xref: /openbsd/gnu/usr.bin/perl/pod/perlhack.pod (revision 274d7c50)
1=encoding utf8
2
3=for comment
4Consistent formatting of this file is achieved with:
5  perl ./Porting/podtidy pod/perlhack.pod
6
7=head1 NAME
8
9perlhack - How to hack on Perl
10
11=head1 DESCRIPTION
12
13This document explains how Perl development works.  It includes details
14about the Perl 5 Porters email list, the Perl repository, the Perlbug
15bug tracker, patch guidelines, and commentary on Perl development
16philosophy.
17
18=head1 SUPER QUICK PATCH GUIDE
19
20If you just want to submit a single small patch like a pod fix, a test
21for a bug, comment fixes, etc., it's easy! Here's how:
22
23=over 4
24
25=item * Check out the source repository
26
27The perl source is in a git repository.  You can clone the repository
28with the following command:
29
30  % git clone git://perl5.git.perl.org/perl.git perl
31
32=item * Ensure you're following the latest advice
33
34In case the advice in this guide has been updated recently, read the
35latest version directly from the perl source:
36
37  % perldoc pod/perlhack.pod
38
39=item * Make your change
40
41Hack, hack, hack.  Keep in mind that Perl runs on many different
42platforms, with different operating systems that have different
43capabilities, different filesystem organizations, and even different
44character sets.  L<perlhacktips> gives advice on this.
45
46=item * Test your change
47
48You can run all the tests with the following commands:
49
50  % ./Configure -des -Dusedevel
51  % make test
52
53Keep hacking until the tests pass.
54
55=item * Commit your change
56
57Committing your work will save the change I<on your local system>:
58
59  % git commit -a -m 'Commit message goes here'
60
61Make sure the commit message describes your change in a single
62sentence.  For example, "Fixed spelling errors in perlhack.pod".
63
64=item * Send your change to perlbug
65
66The next step is to submit your patch to the Perl core ticket system
67via email.
68
69If your changes are in a single git commit, run the following commands
70to generate the patch file and attach it to your bug report:
71
72  % git format-patch -1
73  % ./perl -Ilib utils/perlbug -p 0001-*.patch
74
75The perlbug program will ask you a few questions about your email
76address and the patch you're submitting.  Once you've answered them it
77will submit your patch via email.
78
79If your changes are in multiple commits, generate a patch file for each
80one and provide them to perlbug's C<-p> option separated by commas:
81
82  % git format-patch -3
83  % ./perl -Ilib utils/perlbug -p 0001-fix1.patch,0002-fix2.patch,\
84  > 0003-fix3.patch
85
86When prompted, pick a subject that summarizes your changes.
87
88=item * Thank you
89
90The porters appreciate the time you spent helping to make Perl better.
91Thank you!
92
93=item * Acknowledgement
94
95All contributors are credited (by name and email address) in the
96AUTHORS file, which is part of the perl distribution, as well as the
97Git commit history.
98
99If you don’t want to be included in the AUTHORS file, just let us
100know. Otherwise we will take your submission of a patch as permission
101to credit you in the AUTHORS file.
102
103=item * Next time
104
105The next time you wish to make a patch, you need to start from the
106latest perl in a pristine state.  Check you don't have any local changes
107or added files in your perl check-out which you wish to keep, then run
108these commands:
109
110  % git pull
111  % git reset --hard origin/blead
112  % git clean -dxf
113
114=back
115
116=head1 BUG REPORTING
117
118If you want to report a bug in Perl, you must use the F<perlbug>
119command line tool.  This tool will ensure that your bug report includes
120all the relevant system and configuration information.
121
122To browse existing Perl bugs and patches, you can use the web interface
123at L<http://rt.perl.org/>.
124
125Please check the archive of the perl5-porters list (see below) and/or
126the bug tracking system before submitting a bug report.  Often, you'll
127find that the bug has been reported already.
128
129You can log in to the bug tracking system and comment on existing bug
130reports.  If you have additional information regarding an existing bug,
131please add it.  This will help the porters fix the bug.
132
133=head1 PERL 5 PORTERS
134
135The perl5-porters (p5p) mailing list is where the Perl standard
136distribution is maintained and developed.  The people who maintain Perl
137are also referred to as the "Perl 5 Porters", "p5p" or just the
138"porters".
139
140A searchable archive of the list is available at
141L<http://markmail.org/search/?q=perl5-porters>.  There is also an archive at
142L<http://archive.develooper.com/perl5-porters@perl.org/>.
143
144=head2 perl-changes mailing list
145
146The perl5-changes mailing list receives a copy of each patch that gets
147submitted to the maintenance and development branches of the perl
148repository.  See L<http://lists.perl.org/list/perl5-changes.html> for
149subscription and archive information.
150
151=head2 #p5p on IRC
152
153Many porters are also active on the L<irc://irc.perl.org/#p5p> channel.
154Feel free to join the channel and ask questions about hacking on the
155Perl core.
156
157=head1 GETTING THE PERL SOURCE
158
159All of Perl's source code is kept centrally in a Git repository at
160I<perl5.git.perl.org>.  The repository contains many Perl revisions
161from Perl 1 onwards and all the revisions from Perforce, the previous
162version control system.
163
164For much more detail on using git with the Perl repository, please see
165L<perlgit>.
166
167=head2 Read access via Git
168
169You will need a copy of Git for your computer.  You can fetch a copy of
170the repository using the git protocol:
171
172  % git clone git://perl5.git.perl.org/perl.git perl
173
174This clones the repository and makes a local copy in the F<perl>
175directory.
176
177If you cannot use the git protocol for firewall reasons, you can also
178clone via http, though this is much slower:
179
180  % git clone http://perl5.git.perl.org/perl.git perl
181
182=head2 Read access via the web
183
184You may access the repository over the web.  This allows you to browse
185the tree, see recent commits, subscribe to RSS feeds for the changes,
186search for particular commits and more.  You may access it at
187L<http://perl5.git.perl.org/perl.git>.  A mirror of the repository is
188found at L<https://github.com/Perl/perl5>.
189
190=head2 Read access via rsync
191
192You can also choose to use rsync to get a copy of the current source
193tree for the bleadperl branch and all maintenance branches:
194
195  % rsync -avz rsync://perl5.git.perl.org/perl-current .
196  % rsync -avz rsync://perl5.git.perl.org/perl-5.12.x .
197  % rsync -avz rsync://perl5.git.perl.org/perl-5.10.x .
198  % rsync -avz rsync://perl5.git.perl.org/perl-5.8.x .
199  % rsync -avz rsync://perl5.git.perl.org/perl-5.6.x .
200  % rsync -avz rsync://perl5.git.perl.org/perl-5.005xx .
201
202(Add the C<--delete> option to remove leftover files.)
203
204To get a full list of the available sync points:
205
206  % rsync perl5.git.perl.org::
207
208=head2 Write access via git
209
210If you have a commit bit, please see L<perlgit> for more details on
211using git.
212
213=head1 PATCHING PERL
214
215If you're planning to do more extensive work than a single small fix,
216we encourage you to read the documentation below.  This will help you
217focus your work and make your patches easier to incorporate into the
218Perl source.
219
220=head2 Submitting patches
221
222If you have a small patch to submit, please submit it via perlbug.  You
223can also send email directly to perlbug@perl.org.  Please note that
224messages sent to perlbug may be held in a moderation queue, so you
225won't receive a response immediately.
226
227You'll know your submission has been processed when you receive an
228email from our ticket tracking system.  This email will give you a
229ticket number.  Once your patch has made it to the ticket tracking
230system, it will also be sent to the perl5-porters@perl.org list.
231
232If your patch is related to an already-opened ticket you can also
233attach your patch to that ticket, without having to use perlbug.
234
235Patches are reviewed and discussed on the p5p list.  Simple,
236uncontroversial patches will usually be applied without any discussion.
237When the patch is applied, the ticket will be updated and you will
238receive email.  In addition, an email will be sent to the p5p list.
239
240In other cases, the patch will need more work or discussion.  That will
241happen on the p5p list.
242
243You are encouraged to participate in the discussion and advocate for
244your patch.  Sometimes your patch may get lost in the shuffle.  It's
245appropriate to send a reminder email to p5p if no action has been taken
246in a month.  Please remember that the Perl 5 developers are all
247volunteers, and be polite.
248
249Changes are always applied directly to the main development branch,
250called "blead".  Some patches may be backported to a maintenance
251branch.  If you think your patch is appropriate for the maintenance
252branch (see L<perlpolicy/MAINTENANCE BRANCHES>), please explain why
253when you submit it.
254
255=head2 Getting your patch accepted
256
257If you are submitting a code patch there are several things that you
258can do to help the Perl 5 Porters accept your patch.
259
260=head3 Patch style
261
262If you used git to check out the Perl source, then using C<git
263format-patch> will produce a patch in a style suitable for Perl.  The
264C<format-patch> command produces one patch file for each commit you
265made.  If you prefer to send a single patch for all commits, you can
266use C<git diff>.
267
268  % git checkout blead
269  % git pull
270  % git diff blead my-branch-name
271
272This produces a patch based on the difference between blead and your
273current branch.  It's important to make sure that blead is up to date
274before producing the diff, that's why we call C<git pull> first.
275
276We strongly recommend that you use git if possible.  It will make your
277life easier, and ours as well.
278
279However, if you're not using git, you can still produce a suitable
280patch.  You'll need a pristine copy of the Perl source to diff against.
281The porters prefer unified diffs.  Using GNU C<diff>, you can produce a
282diff like this:
283
284  % diff -Npurd perl.pristine perl.mine
285
286Make sure that you C<make realclean> in your copy of Perl to remove any
287build artifacts, or you may get a confusing result.
288
289=head3 Commit message
290
291As you craft each patch you intend to submit to the Perl core, it's
292important to write a good commit message.  This is especially important
293if your submission will consist of a series of commits.
294
295The first line of the commit message should be a short description
296without a period.  It should be no longer than the subject line of an
297email, 50 characters being a good rule of thumb.
298
299A lot of Git tools (Gitweb, GitHub, git log --pretty=oneline, ...) will
300only display the first line (cut off at 50 characters) when presenting
301commit summaries.
302
303The commit message should include a description of the problem that the
304patch corrects or new functionality that the patch adds.
305
306As a general rule of thumb, your commit message should help a
307programmer who knows the Perl core quickly understand what you were
308trying to do, how you were trying to do it, and why the change matters
309to Perl.
310
311=over 4
312
313=item * Why
314
315Your commit message should describe why the change you are making is
316important.  When someone looks at your change in six months or six
317years, your intent should be clear.
318
319If you're deprecating a feature with the intent of later simplifying
320another bit of code, say so.  If you're fixing a performance problem or
321adding a new feature to support some other bit of the core, mention
322that.
323
324=item * What
325
326Your commit message should describe what part of the Perl core you're
327changing and what you expect your patch to do.
328
329=item * How
330
331While it's not necessary for documentation changes, new tests or
332trivial patches, it's often worth explaining how your change works.
333Even if it's clear to you today, it may not be clear to a porter next
334month or next year.
335
336=back
337
338A commit message isn't intended to take the place of comments in your
339code.  Commit messages should describe the change you made, while code
340comments should describe the current state of the code.
341
342If you've just implemented a new feature, complete with doc, tests and
343well-commented code, a brief commit message will often suffice.  If,
344however, you've just changed a single character deep in the parser or
345lexer, you might need to write a small novel to ensure that future
346readers understand what you did and why you did it.
347
348=head3 Comments, Comments, Comments
349
350Be sure to adequately comment your code.  While commenting every line
351is unnecessary, anything that takes advantage of side effects of
352operators, that creates changes that will be felt outside of the
353function being patched, or that others may find confusing should be
354documented.  If you are going to err, it is better to err on the side
355of adding too many comments than too few.
356
357The best comments explain I<why> the code does what it does, not I<what
358it does>.
359
360=head3 Style
361
362In general, please follow the particular style of the code you are
363patching.
364
365In particular, follow these general guidelines for patching Perl
366sources:
367
368=over 4
369
370=item *
371
3724-wide indents for code, 2-wide indents for nested CPP C<#define>s,
373with 8-wide tabstops.
374
375=item *
376
377Use spaces for indentation, not tab characters.
378
379The codebase is a mixture of tabs and spaces for indentation, and we
380are moving to spaces only.  Converting lines you're patching from 8-wide
381tabs to spaces will help this migration.
382
383=item *
384
385Try hard not to exceed 79-columns
386
387=item *
388
389ANSI C prototypes
390
391=item *
392
393Uncuddled elses and "K&R" style for indenting control constructs
394
395=item *
396
397No C++ style (//) comments
398
399=item *
400
401Mark places that need to be revisited with XXX (and revisit often!)
402
403=item *
404
405Opening brace lines up with "if" when conditional spans multiple lines;
406should be at end-of-line otherwise
407
408=item *
409
410In function definitions, name starts in column 0 (return value-type is on
411previous line)
412
413=item *
414
415Single space after keywords that are followed by parens, no space
416between function name and following paren
417
418=item *
419
420Avoid assignments in conditionals, but if they're unavoidable, use
421extra paren, e.g. "if (a && (b = c)) ..."
422
423=item *
424
425"return foo;" rather than "return(foo);"
426
427=item *
428
429"if (!foo) ..." rather than "if (foo == FALSE) ..." etc.
430
431=item *
432
433Do not declare variables using "register".  It may be counterproductive
434with modern compilers, and is deprecated in C++, under which the Perl
435source is regularly compiled.
436
437=item *
438
439In-line functions that are in headers that are accessible to XS code
440need to be able to compile without warnings with commonly used extra
441compilation flags, such as gcc's C<-Wswitch-default> which warns
442whenever a switch statement does not have a "default" case.  The use of
443these extra flags is to catch potential problems in legal C code, and
444is often used by Perl aggregators, such as Linux distributors.
445
446=back
447
448=head3 Test suite
449
450If your patch changes code (rather than just changing documentation),
451you should also include one or more test cases which illustrate the bug
452you're fixing or validate the new functionality you're adding.  In
453general, you should update an existing test file rather than create a
454new one.
455
456Your test suite additions should generally follow these guidelines
457(courtesy of Gurusamy Sarathy <gsar@activestate.com>):
458
459=over 4
460
461=item *
462
463Know what you're testing.  Read the docs, and the source.
464
465=item *
466
467Tend to fail, not succeed.
468
469=item *
470
471Interpret results strictly.
472
473=item *
474
475Use unrelated features (this will flush out bizarre interactions).
476
477=item *
478
479Use non-standard idioms (otherwise you are not testing TIMTOWTDI).
480
481=item *
482
483Avoid using hardcoded test numbers whenever possible (the EXPECTED/GOT
484found in t/op/tie.t is much more maintainable, and gives better failure
485reports).
486
487=item *
488
489Give meaningful error messages when a test fails.
490
491=item *
492
493Avoid using qx// and system() unless you are testing for them.  If you
494do use them, make sure that you cover _all_ perl platforms.
495
496=item *
497
498Unlink any temporary files you create.
499
500=item *
501
502Promote unforeseen warnings to errors with $SIG{__WARN__}.
503
504=item *
505
506Be sure to use the libraries and modules shipped with the version being
507tested, not those that were already installed.
508
509=item *
510
511Add comments to the code explaining what you are testing for.
512
513=item *
514
515Make updating the '1..42' string unnecessary.  Or make sure that you
516update it.
517
518=item *
519
520Test _all_ behaviors of a given operator, library, or function.
521
522Test all optional arguments.
523
524Test return values in various contexts (boolean, scalar, list, lvalue).
525
526Use both global and lexical variables.
527
528Don't forget the exceptional, pathological cases.
529
530=back
531
532=head2 Patching a core module
533
534This works just like patching anything else, with one extra
535consideration.
536
537Modules in the F<cpan/> directory of the source tree are maintained
538outside of the Perl core.  When the author updates the module, the
539updates are simply copied into the core.  See that module's
540documentation or its listing on L<http://search.cpan.org/> for more
541information on reporting bugs and submitting patches.
542
543In most cases, patches to modules in F<cpan/> should be sent upstream
544and should not be applied to the Perl core individually.  If a patch to
545a file in F<cpan/> absolutely cannot wait for the fix to be made
546upstream, released to CPAN and copied to blead, you must add (or
547update) a C<CUSTOMIZED> entry in the F<"Porting/Maintainers.pl"> file
548to flag that a local modification has been made.  See
549F<"Porting/Maintainers.pl"> for more details.
550
551In contrast, modules in the F<dist/> directory are maintained in the
552core.
553
554=head2 Updating perldelta
555
556For changes significant enough to warrant a F<pod/perldelta.pod> entry,
557the porters will greatly appreciate it if you submit a delta entry
558along with your actual change.  Significant changes include, but are
559not limited to:
560
561=over 4
562
563=item *
564
565Adding, deprecating, or removing core features
566
567=item *
568
569Adding, deprecating, removing, or upgrading core or dual-life modules
570
571=item *
572
573Adding new core tests
574
575=item *
576
577Fixing security issues and user-visible bugs in the core
578
579=item *
580
581Changes that might break existing code, either on the perl or C level
582
583=item *
584
585Significant performance improvements
586
587=item *
588
589Adding, removing, or significantly changing documentation in the
590F<pod/> directory
591
592=item *
593
594Important platform-specific changes
595
596=back
597
598Please make sure you add the perldelta entry to the right section
599within F<pod/perldelta.pod>.  More information on how to write good
600perldelta entries is available in the C<Style> section of
601F<Porting/how_to_write_a_perldelta.pod>.
602
603=head2 What makes for a good patch?
604
605New features and extensions to the language can be contentious.  There
606is no specific set of criteria which determine what features get added,
607but here are some questions to consider when developing a patch:
608
609=head3 Does the concept match the general goals of Perl?
610
611Our goals include, but are not limited to:
612
613=over 4
614
615=item 1.
616
617Keep it fast, simple, and useful.
618
619=item 2.
620
621Keep features/concepts as orthogonal as possible.
622
623=item 3.
624
625No arbitrary limits (platforms, data sizes, cultures).
626
627=item 4.
628
629Keep it open and exciting to use/patch/advocate Perl everywhere.
630
631=item 5.
632
633Either assimilate new technologies, or build bridges to them.
634
635=back
636
637=head3 Where is the implementation?
638
639All the talk in the world is useless without an implementation.  In
640almost every case, the person or people who argue for a new feature
641will be expected to be the ones who implement it.  Porters capable of
642coding new features have their own agendas, and are not available to
643implement your (possibly good) idea.
644
645=head3 Backwards compatibility
646
647It's a cardinal sin to break existing Perl programs.  New warnings can
648be contentious--some say that a program that emits warnings is not
649broken, while others say it is.  Adding keywords has the potential to
650break programs, changing the meaning of existing token sequences or
651functions might break programs.
652
653The Perl 5 core includes mechanisms to help porters make backwards
654incompatible changes more compatible such as the L<feature> and
655L<deprecate> modules.  Please use them when appropriate.
656
657=head3 Could it be a module instead?
658
659Perl 5 has extension mechanisms, modules and XS, specifically to avoid
660the need to keep changing the Perl interpreter.  You can write modules
661that export functions, you can give those functions prototypes so they
662can be called like built-in functions, you can even write XS code to
663mess with the runtime data structures of the Perl interpreter if you
664want to implement really complicated things.
665
666Whenever possible, new features should be prototyped in a CPAN module
667before they will be considered for the core.
668
669=head3 Is the feature generic enough?
670
671Is this something that only the submitter wants added to the language,
672or is it broadly useful?  Sometimes, instead of adding a feature with a
673tight focus, the porters might decide to wait until someone implements
674the more generalized feature.
675
676=head3 Does it potentially introduce new bugs?
677
678Radical rewrites of large chunks of the Perl interpreter have the
679potential to introduce new bugs.
680
681=head3 How big is it?
682
683The smaller and more localized the change, the better.  Similarly, a
684series of small patches is greatly preferred over a single large patch.
685
686=head3 Does it preclude other desirable features?
687
688A patch is likely to be rejected if it closes off future avenues of
689development.  For instance, a patch that placed a true and final
690interpretation on prototypes is likely to be rejected because there are
691still options for the future of prototypes that haven't been addressed.
692
693=head3 Is the implementation robust?
694
695Good patches (tight code, complete, correct) stand more chance of going
696in.  Sloppy or incorrect patches might be placed on the back burner
697until the pumpking has time to fix, or might be discarded altogether
698without further notice.
699
700=head3 Is the implementation generic enough to be portable?
701
702The worst patches make use of system-specific features.  It's highly
703unlikely that non-portable additions to the Perl language will be
704accepted.
705
706=head3 Is the implementation tested?
707
708Patches which change behaviour (fixing bugs or introducing new
709features) must include regression tests to verify that everything works
710as expected.
711
712Without tests provided by the original author, how can anyone else
713changing perl in the future be sure that they haven't unwittingly
714broken the behaviour the patch implements? And without tests, how can
715the patch's author be confident that his/her hard work put into the
716patch won't be accidentally thrown away by someone in the future?
717
718=head3 Is there enough documentation?
719
720Patches without documentation are probably ill-thought out or
721incomplete.  No features can be added or changed without documentation,
722so submitting a patch for the appropriate pod docs as well as the
723source code is important.
724
725=head3 Is there another way to do it?
726
727Larry said "Although the Perl Slogan is I<There's More Than One Way to
728Do It>, I hesitate to make 10 ways to do something".  This is a tricky
729heuristic to navigate, though--one man's essential addition is another
730man's pointless cruft.
731
732=head3 Does it create too much work?
733
734Work for the pumpking, work for Perl programmers, work for module
735authors, ... Perl is supposed to be easy.
736
737=head3 Patches speak louder than words
738
739Working code is always preferred to pie-in-the-sky ideas.  A patch to
740add a feature stands a much higher chance of making it to the language
741than does a random feature request, no matter how fervently argued the
742request might be.  This ties into "Will it be useful?", as the fact
743that someone took the time to make the patch demonstrates a strong
744desire for the feature.
745
746=head1 TESTING
747
748The core uses the same testing style as the rest of Perl, a simple
749"ok/not ok" run through Test::Harness, but there are a few special
750considerations.
751
752There are three ways to write a test in the core: L<Test::More>,
753F<t/test.pl> and ad hoc C<print $test ? "ok 42\n" : "not ok 42\n">.
754The decision of which to use depends on what part of the test suite
755you're working on.  This is a measure to prevent a high-level failure
756(such as Config.pm breaking) from causing basic functionality tests to
757fail.
758
759The F<t/test.pl> library provides some of the features of
760L<Test::More>, but avoids loading most modules and uses as few core
761features as possible.
762
763If you write your own test, use the L<Test Anything
764Protocol|http://testanything.org>.
765
766=over 4
767
768=item * F<t/base>, F<t/comp> and F<t/opbasic>
769
770Since we don't know if C<require> works, or even subroutines, use ad hoc
771tests for these three.  Step carefully to avoid using the feature being
772tested.  Tests in F<t/opbasic>, for instance, have been placed there
773rather than in F<t/op> because they test functionality which
774F<t/test.pl> presumes has already been demonstrated to work.
775
776=item * F<t/cmd>, F<t/run>, F<t/io> and F<t/op>
777
778Now that basic require() and subroutines are tested, you can use the
779F<t/test.pl> library.
780
781You can also use certain libraries like Config conditionally, but be
782sure to skip the test gracefully if it's not there.
783
784=item * Everything else
785
786Now that the core of Perl is tested, L<Test::More> can and should be
787used.  You can also use the full suite of core modules in the tests.
788
789=back
790
791When you say "make test", Perl uses the F<t/TEST> program to run the
792test suite (except under Win32 where it uses F<t/harness> instead).
793All tests are run from the F<t/> directory, B<not> the directory which
794contains the test.  This causes some problems with the tests in
795F<lib/>, so here's some opportunity for some patching.
796
797You must be triply conscious of cross-platform concerns.  This usually
798boils down to using L<File::Spec>, avoiding things like C<fork()>
799and C<system()> unless absolutely necessary, and not assuming that a
800given character has a particular ordinal value (code point) or that its
801UTF-8 representation is composed of particular bytes.
802
803There are several functions available to specify characters and code
804points portably in tests.  The always-preloaded functions
805C<utf8::unicode_to_native()> and its inverse
806C<utf8::native_to_unicode()> take code points and translate
807appropriately.  The file F<t/charset_tools.pl> has several functions
808that can be useful.  It has versions of the previous two functions
809that take strings as inputs -- not single numeric code points:
810C<uni_to_native()> and C<native_to_uni()>.  If you must look at the
811individual bytes comprising a UTF-8 encoded string,
812C<byte_utf8a_to_utf8n()> takes as input a string of those bytes encoded
813for an ASCII platform, and returns the equivalent string in the native
814platform.  For example, C<byte_utf8a_to_utf8n("\xC2\xA0")> returns the
815byte sequence on the current platform that form the UTF-8 for C<U+00A0>,
816since C<"\xC2\xA0"> are the UTF-8 bytes on an ASCII platform for that
817code point.  This function returns C<"\xC2\xA0"> on an ASCII platform, and
818C<"\x80\x41"> on an EBCDIC 1047 one.
819
820But easiest is, if the character is specifiable as a literal, like
821C<"A"> or C<"%">, to use that; if not so specificable, you can use use
822C<\N{}> , if the side effects aren't troublesome.  Simply specify all
823your characters in hex, using C<\N{U+ZZ}> instead of C<\xZZ>.  C<\N{}>
824is the Unicode name, and so it
825always gives you the Unicode character.  C<\N{U+41}> is the character
826whose Unicode code point is C<0x41>, hence is C<'A'> on all platforms.
827The side effects are:
828
829=over 4
830
831=item *
832
833These select Unicode rules.  That means that in double-quotish strings,
834the string is always converted to UTF-8 to force a Unicode
835interpretation (you can C<utf8::downgrade()> afterwards to convert back
836to non-UTF8, if possible).  In regular expression patterns, the
837conversion isn't done, but if the character set modifier would
838otherwise be C</d>, it is changed to C</u>.
839
840=item *
841
842If you use the form C<\N{I<character name>}>, the L<charnames> module
843gets automatically loaded.  This may not be suitable for the test level
844you are doing.
845
846=back
847
848If you are testing locales (see L<perllocale>), there are helper
849functions in F<t/loc_tools.pl> to enable you to see what locales there
850are on the current platform.
851
852=head2 Special C<make test> targets
853
854There are various special make targets that can be used to test Perl
855slightly differently than the standard "test" target.  Not all them are
856expected to give a 100% success rate.  Many of them have several
857aliases, and many of them are not available on certain operating
858systems.
859
860=over 4
861
862=item * test_porting
863
864This runs some basic sanity tests on the source tree and helps catch
865basic errors before you submit a patch.
866
867=item * minitest
868
869Run F<miniperl> on F<t/base>, F<t/comp>, F<t/cmd>, F<t/run>, F<t/io>,
870F<t/op>, F<t/uni> and F<t/mro> tests.
871
872=item * test.valgrind check.valgrind
873
874(Only in Linux) Run all the tests using the memory leak + naughty
875memory access tool "valgrind".  The log files will be named
876F<testname.valgrind>.
877
878=item * test_harness
879
880Run the test suite with the F<t/harness> controlling program, instead
881of F<t/TEST>.  F<t/harness> is more sophisticated, and uses the
882L<Test::Harness> module, thus using this test target supposes that perl
883mostly works.  The main advantage for our purposes is that it prints a
884detailed summary of failed tests at the end.  Also, unlike F<t/TEST>,
885it doesn't redirect stderr to stdout.
886
887Note that under Win32 F<t/harness> is always used instead of F<t/TEST>,
888so there is no special "test_harness" target.
889
890Under Win32's "test" target you may use the TEST_SWITCHES and
891TEST_FILES environment variables to control the behaviour of
892F<t/harness>.  This means you can say
893
894    nmake test TEST_FILES="op/*.t"
895    nmake test TEST_SWITCHES="-torture" TEST_FILES="op/*.t"
896
897=item * test-notty test_notty
898
899Sets PERL_SKIP_TTY_TEST to true before running normal test.
900
901=back
902
903=head2 Parallel tests
904
905The core distribution can now run its regression tests in parallel on
906Unix-like platforms.  Instead of running C<make test>, set C<TEST_JOBS>
907in your environment to the number of tests to run in parallel, and run
908C<make test_harness>.  On a Bourne-like shell, this can be done as
909
910    TEST_JOBS=3 make test_harness  # Run 3 tests in parallel
911
912An environment variable is used, rather than parallel make itself,
913because L<TAP::Harness> needs to be able to schedule individual
914non-conflicting test scripts itself, and there is no standard interface
915to C<make> utilities to interact with their job schedulers.
916
917Note that currently some test scripts may fail when run in parallel
918(most notably F<dist/IO/t/io_dir.t>).  If necessary, run just the
919failing scripts again sequentially and see if the failures go away.
920
921=head2 Running tests by hand
922
923You can run part of the test suite by hand by using one of the
924following commands from the F<t/> directory:
925
926    ./perl -I../lib TEST list-of-.t-files
927
928or
929
930    ./perl -I../lib harness list-of-.t-files
931
932(If you don't specify test scripts, the whole test suite will be run.)
933
934=head2 Using F<t/harness> for testing
935
936If you use C<harness> for testing, you have several command line
937options available to you.  The arguments are as follows, and are in the
938order that they must appear if used together.
939
940    harness -v -torture -re=pattern LIST OF FILES TO TEST
941    harness -v -torture -re LIST OF PATTERNS TO MATCH
942
943If C<LIST OF FILES TO TEST> is omitted, the file list is obtained from
944the manifest.  The file list may include shell wildcards which will be
945expanded out.
946
947=over 4
948
949=item * -v
950
951Run the tests under verbose mode so you can see what tests were run,
952and debug output.
953
954=item * -torture
955
956Run the torture tests as well as the normal set.
957
958=item * -re=PATTERN
959
960Filter the file list so that all the test files run match PATTERN.
961Note that this form is distinct from the B<-re LIST OF PATTERNS> form
962below in that it allows the file list to be provided as well.
963
964=item * -re LIST OF PATTERNS
965
966Filter the file list so that all the test files run match
967/(LIST|OF|PATTERNS)/.  Note that with this form the patterns are joined
968by '|' and you cannot supply a list of files, instead the test files
969are obtained from the MANIFEST.
970
971=back
972
973You can run an individual test by a command similar to
974
975    ./perl -I../lib path/to/foo.t
976
977except that the harnesses set up some environment variables that may
978affect the execution of the test:
979
980=over 4
981
982=item * PERL_CORE=1
983
984indicates that we're running this test as part of the perl core test
985suite.  This is useful for modules that have a dual life on CPAN.
986
987=item * PERL_DESTRUCT_LEVEL=2
988
989is set to 2 if it isn't set already (see
990L<perlhacktips/PERL_DESTRUCT_LEVEL>).
991
992=item * PERL
993
994(used only by F<t/TEST>) if set, overrides the path to the perl
995executable that should be used to run the tests (the default being
996F<./perl>).
997
998=item * PERL_SKIP_TTY_TEST
999
1000if set, tells to skip the tests that need a terminal.  It's actually
1001set automatically by the Makefile, but can also be forced artificially
1002by running 'make test_notty'.
1003
1004=back
1005
1006=head3 Other environment variables that may influence tests
1007
1008=over 4
1009
1010=item * PERL_TEST_Net_Ping
1011
1012Setting this variable runs all the Net::Ping modules tests, otherwise
1013some tests that interact with the outside world are skipped.  See
1014L<perl58delta>.
1015
1016=item * PERL_TEST_NOVREXX
1017
1018Setting this variable skips the vrexx.t tests for OS2::REXX.
1019
1020=item * PERL_TEST_NUMCONVERTS
1021
1022This sets a variable in op/numconvert.t.
1023
1024=item * PERL_TEST_MEMORY
1025
1026Setting this variable includes the tests in F<t/bigmem/>.  This should
1027be set to the number of gigabytes of memory available for testing, eg.
1028C<PERL_TEST_MEMORY=4> indicates that tests that require 4GiB of
1029available memory can be run safely.
1030
1031=back
1032
1033See also the documentation for the Test and Test::Harness modules, for
1034more environment variables that affect testing.
1035
1036=head2 Performance testing
1037
1038The file F<t/perf/benchmarks> contains snippets of perl code which are
1039intended to be benchmarked across a range of perls by the
1040F<Porting/bench.pl> tool. If you fix or enhance a performance issue, you
1041may want to add a representative code sample to the file, then run
1042F<bench.pl> against the previous and current perls to see what difference
1043it has made, and whether anything else has slowed down as a consequence.
1044
1045The file F<t/perf/opcount.t> is designed to test whether a particular
1046code snippet has been compiled into an optree containing specified
1047numbers of particular op types. This is good for testing whether
1048optimisations which alter ops, such as converting an C<aelem> op into an
1049C<aelemfast> op, are really doing that.
1050
1051The files F<t/perf/speed.t> and F<t/re/speed.t> are designed to test
1052things that run thousands of times slower if a particular optimisation
1053is broken (for example, the utf8 length cache on long utf8 strings).
1054Add a test that will take a fraction of a second normally, and minutes
1055otherwise, causing the test file to time out on failure.
1056
1057=head2 Building perl at older commits
1058
1059In the course of hacking on the Perl core distribution, you may have occasion
1060to configure, build and test perl at an old commit.  Sometimes C<make> will
1061fail during this process.  If that happens, you may be able to salvage the
1062situation by using the Devel::PatchPerl library from CPAN (not included in the
1063core) to bring the source code at that commit to a buildable state.
1064
1065Here's a real world example, taken from work done to resolve
1066L<perl #72414|https://rt.perl.org/Ticket/Display.html?id=72414>.
1067Use of F<Porting/bisect.pl> had identified commit
1068C<ba77e4cc9d1ceebf472c9c5c18b2377ee47062e6> as the commit in which a bug was
1069corrected.  To confirm, a P5P developer wanted to configure and build perl at
1070commit C<ba77e4c^> (presumably "bad") and then at C<ba77e4c> (presumably
1071"good").  Normal configuration and build was attempted:
1072
1073    $ sh ./Configure -des -Dusedevel
1074    $ make test_prep
1075
1076C<make>, however, failed with output (excerpted) like this:
1077
1078    cc -fstack-protector -L/usr/local/lib -o miniperl \
1079      gv.o toke.o perly.o pad.o regcomp.o dump.o util.o \
1080      mg.o reentr.o mro.o hv.o av.o run.o pp_hot.o sv.o \
1081      pp.o scope.o pp_ctl.o pp_sys.o doop.o doio.o regexec.o \
1082      utf8.o taint.o deb.o universal.o globals.o perlio.o \
1083      perlapi.o numeric.o mathoms.o locale.o pp_pack.o pp_sort.o  \
1084      miniperlmain.o opmini.o perlmini.o
1085    pp.o: In function `Perl_pp_pow':
1086    pp.c:(.text+0x2db9): undefined reference to `pow'
1087    ...
1088    collect2: error: ld returned 1 exit status
1089    makefile:348: recipe for target 'miniperl' failed
1090    make: *** [miniperl] Error 1
1091
1092Another P5P contributor recommended installation and use of Devel::PatchPerl
1093for this situation, first to determine the version of perl at the commit in
1094question, then to patch the source code at that point to facilitate a build.
1095
1096    $ perl -MDevel::PatchPerl -e \
1097        'print Devel::PatchPerl->determine_version("/path/to/sourcecode"), "\n";'
1098    5.11.1
1099    $ perl -MDevel::PatchPerl -e \
1100        'Devel::PatchPerl->patch_source("5.11.1", "/path/to/sourcecode");'
1101
1102Once the source was patched, C<./Configure> and C<make test_prep> were called
1103and completed successfully, enabling confirmation of the findings in RT
1104#72414.
1105
1106=head1 MORE READING FOR GUTS HACKERS
1107
1108To hack on the Perl guts, you'll need to read the following things:
1109
1110=over 4
1111
1112=item * L<perlsource>
1113
1114An overview of the Perl source tree.  This will help you find the files
1115you're looking for.
1116
1117=item * L<perlinterp>
1118
1119An overview of the Perl interpreter source code and some details on how
1120Perl does what it does.
1121
1122=item * L<perlhacktut>
1123
1124This document walks through the creation of a small patch to Perl's C
1125code.  If you're just getting started with Perl core hacking, this will
1126help you understand how it works.
1127
1128=item * L<perlhacktips>
1129
1130More details on hacking the Perl core.  This document focuses on lower
1131level details such as how to write tests, compilation issues,
1132portability, debugging, etc.
1133
1134If you plan on doing serious C hacking, make sure to read this.
1135
1136=item * L<perlguts>
1137
1138This is of paramount importance, since it's the documentation of what
1139goes where in the Perl source.  Read it over a couple of times and it
1140might start to make sense - don't worry if it doesn't yet, because the
1141best way to study it is to read it in conjunction with poking at Perl
1142source, and we'll do that later on.
1143
1144Gisle Aas's "illustrated perlguts", also known as I<illguts>, has very
1145helpful pictures:
1146
1147L<http://search.cpan.org/dist/illguts/>
1148
1149=item * L<perlxstut> and L<perlxs>
1150
1151A working knowledge of XSUB programming is incredibly useful for core
1152hacking; XSUBs use techniques drawn from the PP code, the portion of
1153the guts that actually executes a Perl program.  It's a lot gentler to
1154learn those techniques from simple examples and explanation than from
1155the core itself.
1156
1157=item * L<perlapi>
1158
1159The documentation for the Perl API explains what some of the internal
1160functions do, as well as the many macros used in the source.
1161
1162=item * F<Porting/pumpkin.pod>
1163
1164This is a collection of words of wisdom for a Perl porter; some of it
1165is only useful to the pumpkin holder, but most of it applies to anyone
1166wanting to go about Perl development.
1167
1168=back
1169
1170=head1 CPAN TESTERS AND PERL SMOKERS
1171
1172The CPAN testers ( L<http://testers.cpan.org/> ) are a group of volunteers
1173who test CPAN modules on a variety of platforms.
1174
1175Perl Smokers ( L<http://www.nntp.perl.org/group/perl.daily-build/> and
1176L<http://www.nntp.perl.org/group/perl.daily-build.reports/> )
1177automatically test Perl source releases on platforms with various
1178configurations.
1179
1180Both efforts welcome volunteers.  In order to get involved in smoke
1181testing of the perl itself visit
1182L<http://search.cpan.org/dist/Test-Smoke/>.  In order to start smoke
1183testing CPAN modules visit
1184L<http://search.cpan.org/dist/CPANPLUS-YACSmoke/> or
1185L<http://search.cpan.org/dist/minismokebox/> or
1186L<http://search.cpan.org/dist/CPAN-Reporter/>.
1187
1188=head1 WHAT NEXT?
1189
1190If you've read all the documentation in the document and the ones
1191listed above, you're more than ready to hack on Perl.
1192
1193Here's some more recommendations
1194
1195=over 4
1196
1197=item *
1198
1199Subscribe to perl5-porters, follow the patches and try and understand
1200them; don't be afraid to ask if there's a portion you're not clear on -
1201who knows, you may unearth a bug in the patch...
1202
1203=item *
1204
1205Do read the README associated with your operating system, e.g.
1206README.aix on the IBM AIX OS.  Don't hesitate to supply patches to that
1207README if you find anything missing or changed over a new OS release.
1208
1209=item *
1210
1211Find an area of Perl that seems interesting to you, and see if you can
1212work out how it works.  Scan through the source, and step over it in
1213the debugger.  Play, poke, investigate, fiddle! You'll probably get to
1214understand not just your chosen area but a much wider range of
1215F<perl>'s activity as well, and probably sooner than you'd think.
1216
1217=back
1218
1219=head2 "The Road goes ever on and on, down from the door where it began."
1220
1221If you can do these things, you've started on the long road to Perl
1222porting.  Thanks for wanting to help make Perl better - and happy
1223hacking!
1224
1225=head2 Metaphoric Quotations
1226
1227If you recognized the quote about the Road above, you're in luck.
1228
1229Most software projects begin each file with a literal description of
1230each file's purpose.  Perl instead begins each with a literary allusion
1231to that file's purpose.
1232
1233Like chapters in many books, all top-level Perl source files (along
1234with a few others here and there) begin with an epigrammatic
1235inscription that alludes, indirectly and metaphorically, to the
1236material you're about to read.
1237
1238Quotations are taken from writings of J.R.R. Tolkien pertaining to his
1239Legendarium, almost always from I<The Lord of the Rings>.  Chapters and
1240page numbers are given using the following editions:
1241
1242=over 4
1243
1244=item *
1245
1246I<The Hobbit>, by J.R.R. Tolkien.  The hardcover, 70th-anniversary
1247edition of 2007 was used, published in the UK by Harper Collins
1248Publishers and in the US by the Houghton Mifflin Company.
1249
1250=item *
1251
1252I<The Lord of the Rings>, by J.R.R. Tolkien.  The hardcover,
125350th-anniversary edition of 2004 was used, published in the UK by
1254Harper Collins Publishers and in the US by the Houghton Mifflin
1255Company.
1256
1257=item *
1258
1259I<The Lays of Beleriand>, by J.R.R. Tolkien and published posthumously
1260by his son and literary executor, C.J.R. Tolkien, being the 3rd of the
126112 volumes in Christopher's mammoth I<History of Middle Earth>.  Page
1262numbers derive from the hardcover edition, first published in 1983 by
1263George Allen & Unwin; no page numbers changed for the special 3-volume
1264omnibus edition of 2002 or the various trade-paper editions, all again
1265now by Harper Collins or Houghton Mifflin.
1266
1267=back
1268
1269Other JRRT books fair game for quotes would thus include I<The
1270Adventures of Tom Bombadil>, I<The Silmarillion>, I<Unfinished Tales>,
1271and I<The Tale of the Children of Hurin>, all but the first
1272posthumously assembled by CJRT.  But I<The Lord of the Rings> itself is
1273perfectly fine and probably best to quote from, provided you can find a
1274suitable quote there.
1275
1276So if you were to supply a new, complete, top-level source file to add
1277to Perl, you should conform to this peculiar practice by yourself
1278selecting an appropriate quotation from Tolkien, retaining the original
1279spelling and punctuation and using the same format the rest of the
1280quotes are in.  Indirect and oblique is just fine; remember, it's a
1281metaphor, so being meta is, after all, what it's for.
1282
1283=head1 AUTHOR
1284
1285This document was originally written by Nathan Torkington, and is
1286maintained by the perl5-porters mailing list.
1287