1=encoding utf8 2 3=for comment 4Consistent formatting of this file is achieved with: 5 perl ./Porting/podtidy pod/perlhack.pod 6 7=head1 NAME 8 9perlhack - How to hack on Perl 10 11=head1 DESCRIPTION 12 13This document explains how Perl development works. It includes details 14about the Perl 5 Porters email list, the Perl repository, the Perlbug 15bug tracker, patch guidelines, and commentary on Perl development 16philosophy. 17 18=head1 SUPER QUICK PATCH GUIDE 19 20If you just want to submit a single small patch like a pod fix, a test 21for a bug, comment fixes, etc., it's easy! Here's how: 22 23=over 4 24 25=item * Check out the source repository 26 27The perl source is in a git repository. You can clone the repository 28with the following command: 29 30 % git clone git://perl5.git.perl.org/perl.git perl 31 32=item * Ensure you're following the latest advice 33 34In case the advice in this guide has been updated recently, read the 35latest version directly from the perl source: 36 37 % perldoc pod/perlhack.pod 38 39=item * Make your change 40 41Hack, hack, hack. Keep in mind that Perl runs on many different 42platforms, with different operating systems that have different 43capabilities, different filesystem organizations, and even different 44character sets. L<perlhacktips> gives advice on this. 45 46=item * Test your change 47 48You can run all the tests with the following commands: 49 50 % ./Configure -des -Dusedevel 51 % make test 52 53Keep hacking until the tests pass. 54 55=item * Commit your change 56 57Committing your work will save the change I<on your local system>: 58 59 % git commit -a -m 'Commit message goes here' 60 61Make sure the commit message describes your change in a single 62sentence. For example, "Fixed spelling errors in perlhack.pod". 63 64=item * Send your change to perlbug 65 66The next step is to submit your patch to the Perl core ticket system 67via email. 68 69If your changes are in a single git commit, run the following commands 70to generate the patch file and attach it to your bug report: 71 72 % git format-patch -1 73 % ./perl -Ilib utils/perlbug -p 0001-*.patch 74 75The perlbug program will ask you a few questions about your email 76address and the patch you're submitting. Once you've answered them it 77will submit your patch via email. 78 79If your changes are in multiple commits, generate a patch file for each 80one and provide them to perlbug's C<-p> option separated by commas: 81 82 % git format-patch -3 83 % ./perl -Ilib utils/perlbug -p 0001-fix1.patch,0002-fix2.patch,\ 84 > 0003-fix3.patch 85 86When prompted, pick a subject that summarizes your changes. 87 88=item * Thank you 89 90The porters appreciate the time you spent helping to make Perl better. 91Thank you! 92 93=item * Acknowledgement 94 95All contributors are credited (by name and email address) in the 96AUTHORS file, which is part of the perl distribution, as well as the 97Git commit history. 98 99If you don’t want to be included in the AUTHORS file, just let us 100know. Otherwise we will take your submission of a patch as permission 101to credit you in the AUTHORS file. 102 103=item * Next time 104 105The next time you wish to make a patch, you need to start from the 106latest perl in a pristine state. Check you don't have any local changes 107or added files in your perl check-out which you wish to keep, then run 108these commands: 109 110 % git pull 111 % git reset --hard origin/blead 112 % git clean -dxf 113 114=back 115 116=head1 BUG REPORTING 117 118If you want to report a bug in Perl, you must use the F<perlbug> 119command line tool. This tool will ensure that your bug report includes 120all the relevant system and configuration information. 121 122To browse existing Perl bugs and patches, you can use the web interface 123at L<http://rt.perl.org/>. 124 125Please check the archive of the perl5-porters list (see below) and/or 126the bug tracking system before submitting a bug report. Often, you'll 127find that the bug has been reported already. 128 129You can log in to the bug tracking system and comment on existing bug 130reports. If you have additional information regarding an existing bug, 131please add it. This will help the porters fix the bug. 132 133=head1 PERL 5 PORTERS 134 135The perl5-porters (p5p) mailing list is where the Perl standard 136distribution is maintained and developed. The people who maintain Perl 137are also referred to as the "Perl 5 Porters", "p5p" or just the 138"porters". 139 140A searchable archive of the list is available at 141L<http://markmail.org/search/?q=perl5-porters>. There is also an archive at 142L<http://archive.develooper.com/perl5-porters@perl.org/>. 143 144=head2 perl-changes mailing list 145 146The perl5-changes mailing list receives a copy of each patch that gets 147submitted to the maintenance and development branches of the perl 148repository. See L<http://lists.perl.org/list/perl5-changes.html> for 149subscription and archive information. 150 151=head2 #p5p on IRC 152 153Many porters are also active on the L<irc://irc.perl.org/#p5p> channel. 154Feel free to join the channel and ask questions about hacking on the 155Perl core. 156 157=head1 GETTING THE PERL SOURCE 158 159All of Perl's source code is kept centrally in a Git repository at 160I<perl5.git.perl.org>. The repository contains many Perl revisions 161from Perl 1 onwards and all the revisions from Perforce, the previous 162version control system. 163 164For much more detail on using git with the Perl repository, please see 165L<perlgit>. 166 167=head2 Read access via Git 168 169You will need a copy of Git for your computer. You can fetch a copy of 170the repository using the git protocol: 171 172 % git clone git://perl5.git.perl.org/perl.git perl 173 174This clones the repository and makes a local copy in the F<perl> 175directory. 176 177If you cannot use the git protocol for firewall reasons, you can also 178clone via http, though this is much slower: 179 180 % git clone http://perl5.git.perl.org/perl.git perl 181 182=head2 Read access via the web 183 184You may access the repository over the web. This allows you to browse 185the tree, see recent commits, subscribe to RSS feeds for the changes, 186search for particular commits and more. You may access it at 187L<http://perl5.git.perl.org/perl.git>. A mirror of the repository is 188found at L<https://github.com/Perl/perl5>. 189 190=head2 Read access via rsync 191 192You can also choose to use rsync to get a copy of the current source 193tree for the bleadperl branch and all maintenance branches: 194 195 % rsync -avz rsync://perl5.git.perl.org/perl-current . 196 % rsync -avz rsync://perl5.git.perl.org/perl-5.12.x . 197 % rsync -avz rsync://perl5.git.perl.org/perl-5.10.x . 198 % rsync -avz rsync://perl5.git.perl.org/perl-5.8.x . 199 % rsync -avz rsync://perl5.git.perl.org/perl-5.6.x . 200 % rsync -avz rsync://perl5.git.perl.org/perl-5.005xx . 201 202(Add the C<--delete> option to remove leftover files.) 203 204To get a full list of the available sync points: 205 206 % rsync perl5.git.perl.org:: 207 208=head2 Write access via git 209 210If you have a commit bit, please see L<perlgit> for more details on 211using git. 212 213=head1 PATCHING PERL 214 215If you're planning to do more extensive work than a single small fix, 216we encourage you to read the documentation below. This will help you 217focus your work and make your patches easier to incorporate into the 218Perl source. 219 220=head2 Submitting patches 221 222If you have a small patch to submit, please submit it via perlbug. You 223can also send email directly to perlbug@perl.org. Please note that 224messages sent to perlbug may be held in a moderation queue, so you 225won't receive a response immediately. 226 227You'll know your submission has been processed when you receive an 228email from our ticket tracking system. This email will give you a 229ticket number. Once your patch has made it to the ticket tracking 230system, it will also be sent to the perl5-porters@perl.org list. 231 232If your patch is related to an already-opened ticket you can also 233attach your patch to that ticket, without having to use perlbug. 234 235Patches are reviewed and discussed on the p5p list. Simple, 236uncontroversial patches will usually be applied without any discussion. 237When the patch is applied, the ticket will be updated and you will 238receive email. In addition, an email will be sent to the p5p list. 239 240In other cases, the patch will need more work or discussion. That will 241happen on the p5p list. 242 243You are encouraged to participate in the discussion and advocate for 244your patch. Sometimes your patch may get lost in the shuffle. It's 245appropriate to send a reminder email to p5p if no action has been taken 246in a month. Please remember that the Perl 5 developers are all 247volunteers, and be polite. 248 249Changes are always applied directly to the main development branch, 250called "blead". Some patches may be backported to a maintenance 251branch. If you think your patch is appropriate for the maintenance 252branch (see L<perlpolicy/MAINTENANCE BRANCHES>), please explain why 253when you submit it. 254 255=head2 Getting your patch accepted 256 257If you are submitting a code patch there are several things that you 258can do to help the Perl 5 Porters accept your patch. 259 260=head3 Patch style 261 262If you used git to check out the Perl source, then using C<git 263format-patch> will produce a patch in a style suitable for Perl. The 264C<format-patch> command produces one patch file for each commit you 265made. If you prefer to send a single patch for all commits, you can 266use C<git diff>. 267 268 % git checkout blead 269 % git pull 270 % git diff blead my-branch-name 271 272This produces a patch based on the difference between blead and your 273current branch. It's important to make sure that blead is up to date 274before producing the diff, that's why we call C<git pull> first. 275 276We strongly recommend that you use git if possible. It will make your 277life easier, and ours as well. 278 279However, if you're not using git, you can still produce a suitable 280patch. You'll need a pristine copy of the Perl source to diff against. 281The porters prefer unified diffs. Using GNU C<diff>, you can produce a 282diff like this: 283 284 % diff -Npurd perl.pristine perl.mine 285 286Make sure that you C<make realclean> in your copy of Perl to remove any 287build artifacts, or you may get a confusing result. 288 289=head3 Commit message 290 291As you craft each patch you intend to submit to the Perl core, it's 292important to write a good commit message. This is especially important 293if your submission will consist of a series of commits. 294 295The first line of the commit message should be a short description 296without a period. It should be no longer than the subject line of an 297email, 50 characters being a good rule of thumb. 298 299A lot of Git tools (Gitweb, GitHub, git log --pretty=oneline, ...) will 300only display the first line (cut off at 50 characters) when presenting 301commit summaries. 302 303The commit message should include a description of the problem that the 304patch corrects or new functionality that the patch adds. 305 306As a general rule of thumb, your commit message should help a 307programmer who knows the Perl core quickly understand what you were 308trying to do, how you were trying to do it, and why the change matters 309to Perl. 310 311=over 4 312 313=item * Why 314 315Your commit message should describe why the change you are making is 316important. When someone looks at your change in six months or six 317years, your intent should be clear. 318 319If you're deprecating a feature with the intent of later simplifying 320another bit of code, say so. If you're fixing a performance problem or 321adding a new feature to support some other bit of the core, mention 322that. 323 324=item * What 325 326Your commit message should describe what part of the Perl core you're 327changing and what you expect your patch to do. 328 329=item * How 330 331While it's not necessary for documentation changes, new tests or 332trivial patches, it's often worth explaining how your change works. 333Even if it's clear to you today, it may not be clear to a porter next 334month or next year. 335 336=back 337 338A commit message isn't intended to take the place of comments in your 339code. Commit messages should describe the change you made, while code 340comments should describe the current state of the code. 341 342If you've just implemented a new feature, complete with doc, tests and 343well-commented code, a brief commit message will often suffice. If, 344however, you've just changed a single character deep in the parser or 345lexer, you might need to write a small novel to ensure that future 346readers understand what you did and why you did it. 347 348=head3 Comments, Comments, Comments 349 350Be sure to adequately comment your code. While commenting every line 351is unnecessary, anything that takes advantage of side effects of 352operators, that creates changes that will be felt outside of the 353function being patched, or that others may find confusing should be 354documented. If you are going to err, it is better to err on the side 355of adding too many comments than too few. 356 357The best comments explain I<why> the code does what it does, not I<what 358it does>. 359 360=head3 Style 361 362In general, please follow the particular style of the code you are 363patching. 364 365In particular, follow these general guidelines for patching Perl 366sources: 367 368=over 4 369 370=item * 371 3724-wide indents for code, 2-wide indents for nested CPP C<#define>s, 373with 8-wide tabstops. 374 375=item * 376 377Use spaces for indentation, not tab characters. 378 379The codebase is a mixture of tabs and spaces for indentation, and we 380are moving to spaces only. Converting lines you're patching from 8-wide 381tabs to spaces will help this migration. 382 383=item * 384 385Try hard not to exceed 79-columns 386 387=item * 388 389ANSI C prototypes 390 391=item * 392 393Uncuddled elses and "K&R" style for indenting control constructs 394 395=item * 396 397No C++ style (//) comments 398 399=item * 400 401Mark places that need to be revisited with XXX (and revisit often!) 402 403=item * 404 405Opening brace lines up with "if" when conditional spans multiple lines; 406should be at end-of-line otherwise 407 408=item * 409 410In function definitions, name starts in column 0 (return value-type is on 411previous line) 412 413=item * 414 415Single space after keywords that are followed by parens, no space 416between function name and following paren 417 418=item * 419 420Avoid assignments in conditionals, but if they're unavoidable, use 421extra paren, e.g. "if (a && (b = c)) ..." 422 423=item * 424 425"return foo;" rather than "return(foo);" 426 427=item * 428 429"if (!foo) ..." rather than "if (foo == FALSE) ..." etc. 430 431=item * 432 433Do not declare variables using "register". It may be counterproductive 434with modern compilers, and is deprecated in C++, under which the Perl 435source is regularly compiled. 436 437=item * 438 439In-line functions that are in headers that are accessible to XS code 440need to be able to compile without warnings with commonly used extra 441compilation flags, such as gcc's C<-Wswitch-default> which warns 442whenever a switch statement does not have a "default" case. The use of 443these extra flags is to catch potential problems in legal C code, and 444is often used by Perl aggregators, such as Linux distributors. 445 446=back 447 448=head3 Test suite 449 450If your patch changes code (rather than just changing documentation), 451you should also include one or more test cases which illustrate the bug 452you're fixing or validate the new functionality you're adding. In 453general, you should update an existing test file rather than create a 454new one. 455 456Your test suite additions should generally follow these guidelines 457(courtesy of Gurusamy Sarathy <gsar@activestate.com>): 458 459=over 4 460 461=item * 462 463Know what you're testing. Read the docs, and the source. 464 465=item * 466 467Tend to fail, not succeed. 468 469=item * 470 471Interpret results strictly. 472 473=item * 474 475Use unrelated features (this will flush out bizarre interactions). 476 477=item * 478 479Use non-standard idioms (otherwise you are not testing TIMTOWTDI). 480 481=item * 482 483Avoid using hardcoded test numbers whenever possible (the EXPECTED/GOT 484found in t/op/tie.t is much more maintainable, and gives better failure 485reports). 486 487=item * 488 489Give meaningful error messages when a test fails. 490 491=item * 492 493Avoid using qx// and system() unless you are testing for them. If you 494do use them, make sure that you cover _all_ perl platforms. 495 496=item * 497 498Unlink any temporary files you create. 499 500=item * 501 502Promote unforeseen warnings to errors with $SIG{__WARN__}. 503 504=item * 505 506Be sure to use the libraries and modules shipped with the version being 507tested, not those that were already installed. 508 509=item * 510 511Add comments to the code explaining what you are testing for. 512 513=item * 514 515Make updating the '1..42' string unnecessary. Or make sure that you 516update it. 517 518=item * 519 520Test _all_ behaviors of a given operator, library, or function. 521 522Test all optional arguments. 523 524Test return values in various contexts (boolean, scalar, list, lvalue). 525 526Use both global and lexical variables. 527 528Don't forget the exceptional, pathological cases. 529 530=back 531 532=head2 Patching a core module 533 534This works just like patching anything else, with one extra 535consideration. 536 537Modules in the F<cpan/> directory of the source tree are maintained 538outside of the Perl core. When the author updates the module, the 539updates are simply copied into the core. See that module's 540documentation or its listing on L<http://search.cpan.org/> for more 541information on reporting bugs and submitting patches. 542 543In most cases, patches to modules in F<cpan/> should be sent upstream 544and should not be applied to the Perl core individually. If a patch to 545a file in F<cpan/> absolutely cannot wait for the fix to be made 546upstream, released to CPAN and copied to blead, you must add (or 547update) a C<CUSTOMIZED> entry in the F<"Porting/Maintainers.pl"> file 548to flag that a local modification has been made. See 549F<"Porting/Maintainers.pl"> for more details. 550 551In contrast, modules in the F<dist/> directory are maintained in the 552core. 553 554=head2 Updating perldelta 555 556For changes significant enough to warrant a F<pod/perldelta.pod> entry, 557the porters will greatly appreciate it if you submit a delta entry 558along with your actual change. Significant changes include, but are 559not limited to: 560 561=over 4 562 563=item * 564 565Adding, deprecating, or removing core features 566 567=item * 568 569Adding, deprecating, removing, or upgrading core or dual-life modules 570 571=item * 572 573Adding new core tests 574 575=item * 576 577Fixing security issues and user-visible bugs in the core 578 579=item * 580 581Changes that might break existing code, either on the perl or C level 582 583=item * 584 585Significant performance improvements 586 587=item * 588 589Adding, removing, or significantly changing documentation in the 590F<pod/> directory 591 592=item * 593 594Important platform-specific changes 595 596=back 597 598Please make sure you add the perldelta entry to the right section 599within F<pod/perldelta.pod>. More information on how to write good 600perldelta entries is available in the C<Style> section of 601F<Porting/how_to_write_a_perldelta.pod>. 602 603=head2 What makes for a good patch? 604 605New features and extensions to the language can be contentious. There 606is no specific set of criteria which determine what features get added, 607but here are some questions to consider when developing a patch: 608 609=head3 Does the concept match the general goals of Perl? 610 611Our goals include, but are not limited to: 612 613=over 4 614 615=item 1. 616 617Keep it fast, simple, and useful. 618 619=item 2. 620 621Keep features/concepts as orthogonal as possible. 622 623=item 3. 624 625No arbitrary limits (platforms, data sizes, cultures). 626 627=item 4. 628 629Keep it open and exciting to use/patch/advocate Perl everywhere. 630 631=item 5. 632 633Either assimilate new technologies, or build bridges to them. 634 635=back 636 637=head3 Where is the implementation? 638 639All the talk in the world is useless without an implementation. In 640almost every case, the person or people who argue for a new feature 641will be expected to be the ones who implement it. Porters capable of 642coding new features have their own agendas, and are not available to 643implement your (possibly good) idea. 644 645=head3 Backwards compatibility 646 647It's a cardinal sin to break existing Perl programs. New warnings can 648be contentious--some say that a program that emits warnings is not 649broken, while others say it is. Adding keywords has the potential to 650break programs, changing the meaning of existing token sequences or 651functions might break programs. 652 653The Perl 5 core includes mechanisms to help porters make backwards 654incompatible changes more compatible such as the L<feature> and 655L<deprecate> modules. Please use them when appropriate. 656 657=head3 Could it be a module instead? 658 659Perl 5 has extension mechanisms, modules and XS, specifically to avoid 660the need to keep changing the Perl interpreter. You can write modules 661that export functions, you can give those functions prototypes so they 662can be called like built-in functions, you can even write XS code to 663mess with the runtime data structures of the Perl interpreter if you 664want to implement really complicated things. 665 666Whenever possible, new features should be prototyped in a CPAN module 667before they will be considered for the core. 668 669=head3 Is the feature generic enough? 670 671Is this something that only the submitter wants added to the language, 672or is it broadly useful? Sometimes, instead of adding a feature with a 673tight focus, the porters might decide to wait until someone implements 674the more generalized feature. 675 676=head3 Does it potentially introduce new bugs? 677 678Radical rewrites of large chunks of the Perl interpreter have the 679potential to introduce new bugs. 680 681=head3 How big is it? 682 683The smaller and more localized the change, the better. Similarly, a 684series of small patches is greatly preferred over a single large patch. 685 686=head3 Does it preclude other desirable features? 687 688A patch is likely to be rejected if it closes off future avenues of 689development. For instance, a patch that placed a true and final 690interpretation on prototypes is likely to be rejected because there are 691still options for the future of prototypes that haven't been addressed. 692 693=head3 Is the implementation robust? 694 695Good patches (tight code, complete, correct) stand more chance of going 696in. Sloppy or incorrect patches might be placed on the back burner 697until the pumpking has time to fix, or might be discarded altogether 698without further notice. 699 700=head3 Is the implementation generic enough to be portable? 701 702The worst patches make use of system-specific features. It's highly 703unlikely that non-portable additions to the Perl language will be 704accepted. 705 706=head3 Is the implementation tested? 707 708Patches which change behaviour (fixing bugs or introducing new 709features) must include regression tests to verify that everything works 710as expected. 711 712Without tests provided by the original author, how can anyone else 713changing perl in the future be sure that they haven't unwittingly 714broken the behaviour the patch implements? And without tests, how can 715the patch's author be confident that his/her hard work put into the 716patch won't be accidentally thrown away by someone in the future? 717 718=head3 Is there enough documentation? 719 720Patches without documentation are probably ill-thought out or 721incomplete. No features can be added or changed without documentation, 722so submitting a patch for the appropriate pod docs as well as the 723source code is important. 724 725=head3 Is there another way to do it? 726 727Larry said "Although the Perl Slogan is I<There's More Than One Way to 728Do It>, I hesitate to make 10 ways to do something". This is a tricky 729heuristic to navigate, though--one man's essential addition is another 730man's pointless cruft. 731 732=head3 Does it create too much work? 733 734Work for the pumpking, work for Perl programmers, work for module 735authors, ... Perl is supposed to be easy. 736 737=head3 Patches speak louder than words 738 739Working code is always preferred to pie-in-the-sky ideas. A patch to 740add a feature stands a much higher chance of making it to the language 741than does a random feature request, no matter how fervently argued the 742request might be. This ties into "Will it be useful?", as the fact 743that someone took the time to make the patch demonstrates a strong 744desire for the feature. 745 746=head1 TESTING 747 748The core uses the same testing style as the rest of Perl, a simple 749"ok/not ok" run through Test::Harness, but there are a few special 750considerations. 751 752There are three ways to write a test in the core: L<Test::More>, 753F<t/test.pl> and ad hoc C<print $test ? "ok 42\n" : "not ok 42\n">. 754The decision of which to use depends on what part of the test suite 755you're working on. This is a measure to prevent a high-level failure 756(such as Config.pm breaking) from causing basic functionality tests to 757fail. 758 759The F<t/test.pl> library provides some of the features of 760L<Test::More>, but avoids loading most modules and uses as few core 761features as possible. 762 763If you write your own test, use the L<Test Anything 764Protocol|http://testanything.org>. 765 766=over 4 767 768=item * F<t/base>, F<t/comp> and F<t/opbasic> 769 770Since we don't know if C<require> works, or even subroutines, use ad hoc 771tests for these three. Step carefully to avoid using the feature being 772tested. Tests in F<t/opbasic>, for instance, have been placed there 773rather than in F<t/op> because they test functionality which 774F<t/test.pl> presumes has already been demonstrated to work. 775 776=item * F<t/cmd>, F<t/run>, F<t/io> and F<t/op> 777 778Now that basic require() and subroutines are tested, you can use the 779F<t/test.pl> library. 780 781You can also use certain libraries like Config conditionally, but be 782sure to skip the test gracefully if it's not there. 783 784=item * Everything else 785 786Now that the core of Perl is tested, L<Test::More> can and should be 787used. You can also use the full suite of core modules in the tests. 788 789=back 790 791When you say "make test", Perl uses the F<t/TEST> program to run the 792test suite (except under Win32 where it uses F<t/harness> instead). 793All tests are run from the F<t/> directory, B<not> the directory which 794contains the test. This causes some problems with the tests in 795F<lib/>, so here's some opportunity for some patching. 796 797You must be triply conscious of cross-platform concerns. This usually 798boils down to using L<File::Spec>, avoiding things like C<fork()> 799and C<system()> unless absolutely necessary, and not assuming that a 800given character has a particular ordinal value (code point) or that its 801UTF-8 representation is composed of particular bytes. 802 803There are several functions available to specify characters and code 804points portably in tests. The always-preloaded functions 805C<utf8::unicode_to_native()> and its inverse 806C<utf8::native_to_unicode()> take code points and translate 807appropriately. The file F<t/charset_tools.pl> has several functions 808that can be useful. It has versions of the previous two functions 809that take strings as inputs -- not single numeric code points: 810C<uni_to_native()> and C<native_to_uni()>. If you must look at the 811individual bytes comprising a UTF-8 encoded string, 812C<byte_utf8a_to_utf8n()> takes as input a string of those bytes encoded 813for an ASCII platform, and returns the equivalent string in the native 814platform. For example, C<byte_utf8a_to_utf8n("\xC2\xA0")> returns the 815byte sequence on the current platform that form the UTF-8 for C<U+00A0>, 816since C<"\xC2\xA0"> are the UTF-8 bytes on an ASCII platform for that 817code point. This function returns C<"\xC2\xA0"> on an ASCII platform, and 818C<"\x80\x41"> on an EBCDIC 1047 one. 819 820But easiest is, if the character is specifiable as a literal, like 821C<"A"> or C<"%">, to use that; if not so specificable, you can use use 822C<\N{}> , if the side effects aren't troublesome. Simply specify all 823your characters in hex, using C<\N{U+ZZ}> instead of C<\xZZ>. C<\N{}> 824is the Unicode name, and so it 825always gives you the Unicode character. C<\N{U+41}> is the character 826whose Unicode code point is C<0x41>, hence is C<'A'> on all platforms. 827The side effects are: 828 829=over 4 830 831=item * 832 833These select Unicode rules. That means that in double-quotish strings, 834the string is always converted to UTF-8 to force a Unicode 835interpretation (you can C<utf8::downgrade()> afterwards to convert back 836to non-UTF8, if possible). In regular expression patterns, the 837conversion isn't done, but if the character set modifier would 838otherwise be C</d>, it is changed to C</u>. 839 840=item * 841 842If you use the form C<\N{I<character name>}>, the L<charnames> module 843gets automatically loaded. This may not be suitable for the test level 844you are doing. 845 846=back 847 848If you are testing locales (see L<perllocale>), there are helper 849functions in F<t/loc_tools.pl> to enable you to see what locales there 850are on the current platform. 851 852=head2 Special C<make test> targets 853 854There are various special make targets that can be used to test Perl 855slightly differently than the standard "test" target. Not all them are 856expected to give a 100% success rate. Many of them have several 857aliases, and many of them are not available on certain operating 858systems. 859 860=over 4 861 862=item * test_porting 863 864This runs some basic sanity tests on the source tree and helps catch 865basic errors before you submit a patch. 866 867=item * minitest 868 869Run F<miniperl> on F<t/base>, F<t/comp>, F<t/cmd>, F<t/run>, F<t/io>, 870F<t/op>, F<t/uni> and F<t/mro> tests. 871 872=item * test.valgrind check.valgrind 873 874(Only in Linux) Run all the tests using the memory leak + naughty 875memory access tool "valgrind". The log files will be named 876F<testname.valgrind>. 877 878=item * test_harness 879 880Run the test suite with the F<t/harness> controlling program, instead 881of F<t/TEST>. F<t/harness> is more sophisticated, and uses the 882L<Test::Harness> module, thus using this test target supposes that perl 883mostly works. The main advantage for our purposes is that it prints a 884detailed summary of failed tests at the end. Also, unlike F<t/TEST>, 885it doesn't redirect stderr to stdout. 886 887Note that under Win32 F<t/harness> is always used instead of F<t/TEST>, 888so there is no special "test_harness" target. 889 890Under Win32's "test" target you may use the TEST_SWITCHES and 891TEST_FILES environment variables to control the behaviour of 892F<t/harness>. This means you can say 893 894 nmake test TEST_FILES="op/*.t" 895 nmake test TEST_SWITCHES="-torture" TEST_FILES="op/*.t" 896 897=item * test-notty test_notty 898 899Sets PERL_SKIP_TTY_TEST to true before running normal test. 900 901=back 902 903=head2 Parallel tests 904 905The core distribution can now run its regression tests in parallel on 906Unix-like platforms. Instead of running C<make test>, set C<TEST_JOBS> 907in your environment to the number of tests to run in parallel, and run 908C<make test_harness>. On a Bourne-like shell, this can be done as 909 910 TEST_JOBS=3 make test_harness # Run 3 tests in parallel 911 912An environment variable is used, rather than parallel make itself, 913because L<TAP::Harness> needs to be able to schedule individual 914non-conflicting test scripts itself, and there is no standard interface 915to C<make> utilities to interact with their job schedulers. 916 917Note that currently some test scripts may fail when run in parallel 918(most notably F<dist/IO/t/io_dir.t>). If necessary, run just the 919failing scripts again sequentially and see if the failures go away. 920 921=head2 Running tests by hand 922 923You can run part of the test suite by hand by using one of the 924following commands from the F<t/> directory: 925 926 ./perl -I../lib TEST list-of-.t-files 927 928or 929 930 ./perl -I../lib harness list-of-.t-files 931 932(If you don't specify test scripts, the whole test suite will be run.) 933 934=head2 Using F<t/harness> for testing 935 936If you use C<harness> for testing, you have several command line 937options available to you. The arguments are as follows, and are in the 938order that they must appear if used together. 939 940 harness -v -torture -re=pattern LIST OF FILES TO TEST 941 harness -v -torture -re LIST OF PATTERNS TO MATCH 942 943If C<LIST OF FILES TO TEST> is omitted, the file list is obtained from 944the manifest. The file list may include shell wildcards which will be 945expanded out. 946 947=over 4 948 949=item * -v 950 951Run the tests under verbose mode so you can see what tests were run, 952and debug output. 953 954=item * -torture 955 956Run the torture tests as well as the normal set. 957 958=item * -re=PATTERN 959 960Filter the file list so that all the test files run match PATTERN. 961Note that this form is distinct from the B<-re LIST OF PATTERNS> form 962below in that it allows the file list to be provided as well. 963 964=item * -re LIST OF PATTERNS 965 966Filter the file list so that all the test files run match 967/(LIST|OF|PATTERNS)/. Note that with this form the patterns are joined 968by '|' and you cannot supply a list of files, instead the test files 969are obtained from the MANIFEST. 970 971=back 972 973You can run an individual test by a command similar to 974 975 ./perl -I../lib path/to/foo.t 976 977except that the harnesses set up some environment variables that may 978affect the execution of the test: 979 980=over 4 981 982=item * PERL_CORE=1 983 984indicates that we're running this test as part of the perl core test 985suite. This is useful for modules that have a dual life on CPAN. 986 987=item * PERL_DESTRUCT_LEVEL=2 988 989is set to 2 if it isn't set already (see 990L<perlhacktips/PERL_DESTRUCT_LEVEL>). 991 992=item * PERL 993 994(used only by F<t/TEST>) if set, overrides the path to the perl 995executable that should be used to run the tests (the default being 996F<./perl>). 997 998=item * PERL_SKIP_TTY_TEST 999 1000if set, tells to skip the tests that need a terminal. It's actually 1001set automatically by the Makefile, but can also be forced artificially 1002by running 'make test_notty'. 1003 1004=back 1005 1006=head3 Other environment variables that may influence tests 1007 1008=over 4 1009 1010=item * PERL_TEST_Net_Ping 1011 1012Setting this variable runs all the Net::Ping modules tests, otherwise 1013some tests that interact with the outside world are skipped. See 1014L<perl58delta>. 1015 1016=item * PERL_TEST_NOVREXX 1017 1018Setting this variable skips the vrexx.t tests for OS2::REXX. 1019 1020=item * PERL_TEST_NUMCONVERTS 1021 1022This sets a variable in op/numconvert.t. 1023 1024=item * PERL_TEST_MEMORY 1025 1026Setting this variable includes the tests in F<t/bigmem/>. This should 1027be set to the number of gigabytes of memory available for testing, eg. 1028C<PERL_TEST_MEMORY=4> indicates that tests that require 4GiB of 1029available memory can be run safely. 1030 1031=back 1032 1033See also the documentation for the Test and Test::Harness modules, for 1034more environment variables that affect testing. 1035 1036=head2 Performance testing 1037 1038The file F<t/perf/benchmarks> contains snippets of perl code which are 1039intended to be benchmarked across a range of perls by the 1040F<Porting/bench.pl> tool. If you fix or enhance a performance issue, you 1041may want to add a representative code sample to the file, then run 1042F<bench.pl> against the previous and current perls to see what difference 1043it has made, and whether anything else has slowed down as a consequence. 1044 1045The file F<t/perf/opcount.t> is designed to test whether a particular 1046code snippet has been compiled into an optree containing specified 1047numbers of particular op types. This is good for testing whether 1048optimisations which alter ops, such as converting an C<aelem> op into an 1049C<aelemfast> op, are really doing that. 1050 1051The files F<t/perf/speed.t> and F<t/re/speed.t> are designed to test 1052things that run thousands of times slower if a particular optimisation 1053is broken (for example, the utf8 length cache on long utf8 strings). 1054Add a test that will take a fraction of a second normally, and minutes 1055otherwise, causing the test file to time out on failure. 1056 1057=head2 Building perl at older commits 1058 1059In the course of hacking on the Perl core distribution, you may have occasion 1060to configure, build and test perl at an old commit. Sometimes C<make> will 1061fail during this process. If that happens, you may be able to salvage the 1062situation by using the Devel::PatchPerl library from CPAN (not included in the 1063core) to bring the source code at that commit to a buildable state. 1064 1065Here's a real world example, taken from work done to resolve 1066L<perl #72414|https://rt.perl.org/Ticket/Display.html?id=72414>. 1067Use of F<Porting/bisect.pl> had identified commit 1068C<ba77e4cc9d1ceebf472c9c5c18b2377ee47062e6> as the commit in which a bug was 1069corrected. To confirm, a P5P developer wanted to configure and build perl at 1070commit C<ba77e4c^> (presumably "bad") and then at C<ba77e4c> (presumably 1071"good"). Normal configuration and build was attempted: 1072 1073 $ sh ./Configure -des -Dusedevel 1074 $ make test_prep 1075 1076C<make>, however, failed with output (excerpted) like this: 1077 1078 cc -fstack-protector -L/usr/local/lib -o miniperl \ 1079 gv.o toke.o perly.o pad.o regcomp.o dump.o util.o \ 1080 mg.o reentr.o mro.o hv.o av.o run.o pp_hot.o sv.o \ 1081 pp.o scope.o pp_ctl.o pp_sys.o doop.o doio.o regexec.o \ 1082 utf8.o taint.o deb.o universal.o globals.o perlio.o \ 1083 perlapi.o numeric.o mathoms.o locale.o pp_pack.o pp_sort.o \ 1084 miniperlmain.o opmini.o perlmini.o 1085 pp.o: In function `Perl_pp_pow': 1086 pp.c:(.text+0x2db9): undefined reference to `pow' 1087 ... 1088 collect2: error: ld returned 1 exit status 1089 makefile:348: recipe for target 'miniperl' failed 1090 make: *** [miniperl] Error 1 1091 1092Another P5P contributor recommended installation and use of Devel::PatchPerl 1093for this situation, first to determine the version of perl at the commit in 1094question, then to patch the source code at that point to facilitate a build. 1095 1096 $ perl -MDevel::PatchPerl -e \ 1097 'print Devel::PatchPerl->determine_version("/path/to/sourcecode"), "\n";' 1098 5.11.1 1099 $ perl -MDevel::PatchPerl -e \ 1100 'Devel::PatchPerl->patch_source("5.11.1", "/path/to/sourcecode");' 1101 1102Once the source was patched, C<./Configure> and C<make test_prep> were called 1103and completed successfully, enabling confirmation of the findings in RT 1104#72414. 1105 1106=head1 MORE READING FOR GUTS HACKERS 1107 1108To hack on the Perl guts, you'll need to read the following things: 1109 1110=over 4 1111 1112=item * L<perlsource> 1113 1114An overview of the Perl source tree. This will help you find the files 1115you're looking for. 1116 1117=item * L<perlinterp> 1118 1119An overview of the Perl interpreter source code and some details on how 1120Perl does what it does. 1121 1122=item * L<perlhacktut> 1123 1124This document walks through the creation of a small patch to Perl's C 1125code. If you're just getting started with Perl core hacking, this will 1126help you understand how it works. 1127 1128=item * L<perlhacktips> 1129 1130More details on hacking the Perl core. This document focuses on lower 1131level details such as how to write tests, compilation issues, 1132portability, debugging, etc. 1133 1134If you plan on doing serious C hacking, make sure to read this. 1135 1136=item * L<perlguts> 1137 1138This is of paramount importance, since it's the documentation of what 1139goes where in the Perl source. Read it over a couple of times and it 1140might start to make sense - don't worry if it doesn't yet, because the 1141best way to study it is to read it in conjunction with poking at Perl 1142source, and we'll do that later on. 1143 1144Gisle Aas's "illustrated perlguts", also known as I<illguts>, has very 1145helpful pictures: 1146 1147L<http://search.cpan.org/dist/illguts/> 1148 1149=item * L<perlxstut> and L<perlxs> 1150 1151A working knowledge of XSUB programming is incredibly useful for core 1152hacking; XSUBs use techniques drawn from the PP code, the portion of 1153the guts that actually executes a Perl program. It's a lot gentler to 1154learn those techniques from simple examples and explanation than from 1155the core itself. 1156 1157=item * L<perlapi> 1158 1159The documentation for the Perl API explains what some of the internal 1160functions do, as well as the many macros used in the source. 1161 1162=item * F<Porting/pumpkin.pod> 1163 1164This is a collection of words of wisdom for a Perl porter; some of it 1165is only useful to the pumpkin holder, but most of it applies to anyone 1166wanting to go about Perl development. 1167 1168=back 1169 1170=head1 CPAN TESTERS AND PERL SMOKERS 1171 1172The CPAN testers ( L<http://testers.cpan.org/> ) are a group of volunteers 1173who test CPAN modules on a variety of platforms. 1174 1175Perl Smokers ( L<http://www.nntp.perl.org/group/perl.daily-build/> and 1176L<http://www.nntp.perl.org/group/perl.daily-build.reports/> ) 1177automatically test Perl source releases on platforms with various 1178configurations. 1179 1180Both efforts welcome volunteers. In order to get involved in smoke 1181testing of the perl itself visit 1182L<http://search.cpan.org/dist/Test-Smoke/>. In order to start smoke 1183testing CPAN modules visit 1184L<http://search.cpan.org/dist/CPANPLUS-YACSmoke/> or 1185L<http://search.cpan.org/dist/minismokebox/> or 1186L<http://search.cpan.org/dist/CPAN-Reporter/>. 1187 1188=head1 WHAT NEXT? 1189 1190If you've read all the documentation in the document and the ones 1191listed above, you're more than ready to hack on Perl. 1192 1193Here's some more recommendations 1194 1195=over 4 1196 1197=item * 1198 1199Subscribe to perl5-porters, follow the patches and try and understand 1200them; don't be afraid to ask if there's a portion you're not clear on - 1201who knows, you may unearth a bug in the patch... 1202 1203=item * 1204 1205Do read the README associated with your operating system, e.g. 1206README.aix on the IBM AIX OS. Don't hesitate to supply patches to that 1207README if you find anything missing or changed over a new OS release. 1208 1209=item * 1210 1211Find an area of Perl that seems interesting to you, and see if you can 1212work out how it works. Scan through the source, and step over it in 1213the debugger. Play, poke, investigate, fiddle! You'll probably get to 1214understand not just your chosen area but a much wider range of 1215F<perl>'s activity as well, and probably sooner than you'd think. 1216 1217=back 1218 1219=head2 "The Road goes ever on and on, down from the door where it began." 1220 1221If you can do these things, you've started on the long road to Perl 1222porting. Thanks for wanting to help make Perl better - and happy 1223hacking! 1224 1225=head2 Metaphoric Quotations 1226 1227If you recognized the quote about the Road above, you're in luck. 1228 1229Most software projects begin each file with a literal description of 1230each file's purpose. Perl instead begins each with a literary allusion 1231to that file's purpose. 1232 1233Like chapters in many books, all top-level Perl source files (along 1234with a few others here and there) begin with an epigrammatic 1235inscription that alludes, indirectly and metaphorically, to the 1236material you're about to read. 1237 1238Quotations are taken from writings of J.R.R. Tolkien pertaining to his 1239Legendarium, almost always from I<The Lord of the Rings>. Chapters and 1240page numbers are given using the following editions: 1241 1242=over 4 1243 1244=item * 1245 1246I<The Hobbit>, by J.R.R. Tolkien. The hardcover, 70th-anniversary 1247edition of 2007 was used, published in the UK by Harper Collins 1248Publishers and in the US by the Houghton Mifflin Company. 1249 1250=item * 1251 1252I<The Lord of the Rings>, by J.R.R. Tolkien. The hardcover, 125350th-anniversary edition of 2004 was used, published in the UK by 1254Harper Collins Publishers and in the US by the Houghton Mifflin 1255Company. 1256 1257=item * 1258 1259I<The Lays of Beleriand>, by J.R.R. Tolkien and published posthumously 1260by his son and literary executor, C.J.R. Tolkien, being the 3rd of the 126112 volumes in Christopher's mammoth I<History of Middle Earth>. Page 1262numbers derive from the hardcover edition, first published in 1983 by 1263George Allen & Unwin; no page numbers changed for the special 3-volume 1264omnibus edition of 2002 or the various trade-paper editions, all again 1265now by Harper Collins or Houghton Mifflin. 1266 1267=back 1268 1269Other JRRT books fair game for quotes would thus include I<The 1270Adventures of Tom Bombadil>, I<The Silmarillion>, I<Unfinished Tales>, 1271and I<The Tale of the Children of Hurin>, all but the first 1272posthumously assembled by CJRT. But I<The Lord of the Rings> itself is 1273perfectly fine and probably best to quote from, provided you can find a 1274suitable quote there. 1275 1276So if you were to supply a new, complete, top-level source file to add 1277to Perl, you should conform to this peculiar practice by yourself 1278selecting an appropriate quotation from Tolkien, retaining the original 1279spelling and punctuation and using the same format the rest of the 1280quotes are in. Indirect and oblique is just fine; remember, it's a 1281metaphor, so being meta is, after all, what it's for. 1282 1283=head1 AUTHOR 1284 1285This document was originally written by Nathan Torkington, and is 1286maintained by the perl5-porters mailing list. 1287