1=begin comment 2 3# !!!!!!! DO NOT EDIT THIS FILE !!!!!!! 4# This file is machine-generated by lib/unicore/mktables from the Unicode 5# database, Version 12.1.0. Any changes made here will be lost! 6 7 8To change this file, edit lib/unicore/mktables instead. 9 10=end comment 11 12=head1 NAME 13 14perluniprops - Index of Unicode Version 12.1.0 character properties in Perl 15 16=head1 DESCRIPTION 17 18This document provides information about the portion of the Unicode database 19that deals with character properties, that is the portion that is defined on 20single code points. (L</Other information in the Unicode data base> 21below briefly mentions other data that Unicode provides.) 22 23Perl can provide access to all non-provisional Unicode character properties, 24though not all are enabled by default. The omitted ones are the Unihan 25properties (accessible via the CPAN module L<Unicode::Unihan>) and certain 26deprecated or Unicode-internal properties. (An installation may choose to 27recompile Perl's tables to change this. See L</Unicode character 28properties that are NOT accepted by Perl>.) 29 30For most purposes, access to Unicode properties from the Perl core is through 31regular expression matches, as described in the next section. 32For some special purposes, and to access the properties that are not suitable 33for regular expression matching, all the Unicode character properties that 34Perl handles are accessible via the standard L<Unicode::UCD> module, as 35described in the section L</Properties accessible through Unicode::UCD>. 36 37Perl also provides some additional extensions and short-cut synonyms 38for Unicode properties. 39 40This document merely lists all available properties and does not attempt to 41explain what each property really means. There is a brief description of each 42Perl extension; see L<perlunicode/Other Properties> for more information on 43these. There is some detail about Blocks, Scripts, General_Category, 44and Bidi_Class in L<perlunicode>, but to find out about the intricacies of the 45official Unicode properties, refer to the Unicode standard. A good starting 46place is L<http://www.unicode.org/reports/tr44/>. 47 48Note that you can define your own properties; see 49L<perlunicode/"User-Defined Character Properties">. 50 51=head1 Properties accessible through C<\p{}> and C<\P{}> 52 53The Perl regular expression C<\p{}> and C<\P{}> constructs give access to 54most of the Unicode character properties. The table below shows all these 55constructs, both single and compound forms. 56 57B<Compound forms> consist of two components, separated by an equals sign or a 58colon. The first component is the property name, and the second component is 59the particular value of the property to match against, for example, 60C<\p{Script_Extensions: Greek}> and C<\p{Script_Extensions=Greek}> both mean 61to match characters whose Script_Extensions property value is Greek. 62(C<Script_Extensions> is an improved version of the C<Script> property.) 63 64B<Single forms>, like C<\p{Greek}>, are mostly Perl-defined shortcuts for 65their equivalent compound forms. The table shows these equivalences. (In our 66example, C<\p{Greek}> is a just a shortcut for 67C<\p{Script_Extensions=Greek}>). There are also a few Perl-defined single 68forms that are not shortcuts for a compound form. One such is C<\p{Word}>. 69These are also listed in the table. 70 71In parsing these constructs, Perl always ignores Upper/lower case differences 72everywhere within the {braces}. Thus C<\p{Greek}> means the same thing as 73C<\p{greek}>. But note that changing the case of the C<"p"> or C<"P"> before 74the left brace completely changes the meaning of the construct, from "match" 75(for C<\p{}>) to "doesn't match" (for C<\P{}>). Casing in this document is 76for improved legibility. 77 78Also, white space, hyphens, and underscores are normally ignored 79everywhere between the {braces}, and hence can be freely added or removed 80even if the C</x> modifier hasn't been specified on the regular expression. 81But in the table below a 'B<T>' at the beginning of an entry 82means that tighter (stricter) rules are used for that entry: 83 84=over 4 85 86=over 4 87 88=item Single form (C<\p{name}>) tighter rules: 89 90White space, hyphens, and underscores ARE significant 91except for: 92 93=over 4 94 95=item * white space adjacent to a non-word character 96 97=item * underscores separating digits in numbers 98 99=back 100 101That means, for example, that you can freely add or remove white space 102adjacent to (but within) the braces without affecting the meaning. 103 104=item Compound form (C<\p{name=value}> or C<\p{name:value}>) tighter rules: 105 106The tighter rules given above for the single form apply to everything to the 107right of the colon or equals; the looser rules still apply to everything to 108the left. 109 110That means, for example, that you can freely add or remove white space 111adjacent to (but within) the braces and the colon or equal sign. 112 113=back 114 115=back 116 117Some properties are considered obsolete by Unicode, but still available. 118There are several varieties of obsolescence: 119 120=over 4 121 122=over 4 123 124=item Stabilized 125 126A property may be stabilized. Such a determination does not indicate 127that the property should or should not be used; instead it is a declaration 128that the property will not be maintained nor extended for newly encoded 129characters. Such properties are marked with an 'B<S>' in the 130table. 131 132=item Deprecated 133 134A property may be deprecated, perhaps because its original intent 135has been replaced by another property, or because its specification was 136somehow defective. This means that its use is strongly 137discouraged, so much so that a warning will be issued if used, unless the 138regular expression is in the scope of a C<S<no warnings 'deprecated'>> 139statement. A 'B<D>' flags each such entry in the table, and 140the entry there for the longest, most descriptive version of the property will 141give the reason it is deprecated, and perhaps advice. Perl may issue such a 142warning, even for properties that aren't officially deprecated by Unicode, 143when there used to be characters or code points that were matched by them, but 144no longer. This is to warn you that your program may not work like it did on 145earlier Unicode releases. 146 147A deprecated property may be made unavailable in a future Perl version, so it 148is best to move away from them. 149 150A deprecated property may also be stabilized, but this fact is not shown. 151 152=item Obsolete 153 154Properties marked with an 'B<O>' in the table are considered (plain) 155obsolete. Generally this designation is given to properties that Unicode once 156used for internal purposes (but not any longer). 157 158=item Discouraged 159 160This is not actually a Unicode-specified obsolescence, but applies to certain 161Perl extensions that are present for backwards compatibility, but are 162discouraged from being used. These are not obsolete, but their meanings are 163not stable. Future Unicode versions could force any of these extensions to be 164removed without warning, replaced by another property with the same name that 165means something different. An 'B<X>' flags each such entry in the 166table. Use the equivalent shown instead. 167 168 169In particular, matches in the Block property have single forms 170defined by Perl that begin with C<"In_">, C<"Is_>, or even with no prefix at 171all, Like all B<DISCOURAGED> forms, these are not stable. For example, 172C<\p{Block=Deseret}> can currently be written as C<\p{In_Deseret}>, 173C<\p{Is_Deseret}>, or C<\p{Deseret}>. But, a new Unicode version may 174come along that would force Perl to change the meaning of one or more of 175these, and your program would no longer be correct. Currently there are no 176such conflicts with the form that begins C<"In_">, but there are many with the 177other two shortcuts, and Unicode continues to define new properties that begin 178with C<"In">, so it's quite possible that a conflict will occur in the future. 179The compound form is guaranteed to not become obsolete, and its meaning is 180clearer anyway. See L<perlunicode/"Blocks"> for more information about this. 181 182 183=back 184 185=back 186 187The table below has two columns. The left column contains the C<\p{}> 188constructs to look up, possibly preceded by the flags mentioned above; and 189the right column contains information about them, like a description, or 190synonyms. The table shows both the single and compound forms for each 191property that has them. If the left column is a short name for a property, 192the right column will give its longer, more descriptive name; and if the left 193column is the longest name, the right column will show any equivalent shortest 194name, in both single and compound forms if applicable. 195 196If braces are not needed to specify a property (e.g., C<\pL>), the left 197column contains both forms, with and without braces. 198 199The right column will also caution you if a property means something different 200than what might normally be expected. 201 202All single forms are Perl extensions; a few compound forms are as well, and 203are noted as such. 204 205Numbers in (parentheses) indicate the total number of Unicode code points 206matched by the property. For the entries that give the longest, most 207descriptive version of the property, the count is followed by a list of some 208of the code points matched by it. The list includes all the matched 209characters in the 0-255 range, enclosed in the familiar [brackets] the same as 210a regular expression bracketed character class. Following that, the next few 211higher matching ranges are also given. To avoid visual ambiguity, the SPACE 212character is represented as C<\x20>. 213 214For emphasis, those properties that match no code points at all are listed as 215well in a separate section following the table. 216 217Most properties match the same code points regardless of whether C<"/i"> 218case-insensitive matching is specified or not. But a few properties are 219affected. These are shown with the notation S<C<(/i= I<other_property>)>> 220in the second column. Under case-insensitive matching they match the 221same code pode points as the property I<other_property>. 222 223There is no description given for most non-Perl defined properties (See 224L<http://www.unicode.org/reports/tr44/> for that). 225 226For compactness, 'B<*>' is used as a wildcard instead of showing all possible 227combinations. For example, entries like: 228 229 \p{Gc: *} \p{General_Category: *} 230 231mean that 'Gc' is a synonym for 'General_Category', and anything that is valid 232for the latter is also valid for the former. Similarly, 233 234 \p{Is_*} \p{*} 235 236means that if and only if, for example, C<\p{Foo}> exists, then 237C<\p{Is_Foo}> and C<\p{IsFoo}> are also valid and all mean the same thing. 238And similarly, C<\p{Foo=Bar}> means the same as C<\p{Is_Foo=Bar}> and 239C<\p{IsFoo=Bar}>. "*" here is restricted to something not beginning with an 240underscore. 241 242Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for 'Y'. 243And 'No', 'F', and 'False' are all synonyms for 'N'. The table shows 'Y*' and 244'N*' to indicate this, and doesn't have separate entries for the other 245possibilities. Note that not all properties which have values 'Yes' and 'No' 246are binary, and they have all their values spelled out without using this wild 247card, and a C<NOT> clause in their description that highlights their not being 248binary. These also require the compound form to match them, whereas true 249binary properties have both single and compound forms available. 250 251Note that all non-essential underscores are removed in the display of the 252short names below. 253 254B<Legend summary:> 255 256=over 4 257 258=item Z<>B<*> is a wild-card 259 260=item B<(\d+)> in the info column gives the number of Unicode code points matched 261by this property. 262 263=item B<D> means this is deprecated. 264 265=item B<O> means this is obsolete. 266 267=item B<S> means this is stabilized. 268 269=item B<T> means tighter (stricter) name matching applies. 270 271=item B<X> means use of this form is discouraged, and may not be 272stable. 273 274=back 275 276 NAME INFO 277 278 \p{Adlam} \p{Script_Extensions=Adlam} (Short: 279 \p{Adlm}; NOT \p{Block=Adlam}) (89) 280 \p{Adlm} \p{Adlam} (= \p{Script_Extensions=Adlam}) 281 (NOT \p{Block=Adlam}) (89) 282 X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64) 283 T \p{Age: 1.1} \p{Age=V1_1} (33_979) 284 \p{Age: V1_1} Code point's usage introduced in version 285 1.1 (33_979: U+0000..01F5, U+01FA..0217, 286 U+0250..02A8, U+02B0..02DE, 287 U+02E0..02E9, U+0300..0345 ...) 288 T \p{Age: 2.0} \p{Age=V2_0} (144_521) 289 \p{Age: V2_0} Code point's usage was introduced in 290 version 2.0; See also Property 291 'Present_In' (144_521: U+0591..05A1, 292 U+05A3..05AF, U+05C4, U+0F00..0F47, 293 U+0F49..0F69, U+0F71..0F8B ...) 294 T \p{Age: 2.1} \p{Age=V2_1} (2) 295 \p{Age: V2_1} Code point's usage was introduced in 296 version 2.1; See also Property 297 'Present_In' (2: U+20AC, U+FFFC) 298 T \p{Age: 3.0} \p{Age=V3_0} (10_307) 299 \p{Age: V3_0} Code point's usage was introduced in 300 version 3.0; See also Property 301 'Present_In' (10_307: U+01F6..01F9, 302 U+0218..021F, U+0222..0233, 303 U+02A9..02AD, U+02DF, U+02EA..02EE ...) 304 T \p{Age: 3.1} \p{Age=V3_1} (44_978) 305 \p{Age: V3_1} Code point's usage was introduced in 306 version 3.1; See also Property 307 'Present_In' (44_978: U+03F4..03F5, 308 U+FDD0..FDEF, U+10300..1031E, 309 U+10320..10323, U+10330..1034A, 310 U+10400..10425 ...) 311 T \p{Age: 3.2} \p{Age=V3_2} (1016) 312 \p{Age: V3_2} Code point's usage was introduced in 313 version 3.2; See also Property 314 'Present_In' (1016: U+0220, U+034F, 315 U+0363..036F, U+03D8..03D9, U+03F6, 316 U+048A..048B ...) 317 T \p{Age: 4.0} \p{Age=V4_0} (1226) 318 \p{Age: V4_0} Code point's usage was introduced in 319 version 4.0; See also Property 320 'Present_In' (1226: U+0221, 321 U+0234..0236, U+02AE..02AF, 322 U+02EF..02FF, U+0350..0357, U+035D..035F 323 ...) 324 T \p{Age: 4.1} \p{Age=V4_1} (1273) 325 \p{Age: V4_1} Code point's usage was introduced in 326 version 4.1; See also Property 327 'Present_In' (1273: U+0237..0241, 328 U+0358..035C, U+03FC..03FF, 329 U+04F6..04F7, U+05A2, U+05C5..05C7 ...) 330 T \p{Age: 5.0} \p{Age=V5_0} (1369) 331 \p{Age: V5_0} Code point's usage was introduced in 332 version 5.0; See also Property 333 'Present_In' (1369: U+0242..024F, 334 U+037B..037D, U+04CF, U+04FA..04FF, 335 U+0510..0513, U+05BA ...) 336 T \p{Age: 5.1} \p{Age=V5_1} (1624) 337 \p{Age: V5_1} Code point's usage was introduced in 338 version 5.1; See also Property 339 'Present_In' (1624: U+0370..0373, 340 U+0376..0377, U+03CF, U+0487, 341 U+0514..0523, U+0606..060A ...) 342 T \p{Age: 5.2} \p{Age=V5_2} (6648) 343 \p{Age: V5_2} Code point's usage was introduced in 344 version 5.2; See also Property 345 'Present_In' (6648: U+0524..0525, 346 U+0800..082D, U+0830..083E, U+0900, 347 U+094E, U+0955 ...) 348 T \p{Age: 6.0} \p{Age=V6_0} (2088) 349 \p{Age: V6_0} Code point's usage was introduced in 350 version 6.0; See also Property 351 'Present_In' (2088: U+0526..0527, 352 U+0620, U+065F, U+0840..085B, U+085E, 353 U+093A..093B ...) 354 T \p{Age: 6.1} \p{Age=V6_1} (732) 355 \p{Age: V6_1} Code point's usage was introduced in 356 version 6.1; See also Property 357 'Present_In' (732: U+058F, U+0604, 358 U+08A0, U+08A2..08AC, U+08E4..08FE, 359 U+0AF0 ...) 360 T \p{Age: 6.2} \p{Age=V6_2} (1) 361 \p{Age: V6_2} Code point's usage was introduced in 362 version 6.2; See also Property 363 'Present_In' (1: U+20BA) 364 T \p{Age: 6.3} \p{Age=V6_3} (5) 365 \p{Age: V6_3} Code point's usage was introduced in 366 version 6.3; See also Property 367 'Present_In' (5: U+061C, U+2066..2069) 368 T \p{Age: 7.0} \p{Age=V7_0} (2834) 369 \p{Age: V7_0} Code point's usage was introduced in 370 version 7.0; See also Property 371 'Present_In' (2834: U+037F, 372 U+0528..052F, U+058D..058E, U+0605, 373 U+08A1, U+08AD..08B2 ...) 374 T \p{Age: 8.0} \p{Age=V8_0} (7716) 375 \p{Age: V8_0} Code point's usage was introduced in 376 version 8.0; See also Property 377 'Present_In' (7716: U+08B3..08B4, 378 U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5 379 ...) 380 T \p{Age: 9.0} \p{Age=V9_0} (7500) 381 \p{Age: V9_0} Code point's usage was introduced in 382 version 9.0; See also Property 383 'Present_In' (7500: U+08B6..08BD, 384 U+08D4..08E2, U+0C80, U+0D4F, 385 U+0D54..0D56, U+0D58..0D5E ...) 386 T \p{Age: 10.0} \p{Age=V10_0} (8518) 387 \p{Age: V10_0} Code point's usage was introduced in 388 version 10.0; See also Property 389 'Present_In' (8518: U+0860..086A, 390 U+09FC..09FD, U+0AFA..0AFF, U+0D00, 391 U+0D3B..0D3C, U+1CF7 ...) 392 T \p{Age: 11.0} \p{Age=V11_0} (684) 393 \p{Age: V11_0} Code point's usage was introduced in 394 version 11.0; See also Property 395 'Present_In' (684: U+0560, U+0588, 396 U+05EF, U+07FD..07FF, U+08D3, U+09FE ...) 397 T \p{Age: 12.0} \p{Age=V12_0} (554) 398 \p{Age: V12_0} Code point's usage was introduced in 399 version 12.0; See also Property 400 'Present_In' (554: U+0C77, U+0E86, 401 U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...) 402 T \p{Age: 12.1} \p{Age=V12_1} (1) 403 \p{Age: V12_1} Code point's usage was introduced in 404 version 12.1; See also Property 405 'Present_In' (1: U+32FF) 406 \p{Age: NA} \p{Age=Unassigned} (836_536 plus all 407 above-Unicode code points) 408 \p{Age: Unassigned} Code point's usage has not been assigned 409 in any Unicode release thus far. (Short: 410 \p{Age=NA}) (836_536 plus all above- 411 Unicode code points: U+0378..0379, 412 U+0380..0383, U+038B, U+038D, U+03A2, 413 U+0530 ...) 414 \p{Aghb} \p{Caucasian_Albanian} (= 415 \p{Script_Extensions= 416 Caucasian_Albanian}) (NOT \p{Block= 417 Caucasian_Albanian}) (53) 418 \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y}) 419 (22) 420 \p{AHex: *} \p{ASCII_Hex_Digit: *} 421 \p{Ahom} \p{Script_Extensions=Ahom} (NOT \p{Block= 422 Ahom}) (58) 423 X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block= 424 Alchemical_Symbols}) (128) 425 X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short: 426 \p{InAlchemical}) (128) 427 \p{All} All code points, including those above 428 Unicode. Same as qr/./s (1_114_112 plus 429 all above-Unicode code points: 430 U+0000..infinity) 431 \p{Alnum} \p{XPosixAlnum} (127_886) 432 \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y}) 433 (127_256) 434 \p{Alpha: *} \p{Alphabetic: *} 435 \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y}) 436 (127_256) 437 \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (986_856 438 plus all above-Unicode code points: 439 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 440 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab- 441 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7], 442 U+02C2..02C5, U+02D2..02DF, 443 U+02E5..02EB, U+02ED, U+02EF..0344 ...) 444 \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (127_256: 445 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8- 446 \xf6\xf8-\xff], U+0100..02C1, 447 U+02C6..02D1, U+02E0..02E4, U+02EC, 448 U+02EE ...) 449 X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (= 450 \p{Block=Alphabetic_Presentation_Forms}) 451 (80) 452 X \p{Alphabetic_Presentation_Forms} \p{Block= 453 Alphabetic_Presentation_Forms} (Short: 454 \p{InAlphabeticPF}) (80) 455 \p{Anatolian_Hieroglyphs} \p{Script_Extensions= 456 Anatolian_Hieroglyphs} (Short: \p{Hluw}; 457 NOT \p{Block=Anatolian_Hieroglyphs}) 458 (583) 459 X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (= 460 \p{Block= 461 Ancient_Greek_Musical_Notation}) (80) 462 X \p{Ancient_Greek_Musical_Notation} \p{Block= 463 Ancient_Greek_Musical_Notation} (Short: 464 \p{InAncientGreekMusic}) (80) 465 X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80) 466 X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64) 467 \p{Any} All Unicode code points (1_114_112: 468 U+0000..10FFFF) 469 \p{Arab} \p{Arabic} (= \p{Script_Extensions= 470 Arabic}) (NOT \p{Block=Arabic}) (1325) 471 \p{Arabic} \p{Script_Extensions=Arabic} (Short: 472 \p{Arab}; NOT \p{Block=Arabic}) (1325) 473 X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block= 474 Arabic_Extended_A}) (96) 475 X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short: 476 \p{InArabicExtA}) (96) 477 X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols} 478 (= \p{Block= 479 Arabic_Mathematical_Alphabetic_Symbols}) 480 (256) 481 X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block= 482 Arabic_Mathematical_Alphabetic_Symbols} 483 (Short: \p{InArabicMath}) (256) 484 X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (= 485 \p{Block=Arabic_Presentation_Forms_A}) 486 (688) 487 X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (= 488 \p{Block=Arabic_Presentation_Forms_B}) 489 (144) 490 X \p{Arabic_Presentation_Forms_A} \p{Block= 491 Arabic_Presentation_Forms_A} (Short: 492 \p{InArabicPFA}) (688) 493 X \p{Arabic_Presentation_Forms_B} \p{Block= 494 Arabic_Presentation_Forms_B} (Short: 495 \p{InArabicPFB}) (144) 496 X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block= 497 Arabic_Supplement}) (48) 498 X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short: 499 \p{InArabicSup}) (48) 500 \p{Armenian} \p{Script_Extensions=Armenian} (Short: 501 \p{Armn}; NOT \p{Block=Armenian}) (96) 502 \p{Armi} \p{Imperial_Aramaic} (= 503 \p{Script_Extensions=Imperial_Aramaic}) 504 (NOT \p{Block=Imperial_Aramaic}) (31) 505 \p{Armn} \p{Armenian} (= \p{Script_Extensions= 506 Armenian}) (NOT \p{Block=Armenian}) (96) 507 X \p{Arrows} \p{Block=Arrows} (112) 508 \p{ASCII} \p{Block=Basic_Latin} (128) 509 \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y}) 510 (22) 511 \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090 512 plus all above-Unicode code points: 513 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<= 514 >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff], 515 U+0100..infinity) 516 \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A- 517 Fa-f]) 518 \p{Assigned} All assigned code points (277_510: 519 U+0000..0377, U+037A..037F, 520 U+0384..038A, U+038C, U+038E..03A1, 521 U+03A3..052F ...) 522 \p{Avestan} \p{Script_Extensions=Avestan} (Short: 523 \p{Avst}; NOT \p{Block=Avestan}) (61) 524 \p{Avst} \p{Avestan} (= \p{Script_Extensions= 525 Avestan}) (NOT \p{Block=Avestan}) (61) 526 \p{Bali} \p{Balinese} (= \p{Script_Extensions= 527 Balinese}) (NOT \p{Block=Balinese}) (121) 528 \p{Balinese} \p{Script_Extensions=Balinese} (Short: 529 \p{Bali}; NOT \p{Block=Balinese}) (121) 530 \p{Bamu} \p{Bamum} (= \p{Script_Extensions=Bamum}) 531 (NOT \p{Block=Bamum}) (657) 532 \p{Bamum} \p{Script_Extensions=Bamum} (Short: 533 \p{Bamu}; NOT \p{Block=Bamum}) (657) 534 X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block= 535 Bamum_Supplement}) (576) 536 X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short: 537 \p{InBamumSup}) (576) 538 X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128) 539 \p{Bass} \p{Bassa_Vah} (= \p{Script_Extensions= 540 Bassa_Vah}) (NOT \p{Block=Bassa_Vah}) 541 (36) 542 \p{Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short: 543 \p{Bass}; NOT \p{Block=Bassa_Vah}) (36) 544 \p{Batak} \p{Script_Extensions=Batak} (Short: 545 \p{Batk}; NOT \p{Block=Batak}) (56) 546 \p{Batk} \p{Batak} (= \p{Script_Extensions=Batak}) 547 (NOT \p{Block=Batak}) (56) 548 \p{Bc: *} \p{Bidi_Class: *} 549 \p{Beng} \p{Bengali} (= \p{Script_Extensions= 550 Bengali}) (NOT \p{Block=Bengali}) (113) 551 \p{Bengali} \p{Script_Extensions=Bengali} (Short: 552 \p{Beng}; NOT \p{Block=Bengali}) (113) 553 \p{Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short: 554 \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97) 555 \p{Bhks} \p{Bhaiksuki} (= \p{Script_Extensions= 556 Bhaiksuki}) (NOT \p{Block=Bhaiksuki}) 557 (97) 558 \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y}) 559 (12) 560 \p{Bidi_C: *} \p{Bidi_Control: *} 561 \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1698) 562 \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (61) 563 \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1698: U+0608, 564 U+060B, U+060D, U+061B..064A, 565 U+066D..066F, U+0671..06D5 ...) 566 \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (61: 567 U+0600..0605, U+0660..0669, 568 U+066B..066C, U+06DD, U+08E2, 569 U+10D30..10D39 ...) 570 \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7) 571 \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016) 572 \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016: 573 [^\t\n\cK\f\r\x1c-\x7e\x85\xa0-\xac\xae- 574 \xff], U+180E, U+200B..200D, 575 U+2060..2065, U+206A..206F, U+FDD0..FDEF 576 ...) 577 \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15: 578 [,.\/:\xa0], U+060C, U+202F, U+2044, 579 U+FE50, U+FE52 ...) 580 \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15) 581 \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (158) 582 \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12) 583 \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (92) 584 \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158: [0-9\xb2- 585 \xb3\xb9], U+06F0..06F9, U+2070, 586 U+2074..2079, U+2080..2089, U+2488..249B 587 ...) 588 \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-], 589 U+207A..207B, U+208A..208B, U+2212, 590 U+FB29, U+FE62..FE63 ...) 591 \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92: 592 [#\$\%\xa2-\xa5\xb0-\xb1], U+058F, 593 U+0609..060A, U+066A, U+09F2..09F3, 594 U+09FB ...) 595 \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1: 596 U+2068) 597 \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1) 598 \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_096_767 599 plus all above-Unicode code points) 600 \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_767 plus 601 all above-Unicode code points: [A-Za- 602 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 603 \xff], U+0100..02B8, U+02BB..02C1, 604 U+02D0..02D1, U+02E0..02E4, U+02EE ...) 605 \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1: 606 U+202A) 607 \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1: 608 U+2066) 609 \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1: 610 U+202D) 611 \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1) 612 \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1) 613 \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1) 614 \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1834: 615 U+0300..036F, U+0483..0489, 616 U+0591..05BD, U+05BF, U+05C1..05C2, 617 U+05C4..05C5 ...) 618 \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1834) 619 \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (5658) 620 \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5658: 621 [!\"&\'\(\)*;<= 622 >?\@\[\\\]\^_`\{\|\}~\xa1\xa6-\xa9\xab- 623 \xac\xae-\xaf\xb4\xb6-\xb8\xbb- 624 \xbf\xd7\xf7], U+02B9..02BA, 625 U+02C2..02CF, U+02D2..02DF, 626 U+02E5..02ED, U+02EF..02FF ...) 627 \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7: 628 [\n\r\x1c-\x1e\x85], U+2029) 629 \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1) 630 \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1) 631 \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1: 632 U+202C) 633 \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1: 634 U+2069) 635 \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (3765) 636 \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3765: U+0590, 637 U+05BE, U+05C0, U+05C3, U+05C6, 638 U+05C8..05FF ...) 639 \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1: 640 U+202B) 641 \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1: 642 U+2067) 643 \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1: 644 U+202E) 645 \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1) 646 \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1) 647 \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1) 648 \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3) 649 \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3: 650 [\t\cK\x1f]) 651 \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20], 652 U+1680, U+2000..200A, U+2028, U+205F, 653 U+3000) 654 \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17) 655 \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12) 656 \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100 657 plus all above-Unicode code points: 658 U+0000..061B, U+061D..200D, 659 U+2010..2029, U+202F..2065, 660 U+206A..infinity) 661 \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12: 662 U+061C, U+200E..200F, U+202A..202E, 663 U+2066..2069) 664 \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y}) 665 (545) 666 \p{Bidi_M: *} \p{Bidi_Mirrored: *} 667 \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM}) 668 (545) 669 \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567 670 plus all above-Unicode code points: 671 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A- 672 Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc- 673 \xff], U+0100..0F39, U+0F3E..169A, 674 U+169D..2038, U+203B..2044, U+2047..207C 675 ...) 676 \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545: 677 [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D, 678 U+169B..169C, U+2039..203A, 679 U+2045..2046, U+207D..207E ...) 680 \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close} 681 (60) 682 \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60: 683 [\)\]\}], U+0F3B, U+0F3D, U+169C, 684 U+2046, U+207E ...) 685 \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None} 686 (1_113_992 plus all above-Unicode code 687 points) 688 \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992 689 plus all above-Unicode code points: 690 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<= 691 >?\@A-Z\\\^_`a-z\|~\x7f-\xff], 692 U+0100..0F39, U+0F3E..169A, 693 U+169D..2044, U+2047..207C, U+207F..208C 694 ...) 695 \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open} 696 (60) 697 \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60: 698 [\(\[\{], U+0F3A, U+0F3C, U+169B, 699 U+2045, U+207D ...) 700 \p{Blank} \p{XPosixBlank} (18) 701 \p{Blk: *} \p{Block: *} 702 \p{Block: Adlam} (NOT \p{Adlam} NOR \p{Is_Adlam}) (96: 703 U+1E900..1E95F) 704 \p{Block: Aegean_Numbers} (64: U+10100..1013F) 705 \p{Block: Ahom} (NOT \p{Ahom} NOR \p{Is_Ahom}) (64: 706 U+11700..1173F) 707 \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128) 708 \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128: 709 U+1F700..1F77F) 710 \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms} 711 (80) 712 \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk= 713 AlphabeticPF}) (80: U+FB00..FB4F) 714 \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs} 715 NOR \p{Is_Anatolian_Hieroglyphs}) (640: 716 U+14400..1467F) 717 \p{Block: Ancient_Greek_Music} \p{Block= 718 Ancient_Greek_Musical_Notation} (80) 719 \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk= 720 AncientGreekMusic}) (80: U+1D200..1D24F) 721 \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F) 722 \p{Block: Ancient_Symbols} (64: U+10190..101CF) 723 \p{Block: Arabic} (NOT \p{Arabic} NOR \p{Is_Arabic}) (256: 724 U+0600..06FF) 725 \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96) 726 \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96: 727 U+08A0..08FF) 728 \p{Block: Arabic_Math} \p{Block= 729 Arabic_Mathematical_Alphabetic_Symbols} 730 (256) 731 \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk= 732 ArabicMath}) (256: U+1EE00..1EEFF) 733 \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688) 734 \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144) 735 \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA}) 736 (688: U+FB50..FDFF) 737 \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB}) 738 (144: U+FE70..FEFF) 739 \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48) 740 \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48: 741 U+0750..077F) 742 \p{Block: Armenian} (NOT \p{Armenian} NOR \p{Is_Armenian}) 743 (96: U+0530..058F) 744 \p{Block: Arrows} (112: U+2190..21FF) 745 \p{Block: ASCII} \p{Block=Basic_Latin} (128) 746 \p{Block: Avestan} (NOT \p{Avestan} NOR \p{Is_Avestan}) (64: 747 U+10B00..10B3F) 748 \p{Block: Balinese} (NOT \p{Balinese} NOR \p{Is_Balinese}) 749 (128: U+1B00..1B7F) 750 \p{Block: Bamum} (NOT \p{Bamum} NOR \p{Is_Bamum}) (96: 751 U+A6A0..A6FF) 752 \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576) 753 \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576: 754 U+16800..16A3F) 755 \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}) (128: [\x00-\x7f]) 756 \p{Block: Bassa_Vah} (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah}) 757 (48: U+16AD0..16AFF) 758 \p{Block: Batak} (NOT \p{Batak} NOR \p{Is_Batak}) (64: 759 U+1BC0..1BFF) 760 \p{Block: Bengali} (NOT \p{Bengali} NOR \p{Is_Bengali}) (128: 761 U+0980..09FF) 762 \p{Block: Bhaiksuki} (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki}) 763 (112: U+11C00..11C6F) 764 \p{Block: Block_Elements} (32: U+2580..259F) 765 \p{Block: Bopomofo} (NOT \p{Bopomofo} NOR \p{Is_Bopomofo}) 766 (48: U+3100..312F) 767 \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32) 768 \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32: 769 U+31A0..31BF) 770 \p{Block: Box_Drawing} (128: U+2500..257F) 771 \p{Block: Brahmi} (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128: 772 U+11000..1107F) 773 \p{Block: Braille} \p{Block=Braille_Patterns} (256) 774 \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256: 775 U+2800..28FF) 776 \p{Block: Buginese} (NOT \p{Buginese} NOR \p{Is_Buginese}) 777 (32: U+1A00..1A1F) 778 \p{Block: Buhid} (NOT \p{Buhid} NOR \p{Is_Buhid}) (32: 779 U+1740..175F) 780 \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols} 781 (256) 782 \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk= 783 ByzantineMusic}) (256: U+1D000..1D0FF) 784 \p{Block: Canadian_Syllabics} \p{Block= 785 Unified_Canadian_Aboriginal_Syllabics} 786 (640) 787 \p{Block: Carian} (NOT \p{Carian} NOR \p{Is_Carian}) (64: 788 U+102A0..102DF) 789 \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR 790 \p{Is_Caucasian_Albanian}) (64: 791 U+10530..1056F) 792 \p{Block: Chakma} (NOT \p{Chakma} NOR \p{Is_Chakma}) (80: 793 U+11100..1114F) 794 \p{Block: Cham} (NOT \p{Cham} NOR \p{Is_Cham}) (96: 795 U+AA00..AA5F) 796 \p{Block: Cherokee} (NOT \p{Cherokee} NOR \p{Is_Cherokee}) 797 (96: U+13A0..13FF) 798 \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80) 799 \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80: 800 U+AB70..ABBF) 801 \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F) 802 \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992) 803 \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256) 804 \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32) 805 \p{Block: CJK_Compat_Ideographs} \p{Block= 806 CJK_Compatibility_Ideographs} (512) 807 \p{Block: CJK_Compat_Ideographs_Sup} \p{Block= 808 CJK_Compatibility_Ideographs_Supplement} 809 (544) 810 \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256: 811 U+3300..33FF) 812 \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms}) 813 (32: U+FE30..FE4F) 814 \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk= 815 CJKCompatIdeographs}) (512: U+F900..FAFF) 816 \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk= 817 CJKCompatIdeographsSup}) (544: 818 U+2F800..2FA1F) 819 \p{Block: CJK_Ext_A} \p{Block= 820 CJK_Unified_Ideographs_Extension_A} 821 (6592) 822 \p{Block: CJK_Ext_B} \p{Block= 823 CJK_Unified_Ideographs_Extension_B} 824 (42_720) 825 \p{Block: CJK_Ext_C} \p{Block= 826 CJK_Unified_Ideographs_Extension_C} 827 (4160) 828 \p{Block: CJK_Ext_D} \p{Block= 829 CJK_Unified_Ideographs_Extension_D} (224) 830 \p{Block: CJK_Ext_E} \p{Block= 831 CJK_Unified_Ideographs_Extension_E} 832 (5776) 833 \p{Block: CJK_Ext_F} \p{Block= 834 CJK_Unified_Ideographs_Extension_F} 835 (7488) 836 \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128) 837 \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup}) 838 (128: U+2E80..2EFF) 839 \p{Block: CJK_Strokes} (48: U+31C0..31EF) 840 \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64) 841 \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols}) 842 (64: U+3000..303F) 843 \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992: 844 U+4E00..9FFF) 845 \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk= 846 CJKExtA}) (6592: U+3400..4DBF) 847 \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk= 848 CJKExtB}) (42_720: U+20000..2A6DF) 849 \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk= 850 CJKExtC}) (4160: U+2A700..2B73F) 851 \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk= 852 CJKExtD}) (224: U+2B740..2B81F) 853 \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk= 854 CJKExtE}) (5776: U+2B820..2CEAF) 855 \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk= 856 CJKExtF}) (7488: U+2CEB0..2EBEF) 857 \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk= 858 Diacriticals}) (112: U+0300..036F) 859 \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk= 860 DiacriticalsExt}) (80: U+1AB0..1AFF) 861 \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk= 862 DiacriticalsForSymbols}) (48: 863 U+20D0..20FF) 864 \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk= 865 DiacriticalsSup}) (64: U+1DC0..1DFF) 866 \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16: 867 U+FE20..FE2F) 868 \p{Block: Combining_Marks_For_Symbols} \p{Block= 869 Combining_Diacritical_Marks_For_Symbols} 870 (48) 871 \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk= 872 IndicNumberForms}) (16: U+A830..A83F) 873 \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96) 874 \p{Block: Control_Pictures} (64: U+2400..243F) 875 \p{Block: Coptic} (NOT \p{Coptic} NOR \p{Is_Coptic}) (128: 876 U+2C80..2CFF) 877 \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF) 878 \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32) 879 \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32: 880 U+1D360..1D37F) 881 \p{Block: Cuneiform} (NOT \p{Cuneiform} NOR \p{Is_Cuneiform}) 882 (1024: U+12000..123FF) 883 \p{Block: Cuneiform_Numbers} \p{Block= 884 Cuneiform_Numbers_And_Punctuation} (128) 885 \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk= 886 CuneiformNumbers}) (128: U+12400..1247F) 887 \p{Block: Currency_Symbols} (48: U+20A0..20CF) 888 \p{Block: Cypriot_Syllabary} (64: U+10800..1083F) 889 \p{Block: Cyrillic} (NOT \p{Cyrillic} NOR \p{Is_Cyrillic}) 890 (256: U+0400..04FF) 891 \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32) 892 \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96) 893 \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16) 894 \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32: 895 U+2DE0..2DFF) 896 \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96: 897 U+A640..A69F) 898 \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16: 899 U+1C80..1C8F) 900 \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48) 901 \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48: 902 U+0500..052F) 903 \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement} 904 (48) 905 \p{Block: Deseret} (80: U+10400..1044F) 906 \p{Block: Devanagari} (NOT \p{Devanagari} NOR \p{Is_Devanagari}) 907 (128: U+0900..097F) 908 \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32) 909 \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32: 910 U+A8E0..A8FF) 911 \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112) 912 \p{Block: Diacriticals_Ext} \p{Block= 913 Combining_Diacritical_Marks_Extended} 914 (80) 915 \p{Block: Diacriticals_For_Symbols} \p{Block= 916 Combining_Diacritical_Marks_For_Symbols} 917 (48) 918 \p{Block: Diacriticals_Sup} \p{Block= 919 Combining_Diacritical_Marks_Supplement} 920 (64) 921 \p{Block: Dingbats} (192: U+2700..27BF) 922 \p{Block: Dogra} (NOT \p{Dogra} NOR \p{Is_Dogra}) (80: 923 U+11800..1184F) 924 \p{Block: Domino} \p{Block=Domino_Tiles} (112) 925 \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112: 926 U+1F030..1F09F) 927 \p{Block: Duployan} (NOT \p{Duployan} NOR \p{Is_Duployan}) 928 (160: U+1BC00..1BC9F) 929 \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F) 930 \p{Block: Egyptian_Hieroglyph_Format_Controls} (16: U+13430..1343F) 931 \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR 932 \p{Is_Egyptian_Hieroglyphs}) (1072: 933 U+13000..1342F) 934 \p{Block: Elbasan} (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48: 935 U+10500..1052F) 936 \p{Block: Elymaic} (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32: 937 U+10FE0..10FFF) 938 \p{Block: Emoticons} (80: U+1F600..1F64F) 939 \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160) 940 \p{Block: Enclosed_Alphanum_Sup} \p{Block= 941 Enclosed_Alphanumeric_Supplement} (256) 942 \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk= 943 EnclosedAlphanumSup}) (256: 944 U+1F100..1F1FF) 945 \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk= 946 EnclosedAlphanum}) (160: U+2460..24FF) 947 \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months} 948 (256) 949 \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk= 950 EnclosedCJK}) (256: U+3200..32FF) 951 \p{Block: Enclosed_Ideographic_Sup} \p{Block= 952 Enclosed_Ideographic_Supplement} (256) 953 \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk= 954 EnclosedIdeographicSup}) (256: 955 U+1F200..1F2FF) 956 \p{Block: Ethiopic} (NOT \p{Ethiopic} NOR \p{Is_Ethiopic}) 957 (384: U+1200..137F) 958 \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96) 959 \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48) 960 \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96: 961 U+2D80..2DDF) 962 \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48: 963 U+AB00..AB2F) 964 \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32) 965 \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32: 966 U+1380..139F) 967 \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT 968 \p{Punct} NOR \p{Is_Punctuation}) (112: 969 U+2000..206F) 970 \p{Block: Geometric_Shapes} (96: U+25A0..25FF) 971 \p{Block: Geometric_Shapes_Ext} \p{Block= 972 Geometric_Shapes_Extended} (128) 973 \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk= 974 GeometricShapesExt}) (128: 975 U+1F780..1F7FF) 976 \p{Block: Georgian} (NOT \p{Georgian} NOR \p{Is_Georgian}) 977 (96: U+10A0..10FF) 978 \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48) 979 \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48: 980 U+1C90..1CBF) 981 \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48) 982 \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48: 983 U+2D00..2D2F) 984 \p{Block: Glagolitic} (NOT \p{Glagolitic} NOR \p{Is_Glagolitic}) 985 (96: U+2C00..2C5F) 986 \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48) 987 \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup}) 988 (48: U+1E000..1E02F) 989 \p{Block: Gothic} (NOT \p{Gothic} NOR \p{Is_Gothic}) (32: 990 U+10330..1034F) 991 \p{Block: Grantha} (NOT \p{Grantha} NOR \p{Is_Grantha}) (128: 992 U+11300..1137F) 993 \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek} 994 NOR \p{Is_Greek}) (144) 995 \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek} 996 NOR \p{Is_Greek}) (144: U+0370..03FF) 997 \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256) 998 \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256: 999 U+1F00..1FFF) 1000 \p{Block: Gujarati} (NOT \p{Gujarati} NOR \p{Is_Gujarati}) 1001 (128: U+0A80..0AFF) 1002 \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR 1003 \p{Is_Gunjala_Gondi}) (80: 1004 U+11D60..11DAF) 1005 \p{Block: Gurmukhi} (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi}) 1006 (128: U+0A00..0A7F) 1007 \p{Block: Half_And_Full_Forms} \p{Block= 1008 Halfwidth_And_Fullwidth_Forms} (240) 1009 \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16) 1010 \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk= 1011 HalfAndFullForms}) (240: U+FF00..FFEF) 1012 \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul} 1013 NOR \p{Is_Hangul}) (11_184) 1014 \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo}) 1015 (96: U+3130..318F) 1016 \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}) (256: U+1100..11FF) 1017 \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32: 1018 U+A960..A97F) 1019 \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80: 1020 U+D7B0..D7FF) 1021 \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul} 1022 NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF) 1023 \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR 1024 \p{Is_Hanifi_Rohingya}) (64: 1025 U+10D00..10D3F) 1026 \p{Block: Hanunoo} (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32: 1027 U+1720..173F) 1028 \p{Block: Hatran} (NOT \p{Hatran} NOR \p{Is_Hatran}) (32: 1029 U+108E0..108FF) 1030 \p{Block: Hebrew} (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112: 1031 U+0590..05FF) 1032 \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk= 1033 HighPUSurrogates}) (128: U+DB80..DBFF) 1034 \p{Block: High_PU_Surrogates} \p{Block= 1035 High_Private_Use_Surrogates} (128) 1036 \p{Block: High_Surrogates} (896: U+D800..DB7F) 1037 \p{Block: Hiragana} (NOT \p{Hiragana} NOR \p{Is_Hiragana}) 1038 (96: U+3040..309F) 1039 \p{Block: IDC} \p{Block= 1040 Ideographic_Description_Characters} (NOT 1041 \p{ID_Continue} NOR \p{Is_IDC}) (16) 1042 \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC}; 1043 NOT \p{ID_Continue} NOR \p{Is_IDC}) (16: 1044 U+2FF0..2FFF) 1045 \p{Block: Ideographic_Symbols} \p{Block= 1046 Ideographic_Symbols_And_Punctuation} (32) 1047 \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk= 1048 IdeographicSymbols}) (32: U+16FE0..16FFF) 1049 \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR 1050 \p{Is_Imperial_Aramaic}) (32: 1051 U+10840..1085F) 1052 \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms} 1053 (16) 1054 \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF) 1055 \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi} 1056 NOR \p{Is_Inscriptional_Pahlavi}) (32: 1057 U+10B60..10B7F) 1058 \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian} 1059 NOR \p{Is_Inscriptional_Parthian}) (32: 1060 U+10B40..10B5F) 1061 \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96) 1062 \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96: 1063 U+0250..02AF) 1064 \p{Block: Jamo} \p{Block=Hangul_Jamo} (256) 1065 \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32) 1066 \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80) 1067 \p{Block: Javanese} (NOT \p{Javanese} NOR \p{Is_Javanese}) 1068 (96: U+A980..A9DF) 1069 \p{Block: Kaithi} (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80: 1070 U+11080..110CF) 1071 \p{Block: Kana_Ext_A} \p{Block=Kana_Extended_A} (48) 1072 \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48: 1073 U+1B100..1B12F) 1074 \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256) 1075 \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256: 1076 U+1B000..1B0FF) 1077 \p{Block: Kanbun} (16: U+3190..319F) 1078 \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224) 1079 \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224: 1080 U+2F00..2FDF) 1081 \p{Block: Kannada} (NOT \p{Kannada} NOR \p{Is_Kannada}) (128: 1082 U+0C80..0CFF) 1083 \p{Block: Katakana} (NOT \p{Katakana} NOR \p{Is_Katakana}) 1084 (96: U+30A0..30FF) 1085 \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16) 1086 \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk= 1087 KatakanaExt}) (16: U+31F0..31FF) 1088 \p{Block: Kayah_Li} (48: U+A900..A92F) 1089 \p{Block: Kharoshthi} (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi}) 1090 (96: U+10A00..10A5F) 1091 \p{Block: Khmer} (NOT \p{Khmer} NOR \p{Is_Khmer}) (128: 1092 U+1780..17FF) 1093 \p{Block: Khmer_Symbols} (32: U+19E0..19FF) 1094 \p{Block: Khojki} (NOT \p{Khojki} NOR \p{Is_Khojki}) (80: 1095 U+11200..1124F) 1096 \p{Block: Khudawadi} (NOT \p{Khudawadi} NOR \p{Is_Khudawadi}) 1097 (80: U+112B0..112FF) 1098 \p{Block: Lao} (NOT \p{Lao} NOR \p{Is_Lao}) (128: 1099 U+0E80..0EFF) 1100 \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128) 1101 \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128) 1102 \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80- 1103 \xff]) 1104 \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128) 1105 \p{Block: Latin_Ext_Additional} \p{Block= 1106 Latin_Extended_Additional} (256) 1107 \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208) 1108 \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32) 1109 \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224) 1110 \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64) 1111 \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128: 1112 U+0100..017F) 1113 \p{Block: Latin_Extended_Additional} (Short: \p{Blk= 1114 LatinExtAdditional}) (256: U+1E00..1EFF) 1115 \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208: 1116 U+0180..024F) 1117 \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32: 1118 U+2C60..2C7F) 1119 \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224: 1120 U+A720..A7FF) 1121 \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64: 1122 U+AB30..AB6F) 1123 \p{Block: Lepcha} (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80: 1124 U+1C00..1C4F) 1125 \p{Block: Letterlike_Symbols} (80: U+2100..214F) 1126 \p{Block: Limbu} (NOT \p{Limbu} NOR \p{Is_Limbu}) (80: 1127 U+1900..194F) 1128 \p{Block: Linear_A} (NOT \p{Linear_A} NOR \p{Is_Linear_A}) 1129 (384: U+10600..1077F) 1130 \p{Block: Linear_B_Ideograms} (128: U+10080..100FF) 1131 \p{Block: Linear_B_Syllabary} (128: U+10000..1007F) 1132 \p{Block: Lisu} (48: U+A4D0..A4FF) 1133 \p{Block: Low_Surrogates} (1024: U+DC00..DFFF) 1134 \p{Block: Lycian} (NOT \p{Lycian} NOR \p{Is_Lycian}) (32: 1135 U+10280..1029F) 1136 \p{Block: Lydian} (NOT \p{Lydian} NOR \p{Is_Lydian}) (32: 1137 U+10920..1093F) 1138 \p{Block: Mahajani} (NOT \p{Mahajani} NOR \p{Is_Mahajani}) 1139 (48: U+11150..1117F) 1140 \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48) 1141 \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48: 1142 U+1F000..1F02F) 1143 \p{Block: Makasar} (NOT \p{Makasar} NOR \p{Is_Makasar}) (32: 1144 U+11EE0..11EFF) 1145 \p{Block: Malayalam} (NOT \p{Malayalam} NOR \p{Is_Malayalam}) 1146 (128: U+0D00..0D7F) 1147 \p{Block: Mandaic} (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32: 1148 U+0840..085F) 1149 \p{Block: Manichaean} (NOT \p{Manichaean} NOR \p{Is_Manichaean}) 1150 (64: U+10AC0..10AFF) 1151 \p{Block: Marchen} (NOT \p{Marchen} NOR \p{Is_Marchen}) (80: 1152 U+11C70..11CBF) 1153 \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR 1154 \p{Is_Masaram_Gondi}) (96: 1155 U+11D00..11D5F) 1156 \p{Block: Math_Alphanum} \p{Block= 1157 Mathematical_Alphanumeric_Symbols} (1024) 1158 \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256) 1159 \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk= 1160 MathAlphanum}) (1024: U+1D400..1D7FF) 1161 \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators}) 1162 (256: U+2200..22FF) 1163 \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF) 1164 \p{Block: Medefaidrin} (NOT \p{Medefaidrin} NOR 1165 \p{Is_Medefaidrin}) (96: U+16E40..16E9F) 1166 \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR 1167 \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF) 1168 \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32) 1169 \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt}) 1170 (32: U+AAE0..AAFF) 1171 \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR 1172 \p{Is_Mende_Kikakui}) (224: 1173 U+1E800..1E8DF) 1174 \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR 1175 \p{Is_Meroitic_Cursive}) (96: 1176 U+109A0..109FF) 1177 \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F) 1178 \p{Block: Miao} (NOT \p{Miao} NOR \p{Is_Miao}) (160: 1179 U+16F00..16F9F) 1180 \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows} 1181 (256) 1182 \p{Block: Misc_Math_Symbols_A} \p{Block= 1183 Miscellaneous_Mathematical_Symbols_A} 1184 (48) 1185 \p{Block: Misc_Math_Symbols_B} \p{Block= 1186 Miscellaneous_Mathematical_Symbols_B} 1187 (128) 1188 \p{Block: Misc_Pictographs} \p{Block= 1189 Miscellaneous_Symbols_And_Pictographs} 1190 (768) 1191 \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256) 1192 \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256) 1193 \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk= 1194 MiscMathSymbolsA}) (48: U+27C0..27EF) 1195 \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk= 1196 MiscMathSymbolsB}) (128: U+2980..29FF) 1197 \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols}) 1198 (256: U+2600..26FF) 1199 \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk= 1200 MiscArrows}) (256: U+2B00..2BFF) 1201 \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk= 1202 MiscPictographs}) (768: U+1F300..1F5FF) 1203 \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical}) 1204 (256: U+2300..23FF) 1205 \p{Block: Modi} (NOT \p{Modi} NOR \p{Is_Modi}) (96: 1206 U+11600..1165F) 1207 \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80) 1208 \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F) 1209 \p{Block: Mongolian} (NOT \p{Mongolian} NOR \p{Is_Mongolian}) 1210 (176: U+1800..18AF) 1211 \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32) 1212 \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32: 1213 U+11660..1167F) 1214 \p{Block: Mro} (NOT \p{Mro} NOR \p{Is_Mro}) (48: 1215 U+16A40..16A6F) 1216 \p{Block: Multani} (NOT \p{Multani} NOR \p{Is_Multani}) (48: 1217 U+11280..112AF) 1218 \p{Block: Music} \p{Block=Musical_Symbols} (256) 1219 \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256: 1220 U+1D100..1D1FF) 1221 \p{Block: Myanmar} (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160: 1222 U+1000..109F) 1223 \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32) 1224 \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32) 1225 \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32: 1226 U+AA60..AA7F) 1227 \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32: 1228 U+A9E0..A9FF) 1229 \p{Block: Nabataean} (NOT \p{Nabataean} NOR \p{Is_Nabataean}) 1230 (48: U+10880..108AF) 1231 \p{Block: Nandinagari} (NOT \p{Nandinagari} NOR 1232 \p{Is_Nandinagari}) (96: U+119A0..119FF) 1233 \p{Block: NB} \p{Block=No_Block} (832_720 plus all 1234 above-Unicode code points) 1235 \p{Block: New_Tai_Lue} (NOT \p{New_Tai_Lue} NOR 1236 \p{Is_New_Tai_Lue}) (96: U+1980..19DF) 1237 \p{Block: Newa} (NOT \p{Newa} NOR \p{Is_Newa}) (128: 1238 U+11400..1147F) 1239 \p{Block: NKo} (NOT \p{Nko} NOR \p{Is_NKo}) (64: 1240 U+07C0..07FF) 1241 \p{Block: No_Block} (Short: \p{Blk=NB}) (832_720 plus all 1242 above-Unicode code points: U+0870..089F, 1243 U+2FE0..2FEF, U+10200..1027F, 1244 U+103E0..103FF, U+10570..105FF, 1245 U+10780..107FF ...) 1246 \p{Block: Number_Forms} (64: U+2150..218F) 1247 \p{Block: Nushu} (NOT \p{Nushu} NOR \p{Is_Nushu}) (400: 1248 U+1B170..1B2FF) 1249 \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong} 1250 NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80: 1251 U+1E100..1E14F) 1252 \p{Block: OCR} \p{Block=Optical_Character_Recognition} 1253 (32) 1254 \p{Block: Ogham} (NOT \p{Ogham} NOR \p{Is_Ogham}) (32: 1255 U+1680..169F) 1256 \p{Block: Ol_Chiki} (48: U+1C50..1C7F) 1257 \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR 1258 \p{Is_Old_Hungarian}) (128: 1259 U+10C80..10CFF) 1260 \p{Block: Old_Italic} (NOT \p{Old_Italic} NOR \p{Is_Old_Italic}) 1261 (48: U+10300..1032F) 1262 \p{Block: Old_North_Arabian} (32: U+10A80..10A9F) 1263 \p{Block: Old_Permic} (NOT \p{Old_Permic} NOR \p{Is_Old_Permic}) 1264 (48: U+10350..1037F) 1265 \p{Block: Old_Persian} (NOT \p{Old_Persian} NOR 1266 \p{Is_Old_Persian}) (64: U+103A0..103DF) 1267 \p{Block: Old_Sogdian} (NOT \p{Old_Sogdian} NOR 1268 \p{Is_Old_Sogdian}) (48: U+10F00..10F2F) 1269 \p{Block: Old_South_Arabian} (32: U+10A60..10A7F) 1270 \p{Block: Old_Turkic} (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic}) 1271 (80: U+10C00..10C4F) 1272 \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32: 1273 U+2440..245F) 1274 \p{Block: Oriya} (NOT \p{Oriya} NOR \p{Is_Oriya}) (128: 1275 U+0B00..0B7F) 1276 \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F) 1277 \p{Block: Osage} (NOT \p{Osage} NOR \p{Is_Osage}) (80: 1278 U+104B0..104FF) 1279 \p{Block: Osmanya} (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48: 1280 U+10480..104AF) 1281 \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F) 1282 \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR 1283 \p{Is_Pahawh_Hmong}) (144: 1284 U+16B00..16B8F) 1285 \p{Block: Palmyrene} (32: U+10860..1087F) 1286 \p{Block: Pau_Cin_Hau} (NOT \p{Pau_Cin_Hau} NOR 1287 \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF) 1288 \p{Block: Phags_Pa} (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa}) 1289 (64: U+A840..A87F) 1290 \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48) 1291 \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48: 1292 U+101D0..101FF) 1293 \p{Block: Phoenician} (NOT \p{Phoenician} NOR \p{Is_Phoenician}) 1294 (32: U+10900..1091F) 1295 \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128) 1296 \p{Block: Phonetic_Ext_Sup} \p{Block= 1297 Phonetic_Extensions_Supplement} (64) 1298 \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128: 1299 U+1D00..1D7F) 1300 \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk= 1301 PhoneticExtSup}) (64: U+1D80..1DBF) 1302 \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF) 1303 \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT 1304 \p{Private_Use} NOR \p{Is_Private_Use}) 1305 (6400) 1306 \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT 1307 \p{Private_Use} NOR \p{Is_Private_Use}) 1308 (6400: U+E000..F8FF) 1309 \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR 1310 \p{Is_Psalter_Pahlavi}) (48: 1311 U+10B80..10BAF) 1312 \p{Block: PUA} \p{Block=Private_Use_Area} (NOT 1313 \p{Private_Use} NOR \p{Is_Private_Use}) 1314 (6400) 1315 \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT 1316 \p{Punct} NOR \p{Is_Punctuation}) (112) 1317 \p{Block: Rejang} (NOT \p{Rejang} NOR \p{Is_Rejang}) (48: 1318 U+A930..A95F) 1319 \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32) 1320 \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32: 1321 U+10E60..10E7F) 1322 \p{Block: Runic} (NOT \p{Runic} NOR \p{Is_Runic}) (96: 1323 U+16A0..16FF) 1324 \p{Block: Samaritan} (NOT \p{Samaritan} NOR \p{Is_Samaritan}) 1325 (64: U+0800..083F) 1326 \p{Block: Saurashtra} (NOT \p{Saurashtra} NOR \p{Is_Saurashtra}) 1327 (96: U+A880..A8DF) 1328 \p{Block: Sharada} (NOT \p{Sharada} NOR \p{Is_Sharada}) (96: 1329 U+11180..111DF) 1330 \p{Block: Shavian} (48: U+10450..1047F) 1331 \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF) 1332 \p{Block: Siddham} (NOT \p{Siddham} NOR \p{Is_Siddham}) (128: 1333 U+11580..115FF) 1334 \p{Block: Sinhala} (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128: 1335 U+0D80..0DFF) 1336 \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF) 1337 \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32: 1338 U+FE50..FE6F) 1339 \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32) 1340 \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64) 1341 \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64: 1342 U+1B130..1B16F) 1343 \p{Block: Sogdian} (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64: 1344 U+10F30..10F6F) 1345 \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR 1346 \p{Is_Sora_Sompeng}) (48: U+110D0..110FF) 1347 \p{Block: Soyombo} (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96: 1348 U+11A50..11AAF) 1349 \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk= 1350 ModifierLetters}) (80: U+02B0..02FF) 1351 \p{Block: Specials} (16: U+FFF0..FFFF) 1352 \p{Block: Sundanese} (NOT \p{Sundanese} NOR \p{Is_Sundanese}) 1353 (64: U+1B80..1BBF) 1354 \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16) 1355 \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16: 1356 U+1CC0..1CCF) 1357 \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16) 1358 \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128) 1359 \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256) 1360 \p{Block: Sup_Math_Operators} \p{Block= 1361 Supplemental_Mathematical_Operators} 1362 (256) 1363 \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A} 1364 (65_536) 1365 \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B} 1366 (65_536) 1367 \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128) 1368 \p{Block: Sup_Symbols_And_Pictographs} \p{Block= 1369 Supplemental_Symbols_And_Pictographs} 1370 (256) 1371 \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48) 1372 \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk= 1373 SuperAndSub}) (48: U+2070..209F) 1374 \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16: 1375 U+27F0..27FF) 1376 \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128: 1377 U+2900..297F) 1378 \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256: 1379 U+1F800..1F8FF) 1380 \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk= 1381 SupMathOperators}) (256: U+2A00..2AFF) 1382 \p{Block: Supplemental_Punctuation} (Short: \p{Blk= 1383 SupPunctuation}) (128: U+2E00..2E7F) 1384 \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk= 1385 SupSymbolsAndPictographs}) (256: 1386 U+1F900..1F9FF) 1387 \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk= 1388 SupPUAA}) (65_536: U+F0000..FFFFF) 1389 \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk= 1390 SupPUAB}) (65_536: U+100000..10FFFF) 1391 \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF) 1392 \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR 1393 \p{Is_Syloti_Nagri}) (48: U+A800..A82F) 1394 \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block= 1395 Symbols_And_Pictographs_Extended_A} (144) 1396 \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk= 1397 SymbolsAndPictographsExtA}) (144: 1398 U+1FA70..1FAFF) 1399 \p{Block: Syriac} (NOT \p{Syriac} NOR \p{Is_Syriac}) (80: 1400 U+0700..074F) 1401 \p{Block: Syriac_Sup} \p{Block=Syriac_Supplement} (16) 1402 \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16: 1403 U+0860..086F) 1404 \p{Block: Tagalog} (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32: 1405 U+1700..171F) 1406 \p{Block: Tagbanwa} (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa}) 1407 (32: U+1760..177F) 1408 \p{Block: Tags} (128: U+E0000..E007F) 1409 \p{Block: Tai_Le} (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48: 1410 U+1950..197F) 1411 \p{Block: Tai_Tham} (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham}) 1412 (144: U+1A20..1AAF) 1413 \p{Block: Tai_Viet} (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet}) 1414 (96: U+AA80..AADF) 1415 \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96) 1416 \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96: 1417 U+1D300..1D35F) 1418 \p{Block: Takri} (NOT \p{Takri} NOR \p{Is_Takri}) (80: 1419 U+11680..116CF) 1420 \p{Block: Tamil} (NOT \p{Tamil} NOR \p{Is_Tamil}) (128: 1421 U+0B80..0BFF) 1422 \p{Block: Tamil_Sup} \p{Block=Tamil_Supplement} (64) 1423 \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64: 1424 U+11FC0..11FFF) 1425 \p{Block: Tangut} (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144: 1426 U+17000..187FF) 1427 \p{Block: Tangut_Components} (768: U+18800..18AFF) 1428 \p{Block: Telugu} (NOT \p{Telugu} NOR \p{Is_Telugu}) (128: 1429 U+0C00..0C7F) 1430 \p{Block: Thaana} (NOT \p{Thaana} NOR \p{Is_Thaana}) (64: 1431 U+0780..07BF) 1432 \p{Block: Thai} (NOT \p{Thai} NOR \p{Is_Thai}) (128: 1433 U+0E00..0E7F) 1434 \p{Block: Tibetan} (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256: 1435 U+0F00..0FFF) 1436 \p{Block: Tifinagh} (NOT \p{Tifinagh} NOR \p{Is_Tifinagh}) 1437 (80: U+2D30..2D7F) 1438 \p{Block: Tirhuta} (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96: 1439 U+11480..114DF) 1440 \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols} 1441 (128) 1442 \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk= 1443 TransportAndMap}) (128: U+1F680..1F6FF) 1444 \p{Block: UCAS} \p{Block= 1445 Unified_Canadian_Aboriginal_Syllabics} 1446 (640) 1447 \p{Block: UCAS_Ext} \p{Block= 1448 Unified_Canadian_Aboriginal_Syllabics_- 1449 Extended} (80) 1450 \p{Block: Ugaritic} (NOT \p{Ugaritic} NOR \p{Is_Ugaritic}) 1451 (32: U+10380..1039F) 1452 \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk= 1453 UCAS}) (640: U+1400..167F) 1454 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short: 1455 \p{Blk=UCASExt}) (80: U+18B0..18FF) 1456 \p{Block: Vai} (NOT \p{Vai} NOR \p{Is_Vai}) (320: 1457 U+A500..A63F) 1458 \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT 1459 \p{Variation_Selector} NOR \p{Is_VS}) 1460 (16: U+FE00..FE0F) 1461 \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup}) 1462 (240: U+E0100..E01EF) 1463 \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48) 1464 \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48: 1465 U+1CD0..1CFF) 1466 \p{Block: Vertical_Forms} (16: U+FE10..FE1F) 1467 \p{Block: VS} \p{Block=Variation_Selectors} (NOT 1468 \p{Variation_Selector} NOR \p{Is_VS}) 1469 (16) 1470 \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement} 1471 (240) 1472 \p{Block: Wancho} (NOT \p{Wancho} NOR \p{Is_Wancho}) (64: 1473 U+1E2C0..1E2FF) 1474 \p{Block: Warang_Citi} (NOT \p{Warang_Citi} NOR 1475 \p{Is_Warang_Citi}) (96: U+118A0..118FF) 1476 \p{Block: Yi_Radicals} (64: U+A490..A4CF) 1477 \p{Block: Yi_Syllables} (1168: U+A000..A48F) 1478 \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64) 1479 \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64: 1480 U+4DC0..4DFF) 1481 \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR 1482 \p{Is_Zanabazar_Square}) (80: 1483 U+11A00..11A4F) 1484 X \p{Block_Elements} \p{Block=Block_Elements} (32) 1485 \p{Bopo} \p{Bopomofo} (= \p{Script_Extensions= 1486 Bopomofo}) (NOT \p{Block=Bopomofo}) (112) 1487 \p{Bopomofo} \p{Script_Extensions=Bopomofo} (Short: 1488 \p{Bopo}; NOT \p{Block=Bopomofo}) (112) 1489 X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block= 1490 Bopomofo_Extended}) (32) 1491 X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short: 1492 \p{InBopomofoExt}) (32) 1493 X \p{Box_Drawing} \p{Block=Box_Drawing} (128) 1494 \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *} 1495 \p{Brah} \p{Brahmi} (= \p{Script_Extensions= 1496 Brahmi}) (NOT \p{Block=Brahmi}) (109) 1497 \p{Brahmi} \p{Script_Extensions=Brahmi} (Short: 1498 \p{Brah}; NOT \p{Block=Brahmi}) (109) 1499 \p{Brai} \p{Braille} (= \p{Script_Extensions= 1500 Braille}) (256) 1501 \p{Braille} \p{Script_Extensions=Braille} (Short: 1502 \p{Brai}) (256) 1503 X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short: 1504 \p{InBraille}) (256) 1505 \p{Bugi} \p{Buginese} (= \p{Script_Extensions= 1506 Buginese}) (NOT \p{Block=Buginese}) (31) 1507 \p{Buginese} \p{Script_Extensions=Buginese} (Short: 1508 \p{Bugi}; NOT \p{Block=Buginese}) (31) 1509 \p{Buhd} \p{Buhid} (= \p{Script_Extensions=Buhid}) 1510 (NOT \p{Block=Buhid}) (22) 1511 \p{Buhid} \p{Script_Extensions=Buhid} (Short: 1512 \p{Buhd}; NOT \p{Block=Buhid}) (22) 1513 X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block= 1514 Byzantine_Musical_Symbols}) (256) 1515 X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols} 1516 (Short: \p{InByzantineMusic}) (256) 1517 \p{C} \pC \p{Other} (= \p{General_Category=Other}) 1518 (976_344 plus all above-Unicode code 1519 points) 1520 \p{Cakm} \p{Chakma} (= \p{Script_Extensions= 1521 Chakma}) (NOT \p{Block=Chakma}) (90) 1522 \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal} 1523 (Short: \p{Cans}) (710) 1524 X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics} 1525 (= \p{Block= 1526 Unified_Canadian_Aboriginal_Syllabics}) 1527 (640) 1528 T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class= 1529 Not_Reordered} (1_113_250 plus all 1530 above-Unicode code points) 1531 T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class= 1532 Overlay} (32) 1533 T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class= 1534 Nukta} (25) 1535 T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class= 1536 Kana_Voicing} (2) 1537 T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class= 1538 Virama} (58) 1539 T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class= 1540 CCC10} (1) 1541 \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1: 1542 U+05B0) 1543 T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class= 1544 CCC11} (1) 1545 \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1: 1546 U+05B1) 1547 T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class= 1548 CCC12} (1) 1549 \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1: 1550 U+05B2) 1551 T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class= 1552 CCC13} (1) 1553 \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1: 1554 U+05B3) 1555 T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class= 1556 CCC14} (1) 1557 \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1: 1558 U+05B4) 1559 T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class= 1560 CCC15} (1) 1561 \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1: 1562 U+05B5) 1563 T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class= 1564 CCC16} (1) 1565 \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1: 1566 U+05B6) 1567 T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class= 1568 CCC17} (1) 1569 \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1: 1570 U+05B7) 1571 T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class= 1572 CCC18} (2) 1573 \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2: 1574 U+05B8, U+05C7) 1575 T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class= 1576 CCC19} (2) 1577 \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2: 1578 U+05B9..05BA) 1579 T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class= 1580 CCC20} (1) 1581 \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1: 1582 U+05BB) 1583 T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class= 1584 CCC21} (1) 1585 \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1: 1586 U+05BC) 1587 T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class= 1588 CCC22} (1) 1589 \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1: 1590 U+05BD) 1591 T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class= 1592 CCC23} (1) 1593 \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1: 1594 U+05BF) 1595 T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class= 1596 CCC24} (1) 1597 \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1: 1598 U+05C1) 1599 T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class= 1600 CCC25} (1) 1601 \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1: 1602 U+05C2) 1603 T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class= 1604 CCC26} (1) 1605 \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1: 1606 U+FB1E) 1607 T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class= 1608 CCC27} (2) 1609 \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2: 1610 U+064B, U+08F0) 1611 T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class= 1612 CCC28} (2) 1613 \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2: 1614 U+064C, U+08F1) 1615 T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class= 1616 CCC29} (2) 1617 \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2: 1618 U+064D, U+08F2) 1619 T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class= 1620 CCC30} (2) 1621 \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2: 1622 U+0618, U+064E) 1623 T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class= 1624 CCC31} (2) 1625 \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2: 1626 U+0619, U+064F) 1627 T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class= 1628 CCC32} (2) 1629 \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2: 1630 U+061A, U+0650) 1631 T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class= 1632 CCC33} (1) 1633 \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1: 1634 U+0651) 1635 T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class= 1636 CCC34} (1) 1637 \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1: 1638 U+0652) 1639 T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class= 1640 CCC35} (1) 1641 \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1: 1642 U+0670) 1643 T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class= 1644 CCC36} (1) 1645 \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1: 1646 U+0711) 1647 T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class= 1648 CCC84} (1) 1649 \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1: 1650 U+0C55) 1651 T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class= 1652 CCC91} (1) 1653 \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1: 1654 U+0C56) 1655 T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class= 1656 CCC103} (2) 1657 \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2: 1658 U+0E38..0E39) 1659 T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class= 1660 CCC107} (4) 1661 \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4: 1662 U+0E48..0E4B) 1663 T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class= 1664 CCC118} (2) 1665 \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2: 1666 U+0EB8..0EB9) 1667 T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class= 1668 CCC122} (4) 1669 \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4: 1670 U+0EC8..0ECB) 1671 T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class= 1672 CCC129} (1) 1673 \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1: 1674 U+0F71) 1675 T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class= 1676 CCC130} (6) 1677 \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6: 1678 U+0F72, U+0F7A..0F7D, U+0F80) 1679 T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class= 1680 CCC132} (1) 1681 \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1: 1682 U+0F74) 1683 T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class= 1684 CCC133} (0) 1685 \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0) 1686 T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class= 1687 Attached_Below_Left} (0) 1688 T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class= 1689 Attached_Below} (5) 1690 T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class= 1691 Attached_Above} (1) 1692 T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class= 1693 Attached_Above_Right} (9) 1694 T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class= 1695 Below_Left} (1) 1696 T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class= 1697 Below} (163) 1698 T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class= 1699 Below_Right} (4) 1700 T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class= 1701 Left} (2) 1702 T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class= 1703 Right} (1) 1704 T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class= 1705 Above_Left} (5) 1706 T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class= 1707 Above} (482) 1708 T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class= 1709 Above_Right} (5) 1710 T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class= 1711 Double_Below} (4) 1712 T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class= 1713 Double_Above} (5) 1714 T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class= 1715 Iota_Subscript} (1) 1716 \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class= 1717 Above} (482) 1718 \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (482: 1719 U+0300..0314, U+033D..0344, U+0346, 1720 U+034A..034C, U+0350..0352, U+0357 ...) 1721 \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5: 1722 U+05AE, U+18A9, U+1DF7..1DF8, U+302B) 1723 \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (5: 1724 U+0315, U+031A, U+0358, U+1DF6, U+302C) 1725 \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class= 1726 Above_Left} (5) 1727 \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class= 1728 Above_Right} (5) 1729 \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class= 1730 Attached_Above} (1) 1731 \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class= 1732 Attached_Above_Right} (9) 1733 \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class= 1734 Attached_Below} (5) 1735 \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class= 1736 Attached_Below_Left} (0) 1737 \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA}) 1738 (1: U+1DCE) 1739 \p{Canonical_Combining_Class: Attached_Above_Right} (Short: 1740 \p{Ccc=ATAR}) (9: U+031B, U+0F39, 1741 U+1D165..1D166, U+1D16E..1D172) 1742 \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB}) 1743 (5: U+0321..0322, U+0327..0328, U+1DD0) 1744 \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc= 1745 ATBL}) (0) 1746 \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class= 1747 Below} (163) 1748 \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (163: 1749 U+0316..0319, U+031C..0320, 1750 U+0323..0326, U+0329..0333, 1751 U+0339..033C, U+0347..0349 ...) 1752 \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1: 1753 U+302A) 1754 \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4: 1755 U+059A, U+05AD, U+1939, U+302D) 1756 \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class= 1757 Below_Left} (1) 1758 \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class= 1759 Below_Right} (4) 1760 \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class= 1761 Double_Above} (5) 1762 \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class= 1763 Double_Below} (4) 1764 \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) 1765 (5: U+035D..035E, U+0360..0361, U+1DCD) 1766 \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) 1767 (4: U+035C, U+035F, U+0362, U+1DFC) 1768 \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS}) 1769 (1: U+0345) 1770 \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class= 1771 Iota_Subscript} (1) 1772 \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) 1773 (2: U+3099..309A) 1774 \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class= 1775 Kana_Voicing} (2) 1776 \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class= 1777 Left} (2) 1778 \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2: 1779 U+302E..302F) 1780 \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class= 1781 Nukta} (25) 1782 \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR}) 1783 (1_113_250 plus all above-Unicode code 1784 points: U+0000..02FF, U+034F, 1785 U+0370..0482, U+0488..0590, U+05BE, 1786 U+05C0 ...) 1787 \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class= 1788 Not_Reordered} (1_113_250 plus all 1789 above-Unicode code points) 1790 \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (25: 1791 U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C, 1792 U+0CBC ...) 1793 \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class= 1794 Overlay} (32) 1795 \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32: 1796 U+0334..0338, U+1CD4, U+1CE2..1CE8, 1797 U+20D2..20D3, U+20D8..20DA, U+20E5..20E6 1798 ...) 1799 \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class= 1800 Right} (1) 1801 \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1: 1802 U+1D16D) 1803 \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (58: 1804 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D, 1805 U+0BCD ...) 1806 \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class= 1807 Virama} (58) 1808 \p{Cans} \p{Canadian_Aboriginal} (= 1809 \p{Script_Extensions= 1810 Canadian_Aboriginal}) (710) 1811 \p{Cari} \p{Carian} (= \p{Script_Extensions= 1812 Carian}) (NOT \p{Block=Carian}) (49) 1813 \p{Carian} \p{Script_Extensions=Carian} (Short: 1814 \p{Cari}; NOT \p{Block=Carian}) (49) 1815 \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (2396) 1816 \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_111_716 plus 1817 all above-Unicode code points: [\x00- 1818 \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A- 1819 Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9- 1820 \xac\xae\xb0-\xb3\xb5-\xb6\xb9-\xff], 1821 U+0100..02AF, U+0370..0373, 1822 U+0376..0379, U+037B..0383, U+0386 ...) 1823 \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (2396: 1824 [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8], 1825 U+02B0..036F, U+0374..0375, U+037A, 1826 U+0384..0385, U+0387 ...) 1827 \p{Cased} \p{Cased=Y} (4279) 1828 \p{Cased: N*} (Single: \P{Cased}) (1_109_833 plus all 1829 above-Unicode code points: [\x00- 1830 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 1831 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab- 1832 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7], U+01BB, 1833 U+01C0..01C3, U+0294, U+02B9..02BF, 1834 U+02C2..02DF ...) 1835 \p{Cased: Y*} (Single: \p{Cased}) (4279: [A-Za- 1836 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 1837 \xff], U+0100..01BA, U+01BC..01BF, 1838 U+01C4..0293, U+0295..02B8, U+02C0..02C1 1839 ...) 1840 \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short: 1841 \p{LC}) (3970) 1842 \p{Category: *} \p{General_Category: *} 1843 \p{Caucasian_Albanian} \p{Script_Extensions=Caucasian_Albanian} 1844 (Short: \p{Aghb}; NOT \p{Block= 1845 Caucasian_Albanian}) (53) 1846 \p{Cc} \p{XPosixCntrl} (= \p{General_Category= 1847 Control}) (65) 1848 \p{Ccc: *} \p{Canonical_Combining_Class: *} 1849 \p{CE} \p{Composition_Exclusion} (= 1850 \p{Composition_Exclusion=Y}) (81) 1851 \p{CE: *} \p{Composition_Exclusion: *} 1852 \p{Cf} \p{Format} (= \p{General_Category=Format}) 1853 (161) 1854 \p{Chakma} \p{Script_Extensions=Chakma} (Short: 1855 \p{Cakm}; NOT \p{Block=Chakma}) (90) 1856 \p{Cham} \p{Script_Extensions=Cham} (NOT \p{Block= 1857 Cham}) (83) 1858 \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short: 1859 \p{CWCF}) (1463) 1860 \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF}) 1861 (1_112_649 plus all above-Unicode code 1862 points: [\x00- 1863 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 1864 >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\xb4\xb6- 1865 \xbf\xd7\xe0-\xff], U+0101, U+0103, 1866 U+0105, U+0107, U+0109 ...) 1867 \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF}) 1868 (1463: [A-Z\xb5\xc0-\xd6\xd8-\xdf], 1869 U+0100, U+0102, U+0104, U+0106, U+0108 1870 ...) 1871 \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short: 1872 \p{CWCM}) (2841) 1873 \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM}) 1874 (1_111_271 plus all above-Unicode code 1875 points: [\x00- 1876 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 1877 >?\@\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6- 1878 \xbf\xd7\xf7], U+0138, U+018D, U+019B, 1879 U+01AA..01AB, U+01BA..01BB ...) 1880 \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM}) 1881 (2841: [A-Za-z\xb5\xc0-\xd6\xd8- 1882 \xf6\xf8-\xff], U+0100..0137, 1883 U+0139..018C, U+018E..019A, 1884 U+019C..01A9, U+01AC..01B9 ...) 1885 \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short: 1886 \p{CWL}) (1390) 1887 \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL}) 1888 (1_112_722 plus all above-Unicode code 1889 points: [\x00- 1890 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 1891 >?\@\[\\\]\^_`a-z\{\|\}~\x7f- 1892 \xbf\xd7\xdf-\xff], U+0101, U+0103, 1893 U+0105, U+0107, U+0109 ...) 1894 \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1390: 1895 [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102, 1896 U+0104, U+0106, U+0108 ...) 1897 \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded= 1898 Y} (Short: \p{CWKCF}) (10_315) 1899 \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N}, 1900 \P{CWKCF}) (1_103_797 plus all above- 1901 Unicode code points: [\x00- 1902 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 1903 >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1- 1904 \xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6- 1905 \xb7\xbb\xbf\xd7\xe0-\xff], U+0101, 1906 U+0103, U+0105, U+0107, U+0109 ...) 1907 \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y}, 1908 \p{CWKCF}) (10_315: [A- 1909 Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8- 1910 \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf], 1911 U+0100, U+0102, U+0104, U+0106, U+0108 1912 ...) 1913 \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short: 1914 \p{CWT}) (1409) 1915 \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT}) 1916 (1_112_703 plus all above-Unicode code 1917 points: [\x00- 1918 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A- 1919 Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6- 1920 \xde\xf7], U+0100, U+0102, U+0104, 1921 U+0106, U+0108 ...) 1922 \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1409: 1923 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101, 1924 U+0103, U+0105, U+0107, U+0109 ...) 1925 \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short: 1926 \p{CWU}) (1482) 1927 \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU}) 1928 (1_112_630 plus all above-Unicode code 1929 points: [\x00- 1930 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A- 1931 Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6- 1932 \xde\xf7], U+0100, U+0102, U+0104, 1933 U+0106, U+0108 ...) 1934 \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1482: 1935 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101, 1936 U+0103, U+0105, U+0107, U+0109 ...) 1937 \p{Cher} \p{Cherokee} (= \p{Script_Extensions= 1938 Cherokee}) (NOT \p{Block=Cherokee}) (172) 1939 \p{Cherokee} \p{Script_Extensions=Cherokee} (Short: 1940 \p{Cher}; NOT \p{Block=Cherokee}) (172) 1941 X \p{Cherokee_Sup} \p{Cherokee_Supplement} (= \p{Block= 1942 Cherokee_Supplement}) (80) 1943 X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short: 1944 \p{InCherokeeSup}) (80) 1945 X \p{Chess_Symbols} \p{Block=Chess_Symbols} (112) 1946 \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable= 1947 Y}) (2396) 1948 \p{CI: *} \p{Case_Ignorable: *} 1949 X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block= 1950 CJK_Unified_Ideographs}) (20_992) 1951 X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block= 1952 CJK_Compatibility}) (256) 1953 X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block= 1954 CJK_Compatibility_Forms}) (32) 1955 X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (= 1956 \p{Block=CJK_Compatibility_Ideographs}) 1957 (512) 1958 X \p{CJK_Compat_Ideographs_Sup} 1959 \p{CJK_Compatibility_Ideographs_- 1960 Supplement} (= \p{Block= 1961 CJK_Compatibility_Ideographs_- 1962 Supplement}) (544) 1963 X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short: 1964 \p{InCJKCompat}) (256) 1965 X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms} 1966 (Short: \p{InCJKCompatForms}) (32) 1967 X \p{CJK_Compatibility_Ideographs} \p{Block= 1968 CJK_Compatibility_Ideographs} (Short: 1969 \p{InCJKCompatIdeographs}) (512) 1970 X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block= 1971 CJK_Compatibility_Ideographs_Supplement} 1972 (Short: \p{InCJKCompatIdeographsSup}) 1973 (544) 1974 X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (= 1975 \p{Block= 1976 CJK_Unified_Ideographs_Extension_A}) 1977 (6592) 1978 X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (= 1979 \p{Block= 1980 CJK_Unified_Ideographs_Extension_B}) 1981 (42_720) 1982 X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (= 1983 \p{Block= 1984 CJK_Unified_Ideographs_Extension_C}) 1985 (4160) 1986 X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (= 1987 \p{Block= 1988 CJK_Unified_Ideographs_Extension_D}) 1989 (224) 1990 X \p{CJK_Ext_E} \p{CJK_Unified_Ideographs_Extension_E} (= 1991 \p{Block= 1992 CJK_Unified_Ideographs_Extension_E}) 1993 (5776) 1994 X \p{CJK_Ext_F} \p{CJK_Unified_Ideographs_Extension_F} (= 1995 \p{Block= 1996 CJK_Unified_Ideographs_Extension_F}) 1997 (7488) 1998 X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block= 1999 CJK_Radicals_Supplement}) (128) 2000 X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement} 2001 (Short: \p{InCJKRadicalsSup}) (128) 2002 X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48) 2003 X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (= 2004 \p{Block=CJK_Symbols_And_Punctuation}) 2005 (64) 2006 X \p{CJK_Symbols_And_Punctuation} \p{Block= 2007 CJK_Symbols_And_Punctuation} (Short: 2008 \p{InCJKSymbols}) (64) 2009 X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs} 2010 (Short: \p{InCJK}) (20_992) 2011 X \p{CJK_Unified_Ideographs_Extension_A} \p{Block= 2012 CJK_Unified_Ideographs_Extension_A} 2013 (Short: \p{InCJKExtA}) (6592) 2014 X \p{CJK_Unified_Ideographs_Extension_B} \p{Block= 2015 CJK_Unified_Ideographs_Extension_B} 2016 (Short: \p{InCJKExtB}) (42_720) 2017 X \p{CJK_Unified_Ideographs_Extension_C} \p{Block= 2018 CJK_Unified_Ideographs_Extension_C} 2019 (Short: \p{InCJKExtC}) (4160) 2020 X \p{CJK_Unified_Ideographs_Extension_D} \p{Block= 2021 CJK_Unified_Ideographs_Extension_D} 2022 (Short: \p{InCJKExtD}) (224) 2023 X \p{CJK_Unified_Ideographs_Extension_E} \p{Block= 2024 CJK_Unified_Ideographs_Extension_E} 2025 (Short: \p{InCJKExtE}) (5776) 2026 X \p{CJK_Unified_Ideographs_Extension_F} \p{Block= 2027 CJK_Unified_Ideographs_Extension_F} 2028 (Short: \p{InCJKExtF}) (7488) 2029 \p{Close_Punctuation} \p{General_Category=Close_Punctuation} 2030 (Short: \p{Pe}) (73) 2031 \p{Cn} \p{Unassigned} (= \p{General_Category= 2032 Unassigned}) (836_602 plus all above- 2033 Unicode code points) 2034 \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category= 2035 Control}) (65) 2036 \p{Co} \p{Private_Use} (= \p{General_Category= 2037 Private_Use}) (NOT \p{Private_Use_Area}) 2038 (137_468) 2039 X \p{Combining_Diacritical_Marks} \p{Block= 2040 Combining_Diacritical_Marks} (Short: 2041 \p{InDiacriticals}) (112) 2042 X \p{Combining_Diacritical_Marks_Extended} \p{Block= 2043 Combining_Diacritical_Marks_Extended} 2044 (Short: \p{InDiacriticalsExt}) (80) 2045 X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block= 2046 Combining_Diacritical_Marks_For_Symbols} 2047 (Short: \p{InDiacriticalsForSymbols}) 2048 (48) 2049 X \p{Combining_Diacritical_Marks_Supplement} \p{Block= 2050 Combining_Diacritical_Marks_Supplement} 2051 (Short: \p{InDiacriticalsSup}) (64) 2052 X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short: 2053 \p{InHalfMarks}) (16) 2054 \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark}) 2055 (2268) 2056 X \p{Combining_Marks_For_Symbols} 2057 \p{Combining_Diacritical_Marks_For_- 2058 Symbols} (= \p{Block= 2059 Combining_Diacritical_Marks_For_- 2060 Symbols}) (48) 2061 \p{Common} \p{Script_Extensions=Common} (Short: 2062 \p{Zyyy}) (7386) 2063 X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms} 2064 (Short: \p{InIndicNumberForms}) (16) 2065 \p{Comp_Ex} \p{Full_Composition_Exclusion} (= 2066 \p{Full_Composition_Exclusion=Y}) (1120) 2067 \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *} 2068 X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block= 2069 Hangul_Compatibility_Jamo}) (96) 2070 \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short: 2071 \p{CE}) (81) 2072 \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031 2073 plus all above-Unicode code points: 2074 U+0000..0957, U+0960..09DB, U+09DE, 2075 U+09E0..0A32, U+0A34..0A35, U+0A37..0A58 2076 ...) 2077 \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81: 2078 U+0958..095F, U+09DC..09DD, U+09DF, 2079 U+0A33, U+0A36, U+0A59..0A5B ...) 2080 \p{Connector_Punctuation} \p{General_Category= 2081 Connector_Punctuation} (Short: \p{Pc}) 2082 (10) 2083 \p{Control} \p{XPosixCntrl} (= \p{General_Category= 2084 Control}) (65) 2085 X \p{Control_Pictures} \p{Block=Control_Pictures} (64) 2086 \p{Copt} \p{Coptic} (= \p{Script_Extensions= 2087 Coptic}) (NOT \p{Block=Coptic}) (165) 2088 \p{Coptic} \p{Script_Extensions=Coptic} (Short: 2089 \p{Copt}; NOT \p{Block=Coptic}) (165) 2090 X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32) 2091 X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block= 2092 Counting_Rod_Numerals}) (32) 2093 X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short: 2094 \p{InCountingRod}) (32) 2095 \p{Cprt} \p{Cypriot} (= \p{Script_Extensions= 2096 Cypriot}) (112) 2097 \p{Cs} \p{Surrogate} (= \p{General_Category= 2098 Surrogate}) (2048) 2099 \p{Cuneiform} \p{Script_Extensions=Cuneiform} (Short: 2100 \p{Xsux}; NOT \p{Block=Cuneiform}) (1234) 2101 X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (= 2102 \p{Block= 2103 Cuneiform_Numbers_And_Punctuation}) (128) 2104 X \p{Cuneiform_Numbers_And_Punctuation} \p{Block= 2105 Cuneiform_Numbers_And_Punctuation} 2106 (Short: \p{InCuneiformNumbers}) (128) 2107 \p{Currency_Symbol} \p{General_Category=Currency_Symbol} 2108 (Short: \p{Sc}) (62) 2109 X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48) 2110 \p{CWCF} \p{Changes_When_Casefolded} (= 2111 \p{Changes_When_Casefolded=Y}) (1463) 2112 \p{CWCF: *} \p{Changes_When_Casefolded: *} 2113 \p{CWCM} \p{Changes_When_Casemapped} (= 2114 \p{Changes_When_Casemapped=Y}) (2841) 2115 \p{CWCM: *} \p{Changes_When_Casemapped: *} 2116 \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (= 2117 \p{Changes_When_NFKC_Casefolded=Y}) 2118 (10_315) 2119 \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *} 2120 \p{CWL} \p{Changes_When_Lowercased} (= 2121 \p{Changes_When_Lowercased=Y}) (1390) 2122 \p{CWL: *} \p{Changes_When_Lowercased: *} 2123 \p{CWT} \p{Changes_When_Titlecased} (= 2124 \p{Changes_When_Titlecased=Y}) (1409) 2125 \p{CWT: *} \p{Changes_When_Titlecased: *} 2126 \p{CWU} \p{Changes_When_Uppercased} (= 2127 \p{Changes_When_Uppercased=Y}) (1482) 2128 \p{CWU: *} \p{Changes_When_Uppercased: *} 2129 \p{Cypriot} \p{Script_Extensions=Cypriot} (Short: 2130 \p{Cprt}) (112) 2131 X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64) 2132 \p{Cyrillic} \p{Script_Extensions=Cyrillic} (Short: 2133 \p{Cyrl}; NOT \p{Block=Cyrillic}) (446) 2134 X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block= 2135 Cyrillic_Extended_A}) (32) 2136 X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block= 2137 Cyrillic_Extended_B}) (96) 2138 X \p{Cyrillic_Ext_C} \p{Cyrillic_Extended_C} (= \p{Block= 2139 Cyrillic_Extended_C}) (16) 2140 X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short: 2141 \p{InCyrillicExtA}) (32) 2142 X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short: 2143 \p{InCyrillicExtB}) (96) 2144 X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short: 2145 \p{InCyrillicExtC}) (16) 2146 X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block= 2147 Cyrillic_Supplement}) (48) 2148 X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short: 2149 \p{InCyrillicSup}) (48) 2150 X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block= 2151 Cyrillic_Supplement}) (48) 2152 \p{Cyrl} \p{Cyrillic} (= \p{Script_Extensions= 2153 Cyrillic}) (NOT \p{Block=Cyrillic}) (446) 2154 \p{Dash} \p{Dash=Y} (28) 2155 \p{Dash: N*} (Single: \P{Dash}) (1_114_084 plus all 2156 above-Unicode code points: [\x00- 2157 \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A- 2158 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff], 2159 U+0100..0589, U+058B..05BD, 2160 U+05BF..13FF, U+1401..1805, U+1807..200F 2161 ...) 2162 \p{Dash: Y*} (Single: \p{Dash}) (28: [\-], U+058A, 2163 U+05BE, U+1400, U+1806, U+2010..2015 ...) 2164 \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation} 2165 (Short: \p{Pd}) (24) 2166 \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category= 2167 Decimal_Number}) (630) 2168 \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical} 2169 (13_232) 2170 \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232: 2171 [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9- 2172 \xdd\xe0-\xe5\xe7-\xef\xf1-\xf6\xf9- 2173 \xfd\xff], U+0100..010F, U+0112..0125, 2174 U+0128..0130, U+0134..0137, U+0139..013E 2175 ...) 2176 \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240: 2177 U+2460..2473, U+24B6..24EA, 2178 U+3244..3247, U+3251..327E, 2179 U+3280..32BF, U+32D0..32FE ...) 2180 \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720) 2181 \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720: 2182 [\xa8\xaf\xb4-\xb5\xb8], U+0132..0133, 2183 U+013F..0140, U+0149, U+017F, 2184 U+01C4..01CC ...) 2185 \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240) 2186 \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240) 2187 \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51, 2188 U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63 2189 ...) 2190 \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184: U+2102, 2191 U+210A..2113, U+2115, U+2119..211D, 2192 U+2124, U+2128 ...) 2193 \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20) 2194 \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc- 2195 \xbe], U+2150..215F, U+2189) 2196 \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171) 2197 \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54, 2198 U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68 2199 ...) 2200 \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238) 2201 \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50, 2202 U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62 2203 ...) 2204 \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82) 2205 \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55, 2206 U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69 2207 ...) 2208 \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122) 2209 \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122: 2210 U+FF61..FFBE, U+FFC2..FFC7, 2211 U+FFCA..FFCF, U+FFD2..FFD7, 2212 U+FFDA..FFDC, U+FFE8..FFEE) 2213 \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5) 2214 \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0], 2215 U+0F0C, U+2007, U+2011, U+202F) 2216 \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type= 2217 Non_Canonical} (Perl extension) (3664) 2218 \p{Decomposition_Type: Non_Canonical} Union of all non-canonical 2219 decompositions (Short: \p{Dt=NonCanon}) 2220 (Perl extension) (3664: 2221 [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-\xba\xbc- 2222 \xbe], U+0132..0133, U+013F..0140, 2223 U+0149, U+017F, U+01C4..01CC ...) 2224 \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_216 plus 2225 all above-Unicode code points: [\x00- 2226 \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6- 2227 \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde- 2228 \xdf\xe6\xf0\xf7-\xf8\xfe], 2229 U+0110..0111, U+0126..0127, U+0131, 2230 U+0138, U+0141..0142 ...) 2231 \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26: 2232 U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B) 2233 \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26) 2234 \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286) 2235 \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250, 2236 U+32CC..32CF, U+32FF..3357, 2237 U+3371..33DF, U+33FF, U+1F130..1F14F ...) 2238 \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38: U+1D62..1D6A, 2239 U+2080..208E, U+2090..209C, U+2C7C) 2240 \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (153) 2241 \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (153: [\xaa\xb2- 2242 \xb3\xb9-\xba], U+02B0..02B8, 2243 U+02E0..02E4, U+10FC, U+1D2C..1D2E, 2244 U+1D30..1D3A ...) 2245 \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35) 2246 \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F, 2247 U+30FF, U+FE10..FE19, U+FE30..FE44, 2248 U+FE47..FE48) 2249 \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000, 2250 U+FF01..FF60, U+FFE0..FFE6) 2251 \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point= 2252 Y} (Short: \p{DI}) (4173) 2253 \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI}) 2254 (1_109_939 plus all above-Unicode code 2255 points: [\x00-\xac\xae-\xff], 2256 U+0100..034E, U+0350..061B, 2257 U+061D..115E, U+1161..17B3, U+17B6..180A 2258 ...) 2259 \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI}) 2260 (4173: [\xad], U+034F, U+061C, 2261 U+115F..1160, U+17B4..17B5, U+180B..180E 2262 ...) 2263 \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (15) 2264 \p{Dep: *} \p{Deprecated: *} 2265 \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (15) 2266 \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_097 2267 plus all above-Unicode code points: 2268 U+0000..0148, U+014A..0672, 2269 U+0674..0F76, U+0F78, U+0F7A..17A2, 2270 U+17A5..2069 ...) 2271 \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149, 2272 U+0673, U+0F77, U+0F79, U+17A3..17A4, 2273 U+206A..206F ...) 2274 \p{Deseret} \p{Script_Extensions=Deseret} (Short: 2275 \p{Dsrt}) (80) 2276 \p{Deva} \p{Devanagari} (= \p{Script_Extensions= 2277 Devanagari}) (NOT \p{Block=Devanagari}) 2278 (210) 2279 \p{Devanagari} \p{Script_Extensions=Devanagari} (Short: 2280 \p{Deva}; NOT \p{Block=Devanagari}) (210) 2281 X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block= 2282 Devanagari_Extended}) (32) 2283 X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short: 2284 \p{InDevanagariExt}) (32) 2285 \p{DI} \p{Default_Ignorable_Code_Point} (= 2286 \p{Default_Ignorable_Code_Point=Y}) 2287 (4173) 2288 \p{DI: *} \p{Default_Ignorable_Code_Point: *} 2289 \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (873) 2290 \p{Dia: *} \p{Diacritic: *} 2291 \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (873) 2292 \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_239 2293 plus all above-Unicode code points: 2294 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 2295 >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9- 2296 \xae\xb0-\xb3\xb5-\xb6\xb9-\xff], 2297 U+0100..02AF, U+034F, U+0358..035C, 2298 U+0363..0373, U+0376..0379 ...) 2299 \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (873: 2300 [\^`\xa8\xaf\xb4\xb7-\xb8], 2301 U+02B0..034E, U+0350..0357, 2302 U+035D..0362, U+0374..0375, U+037A ...) 2303 X \p{Diacriticals} \p{Combining_Diacritical_Marks} (= 2304 \p{Block=Combining_Diacritical_Marks}) 2305 (112) 2306 X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended} 2307 (= \p{Block= 2308 Combining_Diacritical_Marks_Extended}) 2309 (80) 2310 X \p{Diacriticals_For_Symbols} 2311 \p{Combining_Diacritical_Marks_For_- 2312 Symbols} (= \p{Block= 2313 Combining_Diacritical_Marks_For_- 2314 Symbols}) (48) 2315 X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement} 2316 (= \p{Block= 2317 Combining_Diacritical_Marks_Supplement}) 2318 (64) 2319 \p{Digit} \p{XPosixDigit} (= \p{General_Category= 2320 Decimal_Number}) (630) 2321 X \p{Dingbats} \p{Block=Dingbats} (192) 2322 \p{Dogr} \p{Dogra} (= \p{Script_Extensions=Dogra}) 2323 (NOT \p{Block=Dogra}) (82) 2324 \p{Dogra} \p{Script_Extensions=Dogra} (Short: 2325 \p{Dogr}; NOT \p{Block=Dogra}) (82) 2326 X \p{Domino} \p{Domino_Tiles} (= \p{Block= 2327 Domino_Tiles}) (112) 2328 X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short: 2329 \p{InDomino}) (112) 2330 \p{Dsrt} \p{Deseret} (= \p{Script_Extensions= 2331 Deseret}) (80) 2332 \p{Dt: *} \p{Decomposition_Type: *} 2333 \p{Dupl} \p{Duployan} (= \p{Script_Extensions= 2334 Duployan}) (NOT \p{Block=Duployan}) (147) 2335 \p{Duployan} \p{Script_Extensions=Duployan} (Short: 2336 \p{Dupl}; NOT \p{Block=Duployan}) (147) 2337 \p{Ea: *} \p{East_Asian_Width: *} 2338 X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform} 2339 (208) 2340 \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739) 2341 \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739: 2342 [\xa1\xa4\xa7-\xa8\xaa\xad-\xae\xb0- 2343 \xb4\xb6-\xba\xbc-\xbf\xc6\xd0\xd7- 2344 \xd8\xde-\xe1\xe6\xe8-\xea\xec- 2345 \xed\xf0\xf2-\xf3\xf7-\xfa\xfc\xfe], 2346 U+0101, U+0111, U+0113, U+011B, 2347 U+0126..0127 ...) 2348 \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104) 2349 \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000, 2350 U+FF01..FF60, U+FFE0..FFE6) 2351 \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123) 2352 \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9, 2353 U+FF61..FFBE, U+FFC2..FFC7, 2354 U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC 2355 ...) 2356 \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (793_252 plus 2357 all above-Unicode code points) 2358 \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111) 2359 \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20- 2360 \x7e\xa2-\xa3\xa5-\xa6\xac\xaf], 2361 U+27E6..27ED, U+2985..2986) 2362 \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (793_252 plus all 2363 above-Unicode code points: [\x00- 2364 \x1f\x7f-\xa0\xa9\xab\xb5\xbb\xc0- 2365 \xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd\xe2- 2366 \xe5\xe7\xeb\xee-\xef\xf1\xf4- 2367 \xf6\xfb\xfd\xff], U+00FF..0100, 2368 U+0102..0110, U+0112, U+0114..011A, 2369 U+011C..0125 ...) 2370 \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (181_783) 2371 \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (181_783: 2372 U+1100..115F, U+231A..231B, 2373 U+2329..232A, U+23E9..23EC, U+23F0, 2374 U+23F3 ...) 2375 \p{Egyp} \p{Egyptian_Hieroglyphs} (= 2376 \p{Script_Extensions= 2377 Egyptian_Hieroglyphs}) (NOT \p{Block= 2378 Egyptian_Hieroglyphs}) (1080) 2379 X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block= 2380 Egyptian_Hieroglyph_Format_Controls} (16) 2381 \p{Egyptian_Hieroglyphs} \p{Script_Extensions= 2382 Egyptian_Hieroglyphs} (Short: \p{Egyp}; 2383 NOT \p{Block=Egyptian_Hieroglyphs}) 2384 (1080) 2385 \p{Elba} \p{Elbasan} (= \p{Script_Extensions= 2386 Elbasan}) (NOT \p{Block=Elbasan}) (40) 2387 \p{Elbasan} \p{Script_Extensions=Elbasan} (Short: 2388 \p{Elba}; NOT \p{Block=Elbasan}) (40) 2389 \p{Elym} \p{Elymaic} (= \p{Script_Extensions= 2390 Elymaic}) (NOT \p{Block=Elymaic}) (23) 2391 \p{Elymaic} \p{Script_Extensions=Elymaic} (Short: 2392 \p{Elym}; NOT \p{Block=Elymaic}) (23) 2393 X \p{Emoticons} \p{Block=Emoticons} (80) 2394 X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block= 2395 Enclosed_Alphanumerics}) (160) 2396 X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (= 2397 \p{Block= 2398 Enclosed_Alphanumeric_Supplement}) (256) 2399 X \p{Enclosed_Alphanumeric_Supplement} \p{Block= 2400 Enclosed_Alphanumeric_Supplement} 2401 (Short: \p{InEnclosedAlphanumSup}) (256) 2402 X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics} 2403 (Short: \p{InEnclosedAlphanum}) (160) 2404 X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (= 2405 \p{Block= 2406 Enclosed_CJK_Letters_And_Months}) (256) 2407 X \p{Enclosed_CJK_Letters_And_Months} \p{Block= 2408 Enclosed_CJK_Letters_And_Months} (Short: 2409 \p{InEnclosedCJK}) (256) 2410 X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement} 2411 (= \p{Block= 2412 Enclosed_Ideographic_Supplement}) (256) 2413 X \p{Enclosed_Ideographic_Supplement} \p{Block= 2414 Enclosed_Ideographic_Supplement} (Short: 2415 \p{InEnclosedIdeographicSup}) (256) 2416 \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark} 2417 (Short: \p{Me}) (13) 2418 \p{Ethi} \p{Ethiopic} (= \p{Script_Extensions= 2419 Ethiopic}) (NOT \p{Block=Ethiopic}) (495) 2420 \p{Ethiopic} \p{Script_Extensions=Ethiopic} (Short: 2421 \p{Ethi}; NOT \p{Block=Ethiopic}) (495) 2422 X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block= 2423 Ethiopic_Extended}) (96) 2424 X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block= 2425 Ethiopic_Extended_A}) (48) 2426 X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short: 2427 \p{InEthiopicExt}) (96) 2428 X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short: 2429 \p{InEthiopicExtA}) (48) 2430 X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block= 2431 Ethiopic_Supplement}) (32) 2432 X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short: 2433 \p{InEthiopicSup}) (32) 2434 \p{Ext} \p{Extender} (= \p{Extender=Y}) (47) 2435 \p{Ext: *} \p{Extender: *} 2436 \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (47) 2437 \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_065 2438 plus all above-Unicode code points: 2439 [\x00-\xb6\xb8-\xff], U+0100..02CF, 2440 U+02D2..063F, U+0641..07F9, 2441 U+07FB..0E45, U+0E47..0EC5 ...) 2442 \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (47: [\xb7], 2443 U+02D0..02D1, U+0640, U+07FA, U+0E46, 2444 U+0EC6 ...) 2445 \p{Final_Punctuation} \p{General_Category=Final_Punctuation} 2446 (Short: \p{Pf}) (10) 2447 \p{Format} \p{General_Category=Format} (Short: 2448 \p{Cf}) (161) 2449 \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y} 2450 (Short: \p{CompEx}) (1120) 2451 \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N}, 2452 \P{CompEx}) (1_112_992 plus all above- 2453 Unicode code points: U+0000..033F, 2454 U+0342, U+0345..0373, U+0375..037D, 2455 U+037F..0386, U+0388..0957 ...) 2456 \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y}, 2457 \p{CompEx}) (1120: U+0340..0341, 2458 U+0343..0344, U+0374, U+037E, U+0387, 2459 U+0958..095F ...) 2460 \p{Gc: *} \p{General_Category: *} 2461 \p{GCB: *} \p{Grapheme_Cluster_Break: *} 2462 \p{General_Category: C} \p{General_Category=Other} (976_344 plus 2463 all above-Unicode code points) 2464 \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short: 2465 \p{Gc=LC}, \p{LC}) (3970: [A-Za- 2466 z\xb5\xc0-\xd6\xd8-\xf6\xf8-\xff], 2467 U+0100..01BA, U+01BC..01BF, 2468 U+01C4..0293, U+0295..02AF, U+0370..0373 2469 ...) 2470 \p{General_Category: Cc} \p{General_Category=Control} (65) 2471 \p{General_Category: Cf} \p{General_Category=Format} (161) 2472 \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe}) 2473 (73: [\)\]\}], U+0F3B, U+0F3D, U+169C, 2474 U+2046, U+207E ...) 2475 \p{General_Category: Cn} \p{General_Category=Unassigned} (836_602 2476 plus all above-Unicode code points) 2477 \p{General_Category: Cntrl} \p{General_Category=Control} (65) 2478 \p{General_Category: Co} \p{General_Category=Private_Use} (137_468) 2479 \p{General_Category: Combining_Mark} \p{General_Category=Mark} 2480 (2268) 2481 \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc}, 2482 \p{Pc}) (10: [_], U+203F..2040, U+2054, 2483 U+FE33..FE34, U+FE4D..FE4F, U+FF3F) 2484 \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65: 2485 [\x00-\x1f\x7f-\x9f]) 2486 \p{General_Category: Cs} \p{General_Category=Surrogate} (2048) 2487 \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc}) 2488 (62: [\$\xa2-\xa5], U+058F, U+060B, 2489 U+07FE..07FF, U+09F2..09F3, U+09FB ...) 2490 \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd}) 2491 (24: [\-], U+058A, U+05BE, U+1400, 2492 U+1806, U+2010..2015 ...) 2493 \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd}) 2494 (630: [0-9], U+0660..0669, U+06F0..06F9, 2495 U+07C0..07C9, U+0966..096F, U+09E6..09EF 2496 ...) 2497 \p{General_Category: Digit} \p{General_Category=Decimal_Number} 2498 (630) 2499 \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me}) 2500 (13: U+0488..0489, U+1ABE, U+20DD..20E0, 2501 U+20E2..20E4, U+A670..A672) 2502 \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf}) 2503 (10: [\xbb], U+2019, U+201D, U+203A, 2504 U+2E03, U+2E05 ...) 2505 \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (161: 2506 [\xad], U+0600..0605, U+061C, U+06DD, 2507 U+070F, U+08E2 ...) 2508 \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi}, 2509 \p{Pi}) (12: [\xab], U+2018, 2510 U+201B..201C, U+201F, U+2039, U+2E02 ...) 2511 \p{General_Category: L} \p{General_Category=Letter} (125_643) 2512 X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3970) 2513 X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note 2514 the trailing '_' matters in spite of 2515 loose matching rules. (3970) 2516 \p{General_Category: LC} \p{General_Category=Cased_Letter} (3970) 2517 \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (125_643: 2518 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8- 2519 \xf6\xf8-\xff], U+0100..02C1, 2520 U+02C6..02D1, U+02E0..02E4, U+02EC, 2521 U+02EE ...) 2522 \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl}) 2523 (236: U+16EE..16F0, U+2160..2182, 2524 U+2185..2188, U+3007, U+3021..3029, 2525 U+3038..303A ...) 2526 \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) 2527 (1: U+2028) 2528 \p{General_Category: Ll} \p{General_Category=Lowercase_Letter} 2529 (/i= General_Category=Cased_Letter) 2530 (2151) 2531 \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (259) 2532 \p{General_Category: Lo} \p{General_Category=Other_Letter} 2533 (121_414) 2534 \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll}; 2535 /i= General_Category=Cased_Letter) 2536 (2151: [a-z\xb5\xdf-\xf6\xf8-\xff], 2537 U+0101, U+0103, U+0105, U+0107, U+0109 2538 ...) 2539 \p{General_Category: Lt} \p{General_Category=Titlecase_Letter} 2540 (/i= General_Category=Cased_Letter) (31) 2541 \p{General_Category: Lu} \p{General_Category=Uppercase_Letter} 2542 (/i= General_Category=Cased_Letter) 2543 (1788) 2544 \p{General_Category: M} \p{General_Category=Mark} (2268) 2545 \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2268: 2546 U+0300..036F, U+0483..0489, 2547 U+0591..05BD, U+05BF, U+05C1..05C2, 2548 U+05C4..05C5 ...) 2549 \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948: 2550 [+<=>\|~\xac\xb1\xd7\xf7], U+03F6, 2551 U+0606..0608, U+2044, U+2052, 2552 U+207A..207C ...) 2553 \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (429) 2554 \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13) 2555 \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark} 2556 (1826) 2557 \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm}) 2558 (259: U+02B0..02C1, U+02C6..02D1, 2559 U+02E0..02E4, U+02EC, U+02EE, U+0374 ...) 2560 \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk}) 2561 (121: [\^`\xa8\xaf\xb4\xb8], 2562 U+02C2..02C5, U+02D2..02DF, 2563 U+02E5..02EB, U+02ED, U+02EF..02FF ...) 2564 \p{General_Category: N} \p{General_Category=Number} (1754) 2565 \p{General_Category: Nd} \p{General_Category=Decimal_Number} (630) 2566 \p{General_Category: Nl} \p{General_Category=Letter_Number} (236) 2567 \p{General_Category: No} \p{General_Category=Other_Number} (888) 2568 \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn}) 2569 (1826: U+0300..036F, U+0483..0487, 2570 U+0591..05BD, U+05BF, U+05C1..05C2, 2571 U+05C4..05C5 ...) 2572 \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1754: 2573 [0-9\xb2-\xb3\xb9\xbc-\xbe], 2574 U+0660..0669, U+06F0..06F9, 2575 U+07C0..07C9, U+0966..096F, U+09E6..09EF 2576 ...) 2577 \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps}) 2578 (75: [\(\[\{], U+0F3A, U+0F3C, U+169B, 2579 U+201A, U+201E ...) 2580 \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (976_344 plus 2581 all above-Unicode code points: [\x00- 2582 \x1f\x7f-\x9f\xad], U+0378..0379, 2583 U+0380..0383, U+038B, U+038D, U+03A2 ...) 2584 \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo}) 2585 (121_414: [\xaa\xba], U+01BB, 2586 U+01C0..01C3, U+0294, U+05D0..05EA, 2587 U+05EF..05F2 ...) 2588 \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) 2589 (888: [\xb2-\xb3\xb9\xbc-\xbe], 2590 U+09F4..09F9, U+0B72..0B77, 2591 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E 2592 ...) 2593 \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po}) 2594 (588: 2595 [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7\xb6- 2596 \xb7\xbf], U+037E, U+0387, U+055A..055F, 2597 U+0589, U+05C0 ...) 2598 \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So}) 2599 (6161: [\xa6\xa9\xae\xb0], U+0482, 2600 U+058D..058E, U+060E..060F, U+06DE, 2601 U+06E9 ...) 2602 \p{General_Category: P} \p{General_Category=Punctuation} (792) 2603 \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp}, 2604 \p{Zp}) (1: U+2029) 2605 \p{General_Category: Pc} \p{General_Category= 2606 Connector_Punctuation} (10) 2607 \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24) 2608 \p{General_Category: Pe} \p{General_Category=Close_Punctuation} 2609 (73) 2610 \p{General_Category: Pf} \p{General_Category=Final_Punctuation} 2611 (10) 2612 \p{General_Category: Pi} \p{General_Category=Initial_Punctuation} 2613 (12) 2614 \p{General_Category: Po} \p{General_Category=Other_Punctuation} 2615 (588) 2616 \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co}) 2617 (137_468: U+E000..F8FF, U+F0000..FFFFD, 2618 U+100000..10FFFD) 2619 \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75) 2620 \p{General_Category: Punct} \p{General_Category=Punctuation} (792) 2621 \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (792: 2622 [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_- 2623 \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf], 2624 U+037E, U+0387, U+055A..055F, 2625 U+0589..058A, U+05BE ...) 2626 \p{General_Category: S} \p{General_Category=Symbol} (7292) 2627 \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (62) 2628 \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19: 2629 [\x20\xa0], U+1680, U+2000..200A, 2630 U+2028..2029, U+202F, U+205F ...) 2631 \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (121) 2632 \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948) 2633 \p{General_Category: So} \p{General_Category=Other_Symbol} (6161) 2634 \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs}) 2635 (17: [\x20\xa0], U+1680, U+2000..200A, 2636 U+202F, U+205F, U+3000) 2637 \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) 2638 (429: U+0903, U+093B, U+093E..0940, 2639 U+0949..094C, U+094E..094F, U+0982..0983 2640 ...) 2641 \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048: 2642 U+D800..DFFF) 2643 \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7292: 2644 [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae- 2645 \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5, 2646 U+02D2..02DF, U+02E5..02EB, U+02ED, 2647 U+02EF..02FF ...) 2648 \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt}; 2649 /i= General_Category=Cased_Letter) (31: 2650 U+01C5, U+01C8, U+01CB, U+01F2, 2651 U+1F88..1F8F, U+1F98..1F9F ...) 2652 \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn}) 2653 (836_602 plus all above-Unicode code 2654 points: U+0378..0379, U+0380..0383, 2655 U+038B, U+038D, U+03A2, U+0530 ...) 2656 \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu}; 2657 /i= General_Category=Cased_Letter) 2658 (1788: [A-Z\xc0-\xd6\xd8-\xde], U+0100, 2659 U+0102, U+0104, U+0106, U+0108 ...) 2660 \p{General_Category: Z} \p{General_Category=Separator} (19) 2661 \p{General_Category: Zl} \p{General_Category=Line_Separator} (1) 2662 \p{General_Category: Zp} \p{General_Category=Paragraph_Separator} 2663 (1) 2664 \p{General_Category: Zs} \p{General_Category=Space_Separator} (17) 2665 X \p{General_Punctuation} \p{Block=General_Punctuation} (Short: 2666 \p{InPunctuation}) (112) 2667 X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96) 2668 X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (= 2669 \p{Block=Geometric_Shapes_Extended}) 2670 (128) 2671 X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended} 2672 (Short: \p{InGeometricShapesExt}) (128) 2673 \p{Geor} \p{Georgian} (= \p{Script_Extensions= 2674 Georgian}) (NOT \p{Block=Georgian}) (175) 2675 \p{Georgian} \p{Script_Extensions=Georgian} (Short: 2676 \p{Geor}; NOT \p{Block=Georgian}) (175) 2677 X \p{Georgian_Ext} \p{Georgian_Extended} (= \p{Block= 2678 Georgian_Extended}) (48) 2679 X \p{Georgian_Extended} \p{Block=Georgian_Extended} (Short: 2680 \p{InGeorgianExt}) (48) 2681 X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block= 2682 Georgian_Supplement}) (48) 2683 X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short: 2684 \p{InGeorgianSup}) (48) 2685 \p{Glag} \p{Glagolitic} (= \p{Script_Extensions= 2686 Glagolitic}) (NOT \p{Block=Glagolitic}) 2687 (136) 2688 \p{Glagolitic} \p{Script_Extensions=Glagolitic} (Short: 2689 \p{Glag}; NOT \p{Block=Glagolitic}) (136) 2690 X \p{Glagolitic_Sup} \p{Glagolitic_Supplement} (= \p{Block= 2691 Glagolitic_Supplement}) (48) 2692 X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short: 2693 \p{InGlagoliticSup}) (48) 2694 \p{Gong} \p{Gunjala_Gondi} (= \p{Script_Extensions= 2695 Gunjala_Gondi}) (NOT \p{Block= 2696 Gunjala_Gondi}) (65) 2697 \p{Gonm} \p{Masaram_Gondi} (= \p{Script_Extensions= 2698 Masaram_Gondi}) (NOT \p{Block= 2699 Masaram_Gondi}) (77) 2700 \p{Goth} \p{Gothic} (= \p{Script_Extensions= 2701 Gothic}) (NOT \p{Block=Gothic}) (27) 2702 \p{Gothic} \p{Script_Extensions=Gothic} (Short: 2703 \p{Goth}; NOT \p{Block=Gothic}) (27) 2704 \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y}) 2705 (135_898) 2706 \p{Gr_Base: *} \p{Grapheme_Base: *} 2707 \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend= 2708 Y}) (1965) 2709 \p{Gr_Ext: *} \p{Grapheme_Extend: *} 2710 \p{Gran} \p{Grantha} (= \p{Script_Extensions= 2711 Grantha}) (NOT \p{Block=Grantha}) (116) 2712 \p{Grantha} \p{Script_Extensions=Grantha} (Short: 2713 \p{Gran}; NOT \p{Block=Grantha}) (116) 2714 \p{Graph} \p{XPosixGraph} (275_378) 2715 \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase}) 2716 (135_898) 2717 \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase}) (978_214 2718 plus all above-Unicode code points: 2719 [\x00-\x1f\x7f-\x9f\xad], U+0300..036F, 2720 U+0378..0379, U+0380..0383, U+038B, 2721 U+038D ...) 2722 \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase}) 2723 (135_898: [\x20-\x7e\xa0-\xac\xae-\xff], 2724 U+0100..02FF, U+0370..0377, 2725 U+037A..037F, U+0384..038A, U+038C ...) 2726 \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control} 2727 (3886) 2728 \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3886: 2729 [^\n\r\x20-\x7e\xa0-\xac\xae-\xff], 2730 U+061C, U+180E, U+200B, U+200E..200F, 2731 U+2028..202E ...) 2732 \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r]) 2733 \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0) 2734 \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0) 2735 \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0) 2736 \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base} 2737 (0) 2738 \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break= 2739 E_Base_GAZ} (0) 2740 \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break= 2741 E_Modifier} (0) 2742 \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend} 2743 (1970) 2744 \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1970: 2745 U+0300..036F, U+0483..0489, 2746 U+0591..05BD, U+05BF, U+05C1..05C2, 2747 U+05C4..05C5 ...) 2748 \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break= 2749 Glue_After_Zwj} (0) 2750 \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0) 2751 \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125: 2752 U+1100..115F, U+A960..A97C) 2753 \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n]) 2754 \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00, 2755 U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C 2756 ...) 2757 \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773: 2758 U+AC01..AC1B, U+AC1D..AC37, 2759 U+AC39..AC53, U+AC55..AC6F, 2760 U+AC71..AC8B, U+AC8D..ACA7 ...) 2761 \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_301 2762 plus all above-Unicode code points: 2763 [\x20-\x7e\xa0-\xac\xae-\xff], 2764 U+0100..02FF, U+0370..0482, 2765 U+048A..0590, U+05BE, U+05C0 ...) 2766 \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend} 2767 (22) 2768 \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (22: 2769 U+0600..0605, U+06DD, U+070F, U+08E2, 2770 U+0D4E, U+110BD ...) 2771 \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI}) 2772 (26: U+1F1E6..1F1FF) 2773 \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break= 2774 Regional_Indicator} (26) 2775 \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break= 2776 SpacingMark} (375) 2777 \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (375: 2778 U+0903, U+093B, U+093E..0940, 2779 U+0949..094C, U+094E..094F, U+0982..0983 2780 ...) 2781 \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137: 2782 U+11A8..11FF, U+D7CB..D7FB) 2783 \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95: 2784 U+1160..11A7, U+D7B0..D7C6) 2785 \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other} 2786 (1_096_301 plus all above-Unicode code 2787 points) 2788 \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D) 2789 \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt}) 2790 (1965) 2791 \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_147 2792 plus all above-Unicode code points: 2793 U+0000..02FF, U+0370..0482, 2794 U+048A..0590, U+05BE, U+05C0, U+05C3 ...) 2795 \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1965: 2796 U+0300..036F, U+0483..0489, 2797 U+0591..05BD, U+05BF, U+05C1..05C2, 2798 U+05C4..05C5 ...) 2799 \p{Greek} \p{Script_Extensions=Greek} (Short: 2800 \p{Grek}; NOT \p{Greek_And_Coptic}) (522) 2801 X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short: 2802 \p{InGreek}) (144) 2803 X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block= 2804 Greek_Extended}) (256) 2805 X \p{Greek_Extended} \p{Block=Greek_Extended} (Short: 2806 \p{InGreekExt}) (256) 2807 \p{Grek} \p{Greek} (= \p{Script_Extensions=Greek}) 2808 (NOT \p{Greek_And_Coptic}) (522) 2809 \p{Gujarati} \p{Script_Extensions=Gujarati} (Short: 2810 \p{Gujr}; NOT \p{Block=Gujarati}) (105) 2811 \p{Gujr} \p{Gujarati} (= \p{Script_Extensions= 2812 Gujarati}) (NOT \p{Block=Gujarati}) (105) 2813 \p{Gunjala_Gondi} \p{Script_Extensions=Gunjala_Gondi} 2814 (Short: \p{Gong}; NOT \p{Block= 2815 Gunjala_Gondi}) (65) 2816 \p{Gurmukhi} \p{Script_Extensions=Gurmukhi} (Short: 2817 \p{Guru}; NOT \p{Block=Gurmukhi}) (94) 2818 \p{Guru} \p{Gurmukhi} (= \p{Script_Extensions= 2819 Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94) 2820 X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (= 2821 \p{Block=Halfwidth_And_Fullwidth_Forms}) 2822 (240) 2823 X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block= 2824 Combining_Half_Marks}) (16) 2825 X \p{Halfwidth_And_Fullwidth_Forms} \p{Block= 2826 Halfwidth_And_Fullwidth_Forms} (Short: 2827 \p{InHalfAndFullForms}) (240) 2828 \p{Han} \p{Script_Extensions=Han} (89_513) 2829 \p{Hang} \p{Hangul} (= \p{Script_Extensions= 2830 Hangul}) (NOT \p{Hangul_Syllables}) 2831 (11_775) 2832 \p{Hangul} \p{Script_Extensions=Hangul} (Short: 2833 \p{Hang}; NOT \p{Hangul_Syllables}) 2834 (11_775) 2835 X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo} 2836 (Short: \p{InCompatJamo}) (96) 2837 X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo}) 2838 (256) 2839 X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A} 2840 (Short: \p{InJamoExtA}) (32) 2841 X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B} 2842 (Short: \p{InJamoExtB}) (80) 2843 \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo} 2844 (125) 2845 \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125: 2846 U+1100..115F, U+A960..A97C) 2847 \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable} 2848 (399) 2849 \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399: 2850 U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70, 2851 U+AC8C ...) 2852 \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type= 2853 LVT_Syllable} (10_773) 2854 \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT}) 2855 (10_773: U+AC01..AC1B, U+AC1D..AC37, 2856 U+AC39..AC53, U+AC55..AC6F, 2857 U+AC71..AC8B, U+AC8D..ACA7 ...) 2858 \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type= 2859 Not_Applicable} (1_102_583 plus all 2860 above-Unicode code points) 2861 \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA}) 2862 (1_102_583 plus all above-Unicode code 2863 points: U+0000..10FF, U+1200..A95F, 2864 U+A97D..ABFF, U+D7A4..D7AF, 2865 U+D7C7..D7CA, U+D7FC..infinity) 2866 \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo} 2867 (137) 2868 \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137: 2869 U+11A8..11FF, U+D7CB..D7FB) 2870 \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo} 2871 (95) 2872 \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95: 2873 U+1160..11A7, U+D7B0..D7C6) 2874 X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short: 2875 \p{InHangul}) (11_184) 2876 \p{Hani} \p{Han} (= \p{Script_Extensions=Han}) 2877 (89_513) 2878 \p{Hanifi_Rohingya} \p{Script_Extensions=Hanifi_Rohingya} 2879 (Short: \p{Rohg}; NOT \p{Block= 2880 Hanifi_Rohingya}) (55) 2881 \p{Hano} \p{Hanunoo} (= \p{Script_Extensions= 2882 Hanunoo}) (NOT \p{Block=Hanunoo}) (23) 2883 \p{Hanunoo} \p{Script_Extensions=Hanunoo} (Short: 2884 \p{Hano}; NOT \p{Block=Hanunoo}) (23) 2885 \p{Hatr} \p{Hatran} (= \p{Script_Extensions= 2886 Hatran}) (NOT \p{Block=Hatran}) (26) 2887 \p{Hatran} \p{Script_Extensions=Hatran} (Short: 2888 \p{Hatr}; NOT \p{Block=Hatran}) (26) 2889 \p{Hebr} \p{Hebrew} (= \p{Script_Extensions= 2890 Hebrew}) (NOT \p{Block=Hebrew}) (134) 2891 \p{Hebrew} \p{Script_Extensions=Hebrew} (Short: 2892 \p{Hebr}; NOT \p{Block=Hebrew}) (134) 2893 \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44) 2894 \p{Hex: *} \p{Hex_Digit: *} 2895 \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44) 2896 \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068 2897 plus all above-Unicode code points: 2898 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<= 2899 >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff], 2900 U+0100..FF0F, U+FF1A..FF20, 2901 U+FF27..FF40, U+FF47..infinity) 2902 \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa- 2903 f], U+FF10..FF19, U+FF21..FF26, 2904 U+FF41..FF46) 2905 X \p{High_Private_Use_Surrogates} \p{Block= 2906 High_Private_Use_Surrogates} (Short: 2907 \p{InHighPUSurrogates}) (128) 2908 X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (= 2909 \p{Block=High_Private_Use_Surrogates}) 2910 (128) 2911 X \p{High_Surrogates} \p{Block=High_Surrogates} (896) 2912 \p{Hira} \p{Hiragana} (= \p{Script_Extensions= 2913 Hiragana}) (NOT \p{Block=Hiragana}) (431) 2914 \p{Hiragana} \p{Script_Extensions=Hiragana} (Short: 2915 \p{Hira}; NOT \p{Block=Hiragana}) (431) 2916 \p{Hluw} \p{Anatolian_Hieroglyphs} (= 2917 \p{Script_Extensions= 2918 Anatolian_Hieroglyphs}) (NOT \p{Block= 2919 Anatolian_Hieroglyphs}) (583) 2920 \p{Hmng} \p{Pahawh_Hmong} (= \p{Script_Extensions= 2921 Pahawh_Hmong}) (NOT \p{Block= 2922 Pahawh_Hmong}) (127) 2923 \p{Hmnp} \p{Nyiakeng_Puachue_Hmong} (= 2924 \p{Script_Extensions= 2925 Nyiakeng_Puachue_Hmong}) (NOT \p{Block= 2926 Nyiakeng_Puachue_Hmong}) (71) 2927 \p{HorizSpace} \p{XPosixBlank} (18) 2928 \p{Hst: *} \p{Hangul_Syllable_Type: *} 2929 \p{Hung} \p{Old_Hungarian} (= \p{Script_Extensions= 2930 Old_Hungarian}) (NOT \p{Block= 2931 Old_Hungarian}) (108) 2932 D \p{Hyphen} \p{Hyphen=Y} (11) 2933 D \p{Hyphen: N*} Supplanted by Line_Break property values; 2934 see www.unicode.org/reports/tr14 2935 (Single: \P{Hyphen}) (1_114_101 plus all 2936 above-Unicode code points: [\x00- 2937 \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A- 2938 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae- 2939 \xff], U+0100..0589, U+058B..1805, 2940 U+1807..200F, U+2012..2E16, U+2E18..30FA 2941 ...) 2942 D \p{Hyphen: Y*} Supplanted by Line_Break property values; 2943 see www.unicode.org/reports/tr14 2944 (Single: \p{Hyphen}) (11: [\-\xad], 2945 U+058A, U+1806, U+2010..2011, U+2E17, 2946 U+30FB ...) 2947 \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT 2948 \p{Ideographic_Description_Characters}) 2949 (128_789) 2950 \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (985_323 plus 2951 all above-Unicode code points: [\x00- 2952 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<= 2953 >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab- 2954 \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7], 2955 U+02C2..02C5, U+02D2..02DF, 2956 U+02E5..02EB, U+02ED, U+02EF..02FF ...) 2957 \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (128_789: 2958 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0- 2959 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1, 2960 U+02C6..02D1, U+02E0..02E4, U+02EC, 2961 U+02EE ...) 2962 \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (125_884) 2963 \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (988_228 plus 2964 all above-Unicode code points: [\x00- 2965 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 2966 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab- 2967 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7], 2968 U+02C2..02C5, U+02D2..02DF, 2969 U+02E5..02EB, U+02ED, U+02EF..036F ...) 2970 \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (125_884: [A- 2971 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 2972 \xff], U+0100..02C1, U+02C6..02D1, 2973 U+02E0..02E4, U+02EC, U+02EE ...) 2974 \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT 2975 \p{Ideographic_Description_Characters}) 2976 (128_789) 2977 \p{IDC: *} \p{ID_Continue: *} 2978 \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y}) 2979 (96_190) 2980 \p{Ideo: *} \p{Ideographic: *} 2981 \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo}) 2982 (96_190) 2983 \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_017_922 2984 plus all above-Unicode code points: 2985 U+0000..3005, U+3008..3020, 2986 U+302A..3037, U+303B..33FF, 2987 U+4DB6..4DFF, U+9FF0..F8FF ...) 2988 \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (96_190: 2989 U+3006..3007, U+3021..3029, 2990 U+3038..303A, U+3400..4DB5, 2991 U+4E00..9FEF, U+F900..FA6D ...) 2992 X \p{Ideographic_Description_Characters} \p{Block= 2993 Ideographic_Description_Characters} 2994 (Short: \p{InIDC}) (16) 2995 X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (= 2996 \p{Block= 2997 Ideographic_Symbols_And_Punctuation}) 2998 (32) 2999 X \p{Ideographic_Symbols_And_Punctuation} \p{Block= 3000 Ideographic_Symbols_And_Punctuation} 3001 (Short: \p{InIdeographicSymbols}) (32) 3002 \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (125_884) 3003 \p{IDS: *} \p{ID_Start: *} 3004 \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short: 3005 \p{IDSB}) (10) 3006 \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB}) 3007 (1_114_102 plus all above-Unicode code 3008 points: U+0000..2FEF, U+2FF2..2FF3, 3009 U+2FFC..infinity) 3010 \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10: 3011 U+2FF0..2FF1, U+2FF4..2FFB) 3012 \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short: 3013 \p{IDST}) (2) 3014 \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST}) 3015 (1_114_110 plus all above-Unicode code 3016 points: U+0000..2FF1, U+2FF4..infinity) 3017 \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2: 3018 U+2FF2..2FF3) 3019 \p{IDSB} \p{IDS_Binary_Operator} (= 3020 \p{IDS_Binary_Operator=Y}) (10) 3021 \p{IDSB: *} \p{IDS_Binary_Operator: *} 3022 \p{IDST} \p{IDS_Trinary_Operator} (= 3023 \p{IDS_Trinary_Operator=Y}) (2) 3024 \p{IDST: *} \p{IDS_Trinary_Operator: *} 3025 \p{Imperial_Aramaic} \p{Script_Extensions=Imperial_Aramaic} 3026 (Short: \p{Armi}; NOT \p{Block= 3027 Imperial_Aramaic}) (31) 3028 \p{In: *} \p{Present_In: *} (Perl extension) 3029 X \p{In_*} \p{Block: *} 3030 X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block= 3031 Common_Indic_Number_Forms}) (16) 3032 \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom}) 3033 (349: U+093C, U+0941..0944, U+094D, 3034 U+0952, U+0956..0957, U+0962..0963 ...) 3035 \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC= 3036 BottomAndLeft}) (1: U+A9BF) 3037 \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC= 3038 BottomAndRight}) (2: U+1B3B, U+A9C0) 3039 \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (61: 3040 U+093F, U+094E, U+09BF, U+09C7..09C8, 3041 U+0A3F, U+0ABF ...) 3042 \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC= 3043 LeftAndRight}) (21: U+09CB..09CC, 3044 U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C, 3045 U+0DDC, U+0DDE ...) 3046 \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_936 3047 plus all above-Unicode code points: 3048 U+0000..08FF, U+0904..0939, U+093D, 3049 U+0950, U+0958..0961, U+0964..0980 ...) 3050 \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC= 3051 Overstruck}) (10: U+1CD4, U+1CE2..1CE8, 3052 U+10A01, U+10A06) 3053 \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (281: 3054 U+0903, U+093B, U+093E, U+0940, 3055 U+0949..094C, U+094F ...) 3056 \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (398: 3057 U+0900..0902, U+093A, U+0945..0948, 3058 U+0951, U+0953..0955, U+0981 ...) 3059 \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC= 3060 TopAndBottom}) (10: U+0C48, U+0F73, 3061 U+0F76..0F79, U+0F81, U+1B3C, 3062 U+1112E..1112F) 3063 \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short: 3064 \p{InPC=TopAndBottomAndRight}) (1: 3065 U+1B3D) 3066 \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC= 3067 TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE, 3068 U+1C29, U+114BB, U+115B9) 3069 \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short: 3070 \p{InPC=TopAndLeftAndRight}) (4: U+0B4C, 3071 U+0DDD, U+17BF, U+115BB) 3072 \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC= 3073 TopAndRight}) (13: U+0AC9, U+0B57, 3074 U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB, 3075 U+1925..1926 ...) 3076 \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC= 3077 VisualOrderLeft}) (19: U+0E40..0E44, 3078 U+0EC0..0EC4, U+19B5..19B7, U+19BA, 3079 U+AAB5..AAB6, U+AAB9 ...) 3080 X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80) 3081 \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha}) 3082 (17: U+093D, U+09BD, U+0ABD, U+0B3D, 3083 U+0C3D, U+0CBD ...) 3084 \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (86: 3085 U+0900..0902, U+0981..0982, U+09FC, 3086 U+0A01..0A02, U+0A70, U+0A81..0A82 ...) 3087 \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short: 3088 \p{InSC=BrahmiJoiningNumber}) (20: 3089 U+11052..11065) 3090 \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC= 3091 CantillationMark}) (59: U+0951..0952, 3092 U+0A51, U+0AFA..0AFC, U+1CD0..1CD2, 3093 U+1CD4..1CE1, U+1CF4 ...) 3094 \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant}) 3095 (2160: U+0915..0939, U+0958..095F, 3096 U+0978..097F, U+0995..09A8, 3097 U+09AA..09B0, U+09B2 ...) 3098 \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC= 3099 ConsonantDead}) (12: U+09CE, 3100 U+0D54..0D56, U+0D7A..0D7F, U+1CF2..1CF3) 3101 \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC= 3102 ConsonantFinal}) (67: U+1930..1931, 3103 U+1933..1939, U+19C1..19C7, 3104 U+1A58..1A59, U+1BBE..1BBF, U+1BF0..1BF1 3105 ...) 3106 \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short: 3107 \p{InSC=ConsonantHeadLetter}) (5: 3108 U+0F88..0F8C) 3109 \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short: 3110 \p{InSC=ConsonantInitialPostfixed}) (1: 3111 U+1A5A) 3112 \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC= 3113 ConsonantKiller}) (2: U+0E4C, U+17CD) 3114 \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC= 3115 ConsonantMedial}) (29: U+0A75, 3116 U+0EBC..0EBD, U+103B..103E, 3117 U+105E..1060, U+1082, U+1A55..1A56 ...) 3118 \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short: 3119 \p{InSC=ConsonantPlaceholder}) (22: [\- 3120 \xa0\xd7], U+0980, U+0A72..0A73, U+104B, 3121 U+104E, U+1900 ...) 3122 \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short: 3123 \p{InSC=ConsonantPrecedingRepha}) (2: 3124 U+0D4E, U+11D46) 3125 \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC= 3126 ConsonantPrefixed}) (9: U+111C2..111C3, 3127 U+11A3A, U+11A84..11A89) 3128 \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC= 3129 ConsonantSubjoined}) (94: U+0F8D..0F97, 3130 U+0F99..0FBC, U+1929..192B, U+1A57, 3131 U+1A5B..1A5E, U+1BA1..1BA3 ...) 3132 \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short: 3133 \p{InSC=ConsonantSucceedingRepha}) (4: 3134 U+17CC, U+1B03, U+1B81, U+A982) 3135 \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short: 3136 \p{InSC=ConsonantWithStacker}) (6: 3137 U+0CF1..0CF2, U+1CF5..1CF6, 3138 U+11003..11004) 3139 \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC= 3140 GeminationMark}) (3: U+0A71, U+11237, 3141 U+11A98) 3142 \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC= 3143 InvisibleStacker}) (11: U+1039, U+17D2, 3144 U+1A60, U+1BAB, U+AAF6, U+10A3F ...) 3145 \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1: 3146 U+200D) 3147 \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC= 3148 ModifyingLetter}) (1: U+0B83) 3149 \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC= 3150 NonJoiner}) (1: U+200C) 3151 \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (30: 3152 U+093C, U+09BC, U+0A3C, U+0ABC, 3153 U+0AFD..0AFF, U+0B3C ...) 3154 \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (481: 3155 [0-9], U+0966..096F, U+09E6..09EF, 3156 U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F 3157 ...) 3158 \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC= 3159 NumberJoiner}) (1: U+1107F) 3160 \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other}) 3161 (1_109_650 plus all above-Unicode code 3162 points: [\x00- 3163 \x20!\"#\$\%&\'\(\)*+,.\/:;<=>?\@A- 3164 Z\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1- 3165 \xb1\xb4-\xd6\xd8-\xff], U+0100..08FF, 3166 U+0950, U+0953..0954, U+0964..0965, 3167 U+0970..0971 ...) 3168 \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC= 3169 PureKiller}) (21: U+0D3B..0D3C, U+0E3A, 3170 U+0E4E, U+0EBA, U+0F84, U+103A ...) 3171 \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC= 3172 RegisterShifter}) (2: U+17C9..17CA) 3173 \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC= 3174 SyllableModifier}) (25: [\xb2-\xb3], 3175 U+09FE, U+0F35, U+0F37, U+0FC6, U+17CB 3176 ...) 3177 \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC= 3178 ToneLetter}) (7: U+1970..1974, U+AAC0, 3179 U+AAC2) 3180 \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark}) 3181 (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037, 3182 U+1063..1064, U+1069..106D, U+1087..108D 3183 ...) 3184 \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27: 3185 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D, 3186 U+0BCD ...) 3187 \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga}) 3188 (35: U+0903, U+0983, U+0A03, U+0A83, 3189 U+0B03, U+0C03 ...) 3190 \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30: 3191 U+1963..196D, U+A85E..A861, U+A866, 3192 U+A922..A92A, U+11150..11154) 3193 \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC= 3194 VowelDependent}) (673: U+093A..093B, 3195 U+093E..094C, U+094E..094F, 3196 U+0955..0957, U+0962..0963, U+09BE..09C4 3197 ...) 3198 \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC= 3199 VowelIndependent}) (476: U+0904..0914, 3200 U+0960..0961, U+0972..0977, 3201 U+0985..098C, U+098F..0990, U+0993..0994 3202 ...) 3203 \p{Inherited} \p{Script_Extensions=Inherited} (Short: 3204 \p{Zinh}) (502) 3205 \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation} 3206 (Short: \p{Pi}) (12) 3207 \p{InPC: *} \p{Indic_Positional_Category: *} 3208 \p{InSC: *} \p{Indic_Syllabic_Category: *} 3209 \p{Inscriptional_Pahlavi} \p{Script_Extensions= 3210 Inscriptional_Pahlavi} (Short: \p{Phli}; 3211 NOT \p{Block=Inscriptional_Pahlavi}) (27) 3212 \p{Inscriptional_Parthian} \p{Script_Extensions= 3213 Inscriptional_Parthian} (Short: 3214 \p{Prti}; NOT \p{Block= 3215 Inscriptional_Parthian}) (30) 3216 X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block= 3217 IPA_Extensions}) (96) 3218 X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short: 3219 \p{InIPAExt}) (96) 3220 \p{Is_*} \p{*} (Any exceptions are individually 3221 noted beginning with the word NOT.) If 3222 an entry has flag(s) at its beginning, 3223 like "D", the "Is_" form has the same 3224 flag(s) 3225 \p{Ital} \p{Old_Italic} (= \p{Script_Extensions= 3226 Old_Italic}) (NOT \p{Block=Old_Italic}) 3227 (39) 3228 X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo}) 3229 (256) 3230 X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block= 3231 Hangul_Jamo_Extended_A}) (32) 3232 X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block= 3233 Hangul_Jamo_Extended_B}) (80) 3234 \p{Java} \p{Javanese} (= \p{Script_Extensions= 3235 Javanese}) (NOT \p{Block=Javanese}) (91) 3236 \p{Javanese} \p{Script_Extensions=Javanese} (Short: 3237 \p{Java}; NOT \p{Block=Javanese}) (91) 3238 \p{Jg: *} \p{Joining_Group: *} 3239 \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2) 3240 \p{Join_C: *} \p{Join_Control: *} 3241 \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2) 3242 \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110 3243 plus all above-Unicode code points: 3244 U+0000..200B, U+200E..infinity) 3245 \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2: 3246 U+200C..200D) 3247 \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1: 3248 U+08BB) 3249 \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1: 3250 U+08BD) 3251 \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (1: 3252 U+08BC) 3253 \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (8: U+0639..063A, 3254 U+06A0, U+06FC, U+075D..075F, U+08B3) 3255 \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710) 3256 \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10: U+0622..0623, 3257 U+0625, U+0627, U+0671..0673, U+0675, 3258 U+0773..0774) 3259 \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (24: U+0628, 3260 U+062A..062B, U+066E, U+0679..0680, 3261 U+0750..0756, U+08A0..08A1 ...) 3262 \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D) 3263 \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg= 3264 BurushaskiYehBarree}) (2: U+077A..077B) 3265 \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15: U+062F..0630, 3266 U+0688..0690, U+06EE, U+0759..075A, 3267 U+08AE) 3268 \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4: 3269 U+0715..0716, U+072A, U+072F) 3270 \p{Joining_Group: E} (Short: \p{Jg=E}) (1: U+0725) 3271 \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7: 3272 U+063D..063F, U+06CC, U+06CE, 3273 U+0775..0776) 3274 \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1: U+074F) 3275 \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10: U+0641, 3276 U+06A1..06A6, U+0760..0761, U+08A4) 3277 \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1: 3278 U+0724) 3279 \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (14: U+063B..063C, 3280 U+06A9, U+06AB, U+06AF..06B4, 3281 U+0762..0764, U+08B0) 3282 \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714, 3283 U+072E) 3284 \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (18: U+062C..062E, 3285 U+0681..0687, U+06BF, U+0757..0758, 3286 U+076E..076F, U+0772 ...) 3287 \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg= 3288 HamzaOnHehGoal}) (1: U+06C3) 3289 \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg= 3290 HanifiRohingyaKinnaYa}) (4: U+10D19, 3291 U+10D1E, U+10D20, U+10D23) 3292 \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg= 3293 HanifiRohingyaPa}) (3: U+10D02, U+10D09, 3294 U+10D1C) 3295 \p{Joining_Group: He} (Short: \p{Jg=He}) (1: U+0717) 3296 \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1: U+0647) 3297 \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2: 3298 U+06C1..06C2) 3299 \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A) 3300 \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (6: U+0643, 3301 U+06AC..06AE, U+077F, U+08B4) 3302 \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F) 3303 \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E) 3304 \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2: 3305 U+06BE, U+06FF) 3306 \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (7: U+0644, 3307 U+06B5..06B8, U+076A, U+08A6) 3308 \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720) 3309 \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1: 3310 U+0866) 3311 \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1: 3312 U+0861) 3313 \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1: 3314 U+0868) 3315 \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla}) 3316 (1: U+0869) 3317 \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1: 3318 U+0860) 3319 \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1: 3320 U+0864) 3321 \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna}) 3322 (1: U+0865) 3323 \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1: 3324 U+0862) 3325 \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1: 3326 U+0867) 3327 \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1: 3328 U+086A) 3329 \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1: 3330 U+0863) 3331 \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg= 3332 ManichaeanAleph}) (1: U+10AC0) 3333 \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin}) 3334 (2: U+10AD9..10ADA) 3335 \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth}) 3336 (2: U+10AC1..10AC2) 3337 \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg= 3338 ManichaeanDaleth}) (1: U+10AC5) 3339 \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg= 3340 ManichaeanDhamedh}) (1: U+10AD4) 3341 \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive}) 3342 (1: U+10AEC) 3343 \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg= 3344 ManichaeanGimel}) (2: U+10AC3..10AC4) 3345 \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth}) 3346 (1: U+10ACD) 3347 \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg= 3348 ManichaeanHundred}) (1: U+10AEF) 3349 \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph}) 3350 (3: U+10AD0..10AD2) 3351 \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg= 3352 ManichaeanLamedh}) (1: U+10AD3) 3353 \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem}) 3354 (1: U+10AD6) 3355 \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun}) 3356 (1: U+10AD7) 3357 \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne}) 3358 (1: U+10AEB) 3359 \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2: 3360 U+10ADB..10ADC) 3361 \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph}) 3362 (3: U+10ADE..10AE0) 3363 \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh}) 3364 (1: U+10AE1) 3365 \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg= 3366 ManichaeanSadhe}) (1: U+10ADD) 3367 \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg= 3368 ManichaeanSamekh}) (1: U+10AD8) 3369 \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw}) 3370 (1: U+10AE4) 3371 \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen}) 3372 (1: U+10AED) 3373 \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth}) 3374 (1: U+10ACE) 3375 \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg= 3376 ManichaeanThamedh}) (1: U+10AD5) 3377 \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg= 3378 ManichaeanTwenty}) (1: U+10AEE) 3379 \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw}) 3380 (1: U+10AC7) 3381 \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh}) 3382 (1: U+10ACF) 3383 \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg= 3384 ManichaeanZayin}) (2: U+10AC9..10ACA) 3385 \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645, 3386 U+0765..0766, U+08A7) 3387 \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1: U+0721) 3388 \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup}) 3389 (1_113_800 plus all above-Unicode code 3390 points: U+0000..061F, U+0621, U+0640, 3391 U+064B..066D, U+0670, U+0674 ...) 3392 \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8: U+0646, 3393 U+06B9..06BC, U+0767..0769) 3394 \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1: U+0722) 3395 \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1: U+06BD) 3396 \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1: U+0726) 3397 \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5: U+0642, U+066F, 3398 U+06A7..06A8, U+08A5) 3399 \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729) 3400 \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (19: U+0631..0632, 3401 U+0691..0699, U+06EF, U+075B, 3402 U+076B..076C, U+0771 ...) 3403 \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1: 3404 U+0727) 3405 \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1: 3406 U+08AC) 3407 \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6: U+0635..0636, 3408 U+069D..069E, U+06FB, U+08AF) 3409 \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728) 3410 \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634, 3411 U+069A..069C, U+06FA, U+075C, U+076D, 3412 U+0770 ...) 3413 \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723) 3414 \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B) 3415 \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1: 3416 U+08B1) 3417 \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA) 3418 \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718) 3419 \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4: U+0637..0638, 3420 U+069F, U+08A3) 3421 \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1: U+072C) 3422 \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3: 3423 U+0629, U+06C0, U+06D5) 3424 \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group= 3425 Hamza_On_Heh_Goal} (1) 3426 \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C) 3427 \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16: U+0624, U+0648, 3428 U+0676..0677, U+06C4..06CB, U+06CF, 3429 U+0778..0779 ...) 3430 \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (11: U+0620, U+0626, 3431 U+0649..064A, U+0678, U+06D0..06D1, 3432 U+0777 ...) 3433 \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2: 3434 U+06D2..06D3) 3435 \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1: 3436 U+06CD) 3437 \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D) 3438 \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E) 3439 \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719) 3440 \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D) 3441 \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (4) 3442 \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (565) 3443 \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (565: U+0620, 3444 U+0626, U+0628, U+062A..062E, 3445 U+0633..063F, U+0641..0647 ...) 3446 \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4: U+0640, 3447 U+07FA, U+180A, U+200D) 3448 \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (4) 3449 \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (4: U+A872, 3450 U+10ACD, U+10AD7, U+10D00) 3451 \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_437 plus 3452 all above-Unicode code points: [\x00- 3453 \xac\xae-\xff], U+0100..02FF, 3454 U+0370..0482, U+048A..0590, U+05BE, 3455 U+05C0 ...) 3456 \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (118) 3457 \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (118: 3458 U+0622..0625, U+0627, U+0629, 3459 U+062F..0632, U+0648, U+0671..0673 ...) 3460 \p{Joining_Type: T} \p{Joining_Type=Transparent} (1984) 3461 \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1984: [\xad], 3462 U+0300..036F, U+0483..0489, 3463 U+0591..05BD, U+05BF, U+05C1..05C2 ...) 3464 \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_111_437 3465 plus all above-Unicode code points) 3466 \p{Jt: *} \p{Joining_Type: *} 3467 \p{Kaithi} \p{Script_Extensions=Kaithi} (Short: 3468 \p{Kthi}; NOT \p{Block=Kaithi}) (87) 3469 \p{Kali} \p{Kayah_Li} (= \p{Script_Extensions= 3470 Kayah_Li}) (48) 3471 \p{Kana} \p{Katakana} (= \p{Script_Extensions= 3472 Katakana}) (NOT \p{Block=Katakana}) (356) 3473 X \p{Kana_Ext_A} \p{Kana_Extended_A} (= \p{Block= 3474 Kana_Extended_A}) (48) 3475 X \p{Kana_Extended_A} \p{Block=Kana_Extended_A} (Short: 3476 \p{InKanaExtA}) (48) 3477 X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block= 3478 Kana_Supplement}) (256) 3479 X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short: 3480 \p{InKanaSup}) (256) 3481 X \p{Kanbun} \p{Block=Kanbun} (16) 3482 X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block= 3483 Kangxi_Radicals}) (224) 3484 X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short: 3485 \p{InKangxi}) (224) 3486 \p{Kannada} \p{Script_Extensions=Kannada} (Short: 3487 \p{Knda}; NOT \p{Block=Kannada}) (104) 3488 \p{Katakana} \p{Script_Extensions=Katakana} (Short: 3489 \p{Kana}; NOT \p{Block=Katakana}) (356) 3490 X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (= 3491 \p{Block=Katakana_Phonetic_Extensions}) 3492 (16) 3493 X \p{Katakana_Phonetic_Extensions} \p{Block= 3494 Katakana_Phonetic_Extensions} (Short: 3495 \p{InKatakanaExt}) (16) 3496 \p{Kayah_Li} \p{Script_Extensions=Kayah_Li} (Short: 3497 \p{Kali}) (48) 3498 \p{Khar} \p{Kharoshthi} (= \p{Script_Extensions= 3499 Kharoshthi}) (NOT \p{Block=Kharoshthi}) 3500 (68) 3501 \p{Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short: 3502 \p{Khar}; NOT \p{Block=Kharoshthi}) (68) 3503 \p{Khmer} \p{Script_Extensions=Khmer} (Short: 3504 \p{Khmr}; NOT \p{Block=Khmer}) (146) 3505 X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32) 3506 \p{Khmr} \p{Khmer} (= \p{Script_Extensions=Khmer}) 3507 (NOT \p{Block=Khmer}) (146) 3508 \p{Khoj} \p{Khojki} (= \p{Script_Extensions= 3509 Khojki}) (NOT \p{Block=Khojki}) (82) 3510 \p{Khojki} \p{Script_Extensions=Khojki} (Short: 3511 \p{Khoj}; NOT \p{Block=Khojki}) (82) 3512 \p{Khudawadi} \p{Script_Extensions=Khudawadi} (Short: 3513 \p{Sind}; NOT \p{Block=Khudawadi}) (81) 3514 \p{Knda} \p{Kannada} (= \p{Script_Extensions= 3515 Kannada}) (NOT \p{Block=Kannada}) (104) 3516 \p{Kthi} \p{Kaithi} (= \p{Script_Extensions= 3517 Kaithi}) (NOT \p{Block=Kaithi}) (87) 3518 \p{L} \pL \p{Letter} (= \p{General_Category=Letter}) 3519 (125_643) 3520 X \p{L&} \p{Cased_Letter} (= \p{General_Category= 3521 Cased_Letter}) (3970) 3522 X \p{L_} \p{Cased_Letter} (= \p{General_Category= 3523 Cased_Letter}) Note the trailing '_' 3524 matters in spite of loose matching 3525 rules. (3970) 3526 \p{Lana} \p{Tai_Tham} (= \p{Script_Extensions= 3527 Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127) 3528 \p{Lao} \p{Script_Extensions=Lao} (NOT \p{Block= 3529 Lao}) (82) 3530 \p{Laoo} \p{Lao} (= \p{Script_Extensions=Lao}) (NOT 3531 \p{Block=Lao}) (82) 3532 \p{Latin} \p{Script_Extensions=Latin} (Short: 3533 \p{Latn}) (1387) 3534 X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block= 3535 Latin_1_Supplement}) (128) 3536 X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block= 3537 Latin_1_Supplement}) (128) 3538 X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short: 3539 \p{InLatin1}) (128) 3540 X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block= 3541 Latin_Extended_A}) (128) 3542 X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (= 3543 \p{Block=Latin_Extended_Additional}) 3544 (256) 3545 X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block= 3546 Latin_Extended_B}) (208) 3547 X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block= 3548 Latin_Extended_C}) (32) 3549 X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block= 3550 Latin_Extended_D}) (224) 3551 X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block= 3552 Latin_Extended_E}) (64) 3553 X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short: 3554 \p{InLatinExtA}) (128) 3555 X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional} 3556 (Short: \p{InLatinExtAdditional}) (256) 3557 X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short: 3558 \p{InLatinExtB}) (208) 3559 X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short: 3560 \p{InLatinExtC}) (32) 3561 X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short: 3562 \p{InLatinExtD}) (224) 3563 X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short: 3564 \p{InLatinExtE}) (64) 3565 \p{Latn} \p{Latin} (= \p{Script_Extensions=Latin}) 3566 (1387) 3567 \p{Lb: *} \p{Line_Break: *} 3568 \p{LC} \p{Cased_Letter} (= \p{General_Category= 3569 Cased_Letter}) (3970) 3570 \p{Lepc} \p{Lepcha} (= \p{Script_Extensions= 3571 Lepcha}) (NOT \p{Block=Lepcha}) (74) 3572 \p{Lepcha} \p{Script_Extensions=Lepcha} (Short: 3573 \p{Lepc}; NOT \p{Block=Lepcha}) (74) 3574 \p{Letter} \p{General_Category=Letter} (Short: \p{L}) 3575 (125_643) 3576 \p{Letter_Number} \p{General_Category=Letter_Number} (Short: 3577 \p{Nl}) (236) 3578 X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80) 3579 \p{Limb} \p{Limbu} (= \p{Script_Extensions=Limbu}) 3580 (NOT \p{Block=Limbu}) (69) 3581 \p{Limbu} \p{Script_Extensions=Limbu} (Short: 3582 \p{Limb}; NOT \p{Block=Limbu}) (69) 3583 \p{Lina} \p{Linear_A} (= \p{Script_Extensions= 3584 Linear_A}) (NOT \p{Block=Linear_A}) (386) 3585 \p{Linb} \p{Linear_B} (= \p{Script_Extensions= 3586 Linear_B}) (268) 3587 \p{Line_Break: AI} \p{Line_Break=Ambiguous} (707) 3588 \p{Line_Break: AL} \p{Line_Break=Alphabetic} (20_582) 3589 \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (20_582: [#&*<=>\@A- 3590 Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0- 3591 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6, 3592 U+02CE..02CF, U+02D1..02D7, U+02DC, 3593 U+02DE ...) 3594 \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7- 3595 \xa8\xaa\xb2-\xb3\xb6-\xba\xbc- 3596 \xbe\xd7\xf7], U+02C7, U+02C9..02CB, 3597 U+02CD, U+02D0, U+02D8..02DB ...) 3598 \p{Line_Break: B2} \p{Line_Break=Break_Both} (3) 3599 \p{Line_Break: BA} \p{Line_Break=Break_After} (239) 3600 \p{Line_Break: BB} \p{Line_Break=Break_Before} (45) 3601 \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4) 3602 \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (239: [\t\|\xad], 3603 U+058A, U+05BE, U+0964..0965, 3604 U+0E5A..0E5B, U+0F0B ...) 3605 \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (45: [\xb4], 3606 U+02C8, U+02CC, U+02DF, U+0C77, U+0C84 3607 ...) 3608 \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014, 3609 U+2E3A..2E3B) 3610 \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/]) 3611 \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r]) 3612 \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1) 3613 \p{Line_Break: CJ} \p{Line_Break= 3614 Conditional_Japanese_Starter} (58) 3615 \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (91) 3616 \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]]) 3617 \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (91: [\}], 3618 U+0F3B, U+0F3D, U+169C, U+2046, U+207E 3619 ...) 3620 \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (2260) 3621 \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2260: 3622 [^\t\n\cK\f\r\x20-\x7e\x85\xa0-\xff], 3623 U+0300..034E, U+0350..035B, 3624 U+0363..036F, U+0483..0489, U+0591..05BD 3625 ...) 3626 \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (750: 3627 U+0E01..0E3A, U+0E40..0E4E, 3628 U+0E81..0E82, U+0E84, U+0E86..0E8A, 3629 U+0E8C..0EA3 ...) 3630 \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ}) 3631 (58: U+3041, U+3043, U+3045, U+3047, 3632 U+3049, U+3063 ...) 3633 \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC) 3634 \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2) 3635 \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1) 3636 \p{Line_Break: E_Base} (Short: \p{Lb=EB}) (120: U+261D, U+26F9, 3637 U+270A..270D, U+1F385, U+1F3C2..1F3C4, 3638 U+1F3C7 ...) 3639 \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF) 3640 \p{Line_Break: EB} \p{Line_Break=E_Base} (120) 3641 \p{Line_Break: EM} \p{Line_Break=E_Modifier} (5) 3642 \p{Line_Break: EX} \p{Line_Break=Exclamation} (37) 3643 \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (37: [!?], U+05C6, 3644 U+061B, U+061E..061F, U+06D4, U+07F9 ...) 3645 \p{Line_Break: GL} \p{Line_Break=Glue} (25) 3646 \p{Line_Break: Glue} (Short: \p{Lb=GL}) (25: [\xa0], U+034F, 3647 U+035C..0362, U+0F08, U+0F0C, U+0F12 ...) 3648 \p{Line_Break: H2} (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C, 3649 U+AC38, U+AC54, U+AC70, U+AC8C ...) 3650 \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B, 3651 U+AC1D..AC37, U+AC39..AC53, 3652 U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7 3653 ...) 3654 \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75: 3655 U+05D0..05EA, U+05EF..05F2, U+FB1D, 3656 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C 3657 ...) 3658 \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (75) 3659 \p{Line_Break: HY} \p{Line_Break=Hyphen} (1) 3660 \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1: [\-]) 3661 \p{Line_Break: ID} \p{Line_Break=Ideographic} (172_693) 3662 \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_693: 3663 U+231A..231B, U+23F0..23F3, 3664 U+2600..2603, U+2614..2615, U+2618, 3665 U+261A..261C ...) 3666 \p{Line_Break: IN} \p{Line_Break=Inseparable} (6) 3667 \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;], 3668 U+037E, U+0589, U+060C..060D, U+07F8, 3669 U+2044 ...) 3670 \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026, 3671 U+22EF, U+FE19, U+10AF6) 3672 \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6) 3673 \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13) 3674 \p{Line_Break: JL} (Short: \p{Lb=JL}) (125: U+1100..115F, 3675 U+A960..A97C) 3676 \p{Line_Break: JT} (Short: \p{Lb=JT}) (137: U+11A8..11FF, 3677 U+D7CB..D7FB) 3678 \p{Line_Break: JV} (Short: \p{Lb=JV}) (95: U+1160..11A7, 3679 U+D7B0..D7C6) 3680 \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1) 3681 \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n]) 3682 \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f], 3683 U+2028..2029) 3684 \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85]) 3685 \p{Line_Break: NL} \p{Line_Break=Next_Line} (1) 3686 \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6, 3687 U+203C..203D, U+2047..2049, U+3005, 3688 U+301C, U+303B..303C ...) 3689 \p{Line_Break: NS} \p{Line_Break=Nonstarter} (33) 3690 \p{Line_Break: NU} \p{Line_Break=Numeric} (622) 3691 \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (622: [0-9], 3692 U+0660..0669, U+066B..066C, 3693 U+06F0..06F9, U+07C0..07C9, U+0966..096F 3694 ...) 3695 \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (88) 3696 \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (88: 3697 [\(\[\{\xa1\xbf], U+0F3A, U+0F3C, 3698 U+169B, U+201A, U+201E ...) 3699 \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (36) 3700 \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (36: 3701 [\%\xa2\xb0], U+0609..060B, U+066A, 3702 U+09F2..09F3, U+09F9, U+0D79 ...) 3703 \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (68) 3704 \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (68: [\$+\\\xa3- 3705 \xa5\xb1], U+058F, U+07FE..07FF, U+09FB, 3706 U+0AF1, U+0BF9 ...) 3707 \p{Line_Break: QU} \p{Line_Break=Quotation} (39) 3708 \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb], 3709 U+2018..2019, U+201B..201D, U+201F, 3710 U+2039..203A, U+275B..2760 ...) 3711 \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26: 3712 U+1F1E6..1F1FF) 3713 \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26) 3714 \p{Line_Break: SA} \p{Line_Break=Complex_Context} (750) 3715 D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048) 3716 \p{Line_Break: SP} \p{Line_Break=Space} (1) 3717 \p{Line_Break: Space} (Short: \p{Lb=SP}) (1: [\x20]) 3718 D \p{Line_Break: Surrogate} Surrogates should never appear in well- 3719 formed text, and therefore shouldn't be 3720 the basis for line breaking (Short: 3721 \p{Lb=SG}) (2048: U+D800..DFFF) 3722 \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1) 3723 \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (901_897 plus all 3724 above-Unicode code points: U+0378..0379, 3725 U+0380..0383, U+038B, U+038D, U+03A2, 3726 U+0530 ...) 3727 \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2) 3728 \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF) 3729 \p{Line_Break: XX} \p{Line_Break=Unknown} (901_897 plus all 3730 above-Unicode code points) 3731 \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1) 3732 \p{Line_Break: ZWJ} (Short: \p{Lb=ZWJ}) (1: U+200D) 3733 \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B) 3734 \p{Line_Separator} \p{General_Category=Line_Separator} 3735 (Short: \p{Zl}) (1) 3736 \p{Linear_A} \p{Script_Extensions=Linear_A} (Short: 3737 \p{Lina}; NOT \p{Block=Linear_A}) (386) 3738 \p{Linear_B} \p{Script_Extensions=Linear_B} (Short: 3739 \p{Linb}) (268) 3740 X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128) 3741 X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128) 3742 \p{Lisu} \p{Script_Extensions=Lisu} (48) 3743 \p{Ll} \p{Lowercase_Letter} (= 3744 \p{General_Category=Lowercase_Letter}) 3745 (/i= General_Category=Cased_Letter) 3746 (2151) 3747 \p{Lm} \p{Modifier_Letter} (= 3748 \p{General_Category=Modifier_Letter}) 3749 (259) 3750 \p{Lo} \p{Other_Letter} (= \p{General_Category= 3751 Other_Letter}) (121_414) 3752 \p{LOE} \p{Logical_Order_Exception} (= 3753 \p{Logical_Order_Exception=Y}) (19) 3754 \p{LOE: *} \p{Logical_Order_Exception: *} 3755 \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short: 3756 \p{LOE}) (19) 3757 \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE}) 3758 (1_114_093 plus all above-Unicode code 3759 points: U+0000..0E3F, U+0E45..0EBF, 3760 U+0EC5..19B4, U+19B8..19B9, 3761 U+19BB..AAB4, U+AAB7..AAB8 ...) 3762 \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19: 3763 U+0E40..0E44, U+0EC0..0EC4, 3764 U+19B5..19B7, U+19BA, U+AAB5..AAB6, 3765 U+AAB9 ...) 3766 X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024) 3767 \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i= 3768 Cased=Yes) (2340) 3769 \p{Lower: *} \p{Lowercase: *} 3770 \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i= 3771 Cased=Yes) (2340) 3772 \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased= 3773 No) (1_111_772 plus all above-Unicode 3774 code points: [\x00- 3775 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A- 3776 Z\[\\\]\^_`\{\|\}~\x7f-\xa9\xab- 3777 \xb4\xb6-\xb9\xbb-\xde\xf7], U+0100, 3778 U+0102, U+0104, U+0106, U+0108 ...) 3779 \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased= 3780 Yes) (2340: [a-z\xaa\xb5\xba\xdf- 3781 \xf6\xf8-\xff], U+0101, U+0103, U+0105, 3782 U+0107, U+0109 ...) 3783 \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter} 3784 (Short: \p{Ll}; /i= General_Category= 3785 Cased_Letter) (2151) 3786 \p{Lt} \p{Titlecase_Letter} (= 3787 \p{General_Category=Titlecase_Letter}) 3788 (/i= General_Category=Cased_Letter) (31) 3789 \p{Lu} \p{Uppercase_Letter} (= 3790 \p{General_Category=Uppercase_Letter}) 3791 (/i= General_Category=Cased_Letter) 3792 (1788) 3793 \p{Lyci} \p{Lycian} (= \p{Script_Extensions= 3794 Lycian}) (NOT \p{Block=Lycian}) (29) 3795 \p{Lycian} \p{Script_Extensions=Lycian} (Short: 3796 \p{Lyci}; NOT \p{Block=Lycian}) (29) 3797 \p{Lydi} \p{Lydian} (= \p{Script_Extensions= 3798 Lydian}) (NOT \p{Block=Lydian}) (27) 3799 \p{Lydian} \p{Script_Extensions=Lydian} (Short: 3800 \p{Lydi}; NOT \p{Block=Lydian}) (27) 3801 \p{M} \pM \p{Mark} (= \p{General_Category=Mark}) 3802 (2268) 3803 \p{Mahajani} \p{Script_Extensions=Mahajani} (Short: 3804 \p{Mahj}; NOT \p{Block=Mahajani}) (61) 3805 \p{Mahj} \p{Mahajani} (= \p{Script_Extensions= 3806 Mahajani}) (NOT \p{Block=Mahajani}) (61) 3807 X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block= 3808 Mahjong_Tiles}) (48) 3809 X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short: 3810 \p{InMahjong}) (48) 3811 \p{Maka} \p{Makasar} (= \p{Script_Extensions= 3812 Makasar}) (NOT \p{Block=Makasar}) (25) 3813 \p{Makasar} \p{Script_Extensions=Makasar} (Short: 3814 \p{Maka}; NOT \p{Block=Makasar}) (25) 3815 \p{Malayalam} \p{Script_Extensions=Malayalam} (Short: 3816 \p{Mlym}; NOT \p{Block=Malayalam}) (125) 3817 \p{Mand} \p{Mandaic} (= \p{Script_Extensions= 3818 Mandaic}) (NOT \p{Block=Mandaic}) (30) 3819 \p{Mandaic} \p{Script_Extensions=Mandaic} (Short: 3820 \p{Mand}; NOT \p{Block=Mandaic}) (30) 3821 \p{Mani} \p{Manichaean} (= \p{Script_Extensions= 3822 Manichaean}) (NOT \p{Block=Manichaean}) 3823 (52) 3824 \p{Manichaean} \p{Script_Extensions=Manichaean} (Short: 3825 \p{Mani}; NOT \p{Block=Manichaean}) (52) 3826 \p{Marc} \p{Marchen} (= \p{Script_Extensions= 3827 Marchen}) (NOT \p{Block=Marchen}) (68) 3828 \p{Marchen} \p{Script_Extensions=Marchen} (Short: 3829 \p{Marc}; NOT \p{Block=Marchen}) (68) 3830 \p{Mark} \p{General_Category=Mark} (Short: \p{M}) 3831 (2268) 3832 \p{Masaram_Gondi} \p{Script_Extensions=Masaram_Gondi} 3833 (Short: \p{Gonm}; NOT \p{Block= 3834 Masaram_Gondi}) (77) 3835 \p{Math} \p{Math=Y} (2310) 3836 \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all 3837 above-Unicode code points: [\x00- 3838 \x20!\"#\$\%&\'\(\)*,\-.\/0-9:;?\@A- 3839 Z\[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2- 3840 \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF, 3841 U+03D3..03D4, U+03D6..03EF, 3842 U+03F2..03F3, U+03F7..0605 ...) 3843 \p{Math: Y*} (Single: \p{Math}) (2310: [+<= 3844 >\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2, 3845 U+03D5, U+03F0..03F1, U+03F4..03F6, 3846 U+0606..0608 ...) 3847 X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (= 3848 \p{Block= 3849 Mathematical_Alphanumeric_Symbols}) 3850 (1024) 3851 X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block= 3852 Mathematical_Operators}) (256) 3853 \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short: 3854 \p{Sm}) (948) 3855 X \p{Mathematical_Alphanumeric_Symbols} \p{Block= 3856 Mathematical_Alphanumeric_Symbols} 3857 (Short: \p{InMathAlphanum}) (1024) 3858 X \p{Mathematical_Operators} \p{Block=Mathematical_Operators} 3859 (Short: \p{InMathOperators}) (256) 3860 X \p{Mayan_Numerals} \p{Block=Mayan_Numerals} (32) 3861 \p{Mc} \p{Spacing_Mark} (= \p{General_Category= 3862 Spacing_Mark}) (429) 3863 \p{Me} \p{Enclosing_Mark} (= \p{General_Category= 3864 Enclosing_Mark}) (13) 3865 \p{Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short: 3866 \p{Medf}; NOT \p{Block=Medefaidrin}) (91) 3867 \p{Medf} \p{Medefaidrin} (= \p{Script_Extensions= 3868 Medefaidrin}) (NOT \p{Block= 3869 Medefaidrin}) (91) 3870 \p{Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} (Short: 3871 \p{Mtei}; NOT \p{Block=Meetei_Mayek}) 3872 (79) 3873 X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block= 3874 Meetei_Mayek_Extensions}) (32) 3875 X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions} 3876 (Short: \p{InMeeteiMayekExt}) (32) 3877 \p{Mend} \p{Mende_Kikakui} (= \p{Script_Extensions= 3878 Mende_Kikakui}) (NOT \p{Block= 3879 Mende_Kikakui}) (213) 3880 \p{Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui} 3881 (Short: \p{Mend}; NOT \p{Block= 3882 Mende_Kikakui}) (213) 3883 \p{Merc} \p{Meroitic_Cursive} (= 3884 \p{Script_Extensions=Meroitic_Cursive}) 3885 (NOT \p{Block=Meroitic_Cursive}) (90) 3886 \p{Mero} \p{Meroitic_Hieroglyphs} (= 3887 \p{Script_Extensions= 3888 Meroitic_Hieroglyphs}) (32) 3889 \p{Meroitic_Cursive} \p{Script_Extensions=Meroitic_Cursive} 3890 (Short: \p{Merc}; NOT \p{Block= 3891 Meroitic_Cursive}) (90) 3892 \p{Meroitic_Hieroglyphs} \p{Script_Extensions= 3893 Meroitic_Hieroglyphs} (Short: \p{Mero}) 3894 (32) 3895 \p{Miao} \p{Script_Extensions=Miao} (NOT \p{Block= 3896 Miao}) (149) 3897 X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (= 3898 \p{Block= 3899 Miscellaneous_Symbols_And_Arrows}) (256) 3900 X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A} 3901 (= \p{Block= 3902 Miscellaneous_Mathematical_Symbols_A}) 3903 (48) 3904 X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B} 3905 (= \p{Block= 3906 Miscellaneous_Mathematical_Symbols_B}) 3907 (128) 3908 X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs} 3909 (= \p{Block= 3910 Miscellaneous_Symbols_And_Pictographs}) 3911 (768) 3912 X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block= 3913 Miscellaneous_Symbols}) (256) 3914 X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block= 3915 Miscellaneous_Technical}) (256) 3916 X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block= 3917 Miscellaneous_Mathematical_Symbols_A} 3918 (Short: \p{InMiscMathSymbolsA}) (48) 3919 X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block= 3920 Miscellaneous_Mathematical_Symbols_B} 3921 (Short: \p{InMiscMathSymbolsB}) (128) 3922 X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short: 3923 \p{InMiscSymbols}) (256) 3924 X \p{Miscellaneous_Symbols_And_Arrows} \p{Block= 3925 Miscellaneous_Symbols_And_Arrows} 3926 (Short: \p{InMiscArrows}) (256) 3927 X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block= 3928 Miscellaneous_Symbols_And_Pictographs} 3929 (Short: \p{InMiscPictographs}) (768) 3930 X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical} 3931 (Short: \p{InMiscTechnical}) (256) 3932 \p{Mlym} \p{Malayalam} (= \p{Script_Extensions= 3933 Malayalam}) (NOT \p{Block=Malayalam}) 3934 (125) 3935 \p{Mn} \p{Nonspacing_Mark} (= 3936 \p{General_Category=Nonspacing_Mark}) 3937 (1826) 3938 \p{Modi} \p{Script_Extensions=Modi} (NOT \p{Block= 3939 Modi}) (89) 3940 \p{Modifier_Letter} \p{General_Category=Modifier_Letter} 3941 (Short: \p{Lm}) (259) 3942 X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block= 3943 Spacing_Modifier_Letters}) (80) 3944 \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol} 3945 (Short: \p{Sk}) (121) 3946 X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32) 3947 \p{Mong} \p{Mongolian} (= \p{Script_Extensions= 3948 Mongolian}) (NOT \p{Block=Mongolian}) 3949 (171) 3950 \p{Mongolian} \p{Script_Extensions=Mongolian} (Short: 3951 \p{Mong}; NOT \p{Block=Mongolian}) (171) 3952 X \p{Mongolian_Sup} \p{Mongolian_Supplement} (= \p{Block= 3953 Mongolian_Supplement}) (32) 3954 X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short: 3955 \p{InMongolianSup}) (32) 3956 \p{Mro} \p{Script_Extensions=Mro} (NOT \p{Block= 3957 Mro}) (43) 3958 \p{Mroo} \p{Mro} (= \p{Script_Extensions=Mro}) (NOT 3959 \p{Block=Mro}) (43) 3960 \p{Mtei} \p{Meetei_Mayek} (= \p{Script_Extensions= 3961 Meetei_Mayek}) (NOT \p{Block= 3962 Meetei_Mayek}) (79) 3963 \p{Mult} \p{Multani} (= \p{Script_Extensions= 3964 Multani}) (NOT \p{Block=Multani}) (48) 3965 \p{Multani} \p{Script_Extensions=Multani} (Short: 3966 \p{Mult}; NOT \p{Block=Multani}) (48) 3967 X \p{Music} \p{Musical_Symbols} (= \p{Block= 3968 Musical_Symbols}) (256) 3969 X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short: 3970 \p{InMusic}) (256) 3971 \p{Myanmar} \p{Script_Extensions=Myanmar} (Short: 3972 \p{Mymr}; NOT \p{Block=Myanmar}) (224) 3973 X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block= 3974 Myanmar_Extended_A}) (32) 3975 X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block= 3976 Myanmar_Extended_B}) (32) 3977 X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short: 3978 \p{InMyanmarExtA}) (32) 3979 X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short: 3980 \p{InMyanmarExtB}) (32) 3981 \p{Mymr} \p{Myanmar} (= \p{Script_Extensions= 3982 Myanmar}) (NOT \p{Block=Myanmar}) (224) 3983 \p{N} \pN \p{Number} (= \p{General_Category=Number}) 3984 (1754) 3985 \p{Nabataean} \p{Script_Extensions=Nabataean} (Short: 3986 \p{Nbat}; NOT \p{Block=Nabataean}) (40) 3987 \p{Nand} \p{Nandinagari} (= \p{Script_Extensions= 3988 Nandinagari}) (NOT \p{Block= 3989 Nandinagari}) (86) 3990 \p{Nandinagari} \p{Script_Extensions=Nandinagari} (Short: 3991 \p{Nand}; NOT \p{Block=Nandinagari}) (86) 3992 \p{Narb} \p{Old_North_Arabian} (= 3993 \p{Script_Extensions=Old_North_Arabian}) 3994 (32) 3995 X \p{NB} \p{No_Block} (= \p{Block=No_Block}) 3996 (832_720 plus all above-Unicode code 3997 points) 3998 \p{Nbat} \p{Nabataean} (= \p{Script_Extensions= 3999 Nabataean}) (NOT \p{Block=Nabataean}) 4000 (40) 4001 \p{NChar} \p{Noncharacter_Code_Point} (= 4002 \p{Noncharacter_Code_Point=Y}) (66) 4003 \p{NChar: *} \p{Noncharacter_Code_Point: *} 4004 \p{Nd} \p{XPosixDigit} (= \p{General_Category= 4005 Decimal_Number}) (630) 4006 \p{New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short: 4007 \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83) 4008 \p{Newa} \p{Script_Extensions=Newa} (NOT \p{Block= 4009 Newa}) (94) 4010 \p{NFC_QC: *} \p{NFC_Quick_Check: *} 4011 \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (110) 4012 \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110: 4013 U+0300..0304, U+0306..030C, U+030F, 4014 U+0311, U+0313..0314, U+031B ...) 4015 \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT 4016 \P{NFC_Quick_Check} NOR \P{NFC_QC}) 4017 (1120) 4018 \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT 4019 \P{NFC_Quick_Check} NOR \P{NFC_QC}) 4020 (1120: U+0340..0341, U+0343..0344, 4021 U+0374, U+037E, U+0387, U+0958..095F ...) 4022 \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT 4023 \p{NFC_Quick_Check} NOR \p{NFC_QC}) 4024 (1_112_882 plus all above-Unicode code 4025 points) 4026 \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT 4027 \p{NFC_Quick_Check} NOR \p{NFC_QC}) 4028 (1_112_882 plus all above-Unicode code 4029 points: U+0000..02FF, U+0305, 4030 U+030D..030E, U+0310, U+0312, 4031 U+0315..031A ...) 4032 \p{NFD_QC: *} \p{NFD_Quick_Check: *} 4033 \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT 4034 \P{NFD_Quick_Check} NOR \P{NFD_QC}) 4035 (13_232) 4036 \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT 4037 \P{NFD_Quick_Check} NOR \P{NFD_QC}) 4038 (13_232: [\xc0-\xc5\xc7-\xcf\xd1- 4039 \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1- 4040 \xf6\xf9-\xfd\xff], U+0100..010F, 4041 U+0112..0125, U+0128..0130, 4042 U+0134..0137, U+0139..013E ...) 4043 \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT 4044 \p{NFD_Quick_Check} NOR \p{NFD_QC}) 4045 (1_100_880 plus all above-Unicode code 4046 points) 4047 \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT 4048 \p{NFD_Quick_Check} NOR \p{NFD_QC}) 4049 (1_100_880 plus all above-Unicode code 4050 points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde- 4051 \xdf\xe6\xf0\xf7-\xf8\xfe], 4052 U+0110..0111, U+0126..0127, 4053 U+0131..0133, U+0138, U+013F..0142 ...) 4054 \p{NFKC_QC: *} \p{NFKC_Quick_Check: *} 4055 \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110) 4056 \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110: 4057 U+0300..0304, U+0306..030C, U+030F, 4058 U+0311, U+0313..0314, U+031B ...) 4059 \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT 4060 \P{NFKC_Quick_Check} NOR \P{NFKC_QC}) 4061 (4796) 4062 \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT 4063 \P{NFKC_Quick_Check} NOR \P{NFKC_QC}) 4064 (4796: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8- 4065 \xba\xbc-\xbe], U+0132..0133, 4066 U+013F..0140, U+0149, U+017F, 4067 U+01C4..01CC ...) 4068 \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT 4069 \p{NFKC_Quick_Check} NOR \p{NFKC_QC}) 4070 (1_109_206 plus all above-Unicode code 4071 points) 4072 \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT 4073 \p{NFKC_Quick_Check} NOR \p{NFKC_QC}) 4074 (1_109_206 plus all above-Unicode code 4075 points: [\x00-\x9f\xa1-\xa7\xa9\xab- 4076 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff], 4077 U+0100..0131, U+0134..013E, 4078 U+0141..0148, U+014A..017E, U+0180..01C3 4079 ...) 4080 \p{NFKD_QC: *} \p{NFKD_Quick_Check: *} 4081 \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT 4082 \P{NFKD_Quick_Check} NOR \P{NFKD_QC}) 4083 (16_896) 4084 \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT 4085 \P{NFKD_Quick_Check} NOR \P{NFKD_QC}) 4086 (16_896: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8- 4087 \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1- 4088 \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1- 4089 \xf6\xf9-\xfd\xff], U+0100..010F, 4090 U+0112..0125, U+0128..0130, 4091 U+0132..0137, U+0139..0140 ...) 4092 \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT 4093 \p{NFKD_Quick_Check} NOR \p{NFKD_QC}) 4094 (1_097_216 plus all above-Unicode code 4095 points) 4096 \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT 4097 \p{NFKD_Quick_Check} NOR \p{NFKD_QC}) 4098 (1_097_216 plus all above-Unicode code 4099 points: [\x00-\x9f\xa1-\xa7\xa9\xab- 4100 \xae\xb0-\xb1\xb6- 4101 \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde- 4102 \xdf\xe6\xf0\xf7-\xf8\xfe], 4103 U+0110..0111, U+0126..0127, U+0131, 4104 U+0138, U+0141..0142 ...) 4105 \p{Nko} \p{Script_Extensions=Nko} (NOT \p{Block= 4106 NKo}) (62) 4107 \p{Nkoo} \p{Nko} (= \p{Script_Extensions=Nko}) (NOT 4108 \p{Block=NKo}) (62) 4109 \p{Nl} \p{Letter_Number} (= \p{General_Category= 4110 Letter_Number}) (236) 4111 \p{No} \p{Other_Number} (= \p{General_Category= 4112 Other_Number}) (888) 4113 X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB}) 4114 (832_720 plus all above-Unicode code 4115 points) 4116 \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short: 4117 \p{NChar}) (66) 4118 \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar}) 4119 (1_114_046 plus all above-Unicode code 4120 points: U+0000..FDCF, U+FDF0..FFFD, 4121 U+10000..1FFFD, U+20000..2FFFD, 4122 U+30000..3FFFD, U+40000..4FFFD ...) 4123 \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar}) 4124 (66: U+FDD0..FDEF, U+FFFE..FFFF, 4125 U+1FFFE..1FFFF, U+2FFFE..2FFFF, 4126 U+3FFFE..3FFFF, U+4FFFE..4FFFF ...) 4127 \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark} 4128 (Short: \p{Mn}) (1826) 4129 \p{Nshu} \p{Nushu} (= \p{Script_Extensions=Nushu}) 4130 (NOT \p{Block=Nushu}) (397) 4131 \p{Nt: *} \p{Numeric_Type: *} 4132 \p{Number} \p{General_Category=Number} (Short: \p{N}) 4133 (1754) 4134 X \p{Number_Forms} \p{Block=Number_Forms} (64) 4135 \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (630) 4136 \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (630: [0-9], 4137 U+0660..0669, U+06F0..06F9, 4138 U+07C0..07C9, U+0966..096F, U+09E6..09EF 4139 ...) 4140 \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128) 4141 \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9], 4142 U+1369..1371, U+19DA, U+2070, 4143 U+2074..2079, U+2080..2089 ...) 4144 \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_277 plus all 4145 above-Unicode code points: [\x00- 4146 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A- 4147 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4- 4148 \xb8\xba-\xbb\xbf-\xff], U+0100..065F, 4149 U+066A..06EF, U+06FA..07BF, 4150 U+07CA..0965, U+0970..09E5 ...) 4151 \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (1077) 4152 \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1077: [\xbc-\xbe], 4153 U+09F4..09F9, U+0B72..0B77, 4154 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E 4155 ...) 4156 T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33) 4157 T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (81: [0], U+0660, 4158 U+06F0, U+07C0, U+0966, U+09E6 ...) 4159 T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0, 4160 U+11FD4) 4161 T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1) 4162 T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2) 4163 T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3) 4164 T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4) 4165 T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5) 4166 T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6) 4167 T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7) 4168 T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8) 4169 T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75, 4170 U+0D76, U+A833, U+11FC9..11FCA) 4171 T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6) 4172 T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152, 4173 U+11FCB) 4174 T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1: U+2151) 4175 T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76, 4176 U+0D77, U+215B, U+A834, U+11FCC ...) 4177 T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1: U+2150) 4178 T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD) 4179 T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (4: U+2159, U+109F7, 4180 U+12461, U+1ED3D) 4181 T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77, 4182 U+0D78, U+A835, U+11FCE) 4183 T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155, 4184 U+11FCF) 4185 T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7, 4186 U+0B72, U+0D73, U+A830, U+10140 ...) 4187 T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (6: U+2153, U+109F9, 4188 U+10E7D, U+1245A, U+1245D, U+12465) 4189 T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1: U+215C) 4190 T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1: U+2156) 4191 T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA) 4192 T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73, 4193 U+0D74, U+0F2A, U+2CFD, U+A831 ...) 4194 T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC) 4195 T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1: U+2157) 4196 T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1: U+215D) 4197 T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (7: U+2154, U+10177, 4198 U+109FD, U+10E7E, U+1245B, U+1245E ...) 4199 T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8, 4200 U+0B74, U+0D75, U+A832, U+10178 ...) 4201 T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1: U+2158) 4202 T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (3: U+215A, U+109FF, 4203 U+1245C) 4204 T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1: U+215E) 4205 T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC) 4206 T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (137: [1\xb9], U+0661, 4207 U+06F1, U+07C1, U+0967, U+09E7 ...) 4208 T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1: U+0F2B) 4209 T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (136: [2\xb2], U+0662, 4210 U+06F2, U+07C2, U+0968, U+09E8 ...) 4211 T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1: U+0F2C) 4212 T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (137: [3\xb3], U+0663, 4213 U+06F3, U+07C3, U+0969, U+09E9 ...) 4214 T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1: U+0F2D) 4215 T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (128: [4], U+0664, 4216 U+06F4, U+07C4, U+096A, U+09EA ...) 4217 T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1: U+0F2E) 4218 T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (127: [5], U+0665, 4219 U+06F5, U+07C5, U+096B, U+09EB ...) 4220 T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F) 4221 T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (111: [6], U+0666, 4222 U+06F6, U+07C6, U+096C, U+09EC ...) 4223 T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30) 4224 T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (110: [7], U+0667, 4225 U+06F7, U+07C7, U+096D, U+09ED ...) 4226 T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31) 4227 T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (106: [8], U+0668, 4228 U+06F8, U+07C8, U+096E, U+09EE ...) 4229 T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32) 4230 T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (110: [9], U+0669, 4231 U+06F9, U+07C9, U+096F, U+09EF ...) 4232 T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (61: U+0BF0, U+0D70, 4233 U+1372, U+2169, U+2179, U+2469 ...) 4234 T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (8: U+216A, U+217A, 4235 U+246A, U+247E, U+2492, U+24EB ...) 4236 T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (8: U+216B, U+217B, 4237 U+246B, U+247F, U+2493, U+24EC ...) 4238 T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (6: U+246C, U+2480, 4239 U+2494, U+24ED, U+16E8D, U+1D2ED) 4240 T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (6: U+246D, U+2481, 4241 U+2495, U+24EE, U+16E8E, U+1D2EE) 4242 T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (6: U+246E, U+2482, 4243 U+2496, U+24EF, U+16E8F, U+1D2EF) 4244 T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (7: U+09F9, U+246F, 4245 U+2483, U+2497, U+24F0, U+16E90 ...) 4246 T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (7: U+16EE, U+2470, 4247 U+2484, U+2498, U+24F1, U+16E91 ...) 4248 T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (7: U+16EF, U+2471, 4249 U+2485, U+2499, U+24F2, U+16E92 ...) 4250 T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (7: U+16F0, U+2472, 4251 U+2486, U+249A, U+24F3, U+16E93 ...) 4252 T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (35: U+1373, U+2473, 4253 U+2487, U+249B, U+24F4, U+3039 ...) 4254 T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1: U+3251) 4255 T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1: U+3252) 4256 T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1: U+3253) 4257 T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1: U+3254) 4258 T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1: U+3255) 4259 T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1: U+3256) 4260 T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1: U+3257) 4261 T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1: U+3258) 4262 T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1: U+3259) 4263 T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (19: U+1374, U+303A, 4264 U+324A, U+325A, U+5345, U+10112 ...) 4265 T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1: U+325B) 4266 T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1: U+325C) 4267 T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1: U+325D) 4268 T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1: U+325E) 4269 T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1: U+325F) 4270 T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1: U+32B1) 4271 T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1: U+32B2) 4272 T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1: U+32B3) 4273 T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1: U+32B4) 4274 T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (18: U+1375, U+324B, 4275 U+32B5, U+534C, U+10113, U+102ED ...) 4276 T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1: U+32B6) 4277 T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1: U+32B7) 4278 T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1: U+32B8) 4279 T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1: U+32B9) 4280 T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1: U+32BA) 4281 T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1: U+32BB) 4282 T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1: U+32BC) 4283 T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1: U+32BD) 4284 T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1: U+32BE) 4285 T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (29: U+1376, U+216C, 4286 U+217C, U+2186, U+324C, U+32BF ...) 4287 T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (13: U+1377, U+324D, 4288 U+10115, U+102EF, U+109CE, U+10E6E ...) 4289 T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (13: U+1378, U+324E, 4290 U+10116, U+102F0, U+109CF, U+10E6F ...) 4291 T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (12: U+1379, U+324F, 4292 U+10117, U+102F1, U+10E70, U+11062 ...) 4293 T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (12: U+137A, U+10118, 4294 U+102F2, U+10341, U+10E71, U+11063 ...) 4295 T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (34: U+0BF1, U+0D71, 4296 U+137B, U+216D, U+217D, U+4F70 ...) 4297 T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (6: U+1011A, U+102F4, 4298 U+109D3, U+10E73, U+1EC84, U+1ED14) 4299 T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (7: U+1011B, U+1016B, 4300 U+102F5, U+109D4, U+10E74, U+1EC85 ...) 4301 T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (7: U+1011C, U+102F6, 4302 U+109D5, U+10E75, U+1EC86, U+1ED16 ...) 4303 T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (16: U+216E, U+217E, 4304 U+1011D, U+10145, U+1014C, U+10153 ...) 4305 T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (7: U+1011E, U+102F8, 4306 U+109D7, U+10E77, U+1EC88, U+1ED18 ...) 4307 T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (6: U+1011F, U+102F9, 4308 U+109D8, U+10E78, U+1EC89, U+1ED19) 4309 T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (6: U+10120, U+102FA, 4310 U+109D9, U+10E79, U+1EC8A, U+1ED1A) 4311 T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (7: U+10121, U+102FB, 4312 U+1034A, U+109DA, U+10E7A, U+1EC8B ...) 4313 T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72, 4314 U+216F, U+217F..2180, U+4EDF, U+5343 ...) 4315 T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC, 4316 U+1EC8D, U+1ED1D, U+1ED3A) 4317 T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD, 4318 U+1EC8E, U+1ED1E) 4319 T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE, 4320 U+1EC8F, U+1ED1F) 4321 T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126, 4322 U+10146, U+1014E, U+10172, U+109DF ...) 4323 T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0, 4324 U+1EC91, U+1ED21) 4325 T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1, 4326 U+1EC92, U+1ED22) 4327 T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2, 4328 U+1EC93, U+1ED23) 4329 T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3, 4330 U+1EC94, U+1ED24) 4331 T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13: 4332 U+137C, U+2182, U+4E07, U+842C, U+1012B, 4333 U+10155 ...) 4334 T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4: 4335 U+1012C, U+109E5, U+1EC96, U+1ED26) 4336 T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4: 4337 U+1012D, U+109E6, U+1EC97, U+1ED27) 4338 T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4: 4339 U+1012E, U+109E7, U+1EC98, U+1ED28) 4340 T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7: 4341 U+2187, U+1012F, U+10147, U+10156, 4342 U+109E8, U+1EC99 ...) 4343 T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4: 4344 U+10130, U+109E9, U+1EC9A, U+1ED2A) 4345 T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4: 4346 U+10131, U+109EA, U+1EC9B, U+1ED2B) 4347 T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4: 4348 U+10132, U+109EB, U+1EC9C, U+1ED2C) 4349 T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4: 4350 U+10133, U+109EC, U+1EC9D, U+1ED2D) 4351 T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5: 4352 U+2188, U+109ED, U+1EC9E, U+1ECA0, 4353 U+1ECB4) 4354 T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2: 4355 U+109EE, U+1EC9F) 4356 T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1: 4357 U+12432) 4358 T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1: 4359 U+109EF) 4360 T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1: 4361 U+109F0) 4362 T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1: 4363 U+12433) 4364 T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1: 4365 U+109F1) 4366 T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1: 4367 U+109F2) 4368 T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1: 4369 U+109F3) 4370 T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1: 4371 U+109F4) 4372 T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1: 4373 U+109F5) 4374 T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1: 4375 U+16B5E) 4376 T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000}) 4377 (1: U+1ECA1) 4378 T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000}) 4379 (1: U+1ECA2) 4380 T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000}) 4381 (3: U+4EBF, U+5104, U+16B5F) 4382 T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv= 4383 10000000000}) (1: U+16B60) 4384 T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv= 4385 1000000000000}) (2: U+5146, U+16B61) 4386 \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_277 plus all 4387 above-Unicode code points: [\x00- 4388 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A- 4389 Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4- 4390 \xb8\xba-\xbb\xbf-\xff], U+0100..065F, 4391 U+066A..06EF, U+06FA..07BF, 4392 U+07CA..0965, U+0970..09E5 ...) 4393 \p{Nushu} \p{Script_Extensions=Nushu} (Short: 4394 \p{Nshu}; NOT \p{Block=Nushu}) (397) 4395 \p{Nv: *} \p{Numeric_Value: *} 4396 \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions= 4397 Nyiakeng_Puachue_Hmong} (Short: 4398 \p{Hmnp}; NOT \p{Block= 4399 Nyiakeng_Puachue_Hmong}) (71) 4400 X \p{OCR} \p{Optical_Character_Recognition} (= 4401 \p{Block=Optical_Character_Recognition}) 4402 (32) 4403 \p{Ogam} \p{Ogham} (= \p{Script_Extensions=Ogham}) 4404 (NOT \p{Block=Ogham}) (29) 4405 \p{Ogham} \p{Script_Extensions=Ogham} (Short: 4406 \p{Ogam}; NOT \p{Block=Ogham}) (29) 4407 \p{Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short: 4408 \p{Olck}) (48) 4409 \p{Olck} \p{Ol_Chiki} (= \p{Script_Extensions= 4410 Ol_Chiki}) (48) 4411 \p{Old_Hungarian} \p{Script_Extensions=Old_Hungarian} 4412 (Short: \p{Hung}; NOT \p{Block= 4413 Old_Hungarian}) (108) 4414 \p{Old_Italic} \p{Script_Extensions=Old_Italic} (Short: 4415 \p{Ital}; NOT \p{Block=Old_Italic}) (39) 4416 \p{Old_North_Arabian} \p{Script_Extensions=Old_North_Arabian} 4417 (Short: \p{Narb}) (32) 4418 \p{Old_Permic} \p{Script_Extensions=Old_Permic} (Short: 4419 \p{Perm}; NOT \p{Block=Old_Permic}) (44) 4420 \p{Old_Persian} \p{Script_Extensions=Old_Persian} (Short: 4421 \p{Xpeo}; NOT \p{Block=Old_Persian}) (50) 4422 \p{Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short: 4423 \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40) 4424 \p{Old_South_Arabian} \p{Script_Extensions=Old_South_Arabian} 4425 (Short: \p{Sarb}) (32) 4426 \p{Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short: 4427 \p{Orkh}; NOT \p{Block=Old_Turkic}) (73) 4428 \p{Open_Punctuation} \p{General_Category=Open_Punctuation} 4429 (Short: \p{Ps}) (75) 4430 X \p{Optical_Character_Recognition} \p{Block= 4431 Optical_Character_Recognition} (Short: 4432 \p{InOCR}) (32) 4433 \p{Oriya} \p{Script_Extensions=Oriya} (Short: 4434 \p{Orya}; NOT \p{Block=Oriya}) (96) 4435 \p{Orkh} \p{Old_Turkic} (= \p{Script_Extensions= 4436 Old_Turkic}) (NOT \p{Block=Old_Turkic}) 4437 (73) 4438 X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48) 4439 \p{Orya} \p{Oriya} (= \p{Script_Extensions=Oriya}) 4440 (NOT \p{Block=Oriya}) (96) 4441 \p{Osage} \p{Script_Extensions=Osage} (Short: 4442 \p{Osge}; NOT \p{Block=Osage}) (72) 4443 \p{Osge} \p{Osage} (= \p{Script_Extensions=Osage}) 4444 (NOT \p{Block=Osage}) (72) 4445 \p{Osma} \p{Osmanya} (= \p{Script_Extensions= 4446 Osmanya}) (NOT \p{Block=Osmanya}) (40) 4447 \p{Osmanya} \p{Script_Extensions=Osmanya} (Short: 4448 \p{Osma}; NOT \p{Block=Osmanya}) (40) 4449 \p{Other} \p{General_Category=Other} (Short: \p{C}) 4450 (976_344 plus all above-Unicode code 4451 points) 4452 \p{Other_Letter} \p{General_Category=Other_Letter} (Short: 4453 \p{Lo}) (121_414) 4454 \p{Other_Number} \p{General_Category=Other_Number} (Short: 4455 \p{No}) (888) 4456 \p{Other_Punctuation} \p{General_Category=Other_Punctuation} 4457 (Short: \p{Po}) (588) 4458 \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short: 4459 \p{So}) (6161) 4460 X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80) 4461 \p{P} \pP \p{Punct} (= \p{General_Category= 4462 Punctuation}) (NOT 4463 \p{General_Punctuation}) (792) 4464 \p{Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} (Short: 4465 \p{Hmng}; NOT \p{Block=Pahawh_Hmong}) 4466 (127) 4467 \p{Palm} \p{Palmyrene} (= \p{Script_Extensions= 4468 Palmyrene}) (32) 4469 \p{Palmyrene} \p{Script_Extensions=Palmyrene} (Short: 4470 \p{Palm}) (32) 4471 \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator} 4472 (Short: \p{Zp}) (1) 4473 \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax= 4474 Y}) (2760) 4475 \p{Pat_Syn: *} \p{Pattern_Syntax: *} 4476 \p{Pat_WS} \p{Pattern_White_Space} (= 4477 \p{Pattern_White_Space=Y}) (11) 4478 \p{Pat_WS: *} \p{Pattern_White_Space: *} 4479 \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn}) 4480 (2760) 4481 \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn}) 4482 (1_111_352 plus all above-Unicode code 4483 points: [\x00-\x200-9A-Z_a-z\x7f- 4484 \xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb7- 4485 \xba\xbc-\xbe\xc0-\xd6\xd8-\xf6\xf8- 4486 \xff], U+0100..200F, U+2028..202F, 4487 U+203F..2040, U+2054, U+205F..218F ...) 4488 \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760: 4489 [!\"#\$\%&\'\(\)*+,\-.\/:;<= 4490 >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab- 4491 \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7], 4492 U+2010..2027, U+2030..203E, 4493 U+2041..2053, U+2055..205E, U+2190..245F 4494 ...) 4495 \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short: 4496 \p{PatWS}) (11) 4497 \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS}) 4498 (1_114_101 plus all above-Unicode code 4499 points: [^\t\n\cK\f\r\x20\x85], 4500 U+0100..200D, U+2010..2027, 4501 U+202A..infinity) 4502 \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11: 4503 [\t\n\cK\f\r\x20\x85], U+200E..200F, 4504 U+2028..2029) 4505 \p{Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short: 4506 \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57) 4507 \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script_Extensions= 4508 Pau_Cin_Hau}) (NOT \p{Block= 4509 Pau_Cin_Hau}) (57) 4510 \p{Pc} \p{Connector_Punctuation} (= 4511 \p{General_Category= 4512 Connector_Punctuation}) (10) 4513 \p{PCM} \p{Prepended_Concatenation_Mark} (= 4514 \p{Prepended_Concatenation_Mark=Y}) (11) 4515 \p{PCM: *} \p{Prepended_Concatenation_Mark: *} 4516 \p{Pd} \p{Dash_Punctuation} (= 4517 \p{General_Category=Dash_Punctuation}) 4518 (24) 4519 \p{Pe} \p{Close_Punctuation} (= 4520 \p{General_Category=Close_Punctuation}) 4521 (73) 4522 \p{PerlSpace} \p{PosixSpace} (6) 4523 \p{PerlWord} \p{PosixWord} (63) 4524 \p{Perm} \p{Old_Permic} (= \p{Script_Extensions= 4525 Old_Permic}) (NOT \p{Block=Old_Permic}) 4526 (44) 4527 \p{Pf} \p{Final_Punctuation} (= 4528 \p{General_Category=Final_Punctuation}) 4529 (10) 4530 \p{Phag} \p{Phags_Pa} (= \p{Script_Extensions= 4531 Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59) 4532 \p{Phags_Pa} \p{Script_Extensions=Phags_Pa} (Short: 4533 \p{Phag}; NOT \p{Block=Phags_Pa}) (59) 4534 X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block= 4535 Phaistos_Disc}) (48) 4536 X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short: 4537 \p{InPhaistos}) (48) 4538 \p{Phli} \p{Inscriptional_Pahlavi} (= 4539 \p{Script_Extensions= 4540 Inscriptional_Pahlavi}) (NOT \p{Block= 4541 Inscriptional_Pahlavi}) (27) 4542 \p{Phlp} \p{Psalter_Pahlavi} (= 4543 \p{Script_Extensions=Psalter_Pahlavi}) 4544 (NOT \p{Block=Psalter_Pahlavi}) (30) 4545 \p{Phnx} \p{Phoenician} (= \p{Script_Extensions= 4546 Phoenician}) (NOT \p{Block=Phoenician}) 4547 (29) 4548 \p{Phoenician} \p{Script_Extensions=Phoenician} (Short: 4549 \p{Phnx}; NOT \p{Block=Phoenician}) (29) 4550 X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block= 4551 Phonetic_Extensions}) (128) 4552 X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (= 4553 \p{Block= 4554 Phonetic_Extensions_Supplement}) (64) 4555 X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short: 4556 \p{InPhoneticExt}) (128) 4557 X \p{Phonetic_Extensions_Supplement} \p{Block= 4558 Phonetic_Extensions_Supplement} (Short: 4559 \p{InPhoneticExtSup}) (64) 4560 \p{Pi} \p{Initial_Punctuation} (= 4561 \p{General_Category= 4562 Initial_Punctuation}) (12) 4563 X \p{Playing_Cards} \p{Block=Playing_Cards} (96) 4564 \p{Plrd} \p{Miao} (= \p{Script_Extensions=Miao}) 4565 (NOT \p{Block=Miao}) (149) 4566 \p{Po} \p{Other_Punctuation} (= 4567 \p{General_Category=Other_Punctuation}) 4568 (588) 4569 \p{PosixAlnum} (62: [0-9A-Za-z]) 4570 \p{PosixAlpha} (52: [A-Za-z]) 4571 \p{PosixBlank} (2: [\t\x20]) 4572 \p{PosixCntrl} ASCII control characters (33: ACK, BEL, 4573 BS, CAN, CR, DC1, DC2, DC3, DC4, DEL, 4574 DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF, 4575 FS, GS, HT, LF, NAK, NUL, RS, SI, SO, 4576 SOH, STX, SUB, SYN, US, VT) 4577 \p{PosixDigit} (10: [0-9]) 4578 \p{PosixGraph} (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A- 4579 Z\[\\\]\^_`a-z\{\|\}~]) 4580 \p{PosixLower} (/i= PosixAlpha) (26: [a-z]) 4581 \p{PosixPrint} (95: [\x20-\x7e]) 4582 \p{PosixPunct} (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<= 4583 >?\@\[\\\]\^_`\{\|\}~]) 4584 \p{PosixSpace} (Short: \p{PerlSpace}) (6: 4585 [\t\n\cK\f\r\x20]) 4586 \p{PosixUpper} (/i= PosixAlpha) (26: [A-Z]) 4587 \p{PosixWord} \w, restricted to ASCII (Short: 4588 \p{PerlWord}) (63: [0-9A-Z_a-z]) 4589 \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} (Short: \p{AHex}) 4590 (22) 4591 \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark= 4592 Y} (Short: \p{PCM}) (11) 4593 \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM}) 4594 (1_114_101 plus all above-Unicode code 4595 points: U+0000..05FF, U+0606..06DC, 4596 U+06DE..070E, U+0710..08E1, 4597 U+08E3..110BC, U+110BE..110CC ...) 4598 \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM}) 4599 (11: U+0600..0605, U+06DD, U+070F, 4600 U+08E2, U+110BD, U+110CD) 4601 T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl 4602 extension) (33_979) 4603 T \p{Present_In: 2.0} Code point's usage introduced in version 4604 2.0 or earlier (Short: \p{In=2.0}) (Perl 4605 extension) (178_500: U+0000..01F5, 4606 U+01FA..0217, U+0250..02A8, 4607 U+02B0..02DE, U+02E0..02E9, U+0300..0345 4608 ...) 4609 \p{Present_In: V2_0} \p{Present_In=2.0} (Perl extension) 4610 (178_500) 4611 T \p{Present_In: 2.1} Code point's usage introduced in version 4612 2.1 or earlier (Short: \p{In=2.1}) (Perl 4613 extension) (178_502: U+0000..01F5, 4614 U+01FA..0217, U+0250..02A8, 4615 U+02B0..02DE, U+02E0..02E9, U+0300..0345 4616 ...) 4617 \p{Present_In: V2_1} \p{Present_In=2.1} (Perl extension) 4618 (178_502) 4619 T \p{Present_In: 3.0} Code point's usage introduced in version 4620 3.0 or earlier (Short: \p{In=3.0}) (Perl 4621 extension) (188_809: U+0000..021F, 4622 U+0222..0233, U+0250..02AD, 4623 U+02B0..02EE, U+0300..034E, U+0360..0362 4624 ...) 4625 \p{Present_In: V3_0} \p{Present_In=3.0} (Perl extension) 4626 (188_809) 4627 T \p{Present_In: 3.1} Code point's usage introduced in version 4628 3.1 or earlier (Short: \p{In=3.1}) (Perl 4629 extension) (233_787: U+0000..021F, 4630 U+0222..0233, U+0250..02AD, 4631 U+02B0..02EE, U+0300..034E, U+0360..0362 4632 ...) 4633 \p{Present_In: V3_1} \p{Present_In=3.1} (Perl extension) 4634 (233_787) 4635 T \p{Present_In: 3.2} Code point's usage introduced in version 4636 3.2 or earlier (Short: \p{In=3.2}) (Perl 4637 extension) (234_803: U+0000..0220, 4638 U+0222..0233, U+0250..02AD, 4639 U+02B0..02EE, U+0300..034F, U+0360..036F 4640 ...) 4641 \p{Present_In: V3_2} \p{Present_In=3.2} (Perl extension) 4642 (234_803) 4643 T \p{Present_In: 4.0} Code point's usage introduced in version 4644 4.0 or earlier (Short: \p{In=4.0}) (Perl 4645 extension) (236_029: U+0000..0236, 4646 U+0250..0357, U+035D..036F, 4647 U+0374..0375, U+037A, U+037E ...) 4648 \p{Present_In: V4_0} \p{Present_In=4.0} (Perl extension) 4649 (236_029) 4650 T \p{Present_In: 4.1} Code point's usage introduced in version 4651 4.1 or earlier (Short: \p{In=4.1}) (Perl 4652 extension) (237_302: U+0000..0241, 4653 U+0250..036F, U+0374..0375, U+037A, 4654 U+037E, U+0384..038A ...) 4655 \p{Present_In: V4_1} \p{Present_In=4.1} (Perl extension) 4656 (237_302) 4657 T \p{Present_In: 5.0} Code point's usage introduced in version 4658 5.0 or earlier (Short: \p{In=5.0}) (Perl 4659 extension) (238_671: U+0000..036F, 4660 U+0374..0375, U+037A..037E, 4661 U+0384..038A, U+038C, U+038E..03A1 ...) 4662 \p{Present_In: V5_0} \p{Present_In=5.0} (Perl extension) 4663 (238_671) 4664 T \p{Present_In: 5.1} Code point's usage introduced in version 4665 5.1 or earlier (Short: \p{In=5.1}) (Perl 4666 extension) (240_295: U+0000..0377, 4667 U+037A..037E, U+0384..038A, U+038C, 4668 U+038E..03A1, U+03A3..0523 ...) 4669 \p{Present_In: V5_1} \p{Present_In=5.1} (Perl extension) 4670 (240_295) 4671 T \p{Present_In: 5.2} Code point's usage introduced in version 4672 5.2 or earlier (Short: \p{In=5.2}) (Perl 4673 extension) (246_943: U+0000..0377, 4674 U+037A..037E, U+0384..038A, U+038C, 4675 U+038E..03A1, U+03A3..0525 ...) 4676 \p{Present_In: V5_2} \p{Present_In=5.2} (Perl extension) 4677 (246_943) 4678 T \p{Present_In: 6.0} Code point's usage introduced in version 4679 6.0 or earlier (Short: \p{In=6.0}) (Perl 4680 extension) (249_031: U+0000..0377, 4681 U+037A..037E, U+0384..038A, U+038C, 4682 U+038E..03A1, U+03A3..0527 ...) 4683 \p{Present_In: V6_0} \p{Present_In=6.0} (Perl extension) 4684 (249_031) 4685 T \p{Present_In: 6.1} Code point's usage introduced in version 4686 6.1 or earlier (Short: \p{In=6.1}) (Perl 4687 extension) (249_763: U+0000..0377, 4688 U+037A..037E, U+0384..038A, U+038C, 4689 U+038E..03A1, U+03A3..0527 ...) 4690 \p{Present_In: V6_1} \p{Present_In=6.1} (Perl extension) 4691 (249_763) 4692 T \p{Present_In: 6.2} Code point's usage introduced in version 4693 6.2 or earlier (Short: \p{In=6.2}) (Perl 4694 extension) (249_764: U+0000..0377, 4695 U+037A..037E, U+0384..038A, U+038C, 4696 U+038E..03A1, U+03A3..0527 ...) 4697 \p{Present_In: V6_2} \p{Present_In=6.2} (Perl extension) 4698 (249_764) 4699 T \p{Present_In: 6.3} Code point's usage introduced in version 4700 6.3 or earlier (Short: \p{In=6.3}) (Perl 4701 extension) (249_769: U+0000..0377, 4702 U+037A..037E, U+0384..038A, U+038C, 4703 U+038E..03A1, U+03A3..0527 ...) 4704 \p{Present_In: V6_3} \p{Present_In=6.3} (Perl extension) 4705 (249_769) 4706 T \p{Present_In: 7.0} Code point's usage introduced in version 4707 7.0 or earlier (Short: \p{In=7.0}) (Perl 4708 extension) (252_603: U+0000..0377, 4709 U+037A..037F, U+0384..038A, U+038C, 4710 U+038E..03A1, U+03A3..052F ...) 4711 \p{Present_In: V7_0} \p{Present_In=7.0} (Perl extension) 4712 (252_603) 4713 T \p{Present_In: 8.0} Code point's usage introduced in version 4714 8.0 or earlier (Short: \p{In=8.0}) (Perl 4715 extension) (260_319: U+0000..0377, 4716 U+037A..037F, U+0384..038A, U+038C, 4717 U+038E..03A1, U+03A3..052F ...) 4718 \p{Present_In: V8_0} \p{Present_In=8.0} (Perl extension) 4719 (260_319) 4720 T \p{Present_In: 9.0} Code point's usage introduced in version 4721 9.0 or earlier (Short: \p{In=9.0}) (Perl 4722 extension) (267_819: U+0000..0377, 4723 U+037A..037F, U+0384..038A, U+038C, 4724 U+038E..03A1, U+03A3..052F ...) 4725 \p{Present_In: V9_0} \p{Present_In=9.0} (Perl extension) 4726 (267_819) 4727 T \p{Present_In: 10.0} Code point's usage introduced in version 4728 10.0 or earlier (Short: \p{In=10.0}) 4729 (Perl extension) (276_337: U+0000..0377, 4730 U+037A..037F, U+0384..038A, U+038C, 4731 U+038E..03A1, U+03A3..052F ...) 4732 \p{Present_In: V10_0} \p{Present_In=10.0} (Perl extension) 4733 (276_337) 4734 T \p{Present_In: 11.0} Code point's usage introduced in version 4735 11.0 or earlier (Short: \p{In=11.0}) 4736 (Perl extension) (277_021: U+0000..0377, 4737 U+037A..037F, U+0384..038A, U+038C, 4738 U+038E..03A1, U+03A3..052F ...) 4739 \p{Present_In: V11_0} \p{Present_In=11.0} (Perl extension) 4740 (277_021) 4741 T \p{Present_In: 12.0} Code point's usage introduced in version 4742 12.0 or earlier (Short: \p{In=12.0}) 4743 (Perl extension) (277_575: U+0000..0377, 4744 U+037A..037F, U+0384..038A, U+038C, 4745 U+038E..03A1, U+03A3..052F ...) 4746 \p{Present_In: V12_0} \p{Present_In=12.0} (Perl extension) 4747 (277_575) 4748 T \p{Present_In: 12.1} Code point's usage introduced in version 4749 12.1 or earlier (Short: \p{In=12.1}) 4750 (Perl extension) (277_576: U+0000..0377, 4751 U+037A..037F, U+0384..038A, U+038C, 4752 U+038E..03A1, U+03A3..052F ...) 4753 \p{Present_In: V12_1} \p{Present_In=12.1} (Perl extension) 4754 (277_576) 4755 \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In= 4756 Unassigned}) (Perl extension) (836_536 4757 plus all above-Unicode code points) 4758 \p{Print} \p{XPosixPrint} (275_395) 4759 \p{Private_Use} \p{General_Category=Private_Use} (Short: 4760 \p{Co}; NOT \p{Private_Use_Area}) 4761 (137_468) 4762 X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short: 4763 \p{InPUA}) (6400) 4764 \p{Prti} \p{Inscriptional_Parthian} (= 4765 \p{Script_Extensions= 4766 Inscriptional_Parthian}) (NOT \p{Block= 4767 Inscriptional_Parthian}) (30) 4768 \p{Ps} \p{Open_Punctuation} (= 4769 \p{General_Category=Open_Punctuation}) 4770 (75) 4771 \p{Psalter_Pahlavi} \p{Script_Extensions=Psalter_Pahlavi} 4772 (Short: \p{Phlp}; NOT \p{Block= 4773 Psalter_Pahlavi}) (30) 4774 X \p{PUA} \p{Private_Use_Area} (= \p{Block= 4775 Private_Use_Area}) (6400) 4776 \p{Punct} \p{General_Category=Punctuation} (Short: 4777 \p{P}; NOT \p{General_Punctuation}) (792) 4778 \p{Punctuation} \p{Punct} (= \p{General_Category= 4779 Punctuation}) (NOT 4780 \p{General_Punctuation}) (792) 4781 \p{Qaac} \p{Coptic} (= \p{Script_Extensions= 4782 Coptic}) (NOT \p{Block=Coptic}) (165) 4783 \p{Qaai} \p{Inherited} (= \p{Script_Extensions= 4784 Inherited}) (502) 4785 \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark= 4786 Y}) (30) 4787 \p{QMark: *} \p{Quotation_Mark: *} 4788 \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark}) 4789 (30) 4790 \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082 4791 plus all above-Unicode code points: 4792 [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<= 4793 >?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f- 4794 \xaa\xac-\xba\xbc-\xff], U+0100..2017, 4795 U+2020..2038, U+203B..2E41, 4796 U+2E43..300B, U+3010..301C ...) 4797 \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30: 4798 [\"\'\xab\xbb], U+2018..201F, 4799 U+2039..203A, U+2E42, U+300C..300F, 4800 U+301D..301F ...) 4801 \p{Radical} \p{Radical=Y} (329) 4802 \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all 4803 above-Unicode code points: U+0000..2E7F, 4804 U+2E9A, U+2EF4..2EFF, U+2FD6..infinity) 4805 \p{Radical: Y*} (Single: \p{Radical}) (329: U+2E80..2E99, 4806 U+2E9B..2EF3, U+2F00..2FD5) 4807 \p{Regional_Indicator} \p{Regional_Indicator=Y} (Short: \p{RI}) 4808 (26) 4809 \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086 4810 plus all above-Unicode code points: 4811 U+0000..1F1E5, U+1F200..infinity) 4812 \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26: 4813 U+1F1E6..1F1FF) 4814 \p{Rejang} \p{Script_Extensions=Rejang} (Short: 4815 \p{Rjng}; NOT \p{Block=Rejang}) (37) 4816 \p{RI} \p{Regional_Indicator} (= 4817 \p{Regional_Indicator=Y}) (26) 4818 \p{RI: *} \p{Regional_Indicator: *} 4819 \p{Rjng} \p{Rejang} (= \p{Script_Extensions= 4820 Rejang}) (NOT \p{Block=Rejang}) (37) 4821 \p{Rohg} \p{Hanifi_Rohingya} (= 4822 \p{Script_Extensions=Hanifi_Rohingya}) 4823 (NOT \p{Block=Hanifi_Rohingya}) (55) 4824 X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block= 4825 Rumi_Numeral_Symbols}) (32) 4826 X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short: 4827 \p{InRumi}) (32) 4828 \p{Runic} \p{Script_Extensions=Runic} (Short: 4829 \p{Runr}; NOT \p{Block=Runic}) (86) 4830 \p{Runr} \p{Runic} (= \p{Script_Extensions=Runic}) 4831 (NOT \p{Block=Runic}) (86) 4832 \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol}) 4833 (7292) 4834 \p{Samaritan} \p{Script_Extensions=Samaritan} (Short: 4835 \p{Samr}; NOT \p{Block=Samaritan}) (61) 4836 \p{Samr} \p{Samaritan} (= \p{Script_Extensions= 4837 Samaritan}) (NOT \p{Block=Samaritan}) 4838 (61) 4839 \p{Sarb} \p{Old_South_Arabian} (= 4840 \p{Script_Extensions=Old_South_Arabian}) 4841 (32) 4842 \p{Saur} \p{Saurashtra} (= \p{Script_Extensions= 4843 Saurashtra}) (NOT \p{Block=Saurashtra}) 4844 (82) 4845 \p{Saurashtra} \p{Script_Extensions=Saurashtra} (Short: 4846 \p{Saur}; NOT \p{Block=Saurashtra}) (82) 4847 \p{SB: *} \p{Sentence_Break: *} 4848 \p{Sc} \p{Currency_Symbol} (= 4849 \p{General_Category=Currency_Symbol}) 4850 (62) 4851 \p{Sc: *} \p{Script: *} 4852 \p{Script: Adlam} (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B, 4853 U+1E950..1E959, U+1E95E..1E95F) 4854 \p{Script: Adlm} \p{Script=Adlam} (88) 4855 \p{Script: Aghb} \p{Script=Caucasian_Albanian} (= 4856 \p{Script_Extensions= 4857 Caucasian_Albanian}) (53) 4858 \p{Script: Ahom} \p{Script_Extensions=Ahom} (Short: \p{Sc= 4859 Ahom}, \p{Ahom}) (58) 4860 \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions= 4861 Anatolian_Hieroglyphs} (Short: \p{Sc= 4862 Hluw}, \p{Hluw}) (583) 4863 \p{Script: Arab} \p{Script=Arabic} (1281) 4864 \p{Script: Arabic} (Short: \p{Sc=Arab}) (1281: U+0600..0604, 4865 U+0606..060B, U+060D..061A, U+061C, 4866 U+061E, U+0620..063F ...) 4867 \p{Script: Armenian} (Short: \p{Sc=Armn}) (95: U+0531..0556, 4868 U+0559..0588, U+058A, U+058D..058F, 4869 U+FB13..FB17) 4870 \p{Script: Armi} \p{Script=Imperial_Aramaic} (= 4871 \p{Script_Extensions=Imperial_Aramaic}) 4872 (31) 4873 \p{Script: Armn} \p{Script=Armenian} (95) 4874 \p{Script: Avestan} \p{Script_Extensions=Avestan} (Short: 4875 \p{Sc=Avst}, \p{Avst}) (61) 4876 \p{Script: Avst} \p{Script=Avestan} (= 4877 \p{Script_Extensions=Avestan}) (61) 4878 \p{Script: Bali} \p{Script=Balinese} (= 4879 \p{Script_Extensions=Balinese}) (121) 4880 \p{Script: Balinese} \p{Script_Extensions=Balinese} (Short: 4881 \p{Sc=Bali}, \p{Bali}) (121) 4882 \p{Script: Bamu} \p{Script=Bamum} (= \p{Script_Extensions= 4883 Bamum}) (657) 4884 \p{Script: Bamum} \p{Script_Extensions=Bamum} (Short: \p{Sc= 4885 Bamu}, \p{Bamu}) (657) 4886 \p{Script: Bass} \p{Script=Bassa_Vah} (= 4887 \p{Script_Extensions=Bassa_Vah}) (36) 4888 \p{Script: Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short: 4889 \p{Sc=Bass}, \p{Bass}) (36) 4890 \p{Script: Batak} \p{Script_Extensions=Batak} (Short: \p{Sc= 4891 Batk}, \p{Batk}) (56) 4892 \p{Script: Batk} \p{Script=Batak} (= \p{Script_Extensions= 4893 Batak}) (56) 4894 \p{Script: Beng} \p{Script=Bengali} (96) 4895 \p{Script: Bengali} (Short: \p{Sc=Beng}) (96: U+0980..0983, 4896 U+0985..098C, U+098F..0990, 4897 U+0993..09A8, U+09AA..09B0, U+09B2 ...) 4898 \p{Script: Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short: 4899 \p{Sc=Bhks}, \p{Bhks}) (97) 4900 \p{Script: Bhks} \p{Script=Bhaiksuki} (= 4901 \p{Script_Extensions=Bhaiksuki}) (97) 4902 \p{Script: Bopo} \p{Script=Bopomofo} (72) 4903 \p{Script: Bopomofo} (Short: \p{Sc=Bopo}) (72: U+02EA..02EB, 4904 U+3105..312F, U+31A0..31BA) 4905 \p{Script: Brah} \p{Script=Brahmi} (= \p{Script_Extensions= 4906 Brahmi}) (109) 4907 \p{Script: Brahmi} \p{Script_Extensions=Brahmi} (Short: 4908 \p{Sc=Brah}, \p{Brah}) (109) 4909 \p{Script: Brai} \p{Script=Braille} (= 4910 \p{Script_Extensions=Braille}) (256) 4911 \p{Script: Braille} \p{Script_Extensions=Braille} (Short: 4912 \p{Sc=Brai}, \p{Brai}) (256) 4913 \p{Script: Bugi} \p{Script=Buginese} (30) 4914 \p{Script: Buginese} (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B, 4915 U+1A1E..1A1F) 4916 \p{Script: Buhd} \p{Script=Buhid} (20) 4917 \p{Script: Buhid} (Short: \p{Sc=Buhd}) (20: U+1740..1753) 4918 \p{Script: Cakm} \p{Script=Chakma} (70) 4919 \p{Script: Canadian_Aboriginal} \p{Script_Extensions= 4920 Canadian_Aboriginal} (Short: \p{Sc= 4921 Cans}, \p{Cans}) (710) 4922 \p{Script: Cans} \p{Script=Canadian_Aboriginal} (= 4923 \p{Script_Extensions= 4924 Canadian_Aboriginal}) (710) 4925 \p{Script: Cari} \p{Script=Carian} (= \p{Script_Extensions= 4926 Carian}) (49) 4927 \p{Script: Carian} \p{Script_Extensions=Carian} (Short: 4928 \p{Sc=Cari}, \p{Cari}) (49) 4929 \p{Script: Caucasian_Albanian} \p{Script_Extensions= 4930 Caucasian_Albanian} (Short: \p{Sc=Aghb}, 4931 \p{Aghb}) (53) 4932 \p{Script: Chakma} (Short: \p{Sc=Cakm}) (70: U+11100..11134, 4933 U+11136..11146) 4934 \p{Script: Cham} \p{Script_Extensions=Cham} (Short: \p{Sc= 4935 Cham}, \p{Cham}) (83) 4936 \p{Script: Cher} \p{Script=Cherokee} (= 4937 \p{Script_Extensions=Cherokee}) (172) 4938 \p{Script: Cherokee} \p{Script_Extensions=Cherokee} (Short: 4939 \p{Sc=Cher}, \p{Cher}) (172) 4940 \p{Script: Common} (Short: \p{Sc=Zyyy}) (7805: [\x00- 4941 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 4942 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab- 4943 \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF, 4944 U+02E5..02E9, U+02EC..02FF, U+0374, 4945 U+037E ...) 4946 \p{Script: Copt} \p{Script=Coptic} (137) 4947 \p{Script: Coptic} (Short: \p{Sc=Copt}) (137: U+03E2..03EF, 4948 U+2C80..2CF3, U+2CF9..2CFF) 4949 \p{Script: Cprt} \p{Script=Cypriot} (55) 4950 \p{Script: Cuneiform} \p{Script_Extensions=Cuneiform} (Short: 4951 \p{Sc=Xsux}, \p{Xsux}) (1234) 4952 \p{Script: Cypriot} (Short: \p{Sc=Cprt}) (55: U+10800..10805, 4953 U+10808, U+1080A..10835, U+10837..10838, 4954 U+1083C, U+1083F) 4955 \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}) (443: U+0400..0484, 4956 U+0487..052F, U+1C80..1C88, U+1D2B, 4957 U+1D78, U+2DE0..2DFF ...) 4958 \p{Script: Cyrl} \p{Script=Cyrillic} (443) 4959 \p{Script: Deseret} \p{Script_Extensions=Deseret} (Short: 4960 \p{Sc=Dsrt}, \p{Dsrt}) (80) 4961 \p{Script: Deva} \p{Script=Devanagari} (154) 4962 \p{Script: Devanagari} (Short: \p{Sc=Deva}) (154: U+0900..0950, 4963 U+0955..0963, U+0966..097F, U+A8E0..A8FF) 4964 \p{Script: Dogr} \p{Script=Dogra} (60) 4965 \p{Script: Dogra} (Short: \p{Sc=Dogr}) (60: U+11800..1183B) 4966 \p{Script: Dsrt} \p{Script=Deseret} (= 4967 \p{Script_Extensions=Deseret}) (80) 4968 \p{Script: Dupl} \p{Script=Duployan} (143) 4969 \p{Script: Duployan} (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A, 4970 U+1BC70..1BC7C, U+1BC80..1BC88, 4971 U+1BC90..1BC99, U+1BC9C..1BC9F) 4972 \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (= 4973 \p{Script_Extensions= 4974 Egyptian_Hieroglyphs}) (1080) 4975 \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions= 4976 Egyptian_Hieroglyphs} (Short: \p{Sc= 4977 Egyp}, \p{Egyp}) (1080) 4978 \p{Script: Elba} \p{Script=Elbasan} (= 4979 \p{Script_Extensions=Elbasan}) (40) 4980 \p{Script: Elbasan} \p{Script_Extensions=Elbasan} (Short: 4981 \p{Sc=Elba}, \p{Elba}) (40) 4982 \p{Script: Elym} \p{Script=Elymaic} (= 4983 \p{Script_Extensions=Elymaic}) (23) 4984 \p{Script: Elymaic} \p{Script_Extensions=Elymaic} (Short: 4985 \p{Sc=Elym}, \p{Elym}) (23) 4986 \p{Script: Ethi} \p{Script=Ethiopic} (= 4987 \p{Script_Extensions=Ethiopic}) (495) 4988 \p{Script: Ethiopic} \p{Script_Extensions=Ethiopic} (Short: 4989 \p{Sc=Ethi}, \p{Ethi}) (495) 4990 \p{Script: Geor} \p{Script=Georgian} (173) 4991 \p{Script: Georgian} (Short: \p{Sc=Geor}) (173: U+10A0..10C5, 4992 U+10C7, U+10CD, U+10D0..10FA, 4993 U+10FC..10FF, U+1C90..1CBA ...) 4994 \p{Script: Glag} \p{Script=Glagolitic} (132) 4995 \p{Script: Glagolitic} (Short: \p{Sc=Glag}) (132: U+2C00..2C2E, 4996 U+2C30..2C5E, U+1E000..1E006, 4997 U+1E008..1E018, U+1E01B..1E021, 4998 U+1E023..1E024 ...) 4999 \p{Script: Gong} \p{Script=Gunjala_Gondi} (63) 5000 \p{Script: Gonm} \p{Script=Masaram_Gondi} (75) 5001 \p{Script: Goth} \p{Script=Gothic} (= \p{Script_Extensions= 5002 Gothic}) (27) 5003 \p{Script: Gothic} \p{Script_Extensions=Gothic} (Short: 5004 \p{Sc=Goth}, \p{Goth}) (27) 5005 \p{Script: Gran} \p{Script=Grantha} (85) 5006 \p{Script: Grantha} (Short: \p{Sc=Gran}) (85: U+11300..11303, 5007 U+11305..1130C, U+1130F..11310, 5008 U+11313..11328, U+1132A..11330, 5009 U+11332..11333 ...) 5010 \p{Script: Greek} (Short: \p{Sc=Grek}) (518: U+0370..0373, 5011 U+0375..0377, U+037A..037D, U+037F, 5012 U+0384, U+0386 ...) 5013 \p{Script: Grek} \p{Script=Greek} (518) 5014 \p{Script: Gujarati} (Short: \p{Sc=Gujr}) (91: U+0A81..0A83, 5015 U+0A85..0A8D, U+0A8F..0A91, 5016 U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3 5017 ...) 5018 \p{Script: Gujr} \p{Script=Gujarati} (91) 5019 \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63: 5020 U+11D60..11D65, U+11D67..11D68, 5021 U+11D6A..11D8E, U+11D90..11D91, 5022 U+11D93..11D98, U+11DA0..11DA9) 5023 \p{Script: Gurmukhi} (Short: \p{Sc=Guru}) (80: U+0A01..0A03, 5024 U+0A05..0A0A, U+0A0F..0A10, 5025 U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33 5026 ...) 5027 \p{Script: Guru} \p{Script=Gurmukhi} (80) 5028 \p{Script: Han} (Short: \p{Sc=Han}) (89_233: U+2E80..2E99, 5029 U+2E9B..2EF3, U+2F00..2FD5, U+3005, 5030 U+3007, U+3021..3029 ...) 5031 \p{Script: Hang} \p{Script=Hangul} (11_739) 5032 \p{Script: Hangul} (Short: \p{Sc=Hang}) (11_739: 5033 U+1100..11FF, U+302E..302F, 5034 U+3131..318E, U+3200..321E, 5035 U+3260..327E, U+A960..A97C ...) 5036 \p{Script: Hani} \p{Script=Han} (89_233) 5037 \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50: 5038 U+10D00..10D27, U+10D30..10D39) 5039 \p{Script: Hano} \p{Script=Hanunoo} (21) 5040 \p{Script: Hanunoo} (Short: \p{Sc=Hano}) (21: U+1720..1734) 5041 \p{Script: Hatr} \p{Script=Hatran} (= \p{Script_Extensions= 5042 Hatran}) (26) 5043 \p{Script: Hatran} \p{Script_Extensions=Hatran} (Short: 5044 \p{Sc=Hatr}, \p{Hatr}) (26) 5045 \p{Script: Hebr} \p{Script=Hebrew} (= \p{Script_Extensions= 5046 Hebrew}) (134) 5047 \p{Script: Hebrew} \p{Script_Extensions=Hebrew} (Short: 5048 \p{Sc=Hebr}, \p{Hebr}) (134) 5049 \p{Script: Hira} \p{Script=Hiragana} (379) 5050 \p{Script: Hiragana} (Short: \p{Sc=Hira}) (379: U+3041..3096, 5051 U+309D..309F, U+1B001..1B11E, 5052 U+1B150..1B152, U+1F200) 5053 \p{Script: Hluw} \p{Script=Anatolian_Hieroglyphs} (= 5054 \p{Script_Extensions= 5055 Anatolian_Hieroglyphs}) (583) 5056 \p{Script: Hmng} \p{Script=Pahawh_Hmong} (= 5057 \p{Script_Extensions=Pahawh_Hmong}) (127) 5058 \p{Script: Hmnp} \p{Script=Nyiakeng_Puachue_Hmong} (= 5059 \p{Script_Extensions= 5060 Nyiakeng_Puachue_Hmong}) (71) 5061 \p{Script: Hung} \p{Script=Old_Hungarian} (= 5062 \p{Script_Extensions=Old_Hungarian}) 5063 (108) 5064 \p{Script: Imperial_Aramaic} \p{Script_Extensions= 5065 Imperial_Aramaic} (Short: \p{Sc=Armi}, 5066 \p{Armi}) (31) 5067 \p{Script: Inherited} (Short: \p{Sc=Zinh}) (571: U+0300..036F, 5068 U+0485..0486, U+064B..0655, U+0670, 5069 U+0951..0954, U+1AB0..1ABE ...) 5070 \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions= 5071 Inscriptional_Pahlavi} (Short: \p{Sc= 5072 Phli}, \p{Phli}) (27) 5073 \p{Script: Inscriptional_Parthian} \p{Script_Extensions= 5074 Inscriptional_Parthian} (Short: \p{Sc= 5075 Prti}, \p{Prti}) (30) 5076 \p{Script: Ital} \p{Script=Old_Italic} (= 5077 \p{Script_Extensions=Old_Italic}) (39) 5078 \p{Script: Java} \p{Script=Javanese} (90) 5079 \p{Script: Javanese} (Short: \p{Sc=Java}) (90: U+A980..A9CD, 5080 U+A9D0..A9D9, U+A9DE..A9DF) 5081 \p{Script: Kaithi} (Short: \p{Sc=Kthi}) (67: U+11080..110C1, 5082 U+110CD) 5083 \p{Script: Kali} \p{Script=Kayah_Li} (47) 5084 \p{Script: Kana} \p{Script=Katakana} (304) 5085 \p{Script: Kannada} (Short: \p{Sc=Knda}) (89: U+0C80..0C8C, 5086 U+0C8E..0C90, U+0C92..0CA8, 5087 U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4 5088 ...) 5089 \p{Script: Katakana} (Short: \p{Sc=Kana}) (304: U+30A1..30FA, 5090 U+30FD..30FF, U+31F0..31FF, 5091 U+32D0..32FE, U+3300..3357, U+FF66..FF6F 5092 ...) 5093 \p{Script: Kayah_Li} (Short: \p{Sc=Kali}) (47: U+A900..A92D, 5094 U+A92F) 5095 \p{Script: Khar} \p{Script=Kharoshthi} (= 5096 \p{Script_Extensions=Kharoshthi}) (68) 5097 \p{Script: Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short: 5098 \p{Sc=Khar}, \p{Khar}) (68) 5099 \p{Script: Khmer} \p{Script_Extensions=Khmer} (Short: \p{Sc= 5100 Khmr}, \p{Khmr}) (146) 5101 \p{Script: Khmr} \p{Script=Khmer} (= \p{Script_Extensions= 5102 Khmer}) (146) 5103 \p{Script: Khoj} \p{Script=Khojki} (62) 5104 \p{Script: Khojki} (Short: \p{Sc=Khoj}) (62: U+11200..11211, 5105 U+11213..1123E) 5106 \p{Script: Khudawadi} (Short: \p{Sc=Sind}) (69: U+112B0..112EA, 5107 U+112F0..112F9) 5108 \p{Script: Knda} \p{Script=Kannada} (89) 5109 \p{Script: Kthi} \p{Script=Kaithi} (67) 5110 \p{Script: Lana} \p{Script=Tai_Tham} (= 5111 \p{Script_Extensions=Tai_Tham}) (127) 5112 \p{Script: Lao} \p{Script_Extensions=Lao} (Short: \p{Sc= 5113 Lao}, \p{Lao}) (82) 5114 \p{Script: Laoo} \p{Script=Lao} (= \p{Script_Extensions= 5115 Lao}) (82) 5116 \p{Script: Latin} (Short: \p{Sc=Latn}) (1366: [A-Za- 5117 z\xaa\xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 5118 U+0100..02B8, U+02E0..02E4, 5119 U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65 5120 ...) 5121 \p{Script: Latn} \p{Script=Latin} (1366) 5122 \p{Script: Lepc} \p{Script=Lepcha} (= \p{Script_Extensions= 5123 Lepcha}) (74) 5124 \p{Script: Lepcha} \p{Script_Extensions=Lepcha} (Short: 5125 \p{Sc=Lepc}, \p{Lepc}) (74) 5126 \p{Script: Limb} \p{Script=Limbu} (68) 5127 \p{Script: Limbu} (Short: \p{Sc=Limb}) (68: U+1900..191E, 5128 U+1920..192B, U+1930..193B, U+1940, 5129 U+1944..194F) 5130 \p{Script: Lina} \p{Script=Linear_A} (341) 5131 \p{Script: Linb} \p{Script=Linear_B} (211) 5132 \p{Script: Linear_A} (Short: \p{Sc=Lina}) (341: U+10600..10736, 5133 U+10740..10755, U+10760..10767) 5134 \p{Script: Linear_B} (Short: \p{Sc=Linb}) (211: U+10000..1000B, 5135 U+1000D..10026, U+10028..1003A, 5136 U+1003C..1003D, U+1003F..1004D, 5137 U+10050..1005D ...) 5138 \p{Script: Lisu} \p{Script_Extensions=Lisu} (Short: \p{Sc= 5139 Lisu}, \p{Lisu}) (48) 5140 \p{Script: Lyci} \p{Script=Lycian} (= \p{Script_Extensions= 5141 Lycian}) (29) 5142 \p{Script: Lycian} \p{Script_Extensions=Lycian} (Short: 5143 \p{Sc=Lyci}, \p{Lyci}) (29) 5144 \p{Script: Lydi} \p{Script=Lydian} (= \p{Script_Extensions= 5145 Lydian}) (27) 5146 \p{Script: Lydian} \p{Script_Extensions=Lydian} (Short: 5147 \p{Sc=Lydi}, \p{Lydi}) (27) 5148 \p{Script: Mahajani} (Short: \p{Sc=Mahj}) (39: U+11150..11176) 5149 \p{Script: Mahj} \p{Script=Mahajani} (39) 5150 \p{Script: Maka} \p{Script=Makasar} (= 5151 \p{Script_Extensions=Makasar}) (25) 5152 \p{Script: Makasar} \p{Script_Extensions=Makasar} (Short: 5153 \p{Sc=Maka}, \p{Maka}) (25) 5154 \p{Script: Malayalam} (Short: \p{Sc=Mlym}) (117: U+0D00..0D03, 5155 U+0D05..0D0C, U+0D0E..0D10, 5156 U+0D12..0D44, U+0D46..0D48, U+0D4A..0D4F 5157 ...) 5158 \p{Script: Mand} \p{Script=Mandaic} (29) 5159 \p{Script: Mandaic} (Short: \p{Sc=Mand}) (29: U+0840..085B, 5160 U+085E) 5161 \p{Script: Mani} \p{Script=Manichaean} (51) 5162 \p{Script: Manichaean} (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6, 5163 U+10AEB..10AF6) 5164 \p{Script: Marc} \p{Script=Marchen} (= 5165 \p{Script_Extensions=Marchen}) (68) 5166 \p{Script: Marchen} \p{Script_Extensions=Marchen} (Short: 5167 \p{Sc=Marc}, \p{Marc}) (68) 5168 \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75: 5169 U+11D00..11D06, U+11D08..11D09, 5170 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D, 5171 U+11D3F..11D47 ...) 5172 \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short: 5173 \p{Sc=Medf}, \p{Medf}) (91) 5174 \p{Script: Medf} \p{Script=Medefaidrin} (= 5175 \p{Script_Extensions=Medefaidrin}) (91) 5176 \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} 5177 (Short: \p{Sc=Mtei}, \p{Mtei}) (79) 5178 \p{Script: Mend} \p{Script=Mende_Kikakui} (= 5179 \p{Script_Extensions=Mende_Kikakui}) 5180 (213) 5181 \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui} 5182 (Short: \p{Sc=Mend}, \p{Mend}) (213) 5183 \p{Script: Merc} \p{Script=Meroitic_Cursive} (= 5184 \p{Script_Extensions=Meroitic_Cursive}) 5185 (90) 5186 \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (= 5187 \p{Script_Extensions= 5188 Meroitic_Hieroglyphs}) (32) 5189 \p{Script: Meroitic_Cursive} \p{Script_Extensions= 5190 Meroitic_Cursive} (Short: \p{Sc=Merc}, 5191 \p{Merc}) (90) 5192 \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions= 5193 Meroitic_Hieroglyphs} (Short: \p{Sc= 5194 Mero}, \p{Mero}) (32) 5195 \p{Script: Miao} \p{Script_Extensions=Miao} (Short: \p{Sc= 5196 Miao}, \p{Miao}) (149) 5197 \p{Script: Mlym} \p{Script=Malayalam} (117) 5198 \p{Script: Modi} (Short: \p{Sc=Modi}) (79: U+11600..11644, 5199 U+11650..11659) 5200 \p{Script: Mong} \p{Script=Mongolian} (167) 5201 \p{Script: Mongolian} (Short: \p{Sc=Mong}) (167: U+1800..1801, 5202 U+1804, U+1806..180E, U+1810..1819, 5203 U+1820..1878, U+1880..18AA ...) 5204 \p{Script: Mro} \p{Script_Extensions=Mro} (Short: \p{Sc= 5205 Mro}, \p{Mro}) (43) 5206 \p{Script: Mroo} \p{Script=Mro} (= \p{Script_Extensions= 5207 Mro}) (43) 5208 \p{Script: Mtei} \p{Script=Meetei_Mayek} (= 5209 \p{Script_Extensions=Meetei_Mayek}) (79) 5210 \p{Script: Mult} \p{Script=Multani} (38) 5211 \p{Script: Multani} (Short: \p{Sc=Mult}) (38: U+11280..11286, 5212 U+11288, U+1128A..1128D, U+1128F..1129D, 5213 U+1129F..112A9) 5214 \p{Script: Myanmar} (Short: \p{Sc=Mymr}) (223: U+1000..109F, 5215 U+A9E0..A9FE, U+AA60..AA7F) 5216 \p{Script: Mymr} \p{Script=Myanmar} (223) 5217 \p{Script: Nabataean} \p{Script_Extensions=Nabataean} (Short: 5218 \p{Sc=Nbat}, \p{Nbat}) (40) 5219 \p{Script: Nand} \p{Script=Nandinagari} (65) 5220 \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7, 5221 U+119AA..119D7, U+119DA..119E4) 5222 \p{Script: Narb} \p{Script=Old_North_Arabian} (= 5223 \p{Script_Extensions=Old_North_Arabian}) 5224 (32) 5225 \p{Script: Nbat} \p{Script=Nabataean} (= 5226 \p{Script_Extensions=Nabataean}) (40) 5227 \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short: 5228 \p{Sc=Talu}, \p{Talu}) (83) 5229 \p{Script: Newa} \p{Script_Extensions=Newa} (Short: \p{Sc= 5230 Newa}, \p{Newa}) (94) 5231 \p{Script: Nko} \p{Script_Extensions=Nko} (Short: \p{Sc= 5232 Nko}, \p{Nko}) (62) 5233 \p{Script: Nkoo} \p{Script=Nko} (= \p{Script_Extensions= 5234 Nko}) (62) 5235 \p{Script: Nshu} \p{Script=Nushu} (= \p{Script_Extensions= 5236 Nushu}) (397) 5237 \p{Script: Nushu} \p{Script_Extensions=Nushu} (Short: \p{Sc= 5238 Nshu}, \p{Nshu}) (397) 5239 \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions= 5240 Nyiakeng_Puachue_Hmong} (Short: \p{Sc= 5241 Hmnp}, \p{Hmnp}) (71) 5242 \p{Script: Ogam} \p{Script=Ogham} (= \p{Script_Extensions= 5243 Ogham}) (29) 5244 \p{Script: Ogham} \p{Script_Extensions=Ogham} (Short: \p{Sc= 5245 Ogam}, \p{Ogam}) (29) 5246 \p{Script: Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short: 5247 \p{Sc=Olck}, \p{Olck}) (48) 5248 \p{Script: Olck} \p{Script=Ol_Chiki} (= 5249 \p{Script_Extensions=Ol_Chiki}) (48) 5250 \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian} 5251 (Short: \p{Sc=Hung}, \p{Hung}) (108) 5252 \p{Script: Old_Italic} \p{Script_Extensions=Old_Italic} (Short: 5253 \p{Sc=Ital}, \p{Ital}) (39) 5254 \p{Script: Old_North_Arabian} \p{Script_Extensions= 5255 Old_North_Arabian} (Short: \p{Sc=Narb}, 5256 \p{Narb}) (32) 5257 \p{Script: Old_Permic} (Short: \p{Sc=Perm}) (43: U+10350..1037A) 5258 \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short: 5259 \p{Sc=Xpeo}, \p{Xpeo}) (50) 5260 \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short: 5261 \p{Sc=Sogo}, \p{Sogo}) (40) 5262 \p{Script: Old_South_Arabian} \p{Script_Extensions= 5263 Old_South_Arabian} (Short: \p{Sc=Sarb}, 5264 \p{Sarb}) (32) 5265 \p{Script: Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short: 5266 \p{Sc=Orkh}, \p{Orkh}) (73) 5267 \p{Script: Oriya} (Short: \p{Sc=Orya}) (90: U+0B01..0B03, 5268 U+0B05..0B0C, U+0B0F..0B10, 5269 U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33 5270 ...) 5271 \p{Script: Orkh} \p{Script=Old_Turkic} (= 5272 \p{Script_Extensions=Old_Turkic}) (73) 5273 \p{Script: Orya} \p{Script=Oriya} (90) 5274 \p{Script: Osage} \p{Script_Extensions=Osage} (Short: \p{Sc= 5275 Osge}, \p{Osge}) (72) 5276 \p{Script: Osge} \p{Script=Osage} (= \p{Script_Extensions= 5277 Osage}) (72) 5278 \p{Script: Osma} \p{Script=Osmanya} (= 5279 \p{Script_Extensions=Osmanya}) (40) 5280 \p{Script: Osmanya} \p{Script_Extensions=Osmanya} (Short: 5281 \p{Sc=Osma}, \p{Osma}) (40) 5282 \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} 5283 (Short: \p{Sc=Hmng}, \p{Hmng}) (127) 5284 \p{Script: Palm} \p{Script=Palmyrene} (= 5285 \p{Script_Extensions=Palmyrene}) (32) 5286 \p{Script: Palmyrene} \p{Script_Extensions=Palmyrene} (Short: 5287 \p{Sc=Palm}, \p{Palm}) (32) 5288 \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short: 5289 \p{Sc=Pauc}, \p{Pauc}) (57) 5290 \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (= 5291 \p{Script_Extensions=Pau_Cin_Hau}) (57) 5292 \p{Script: Perm} \p{Script=Old_Permic} (43) 5293 \p{Script: Phag} \p{Script=Phags_Pa} (56) 5294 \p{Script: Phags_Pa} (Short: \p{Sc=Phag}) (56: U+A840..A877) 5295 \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (= 5296 \p{Script_Extensions= 5297 Inscriptional_Pahlavi}) (27) 5298 \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29) 5299 \p{Script: Phnx} \p{Script=Phoenician} (= 5300 \p{Script_Extensions=Phoenician}) (29) 5301 \p{Script: Phoenician} \p{Script_Extensions=Phoenician} (Short: 5302 \p{Sc=Phnx}, \p{Phnx}) (29) 5303 \p{Script: Plrd} \p{Script=Miao} (= \p{Script_Extensions= 5304 Miao}) (149) 5305 \p{Script: Prti} \p{Script=Inscriptional_Parthian} (= 5306 \p{Script_Extensions= 5307 Inscriptional_Parthian}) (30) 5308 \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29: 5309 U+10B80..10B91, U+10B99..10B9C, 5310 U+10BA9..10BAF) 5311 \p{Script: Qaac} \p{Script=Coptic} (137) 5312 \p{Script: Qaai} \p{Script=Inherited} (571) 5313 \p{Script: Rejang} \p{Script_Extensions=Rejang} (Short: 5314 \p{Sc=Rjng}, \p{Rjng}) (37) 5315 \p{Script: Rjng} \p{Script=Rejang} (= \p{Script_Extensions= 5316 Rejang}) (37) 5317 \p{Script: Rohg} \p{Script=Hanifi_Rohingya} (50) 5318 \p{Script: Runic} \p{Script_Extensions=Runic} (Short: \p{Sc= 5319 Runr}, \p{Runr}) (86) 5320 \p{Script: Runr} \p{Script=Runic} (= \p{Script_Extensions= 5321 Runic}) (86) 5322 \p{Script: Samaritan} \p{Script_Extensions=Samaritan} (Short: 5323 \p{Sc=Samr}, \p{Samr}) (61) 5324 \p{Script: Samr} \p{Script=Samaritan} (= 5325 \p{Script_Extensions=Samaritan}) (61) 5326 \p{Script: Sarb} \p{Script=Old_South_Arabian} (= 5327 \p{Script_Extensions=Old_South_Arabian}) 5328 (32) 5329 \p{Script: Saur} \p{Script=Saurashtra} (= 5330 \p{Script_Extensions=Saurashtra}) (82) 5331 \p{Script: Saurashtra} \p{Script_Extensions=Saurashtra} (Short: 5332 \p{Sc=Saur}, \p{Saur}) (82) 5333 \p{Script: Sgnw} \p{Script=SignWriting} (= 5334 \p{Script_Extensions=SignWriting}) (672) 5335 \p{Script: Sharada} (Short: \p{Sc=Shrd}) (94: U+11180..111CD, 5336 U+111D0..111DF) 5337 \p{Script: Shavian} \p{Script_Extensions=Shavian} (Short: 5338 \p{Sc=Shaw}, \p{Shaw}) (48) 5339 \p{Script: Shaw} \p{Script=Shavian} (= 5340 \p{Script_Extensions=Shavian}) (48) 5341 \p{Script: Shrd} \p{Script=Sharada} (94) 5342 \p{Script: Sidd} \p{Script=Siddham} (= 5343 \p{Script_Extensions=Siddham}) (92) 5344 \p{Script: Siddham} \p{Script_Extensions=Siddham} (Short: 5345 \p{Sc=Sidd}, \p{Sidd}) (92) 5346 \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short: 5347 \p{Sc=Sgnw}, \p{Sgnw}) (672) 5348 \p{Script: Sind} \p{Script=Khudawadi} (69) 5349 \p{Script: Sinh} \p{Script=Sinhala} (110) 5350 \p{Script: Sinhala} (Short: \p{Sc=Sinh}) (110: U+0D82..0D83, 5351 U+0D85..0D96, U+0D9A..0DB1, 5352 U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...) 5353 \p{Script: Sogd} \p{Script=Sogdian} (42) 5354 \p{Script: Sogdian} (Short: \p{Sc=Sogd}) (42: U+10F30..10F59) 5355 \p{Script: Sogo} \p{Script=Old_Sogdian} (= 5356 \p{Script_Extensions=Old_Sogdian}) (40) 5357 \p{Script: Sora} \p{Script=Sora_Sompeng} (= 5358 \p{Script_Extensions=Sora_Sompeng}) (35) 5359 \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} 5360 (Short: \p{Sc=Sora}, \p{Sora}) (35) 5361 \p{Script: Soyo} \p{Script=Soyombo} (= 5362 \p{Script_Extensions=Soyombo}) (83) 5363 \p{Script: Soyombo} \p{Script_Extensions=Soyombo} (Short: 5364 \p{Sc=Soyo}, \p{Soyo}) (83) 5365 \p{Script: Sund} \p{Script=Sundanese} (= 5366 \p{Script_Extensions=Sundanese}) (72) 5367 \p{Script: Sundanese} \p{Script_Extensions=Sundanese} (Short: 5368 \p{Sc=Sund}, \p{Sund}) (72) 5369 \p{Script: Sylo} \p{Script=Syloti_Nagri} (44) 5370 \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (44: U+A800..A82B) 5371 \p{Script: Syrc} \p{Script=Syriac} (88) 5372 \p{Script: Syriac} (Short: \p{Sc=Syrc}) (88: U+0700..070D, 5373 U+070F..074A, U+074D..074F, U+0860..086A) 5374 \p{Script: Tagalog} (Short: \p{Sc=Tglg}) (20: U+1700..170C, 5375 U+170E..1714) 5376 \p{Script: Tagb} \p{Script=Tagbanwa} (18) 5377 \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}) (18: U+1760..176C, 5378 U+176E..1770, U+1772..1773) 5379 \p{Script: Tai_Le} (Short: \p{Sc=Tale}) (35: U+1950..196D, 5380 U+1970..1974) 5381 \p{Script: Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short: 5382 \p{Sc=Lana}, \p{Lana}) (127) 5383 \p{Script: Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short: 5384 \p{Sc=Tavt}, \p{Tavt}) (72) 5385 \p{Script: Takr} \p{Script=Takri} (67) 5386 \p{Script: Takri} (Short: \p{Sc=Takr}) (67: U+11680..116B8, 5387 U+116C0..116C9) 5388 \p{Script: Tale} \p{Script=Tai_Le} (35) 5389 \p{Script: Talu} \p{Script=New_Tai_Lue} (= 5390 \p{Script_Extensions=New_Tai_Lue}) (83) 5391 \p{Script: Tamil} (Short: \p{Sc=Taml}) (123: U+0B82..0B83, 5392 U+0B85..0B8A, U+0B8E..0B90, 5393 U+0B92..0B95, U+0B99..0B9A, U+0B9C ...) 5394 \p{Script: Taml} \p{Script=Tamil} (123) 5395 \p{Script: Tang} \p{Script=Tangut} (= \p{Script_Extensions= 5396 Tangut}) (6892) 5397 \p{Script: Tangut} \p{Script_Extensions=Tangut} (Short: 5398 \p{Sc=Tang}, \p{Tang}) (6892) 5399 \p{Script: Tavt} \p{Script=Tai_Viet} (= 5400 \p{Script_Extensions=Tai_Viet}) (72) 5401 \p{Script: Telu} \p{Script=Telugu} (98) 5402 \p{Script: Telugu} (Short: \p{Sc=Telu}) (98: U+0C00..0C0C, 5403 U+0C0E..0C10, U+0C12..0C28, 5404 U+0C2A..0C39, U+0C3D..0C44, U+0C46..0C48 5405 ...) 5406 \p{Script: Tfng} \p{Script=Tifinagh} (= 5407 \p{Script_Extensions=Tifinagh}) (59) 5408 \p{Script: Tglg} \p{Script=Tagalog} (20) 5409 \p{Script: Thaa} \p{Script=Thaana} (50) 5410 \p{Script: Thaana} (Short: \p{Sc=Thaa}) (50: U+0780..07B1) 5411 \p{Script: Thai} \p{Script_Extensions=Thai} (Short: \p{Sc= 5412 Thai}, \p{Thai}) (86) 5413 \p{Script: Tibetan} \p{Script_Extensions=Tibetan} (Short: 5414 \p{Sc=Tibt}, \p{Tibt}) (207) 5415 \p{Script: Tibt} \p{Script=Tibetan} (= 5416 \p{Script_Extensions=Tibetan}) (207) 5417 \p{Script: Tifinagh} \p{Script_Extensions=Tifinagh} (Short: 5418 \p{Sc=Tfng}, \p{Tfng}) (59) 5419 \p{Script: Tirh} \p{Script=Tirhuta} (82) 5420 \p{Script: Tirhuta} (Short: \p{Sc=Tirh}) (82: U+11480..114C7, 5421 U+114D0..114D9) 5422 \p{Script: Ugar} \p{Script=Ugaritic} (= 5423 \p{Script_Extensions=Ugaritic}) (31) 5424 \p{Script: Ugaritic} \p{Script_Extensions=Ugaritic} (Short: 5425 \p{Sc=Ugar}, \p{Ugar}) (31) 5426 \p{Script: Unknown} \p{Script_Extensions=Unknown} (Short: 5427 \p{Sc=Zzzz}, \p{Zzzz}) (976_118 plus all 5428 above-Unicode code points) 5429 \p{Script: Vai} \p{Script_Extensions=Vai} (Short: \p{Sc= 5430 Vai}, \p{Vai}) (300) 5431 \p{Script: Vaii} \p{Script=Vai} (= \p{Script_Extensions= 5432 Vai}) (300) 5433 \p{Script: Wancho} \p{Script_Extensions=Wancho} (Short: 5434 \p{Sc=Wcho}, \p{Wcho}) (59) 5435 \p{Script: Wara} \p{Script=Warang_Citi} (= 5436 \p{Script_Extensions=Warang_Citi}) (84) 5437 \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short: 5438 \p{Sc=Wara}, \p{Wara}) (84) 5439 \p{Script: Wcho} \p{Script=Wancho} (= \p{Script_Extensions= 5440 Wancho}) (59) 5441 \p{Script: Xpeo} \p{Script=Old_Persian} (= 5442 \p{Script_Extensions=Old_Persian}) (50) 5443 \p{Script: Xsux} \p{Script=Cuneiform} (= 5444 \p{Script_Extensions=Cuneiform}) (1234) 5445 \p{Script: Yi} (Short: \p{Sc=Yi}) (1220: U+A000..A48C, 5446 U+A490..A4C6) 5447 \p{Script: Yiii} \p{Script=Yi} (1220) 5448 \p{Script: Zanabazar_Square} \p{Script_Extensions= 5449 Zanabazar_Square} (Short: \p{Sc=Zanb}, 5450 \p{Zanb}) (72) 5451 \p{Script: Zanb} \p{Script=Zanabazar_Square} (= 5452 \p{Script_Extensions=Zanabazar_Square}) 5453 (72) 5454 \p{Script: Zinh} \p{Script=Inherited} (571) 5455 \p{Script: Zyyy} \p{Script=Common} (7805) 5456 \p{Script: Zzzz} \p{Script=Unknown} (= 5457 \p{Script_Extensions=Unknown}) (976_118 5458 plus all above-Unicode code points) 5459 \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (89: 5460 U+0640, U+1E900..1E94B, U+1E950..1E959, 5461 U+1E95E..1E95F) 5462 \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (89) 5463 \p{Script_Extensions: Aghb} \p{Script_Extensions= 5464 Caucasian_Albanian} (53) 5465 \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (58: 5466 U+11700..1171A, U+1171D..1172B, 5467 U+11730..1173F) 5468 \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw}, 5469 \p{Hluw}) (583: U+14400..14646) 5470 \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1325) 5471 \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab}) 5472 (1325: U+0600..0604, U+0606..061C, 5473 U+061E..06DC, U+06DE..06FF, 5474 U+0750..077F, U+08A0..08B4 ...) 5475 \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn}) 5476 (96: U+0531..0556, U+0559..058A, 5477 U+058D..058F, U+FB13..FB17) 5478 \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic} 5479 (31) 5480 \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96) 5481 \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst}) 5482 (61: U+10B00..10B35, U+10B39..10B3F) 5483 \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61) 5484 \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121) 5485 \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali}) 5486 (121: U+1B00..1B4B, U+1B50..1B7C) 5487 \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657) 5488 \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657: 5489 U+A6A0..A6F7, U+16800..16A38) 5490 \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36) 5491 \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass}) 5492 (36: U+16AD0..16AED, U+16AF0..16AF5) 5493 \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56: 5494 U+1BC0..1BF3, U+1BFC..1BFF) 5495 \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56) 5496 \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113) 5497 \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng}) 5498 (113: U+0951..0952, U+0964..0965, 5499 U+0980..0983, U+0985..098C, 5500 U+098F..0990, U+0993..09A8 ...) 5501 \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks}) 5502 (97: U+11C00..11C08, U+11C0A..11C36, 5503 U+11C38..11C45, U+11C50..11C6C) 5504 \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97) 5505 \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (112) 5506 \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo}) 5507 (112: U+02EA..02EB, U+3001..3003, 5508 U+3008..3011, U+3013..301F, 5509 U+302A..302D, U+3030 ...) 5510 \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109) 5511 \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah}) 5512 (109: U+11000..1104D, U+11052..1106F, 5513 U+1107F) 5514 \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256) 5515 \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai}) 5516 (256: U+2800..28FF) 5517 \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31) 5518 \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi}) 5519 (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF) 5520 \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22) 5521 \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22: 5522 U+1735..1736, U+1740..1753) 5523 \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (90) 5524 \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans}, 5525 \p{Cans}) (710: U+1400..167F, 5526 U+18B0..18F5) 5527 \p{Script_Extensions: Cans} \p{Script_Extensions= 5528 Canadian_Aboriginal} (710) 5529 \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49) 5530 \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49: 5531 U+102A0..102D0) 5532 \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb}, 5533 \p{Aghb}) (53: U+10530..10563, U+1056F) 5534 \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (90: 5535 U+09E6..09EF, U+1040..1049, 5536 U+11100..11134, U+11136..11146) 5537 \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83: 5538 U+AA00..AA36, U+AA40..AA4D, 5539 U+AA50..AA59, U+AA5C..AA5F) 5540 \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172) 5541 \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher}) 5542 (172: U+13A0..13F5, U+13F8..13FD, 5543 U+AB70..ABBF) 5544 \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy}) 5545 (7386: [\x00- 5546 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 5547 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab- 5548 \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF, 5549 U+02E5..02E9, U+02EC..02FF, U+0374, 5550 U+037E ...) 5551 \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165) 5552 \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt}) 5553 (165: U+03E2..03EF, U+2C80..2CF3, 5554 U+2CF9..2CFF, U+102E0..102FB) 5555 \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112) 5556 \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux}) 5557 (1234: U+12000..12399, U+12400..1246E, 5558 U+12470..12474, U+12480..12543) 5559 \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt}) 5560 (112: U+10100..10102, U+10107..10133, 5561 U+10137..1013F, U+10800..10805, U+10808, 5562 U+1080A..10835 ...) 5563 \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl}) 5564 (446: U+0400..052F, U+1C80..1C88, 5565 U+1D2B, U+1D78, U+2DE0..2DFF, U+2E43 ...) 5566 \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (446) 5567 \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt}) 5568 (80: U+10400..1044F) 5569 \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210) 5570 \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva}) 5571 (210: U+0900..0952, U+0955..097F, 5572 U+1CD0..1CF6, U+1CF8..1CF9, U+20F0, 5573 U+A830..A839 ...) 5574 \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82) 5575 \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82: 5576 U+0964..096F, U+A830..A839, 5577 U+11800..1183B) 5578 \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80) 5579 \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147) 5580 \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl}) 5581 (147: U+1BC00..1BC6A, U+1BC70..1BC7C, 5582 U+1BC80..1BC88, U+1BC90..1BC99, 5583 U+1BC9C..1BCA3) 5584 \p{Script_Extensions: Egyp} \p{Script_Extensions= 5585 Egyptian_Hieroglyphs} (1080) 5586 \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp}, 5587 \p{Egyp}) (1080: U+13000..1342E, 5588 U+13430..13438) 5589 \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40) 5590 \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba}) 5591 (40: U+10500..10527) 5592 \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23) 5593 \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym}) 5594 (23: U+10FE0..10FF6) 5595 \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495) 5596 \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi}) 5597 (495: U+1200..1248, U+124A..124D, 5598 U+1250..1256, U+1258, U+125A..125D, 5599 U+1260..1288 ...) 5600 \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (175) 5601 \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor}) 5602 (175: U+0589, U+10A0..10C5, U+10C7, 5603 U+10CD, U+10D0..10FF, U+1C90..1CBA ...) 5604 \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (136) 5605 \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag}) 5606 (136: U+0484, U+0487, U+2C00..2C2E, 5607 U+2C30..2C5E, U+2E43, U+A66F ...) 5608 \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi} 5609 (65) 5610 \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi} 5611 (77) 5612 \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27) 5613 \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27: 5614 U+10330..1034A) 5615 \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116) 5616 \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran}) 5617 (116: U+0951..0952, U+0964..0965, 5618 U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3, 5619 U+1CF2..1CF4 ...) 5620 \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522: 5621 U+0342, U+0345, U+0370..0373, 5622 U+0375..0377, U+037A..037D, U+037F ...) 5623 \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522) 5624 \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr}) 5625 (105: U+0951..0952, U+0964..0965, 5626 U+0A81..0A83, U+0A85..0A8D, 5627 U+0A8F..0A91, U+0A93..0AA8 ...) 5628 \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105) 5629 \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong}, 5630 \p{Gong}) (65: U+0964..0965, 5631 U+11D60..11D65, U+11D67..11D68, 5632 U+11D6A..11D8E, U+11D90..11D91, 5633 U+11D93..11D98 ...) 5634 \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru}) 5635 (94: U+0951..0952, U+0964..0965, 5636 U+0A01..0A03, U+0A05..0A0A, 5637 U+0A0F..0A10, U+0A13..0A28 ...) 5638 \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94) 5639 \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (89_513: 5640 U+2E80..2E99, U+2E9B..2EF3, 5641 U+2F00..2FD5, U+3001..3003, 5642 U+3005..3011, U+3013..301F ...) 5643 \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775) 5644 \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang}) 5645 (11_775: U+1100..11FF, U+3001..3003, 5646 U+3008..3011, U+3013..301F, 5647 U+302E..3030, U+3037 ...) 5648 \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (89_513) 5649 \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg}, 5650 \p{Rohg}) (55: U+060C, U+061B, U+061F, 5651 U+0640, U+06D4, U+10D00..10D27 ...) 5652 \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23) 5653 \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano}) 5654 (23: U+1720..1736) 5655 \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26) 5656 \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26: 5657 U+108E0..108F2, U+108F4..108F5, 5658 U+108FB..108FF) 5659 \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134) 5660 \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr}) 5661 (134: U+0591..05C7, U+05D0..05EA, 5662 U+05EF..05F4, U+FB1D..FB36, 5663 U+FB38..FB3C, U+FB3E ...) 5664 \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (431) 5665 \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira}) 5666 (431: U+3001..3003, U+3008..3011, 5667 U+3013..301F, U+3030..3035, U+3037, 5668 U+303C..303D ...) 5669 \p{Script_Extensions: Hluw} \p{Script_Extensions= 5670 Anatolian_Hieroglyphs} (583) 5671 \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong} 5672 (127) 5673 \p{Script_Extensions: Hmnp} \p{Script_Extensions= 5674 Nyiakeng_Puachue_Hmong} (71) 5675 \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian} 5676 (108) 5677 \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi}, 5678 \p{Armi}) (31: U+10840..10855, 5679 U+10857..1085F) 5680 \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh}) 5681 (502: U+0300..0341, U+0343..0344, 5682 U+0346..0362, U+0953..0954, 5683 U+1AB0..1ABE, U+1DC2..1DF9 ...) 5684 \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli}, 5685 \p{Phli}) (27: U+10B60..10B72, 5686 U+10B78..10B7F) 5687 \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx= 5688 Prti}, \p{Prti}) (30: U+10B40..10B55, 5689 U+10B58..10B5F) 5690 \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39) 5691 \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91) 5692 \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java}) 5693 (91: U+A980..A9CD, U+A9CF..A9D9, 5694 U+A9DE..A9DF) 5695 \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (87: 5696 U+0966..096F, U+A830..A839, 5697 U+11080..110C1, U+110CD) 5698 \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48) 5699 \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (356) 5700 \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda}) 5701 (104: U+0951..0952, U+0964..0965, 5702 U+0C80..0C8C, U+0C8E..0C90, 5703 U+0C92..0CA8, U+0CAA..0CB3 ...) 5704 \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana}) 5705 (356: U+3001..3003, U+3008..3011, 5706 U+3013..301F, U+3030..3035, U+3037, 5707 U+303C..303D ...) 5708 \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali}) 5709 (48: U+A900..A92F) 5710 \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68) 5711 \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar}) 5712 (68: U+10A00..10A03, U+10A05..10A06, 5713 U+10A0C..10A13, U+10A15..10A17, 5714 U+10A19..10A35, U+10A38..10A3A ...) 5715 \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146: 5716 U+1780..17DD, U+17E0..17E9, 5717 U+17F0..17F9, U+19E0..19FF) 5718 \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146) 5719 \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (82) 5720 \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (82: 5721 U+0AE6..0AEF, U+A830..A839, 5722 U+11200..11211, U+11213..1123E) 5723 \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind}) 5724 (81: U+0964..0965, U+A830..A839, 5725 U+112B0..112EA, U+112F0..112F9) 5726 \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (104) 5727 \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (87) 5728 \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127) 5729 \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (82: 5730 U+0E81..0E82, U+0E84, U+0E86..0E8A, 5731 U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...) 5732 \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (82) 5733 \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn}) 5734 (1387: [A-Za-z\xaa\xba\xc0-\xd6\xd8- 5735 \xf6\xf8-\xff], U+0100..02B8, 5736 U+02E0..02E4, U+0363..036F, 5737 U+0485..0486, U+0951..0952 ...) 5738 \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1387) 5739 \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74) 5740 \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74: 5741 U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F) 5742 \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69) 5743 \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69: 5744 U+0965, U+1900..191E, U+1920..192B, 5745 U+1930..193B, U+1940, U+1944..194F) 5746 \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386) 5747 \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268) 5748 \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina}) 5749 (386: U+10107..10133, U+10600..10736, 5750 U+10740..10755, U+10760..10767) 5751 \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb}) 5752 (268: U+10000..1000B, U+1000D..10026, 5753 U+10028..1003A, U+1003C..1003D, 5754 U+1003F..1004D, U+10050..1005D ...) 5755 \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (48: 5756 U+A4D0..A4FF) 5757 \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29) 5758 \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29: 5759 U+10280..1029C) 5760 \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27) 5761 \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27: 5762 U+10920..10939, U+1093F) 5763 \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj}) 5764 (61: U+0964..096F, U+A830..A839, 5765 U+11150..11176) 5766 \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61) 5767 \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25) 5768 \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka}) 5769 (25: U+11EE0..11EF8) 5770 \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym}) 5771 (125: U+0951..0952, U+0964..0965, 5772 U+0D00..0D03, U+0D05..0D0C, 5773 U+0D0E..0D10, U+0D12..0D44 ...) 5774 \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30) 5775 \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand}) 5776 (30: U+0640, U+0840..085B, U+085E) 5777 \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52) 5778 \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani}) 5779 (52: U+0640, U+10AC0..10AE6, 5780 U+10AEB..10AF6) 5781 \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68) 5782 \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc}) 5783 (68: U+11C70..11C8F, U+11C92..11CA7, 5784 U+11CA9..11CB6) 5785 \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm}, 5786 \p{Gonm}) (77: U+0964..0965, 5787 U+11D00..11D06, U+11D08..11D09, 5788 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D 5789 ...) 5790 \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf}) 5791 (91: U+16E40..16E9A) 5792 \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91) 5793 \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei}, 5794 \p{Mtei}) (79: U+AAE0..AAF6, 5795 U+ABC0..ABED, U+ABF0..ABF9) 5796 \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui} 5797 (213) 5798 \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend}, 5799 \p{Mend}) (213: U+1E800..1E8C4, 5800 U+1E8C7..1E8D6) 5801 \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive} 5802 (90) 5803 \p{Script_Extensions: Mero} \p{Script_Extensions= 5804 Meroitic_Hieroglyphs} (32) 5805 \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc}, 5806 \p{Merc}) (90: U+109A0..109B7, 5807 U+109BC..109CF, U+109D2..109FF) 5808 \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero}, 5809 \p{Mero}) (32: U+10980..1099F) 5810 \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149: 5811 U+16F00..16F4A, U+16F4F..16F87, 5812 U+16F8F..16F9F) 5813 \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (125) 5814 \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89: 5815 U+A830..A839, U+11600..11644, 5816 U+11650..11659) 5817 \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (171) 5818 \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong}) 5819 (171: U+1800..180E, U+1810..1819, 5820 U+1820..1878, U+1880..18AA, U+202F, 5821 U+11660..1166C) 5822 \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43: 5823 U+16A40..16A5E, U+16A60..16A69, 5824 U+16A6E..16A6F) 5825 \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43) 5826 \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79) 5827 \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48) 5828 \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult}) 5829 (48: U+0A66..0A6F, U+11280..11286, 5830 U+11288, U+1128A..1128D, U+1128F..1129D, 5831 U+1129F..112A9) 5832 \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr}) 5833 (224: U+1000..109F, U+A92E, 5834 U+A9E0..A9FE, U+AA60..AA7F) 5835 \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224) 5836 \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat}) 5837 (40: U+10880..1089E, U+108A7..108AF) 5838 \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86) 5839 \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand}) 5840 (86: U+0964..0965, U+0CE6..0CEF, U+1CE9, 5841 U+1CF2, U+1CFA, U+A830..A835 ...) 5842 \p{Script_Extensions: Narb} \p{Script_Extensions= 5843 Old_North_Arabian} (32) 5844 \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40) 5845 \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu}) 5846 (83: U+1980..19AB, U+19B0..19C9, 5847 U+19D0..19DA, U+19DE..19DF) 5848 \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (94: 5849 U+11400..11459, U+1145B, U+1145D..1145F) 5850 \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (62: 5851 U+07C0..07FA, U+07FD..07FF) 5852 \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (62) 5853 \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397) 5854 \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397: 5855 U+16FE1, U+1B170..1B2FB) 5856 \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx= 5857 Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C, 5858 U+1E130..1E13D, U+1E140..1E149, 5859 U+1E14E..1E14F) 5860 \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29) 5861 \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29: 5862 U+1680..169C) 5863 \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck}) 5864 (48: U+1C50..1C7F) 5865 \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48) 5866 \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung}, 5867 \p{Hung}) (108: U+10C80..10CB2, 5868 U+10CC0..10CF2, U+10CFA..10CFF) 5869 \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital}) 5870 (39: U+10300..10323, U+1032D..1032F) 5871 \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb}, 5872 \p{Narb}) (32: U+10A80..10A9F) 5873 \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm}) 5874 (44: U+0483, U+10350..1037A) 5875 \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo}) 5876 (50: U+103A0..103C3, U+103C8..103D5) 5877 \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo}) 5878 (40: U+10F00..10F27) 5879 \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb}, 5880 \p{Sarb}) (32: U+10A60..10A7F) 5881 \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh}) 5882 (73: U+10C00..10C48) 5883 \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (96: 5884 U+0951..0952, U+0964..0965, 5885 U+0B01..0B03, U+0B05..0B0C, 5886 U+0B0F..0B10, U+0B13..0B28 ...) 5887 \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73) 5888 \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (96) 5889 \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72: 5890 U+104B0..104D3, U+104D8..104FB) 5891 \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72) 5892 \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40) 5893 \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma}) 5894 (40: U+10480..1049D, U+104A0..104A9) 5895 \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng}, 5896 \p{Hmng}) (127: U+16B00..16B45, 5897 U+16B50..16B59, U+16B5B..16B61, 5898 U+16B63..16B77, U+16B7D..16B8F) 5899 \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32) 5900 \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm}) 5901 (32: U+10860..1087F) 5902 \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc}) 5903 (57: U+11AC0..11AF8) 5904 \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57) 5905 \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44) 5906 \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59) 5907 \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag}) 5908 (59: U+1802..1803, U+1805, U+A840..A877) 5909 \p{Script_Extensions: Phli} \p{Script_Extensions= 5910 Inscriptional_Pahlavi} (27) 5911 \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi} 5912 (30) 5913 \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29) 5914 \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx}) 5915 (29: U+10900..1091B, U+1091F) 5916 \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149) 5917 \p{Script_Extensions: Prti} \p{Script_Extensions= 5918 Inscriptional_Parthian} (30) 5919 \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp}, 5920 \p{Phlp}) (30: U+0640, U+10B80..10B91, 5921 U+10B99..10B9C, U+10BA9..10BAF) 5922 \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165) 5923 \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (502) 5924 \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37: 5925 U+A930..A953, U+A95F) 5926 \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37) 5927 \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya} 5928 (55) 5929 \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86: 5930 U+16A0..16EA, U+16EE..16F8) 5931 \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86) 5932 \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr}) 5933 (61: U+0800..082D, U+0830..083E) 5934 \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61) 5935 \p{Script_Extensions: Sarb} \p{Script_Extensions= 5936 Old_South_Arabian} (32) 5937 \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82) 5938 \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur}) 5939 (82: U+A880..A8C5, U+A8CE..A8D9) 5940 \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672) 5941 \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd}) 5942 (100: U+0951, U+1CD7, U+1CD9, 5943 U+1CDC..1CDD, U+1CE0, U+11180..111CD ...) 5944 \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw}) 5945 (48: U+10450..1047F) 5946 \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48) 5947 \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (100) 5948 \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92) 5949 \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd}) 5950 (92: U+11580..115B5, U+115B8..115DD) 5951 \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw}) 5952 (672: U+1D800..1DA8B, U+1DA9B..1DA9F, 5953 U+1DAA1..1DAAF) 5954 \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81) 5955 \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112) 5956 \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh}) 5957 (112: U+0964..0965, U+0D82..0D83, 5958 U+0D85..0D96, U+0D9A..0DB1, 5959 U+0DB3..0DBB, U+0DBD ...) 5960 \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43) 5961 \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd}) 5962 (43: U+0640, U+10F30..10F59) 5963 \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40) 5964 \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35) 5965 \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora}, 5966 \p{Sora}) (35: U+110D0..110E8, 5967 U+110F0..110F9) 5968 \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83) 5969 \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo}) 5970 (83: U+11A50..11AA2) 5971 \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72) 5972 \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund}) 5973 (72: U+1B80..1BBF, U+1CC0..1CC7) 5974 \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56) 5975 \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo}, 5976 \p{Sylo}) (56: U+0964..0965, 5977 U+09E6..09EF, U+A800..A82B) 5978 \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (105) 5979 \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc}) 5980 (105: U+060C, U+061B..061C, U+061F, 5981 U+0640, U+064B..0655, U+0670 ...) 5982 \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg}) 5983 (22: U+1700..170C, U+170E..1714, 5984 U+1735..1736) 5985 \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20) 5986 \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb}) 5987 (20: U+1735..1736, U+1760..176C, 5988 U+176E..1770, U+1772..1773) 5989 \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45: 5990 U+1040..1049, U+1950..196D, U+1970..1974) 5991 \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana}) 5992 (127: U+1A20..1A5E, U+1A60..1A7C, 5993 U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD) 5994 \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt}) 5995 (72: U+AA80..AAC2, U+AADB..AADF) 5996 \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (79) 5997 \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (79: 5998 U+0964..0965, U+A830..A839, 5999 U+11680..116B8, U+116C0..116C9) 6000 \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45) 6001 \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83) 6002 \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133: 6003 U+0951..0952, U+0964..0965, 6004 U+0B82..0B83, U+0B85..0B8A, 6005 U+0B8E..0B90, U+0B92..0B95 ...) 6006 \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133) 6007 \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6892) 6008 \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang}) 6009 (6892: U+16FE0, U+17000..187F7, 6010 U+18800..18AF2) 6011 \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72) 6012 \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (104) 6013 \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu}) 6014 (104: U+0951..0952, U+0964..0965, 6015 U+0C00..0C0C, U+0C0E..0C10, 6016 U+0C12..0C28, U+0C2A..0C39 ...) 6017 \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59) 6018 \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22) 6019 \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66) 6020 \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66: 6021 U+060C, U+061B..061C, U+061F, 6022 U+0660..0669, U+0780..07B1, U+FDF2 ...) 6023 \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86: 6024 U+0E01..0E3A, U+0E40..0E5B) 6025 \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt}) 6026 (207: U+0F00..0F47, U+0F49..0F6C, 6027 U+0F71..0F97, U+0F99..0FBC, 6028 U+0FBE..0FCC, U+0FCE..0FD4 ...) 6029 \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207) 6030 \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng}) 6031 (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F) 6032 \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97) 6033 \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh}) 6034 (97: U+0951..0952, U+0964..0965, U+1CF2, 6035 U+A830..A839, U+11480..114C7, 6036 U+114D0..114D9) 6037 \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31) 6038 \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar}) 6039 (31: U+10380..1039D, U+1039F) 6040 \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz}) 6041 (976_118 plus all above-Unicode code 6042 points: U+0378..0379, U+0380..0383, 6043 U+038B, U+038D, U+03A2, U+0530 ...) 6044 \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300: 6045 U+A500..A62B) 6046 \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300) 6047 \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59: 6048 U+1E2C0..1E2F9, U+1E2FF) 6049 \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84) 6050 \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara}) 6051 (84: U+118A0..118F2, U+118FF) 6052 \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59) 6053 \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50) 6054 \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234) 6055 \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246: 6056 U+3001..3002, U+3008..3011, 6057 U+3014..301B, U+30FB, U+A000..A48C, 6058 U+A490..A4C6 ...) 6059 \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246) 6060 \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb}, 6061 \p{Zanb}) (72: U+11A00..11A47) 6062 \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square} 6063 (72) 6064 \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (502) 6065 \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7386) 6066 \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (976_118 6067 plus all above-Unicode code points) 6068 \p{Scx: *} \p{Script_Extensions: *} 6069 \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46) 6070 \p{SD: *} \p{Soft_Dotted: *} 6071 \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4) 6072 \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024, 6073 U+FE52, U+FF0E) 6074 \p{Sentence_Break: CL} \p{Sentence_Break=Close} (187) 6075 \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187: 6076 [\"\'\(\)\[\]\{\}\xab\xbb], 6077 U+0F3A..0F3D, U+169B..169C, 6078 U+2018..201F, U+2039..203A, U+2045..2046 6079 ...) 6080 \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1: [\r]) 6081 \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (2368) 6082 \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2368: U+0300..036F, 6083 U+0483..0489, U+0591..05BD, U+05BF, 6084 U+05C1..05C2, U+05C4..05C5 ...) 6085 \p{Sentence_Break: FO} \p{Sentence_Break=Format} (63) 6086 \p{Sentence_Break: Format} (Short: \p{SB=FO}) (63: [\xad], 6087 U+0600..0605, U+061C, U+06DD, U+070F, 6088 U+08E2 ...) 6089 \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (121_822) 6090 \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1: [\n]) 6091 \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2293) 6092 \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2293: [a- 6093 z\xaa\xb5\xba\xdf-\xf6\xf8-\xff], 6094 U+0101, U+0103, U+0105, U+0107, U+0109 6095 ...) 6096 \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (632) 6097 \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (632: [0-9], 6098 U+0660..0669, U+066B..066C, 6099 U+06F0..06F9, U+07C0..07C9, U+0966..096F 6100 ...) 6101 \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (121_822: U+01BB, 6102 U+01C0..01C3, U+0294, U+02B9..02BF, 6103 U+02C6..02D1, U+02EC ...) 6104 \p{Sentence_Break: Other} (Short: \p{SB=XX}) (984_661 plus all 6105 above-Unicode code points: 6106 [^\t\n\cK\f\r\x20!\"\'\(\),\-.0-9:?A- 6107 Z\[\]a-z\{\}\x85\xa0\xaa- 6108 \xab\xad\xb5\xba-\xbb\xc0-\xd6\xd8- 6109 \xf6\xf8-\xff], U+02C2..02C5, 6110 U+02D2..02DF, U+02E5..02EB, U+02ED, 6111 U+02EF..02FF ...) 6112 \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26) 6113 \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:], 6114 U+055D, U+060C..060D, U+07F8, U+1802, 6115 U+1808 ...) 6116 \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3) 6117 \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85], 6118 U+2028..2029) 6119 \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0], 6120 U+1680, U+2000..200A, U+202F, U+205F, 6121 U+3000) 6122 \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (138) 6123 \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (138: [!?], U+0589, 6124 U+061E..061F, U+06D4, U+0700..0702, 6125 U+07F9 ...) 6126 \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1893) 6127 \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1893: [A-Z\xc0- 6128 \xd6\xd8-\xde], U+0100, U+0102, U+0104, 6129 U+0106, U+0108 ...) 6130 \p{Sentence_Break: XX} \p{Sentence_Break=Other} (984_661 plus all 6131 above-Unicode code points) 6132 \p{Sentence_Terminal} \p{Sentence_Terminal=Y} (Short: \p{STerm}) 6133 (141) 6134 \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm}) 6135 (1_113_971 plus all above-Unicode code 6136 points: [\x00-\x20\"#\$\%&\'\(\)*+,\- 6137 \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f- 6138 \xff], U+0100..0588, U+058A..061D, 6139 U+0620..06D3, U+06D5..06FF, U+0703..07F8 6140 ...) 6141 \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (141: 6142 [!.?], U+0589, U+061E..061F, U+06D4, 6143 U+0700..0702, U+07F9 ...) 6144 \p{Separator} \p{General_Category=Separator} (Short: 6145 \p{Z}) (19) 6146 \p{Sgnw} \p{SignWriting} (= \p{Script_Extensions= 6147 SignWriting}) (672) 6148 \p{Sharada} \p{Script_Extensions=Sharada} (Short: 6149 \p{Shrd}; NOT \p{Block=Sharada}) (100) 6150 \p{Shavian} \p{Script_Extensions=Shavian} (Short: 6151 \p{Shaw}) (48) 6152 \p{Shaw} \p{Shavian} (= \p{Script_Extensions= 6153 Shavian}) (48) 6154 X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls} 6155 (16) 6156 \p{Shrd} \p{Sharada} (= \p{Script_Extensions= 6157 Sharada}) (NOT \p{Block=Sharada}) (100) 6158 \p{Sidd} \p{Siddham} (= \p{Script_Extensions= 6159 Siddham}) (NOT \p{Block=Siddham}) (92) 6160 \p{Siddham} \p{Script_Extensions=Siddham} (Short: 6161 \p{Sidd}; NOT \p{Block=Siddham}) (92) 6162 \p{SignWriting} \p{Script_Extensions=SignWriting} (Short: 6163 \p{Sgnw}) (672) 6164 \p{Sind} \p{Khudawadi} (= \p{Script_Extensions= 6165 Khudawadi}) (NOT \p{Block=Khudawadi}) 6166 (81) 6167 \p{Sinh} \p{Sinhala} (= \p{Script_Extensions= 6168 Sinhala}) (NOT \p{Block=Sinhala}) (112) 6169 \p{Sinhala} \p{Script_Extensions=Sinhala} (Short: 6170 \p{Sinh}; NOT \p{Block=Sinhala}) (112) 6171 X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32) 6172 \p{Sk} \p{Modifier_Symbol} (= 6173 \p{General_Category=Modifier_Symbol}) 6174 (121) 6175 \p{Sm} \p{Math_Symbol} (= \p{General_Category= 6176 Math_Symbol}) (948) 6177 X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short: 6178 \p{InSmallForms}) (32) 6179 X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block= 6180 Small_Form_Variants}) (32) 6181 X \p{Small_Kana_Ext} \p{Small_Kana_Extension} (= \p{Block= 6182 Small_Kana_Extension}) (64) 6183 X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short: 6184 \p{InSmallKanaExt}) (64) 6185 \p{So} \p{Other_Symbol} (= \p{General_Category= 6186 Other_Symbol}) (6161) 6187 \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46) 6188 \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066 plus 6189 all above-Unicode code points: [\x00- 6190 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A- 6191 Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff], 6192 U+0100..012E, U+0130..0248, 6193 U+024A..0267, U+0269..029C, U+029E..02B1 6194 ...) 6195 \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46: [i-j], 6196 U+012F, U+0249, U+0268, U+029D, U+02B2 6197 ...) 6198 \p{Sogd} \p{Sogdian} (= \p{Script_Extensions= 6199 Sogdian}) (NOT \p{Block=Sogdian}) (43) 6200 \p{Sogdian} \p{Script_Extensions=Sogdian} (Short: 6201 \p{Sogd}; NOT \p{Block=Sogdian}) (43) 6202 \p{Sogo} \p{Old_Sogdian} (= \p{Script_Extensions= 6203 Old_Sogdian}) (NOT \p{Block= 6204 Old_Sogdian}) (40) 6205 \p{Sora} \p{Sora_Sompeng} (= \p{Script_Extensions= 6206 Sora_Sompeng}) (NOT \p{Block= 6207 Sora_Sompeng}) (35) 6208 \p{Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} (Short: 6209 \p{Sora}; NOT \p{Block=Sora_Sompeng}) 6210 (35) 6211 \p{Soyo} \p{Soyombo} (= \p{Script_Extensions= 6212 Soyombo}) (NOT \p{Block=Soyombo}) (83) 6213 \p{Soyombo} \p{Script_Extensions=Soyombo} (Short: 6214 \p{Soyo}; NOT \p{Block=Soyombo}) (83) 6215 \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25) 6216 \p{Space: *} \p{White_Space: *} 6217 \p{Space_Separator} \p{General_Category=Space_Separator} 6218 (Short: \p{Zs}) (17) 6219 \p{SpacePerl} \p{XPosixSpace} (25) 6220 \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short: 6221 \p{Mc}) (429) 6222 X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters} 6223 (Short: \p{InModifierLetters}) (80) 6224 X \p{Specials} \p{Block=Specials} (16) 6225 \p{STerm} \p{Sentence_Terminal} (= 6226 \p{Sentence_Terminal=Y}) (141) 6227 \p{STerm: *} \p{Sentence_Terminal: *} 6228 \p{Sund} \p{Sundanese} (= \p{Script_Extensions= 6229 Sundanese}) (NOT \p{Block=Sundanese}) 6230 (72) 6231 \p{Sundanese} \p{Script_Extensions=Sundanese} (Short: 6232 \p{Sund}; NOT \p{Block=Sundanese}) (72) 6233 X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block= 6234 Sundanese_Supplement}) (16) 6235 X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short: 6236 \p{InSundaneseSup}) (16) 6237 X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block= 6238 Supplemental_Arrows_A}) (16) 6239 X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block= 6240 Supplemental_Arrows_B}) (128) 6241 X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block= 6242 Supplemental_Arrows_C}) (256) 6243 X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (= 6244 \p{Block= 6245 Supplemental_Mathematical_Operators}) 6246 (256) 6247 X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (= 6248 \p{Block= 6249 Supplementary_Private_Use_Area_A}) 6250 (65_536) 6251 X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (= 6252 \p{Block= 6253 Supplementary_Private_Use_Area_B}) 6254 (65_536) 6255 X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block= 6256 Supplemental_Punctuation}) (128) 6257 X \p{Sup_Symbols_And_Pictographs} 6258 \p{Supplemental_Symbols_And_Pictographs} 6259 (= \p{Block= 6260 Supplemental_Symbols_And_Pictographs}) 6261 (256) 6262 X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (= 6263 \p{Block=Superscripts_And_Subscripts}) 6264 (48) 6265 X \p{Superscripts_And_Subscripts} \p{Block= 6266 Superscripts_And_Subscripts} (Short: 6267 \p{InSuperAndSub}) (48) 6268 X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short: 6269 \p{InSupArrowsA}) (16) 6270 X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short: 6271 \p{InSupArrowsB}) (128) 6272 X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short: 6273 \p{InSupArrowsC}) (256) 6274 X \p{Supplemental_Mathematical_Operators} \p{Block= 6275 Supplemental_Mathematical_Operators} 6276 (Short: \p{InSupMathOperators}) (256) 6277 X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation} 6278 (Short: \p{InSupPunctuation}) (128) 6279 X \p{Supplemental_Symbols_And_Pictographs} \p{Block= 6280 Supplemental_Symbols_And_Pictographs} 6281 (Short: \p{InSupSymbolsAndPictographs}) 6282 (256) 6283 X \p{Supplementary_Private_Use_Area_A} \p{Block= 6284 Supplementary_Private_Use_Area_A} 6285 (Short: \p{InSupPUAA}) (65_536) 6286 X \p{Supplementary_Private_Use_Area_B} \p{Block= 6287 Supplementary_Private_Use_Area_B} 6288 (Short: \p{InSupPUAB}) (65_536) 6289 \p{Surrogate} \p{General_Category=Surrogate} (Short: 6290 \p{Cs}) (2048) 6291 X \p{Sutton_SignWriting} \p{Block=Sutton_SignWriting} (688) 6292 \p{Sylo} \p{Syloti_Nagri} (= \p{Script_Extensions= 6293 Syloti_Nagri}) (NOT \p{Block= 6294 Syloti_Nagri}) (56) 6295 \p{Syloti_Nagri} \p{Script_Extensions=Syloti_Nagri} (Short: 6296 \p{Sylo}; NOT \p{Block=Syloti_Nagri}) 6297 (56) 6298 \p{Symbol} \p{General_Category=Symbol} (Short: \p{S}) 6299 (7292) 6300 X \p{Symbols_And_Pictographs_Ext_A} 6301 \p{Symbols_And_Pictographs_Extended_A} 6302 (= \p{Block= 6303 Symbols_And_Pictographs_Extended_A}) 6304 (144) 6305 X \p{Symbols_And_Pictographs_Extended_A} \p{Block= 6306 Symbols_And_Pictographs_Extended_A} (144) 6307 \p{Syrc} \p{Syriac} (= \p{Script_Extensions= 6308 Syriac}) (NOT \p{Block=Syriac}) (105) 6309 \p{Syriac} \p{Script_Extensions=Syriac} (Short: 6310 \p{Syrc}; NOT \p{Block=Syriac}) (105) 6311 X \p{Syriac_Sup} \p{Syriac_Supplement} (= \p{Block= 6312 Syriac_Supplement}) (16) 6313 X \p{Syriac_Supplement} \p{Block=Syriac_Supplement} (Short: 6314 \p{InSyriacSup}) (16) 6315 \p{Tagalog} \p{Script_Extensions=Tagalog} (Short: 6316 \p{Tglg}; NOT \p{Block=Tagalog}) (22) 6317 \p{Tagb} \p{Tagbanwa} (= \p{Script_Extensions= 6318 Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20) 6319 \p{Tagbanwa} \p{Script_Extensions=Tagbanwa} (Short: 6320 \p{Tagb}; NOT \p{Block=Tagbanwa}) (20) 6321 X \p{Tags} \p{Block=Tags} (128) 6322 \p{Tai_Le} \p{Script_Extensions=Tai_Le} (Short: 6323 \p{Tale}; NOT \p{Block=Tai_Le}) (45) 6324 \p{Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short: 6325 \p{Lana}; NOT \p{Block=Tai_Tham}) (127) 6326 \p{Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short: 6327 \p{Tavt}; NOT \p{Block=Tai_Viet}) (72) 6328 X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block= 6329 Tai_Xuan_Jing_Symbols}) (96) 6330 X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short: 6331 \p{InTaiXuanJing}) (96) 6332 \p{Takr} \p{Takri} (= \p{Script_Extensions=Takri}) 6333 (NOT \p{Block=Takri}) (79) 6334 \p{Takri} \p{Script_Extensions=Takri} (Short: 6335 \p{Takr}; NOT \p{Block=Takri}) (79) 6336 \p{Tale} \p{Tai_Le} (= \p{Script_Extensions= 6337 Tai_Le}) (NOT \p{Block=Tai_Le}) (45) 6338 \p{Talu} \p{New_Tai_Lue} (= \p{Script_Extensions= 6339 New_Tai_Lue}) (NOT \p{Block= 6340 New_Tai_Lue}) (83) 6341 \p{Tamil} \p{Script_Extensions=Tamil} (Short: 6342 \p{Taml}; NOT \p{Block=Tamil}) (133) 6343 X \p{Tamil_Sup} \p{Tamil_Supplement} (= \p{Block= 6344 Tamil_Supplement}) (64) 6345 X \p{Tamil_Supplement} \p{Block=Tamil_Supplement} (Short: 6346 \p{InTamilSup}) (64) 6347 \p{Taml} \p{Tamil} (= \p{Script_Extensions=Tamil}) 6348 (NOT \p{Block=Tamil}) (133) 6349 \p{Tang} \p{Tangut} (= \p{Script_Extensions= 6350 Tangut}) (NOT \p{Block=Tangut}) (6892) 6351 \p{Tangut} \p{Script_Extensions=Tangut} (Short: 6352 \p{Tang}; NOT \p{Block=Tangut}) (6892) 6353 X \p{Tangut_Components} \p{Block=Tangut_Components} (768) 6354 \p{Tavt} \p{Tai_Viet} (= \p{Script_Extensions= 6355 Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72) 6356 \p{Telu} \p{Telugu} (= \p{Script_Extensions= 6357 Telugu}) (NOT \p{Block=Telugu}) (104) 6358 \p{Telugu} \p{Script_Extensions=Telugu} (Short: 6359 \p{Telu}; NOT \p{Block=Telugu}) (104) 6360 \p{Term} \p{Terminal_Punctuation} (= 6361 \p{Terminal_Punctuation=Y}) (264) 6362 \p{Term: *} \p{Terminal_Punctuation: *} 6363 \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short: 6364 \p{Term}) (264) 6365 \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term}) 6366 (1_113_848 plus all above-Unicode code 6367 points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0- 6368 9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff], 6369 U+0100..037D, U+037F..0386, 6370 U+0388..0588, U+058A..05C2, U+05C4..060B 6371 ...) 6372 \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (264: 6373 [!,.:;?], U+037E, U+0387, U+0589, 6374 U+05C3, U+060C ...) 6375 \p{Tfng} \p{Tifinagh} (= \p{Script_Extensions= 6376 Tifinagh}) (NOT \p{Block=Tifinagh}) (59) 6377 \p{Tglg} \p{Tagalog} (= \p{Script_Extensions= 6378 Tagalog}) (NOT \p{Block=Tagalog}) (22) 6379 \p{Thaa} \p{Thaana} (= \p{Script_Extensions= 6380 Thaana}) (NOT \p{Block=Thaana}) (66) 6381 \p{Thaana} \p{Script_Extensions=Thaana} (Short: 6382 \p{Thaa}; NOT \p{Block=Thaana}) (66) 6383 \p{Thai} \p{Script_Extensions=Thai} (NOT \p{Block= 6384 Thai}) (86) 6385 \p{Tibetan} \p{Script_Extensions=Tibetan} (Short: 6386 \p{Tibt}; NOT \p{Block=Tibetan}) (207) 6387 \p{Tibt} \p{Tibetan} (= \p{Script_Extensions= 6388 Tibetan}) (NOT \p{Block=Tibetan}) (207) 6389 \p{Tifinagh} \p{Script_Extensions=Tifinagh} (Short: 6390 \p{Tfng}; NOT \p{Block=Tifinagh}) (59) 6391 \p{Tirh} \p{Tirhuta} (= \p{Script_Extensions= 6392 Tirhuta}) (NOT \p{Block=Tirhuta}) (97) 6393 \p{Tirhuta} \p{Script_Extensions=Tirhuta} (Short: 6394 \p{Tirh}; NOT \p{Block=Tirhuta}) (97) 6395 \p{Title} \p{Titlecase} (/i= Cased=Yes) (31) 6396 \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i= 6397 Cased=Yes) (31: U+01C5, U+01C8, U+01CB, 6398 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...) 6399 \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter} 6400 (Short: \p{Lt}; /i= General_Category= 6401 Cased_Letter) (31) 6402 X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block= 6403 Transport_And_Map_Symbols}) (128) 6404 X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols} 6405 (Short: \p{InTransportAndMap}) (128) 6406 X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics} 6407 (= \p{Block= 6408 Unified_Canadian_Aboriginal_Syllabics}) 6409 (640) 6410 X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_- 6411 Extended} (= \p{Block= 6412 Unified_Canadian_Aboriginal_Syllabics_- 6413 Extended}) (80) 6414 \p{Ugar} \p{Ugaritic} (= \p{Script_Extensions= 6415 Ugaritic}) (NOT \p{Block=Ugaritic}) (31) 6416 \p{Ugaritic} \p{Script_Extensions=Ugaritic} (Short: 6417 \p{Ugar}; NOT \p{Block=Ugaritic}) (31) 6418 \p{UIdeo} \p{Unified_Ideograph} (= 6419 \p{Unified_Ideograph=Y}) (87_887) 6420 \p{UIdeo: *} \p{Unified_Ideograph: *} 6421 \p{Unassigned} \p{General_Category=Unassigned} (Short: 6422 \p{Cn}) (836_602 plus all above-Unicode 6423 code points) 6424 \p{Unicode} \p{Any} (1_114_112) 6425 X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block= 6426 Unified_Canadian_Aboriginal_Syllabics} 6427 (Short: \p{InUCAS}) (640) 6428 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block= 6429 Unified_Canadian_Aboriginal_Syllabics_- 6430 Extended} (Short: \p{InUCASExt}) (80) 6431 \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo}) 6432 (87_887) 6433 \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo}) 6434 (1_026_225 plus all above-Unicode code 6435 points: U+0000..33FF, U+4DB6..4DFF, 6436 U+9FF0..FA0D, U+FA10, U+FA12, 6437 U+FA15..FA1E ...) 6438 \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (87_887: 6439 U+3400..4DB5, U+4E00..9FEF, 6440 U+FA0E..FA0F, U+FA11, U+FA13..FA14, 6441 U+FA1F ...) 6442 \p{Unknown} \p{Script_Extensions=Unknown} (Short: 6443 \p{Zzzz}) (976_118 plus all above- 6444 Unicode code points) 6445 \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i= 6446 Cased=Yes) (1908) 6447 \p{Upper: *} \p{Uppercase: *} 6448 \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i= 6449 Cased=Yes) (1908) 6450 \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased= 6451 No) (1_112_204 plus all above-Unicode 6452 code points: [\x00- 6453 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 6454 >?\@\[\\\]\^_`a-z\{\|\}~\x7f- 6455 \xbf\xd7\xdf-\xff], U+0101, U+0103, 6456 U+0105, U+0107, U+0109 ...) 6457 \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased= 6458 Yes) (1908: [A-Z\xc0-\xd6\xd8-\xde], 6459 U+0100, U+0102, U+0104, U+0106, U+0108 6460 ...) 6461 \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter} 6462 (Short: \p{Lu}; /i= General_Category= 6463 Cased_Letter) (1788) 6464 \p{Vai} \p{Script_Extensions=Vai} (NOT \p{Block= 6465 Vai}) (300) 6466 \p{Vaii} \p{Vai} (= \p{Script_Extensions=Vai}) (NOT 6467 \p{Block=Vai}) (300) 6468 \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS}; 6469 NOT \p{Variation_Selectors}) (259) 6470 \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853 6471 plus all above-Unicode code points: 6472 U+0000..180A, U+180E..FDFF, 6473 U+FE10..E00FF, U+E01F0..infinity) 6474 \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259: 6475 U+180B..180D, U+FE00..FE0F, 6476 U+E0100..E01EF) 6477 X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short: 6478 \p{InVS}) (16) 6479 X \p{Variation_Selectors_Supplement} \p{Block= 6480 Variation_Selectors_Supplement} (Short: 6481 \p{InVSSup}) (240) 6482 X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block= 6483 Vedic_Extensions}) (48) 6484 X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short: 6485 \p{InVedicExt}) (48) 6486 X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16) 6487 \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated} 6488 (787_620 plus all above-Unicode code 6489 points) 6490 \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (787_620 plus 6491 all above-Unicode code points: [\x00- 6492 \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf- 6493 \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9, 6494 U+02EC..10FF, U+1200..1400, 6495 U+1680..18AF, U+1900..2015 ...) 6496 \p{Vertical_Orientation: Tr} \p{Vertical_Orientation= 6497 Transformed_Rotated} (47) 6498 \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr}) 6499 (47: U+2329..232A, U+3008..3011, 6500 U+3014..301F, U+3030, U+30A0, U+30FC ...) 6501 \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu}) 6502 (148: U+3001..3002, U+3041, U+3043, 6503 U+3045, U+3047, U+3049 ...) 6504 \p{Vertical_Orientation: Tu} \p{Vertical_Orientation= 6505 Transformed_Upright} (148) 6506 \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright} 6507 (326_297) 6508 \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (326_297: 6509 [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7], 6510 U+02EA..02EB, U+1100..11FF, 6511 U+1401..167F, U+18B0..18FF, U+2016 ...) 6512 \p{VertSpace} \v (7: [\n\cK\f\r\x85], U+2028..2029) 6513 \p{Vo: *} \p{Vertical_Orientation: *} 6514 \p{VS} \p{Variation_Selector} (= 6515 \p{Variation_Selector=Y}) (NOT 6516 \p{Variation_Selectors}) (259) 6517 \p{VS: *} \p{Variation_Selector: *} 6518 X \p{VS_Sup} \p{Variation_Selectors_Supplement} (= 6519 \p{Block= 6520 Variation_Selectors_Supplement}) (240) 6521 \p{Wancho} \p{Script_Extensions=Wancho} (Short: 6522 \p{Wcho}; NOT \p{Block=Wancho}) (59) 6523 \p{Wara} \p{Warang_Citi} (= \p{Script_Extensions= 6524 Warang_Citi}) (NOT \p{Block= 6525 Warang_Citi}) (84) 6526 \p{Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short: 6527 \p{Wara}; NOT \p{Block=Warang_Citi}) (84) 6528 \p{WB: *} \p{Word_Break: *} 6529 \p{Wcho} \p{Wancho} (= \p{Script_Extensions= 6530 Wancho}) (NOT \p{Block=Wancho}) (59) 6531 \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25) 6532 \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087 6533 plus all above-Unicode code points: 6534 [^\t\n\cK\f\r\x20\x85\xa0], 6535 U+0100..167F, U+1681..1FFF, 6536 U+200B..2027, U+202A..202E, U+2030..205E 6537 ...) 6538 \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25: 6539 [\t\n\cK\f\r\x20\x85\xa0], U+1680, 6540 U+2000..200A, U+2028..2029, U+202F, 6541 U+205F ...) 6542 \p{Word} \p{XPosixWord} (128_919) 6543 \p{Word_Break: ALetter} (Short: \p{WB=LE}) (28_693: [A-Za- 6544 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 6545 \xff], U+0100..02D7, U+02DE..02E4, 6546 U+02EC..02FF, U+0370..0374, U+0376..0377 6547 ...) 6548 \p{Word_Break: CR} (Short: \p{WB=CR}) (1: [\r]) 6549 \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"]) 6550 \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1) 6551 \p{Word_Break: E_Base} (Short: \p{WB=EB}) (0) 6552 \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0) 6553 \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0) 6554 \p{Word_Break: EB} \p{Word_Break=E_Base} (0) 6555 \p{Word_Break: EBG} \p{Word_Break=E_Base_GAZ} (0) 6556 \p{Word_Break: EM} \p{Word_Break=E_Modifier} (0) 6557 \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (11) 6558 \p{Word_Break: Extend} (Short: \p{WB=Extend}) (2372: 6559 U+0300..036F, U+0483..0489, 6560 U+0591..05BD, U+05BF, U+05C1..05C2, 6561 U+05C4..05C5 ...) 6562 \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F, 6563 U+203F..2040, U+2054, U+FE33..FE34, 6564 U+FE4D..FE4F ...) 6565 \p{Word_Break: FO} \p{Word_Break=Format} (62) 6566 \p{Word_Break: Format} (Short: \p{WB=FO}) (62: [\xad], 6567 U+0600..0605, U+061C, U+06DD, U+070F, 6568 U+08E2 ...) 6569 \p{Word_Break: GAZ} \p{Word_Break=Glue_After_Zwj} (0) 6570 \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0) 6571 \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75: 6572 U+05D0..05EA, U+05EF..05F2, U+FB1D, 6573 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C 6574 ...) 6575 \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (75) 6576 \p{Word_Break: KA} \p{Word_Break=Katakana} (314) 6577 \p{Word_Break: Katakana} (Short: \p{WB=KA}) (314: U+3031..3035, 6578 U+309B..309C, U+30A0..30FA, 6579 U+30FC..30FF, U+31F0..31FF, U+32D0..32FE 6580 ...) 6581 \p{Word_Break: LE} \p{Word_Break=ALetter} (28_693) 6582 \p{Word_Break: LF} (Short: \p{WB=LF}) (1: [\n]) 6583 \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7) 6584 \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8: [:\xb7], U+0387, 6585 U+05F4, U+2027, U+FE13, U+FE55 ...) 6586 \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15: [,;], U+037E, 6587 U+0589, U+060C..060D, U+066C, U+07F8 ...) 6588 \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.], 6589 U+2018..2019, U+2024, U+FE52, U+FF07, 6590 U+FF0E) 6591 \p{Word_Break: ML} \p{Word_Break=MidLetter} (8) 6592 \p{Word_Break: MN} \p{Word_Break=MidNum} (15) 6593 \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85], 6594 U+2028..2029) 6595 \p{Word_Break: NL} \p{Word_Break=Newline} (5) 6596 \p{Word_Break: NU} \p{Word_Break=Numeric} (631) 6597 \p{Word_Break: Numeric} (Short: \p{WB=NU}) (631: [0-9], 6598 U+0660..0669, U+066B, U+06F0..06F9, 6599 U+07C0..07C9, U+0966..096F ...) 6600 \p{Word_Break: Other} (Short: \p{WB=XX}) (1_081_874 plus all 6601 above-Unicode code points: 6602 [^\n\cK\f\r\x20\"\',.0-9:;A-Z_a- 6603 z\x85\xaa\xad\xb5\xb7\xba\xc0-\xd6\xd8- 6604 \xf6\xf8-\xff], U+02D8..02DD, 6605 U+02E5..02EB, U+0375, U+0378..0379, 6606 U+0380..0385 ...) 6607 \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26: 6608 U+1F1E6..1F1FF) 6609 \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26) 6610 \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\']) 6611 \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1) 6612 \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20], 6613 U+1680, U+2000..2006, U+2008..200A, 6614 U+205F, U+3000) 6615 \p{Word_Break: XX} \p{Word_Break=Other} (1_081_874 plus all 6616 above-Unicode code points) 6617 \p{Word_Break: ZWJ} (Short: \p{WB=ZWJ}) (1: U+200D) 6618 \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25) 6619 \p{WSpace: *} \p{White_Space: *} 6620 \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44) 6621 \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC}) 6622 (128_770) 6623 \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (985_342 6624 plus all above-Unicode code points: 6625 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<= 6626 >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab- 6627 \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7], 6628 U+02C2..02C5, U+02D2..02DF, 6629 U+02E5..02EB, U+02ED, U+02EF..02FF ...) 6630 \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (128_770: 6631 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0- 6632 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1, 6633 U+02C6..02D1, U+02E0..02E4, U+02EC, 6634 U+02EE ...) 6635 \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (125_861) 6636 \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (988_251 6637 plus all above-Unicode code points: 6638 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 6639 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab- 6640 \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7], 6641 U+02C2..02C5, U+02D2..02DF, 6642 U+02E5..02EB, U+02ED, U+02EF..036F ...) 6643 \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (125_861: 6644 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8- 6645 \xf6\xf8-\xff], U+0100..02C1, 6646 U+02C6..02D1, U+02E0..02E4, U+02EC, 6647 U+02EE ...) 6648 \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y}) 6649 (128_770) 6650 \p{XIDC: *} \p{XID_Continue: *} 6651 \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (125_861) 6652 \p{XIDS: *} \p{XID_Start: *} 6653 \p{Xpeo} \p{Old_Persian} (= \p{Script_Extensions= 6654 Old_Persian}) (NOT \p{Block= 6655 Old_Persian}) (50) 6656 \p{XPerlSpace} \p{XPosixSpace} (25) 6657 \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short: 6658 \p{Alnum}) (127_886: [0-9A-Za- 6659 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 6660 \xff], U+0100..02C1, U+02C6..02D1, 6661 U+02E0..02E4, U+02EC, U+02EE ...) 6662 \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha}) 6663 (127_256) 6664 \p{XPosixBlank} \h, Horizontal white space (Short: 6665 \p{Blank}) (18: [\t\x20\xa0], U+1680, 6666 U+2000..200A, U+202F, U+205F, U+3000) 6667 \p{XPosixCntrl} \p{General_Category=Control} Control 6668 characters (Short: \p{Cc}) (65) 6669 \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9] 6670 + all other decimal digits (Short: 6671 \p{Nd}) (630) 6672 \p{XPosixGraph} Characters that are graphical (Short: 6673 \p{Graph}) (275_378: 6674 [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A- 6675 Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff], 6676 U+0100..0377, U+037A..037F, 6677 U+0384..038A, U+038C, U+038E..03A1 ...) 6678 \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i= 6679 Cased=Yes) (2340) 6680 \p{XPosixPrint} Characters that are graphical plus space 6681 characters (but no controls) (Short: 6682 \p{Print}) (275_395: [\x20-\x7e\xa0- 6683 \xff], U+0100..0377, U+037A..037F, 6684 U+0384..038A, U+038C, U+038E..03A1 ...) 6685 \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (801: 6686 [!\"#\$\%&\'\(\)*+,\-.\/:;<= 6687 >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6- 6688 \xb7\xbb\xbf], U+037E, U+0387, 6689 U+055A..055F, U+0589..058A, U+05BE ...) 6690 \p{XPosixSpace} \s including beyond ASCII and vertical tab 6691 (Short: \p{SpacePerl}) (25: 6692 [\t\n\cK\f\r\x20\x85\xa0], U+1680, 6693 U+2000..200A, U+2028..2029, U+202F, 6694 U+205F ...) 6695 \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i= 6696 Cased=Yes) (1908) 6697 \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} + 6698 \pM + \p{Pc} + \p{Join_Control} (Short: 6699 \p{Word}) (128_919: [0-9A-Z_a- 6700 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 6701 \xff], U+0100..02C1, U+02C6..02D1, 6702 U+02E0..02E4, U+02EC, U+02EE ...) 6703 \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44) 6704 \p{Xsux} \p{Cuneiform} (= \p{Script_Extensions= 6705 Cuneiform}) (NOT \p{Block=Cuneiform}) 6706 (1234) 6707 \p{Yi} \p{Script_Extensions=Yi} (1246) 6708 X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64) 6709 X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168) 6710 \p{Yiii} \p{Yi} (= \p{Script_Extensions=Yi}) (1246) 6711 X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block= 6712 Yijing_Hexagram_Symbols}) (64) 6713 X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols} 6714 (Short: \p{InYijing}) (64) 6715 \p{Z} \pZ \p{Separator} (= \p{General_Category= 6716 Separator}) (19) 6717 \p{Zanabazar_Square} \p{Script_Extensions=Zanabazar_Square} 6718 (Short: \p{Zanb}; NOT \p{Block= 6719 Zanabazar_Square}) (72) 6720 \p{Zanb} \p{Zanabazar_Square} (= 6721 \p{Script_Extensions=Zanabazar_Square}) 6722 (NOT \p{Block=Zanabazar_Square}) (72) 6723 \p{Zinh} \p{Inherited} (= \p{Script_Extensions= 6724 Inherited}) (502) 6725 \p{Zl} \p{Line_Separator} (= \p{General_Category= 6726 Line_Separator}) (1) 6727 \p{Zp} \p{Paragraph_Separator} (= 6728 \p{General_Category= 6729 Paragraph_Separator}) (1) 6730 \p{Zs} \p{Space_Separator} (= 6731 \p{General_Category=Space_Separator}) 6732 (17) 6733 \p{Zyyy} \p{Common} (= \p{Script_Extensions= 6734 Common}) (7386) 6735 \p{Zzzz} \p{Unknown} (= \p{Script_Extensions= 6736 Unknown}) (976_118 plus all above- 6737 Unicode code points) 6738 6739 6740 6741=head2 Legal C<\p{}> and C<\P{}> constructs that match no characters 6742 6743Unicode has some property-value pairs that currently don't match anything. 6744This happens generally either because they are obsolete, or they exist for 6745symmetry with other forms, but no language has yet been encoded that uses 6746them. In this version of Unicode, the following match zero code points: 6747 6748=over 4 6749 6750=item \p{Canonical_Combining_Class=Attached_Below_Left} 6751 6752=item \p{Canonical_Combining_Class=CCC133} 6753 6754=item \p{Grapheme_Cluster_Break=E_Base} 6755 6756=item \p{Grapheme_Cluster_Break=E_Base_GAZ} 6757 6758=item \p{Grapheme_Cluster_Break=E_Modifier} 6759 6760=item \p{Grapheme_Cluster_Break=Glue_After_Zwj} 6761 6762=item \p{Word_Break=E_Base} 6763 6764=item \p{Word_Break=E_Base_GAZ} 6765 6766=item \p{Word_Break=E_Modifier} 6767 6768=item \p{Word_Break=Glue_After_Zwj} 6769 6770=back 6771 6772 6773 6774=head1 Properties accessible through Unicode::UCD 6775 6776The value of any Unicode (not including Perl extensions) character 6777property mentioned above for any single code point is available through 6778L<Unicode::UCD/charprop()>. L<Unicode::UCD/charprops_all()> returns the 6779values of all the Unicode properties for a given code point. 6780 6781Besides these, all the Unicode character properties mentioned above 6782(except for those marked as for internal use by Perl) are also 6783accessible by L<Unicode::UCD/prop_invlist()>. 6784 6785Due to their nature, not all Unicode character properties are suitable for 6786regular expression matches, nor C<prop_invlist()>. The remaining 6787non-provisional, non-internal ones are accessible via 6788L<Unicode::UCD/prop_invmap()> (except for those that this Perl installation 6789hasn't included; see L<below for which those are|/Unicode character properties 6790that are NOT accepted by Perl>). 6791 6792For compatibility with other parts of Perl, all the single forms given in the 6793table in the L<section above|/Properties accessible through \p{} and \P{}> 6794are recognized. BUT, there are some ambiguities between some Perl extensions 6795and the Unicode properties, all of which are silently resolved in favor of the 6796official Unicode property. To avoid surprises, you should only use 6797C<prop_invmap()> for forms listed in the table below, which omits the 6798non-recommended ones. The affected forms are the Perl single form equivalents 6799of Unicode properties, such as C<\p{sc}> being a single-form equivalent of 6800C<\p{gc=sc}>, which is treated by C<prop_invmap()> as the C<Script> property, 6801whose short name is C<sc>. The table indicates the current ambiguities in the 6802INFO column, beginning with the word C<"NOT">. 6803 6804The standard Unicode properties listed below are documented in 6805L<http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is documented in 6806L<Unicode::UCD/prop_invmap()>. The other Perl extensions are in 6807L<perlunicode/Other Properties>; 6808 6809The first column in the table is a name for the property; the second column is 6810an alternative name, if any, plus possibly some annotations. The alternative 6811name is the property's full name, unless that would simply repeat the first 6812column, in which case the second column indicates the property's short name 6813(if different). The annotations are given only in the entry for the full 6814name. The annotations for binary properties include a list of the first few 6815ranges that the property matches. To avoid any ambiguity, the SPACE character 6816is represented as C<\x20>. 6817 6818If a property is obsolete, etc, the entry will be flagged with the same 6819characters used in the table in the L<section above|/Properties accessible 6820through \p{} and \P{}>, like B<D> or B<S>. 6821 6822 NAME INFO 6823 6824 Age 6825 AHex ASCII_Hex_Digit 6826 All (Perl extension). All code points, 6827 including those above Unicode. Same as 6828 qr/./s. U+0000..infinity 6829 Alnum XPosixAlnum. (Perl extension) 6830 Alpha Alphabetic 6831 Alphabetic (Short: Alpha). [A-Za-z\xaa\xb5\xba\xc0- 6832 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1, 6833 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE 6834 ... 6835 Any (Perl extension). All Unicode code 6836 points. U+0000..10FFFF 6837 ASCII Block=Basic_Latin. (Perl extension). 6838 [\x00-\x7f] 6839 ASCII_Hex_Digit (Short: AHex). [0-9A-Fa-f] 6840 Assigned (Perl extension). All assigned code 6841 points. U+0000..0377, U+037A..037F, 6842 U+0384..038A, U+038C, U+038E..03A1, 6843 U+03A3..052F ... 6844 Bc Bidi_Class 6845 Bidi_C Bidi_Control 6846 Bidi_Class (Short: bc) 6847 Bidi_Control (Short: Bidi_C). U+061C, U+200E..200F, 6848 U+202A..202E, U+2066..2069 6849 Bidi_M Bidi_Mirrored 6850 Bidi_Mirrored (Short: Bidi_M). 6851 [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D, 6852 U+169B..169C, U+2039..203A, U+2045..2046, 6853 U+207D..207E ... 6854 Bidi_Mirroring_Glyph (Short: bmg) 6855 Bidi_Paired_Bracket (Short: bpb) 6856 Bidi_Paired_Bracket_Type (Short: bpt) 6857 Blank XPosixBlank. (Perl extension) 6858 Blk Block 6859 Block (Short: blk) 6860 Bmg Bidi_Mirroring_Glyph 6861 Bpb Bidi_Paired_Bracket 6862 Bpt Bidi_Paired_Bracket_Type 6863 Canonical_Combining_Class (Short: ccc) 6864 Case_Folding (Short: cf) 6865 Case_Ignorable (Short: CI). 6866 [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8], 6867 U+02B0..036F, U+0374..0375, U+037A, 6868 U+0384..0385, U+0387 ... 6869 Cased [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 6870 \xff], U+0100..01BA, U+01BC..01BF, 6871 U+01C4..0293, U+0295..02B8, U+02C0..02C1 6872 ... 6873 Category General_Category 6874 Ccc Canonical_Combining_Class 6875 CE Composition_Exclusion 6876 Cf Case_Folding; NOT 'cf' meaning 6877 'General_Category=Format' 6878 Changes_When_Casefolded (Short: CWCF). [A-Z\xb5\xc0-\xd6\xd8- 6879 \xdf], U+0100, U+0102, U+0104, U+0106, 6880 U+0108 ... 6881 Changes_When_Casemapped (Short: CWCM). [A-Za-z\xb5\xc0-\xd6\xd8- 6882 \xf6\xf8-\xff], U+0100..0137, 6883 U+0139..018C, U+018E..019A, U+019C..01A9, 6884 U+01AC..01B9 ... 6885 Changes_When_Lowercased (Short: CWL). [A-Z\xc0-\xd6\xd8-\xde], 6886 U+0100, U+0102, U+0104, U+0106, U+0108 ... 6887 Changes_When_NFKC_Casefolded (Short: CWKCF). [A- 6888 Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8- 6889 \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf], U+0100, 6890 U+0102, U+0104, U+0106, U+0108 ... 6891 Changes_When_Titlecased (Short: CWT). [a-z\xb5\xdf-\xf6\xf8- 6892 \xff], U+0101, U+0103, U+0105, U+0107, 6893 U+0109 ... 6894 Changes_When_Uppercased (Short: CWU). [a-z\xb5\xdf-\xf6\xf8- 6895 \xff], U+0101, U+0103, U+0105, U+0107, 6896 U+0109 ... 6897 CI Case_Ignorable 6898 Cntrl XPosixCntrl (=General_Category=Control). 6899 (Perl extension) 6900 Comp_Ex Full_Composition_Exclusion 6901 Composition_Exclusion (Short: CE). U+0958..095F, U+09DC..09DD, 6902 U+09DF, U+0A33, U+0A36, U+0A59..0A5B ... 6903 CWCF Changes_When_Casefolded 6904 CWCM Changes_When_Casemapped 6905 CWKCF Changes_When_NFKC_Casefolded 6906 CWL Changes_When_Lowercased 6907 CWT Changes_When_Titlecased 6908 CWU Changes_When_Uppercased 6909 Dash [\-], U+058A, U+05BE, U+1400, U+1806, 6910 U+2010..2015 ... 6911 Decomposition_Mapping (Short: dm) 6912 Decomposition_Type (Short: dt) 6913 Default_Ignorable_Code_Point (Short: DI). [\xad], U+034F, U+061C, 6914 U+115F..1160, U+17B4..17B5, U+180B..180E 6915 ... 6916 Dep Deprecated 6917 Deprecated (Short: Dep). U+0149, U+0673, U+0F77, 6918 U+0F79, U+17A3..17A4, U+206A..206F ... 6919 DI Default_Ignorable_Code_Point 6920 Dia Diacritic 6921 Diacritic (Short: Dia). [\^`\xa8\xaf\xb4\xb7-\xb8], 6922 U+02B0..034E, U+0350..0357, U+035D..0362, 6923 U+0374..0375, U+037A ... 6924 Digit XPosixDigit (=General_Category= 6925 Decimal_Number). (Perl extension) 6926 Dm Decomposition_Mapping 6927 Dt Decomposition_Type 6928 Ea East_Asian_Width 6929 East_Asian_Width (Short: ea) 6930 EqUIdeo Equivalent_Unified_Ideograph 6931 Equivalent_Unified_Ideograph (Short: EqUIdeo) 6932 Ext Extender 6933 Extender (Short: Ext). [\xb7], U+02D0..02D1, 6934 U+0640, U+07FA, U+0E46, U+0EC6 ... 6935 Full_Composition_Exclusion (Short: Comp_Ex). U+0340..0341, 6936 U+0343..0344, U+0374, U+037E, U+0387, 6937 U+0958..095F ... 6938 Gc General_Category 6939 GCB Grapheme_Cluster_Break 6940 General_Category (Short: gc) 6941 Gr_Base Grapheme_Base 6942 Gr_Ext Grapheme_Extend 6943 Graph XPosixGraph. (Perl extension) 6944 Grapheme_Base (Short: Gr_Base). [\x20-\x7e\xa0- 6945 \xac\xae-\xff], U+0100..02FF, 6946 U+0370..0377, U+037A..037F, U+0384..038A, 6947 U+038C ... 6948 Grapheme_Cluster_Break (Short: GCB) 6949 Grapheme_Extend (Short: Gr_Ext). U+0300..036F, 6950 U+0483..0489, U+0591..05BD, U+05BF, 6951 U+05C1..05C2, U+05C4..05C5 ... 6952 Hangul_Syllable_Type (Short: hst) 6953 Hex Hex_Digit 6954 Hex_Digit (Short: Hex). [0-9A-Fa-f], U+FF10..FF19, 6955 U+FF21..FF26, U+FF41..FF46 6956 HorizSpace XPosixBlank. (Perl extension) 6957 Hst Hangul_Syllable_Type 6958 D Hyphen [\-\xad], U+058A, U+1806, U+2010..2011, 6959 U+2E17, U+30FB ... Supplanted by 6960 Line_Break property values; see 6961 www.unicode.org/reports/tr14 6962 ID_Continue (Short: IDC). [0-9A-Z_a- 6963 z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8- 6964 \xff], U+0100..02C1, U+02C6..02D1, 6965 U+02E0..02E4, U+02EC, U+02EE ... 6966 ID_Start (Short: IDS). [A-Za-z\xaa\xb5\xba\xc0- 6967 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1, 6968 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE 6969 ... 6970 IDC ID_Continue 6971 Ideo Ideographic 6972 Ideographic (Short: Ideo). U+3006..3007, 6973 U+3021..3029, U+3038..303A, U+3400..4DB5, 6974 U+4E00..9FEF, U+F900..FA6D ... 6975 IDS ID_Start 6976 IDS_Binary_Operator (Short: IDSB). U+2FF0..2FF1, U+2FF4..2FFB 6977 IDS_Trinary_Operator (Short: IDST). U+2FF2..2FF3 6978 IDSB IDS_Binary_Operator 6979 IDST IDS_Trinary_Operator 6980 In Present_In. (Perl extension) 6981 Indic_Positional_Category (Short: InPC) 6982 Indic_Syllabic_Category (Short: InSC) 6983 InPC Indic_Positional_Category 6984 InSC Indic_Syllabic_Category 6985 Isc ISO_Comment; NOT 'isc' meaning 6986 'General_Category=Other' 6987 ISO_Comment (Short: isc) 6988 Jg Joining_Group 6989 Join_C Join_Control 6990 Join_Control (Short: Join_C). U+200C..200D 6991 Joining_Group (Short: jg) 6992 Joining_Type (Short: jt) 6993 Jt Joining_Type 6994 Lb Line_Break 6995 Lc Lowercase_Mapping; NOT 'lc' meaning 6996 'General_Category=Cased_Letter' 6997 Line_Break (Short: lb) 6998 LOE Logical_Order_Exception 6999 Logical_Order_Exception (Short: LOE). U+0E40..0E44, U+0EC0..0EC4, 7000 U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9 7001 ... 7002 Lower Lowercase 7003 Lowercase (Short: Lower). [a-z\xaa\xb5\xba\xdf- 7004 \xf6\xf8-\xff], U+0101, U+0103, U+0105, 7005 U+0107, U+0109 ... 7006 Lowercase_Mapping (Short: lc) 7007 Math [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2, 7008 U+03D5, U+03F0..03F1, U+03F4..03F6, 7009 U+0606..0608 ... 7010 Na Name 7011 Na1 Unicode_1_Name 7012 Name (Short: na) 7013 Name_Alias 7014 NChar Noncharacter_Code_Point 7015 NFC_QC NFC_Quick_Check 7016 NFC_Quick_Check (Short: NFC_QC) 7017 NFD_QC NFD_Quick_Check 7018 NFD_Quick_Check (Short: NFD_QC) 7019 NFKC_Casefold (Short: NFKC_CF) 7020 NFKC_CF NFKC_Casefold 7021 NFKC_QC NFKC_Quick_Check 7022 NFKC_Quick_Check (Short: NFKC_QC) 7023 NFKD_QC NFKD_Quick_Check 7024 NFKD_Quick_Check (Short: NFKD_QC) 7025 Noncharacter_Code_Point (Short: NChar). U+FDD0..FDEF, 7026 U+FFFE..FFFF, U+1FFFE..1FFFF, 7027 U+2FFFE..2FFFF, U+3FFFE..3FFFF, 7028 U+4FFFE..4FFFF ... 7029 Nt Numeric_Type 7030 Numeric_Type (Short: nt) 7031 Numeric_Value (Short: nv) 7032 Nv Numeric_Value 7033 Pat_Syn Pattern_Syntax 7034 Pat_WS Pattern_White_Space 7035 Pattern_Syntax (Short: Pat_Syn). 7036 [!\"#\$\%&\'\(\)*+,\-.\/:;<= 7037 >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab- 7038 \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7], 7039 U+2010..2027, U+2030..203E, U+2041..2053, 7040 U+2055..205E, U+2190..245F ... 7041 Pattern_White_Space (Short: Pat_WS). [\t\n\cK\f\r\x20\x85], 7042 U+200E..200F, U+2028..2029 7043 PCM Prepended_Concatenation_Mark 7044 Perl_Decimal_Digit (Perl extension) 7045 PerlSpace PosixSpace. (Perl extension) 7046 PerlWord PosixWord. (Perl extension) 7047 PosixAlnum (Perl extension). [0-9A-Za-z] 7048 PosixAlpha (Perl extension). [A-Za-z] 7049 PosixBlank (Perl extension). [\t\x20] 7050 PosixCntrl (Perl extension). ASCII control 7051 characters. ACK, BEL, BS, CAN, CR, DC1, 7052 DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT, 7053 ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK, 7054 NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT 7055 PosixDigit (Perl extension). [0-9] 7056 PosixGraph (Perl extension). 7057 [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A- 7058 Z\[\\\]\^_`a-z\{\|\}~] 7059 PosixLower (Perl extension). [a-z] 7060 PosixPrint (Perl extension). [\x20-\x7e] 7061 PosixPunct (Perl extension). 7062 [!\"#\$\%&\'\(\)*+,\-.\/:;<= 7063 >?\@\[\\\]\^_`\{\|\}~] 7064 PosixSpace (Perl extension). [\t\n\cK\f\r\x20] 7065 PosixUpper (Perl extension). [A-Z] 7066 PosixWord (Perl extension). \w, restricted to 7067 ASCII. [0-9A-Z_a-z] 7068 PosixXDigit ASCII_Hex_Digit. (Perl extension). 7069 [0-9A-Fa-f] 7070 Prepended_Concatenation_Mark (Short: PCM). U+0600..0605, U+06DD, 7071 U+070F, U+08E2, U+110BD, U+110CD 7072 Present_In (Short: In). (Perl extension) 7073 Print XPosixPrint. (Perl extension) 7074 Punct General_Category=Punctuation. (Perl 7075 extension). 7076 [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_- 7077 \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf], 7078 U+037E, U+0387, U+055A..055F, 7079 U+0589..058A, U+05BE ... 7080 QMark Quotation_Mark 7081 Quotation_Mark (Short: QMark). [\"\'\xab\xbb], 7082 U+2018..201F, U+2039..203A, U+2E42, 7083 U+300C..300F, U+301D..301F ... 7084 Radical U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5 7085 Regional_Indicator (Short: RI). U+1F1E6..1F1FF 7086 RI Regional_Indicator 7087 SB Sentence_Break 7088 Sc Script; NOT 'sc' meaning 7089 'General_Category=Currency_Symbol' 7090 Scf Simple_Case_Folding 7091 Script (Short: sc) 7092 Script_Extensions (Short: scx) 7093 Scx Script_Extensions 7094 SD Soft_Dotted 7095 Sentence_Break (Short: SB) 7096 Sentence_Terminal (Short: STerm). [!.?], U+0589, 7097 U+061E..061F, U+06D4, U+0700..0702, U+07F9 7098 ... 7099 Sfc Simple_Case_Folding 7100 Simple_Case_Folding (Short: scf) 7101 Simple_Lowercase_Mapping (Short: slc) 7102 Simple_Titlecase_Mapping (Short: stc) 7103 Simple_Uppercase_Mapping (Short: suc) 7104 Slc Simple_Lowercase_Mapping 7105 Soft_Dotted (Short: SD). [i-j], U+012F, U+0249, 7106 U+0268, U+029D, U+02B2 ... 7107 Space White_Space 7108 SpacePerl XPosixSpace. (Perl extension) 7109 Stc Simple_Titlecase_Mapping 7110 STerm Sentence_Terminal 7111 Suc Simple_Uppercase_Mapping 7112 Tc Titlecase_Mapping 7113 Term Terminal_Punctuation 7114 Terminal_Punctuation (Short: Term). [!,.:;?], U+037E, U+0387, 7115 U+0589, U+05C3, U+060C ... 7116 Title Titlecase. (Perl extension) 7117 Titlecase (Short: Title). (Perl extension). (= 7118 \p{Gc=Lt}). U+01C5, U+01C8, U+01CB, 7119 U+01F2, U+1F88..1F8F, U+1F98..1F9F ... 7120 Titlecase_Mapping (Short: tc) 7121 Uc Uppercase_Mapping 7122 UIdeo Unified_Ideograph 7123 Unicode Any. (Perl extension) 7124 Unicode_1_Name (Short: na1) 7125 Unified_Ideograph (Short: UIdeo). U+3400..4DB5, 7126 U+4E00..9FEF, U+FA0E..FA0F, U+FA11, 7127 U+FA13..FA14, U+FA1F ... 7128 Upper Uppercase 7129 Uppercase (Short: Upper). [A-Z\xc0-\xd6\xd8-\xde], 7130 U+0100, U+0102, U+0104, U+0106, U+0108 ... 7131 Uppercase_Mapping (Short: uc) 7132 Variation_Selector (Short: VS). U+180B..180D, U+FE00..FE0F, 7133 U+E0100..E01EF 7134 Vertical_Orientation (Short: vo) 7135 VertSpace (Perl extension). \v. [\n\cK\f\r\x85], 7136 U+2028..2029 7137 Vo Vertical_Orientation 7138 VS Variation_Selector 7139 WB Word_Break 7140 White_Space (Short: WSpace). 7141 [\t\n\cK\f\r\x20\x85\xa0], U+1680, 7142 U+2000..200A, U+2028..2029, U+202F, U+205F 7143 ... 7144 Word XPosixWord. (Perl extension) 7145 Word_Break (Short: WB) 7146 WSpace White_Space 7147 XDigit XPosixXDigit (=Hex_Digit). (Perl 7148 extension) 7149 XID_Continue (Short: XIDC). [0-9A-Z_a- 7150 z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8- 7151 \xff], U+0100..02C1, U+02C6..02D1, 7152 U+02E0..02E4, U+02EC, U+02EE ... 7153 XID_Start (Short: XIDS). [A-Za-z\xaa\xb5\xba\xc0- 7154 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1, 7155 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE 7156 ... 7157 XIDC XID_Continue 7158 XIDS XID_Start 7159 XPerlSpace XPosixSpace. (Perl extension) 7160 XPosixAlnum (Short: Alnum). (Perl extension). 7161 Alphabetic and (decimal) Numeric. [0-9A- 7162 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 7163 \xff], U+0100..02C1, U+02C6..02D1, 7164 U+02E0..02E4, U+02EC, U+02EE ... 7165 XPosixAlpha Alphabetic. (Perl extension). [A-Za- 7166 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 7167 U+0100..02C1, U+02C6..02D1, U+02E0..02E4, 7168 U+02EC, U+02EE ... 7169 XPosixBlank (Short: Blank). (Perl extension). \h, 7170 Horizontal white space. [\t\x20\xa0], 7171 U+1680, U+2000..200A, U+202F, U+205F, 7172 U+3000 7173 XPosixCntrl General_Category=Control (Short: Cntrl). 7174 (Perl extension). Control characters. 7175 [\x00-\x1f\x7f-\x9f] 7176 XPosixDigit General_Category=Decimal_Number (Short: 7177 Digit). (Perl extension). [0-9] + all 7178 other decimal digits. [0-9], 7179 U+0660..0669, U+06F0..06F9, U+07C0..07C9, 7180 U+0966..096F, U+09E6..09EF ... 7181 XPosixGraph (Short: Graph). (Perl extension). 7182 Characters that are graphical. 7183 [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A- 7184 Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff], 7185 U+0100..0377, U+037A..037F, U+0384..038A, 7186 U+038C, U+038E..03A1 ... 7187 XPosixLower Lowercase. (Perl extension). [a- 7188 z\xaa\xb5\xba\xdf-\xf6\xf8-\xff], U+0101, 7189 U+0103, U+0105, U+0107, U+0109 ... 7190 XPosixPrint (Short: Print). (Perl extension). 7191 Characters that are graphical plus space 7192 characters (but no controls). [\x20- 7193 \x7e\xa0-\xff], U+0100..0377, 7194 U+037A..037F, U+0384..038A, U+038C, 7195 U+038E..03A1 ... 7196 XPosixPunct (Perl extension). \p{Punct} + ASCII-range 7197 \p{Symbol}. [!\"#\$\%&\'\(\)*+,\-.\/:;<= 7198 >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6- 7199 \xb7\xbb\xbf], U+037E, U+0387, 7200 U+055A..055F, U+0589..058A, U+05BE ... 7201 XPosixSpace (Perl extension). \s including beyond 7202 ASCII and vertical tab. 7203 [\t\n\cK\f\r\x20\x85\xa0], U+1680, 7204 U+2000..200A, U+2028..2029, U+202F, U+205F 7205 ... 7206 XPosixUpper Uppercase. (Perl extension). [A-Z\xc0- 7207 \xd6\xd8-\xde], U+0100, U+0102, U+0104, 7208 U+0106, U+0108 ... 7209 XPosixWord (Short: Word). (Perl extension). \w, 7210 including beyond ASCII; = \p{Alnum} + \pM 7211 + \p{Pc} + \p{Join_Control}. [0-9A-Z_a- 7212 z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 7213 U+0100..02C1, U+02C6..02D1, U+02E0..02E4, 7214 U+02EC, U+02EE ... 7215 XPosixXDigit Hex_Digit (Short: XDigit). (Perl 7216 extension). [0-9A-Fa-f], U+FF10..FF19, 7217 U+FF21..FF26, U+FF41..FF46 7218 7219 7220=head1 Properties accessible through other means 7221 7222Certain properties are accessible also via core function calls. These are: 7223 7224 Lowercase_Mapping lc() and lcfirst() 7225 Titlecase_Mapping ucfirst() 7226 Uppercase_Mapping uc() 7227 7228Also, Case_Folding is accessible through the C</i> modifier in regular 7229expressions, the C<\F> transliteration escape, and the C<L<fc|perlfunc/fc>> 7230operator. 7231 7232And, the Name and Name_Aliases properties are accessible through the C<\N{}> 7233interpolation in double-quoted strings and regular expressions; and functions 7234C<charnames::viacode()>, C<charnames::vianame()>, and 7235C<charnames::string_vianame()> (which require a C<use charnames ();> to be 7236specified. 7237 7238Finally, most properties related to decomposition are accessible via 7239L<Unicode::Normalize>. 7240 7241=head1 Unicode character properties that are NOT accepted by Perl 7242 7243Perl will generate an error for a few character properties in Unicode when 7244used in a regular expression. The non-Unihan ones are listed below, with the 7245reasons they are not accepted, perhaps with work-arounds. The short names for 7246the properties are listed enclosed in (parentheses). 7247As described after the list, an installation can change the defaults and choose 7248to accept any of these. The list is machine generated based on the 7249choices made for the installation that generated this document. 7250 7251 7252=over 4 7253 7254 7255 7256=item I<Expands_On_NFC> (XO_NFC) 7257 7258=item I<Expands_On_NFD> (XO_NFD) 7259 7260=item I<Expands_On_NFKC> (XO_NFKC) 7261 7262=item I<Expands_On_NFKD> (XO_NFKD) 7263 7264Deprecated by Unicode. These are characters that expand to more than one character in the specified normalization form, but whether they actually take up more bytes or not depends on the encoding being used. For example, a UTF-8 encoded character may expand to a different number of bytes than a UTF-32 encoded character. 7265 7266 7267 7268=item I<Extended_Pictographic> (XPG) 7269 7270Not part of the Unicode Character Database 7271 7272 7273 7274=item I<Grapheme_Link> (Gr_Link) 7275 7276Duplicates ccc=vr (Canonical_Combining_Class=Virama) 7277 7278 7279 7280=item I<Jamo_Short_Name> (JSN) 7281 7282=item I<Other_Alphabetic> (OAlpha) 7283 7284=item I<Other_Default_Ignorable_Code_Point> (ODI) 7285 7286=item I<Other_Grapheme_Extend> (OGr_Ext) 7287 7288=item I<Other_ID_Continue> (OIDC) 7289 7290=item I<Other_ID_Start> (OIDS) 7291 7292=item I<Other_Lowercase> (OLower) 7293 7294=item I<Other_Math> (OMath) 7295 7296=item I<Other_Uppercase> (OUpper) 7297 7298Used by Unicode internally for generating other properties and not intended to be used stand-alone 7299 7300 7301 7302=item I<Script=Katakana_Or_Hiragana> (sc=Hrkt) 7303 7304Obsolete. All code points previously matched by this have been moved to "Script=Common". Consider instead using "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or both) 7305 7306 7307 7308=item I<Script_Extensions=Katakana_Or_Hiragana> (scx=Hrkt) 7309 7310All code points that would be matched by this are matched by either "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" 7311 7312=back 7313 7314 7315An installation can choose to allow any of these to be matched by downloading 7316the Unicode database from L<http://www.unicode.org/Public/> to 7317C<$Config{privlib}>/F<unicore/> in the Perl source tree, changing the 7318controlling lists contained in the program 7319C<$Config{privlib}>/F<unicore/mktables> and then re-compiling and installing. 7320(C<%Config> is available from the Config module). 7321 7322Also, perl can be recompiled to operate on an earlier version of the Unicode 7323standard. Further information is at 7324C<$Config{privlib}>/F<unicore/README.perl>. 7325 7326=head1 Other information in the Unicode data base 7327 7328The Unicode data base is delivered in two different formats. The XML version 7329is valid for more modern Unicode releases. The other version is a collection 7330of files. The two are intended to give equivalent information. Perl uses the 7331older form; this allows you to recompile Perl to use early Unicode releases. 7332 7333The only non-character property that Perl currently supports is Named 7334Sequences, in which a sequence of code points 7335is given a name and generally treated as a single entity. (Perl supports 7336these via the C<\N{...}> double-quotish construct, 7337L<charnames/charnames::string_vianame(name)>, and L<Unicode::UCD/namedseq()>. 7338 7339Below is a list of the files in the Unicode data base that Perl doesn't 7340currently use, along with very brief descriptions of their purposes. 7341Some of the names of the files have been shortened from those that Unicode 7342uses, in order to allow them to be distinguishable from similarly named files 7343on file systems for which only the first 8 characters of a name are 7344significant. 7345 7346=over 4 7347 7348 7349 7350 7351=item F<auxiliary/GraphemeBreakTest.html> 7352 7353=item F<auxiliary/LineBreakTest.html> 7354 7355=item F<auxiliary/SentenceBreakTest.html> 7356 7357=item F<auxiliary/WordBreakTest.html> 7358 7359Documentation of validation Tests 7360 7361 7362 7363=item F<BidiCharacterTest.txt> 7364 7365=item F<BidiTest.txt> 7366 7367=item F<NormTest.txt> 7368 7369Validation Tests 7370 7371 7372 7373=item F<CJKRadicals.txt> 7374 7375Maps the kRSUnicode property values to corresponding code points 7376 7377 7378 7379=item F<EmojiSources.txt> 7380 7381Maps certain Unicode code points to their legacy Japanese cell-phone values 7382 7383 7384 7385=item F<extracted/DName.txt> 7386 7387This file adds no new information not already present in other files 7388 7389 7390 7391=item F<Index.txt> 7392 7393Alphabetical index of Unicode characters 7394 7395 7396 7397=item F<NamedSqProv.txt> 7398 7399Named sequences proposed for inclusion in a later version of the Unicode Standard; if you need them now, you can append this file to F<NamedSequences.txt> and recompile perl 7400 7401 7402 7403=item F<NamesList.html> 7404 7405Describes the format and contents of F<NamesList.txt> 7406 7407 7408 7409=item F<NamesList.txt> 7410 7411Annotated list of characters 7412 7413 7414 7415=item F<NormalizationCorrections.txt> 7416 7417Documentation of corrections already incorporated into the Unicode data base 7418 7419 7420 7421=item F<NushuSources.txt> 7422 7423Specifies source material for Nushu characters 7424 7425 7426 7427=item F<ReadMe.txt> 7428 7429Documentation 7430 7431 7432 7433=item F<StandardizedVariants.html> 7434 7435Obsoleted as of Unicode 9.0, but previously provided a visual display of the standard variant sequences derived from F<StandardizedVariants.txt>. 7436 7437 7438 7439=item F<StandardizedVariants.txt> 7440 7441Certain glyph variations for character display are standardized. This lists the non-Unihan ones; the Unihan ones are also not used by Perl, and are in a separate Unicode data base L<http://www.unicode.org/ivd> 7442 7443 7444 7445=item F<TangutSources.txt> 7446 7447Specifies source mappings for Tangut ideographs and components. This data file also includes informative radical-stroke values that are used internally by Unicode 7448 7449 7450 7451=item F<USourceData.txt> 7452 7453Documentation of status and cross reference of proposals for encoding by Unicode of Unihan characters 7454 7455 7456 7457=item F<USourceGlyphs.pdf> 7458 7459Pictures of the characters in F<USourceData.txt> 7460 7461 7462=back 7463 7464=head1 SEE ALSO 7465 7466L<http://www.unicode.org/reports/tr44/> 7467 7468L<perlrecharclass> 7469 7470L<perlunicode> 7471 7472