1=begin comment 2 3# !!!!!!! DO NOT EDIT THIS FILE !!!!!!! 4# This file is machine-generated by lib/unicore/mktables from the Unicode 5# database, Version 15.0.0. Any changes made here will be lost! 6 7 8To change this file, edit lib/unicore/mktables instead. 9 10=end comment 11 12=head1 NAME 13 14perluniprops - Index of Unicode Version 15.0.0 character properties in Perl 15 16=head1 DESCRIPTION 17 18This document provides information about the portion of the Unicode database 19that deals with character properties, that is the portion that is defined on 20single code points. (L</Other information in the Unicode data base> 21below briefly mentions other data that Unicode provides.) 22 23Perl can provide access to all non-provisional Unicode character properties, 24though not all are enabled by default. The omitted ones are the Unihan 25properties and certain 26deprecated or Unicode-internal properties. (An installation may choose to 27recompile Perl's tables to change this. See L</Unicode character 28properties that are NOT accepted by Perl>.) 29 30For most purposes, access to Unicode properties from the Perl core is through 31regular expression matches, as described in the next section. 32For some special purposes, and to access the properties that are not suitable 33for regular expression matching, all the Unicode character properties that 34Perl handles are accessible via the standard L<Unicode::UCD> module, as 35described in the section L</Properties accessible through Unicode::UCD>. 36 37Perl also provides some additional extensions and short-cut synonyms 38for Unicode properties. 39 40This document merely lists all available properties and does not attempt to 41explain what each property really means. There is a brief description of each 42Perl extension; see L<perlunicode/Other Properties> for more information on 43these. There is some detail about Blocks, Scripts, General_Category, 44and Bidi_Class in L<perlunicode>, but to find out about the intricacies of the 45official Unicode properties, refer to the Unicode standard. A good starting 46place is L<http://www.unicode.org/reports/tr44/>. 47 48Note that you can define your own properties; see 49L<perlunicode/"User-Defined Character Properties">. 50 51=head1 Properties accessible through C<\p{}> and C<\P{}> 52 53The Perl regular expression C<\p{}> and C<\P{}> constructs give access to 54most of the Unicode character properties. The table below shows all these 55constructs, both single and compound forms. 56 57B<Compound forms> consist of two components, separated by an equals sign or a 58colon. The first component is the property name, and the second component is 59the particular value of the property to match against, for example, 60C<\p{Script_Extensions: Greek}> and C<\p{Script_Extensions=Greek}> both mean 61to match characters whose Script_Extensions property value is Greek. 62(C<Script_Extensions> is an improved version of the C<Script> property.) 63 64B<Single forms>, like C<\p{Greek}>, are mostly Perl-defined shortcuts for 65their equivalent compound forms. The table shows these equivalences. (In our 66example, C<\p{Greek}> is a just a shortcut for 67C<\p{Script_Extensions=Greek}>). There are also a few Perl-defined single 68forms that are not shortcuts for a compound form. One such is C<\p{Word}>. 69These are also listed in the table. 70 71In parsing these constructs, Perl always ignores Upper/lower case differences 72everywhere within the {braces}. Thus C<\p{Greek}> means the same thing as 73C<\p{greek}>. But note that changing the case of the C<"p"> or C<"P"> before 74the left brace completely changes the meaning of the construct, from "match" 75(for C<\p{}>) to "doesn't match" (for C<\P{}>). Casing in this document is 76for improved legibility. 77 78Also, white space, hyphens, and underscores are normally ignored 79everywhere between the {braces}, and hence can be freely added or removed 80even if the C</x> modifier hasn't been specified on the regular expression. 81But in the table below a 'B<T>' at the beginning of an entry 82means that tighter (stricter) rules are used for that entry: 83 84=over 4 85 86=over 4 87 88=item Single form (C<\p{name}>) tighter rules: 89 90White space, hyphens, and underscores ARE significant 91except for: 92 93=over 4 94 95=item * white space adjacent to a non-word character 96 97=item * underscores separating digits in numbers 98 99=back 100 101That means, for example, that you can freely add or remove white space 102adjacent to (but within) the braces without affecting the meaning. 103 104=item Compound form (C<\p{name=value}> or C<\p{name:value}>) tighter rules: 105 106The tighter rules given above for the single form apply to everything to the 107right of the colon or equals; the looser rules still apply to everything to 108the left. 109 110That means, for example, that you can freely add or remove white space 111adjacent to (but within) the braces and the colon or equal sign. 112 113=back 114 115=back 116 117Some properties are considered obsolete by Unicode, but still available. 118There are several varieties of obsolescence: 119 120=over 4 121 122=over 4 123 124=item Stabilized 125 126A property may be stabilized. Such a determination does not indicate 127that the property should or should not be used; instead it is a declaration 128that the property will not be maintained nor extended for newly encoded 129characters. Such properties are marked with an 'B<S>' in the 130table. 131 132=item Deprecated 133 134A property may be deprecated, perhaps because its original intent 135has been replaced by another property, or because its specification was 136somehow defective. This means that its use is strongly 137discouraged, so much so that a warning will be issued if used, unless the 138regular expression is in the scope of a C<S<no warnings 'deprecated'>> 139statement. A 'B<D>' flags each such entry in the table, and 140the entry there for the longest, most descriptive version of the property will 141give the reason it is deprecated, and perhaps advice. Perl may issue such a 142warning, even for properties that aren't officially deprecated by Unicode, 143when there used to be characters or code points that were matched by them, but 144no longer. This is to warn you that your program may not work like it did on 145earlier Unicode releases. 146 147A deprecated property may be made unavailable in a future Perl version, so it 148is best to move away from them. 149 150A deprecated property may also be stabilized, but this fact is not shown. 151 152=item Obsolete 153 154Properties marked with an 'B<O>' in the table are considered (plain) 155obsolete. Generally this designation is given to properties that Unicode once 156used for internal purposes (but not any longer). 157 158=item Discouraged 159 160This is not actually a Unicode-specified obsolescence, but applies to certain 161Perl extensions that are present for backwards compatibility, but are 162discouraged from being used. These are not obsolete, but their meanings are 163not stable. Future Unicode versions could force any of these extensions to be 164removed without warning, replaced by another property with the same name that 165means something different. An 'B<X>' flags each such entry in the 166table. Use the equivalent shown instead. 167 168 169In particular, matches in the Block property have single forms 170defined by Perl that begin with C<"In_">, C<"Is_>, or even with no prefix at 171all, Like all B<DISCOURAGED> forms, these are not stable. For example, 172C<\p{Block=Deseret}> can currently be written as C<\p{In_Deseret}>, 173C<\p{Is_Deseret}>, or C<\p{Deseret}>. But, a new Unicode version may 174come along that would force Perl to change the meaning of one or more of 175these, and your program would no longer be correct. Currently there are no 176such conflicts with the form that begins C<"In_">, but there are many with the 177other two shortcuts, and Unicode continues to define new properties that begin 178with C<"In">, so it's quite possible that a conflict will occur in the future. 179The compound form is guaranteed to not become obsolete, and its meaning is 180clearer anyway. See L<perlunicode/"Blocks"> for more information about this. 181 182User-defined properties must begin with "In" or "Is". These override any 183Unicode property of the same name. 184 185 186=back 187 188=back 189 190The table below has two columns. The left column contains the C<\p{}> 191constructs to look up, possibly preceded by the flags mentioned above; and 192the right column contains information about them, like a description, or 193synonyms. The table shows both the single and compound forms for each 194property that has them. If the left column is a short name for a property, 195the right column will give its longer, more descriptive name; and if the left 196column is the longest name, the right column will show any equivalent shortest 197name, in both single and compound forms if applicable. 198 199If braces are not needed to specify a property (e.g., C<\pL>), the left 200column contains both forms, with and without braces. 201 202The right column will also caution you if a property means something different 203than what might normally be expected. 204 205All single forms are Perl extensions; a few compound forms are as well, and 206are noted as such. 207 208Numbers in (parentheses) indicate the total number of Unicode code points 209matched by the property. For the entries that give the longest, most 210descriptive version of the property, the count is followed by a list of some 211of the code points matched by it. The list includes all the matched 212characters in the 0-255 range, enclosed in the familiar [brackets] the same as 213a regular expression bracketed character class. Following that, the next few 214higher matching ranges are also given. To avoid visual ambiguity, the SPACE 215character is represented as C<\x20>. 216 217For emphasis, those properties that match no code points at all are listed as 218well in a separate section following the table. 219 220Most properties match the same code points regardless of whether C<"/i"> 221case-insensitive matching is specified or not. But a few properties are 222affected. These are shown with the notation S<C<(/i= I<other_property>)>> 223in the second column. Under case-insensitive matching they match the 224same code pode points as the property I<other_property>. 225 226There is no description given for most non-Perl defined properties (See 227L<http://www.unicode.org/reports/tr44/> for that). 228 229For compactness, 'B<*>' is used as a wildcard instead of showing all possible 230combinations. For example, entries like: 231 232 \p{Gc: *} \p{General_Category: *} 233 234mean that 'Gc' is a synonym for 'General_Category', and anything that is valid 235for the latter is also valid for the former. Similarly, 236 237 \p{Is_*} \p{*} 238 239means that if and only if, for example, C<\p{Foo}> exists, then 240C<\p{Is_Foo}> and C<\p{IsFoo}> are also valid and all mean the same thing. 241And similarly, C<\p{Foo=Bar}> means the same as C<\p{Is_Foo=Bar}> and 242C<\p{IsFoo=Bar}>. "*" here is restricted to something not beginning with an 243underscore. 244 245Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for 'Y'. 246And 'No', 'F', and 'False' are all synonyms for 'N'. The table shows 'Y*' and 247'N*' to indicate this, and doesn't have separate entries for the other 248possibilities. Note that not all properties which have values 'Yes' and 'No' 249are binary, and they have all their values spelled out without using this wild 250card, and a C<NOT> clause in their description that highlights their not being 251binary. These also require the compound form to match them, whereas true 252binary properties have both single and compound forms available. 253 254Note that all non-essential underscores are removed in the display of the 255short names below. 256 257B<Legend summary:> 258 259=over 4 260 261=item Z<>B<*> is a wild-card 262 263=item B<(\d+)> in the info column gives the number of Unicode code points matched 264by this property. 265 266=item B<D> means this is deprecated. 267 268=item B<O> means this is obsolete. 269 270=item B<S> means this is stabilized. 271 272=item B<T> means tighter (stricter) name matching applies. 273 274=item B<X> means use of this form is discouraged, and may not be 275stable. 276 277=back 278 279 NAME INFO 280 281 \p{Adlam} \p{Script_Extensions=Adlam} (Short: 282 \p{Adlm}; NOT \p{Block=Adlam}) (90) 283 \p{Adlm} \p{Adlam} (= \p{Script_Extensions=Adlam}) 284 (NOT \p{Block=Adlam}) (90) 285 X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64) 286 T \p{Age: 1.1} \p{Age=V1_1} (33_979) 287 \p{Age: V1_1} Code point's usage introduced in version 288 1.1 (33_979: U+0000..01F5, U+01FA..0217, 289 U+0250..02A8, U+02B0..02DE, 290 U+02E0..02E9, U+0300..0345 ...) 291 T \p{Age: 2.0} \p{Age=V2_0} (144_521) 292 \p{Age: V2_0} Code point's usage was introduced in 293 version 2.0; See also Property 294 'Present_In' (144_521: U+0591..05A1, 295 U+05A3..05AF, U+05C4, U+0F00..0F47, 296 U+0F49..0F69, U+0F71..0F8B ...) 297 T \p{Age: 2.1} \p{Age=V2_1} (2) 298 \p{Age: V2_1} Code point's usage was introduced in 299 version 2.1; See also Property 300 'Present_In' (2: U+20AC, U+FFFC) 301 T \p{Age: 3.0} \p{Age=V3_0} (10_307) 302 \p{Age: V3_0} Code point's usage was introduced in 303 version 3.0; See also Property 304 'Present_In' (10_307: U+01F6..01F9, 305 U+0218..021F, U+0222..0233, 306 U+02A9..02AD, U+02DF, U+02EA..02EE ...) 307 T \p{Age: 3.1} \p{Age=V3_1} (44_978) 308 \p{Age: V3_1} Code point's usage was introduced in 309 version 3.1; See also Property 310 'Present_In' (44_978: U+03F4..03F5, 311 U+FDD0..FDEF, U+10300..1031E, 312 U+10320..10323, U+10330..1034A, 313 U+10400..10425 ...) 314 T \p{Age: 3.2} \p{Age=V3_2} (1016) 315 \p{Age: V3_2} Code point's usage was introduced in 316 version 3.2; See also Property 317 'Present_In' (1016: U+0220, U+034F, 318 U+0363..036F, U+03D8..03D9, U+03F6, 319 U+048A..048B ...) 320 T \p{Age: 4.0} \p{Age=V4_0} (1226) 321 \p{Age: V4_0} Code point's usage was introduced in 322 version 4.0; See also Property 323 'Present_In' (1226: U+0221, 324 U+0234..0236, U+02AE..02AF, 325 U+02EF..02FF, U+0350..0357, U+035D..035F 326 ...) 327 T \p{Age: 4.1} \p{Age=V4_1} (1273) 328 \p{Age: V4_1} Code point's usage was introduced in 329 version 4.1; See also Property 330 'Present_In' (1273: U+0237..0241, 331 U+0358..035C, U+03FC..03FF, 332 U+04F6..04F7, U+05A2, U+05C5..05C7 ...) 333 T \p{Age: 5.0} \p{Age=V5_0} (1369) 334 \p{Age: V5_0} Code point's usage was introduced in 335 version 5.0; See also Property 336 'Present_In' (1369: U+0242..024F, 337 U+037B..037D, U+04CF, U+04FA..04FF, 338 U+0510..0513, U+05BA ...) 339 T \p{Age: 5.1} \p{Age=V5_1} (1624) 340 \p{Age: V5_1} Code point's usage was introduced in 341 version 5.1; See also Property 342 'Present_In' (1624: U+0370..0373, 343 U+0376..0377, U+03CF, U+0487, 344 U+0514..0523, U+0606..060A ...) 345 T \p{Age: 5.2} \p{Age=V5_2} (6648) 346 \p{Age: V5_2} Code point's usage was introduced in 347 version 5.2; See also Property 348 'Present_In' (6648: U+0524..0525, 349 U+0800..082D, U+0830..083E, U+0900, 350 U+094E, U+0955 ...) 351 T \p{Age: 6.0} \p{Age=V6_0} (2088) 352 \p{Age: V6_0} Code point's usage was introduced in 353 version 6.0; See also Property 354 'Present_In' (2088: U+0526..0527, 355 U+0620, U+065F, U+0840..085B, U+085E, 356 U+093A..093B ...) 357 T \p{Age: 6.1} \p{Age=V6_1} (732) 358 \p{Age: V6_1} Code point's usage was introduced in 359 version 6.1; See also Property 360 'Present_In' (732: U+058F, U+0604, 361 U+08A0, U+08A2..08AC, U+08E4..08FE, 362 U+0AF0 ...) 363 T \p{Age: 6.2} \p{Age=V6_2} (1) 364 \p{Age: V6_2} Code point's usage was introduced in 365 version 6.2; See also Property 366 'Present_In' (1: U+20BA) 367 T \p{Age: 6.3} \p{Age=V6_3} (5) 368 \p{Age: V6_3} Code point's usage was introduced in 369 version 6.3; See also Property 370 'Present_In' (5: U+061C, U+2066..2069) 371 T \p{Age: 7.0} \p{Age=V7_0} (2834) 372 \p{Age: V7_0} Code point's usage was introduced in 373 version 7.0; See also Property 374 'Present_In' (2834: U+037F, 375 U+0528..052F, U+058D..058E, U+0605, 376 U+08A1, U+08AD..08B2 ...) 377 T \p{Age: 8.0} \p{Age=V8_0} (7716) 378 \p{Age: V8_0} Code point's usage was introduced in 379 version 8.0; See also Property 380 'Present_In' (7716: U+08B3..08B4, 381 U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5 382 ...) 383 T \p{Age: 9.0} \p{Age=V9_0} (7500) 384 \p{Age: V9_0} Code point's usage was introduced in 385 version 9.0; See also Property 386 'Present_In' (7500: U+08B6..08BD, 387 U+08D4..08E2, U+0C80, U+0D4F, 388 U+0D54..0D56, U+0D58..0D5E ...) 389 T \p{Age: 10.0} \p{Age=V10_0} (8518) 390 \p{Age: V10_0} Code point's usage was introduced in 391 version 10.0; See also Property 392 'Present_In' (8518: U+0860..086A, 393 U+09FC..09FD, U+0AFA..0AFF, U+0D00, 394 U+0D3B..0D3C, U+1CF7 ...) 395 T \p{Age: 11.0} \p{Age=V11_0} (684) 396 \p{Age: V11_0} Code point's usage was introduced in 397 version 11.0; See also Property 398 'Present_In' (684: U+0560, U+0588, 399 U+05EF, U+07FD..07FF, U+08D3, U+09FE ...) 400 T \p{Age: 12.0} \p{Age=V12_0} (554) 401 \p{Age: V12_0} Code point's usage was introduced in 402 version 12.0; See also Property 403 'Present_In' (554: U+0C77, U+0E86, 404 U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...) 405 T \p{Age: 12.1} \p{Age=V12_1} (1) 406 \p{Age: V12_1} Code point's usage was introduced in 407 version 12.1; See also Property 408 'Present_In' (1: U+32FF) 409 T \p{Age: 13.0} \p{Age=V13_0} (5930) 410 \p{Age: V13_0} Code point's usage was introduced in 411 version 13.0; See also Property 412 'Present_In' (5930: U+08BE..08C7, 413 U+0B55, U+0D04, U+0D81, U+1ABF..1AC0, 414 U+2B97 ...) 415 T \p{Age: 14.0} \p{Age=V14_0} (838) 416 \p{Age: V14_0} Code point's usage was introduced in 417 version 14.0; See also Property 418 'Present_In' (838: U+061D, U+0870..088E, 419 U+0890..0891, U+0898..089F, U+08B5, 420 U+08C8..08D2 ...) 421 T \p{Age: 15.0} \p{Age=V15_0} (4489) 422 \p{Age: V15_0} Code point's usage was introduced in 423 version 15.0; See also Property 424 'Present_In' (4489: U+0CF3, U+0ECE, 425 U+10EFD..10EFF, U+1123F..11241, 426 U+11B00..11B09, U+11F00..11F10 ...) 427 \p{Age: NA} \p{Age=Unassigned} (825_279 plus all 428 above-Unicode code points) 429 \p{Age: Unassigned} Code point's usage has not been assigned 430 in any Unicode release thus far. 431 (Short: \p{Age=NA}) (825_279 plus all above-Unicode code points: 432 U+0378..0379, U+0380..0383, U+038B, 433 U+038D, U+03A2, U+0530 ...) 434 \p{Aghb} \p{Caucasian_Albanian} (= 435 \p{Script_Extensions= 436 Caucasian_Albanian}) (NOT \p{Block= 437 Caucasian_Albanian}) (53) 438 \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y}) 439 (22) 440 \p{AHex: *} \p{ASCII_Hex_Digit: *} 441 \p{Ahom} \p{Script_Extensions=Ahom} (NOT \p{Block= 442 Ahom}) (65) 443 X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block= 444 Alchemical_Symbols}) (128) 445 X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short: 446 \p{InAlchemical}) (128) 447 \p{All} All code points, including those above 448 Unicode. Same as qr/./s (1_114_112 plus 449 all above-Unicode code points: 450 U+0000..infinity) 451 \p{Alnum} \p{XPosixAlnum} (138_445) 452 \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y}) 453 (137_765) 454 \p{Alpha: *} \p{Alphabetic: *} 455 \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y}) 456 (137_765) 457 \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (976_347 458 plus all above-Unicode code points: 459 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 460 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4 461 \xb6-\xb9\xbb-\xbf\xd7\xf7], 462 U+02C2..02C5, U+02D2..02DF, 463 U+02E5..02EB, U+02ED, U+02EF..0344 ...) 464 \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (137_765: 465 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6 466 \xf8-\xff], U+0100..02C1, U+02C6..02D1, 467 U+02E0..02E4, U+02EC, U+02EE ...) 468 X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (= 469 \p{Block=Alphabetic_Presentation_Forms}) 470 (80) 471 X \p{Alphabetic_Presentation_Forms} \p{Block= 472 Alphabetic_Presentation_Forms} (Short: 473 \p{InAlphabeticPF}) (80) 474 \p{Anatolian_Hieroglyphs} \p{Script_Extensions= 475 Anatolian_Hieroglyphs} (Short: \p{Hluw}; 476 NOT \p{Block=Anatolian_Hieroglyphs}) 477 (583) 478 X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (= 479 \p{Block= 480 Ancient_Greek_Musical_Notation}) (80) 481 X \p{Ancient_Greek_Musical_Notation} \p{Block= 482 Ancient_Greek_Musical_Notation} (Short: 483 \p{InAncientGreekMusic}) (80) 484 X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80) 485 X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64) 486 \p{Any} All Unicode code points (1_114_112: 487 U+0000..10FFFF) 488 \p{Arab} \p{Arabic} (= \p{Script_Extensions= 489 Arabic}) (NOT \p{Block=Arabic}) (1414) 490 \p{Arabic} \p{Script_Extensions=Arabic} (Short: 491 \p{Arab}; NOT \p{Block=Arabic}) (1414) 492 X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block= 493 Arabic_Extended_A}) (96) 494 X \p{Arabic_Ext_B} \p{Arabic_Extended_B} (= \p{Block= 495 Arabic_Extended_B}) (48) 496 X \p{Arabic_Ext_C} \p{Arabic_Extended_C} (= \p{Block= 497 Arabic_Extended_C}) (64) 498 X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short: 499 \p{InArabicExtA}) (96) 500 X \p{Arabic_Extended_B} \p{Block=Arabic_Extended_B} (Short: 501 \p{InArabicExtB}) (48) 502 X \p{Arabic_Extended_C} \p{Block=Arabic_Extended_C} (Short: 503 \p{InArabicExtC}) (64) 504 X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols} 505 (= \p{Block= 506 Arabic_Mathematical_Alphabetic_Symbols}) 507 (256) 508 X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block= 509 Arabic_Mathematical_Alphabetic_Symbols} 510 (Short: \p{InArabicMath}) (256) 511 X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (= 512 \p{Block=Arabic_Presentation_Forms_A}) 513 (688) 514 X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (= 515 \p{Block=Arabic_Presentation_Forms_B}) 516 (144) 517 X \p{Arabic_Presentation_Forms_A} \p{Block= 518 Arabic_Presentation_Forms_A} (Short: 519 \p{InArabicPFA}) (688) 520 X \p{Arabic_Presentation_Forms_B} \p{Block= 521 Arabic_Presentation_Forms_B} (Short: 522 \p{InArabicPFB}) (144) 523 X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block= 524 Arabic_Supplement}) (48) 525 X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short: 526 \p{InArabicSup}) (48) 527 \p{Armenian} \p{Script_Extensions=Armenian} (Short: 528 \p{Armn}; NOT \p{Block=Armenian}) (96) 529 \p{Armi} \p{Imperial_Aramaic} (= 530 \p{Script_Extensions=Imperial_Aramaic}) 531 (NOT \p{Block=Imperial_Aramaic}) (31) 532 \p{Armn} \p{Armenian} (= \p{Script_Extensions= 533 Armenian}) (NOT \p{Block=Armenian}) (96) 534 X \p{Arrows} \p{Block=Arrows} (112) 535 \p{ASCII} \p{Block=Basic_Latin} (128) 536 \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y}) 537 (22) 538 \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090 539 plus all above-Unicode code points: 540 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>? 541 \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff], 542 U+0100..infinity) 543 \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A- 544 Fa-f]) 545 \p{Assigned} All assigned code points (288_767: 546 U+0000..0377, U+037A..037F, 547 U+0384..038A, U+038C, U+038E..03A1, 548 U+03A3..052F ...) 549 \p{Avestan} \p{Script_Extensions=Avestan} (Short: 550 \p{Avst}; NOT \p{Block=Avestan}) (61) 551 \p{Avst} \p{Avestan} (= \p{Script_Extensions= 552 Avestan}) (NOT \p{Block=Avestan}) (61) 553 \p{Bali} \p{Balinese} (= \p{Script_Extensions= 554 Balinese}) (NOT \p{Block=Balinese}) (124) 555 \p{Balinese} \p{Script_Extensions=Balinese} (Short: 556 \p{Bali}; NOT \p{Block=Balinese}) (124) 557 \p{Bamu} \p{Bamum} (= \p{Script_Extensions=Bamum}) 558 (NOT \p{Block=Bamum}) (657) 559 \p{Bamum} \p{Script_Extensions=Bamum} (Short: 560 \p{Bamu}; NOT \p{Block=Bamum}) (657) 561 X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block= 562 Bamum_Supplement}) (576) 563 X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short: 564 \p{InBamumSup}) (576) 565 X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128) 566 \p{Bass} \p{Bassa_Vah} (= \p{Script_Extensions= 567 Bassa_Vah}) (NOT \p{Block=Bassa_Vah}) 568 (36) 569 \p{Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short: 570 \p{Bass}; NOT \p{Block=Bassa_Vah}) (36) 571 \p{Batak} \p{Script_Extensions=Batak} (Short: 572 \p{Batk}; NOT \p{Block=Batak}) (56) 573 \p{Batk} \p{Batak} (= \p{Script_Extensions=Batak}) 574 (NOT \p{Block=Batak}) (56) 575 \p{Bc: *} \p{Bidi_Class: *} 576 \p{Beng} \p{Bengali} (= \p{Script_Extensions= 577 Bengali}) (NOT \p{Block=Bengali}) (113) 578 \p{Bengali} \p{Script_Extensions=Bengali} (Short: 579 \p{Beng}; NOT \p{Block=Bengali}) (113) 580 \p{Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short: 581 \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97) 582 \p{Bhks} \p{Bhaiksuki} (= \p{Script_Extensions= 583 Bhaiksuki}) (NOT \p{Block=Bhaiksuki}) 584 (97) 585 \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y}) 586 (12) 587 \p{Bidi_C: *} \p{Bidi_Control: *} 588 \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1769) 589 \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (63) 590 \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1769: U+0608, 591 U+060B, U+060D, U+061B..064A, 592 U+066D..066F, U+0671..06D5 ...) 593 \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (63: 594 U+0600..0605, U+0660..0669, 595 U+066B..066C, U+06DD, U+0890..0891, 596 U+08E2 ...) 597 \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7) 598 \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016) 599 \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016: [^\t\n 600 \cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-\xff], 601 U+180E, U+200B..200D, U+2060..2065, 602 U+206A..206F, U+FDD0..FDEF ...) 603 \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15: [,.\/: 604 \xa0], U+060C, U+202F, U+2044, U+FE50, 605 U+FE52 ...) 606 \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15) 607 \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (168) 608 \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12) 609 \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (92) 610 \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (168: [0-9\xb2- 611 \xb3\xb9], U+06F0..06F9, U+2070, 612 U+2074..2079, U+2080..2089, U+2488..249B 613 ...) 614 \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-], 615 U+207A..207B, U+208A..208B, U+2212, 616 U+FB29, U+FE62..FE63 ...) 617 \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92: [#\$ 618 \%\xa2-\xa5\xb0-\xb1], U+058F, 619 U+0609..060A, U+066A, U+09F2..09F3, 620 U+09FB ...) 621 \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1: 622 U+2068) 623 \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1) 624 \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_096_272 625 plus all above-Unicode code points) 626 \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_272 plus 627 all above-Unicode code points: [A-Za-z 628 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 629 \xff], U+0100..02B8, U+02BB..02C1, 630 U+02D0..02D1, U+02E0..02E4, U+02EE ...) 631 \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1: 632 U+202A) 633 \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1: 634 U+2066) 635 \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1: 636 U+202D) 637 \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1) 638 \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1) 639 \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1) 640 \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1993: 641 U+0300..036F, U+0483..0489, 642 U+0591..05BD, U+05BF, U+05C1..05C2, 643 U+05C4..05C5 ...) 644 \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1993) 645 \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (6029) 646 \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (6029: [!\"&\' 647 \(\)*;<=>?\@\[\\\]\^_`\{\|\}~\xa1\xa6- 648 \xa9\xab-\xac\xae-\xaf\xb4\xb6-\xb8\xbb- 649 \xbf\xd7\xf7], U+02B9..02BA, 650 U+02C2..02CF, U+02D2..02DF, 651 U+02E5..02ED, U+02EF..02FF ...) 652 \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7: [\n\r 653 \x1c-\x1e\x85], U+2029) 654 \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1) 655 \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1) 656 \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1: 657 U+202C) 658 \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1: 659 U+2069) 660 \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (3647) 661 \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3647: U+0590, 662 U+05BE, U+05C0, U+05C3, U+05C6, 663 U+05C8..05FF ...) 664 \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1: 665 U+202B) 666 \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1: 667 U+2067) 668 \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1: 669 U+202E) 670 \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1) 671 \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1) 672 \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1) 673 \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3) 674 \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3: [\t\cK 675 \x1f]) 676 \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20], 677 U+1680, U+2000..200A, U+2028, U+205F, 678 U+3000) 679 \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17) 680 \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12) 681 \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100 682 plus all above-Unicode code points: 683 U+0000..061B, U+061D..200D, 684 U+2010..2029, U+202F..2065, 685 U+206A..infinity) 686 \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12: 687 U+061C, U+200E..200F, U+202A..202E, 688 U+2066..2069) 689 \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y}) 690 (553) 691 \p{Bidi_M: *} \p{Bidi_Mirrored: *} 692 \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM}) 693 (553) 694 \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_559 695 plus all above-Unicode code points: 696 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A- 697 Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc- 698 \xff], U+0100..0F39, U+0F3E..169A, 699 U+169D..2038, U+203B..2044, U+2047..207C 700 ...) 701 \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (553: 702 [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D, 703 U+169B..169C, U+2039..203A, 704 U+2045..2046, U+207D..207E ...) 705 \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close} 706 (64) 707 \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (64: [\)\] 708 \}], U+0F3B, U+0F3D, U+169C, U+2046, 709 U+207E ...) 710 \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None} 711 (1_113_984 plus all above-Unicode code 712 points) 713 \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_984 714 plus all above-Unicode code points: 715 [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=>? 716 \@A-Z\\\^_`a-z\|~\x7f-\xff], 717 U+0100..0F39, U+0F3E..169A, 718 U+169D..2044, U+2047..207C, U+207F..208C 719 ...) 720 \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open} 721 (64) 722 \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (64: 723 [\(\[\{], U+0F3A, U+0F3C, U+169B, 724 U+2045, U+207D ...) 725 \p{Blank} \p{XPosixBlank} (18) 726 \p{Blk: *} \p{Block: *} 727 \p{Block: Adlam} (NOT \p{Adlam} NOR \p{Is_Adlam}) (96: 728 U+1E900..1E95F) 729 \p{Block: Aegean_Numbers} (64: U+10100..1013F) 730 \p{Block: Ahom} (NOT \p{Ahom} NOR \p{Is_Ahom}) (80: 731 U+11700..1174F) 732 \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128) 733 \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128: 734 U+1F700..1F77F) 735 \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms} 736 (80) 737 \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk= 738 AlphabeticPF}) (80: U+FB00..FB4F) 739 \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs} 740 NOR \p{Is_Anatolian_Hieroglyphs}) (640: 741 U+14400..1467F) 742 \p{Block: Ancient_Greek_Music} \p{Block= 743 Ancient_Greek_Musical_Notation} (80) 744 \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk= 745 AncientGreekMusic}) (80: U+1D200..1D24F) 746 \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F) 747 \p{Block: Ancient_Symbols} (64: U+10190..101CF) 748 \p{Block: Arabic} (NOT \p{Arabic} NOR \p{Is_Arabic}) (256: 749 U+0600..06FF) 750 \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96) 751 \p{Block: Arabic_Ext_B} \p{Block=Arabic_Extended_B} (48) 752 \p{Block: Arabic_Ext_C} \p{Block=Arabic_Extended_C} (64) 753 \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96: 754 U+08A0..08FF) 755 \p{Block: Arabic_Extended_B} (Short: \p{Blk=ArabicExtB}) (48: 756 U+0870..089F) 757 \p{Block: Arabic_Extended_C} (Short: \p{Blk=ArabicExtC}) (64: 758 U+10EC0..10EFF) 759 \p{Block: Arabic_Math} \p{Block= 760 Arabic_Mathematical_Alphabetic_Symbols} 761 (256) 762 \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk= 763 ArabicMath}) (256: U+1EE00..1EEFF) 764 \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688) 765 \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144) 766 \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA}) 767 (688: U+FB50..FDFF) 768 \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB}) 769 (144: U+FE70..FEFF) 770 \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48) 771 \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48: 772 U+0750..077F) 773 \p{Block: Armenian} (NOT \p{Armenian} NOR \p{Is_Armenian}) 774 (96: U+0530..058F) 775 \p{Block: Arrows} (112: U+2190..21FF) 776 \p{Block: ASCII} \p{Block=Basic_Latin} (128) 777 \p{Block: Avestan} (NOT \p{Avestan} NOR \p{Is_Avestan}) (64: 778 U+10B00..10B3F) 779 \p{Block: Balinese} (NOT \p{Balinese} NOR \p{Is_Balinese}) 780 (128: U+1B00..1B7F) 781 \p{Block: Bamum} (NOT \p{Bamum} NOR \p{Is_Bamum}) (96: 782 U+A6A0..A6FF) 783 \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576) 784 \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576: 785 U+16800..16A3F) 786 \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}) (128: [\x00-\x7f]) 787 \p{Block: Bassa_Vah} (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah}) 788 (48: U+16AD0..16AFF) 789 \p{Block: Batak} (NOT \p{Batak} NOR \p{Is_Batak}) (64: 790 U+1BC0..1BFF) 791 \p{Block: Bengali} (NOT \p{Bengali} NOR \p{Is_Bengali}) (128: 792 U+0980..09FF) 793 \p{Block: Bhaiksuki} (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki}) 794 (112: U+11C00..11C6F) 795 \p{Block: Block_Elements} (32: U+2580..259F) 796 \p{Block: Bopomofo} (NOT \p{Bopomofo} NOR \p{Is_Bopomofo}) 797 (48: U+3100..312F) 798 \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32) 799 \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32: 800 U+31A0..31BF) 801 \p{Block: Box_Drawing} (128: U+2500..257F) 802 \p{Block: Brahmi} (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128: 803 U+11000..1107F) 804 \p{Block: Braille} \p{Block=Braille_Patterns} (256) 805 \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256: 806 U+2800..28FF) 807 \p{Block: Buginese} (NOT \p{Buginese} NOR \p{Is_Buginese}) 808 (32: U+1A00..1A1F) 809 \p{Block: Buhid} (NOT \p{Buhid} NOR \p{Is_Buhid}) (32: 810 U+1740..175F) 811 \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols} 812 (256) 813 \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk= 814 ByzantineMusic}) (256: U+1D000..1D0FF) 815 \p{Block: Canadian_Syllabics} \p{Block= 816 Unified_Canadian_Aboriginal_Syllabics} 817 (640) 818 \p{Block: Carian} (NOT \p{Carian} NOR \p{Is_Carian}) (64: 819 U+102A0..102DF) 820 \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR 821 \p{Is_Caucasian_Albanian}) (64: 822 U+10530..1056F) 823 \p{Block: Chakma} (NOT \p{Chakma} NOR \p{Is_Chakma}) (80: 824 U+11100..1114F) 825 \p{Block: Cham} (NOT \p{Cham} NOR \p{Is_Cham}) (96: 826 U+AA00..AA5F) 827 \p{Block: Cherokee} (NOT \p{Cherokee} NOR \p{Is_Cherokee}) 828 (96: U+13A0..13FF) 829 \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80) 830 \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80: 831 U+AB70..ABBF) 832 \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F) 833 \p{Block: Chorasmian} (NOT \p{Chorasmian} NOR \p{Is_Chorasmian}) 834 (48: U+10FB0..10FDF) 835 \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992) 836 \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256) 837 \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32) 838 \p{Block: CJK_Compat_Ideographs} \p{Block= 839 CJK_Compatibility_Ideographs} (512) 840 \p{Block: CJK_Compat_Ideographs_Sup} \p{Block= 841 CJK_Compatibility_Ideographs_Supplement} 842 (544) 843 \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256: 844 U+3300..33FF) 845 \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms}) 846 (32: U+FE30..FE4F) 847 \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk= 848 CJKCompatIdeographs}) (512: U+F900..FAFF) 849 \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk= 850 CJKCompatIdeographsSup}) (544: 851 U+2F800..2FA1F) 852 \p{Block: CJK_Ext_A} \p{Block= 853 CJK_Unified_Ideographs_Extension_A} 854 (6592) 855 \p{Block: CJK_Ext_B} \p{Block= 856 CJK_Unified_Ideographs_Extension_B} 857 (42_720) 858 \p{Block: CJK_Ext_C} \p{Block= 859 CJK_Unified_Ideographs_Extension_C} 860 (4160) 861 \p{Block: CJK_Ext_D} \p{Block= 862 CJK_Unified_Ideographs_Extension_D} (224) 863 \p{Block: CJK_Ext_E} \p{Block= 864 CJK_Unified_Ideographs_Extension_E} 865 (5776) 866 \p{Block: CJK_Ext_F} \p{Block= 867 CJK_Unified_Ideographs_Extension_F} 868 (7488) 869 \p{Block: CJK_Ext_G} \p{Block= 870 CJK_Unified_Ideographs_Extension_G} 871 (4944) 872 \p{Block: CJK_Ext_H} \p{Block= 873 CJK_Unified_Ideographs_Extension_H} 874 (4192) 875 \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128) 876 \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup}) 877 (128: U+2E80..2EFF) 878 \p{Block: CJK_Strokes} (48: U+31C0..31EF) 879 \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64) 880 \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols}) 881 (64: U+3000..303F) 882 \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992: 883 U+4E00..9FFF) 884 \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk= 885 CJKExtA}) (6592: U+3400..4DBF) 886 \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk= 887 CJKExtB}) (42_720: U+20000..2A6DF) 888 \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk= 889 CJKExtC}) (4160: U+2A700..2B73F) 890 \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk= 891 CJKExtD}) (224: U+2B740..2B81F) 892 \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk= 893 CJKExtE}) (5776: U+2B820..2CEAF) 894 \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk= 895 CJKExtF}) (7488: U+2CEB0..2EBEF) 896 \p{Block: CJK_Unified_Ideographs_Extension_G} (Short: \p{Blk= 897 CJKExtG}) (4944: U+30000..3134F) 898 \p{Block: CJK_Unified_Ideographs_Extension_H} (Short: \p{Blk= 899 CJKExtH}) (4192: U+31350..323AF) 900 \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk= 901 Diacriticals}) (112: U+0300..036F) 902 \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk= 903 DiacriticalsExt}) (80: U+1AB0..1AFF) 904 \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk= 905 DiacriticalsForSymbols}) (48: 906 U+20D0..20FF) 907 \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk= 908 DiacriticalsSup}) (64: U+1DC0..1DFF) 909 \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16: 910 U+FE20..FE2F) 911 \p{Block: Combining_Marks_For_Symbols} \p{Block= 912 Combining_Diacritical_Marks_For_Symbols} 913 (48) 914 \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk= 915 IndicNumberForms}) (16: U+A830..A83F) 916 \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96) 917 \p{Block: Control_Pictures} (64: U+2400..243F) 918 \p{Block: Coptic} (NOT \p{Coptic} NOR \p{Is_Coptic}) (128: 919 U+2C80..2CFF) 920 \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF) 921 \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32) 922 \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32: 923 U+1D360..1D37F) 924 \p{Block: Cuneiform} (NOT \p{Cuneiform} NOR \p{Is_Cuneiform}) 925 (1024: U+12000..123FF) 926 \p{Block: Cuneiform_Numbers} \p{Block= 927 Cuneiform_Numbers_And_Punctuation} (128) 928 \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk= 929 CuneiformNumbers}) (128: U+12400..1247F) 930 \p{Block: Currency_Symbols} (48: U+20A0..20CF) 931 \p{Block: Cypriot_Syllabary} (64: U+10800..1083F) 932 \p{Block: Cypro_Minoan} (NOT \p{Cypro_Minoan} NOR 933 \p{Is_Cypro_Minoan}) (112: 934 U+12F90..12FFF) 935 \p{Block: Cyrillic} (NOT \p{Cyrillic} NOR \p{Is_Cyrillic}) 936 (256: U+0400..04FF) 937 \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32) 938 \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96) 939 \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16) 940 \p{Block: Cyrillic_Ext_D} \p{Block=Cyrillic_Extended_D} (96) 941 \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32: 942 U+2DE0..2DFF) 943 \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96: 944 U+A640..A69F) 945 \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16: 946 U+1C80..1C8F) 947 \p{Block: Cyrillic_Extended_D} (Short: \p{Blk=CyrillicExtD}) (96: 948 U+1E030..1E08F) 949 \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48) 950 \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48: 951 U+0500..052F) 952 \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement} 953 (48) 954 \p{Block: Deseret} (80: U+10400..1044F) 955 \p{Block: Devanagari} (NOT \p{Devanagari} NOR \p{Is_Devanagari}) 956 (128: U+0900..097F) 957 \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32) 958 \p{Block: Devanagari_Ext_A} \p{Block=Devanagari_Extended_A} (96) 959 \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32: 960 U+A8E0..A8FF) 961 \p{Block: Devanagari_Extended_A} (Short: \p{Blk=DevanagariExtA}) 962 (96: U+11B00..11B5F) 963 \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112) 964 \p{Block: Diacriticals_Ext} \p{Block= 965 Combining_Diacritical_Marks_Extended} 966 (80) 967 \p{Block: Diacriticals_For_Symbols} \p{Block= 968 Combining_Diacritical_Marks_For_Symbols} 969 (48) 970 \p{Block: Diacriticals_Sup} \p{Block= 971 Combining_Diacritical_Marks_Supplement} 972 (64) 973 \p{Block: Dingbats} (192: U+2700..27BF) 974 \p{Block: Dives_Akuru} (NOT \p{Dives_Akuru} NOR 975 \p{Is_Dives_Akuru}) (96: U+11900..1195F) 976 \p{Block: Dogra} (NOT \p{Dogra} NOR \p{Is_Dogra}) (80: 977 U+11800..1184F) 978 \p{Block: Domino} \p{Block=Domino_Tiles} (112) 979 \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112: 980 U+1F030..1F09F) 981 \p{Block: Duployan} (NOT \p{Duployan} NOR \p{Is_Duployan}) 982 (160: U+1BC00..1BC9F) 983 \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F) 984 \p{Block: Egyptian_Hieroglyph_Format_Controls} (48: U+13430..1345F) 985 \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR 986 \p{Is_Egyptian_Hieroglyphs}) (1072: 987 U+13000..1342F) 988 \p{Block: Elbasan} (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48: 989 U+10500..1052F) 990 \p{Block: Elymaic} (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32: 991 U+10FE0..10FFF) 992 \p{Block: Emoticons} (80: U+1F600..1F64F) 993 \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160) 994 \p{Block: Enclosed_Alphanum_Sup} \p{Block= 995 Enclosed_Alphanumeric_Supplement} (256) 996 \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk= 997 EnclosedAlphanumSup}) (256: 998 U+1F100..1F1FF) 999 \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk= 1000 EnclosedAlphanum}) (160: U+2460..24FF) 1001 \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months} 1002 (256) 1003 \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk= 1004 EnclosedCJK}) (256: U+3200..32FF) 1005 \p{Block: Enclosed_Ideographic_Sup} \p{Block= 1006 Enclosed_Ideographic_Supplement} (256) 1007 \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk= 1008 EnclosedIdeographicSup}) (256: 1009 U+1F200..1F2FF) 1010 \p{Block: Ethiopic} (NOT \p{Ethiopic} NOR \p{Is_Ethiopic}) 1011 (384: U+1200..137F) 1012 \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96) 1013 \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48) 1014 \p{Block: Ethiopic_Ext_B} \p{Block=Ethiopic_Extended_B} (32) 1015 \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96: 1016 U+2D80..2DDF) 1017 \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48: 1018 U+AB00..AB2F) 1019 \p{Block: Ethiopic_Extended_B} (Short: \p{Blk=EthiopicExtB}) (32: 1020 U+1E7E0..1E7FF) 1021 \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32) 1022 \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32: 1023 U+1380..139F) 1024 \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT 1025 \p{Punct} NOR \p{Is_Punctuation}) (112: 1026 U+2000..206F) 1027 \p{Block: Geometric_Shapes} (96: U+25A0..25FF) 1028 \p{Block: Geometric_Shapes_Ext} \p{Block= 1029 Geometric_Shapes_Extended} (128) 1030 \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk= 1031 GeometricShapesExt}) (128: 1032 U+1F780..1F7FF) 1033 \p{Block: Georgian} (NOT \p{Georgian} NOR \p{Is_Georgian}) 1034 (96: U+10A0..10FF) 1035 \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48) 1036 \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48: 1037 U+1C90..1CBF) 1038 \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48) 1039 \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48: 1040 U+2D00..2D2F) 1041 \p{Block: Glagolitic} (NOT \p{Glagolitic} NOR \p{Is_Glagolitic}) 1042 (96: U+2C00..2C5F) 1043 \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48) 1044 \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup}) 1045 (48: U+1E000..1E02F) 1046 \p{Block: Gothic} (NOT \p{Gothic} NOR \p{Is_Gothic}) (32: 1047 U+10330..1034F) 1048 \p{Block: Grantha} (NOT \p{Grantha} NOR \p{Is_Grantha}) (128: 1049 U+11300..1137F) 1050 \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek} 1051 NOR \p{Is_Greek}) (144) 1052 \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek} 1053 NOR \p{Is_Greek}) (144: U+0370..03FF) 1054 \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256) 1055 \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256: 1056 U+1F00..1FFF) 1057 \p{Block: Gujarati} (NOT \p{Gujarati} NOR \p{Is_Gujarati}) 1058 (128: U+0A80..0AFF) 1059 \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR 1060 \p{Is_Gunjala_Gondi}) (80: 1061 U+11D60..11DAF) 1062 \p{Block: Gurmukhi} (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi}) 1063 (128: U+0A00..0A7F) 1064 \p{Block: Half_And_Full_Forms} \p{Block= 1065 Halfwidth_And_Fullwidth_Forms} (240) 1066 \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16) 1067 \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk= 1068 HalfAndFullForms}) (240: U+FF00..FFEF) 1069 \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul} 1070 NOR \p{Is_Hangul}) (11_184) 1071 \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo}) 1072 (96: U+3130..318F) 1073 \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}) (256: U+1100..11FF) 1074 \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32: 1075 U+A960..A97F) 1076 \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80: 1077 U+D7B0..D7FF) 1078 \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul} 1079 NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF) 1080 \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR 1081 \p{Is_Hanifi_Rohingya}) (64: 1082 U+10D00..10D3F) 1083 \p{Block: Hanunoo} (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32: 1084 U+1720..173F) 1085 \p{Block: Hatran} (NOT \p{Hatran} NOR \p{Is_Hatran}) (32: 1086 U+108E0..108FF) 1087 \p{Block: Hebrew} (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112: 1088 U+0590..05FF) 1089 \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk= 1090 HighPUSurrogates}) (128: U+DB80..DBFF) 1091 \p{Block: High_PU_Surrogates} \p{Block= 1092 High_Private_Use_Surrogates} (128) 1093 \p{Block: High_Surrogates} (896: U+D800..DB7F) 1094 \p{Block: Hiragana} (NOT \p{Hiragana} NOR \p{Is_Hiragana}) 1095 (96: U+3040..309F) 1096 \p{Block: IDC} \p{Block= 1097 Ideographic_Description_Characters} (NOT 1098 \p{ID_Continue} NOR \p{Is_IDC}) (16) 1099 \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC}; 1100 NOT \p{ID_Continue} NOR \p{Is_IDC}) (16: 1101 U+2FF0..2FFF) 1102 \p{Block: Ideographic_Symbols} \p{Block= 1103 Ideographic_Symbols_And_Punctuation} (32) 1104 \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk= 1105 IdeographicSymbols}) (32: U+16FE0..16FFF) 1106 \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR 1107 \p{Is_Imperial_Aramaic}) (32: 1108 U+10840..1085F) 1109 \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms} 1110 (16) 1111 \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF) 1112 \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi} 1113 NOR \p{Is_Inscriptional_Pahlavi}) (32: 1114 U+10B60..10B7F) 1115 \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian} 1116 NOR \p{Is_Inscriptional_Parthian}) (32: 1117 U+10B40..10B5F) 1118 \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96) 1119 \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96: 1120 U+0250..02AF) 1121 \p{Block: Jamo} \p{Block=Hangul_Jamo} (256) 1122 \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32) 1123 \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80) 1124 \p{Block: Javanese} (NOT \p{Javanese} NOR \p{Is_Javanese}) 1125 (96: U+A980..A9DF) 1126 \p{Block: Kaithi} (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80: 1127 U+11080..110CF) 1128 \p{Block: Kaktovik_Numerals} (32: U+1D2C0..1D2DF) 1129 \p{Block: Kana_Ext_A} \p{Block=Kana_Extended_A} (48) 1130 \p{Block: Kana_Ext_B} \p{Block=Kana_Extended_B} (16) 1131 \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48: 1132 U+1B100..1B12F) 1133 \p{Block: Kana_Extended_B} (Short: \p{Blk=KanaExtB}) (16: 1134 U+1AFF0..1AFFF) 1135 \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256) 1136 \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256: 1137 U+1B000..1B0FF) 1138 \p{Block: Kanbun} (16: U+3190..319F) 1139 \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224) 1140 \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224: 1141 U+2F00..2FDF) 1142 \p{Block: Kannada} (NOT \p{Kannada} NOR \p{Is_Kannada}) (128: 1143 U+0C80..0CFF) 1144 \p{Block: Katakana} (NOT \p{Katakana} NOR \p{Is_Katakana}) 1145 (96: U+30A0..30FF) 1146 \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16) 1147 \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk= 1148 KatakanaExt}) (16: U+31F0..31FF) 1149 \p{Block: Kawi} (NOT \p{Kawi} NOR \p{Is_Kawi}) (96: 1150 U+11F00..11F5F) 1151 \p{Block: Kayah_Li} (48: U+A900..A92F) 1152 \p{Block: Kharoshthi} (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi}) 1153 (96: U+10A00..10A5F) 1154 \p{Block: Khitan_Small_Script} (NOT \p{Khitan_Small_Script} NOR 1155 \p{Is_Khitan_Small_Script}) (512: 1156 U+18B00..18CFF) 1157 \p{Block: Khmer} (NOT \p{Khmer} NOR \p{Is_Khmer}) (128: 1158 U+1780..17FF) 1159 \p{Block: Khmer_Symbols} (32: U+19E0..19FF) 1160 \p{Block: Khojki} (NOT \p{Khojki} NOR \p{Is_Khojki}) (80: 1161 U+11200..1124F) 1162 \p{Block: Khudawadi} (NOT \p{Khudawadi} NOR \p{Is_Khudawadi}) 1163 (80: U+112B0..112FF) 1164 \p{Block: Lao} (NOT \p{Lao} NOR \p{Is_Lao}) (128: 1165 U+0E80..0EFF) 1166 \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128) 1167 \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128) 1168 \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80- 1169 \xff]) 1170 \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128) 1171 \p{Block: Latin_Ext_Additional} \p{Block= 1172 Latin_Extended_Additional} (256) 1173 \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208) 1174 \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32) 1175 \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224) 1176 \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64) 1177 \p{Block: Latin_Ext_F} \p{Block=Latin_Extended_F} (64) 1178 \p{Block: Latin_Ext_G} \p{Block=Latin_Extended_G} (256) 1179 \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128: 1180 U+0100..017F) 1181 \p{Block: Latin_Extended_Additional} (Short: \p{Blk= 1182 LatinExtAdditional}) (256: U+1E00..1EFF) 1183 \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208: 1184 U+0180..024F) 1185 \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32: 1186 U+2C60..2C7F) 1187 \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224: 1188 U+A720..A7FF) 1189 \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64: 1190 U+AB30..AB6F) 1191 \p{Block: Latin_Extended_F} (Short: \p{Blk=LatinExtF}) (64: 1192 U+10780..107BF) 1193 \p{Block: Latin_Extended_G} (Short: \p{Blk=LatinExtG}) (256: 1194 U+1DF00..1DFFF) 1195 \p{Block: Lepcha} (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80: 1196 U+1C00..1C4F) 1197 \p{Block: Letterlike_Symbols} (80: U+2100..214F) 1198 \p{Block: Limbu} (NOT \p{Limbu} NOR \p{Is_Limbu}) (80: 1199 U+1900..194F) 1200 \p{Block: Linear_A} (NOT \p{Linear_A} NOR \p{Is_Linear_A}) 1201 (384: U+10600..1077F) 1202 \p{Block: Linear_B_Ideograms} (128: U+10080..100FF) 1203 \p{Block: Linear_B_Syllabary} (128: U+10000..1007F) 1204 \p{Block: Lisu} (NOT \p{Lisu} NOR \p{Is_Lisu}) (48: 1205 U+A4D0..A4FF) 1206 \p{Block: Lisu_Sup} \p{Block=Lisu_Supplement} (16) 1207 \p{Block: Lisu_Supplement} (Short: \p{Blk=LisuSup}) (16: 1208 U+11FB0..11FBF) 1209 \p{Block: Low_Surrogates} (1024: U+DC00..DFFF) 1210 \p{Block: Lycian} (NOT \p{Lycian} NOR \p{Is_Lycian}) (32: 1211 U+10280..1029F) 1212 \p{Block: Lydian} (NOT \p{Lydian} NOR \p{Is_Lydian}) (32: 1213 U+10920..1093F) 1214 \p{Block: Mahajani} (NOT \p{Mahajani} NOR \p{Is_Mahajani}) 1215 (48: U+11150..1117F) 1216 \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48) 1217 \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48: 1218 U+1F000..1F02F) 1219 \p{Block: Makasar} (NOT \p{Makasar} NOR \p{Is_Makasar}) (32: 1220 U+11EE0..11EFF) 1221 \p{Block: Malayalam} (NOT \p{Malayalam} NOR \p{Is_Malayalam}) 1222 (128: U+0D00..0D7F) 1223 \p{Block: Mandaic} (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32: 1224 U+0840..085F) 1225 \p{Block: Manichaean} (NOT \p{Manichaean} NOR \p{Is_Manichaean}) 1226 (64: U+10AC0..10AFF) 1227 \p{Block: Marchen} (NOT \p{Marchen} NOR \p{Is_Marchen}) (80: 1228 U+11C70..11CBF) 1229 \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR 1230 \p{Is_Masaram_Gondi}) (96: 1231 U+11D00..11D5F) 1232 \p{Block: Math_Alphanum} \p{Block= 1233 Mathematical_Alphanumeric_Symbols} (1024) 1234 \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256) 1235 \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk= 1236 MathAlphanum}) (1024: U+1D400..1D7FF) 1237 \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators}) 1238 (256: U+2200..22FF) 1239 \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF) 1240 \p{Block: Medefaidrin} (NOT \p{Medefaidrin} NOR 1241 \p{Is_Medefaidrin}) (96: U+16E40..16E9F) 1242 \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR 1243 \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF) 1244 \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32) 1245 \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt}) 1246 (32: U+AAE0..AAFF) 1247 \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR 1248 \p{Is_Mende_Kikakui}) (224: 1249 U+1E800..1E8DF) 1250 \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR 1251 \p{Is_Meroitic_Cursive}) (96: 1252 U+109A0..109FF) 1253 \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F) 1254 \p{Block: Miao} (NOT \p{Miao} NOR \p{Is_Miao}) (160: 1255 U+16F00..16F9F) 1256 \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows} 1257 (256) 1258 \p{Block: Misc_Math_Symbols_A} \p{Block= 1259 Miscellaneous_Mathematical_Symbols_A} 1260 (48) 1261 \p{Block: Misc_Math_Symbols_B} \p{Block= 1262 Miscellaneous_Mathematical_Symbols_B} 1263 (128) 1264 \p{Block: Misc_Pictographs} \p{Block= 1265 Miscellaneous_Symbols_And_Pictographs} 1266 (768) 1267 \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256) 1268 \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256) 1269 \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk= 1270 MiscMathSymbolsA}) (48: U+27C0..27EF) 1271 \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk= 1272 MiscMathSymbolsB}) (128: U+2980..29FF) 1273 \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols}) 1274 (256: U+2600..26FF) 1275 \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk= 1276 MiscArrows}) (256: U+2B00..2BFF) 1277 \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk= 1278 MiscPictographs}) (768: U+1F300..1F5FF) 1279 \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical}) 1280 (256: U+2300..23FF) 1281 \p{Block: Modi} (NOT \p{Modi} NOR \p{Is_Modi}) (96: 1282 U+11600..1165F) 1283 \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80) 1284 \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F) 1285 \p{Block: Mongolian} (NOT \p{Mongolian} NOR \p{Is_Mongolian}) 1286 (176: U+1800..18AF) 1287 \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32) 1288 \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32: 1289 U+11660..1167F) 1290 \p{Block: Mro} (NOT \p{Mro} NOR \p{Is_Mro}) (48: 1291 U+16A40..16A6F) 1292 \p{Block: Multani} (NOT \p{Multani} NOR \p{Is_Multani}) (48: 1293 U+11280..112AF) 1294 \p{Block: Music} \p{Block=Musical_Symbols} (256) 1295 \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256: 1296 U+1D100..1D1FF) 1297 \p{Block: Myanmar} (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160: 1298 U+1000..109F) 1299 \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32) 1300 \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32) 1301 \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32: 1302 U+AA60..AA7F) 1303 \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32: 1304 U+A9E0..A9FF) 1305 \p{Block: Nabataean} (NOT \p{Nabataean} NOR \p{Is_Nabataean}) 1306 (48: U+10880..108AF) 1307 \p{Block: Nag_Mundari} (NOT \p{Nag_Mundari} NOR 1308 \p{Is_Nag_Mundari}) (48: U+1E4D0..1E4FF) 1309 \p{Block: Nandinagari} (NOT \p{Nandinagari} NOR 1310 \p{Is_Nandinagari}) (96: U+119A0..119FF) 1311 \p{Block: NB} \p{Block=No_Block} (820_944 plus all 1312 above-Unicode code points) 1313 \p{Block: New_Tai_Lue} (NOT \p{New_Tai_Lue} NOR 1314 \p{Is_New_Tai_Lue}) (96: U+1980..19DF) 1315 \p{Block: Newa} (NOT \p{Newa} NOR \p{Is_Newa}) (128: 1316 U+11400..1147F) 1317 \p{Block: NKo} (NOT \p{Nko} NOR \p{Is_NKo}) (64: 1318 U+07C0..07FF) 1319 \p{Block: No_Block} (Short: \p{Blk=NB}) (820_944 plus all 1320 above-Unicode code points: U+2FE0..2FEF, 1321 U+10200..1027F, U+103E0..103FF, 1322 U+105C0..105FF, U+107C0..107FF, 1323 U+108B0..108DF ...) 1324 \p{Block: Number_Forms} (64: U+2150..218F) 1325 \p{Block: Nushu} (NOT \p{Nushu} NOR \p{Is_Nushu}) (400: 1326 U+1B170..1B2FF) 1327 \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong} 1328 NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80: 1329 U+1E100..1E14F) 1330 \p{Block: OCR} \p{Block=Optical_Character_Recognition} 1331 (32) 1332 \p{Block: Ogham} (NOT \p{Ogham} NOR \p{Is_Ogham}) (32: 1333 U+1680..169F) 1334 \p{Block: Ol_Chiki} (48: U+1C50..1C7F) 1335 \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR 1336 \p{Is_Old_Hungarian}) (128: 1337 U+10C80..10CFF) 1338 \p{Block: Old_Italic} (NOT \p{Old_Italic} NOR \p{Is_Old_Italic}) 1339 (48: U+10300..1032F) 1340 \p{Block: Old_North_Arabian} (32: U+10A80..10A9F) 1341 \p{Block: Old_Permic} (NOT \p{Old_Permic} NOR \p{Is_Old_Permic}) 1342 (48: U+10350..1037F) 1343 \p{Block: Old_Persian} (NOT \p{Old_Persian} NOR 1344 \p{Is_Old_Persian}) (64: U+103A0..103DF) 1345 \p{Block: Old_Sogdian} (NOT \p{Old_Sogdian} NOR 1346 \p{Is_Old_Sogdian}) (48: U+10F00..10F2F) 1347 \p{Block: Old_South_Arabian} (32: U+10A60..10A7F) 1348 \p{Block: Old_Turkic} (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic}) 1349 (80: U+10C00..10C4F) 1350 \p{Block: Old_Uyghur} (NOT \p{Old_Uyghur} NOR \p{Is_Old_Uyghur}) 1351 (64: U+10F70..10FAF) 1352 \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32: 1353 U+2440..245F) 1354 \p{Block: Oriya} (NOT \p{Oriya} NOR \p{Is_Oriya}) (128: 1355 U+0B00..0B7F) 1356 \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F) 1357 \p{Block: Osage} (NOT \p{Osage} NOR \p{Is_Osage}) (80: 1358 U+104B0..104FF) 1359 \p{Block: Osmanya} (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48: 1360 U+10480..104AF) 1361 \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F) 1362 \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR 1363 \p{Is_Pahawh_Hmong}) (144: 1364 U+16B00..16B8F) 1365 \p{Block: Palmyrene} (32: U+10860..1087F) 1366 \p{Block: Pau_Cin_Hau} (NOT \p{Pau_Cin_Hau} NOR 1367 \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF) 1368 \p{Block: Phags_Pa} (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa}) 1369 (64: U+A840..A87F) 1370 \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48) 1371 \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48: 1372 U+101D0..101FF) 1373 \p{Block: Phoenician} (NOT \p{Phoenician} NOR \p{Is_Phoenician}) 1374 (32: U+10900..1091F) 1375 \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128) 1376 \p{Block: Phonetic_Ext_Sup} \p{Block= 1377 Phonetic_Extensions_Supplement} (64) 1378 \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128: 1379 U+1D00..1D7F) 1380 \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk= 1381 PhoneticExtSup}) (64: U+1D80..1DBF) 1382 \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF) 1383 \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT 1384 \p{Private_Use} NOR \p{Is_Private_Use}) 1385 (6400) 1386 \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT 1387 \p{Private_Use} NOR \p{Is_Private_Use}) 1388 (6400: U+E000..F8FF) 1389 \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR 1390 \p{Is_Psalter_Pahlavi}) (48: 1391 U+10B80..10BAF) 1392 \p{Block: PUA} \p{Block=Private_Use_Area} (NOT 1393 \p{Private_Use} NOR \p{Is_Private_Use}) 1394 (6400) 1395 \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT 1396 \p{Punct} NOR \p{Is_Punctuation}) (112) 1397 \p{Block: Rejang} (NOT \p{Rejang} NOR \p{Is_Rejang}) (48: 1398 U+A930..A95F) 1399 \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32) 1400 \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32: 1401 U+10E60..10E7F) 1402 \p{Block: Runic} (NOT \p{Runic} NOR \p{Is_Runic}) (96: 1403 U+16A0..16FF) 1404 \p{Block: Samaritan} (NOT \p{Samaritan} NOR \p{Is_Samaritan}) 1405 (64: U+0800..083F) 1406 \p{Block: Saurashtra} (NOT \p{Saurashtra} NOR \p{Is_Saurashtra}) 1407 (96: U+A880..A8DF) 1408 \p{Block: Sharada} (NOT \p{Sharada} NOR \p{Is_Sharada}) (96: 1409 U+11180..111DF) 1410 \p{Block: Shavian} (48: U+10450..1047F) 1411 \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF) 1412 \p{Block: Siddham} (NOT \p{Siddham} NOR \p{Is_Siddham}) (128: 1413 U+11580..115FF) 1414 \p{Block: Sinhala} (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128: 1415 U+0D80..0DFF) 1416 \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF) 1417 \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32: 1418 U+FE50..FE6F) 1419 \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32) 1420 \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64) 1421 \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64: 1422 U+1B130..1B16F) 1423 \p{Block: Sogdian} (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64: 1424 U+10F30..10F6F) 1425 \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR 1426 \p{Is_Sora_Sompeng}) (48: U+110D0..110FF) 1427 \p{Block: Soyombo} (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96: 1428 U+11A50..11AAF) 1429 \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk= 1430 ModifierLetters}) (80: U+02B0..02FF) 1431 \p{Block: Specials} (16: U+FFF0..FFFF) 1432 \p{Block: Sundanese} (NOT \p{Sundanese} NOR \p{Is_Sundanese}) 1433 (64: U+1B80..1BBF) 1434 \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16) 1435 \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16: 1436 U+1CC0..1CCF) 1437 \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16) 1438 \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128) 1439 \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256) 1440 \p{Block: Sup_Math_Operators} \p{Block= 1441 Supplemental_Mathematical_Operators} 1442 (256) 1443 \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A} 1444 (65_536) 1445 \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B} 1446 (65_536) 1447 \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128) 1448 \p{Block: Sup_Symbols_And_Pictographs} \p{Block= 1449 Supplemental_Symbols_And_Pictographs} 1450 (256) 1451 \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48) 1452 \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk= 1453 SuperAndSub}) (48: U+2070..209F) 1454 \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16: 1455 U+27F0..27FF) 1456 \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128: 1457 U+2900..297F) 1458 \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256: 1459 U+1F800..1F8FF) 1460 \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk= 1461 SupMathOperators}) (256: U+2A00..2AFF) 1462 \p{Block: Supplemental_Punctuation} (Short: \p{Blk= 1463 SupPunctuation}) (128: U+2E00..2E7F) 1464 \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk= 1465 SupSymbolsAndPictographs}) (256: 1466 U+1F900..1F9FF) 1467 \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk= 1468 SupPUAA}) (65_536: U+F0000..FFFFF) 1469 \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk= 1470 SupPUAB}) (65_536: U+100000..10FFFF) 1471 \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF) 1472 \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR 1473 \p{Is_Syloti_Nagri}) (48: U+A800..A82F) 1474 \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block= 1475 Symbols_And_Pictographs_Extended_A} (144) 1476 \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk= 1477 SymbolsAndPictographsExtA}) (144: 1478 U+1FA70..1FAFF) 1479 \p{Block: Symbols_For_Legacy_Computing} (256: U+1FB00..1FBFF) 1480 \p{Block: Syriac} (NOT \p{Syriac} NOR \p{Is_Syriac}) (80: 1481 U+0700..074F) 1482 \p{Block: Syriac_Sup} \p{Block=Syriac_Supplement} (16) 1483 \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16: 1484 U+0860..086F) 1485 \p{Block: Tagalog} (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32: 1486 U+1700..171F) 1487 \p{Block: Tagbanwa} (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa}) 1488 (32: U+1760..177F) 1489 \p{Block: Tags} (128: U+E0000..E007F) 1490 \p{Block: Tai_Le} (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48: 1491 U+1950..197F) 1492 \p{Block: Tai_Tham} (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham}) 1493 (144: U+1A20..1AAF) 1494 \p{Block: Tai_Viet} (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet}) 1495 (96: U+AA80..AADF) 1496 \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96) 1497 \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96: 1498 U+1D300..1D35F) 1499 \p{Block: Takri} (NOT \p{Takri} NOR \p{Is_Takri}) (80: 1500 U+11680..116CF) 1501 \p{Block: Tamil} (NOT \p{Tamil} NOR \p{Is_Tamil}) (128: 1502 U+0B80..0BFF) 1503 \p{Block: Tamil_Sup} \p{Block=Tamil_Supplement} (64) 1504 \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64: 1505 U+11FC0..11FFF) 1506 \p{Block: Tangsa} (NOT \p{Tangsa} NOR \p{Is_Tangsa}) (96: 1507 U+16A70..16ACF) 1508 \p{Block: Tangut} (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144: 1509 U+17000..187FF) 1510 \p{Block: Tangut_Components} (768: U+18800..18AFF) 1511 \p{Block: Tangut_Sup} \p{Block=Tangut_Supplement} (128) 1512 \p{Block: Tangut_Supplement} (Short: \p{Blk=TangutSup}) (128: 1513 U+18D00..18D7F) 1514 \p{Block: Telugu} (NOT \p{Telugu} NOR \p{Is_Telugu}) (128: 1515 U+0C00..0C7F) 1516 \p{Block: Thaana} (NOT \p{Thaana} NOR \p{Is_Thaana}) (64: 1517 U+0780..07BF) 1518 \p{Block: Thai} (NOT \p{Thai} NOR \p{Is_Thai}) (128: 1519 U+0E00..0E7F) 1520 \p{Block: Tibetan} (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256: 1521 U+0F00..0FFF) 1522 \p{Block: Tifinagh} (NOT \p{Tifinagh} NOR \p{Is_Tifinagh}) 1523 (80: U+2D30..2D7F) 1524 \p{Block: Tirhuta} (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96: 1525 U+11480..114DF) 1526 \p{Block: Toto} (NOT \p{Toto} NOR \p{Is_Toto}) (48: 1527 U+1E290..1E2BF) 1528 \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols} 1529 (128) 1530 \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk= 1531 TransportAndMap}) (128: U+1F680..1F6FF) 1532 \p{Block: UCAS} \p{Block= 1533 Unified_Canadian_Aboriginal_Syllabics} 1534 (640) 1535 \p{Block: UCAS_Ext} \p{Block= 1536 Unified_Canadian_Aboriginal_Syllabics_- 1537 Extended} (80) 1538 \p{Block: UCAS_Ext_A} \p{Block= 1539 Unified_Canadian_Aboriginal_Syllabics_- 1540 Extended_A} (16) 1541 \p{Block: Ugaritic} (NOT \p{Ugaritic} NOR \p{Is_Ugaritic}) 1542 (32: U+10380..1039F) 1543 \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk= 1544 UCAS}) (640: U+1400..167F) 1545 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short: 1546 \p{Blk=UCASExt}) (80: U+18B0..18FF) 1547 \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended_A} 1548 (Short: \p{Blk=UCASExtA}) (16: 1549 U+11AB0..11ABF) 1550 \p{Block: Vai} (NOT \p{Vai} NOR \p{Is_Vai}) (320: 1551 U+A500..A63F) 1552 \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT 1553 \p{Variation_Selector} NOR \p{Is_VS}) 1554 (16: U+FE00..FE0F) 1555 \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup}) 1556 (240: U+E0100..E01EF) 1557 \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48) 1558 \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48: 1559 U+1CD0..1CFF) 1560 \p{Block: Vertical_Forms} (16: U+FE10..FE1F) 1561 \p{Block: Vithkuqi} (NOT \p{Vithkuqi} NOR \p{Is_Vithkuqi}) 1562 (80: U+10570..105BF) 1563 \p{Block: VS} \p{Block=Variation_Selectors} (NOT 1564 \p{Variation_Selector} NOR \p{Is_VS}) 1565 (16) 1566 \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement} 1567 (240) 1568 \p{Block: Wancho} (NOT \p{Wancho} NOR \p{Is_Wancho}) (64: 1569 U+1E2C0..1E2FF) 1570 \p{Block: Warang_Citi} (NOT \p{Warang_Citi} NOR 1571 \p{Is_Warang_Citi}) (96: U+118A0..118FF) 1572 \p{Block: Yezidi} (NOT \p{Yezidi} NOR \p{Is_Yezidi}) (64: 1573 U+10E80..10EBF) 1574 \p{Block: Yi_Radicals} (64: U+A490..A4CF) 1575 \p{Block: Yi_Syllables} (1168: U+A000..A48F) 1576 \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64) 1577 \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64: 1578 U+4DC0..4DFF) 1579 \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR 1580 \p{Is_Zanabazar_Square}) (80: 1581 U+11A00..11A4F) 1582 \p{Block: Znamenny_Music} \p{Block=Znamenny_Musical_Notation} (208) 1583 \p{Block: Znamenny_Musical_Notation} (Short: \p{Blk= 1584 ZnamennyMusic}) (208: U+1CF00..1CFCF) 1585 X \p{Block_Elements} \p{Block=Block_Elements} (32) 1586 \p{Bopo} \p{Bopomofo} (= \p{Script_Extensions= 1587 Bopomofo}) (NOT \p{Block=Bopomofo}) (117) 1588 \p{Bopomofo} \p{Script_Extensions=Bopomofo} (Short: 1589 \p{Bopo}; NOT \p{Block=Bopomofo}) (117) 1590 X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block= 1591 Bopomofo_Extended}) (32) 1592 X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short: 1593 \p{InBopomofoExt}) (32) 1594 X \p{Box_Drawing} \p{Block=Box_Drawing} (128) 1595 \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *} 1596 \p{Brah} \p{Brahmi} (= \p{Script_Extensions= 1597 Brahmi}) (NOT \p{Block=Brahmi}) (115) 1598 \p{Brahmi} \p{Script_Extensions=Brahmi} (Short: 1599 \p{Brah}; NOT \p{Block=Brahmi}) (115) 1600 \p{Brai} \p{Braille} (= \p{Script_Extensions= 1601 Braille}) (256) 1602 \p{Braille} \p{Script_Extensions=Braille} (Short: 1603 \p{Brai}) (256) 1604 X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short: 1605 \p{InBraille}) (256) 1606 \p{Bugi} \p{Buginese} (= \p{Script_Extensions= 1607 Buginese}) (NOT \p{Block=Buginese}) (31) 1608 \p{Buginese} \p{Script_Extensions=Buginese} (Short: 1609 \p{Bugi}; NOT \p{Block=Buginese}) (31) 1610 \p{Buhd} \p{Buhid} (= \p{Script_Extensions=Buhid}) 1611 (NOT \p{Block=Buhid}) (22) 1612 \p{Buhid} \p{Script_Extensions=Buhid} (Short: 1613 \p{Buhd}; NOT \p{Block=Buhid}) (22) 1614 X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block= 1615 Byzantine_Musical_Symbols}) (256) 1616 X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols} 1617 (Short: \p{InByzantineMusic}) (256) 1618 \p{C} \pC \p{Other} (= \p{General_Category=Other}) 1619 (965_096 plus all above-Unicode code 1620 points) 1621 \p{Cakm} \p{Chakma} (= \p{Script_Extensions= 1622 Chakma}) (NOT \p{Block=Chakma}) (91) 1623 \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal} 1624 (Short: \p{Cans}) (726) 1625 X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics} 1626 (= \p{Block= 1627 Unified_Canadian_Aboriginal_Syllabics}) 1628 (640) 1629 T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class= 1630 Not_Reordered} (1_113_190 plus all 1631 above-Unicode code points) 1632 T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class= 1633 Overlay} (32) 1634 T \p{Canonical_Combining_Class: 6} \p{Canonical_Combining_Class= 1635 Han_Reading} (2) 1636 T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class= 1637 Nukta} (27) 1638 T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class= 1639 Kana_Voicing} (2) 1640 T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class= 1641 Virama} (65) 1642 T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class= 1643 CCC10} (1) 1644 \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1: 1645 U+05B0) 1646 T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class= 1647 CCC11} (1) 1648 \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1: 1649 U+05B1) 1650 T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class= 1651 CCC12} (1) 1652 \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1: 1653 U+05B2) 1654 T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class= 1655 CCC13} (1) 1656 \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1: 1657 U+05B3) 1658 T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class= 1659 CCC14} (1) 1660 \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1: 1661 U+05B4) 1662 T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class= 1663 CCC15} (1) 1664 \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1: 1665 U+05B5) 1666 T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class= 1667 CCC16} (1) 1668 \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1: 1669 U+05B6) 1670 T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class= 1671 CCC17} (1) 1672 \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1: 1673 U+05B7) 1674 T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class= 1675 CCC18} (2) 1676 \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2: 1677 U+05B8, U+05C7) 1678 T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class= 1679 CCC19} (2) 1680 \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2: 1681 U+05B9..05BA) 1682 T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class= 1683 CCC20} (1) 1684 \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1: 1685 U+05BB) 1686 T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class= 1687 CCC21} (1) 1688 \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1: 1689 U+05BC) 1690 T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class= 1691 CCC22} (1) 1692 \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1: 1693 U+05BD) 1694 T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class= 1695 CCC23} (1) 1696 \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1: 1697 U+05BF) 1698 T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class= 1699 CCC24} (1) 1700 \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1: 1701 U+05C1) 1702 T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class= 1703 CCC25} (1) 1704 \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1: 1705 U+05C2) 1706 T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class= 1707 CCC26} (1) 1708 \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1: 1709 U+FB1E) 1710 T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class= 1711 CCC27} (2) 1712 \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2: 1713 U+064B, U+08F0) 1714 T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class= 1715 CCC28} (2) 1716 \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2: 1717 U+064C, U+08F1) 1718 T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class= 1719 CCC29} (2) 1720 \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2: 1721 U+064D, U+08F2) 1722 T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class= 1723 CCC30} (2) 1724 \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2: 1725 U+0618, U+064E) 1726 T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class= 1727 CCC31} (2) 1728 \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2: 1729 U+0619, U+064F) 1730 T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class= 1731 CCC32} (2) 1732 \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2: 1733 U+061A, U+0650) 1734 T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class= 1735 CCC33} (1) 1736 \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1: 1737 U+0651) 1738 T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class= 1739 CCC34} (1) 1740 \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1: 1741 U+0652) 1742 T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class= 1743 CCC35} (1) 1744 \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1: 1745 U+0670) 1746 T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class= 1747 CCC36} (1) 1748 \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1: 1749 U+0711) 1750 T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class= 1751 CCC84} (1) 1752 \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1: 1753 U+0C55) 1754 T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class= 1755 CCC91} (1) 1756 \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1: 1757 U+0C56) 1758 T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class= 1759 CCC103} (2) 1760 \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2: 1761 U+0E38..0E39) 1762 T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class= 1763 CCC107} (4) 1764 \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4: 1765 U+0E48..0E4B) 1766 T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class= 1767 CCC118} (2) 1768 \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2: 1769 U+0EB8..0EB9) 1770 T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class= 1771 CCC122} (4) 1772 \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4: 1773 U+0EC8..0ECB) 1774 T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class= 1775 CCC129} (1) 1776 \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1: 1777 U+0F71) 1778 T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class= 1779 CCC130} (6) 1780 \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6: 1781 U+0F72, U+0F7A..0F7D, U+0F80) 1782 T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class= 1783 CCC132} (1) 1784 \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1: 1785 U+0F74) 1786 T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class= 1787 CCC133} (0) 1788 \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0) 1789 T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class= 1790 Attached_Below_Left} (0) 1791 T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class= 1792 Attached_Below} (5) 1793 T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class= 1794 Attached_Above} (1) 1795 T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class= 1796 Attached_Above_Right} (9) 1797 T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class= 1798 Below_Left} (2) 1799 T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class= 1800 Below} (181) 1801 T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class= 1802 Below_Right} (4) 1803 T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class= 1804 Left} (2) 1805 T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class= 1806 Right} (1) 1807 T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class= 1808 Above_Left} (5) 1809 T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class= 1810 Above} (510) 1811 T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class= 1812 Above_Right} (7) 1813 T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class= 1814 Double_Below} (4) 1815 T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class= 1816 Double_Above} (5) 1817 T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class= 1818 Iota_Subscript} (1) 1819 \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class= 1820 Above} (510) 1821 \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (510: 1822 U+0300..0314, U+033D..0344, U+0346, 1823 U+034A..034C, U+0350..0352, U+0357 ...) 1824 \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5: 1825 U+05AE, U+18A9, U+1DF7..1DF8, U+302B) 1826 \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (7: 1827 U+0315, U+031A, U+0358, U+1DF6, U+302C, 1828 U+1E4EC..1E4ED) 1829 \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class= 1830 Above_Left} (5) 1831 \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class= 1832 Above_Right} (7) 1833 \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class= 1834 Attached_Above} (1) 1835 \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class= 1836 Attached_Above_Right} (9) 1837 \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class= 1838 Attached_Below} (5) 1839 \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class= 1840 Attached_Below_Left} (0) 1841 \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA}) 1842 (1: U+1DCE) 1843 \p{Canonical_Combining_Class: Attached_Above_Right} (Short: 1844 \p{Ccc=ATAR}) (9: U+031B, U+0F39, 1845 U+1D165..1D166, U+1D16E..1D172) 1846 \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB}) 1847 (5: U+0321..0322, U+0327..0328, U+1DD0) 1848 \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc= 1849 ATBL}) (0) 1850 \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class= 1851 Below} (181) 1852 \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (181: 1853 U+0316..0319, U+031C..0320, 1854 U+0323..0326, U+0329..0333, 1855 U+0339..033C, U+0347..0349 ...) 1856 \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (2: 1857 U+1DFA, U+302A) 1858 \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4: 1859 U+059A, U+05AD, U+1939, U+302D) 1860 \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class= 1861 Below_Left} (2) 1862 \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class= 1863 Below_Right} (4) 1864 \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class= 1865 Double_Above} (5) 1866 \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class= 1867 Double_Below} (4) 1868 \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) 1869 (5: U+035D..035E, U+0360..0361, U+1DCD) 1870 \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) 1871 (4: U+035C, U+035F, U+0362, U+1DFC) 1872 \p{Canonical_Combining_Class: Han_Reading} (Short: \p{Ccc=HANR}) 1873 (2: U+16FF0..16FF1) 1874 \p{Canonical_Combining_Class: HANR} \p{Canonical_Combining_Class= 1875 Han_Reading} (2) 1876 \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS}) 1877 (1: U+0345) 1878 \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class= 1879 Iota_Subscript} (1) 1880 \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) 1881 (2: U+3099..309A) 1882 \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class= 1883 Kana_Voicing} (2) 1884 \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class= 1885 Left} (2) 1886 \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2: 1887 U+302E..302F) 1888 \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class= 1889 Nukta} (27) 1890 \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR}) 1891 (1_113_190 plus all above-Unicode code 1892 points: U+0000..02FF, U+034F, 1893 U+0370..0482, U+0488..0590, U+05BE, 1894 U+05C0 ...) 1895 \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class= 1896 Not_Reordered} (1_113_190 plus all 1897 above-Unicode code points) 1898 \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (27: 1899 U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C, 1900 U+0C3C ...) 1901 \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class= 1902 Overlay} (32) 1903 \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32: 1904 U+0334..0338, U+1CD4, U+1CE2..1CE8, 1905 U+20D2..20D3, U+20D8..20DA, U+20E5..20E6 1906 ...) 1907 \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class= 1908 Right} (1) 1909 \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1: 1910 U+1D16D) 1911 \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (65: 1912 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D, 1913 U+0BCD ...) 1914 \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class= 1915 Virama} (65) 1916 \p{Cans} \p{Canadian_Aboriginal} (= 1917 \p{Script_Extensions= 1918 Canadian_Aboriginal}) (726) 1919 \p{Cari} \p{Carian} (= \p{Script_Extensions= 1920 Carian}) (NOT \p{Block=Carian}) (49) 1921 \p{Carian} \p{Script_Extensions=Carian} (Short: 1922 \p{Cari}; NOT \p{Block=Carian}) (49) 1923 \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (2707) 1924 \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_111_405 plus 1925 all above-Unicode code points: [\x00- 1926 \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-Z 1927 \[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-\xac\xae 1928 \xb0-\xb3\xb5-\xb6\xb9-\xff], 1929 U+0100..02AF, U+0370..0373, 1930 U+0376..0379, U+037B..0383, U+0386 ...) 1931 \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (2707: [\'.:\^` 1932 \xa8\xad\xaf\xb4\xb7-\xb8], 1933 U+02B0..036F, U+0374..0375, U+037A, 1934 U+0384..0385, U+0387 ...) 1935 \p{Cased} \p{Cased=Y} (4526) 1936 \p{Cased: N*} (Single: \P{Cased}) (1_109_586 plus all 1937 above-Unicode code points: [\x00-\x20! 1938 \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\] 1939 \^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9 1940 \xbb-\xbf\xd7\xf7], U+01BB, 1941 U+01C0..01C3, U+0294, U+02B9..02BF, 1942 U+02C2..02DF ...) 1943 \p{Cased: Y*} (Single: \p{Cased}) (4526: [A-Za-z\xaa 1944 \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 1945 U+0100..01BA, U+01BC..01BF, 1946 U+01C4..0293, U+0295..02B8, U+02C0..02C1 1947 ...) 1948 \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short: 1949 \p{LC}) (4095) 1950 \p{Category: *} \p{General_Category: *} 1951 \p{Caucasian_Albanian} \p{Script_Extensions=Caucasian_Albanian} 1952 (Short: \p{Aghb}; NOT \p{Block= 1953 Caucasian_Albanian}) (53) 1954 \p{Cc} \p{XPosixCntrl} (= \p{General_Category= 1955 Control}) (65) 1956 \p{Ccc: *} \p{Canonical_Combining_Class: *} 1957 \p{CE} \p{Composition_Exclusion} (= 1958 \p{Composition_Exclusion=Y}) (81) 1959 \p{CE: *} \p{Composition_Exclusion: *} 1960 \p{Cf} \p{Format} (= \p{General_Category=Format}) 1961 (170) 1962 \p{Chakma} \p{Script_Extensions=Chakma} (Short: 1963 \p{Cakm}; NOT \p{Block=Chakma}) (91) 1964 \p{Cham} \p{Script_Extensions=Cham} (NOT \p{Block= 1965 Cham}) (83) 1966 \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short: 1967 \p{CWCF}) (1506) 1968 \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF}) 1969 (1_112_606 plus all above-Unicode code 1970 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-. 1971 \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f- 1972 \xb4\xb6-\xbf\xd7\xe0-\xff], U+0101, 1973 U+0103, U+0105, U+0107, U+0109 ...) 1974 \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF}) 1975 (1506: [A-Z\xb5\xc0-\xd6\xd8-\xdf], 1976 U+0100, U+0102, U+0104, U+0106, U+0108 1977 ...) 1978 \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short: 1979 \p{CWCM}) (2927) 1980 \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM}) 1981 (1_111_185 plus all above-Unicode code 1982 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-. 1983 \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xb4 1984 \xb6-\xbf\xd7\xf7], U+0138, U+018D, 1985 U+019B, U+01AA..01AB, U+01BA..01BB ...) 1986 \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM}) 1987 (2927: [A-Za-z\xb5\xc0-\xd6\xd8-\xf6 1988 \xf8-\xff], U+0100..0137, U+0139..018C, 1989 U+018E..019A, U+019C..01A9, U+01AC..01B9 1990 ...) 1991 \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short: 1992 \p{CWL}) (1433) 1993 \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL}) 1994 (1_112_679 plus all above-Unicode code 1995 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-. 1996 \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f- 1997 \xbf\xd7\xdf-\xff], U+0101, U+0103, 1998 U+0105, U+0107, U+0109 ...) 1999 \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1433: 2000 [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102, 2001 U+0104, U+0106, U+0108 ...) 2002 \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded= 2003 Y} (Short: \p{CWKCF}) (10_491) 2004 \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N}, 2005 \P{CWKCF}) (1_103_621 plus all above- 2006 Unicode code points: [\x00-\x20!\"#\$ 2007 \%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a- 2008 z\{\|\}~\x7f-\x9f\xa1-\xa7\xa9\xab-\xac 2009 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xd7\xe0- 2010 \xff], U+0101, U+0103, U+0105, U+0107, 2011 U+0109 ...) 2012 \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y}, 2013 \p{CWKCF}) (10_491: [A-Z\xa0\xa8\xaa 2014 \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0- 2015 \xd6\xd8-\xdf], U+0100, U+0102, U+0104, 2016 U+0106, U+0108 ...) 2017 \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short: 2018 \p{CWT}) (1452) 2019 \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT}) 2020 (1_112_660 plus all above-Unicode code 2021 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-. 2022 \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f- 2023 \xb4\xb6-\xde\xf7], U+0100, U+0102, 2024 U+0104, U+0106, U+0108 ...) 2025 \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1452: 2026 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101, 2027 U+0103, U+0105, U+0107, U+0109 ...) 2028 \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short: 2029 \p{CWU}) (1525) 2030 \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU}) 2031 (1_112_587 plus all above-Unicode code 2032 points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-. 2033 \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f- 2034 \xb4\xb6-\xde\xf7], U+0100, U+0102, 2035 U+0104, U+0106, U+0108 ...) 2036 \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1525: 2037 [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101, 2038 U+0103, U+0105, U+0107, U+0109 ...) 2039 \p{Cher} \p{Cherokee} (= \p{Script_Extensions= 2040 Cherokee}) (NOT \p{Block=Cherokee}) (172) 2041 \p{Cherokee} \p{Script_Extensions=Cherokee} (Short: 2042 \p{Cher}; NOT \p{Block=Cherokee}) (172) 2043 X \p{Cherokee_Sup} \p{Cherokee_Supplement} (= \p{Block= 2044 Cherokee_Supplement}) (80) 2045 X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short: 2046 \p{InCherokeeSup}) (80) 2047 X \p{Chess_Symbols} \p{Block=Chess_Symbols} (112) 2048 \p{Chorasmian} \p{Script_Extensions=Chorasmian} (Short: 2049 \p{Chrs}; NOT \p{Block=Chorasmian}) (28) 2050 \p{Chrs} \p{Chorasmian} (= \p{Script_Extensions= 2051 Chorasmian}) (NOT \p{Block=Chorasmian}) 2052 (28) 2053 \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable= 2054 Y}) (2707) 2055 \p{CI: *} \p{Case_Ignorable: *} 2056 X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block= 2057 CJK_Unified_Ideographs}) (20_992) 2058 X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block= 2059 CJK_Compatibility}) (256) 2060 X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block= 2061 CJK_Compatibility_Forms}) (32) 2062 X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (= 2063 \p{Block=CJK_Compatibility_Ideographs}) 2064 (512) 2065 X \p{CJK_Compat_Ideographs_Sup} 2066 \p{CJK_Compatibility_Ideographs_- 2067 Supplement} (= \p{Block= 2068 CJK_Compatibility_Ideographs_- 2069 Supplement}) (544) 2070 X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short: 2071 \p{InCJKCompat}) (256) 2072 X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms} 2073 (Short: \p{InCJKCompatForms}) (32) 2074 X \p{CJK_Compatibility_Ideographs} \p{Block= 2075 CJK_Compatibility_Ideographs} (Short: 2076 \p{InCJKCompatIdeographs}) (512) 2077 X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block= 2078 CJK_Compatibility_Ideographs_Supplement} 2079 (Short: \p{InCJKCompatIdeographsSup}) 2080 (544) 2081 X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (= 2082 \p{Block= 2083 CJK_Unified_Ideographs_Extension_A}) 2084 (6592) 2085 X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (= 2086 \p{Block= 2087 CJK_Unified_Ideographs_Extension_B}) 2088 (42_720) 2089 X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (= 2090 \p{Block= 2091 CJK_Unified_Ideographs_Extension_C}) 2092 (4160) 2093 X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (= 2094 \p{Block= 2095 CJK_Unified_Ideographs_Extension_D}) 2096 (224) 2097 X \p{CJK_Ext_E} \p{CJK_Unified_Ideographs_Extension_E} (= 2098 \p{Block= 2099 CJK_Unified_Ideographs_Extension_E}) 2100 (5776) 2101 X \p{CJK_Ext_F} \p{CJK_Unified_Ideographs_Extension_F} (= 2102 \p{Block= 2103 CJK_Unified_Ideographs_Extension_F}) 2104 (7488) 2105 X \p{CJK_Ext_G} \p{CJK_Unified_Ideographs_Extension_G} (= 2106 \p{Block= 2107 CJK_Unified_Ideographs_Extension_G}) 2108 (4944) 2109 X \p{CJK_Ext_H} \p{CJK_Unified_Ideographs_Extension_H} (= 2110 \p{Block= 2111 CJK_Unified_Ideographs_Extension_H}) 2112 (4192) 2113 X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block= 2114 CJK_Radicals_Supplement}) (128) 2115 X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement} 2116 (Short: \p{InCJKRadicalsSup}) (128) 2117 X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48) 2118 X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (= 2119 \p{Block=CJK_Symbols_And_Punctuation}) 2120 (64) 2121 X \p{CJK_Symbols_And_Punctuation} \p{Block= 2122 CJK_Symbols_And_Punctuation} (Short: 2123 \p{InCJKSymbols}) (64) 2124 X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs} 2125 (Short: \p{InCJK}) (20_992) 2126 X \p{CJK_Unified_Ideographs_Extension_A} \p{Block= 2127 CJK_Unified_Ideographs_Extension_A} 2128 (Short: \p{InCJKExtA}) (6592) 2129 X \p{CJK_Unified_Ideographs_Extension_B} \p{Block= 2130 CJK_Unified_Ideographs_Extension_B} 2131 (Short: \p{InCJKExtB}) (42_720) 2132 X \p{CJK_Unified_Ideographs_Extension_C} \p{Block= 2133 CJK_Unified_Ideographs_Extension_C} 2134 (Short: \p{InCJKExtC}) (4160) 2135 X \p{CJK_Unified_Ideographs_Extension_D} \p{Block= 2136 CJK_Unified_Ideographs_Extension_D} 2137 (Short: \p{InCJKExtD}) (224) 2138 X \p{CJK_Unified_Ideographs_Extension_E} \p{Block= 2139 CJK_Unified_Ideographs_Extension_E} 2140 (Short: \p{InCJKExtE}) (5776) 2141 X \p{CJK_Unified_Ideographs_Extension_F} \p{Block= 2142 CJK_Unified_Ideographs_Extension_F} 2143 (Short: \p{InCJKExtF}) (7488) 2144 X \p{CJK_Unified_Ideographs_Extension_G} \p{Block= 2145 CJK_Unified_Ideographs_Extension_G} 2146 (Short: \p{InCJKExtG}) (4944) 2147 X \p{CJK_Unified_Ideographs_Extension_H} \p{Block= 2148 CJK_Unified_Ideographs_Extension_H} 2149 (Short: \p{InCJKExtH}) (4192) 2150 \p{Close_Punctuation} \p{General_Category=Close_Punctuation} 2151 (Short: \p{Pe}) (77) 2152 \p{Cn} \p{Unassigned} (= \p{General_Category= 2153 Unassigned}) (825_345 plus all above- 2154 Unicode code points) 2155 \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category= 2156 Control}) (65) 2157 \p{Co} \p{Private_Use} (= \p{General_Category= 2158 Private_Use}) (NOT \p{Private_Use_Area}) 2159 (137_468) 2160 X \p{Combining_Diacritical_Marks} \p{Block= 2161 Combining_Diacritical_Marks} (Short: 2162 \p{InDiacriticals}) (112) 2163 X \p{Combining_Diacritical_Marks_Extended} \p{Block= 2164 Combining_Diacritical_Marks_Extended} 2165 (Short: \p{InDiacriticalsExt}) (80) 2166 X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block= 2167 Combining_Diacritical_Marks_For_Symbols} 2168 (Short: \p{InDiacriticalsForSymbols}) 2169 (48) 2170 X \p{Combining_Diacritical_Marks_Supplement} \p{Block= 2171 Combining_Diacritical_Marks_Supplement} 2172 (Short: \p{InDiacriticalsSup}) (64) 2173 X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short: 2174 \p{InHalfMarks}) (16) 2175 \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark}) 2176 (2450) 2177 X \p{Combining_Marks_For_Symbols} 2178 \p{Combining_Diacritical_Marks_For_- 2179 Symbols} (= \p{Block= 2180 Combining_Diacritical_Marks_For_- 2181 Symbols}) (48) 2182 \p{Common} \p{Script_Extensions=Common} (Short: 2183 \p{Zyyy}) (7873) 2184 X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms} 2185 (Short: \p{InIndicNumberForms}) (16) 2186 \p{Comp_Ex} \p{Full_Composition_Exclusion} (= 2187 \p{Full_Composition_Exclusion=Y}) (1120) 2188 \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *} 2189 X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block= 2190 Hangul_Compatibility_Jamo}) (96) 2191 \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short: 2192 \p{CE}) (81) 2193 \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031 2194 plus all above-Unicode code points: 2195 U+0000..0957, U+0960..09DB, U+09DE, 2196 U+09E0..0A32, U+0A34..0A35, U+0A37..0A58 2197 ...) 2198 \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81: 2199 U+0958..095F, U+09DC..09DD, U+09DF, 2200 U+0A33, U+0A36, U+0A59..0A5B ...) 2201 \p{Connector_Punctuation} \p{General_Category= 2202 Connector_Punctuation} (Short: \p{Pc}) 2203 (10) 2204 \p{Control} \p{XPosixCntrl} (= \p{General_Category= 2205 Control}) (65) 2206 X \p{Control_Pictures} \p{Block=Control_Pictures} (64) 2207 \p{Copt} \p{Coptic} (= \p{Script_Extensions= 2208 Coptic}) (NOT \p{Block=Coptic}) (165) 2209 \p{Coptic} \p{Script_Extensions=Coptic} (Short: 2210 \p{Copt}; NOT \p{Block=Coptic}) (165) 2211 X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32) 2212 X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block= 2213 Counting_Rod_Numerals}) (32) 2214 X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short: 2215 \p{InCountingRod}) (32) 2216 \p{Cpmn} \p{Cypro_Minoan} (= \p{Script_Extensions= 2217 Cypro_Minoan}) (NOT \p{Block= 2218 Cypro_Minoan}) (101) 2219 \p{Cprt} \p{Cypriot} (= \p{Script_Extensions= 2220 Cypriot}) (112) 2221 \p{Cs} \p{Surrogate} (= \p{General_Category= 2222 Surrogate}) (2048) 2223 \p{Cuneiform} \p{Script_Extensions=Cuneiform} (Short: 2224 \p{Xsux}; NOT \p{Block=Cuneiform}) (1234) 2225 X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (= 2226 \p{Block= 2227 Cuneiform_Numbers_And_Punctuation}) (128) 2228 X \p{Cuneiform_Numbers_And_Punctuation} \p{Block= 2229 Cuneiform_Numbers_And_Punctuation} 2230 (Short: \p{InCuneiformNumbers}) (128) 2231 \p{Currency_Symbol} \p{General_Category=Currency_Symbol} 2232 (Short: \p{Sc}) (63) 2233 X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48) 2234 \p{CWCF} \p{Changes_When_Casefolded} (= 2235 \p{Changes_When_Casefolded=Y}) (1506) 2236 \p{CWCF: *} \p{Changes_When_Casefolded: *} 2237 \p{CWCM} \p{Changes_When_Casemapped} (= 2238 \p{Changes_When_Casemapped=Y}) (2927) 2239 \p{CWCM: *} \p{Changes_When_Casemapped: *} 2240 \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (= 2241 \p{Changes_When_NFKC_Casefolded=Y}) 2242 (10_491) 2243 \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *} 2244 \p{CWL} \p{Changes_When_Lowercased} (= 2245 \p{Changes_When_Lowercased=Y}) (1433) 2246 \p{CWL: *} \p{Changes_When_Lowercased: *} 2247 \p{CWT} \p{Changes_When_Titlecased} (= 2248 \p{Changes_When_Titlecased=Y}) (1452) 2249 \p{CWT: *} \p{Changes_When_Titlecased: *} 2250 \p{CWU} \p{Changes_When_Uppercased} (= 2251 \p{Changes_When_Uppercased=Y}) (1525) 2252 \p{CWU: *} \p{Changes_When_Uppercased: *} 2253 \p{Cypriot} \p{Script_Extensions=Cypriot} (Short: 2254 \p{Cprt}) (112) 2255 X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64) 2256 \p{Cypro_Minoan} \p{Script_Extensions=Cypro_Minoan} (Short: 2257 \p{Cpmn}; NOT \p{Block=Cypro_Minoan}) 2258 (101) 2259 \p{Cyrillic} \p{Script_Extensions=Cyrillic} (Short: 2260 \p{Cyrl}; NOT \p{Block=Cyrillic}) (510) 2261 X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block= 2262 Cyrillic_Extended_A}) (32) 2263 X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block= 2264 Cyrillic_Extended_B}) (96) 2265 X \p{Cyrillic_Ext_C} \p{Cyrillic_Extended_C} (= \p{Block= 2266 Cyrillic_Extended_C}) (16) 2267 X \p{Cyrillic_Ext_D} \p{Cyrillic_Extended_D} (= \p{Block= 2268 Cyrillic_Extended_D}) (96) 2269 X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short: 2270 \p{InCyrillicExtA}) (32) 2271 X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short: 2272 \p{InCyrillicExtB}) (96) 2273 X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short: 2274 \p{InCyrillicExtC}) (16) 2275 X \p{Cyrillic_Extended_D} \p{Block=Cyrillic_Extended_D} (Short: 2276 \p{InCyrillicExtD}) (96) 2277 X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block= 2278 Cyrillic_Supplement}) (48) 2279 X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short: 2280 \p{InCyrillicSup}) (48) 2281 X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block= 2282 Cyrillic_Supplement}) (48) 2283 \p{Cyrl} \p{Cyrillic} (= \p{Script_Extensions= 2284 Cyrillic}) (NOT \p{Block=Cyrillic}) (510) 2285 \p{Dash} \p{Dash=Y} (30) 2286 \p{Dash: N*} (Single: \P{Dash}) (1_114_082 plus all 2287 above-Unicode code points: [\x00-\x20! 2288 \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z 2289 \[\\\]\^_`a-z\{\|\}~\x7f-\xff], 2290 U+0100..0589, U+058B..05BD, 2291 U+05BF..13FF, U+1401..1805, U+1807..200F 2292 ...) 2293 \p{Dash: Y*} (Single: \p{Dash}) (30: [\-], U+058A, 2294 U+05BE, U+1400, U+1806, U+2010..2015 ...) 2295 \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation} 2296 (Short: \p{Pd}) (26) 2297 \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category= 2298 Decimal_Number}) (680) 2299 \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical} 2300 (13_233) 2301 \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_233: 2302 [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd 2303 \xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-\xfd 2304 \xff], U+0100..010F, U+0112..0125, 2305 U+0128..0130, U+0134..0137, U+0139..013E 2306 ...) 2307 \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240: 2308 U+2460..2473, U+24B6..24EA, 2309 U+3244..3247, U+3251..327E, 2310 U+3280..32BF, U+32D0..32FE ...) 2311 \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720) 2312 \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720: [\xa8 2313 \xaf\xb4-\xb5\xb8], U+0132..0133, 2314 U+013F..0140, U+0149, U+017F, 2315 U+01C4..01CC ...) 2316 \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240) 2317 \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240) 2318 \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51, 2319 U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63 2320 ...) 2321 \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1194: U+2102, 2322 U+210A..2113, U+2115, U+2119..211D, 2323 U+2124, U+2128 ...) 2324 \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20) 2325 \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc- 2326 \xbe], U+2150..215F, U+2189) 2327 \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171) 2328 \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54, 2329 U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68 2330 ...) 2331 \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238) 2332 \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50, 2333 U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62 2334 ...) 2335 \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82) 2336 \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55, 2337 U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69 2338 ...) 2339 \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122) 2340 \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122: 2341 U+FF61..FFBE, U+FFC2..FFC7, 2342 U+FFCA..FFCF, U+FFD2..FFD7, 2343 U+FFDA..FFDC, U+FFE8..FFEE) 2344 \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5) 2345 \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0], 2346 U+0F0C, U+2007, U+2011, U+202F) 2347 \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type= 2348 Non_Canonical} (Perl extension) (3796) 2349 \p{Decomposition_Type: Non_Canonical} Union of all non-canonical 2350 decompositions (Short: \p{Dt=NonCanon}) 2351 (Perl extension) (3796: [\xa0\xa8\xaa 2352 \xaf\xb2-\xb5\xb8-\xba\xbc-\xbe], 2353 U+0132..0133, U+013F..0140, U+0149, 2354 U+017F, U+01C4..01CC ...) 2355 \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_083 plus 2356 all above-Unicode code points: [\x00- 2357 \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6- 2358 \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-\xdf 2359 \xe6\xf0\xf7-\xf8\xfe], U+0110..0111, 2360 U+0126..0127, U+0131, U+0138, 2361 U+0141..0142 ...) 2362 \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26: 2363 U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B) 2364 \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26) 2365 \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286) 2366 \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250, 2367 U+32CC..32CF, U+32FF..3357, 2368 U+3371..33DF, U+33FF, U+1F130..1F14F ...) 2369 \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (64: U+1D62..1D6A, 2370 U+2080..208E, U+2090..209C, U+2C7C, 2371 U+1E051..1E06A) 2372 \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (249) 2373 \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (249: [\xaa\xb2- 2374 \xb3\xb9-\xba], U+02B0..02B8, 2375 U+02E0..02E4, U+10FC, U+1D2C..1D2E, 2376 U+1D30..1D3A ...) 2377 \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35) 2378 \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F, 2379 U+30FF, U+FE10..FE19, U+FE30..FE44, 2380 U+FE47..FE48) 2381 \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000, 2382 U+FF01..FF60, U+FFE0..FFE6) 2383 \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point= 2384 Y} (Short: \p{DI}) (4174) 2385 \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI}) 2386 (1_109_938 plus all above-Unicode code 2387 points: [\x00-\xac\xae-\xff], 2388 U+0100..034E, U+0350..061B, 2389 U+061D..115E, U+1161..17B3, U+17B6..180A 2390 ...) 2391 \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI}) 2392 (4174: [\xad], U+034F, U+061C, 2393 U+115F..1160, U+17B4..17B5, U+180B..180F 2394 ...) 2395 \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (15) 2396 \p{Dep: *} \p{Deprecated: *} 2397 \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (15) 2398 \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_097 2399 plus all above-Unicode code points: 2400 U+0000..0148, U+014A..0672, 2401 U+0674..0F76, U+0F78, U+0F7A..17A2, 2402 U+17A5..2069 ...) 2403 \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149, 2404 U+0673, U+0F77, U+0F79, U+17A3..17A4, 2405 U+206A..206F ...) 2406 \p{Deseret} \p{Script_Extensions=Deseret} (Short: 2407 \p{Dsrt}) (80) 2408 \p{Deva} \p{Devanagari} (= \p{Script_Extensions= 2409 Devanagari}) (NOT \p{Block=Devanagari}) 2410 (220) 2411 \p{Devanagari} \p{Script_Extensions=Devanagari} (Short: 2412 \p{Deva}; NOT \p{Block=Devanagari}) (220) 2413 X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block= 2414 Devanagari_Extended}) (32) 2415 X \p{Devanagari_Ext_A} \p{Devanagari_Extended_A} (= \p{Block= 2416 Devanagari_Extended_A}) (96) 2417 X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short: 2418 \p{InDevanagariExt}) (32) 2419 X \p{Devanagari_Extended_A} \p{Block=Devanagari_Extended_A} (Short: 2420 \p{InDevanagariExtA}) (96) 2421 \p{DI} \p{Default_Ignorable_Code_Point} (= 2422 \p{Default_Ignorable_Code_Point=Y}) 2423 (4174) 2424 \p{DI: *} \p{Default_Ignorable_Code_Point: *} 2425 \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (1144) 2426 \p{Dia: *} \p{Diacritic: *} 2427 \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (1144) 2428 \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_112_968 2429 plus all above-Unicode code points: 2430 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 2431 >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9- 2432 \xae\xb0-\xb3\xb5-\xb6\xb9-\xff], 2433 U+0100..02AF, U+034F, U+0358..035C, 2434 U+0363..0373, U+0376..0379 ...) 2435 \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (1144: [\^` 2436 \xa8\xaf\xb4\xb7-\xb8], U+02B0..034E, 2437 U+0350..0357, U+035D..0362, 2438 U+0374..0375, U+037A ...) 2439 X \p{Diacriticals} \p{Combining_Diacritical_Marks} (= 2440 \p{Block=Combining_Diacritical_Marks}) 2441 (112) 2442 X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended} 2443 (= \p{Block= 2444 Combining_Diacritical_Marks_Extended}) 2445 (80) 2446 X \p{Diacriticals_For_Symbols} 2447 \p{Combining_Diacritical_Marks_For_- 2448 Symbols} (= \p{Block= 2449 Combining_Diacritical_Marks_For_- 2450 Symbols}) (48) 2451 X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement} 2452 (= \p{Block= 2453 Combining_Diacritical_Marks_Supplement}) 2454 (64) 2455 \p{Diak} \p{Dives_Akuru} (= \p{Script_Extensions= 2456 Dives_Akuru}) (NOT \p{Block= 2457 Dives_Akuru}) (72) 2458 \p{Digit} \p{XPosixDigit} (= \p{General_Category= 2459 Decimal_Number}) (680) 2460 X \p{Dingbats} \p{Block=Dingbats} (192) 2461 \p{Dives_Akuru} \p{Script_Extensions=Dives_Akuru} (Short: 2462 \p{Diak}; NOT \p{Block=Dives_Akuru}) (72) 2463 \p{Dogr} \p{Dogra} (= \p{Script_Extensions=Dogra}) 2464 (NOT \p{Block=Dogra}) (82) 2465 \p{Dogra} \p{Script_Extensions=Dogra} (Short: 2466 \p{Dogr}; NOT \p{Block=Dogra}) (82) 2467 X \p{Domino} \p{Domino_Tiles} (= \p{Block= 2468 Domino_Tiles}) (112) 2469 X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short: 2470 \p{InDomino}) (112) 2471 \p{Dsrt} \p{Deseret} (= \p{Script_Extensions= 2472 Deseret}) (80) 2473 \p{Dt: *} \p{Decomposition_Type: *} 2474 \p{Dupl} \p{Duployan} (= \p{Script_Extensions= 2475 Duployan}) (NOT \p{Block=Duployan}) (147) 2476 \p{Duployan} \p{Script_Extensions=Duployan} (Short: 2477 \p{Dupl}; NOT \p{Block=Duployan}) (147) 2478 \p{Ea: *} \p{East_Asian_Width: *} 2479 X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform} 2480 (208) 2481 \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739) 2482 \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739: [\xa1 2483 \xa4\xa7-\xa8\xaa\xad-\xae\xb0-\xb4\xb6- 2484 \xba\xbc-\xbf\xc6\xd0\xd7-\xd8\xde-\xe1 2485 \xe6\xe8-\xea\xec-\xed\xf0\xf2-\xf3\xf7- 2486 \xfa\xfc\xfe], U+0101, U+0111, U+0113, 2487 U+011B, U+0126..0127 ...) 2488 \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104) 2489 \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000, 2490 U+FF01..FF60, U+FFE0..FFE6) 2491 \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123) 2492 \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9, 2493 U+FF61..FFBE, U+FFC2..FFC7, 2494 U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC 2495 ...) 2496 \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (792_623 plus 2497 all above-Unicode code points) 2498 \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111) 2499 \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-\x7e 2500 \xa2-\xa3\xa5-\xa6\xac\xaf], 2501 U+27E6..27ED, U+2985..2986) 2502 \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (792_623 plus all 2503 above-Unicode code points: [\x00-\x1f 2504 \x7f-\xa0\xa9\xab\xb5\xbb\xc0-\xc5\xc7- 2505 \xcf\xd1-\xd6\xd9-\xdd\xe2-\xe5\xe7\xeb 2506 \xee-\xef\xf1\xf4-\xf6\xfb\xfd\xff], 2507 U+00FF..0100, U+0102..0110, U+0112, 2508 U+0114..011A, U+011C..0125 ...) 2509 \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (182_412) 2510 \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (182_412: 2511 U+1100..115F, U+231A..231B, 2512 U+2329..232A, U+23E9..23EC, U+23F0, 2513 U+23F3 ...) 2514 \p{EBase} \p{Emoji_Modifier_Base} (= 2515 \p{Emoji_Modifier_Base=Y}) (134) 2516 \p{EBase: *} \p{Emoji_Modifier_Base: *} 2517 \p{EComp} \p{Emoji_Component} (= \p{Emoji_Component= 2518 Y}) (146) 2519 \p{EComp: *} \p{Emoji_Component: *} 2520 \p{Egyp} \p{Egyptian_Hieroglyphs} (= 2521 \p{Script_Extensions= 2522 Egyptian_Hieroglyphs}) (NOT \p{Block= 2523 Egyptian_Hieroglyphs}) (1110) 2524 X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block= 2525 Egyptian_Hieroglyph_Format_Controls} (48) 2526 \p{Egyptian_Hieroglyphs} \p{Script_Extensions= 2527 Egyptian_Hieroglyphs} (Short: \p{Egyp}; 2528 NOT \p{Block=Egyptian_Hieroglyphs}) 2529 (1110) 2530 \p{Elba} \p{Elbasan} (= \p{Script_Extensions= 2531 Elbasan}) (NOT \p{Block=Elbasan}) (40) 2532 \p{Elbasan} \p{Script_Extensions=Elbasan} (Short: 2533 \p{Elba}; NOT \p{Block=Elbasan}) (40) 2534 \p{Elym} \p{Elymaic} (= \p{Script_Extensions= 2535 Elymaic}) (NOT \p{Block=Elymaic}) (23) 2536 \p{Elymaic} \p{Script_Extensions=Elymaic} (Short: 2537 \p{Elym}; NOT \p{Block=Elymaic}) (23) 2538 \p{EMod} \p{Emoji_Modifier} (= \p{Emoji_Modifier= 2539 Y}) (5) 2540 \p{EMod: *} \p{Emoji_Modifier: *} 2541 \p{Emoji} \p{Emoji=Y} (1424) 2542 \p{Emoji: N*} (Single: \P{Emoji}) (1_112_688 plus all 2543 above-Unicode code points: [\x00-\x20! 2544 \"\$\%&\'\(\)+,\-.\/:;<=>?\@A-Z\[\\\] 2545 \^_`a-z\{\|\}~\x7f-\xa8\xaa-\xad\xaf- 2546 \xff], U+0100..203B, U+203D..2048, 2547 U+204A..2121, U+2123..2138, U+213A..2193 2548 ...) 2549 \p{Emoji: Y*} (Single: \p{Emoji}) (1424: [#*0-9\xa9 2550 \xae], U+203C, U+2049, U+2122, U+2139, 2551 U+2194..2199 ...) 2552 \p{Emoji_Component} \p{Emoji_Component=Y} (Short: \p{EComp}) 2553 (146) 2554 \p{Emoji_Component: N*} (Short: \p{EComp=N}, \P{EComp}) (1_113_966 2555 plus all above-Unicode code points: 2556 [\x00-\x20!\"\$\%&\'\(\)+,\-.\/:;<=>? 2557 \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff], 2558 U+0100..200C, U+200E..20E2, 2559 U+20E4..FE0E, U+FE10..1F1E5, 2560 U+1F200..1F3FA ...) 2561 \p{Emoji_Component: Y*} (Short: \p{EComp=Y}, \p{EComp}) (146: 2562 [#*0-9], U+200D, U+20E3, U+FE0F, 2563 U+1F1E6..1F1FF, U+1F3FB..1F3FF ...) 2564 \p{Emoji_Modifier} \p{Emoji_Modifier=Y} (Short: \p{EMod}) (5) 2565 \p{Emoji_Modifier: N*} (Short: \p{EMod=N}, \P{EMod}) (1_114_107 2566 plus all above-Unicode code points: 2567 U+0000..1F3FA, U+1F400..infinity) 2568 \p{Emoji_Modifier: Y*} (Short: \p{EMod=Y}, \p{EMod}) (5: 2569 U+1F3FB..1F3FF) 2570 \p{Emoji_Modifier_Base} \p{Emoji_Modifier_Base=Y} (Short: 2571 \p{EBase}) (134) 2572 \p{Emoji_Modifier_Base: N*} (Short: \p{EBase=N}, \P{EBase}) 2573 (1_113_978 plus all above-Unicode code 2574 points: U+0000..261C, U+261E..26F8, 2575 U+26FA..2709, U+270E..1F384, 2576 U+1F386..1F3C1, U+1F3C5..1F3C6 ...) 2577 \p{Emoji_Modifier_Base: Y*} (Short: \p{EBase=Y}, \p{EBase}) (134: 2578 U+261D, U+26F9, U+270A..270D, U+1F385, 2579 U+1F3C2..1F3C4, U+1F3C7 ...) 2580 \p{Emoji_Presentation} \p{Emoji_Presentation=Y} (Short: 2581 \p{EPres}) (1205) 2582 \p{Emoji_Presentation: N*} (Short: \p{EPres=N}, \P{EPres}) 2583 (1_112_907 plus all above-Unicode code 2584 points: U+0000..2319, U+231C..23E8, 2585 U+23ED..23EF, U+23F1..23F2, 2586 U+23F4..25FC, U+25FF..2613 ...) 2587 \p{Emoji_Presentation: Y*} (Short: \p{EPres=Y}, \p{EPres}) (1205: 2588 U+231A..231B, U+23E9..23EC, U+23F0, 2589 U+23F3, U+25FD..25FE, U+2614..2615 ...) 2590 X \p{Emoticons} \p{Block=Emoticons} (80) 2591 X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block= 2592 Enclosed_Alphanumerics}) (160) 2593 X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (= 2594 \p{Block= 2595 Enclosed_Alphanumeric_Supplement}) (256) 2596 X \p{Enclosed_Alphanumeric_Supplement} \p{Block= 2597 Enclosed_Alphanumeric_Supplement} 2598 (Short: \p{InEnclosedAlphanumSup}) (256) 2599 X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics} 2600 (Short: \p{InEnclosedAlphanum}) (160) 2601 X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (= 2602 \p{Block= 2603 Enclosed_CJK_Letters_And_Months}) (256) 2604 X \p{Enclosed_CJK_Letters_And_Months} \p{Block= 2605 Enclosed_CJK_Letters_And_Months} (Short: 2606 \p{InEnclosedCJK}) (256) 2607 X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement} 2608 (= \p{Block= 2609 Enclosed_Ideographic_Supplement}) (256) 2610 X \p{Enclosed_Ideographic_Supplement} \p{Block= 2611 Enclosed_Ideographic_Supplement} (Short: 2612 \p{InEnclosedIdeographicSup}) (256) 2613 \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark} 2614 (Short: \p{Me}) (13) 2615 \p{EPres} \p{Emoji_Presentation} (= 2616 \p{Emoji_Presentation=Y}) (1205) 2617 \p{EPres: *} \p{Emoji_Presentation: *} 2618 \p{Ethi} \p{Ethiopic} (= \p{Script_Extensions= 2619 Ethiopic}) (NOT \p{Block=Ethiopic}) (523) 2620 \p{Ethiopic} \p{Script_Extensions=Ethiopic} (Short: 2621 \p{Ethi}; NOT \p{Block=Ethiopic}) (523) 2622 X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block= 2623 Ethiopic_Extended}) (96) 2624 X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block= 2625 Ethiopic_Extended_A}) (48) 2626 X \p{Ethiopic_Ext_B} \p{Ethiopic_Extended_B} (= \p{Block= 2627 Ethiopic_Extended_B}) (32) 2628 X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short: 2629 \p{InEthiopicExt}) (96) 2630 X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short: 2631 \p{InEthiopicExtA}) (48) 2632 X \p{Ethiopic_Extended_B} \p{Block=Ethiopic_Extended_B} (Short: 2633 \p{InEthiopicExtB}) (32) 2634 X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block= 2635 Ethiopic_Supplement}) (32) 2636 X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short: 2637 \p{InEthiopicSup}) (32) 2638 \p{Ext} \p{Extender} (= \p{Extender=Y}) (50) 2639 \p{Ext: *} \p{Extender: *} 2640 \p{Extended_Pictographic} \p{Extended_Pictographic=Y} (Short: 2641 \p{ExtPict}) (3537) 2642 \p{Extended_Pictographic: N*} (Short: \p{ExtPict=N}, \P{ExtPict}) 2643 (1_110_575 plus all above-Unicode code 2644 points: [\x00-\xa8\xaa-\xad\xaf-\xff], 2645 U+0100..203B, U+203D..2048, 2646 U+204A..2121, U+2123..2138, U+213A..2193 2647 ...) 2648 \p{Extended_Pictographic: Y*} (Short: \p{ExtPict=Y}, \p{ExtPict}) 2649 (3537: [\xa9\xae], U+203C, U+2049, 2650 U+2122, U+2139, U+2194..2199 ...) 2651 \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (50) 2652 \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_062 2653 plus all above-Unicode code points: 2654 [\x00-\xb6\xb8-\xff], U+0100..02CF, 2655 U+02D2..063F, U+0641..07F9, 2656 U+07FB..0B54, U+0B56..0E45 ...) 2657 \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (50: [\xb7], 2658 U+02D0..02D1, U+0640, U+07FA, U+0B55, 2659 U+0E46 ...) 2660 \p{ExtPict} \p{Extended_Pictographic} (= 2661 \p{Extended_Pictographic=Y}) (3537) 2662 \p{ExtPict: *} \p{Extended_Pictographic: *} 2663 \p{Final_Punctuation} \p{General_Category=Final_Punctuation} 2664 (Short: \p{Pf}) (10) 2665 \p{Format} \p{General_Category=Format} (Short: 2666 \p{Cf}) (170) 2667 \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y} 2668 (Short: \p{CompEx}) (1120) 2669 \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N}, 2670 \P{CompEx}) (1_112_992 plus all above- 2671 Unicode code points: U+0000..033F, 2672 U+0342, U+0345..0373, U+0375..037D, 2673 U+037F..0386, U+0388..0957 ...) 2674 \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y}, 2675 \p{CompEx}) (1120: U+0340..0341, 2676 U+0343..0344, U+0374, U+037E, U+0387, 2677 U+0958..095F ...) 2678 \p{Gc: *} \p{General_Category: *} 2679 \p{GCB: *} \p{Grapheme_Cluster_Break: *} 2680 \p{General_Category: C} \p{General_Category=Other} (965_096 plus 2681 all above-Unicode code points) 2682 \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short: 2683 \p{Gc=LC}, \p{LC}) (4095: [A-Za-z\xb5 2684 \xc0-\xd6\xd8-\xf6\xf8-\xff], 2685 U+0100..01BA, U+01BC..01BF, 2686 U+01C4..0293, U+0295..02AF, U+0370..0373 2687 ...) 2688 \p{General_Category: Cc} \p{General_Category=Control} (65) 2689 \p{General_Category: Cf} \p{General_Category=Format} (170) 2690 \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe}) 2691 (77: [\)\]\}], U+0F3B, U+0F3D, U+169C, 2692 U+2046, U+207E ...) 2693 \p{General_Category: Cn} \p{General_Category=Unassigned} (825_345 2694 plus all above-Unicode code points) 2695 \p{General_Category: Cntrl} \p{General_Category=Control} (65) 2696 \p{General_Category: Co} \p{General_Category=Private_Use} (137_468) 2697 \p{General_Category: Combining_Mark} \p{General_Category=Mark} 2698 (2450) 2699 \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc}, 2700 \p{Pc}) (10: [_], U+203F..2040, U+2054, 2701 U+FE33..FE34, U+FE4D..FE4F, U+FF3F) 2702 \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65: 2703 [\x00-\x1f\x7f-\x9f]) 2704 \p{General_Category: Cs} \p{General_Category=Surrogate} (2048) 2705 \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc}) 2706 (63: [\$\xa2-\xa5], U+058F, U+060B, 2707 U+07FE..07FF, U+09F2..09F3, U+09FB ...) 2708 \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd}) 2709 (26: [\-], U+058A, U+05BE, U+1400, 2710 U+1806, U+2010..2015 ...) 2711 \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd}) 2712 (680: [0-9], U+0660..0669, U+06F0..06F9, 2713 U+07C0..07C9, U+0966..096F, U+09E6..09EF 2714 ...) 2715 \p{General_Category: Digit} \p{General_Category=Decimal_Number} 2716 (680) 2717 \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me}) 2718 (13: U+0488..0489, U+1ABE, U+20DD..20E0, 2719 U+20E2..20E4, U+A670..A672) 2720 \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf}) 2721 (10: [\xbb], U+2019, U+201D, U+203A, 2722 U+2E03, U+2E05 ...) 2723 \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (170: 2724 [\xad], U+0600..0605, U+061C, U+06DD, 2725 U+070F, U+0890..0891 ...) 2726 \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi}, 2727 \p{Pi}) (12: [\xab], U+2018, 2728 U+201B..201C, U+201F, U+2039, U+2E02 ...) 2729 \p{General_Category: L} \p{General_Category=Letter} (136_104) 2730 X \p{General_Category: L&} \p{General_Category=Cased_Letter} (4095) 2731 X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note 2732 the trailing '_' matters in spite of 2733 loose matching rules. (4095) 2734 \p{General_Category: LC} \p{General_Category=Cased_Letter} (4095) 2735 \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (136_104: 2736 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6 2737 \xf8-\xff], U+0100..02C1, U+02C6..02D1, 2738 U+02E0..02E4, U+02EC, U+02EE ...) 2739 \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl}) 2740 (236: U+16EE..16F0, U+2160..2182, 2741 U+2185..2188, U+3007, U+3021..3029, 2742 U+3038..303A ...) 2743 \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) 2744 (1: U+2028) 2745 \p{General_Category: Ll} \p{General_Category=Lowercase_Letter} 2746 (/i= General_Category=Cased_Letter) 2747 (2233) 2748 \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (397) 2749 \p{General_Category: Lo} \p{General_Category=Other_Letter} 2750 (131_612) 2751 \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll}; 2752 /i= General_Category=Cased_Letter) 2753 (2233: [a-z\xb5\xdf-\xf6\xf8-\xff], 2754 U+0101, U+0103, U+0105, U+0107, U+0109 2755 ...) 2756 \p{General_Category: Lt} \p{General_Category=Titlecase_Letter} 2757 (/i= General_Category=Cased_Letter) (31) 2758 \p{General_Category: Lu} \p{General_Category=Uppercase_Letter} 2759 (/i= General_Category=Cased_Letter) 2760 (1831) 2761 \p{General_Category: M} \p{General_Category=Mark} (2450) 2762 \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2450: 2763 U+0300..036F, U+0483..0489, 2764 U+0591..05BD, U+05BF, U+05C1..05C2, 2765 U+05C4..05C5 ...) 2766 \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948: 2767 [+<=>\|~\xac\xb1\xd7\xf7], U+03F6, 2768 U+0606..0608, U+2044, U+2052, 2769 U+207A..207C ...) 2770 \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (452) 2771 \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13) 2772 \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark} 2773 (1985) 2774 \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm}) 2775 (397: U+02B0..02C1, U+02C6..02D1, 2776 U+02E0..02E4, U+02EC, U+02EE, U+0374 ...) 2777 \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk}) 2778 (125: [\^`\xa8\xaf\xb4\xb8], 2779 U+02C2..02C5, U+02D2..02DF, 2780 U+02E5..02EB, U+02ED, U+02EF..02FF ...) 2781 \p{General_Category: N} \p{General_Category=Number} (1831) 2782 \p{General_Category: Nd} \p{General_Category=Decimal_Number} (680) 2783 \p{General_Category: Nl} \p{General_Category=Letter_Number} (236) 2784 \p{General_Category: No} \p{General_Category=Other_Number} (915) 2785 \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn}) 2786 (1985: U+0300..036F, U+0483..0487, 2787 U+0591..05BD, U+05BF, U+05C1..05C2, 2788 U+05C4..05C5 ...) 2789 \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1831: [0-9 2790 \xb2-\xb3\xb9\xbc-\xbe], U+0660..0669, 2791 U+06F0..06F9, U+07C0..07C9, 2792 U+0966..096F, U+09E6..09EF ...) 2793 \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps}) 2794 (79: [\(\[\{], U+0F3A, U+0F3C, U+169B, 2795 U+201A, U+201E ...) 2796 \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (965_096 plus 2797 all above-Unicode code points: [\x00- 2798 \x1f\x7f-\x9f\xad], U+0378..0379, 2799 U+0380..0383, U+038B, U+038D, U+03A2 ...) 2800 \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo}) 2801 (131_612: [\xaa\xba], U+01BB, 2802 U+01C0..01C3, U+0294, U+05D0..05EA, 2803 U+05EF..05F2 ...) 2804 \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) 2805 (915: [\xb2-\xb3\xb9\xbc-\xbe], 2806 U+09F4..09F9, U+0B72..0B77, 2807 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E 2808 ...) 2809 \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po}) 2810 (628: [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7 2811 \xb6-\xb7\xbf], U+037E, U+0387, 2812 U+055A..055F, U+0589, U+05C0 ...) 2813 \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So}) 2814 (6634: [\xa6\xa9\xae\xb0], U+0482, 2815 U+058D..058E, U+060E..060F, U+06DE, 2816 U+06E9 ...) 2817 \p{General_Category: P} \p{General_Category=Punctuation} (842) 2818 \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp}, 2819 \p{Zp}) (1: U+2029) 2820 \p{General_Category: Pc} \p{General_Category= 2821 Connector_Punctuation} (10) 2822 \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (26) 2823 \p{General_Category: Pe} \p{General_Category=Close_Punctuation} 2824 (77) 2825 \p{General_Category: Pf} \p{General_Category=Final_Punctuation} 2826 (10) 2827 \p{General_Category: Pi} \p{General_Category=Initial_Punctuation} 2828 (12) 2829 \p{General_Category: Po} \p{General_Category=Other_Punctuation} 2830 (628) 2831 \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co}) 2832 (137_468: U+E000..F8FF, U+F0000..FFFFD, 2833 U+100000..10FFFD) 2834 \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (79) 2835 \p{General_Category: Punct} \p{General_Category=Punctuation} (842) 2836 \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (842: 2837 [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_\{\} 2838 \xa1\xa7\xab\xb6-\xb7\xbb\xbf], U+037E, 2839 U+0387, U+055A..055F, U+0589..058A, 2840 U+05BE ...) 2841 \p{General_Category: S} \p{General_Category=Symbol} (7770) 2842 \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (63) 2843 \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19: 2844 [\x20\xa0], U+1680, U+2000..200A, 2845 U+2028..2029, U+202F, U+205F ...) 2846 \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (125) 2847 \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948) 2848 \p{General_Category: So} \p{General_Category=Other_Symbol} (6634) 2849 \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs}) 2850 (17: [\x20\xa0], U+1680, U+2000..200A, 2851 U+202F, U+205F, U+3000) 2852 \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) 2853 (452: U+0903, U+093B, U+093E..0940, 2854 U+0949..094C, U+094E..094F, U+0982..0983 2855 ...) 2856 \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048: 2857 U+D800..DFFF) 2858 \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7770: 2859 [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae- 2860 \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5, 2861 U+02D2..02DF, U+02E5..02EB, U+02ED, 2862 U+02EF..02FF ...) 2863 \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt}; 2864 /i= General_Category=Cased_Letter) (31: 2865 U+01C5, U+01C8, U+01CB, U+01F2, 2866 U+1F88..1F8F, U+1F98..1F9F ...) 2867 \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn}) 2868 (825_345 plus all above-Unicode code 2869 points: U+0378..0379, U+0380..0383, 2870 U+038B, U+038D, U+03A2, U+0530 ...) 2871 \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu}; 2872 /i= General_Category=Cased_Letter) 2873 (1831: [A-Z\xc0-\xd6\xd8-\xde], U+0100, 2874 U+0102, U+0104, U+0106, U+0108 ...) 2875 \p{General_Category: Z} \p{General_Category=Separator} (19) 2876 \p{General_Category: Zl} \p{General_Category=Line_Separator} (1) 2877 \p{General_Category: Zp} \p{General_Category=Paragraph_Separator} 2878 (1) 2879 \p{General_Category: Zs} \p{General_Category=Space_Separator} (17) 2880 X \p{General_Punctuation} \p{Block=General_Punctuation} (Short: 2881 \p{InPunctuation}) (112) 2882 X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96) 2883 X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (= 2884 \p{Block=Geometric_Shapes_Extended}) 2885 (128) 2886 X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended} 2887 (Short: \p{InGeometricShapesExt}) (128) 2888 \p{Geor} \p{Georgian} (= \p{Script_Extensions= 2889 Georgian}) (NOT \p{Block=Georgian}) (174) 2890 \p{Georgian} \p{Script_Extensions=Georgian} (Short: 2891 \p{Geor}; NOT \p{Block=Georgian}) (174) 2892 X \p{Georgian_Ext} \p{Georgian_Extended} (= \p{Block= 2893 Georgian_Extended}) (48) 2894 X \p{Georgian_Extended} \p{Block=Georgian_Extended} (Short: 2895 \p{InGeorgianExt}) (48) 2896 X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block= 2897 Georgian_Supplement}) (48) 2898 X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short: 2899 \p{InGeorgianSup}) (48) 2900 \p{Glag} \p{Glagolitic} (= \p{Script_Extensions= 2901 Glagolitic}) (NOT \p{Block=Glagolitic}) 2902 (138) 2903 \p{Glagolitic} \p{Script_Extensions=Glagolitic} (Short: 2904 \p{Glag}; NOT \p{Block=Glagolitic}) (138) 2905 X \p{Glagolitic_Sup} \p{Glagolitic_Supplement} (= \p{Block= 2906 Glagolitic_Supplement}) (48) 2907 X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short: 2908 \p{InGlagoliticSup}) (48) 2909 \p{Gong} \p{Gunjala_Gondi} (= \p{Script_Extensions= 2910 Gunjala_Gondi}) (NOT \p{Block= 2911 Gunjala_Gondi}) (65) 2912 \p{Gonm} \p{Masaram_Gondi} (= \p{Script_Extensions= 2913 Masaram_Gondi}) (NOT \p{Block= 2914 Masaram_Gondi}) (77) 2915 \p{Goth} \p{Gothic} (= \p{Script_Extensions= 2916 Gothic}) (NOT \p{Block=Gothic}) (27) 2917 \p{Gothic} \p{Script_Extensions=Gothic} (Short: 2918 \p{Goth}; NOT \p{Block=Gothic}) (27) 2919 \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y}) 2920 (146_986) 2921 \p{Gr_Base: *} \p{Grapheme_Base: *} 2922 \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend= 2923 Y}) (2125) 2924 \p{Gr_Ext: *} \p{Grapheme_Extend: *} 2925 \p{Gran} \p{Grantha} (= \p{Script_Extensions= 2926 Grantha}) (NOT \p{Block=Grantha}) (116) 2927 \p{Grantha} \p{Script_Extensions=Grantha} (Short: 2928 \p{Gran}; NOT \p{Block=Grantha}) (116) 2929 \p{Graph} \p{XPosixGraph} (286_635) 2930 \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase}) 2931 (146_986) 2932 \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase}) (967_126 2933 plus all above-Unicode code points: 2934 [\x00-\x1f\x7f-\x9f\xad], U+0300..036F, 2935 U+0378..0379, U+0380..0383, U+038B, 2936 U+038D ...) 2937 \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase}) 2938 (146_986: [\x20-\x7e\xa0-\xac\xae-\xff], 2939 U+0100..02FF, U+0370..0377, 2940 U+037A..037F, U+0384..038A, U+038C ...) 2941 \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control} 2942 (3893) 2943 \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3893: [^ 2944 \n\r\x20-\x7e\xa0-\xac\xae-\xff], 2945 U+061C, U+180E, U+200B, U+200E..200F, 2946 U+2028..202E ...) 2947 \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r]) 2948 \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0) 2949 \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0) 2950 \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0) 2951 \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base} 2952 (0) 2953 \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break= 2954 E_Base_GAZ} (0) 2955 \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break= 2956 E_Modifier} (0) 2957 \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend} 2958 (2130) 2959 \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (2130: 2960 U+0300..036F, U+0483..0489, 2961 U+0591..05BD, U+05BF, U+05C1..05C2, 2962 U+05C4..05C5 ...) 2963 \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break= 2964 Glue_After_Zwj} (0) 2965 \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0) 2966 \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125: 2967 U+1100..115F, U+A960..A97C) 2968 \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n]) 2969 \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00, 2970 U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C 2971 ...) 2972 \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773: 2973 U+AC01..AC1B, U+AC1D..AC37, 2974 U+AC39..AC53, U+AC55..AC6F, 2975 U+AC71..AC8B, U+AC8D..ACA7 ...) 2976 \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_109 2977 plus all above-Unicode code points: 2978 [\x20-\x7e\xa0-\xac\xae-\xff], 2979 U+0100..02FF, U+0370..0482, 2980 U+048A..0590, U+05BE, U+05C0 ...) 2981 \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend} 2982 (27) 2983 \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (27: 2984 U+0600..0605, U+06DD, U+070F, 2985 U+0890..0891, U+08E2, U+0D4E ...) 2986 \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI}) 2987 (26: U+1F1E6..1F1FF) 2988 \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break= 2989 Regional_Indicator} (26) 2990 \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break= 2991 SpacingMark} (395) 2992 \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (395: 2993 U+0903, U+093B, U+093E..0940, 2994 U+0949..094C, U+094E..094F, U+0982..0983 2995 ...) 2996 \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137: 2997 U+11A8..11FF, U+D7CB..D7FB) 2998 \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95: 2999 U+1160..11A7, U+D7B0..D7C6) 3000 \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other} 3001 (1_096_109 plus all above-Unicode code 3002 points) 3003 \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D) 3004 \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt}) 3005 (2125) 3006 \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_111_987 3007 plus all above-Unicode code points: 3008 U+0000..02FF, U+0370..0482, 3009 U+048A..0590, U+05BE, U+05C0, U+05C3 ...) 3010 \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (2125: 3011 U+0300..036F, U+0483..0489, 3012 U+0591..05BD, U+05BF, U+05C1..05C2, 3013 U+05C4..05C5 ...) 3014 \p{Greek} \p{Script_Extensions=Greek} (Short: 3015 \p{Grek}; NOT \p{Greek_And_Coptic}) (522) 3016 X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short: 3017 \p{InGreek}) (144) 3018 X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block= 3019 Greek_Extended}) (256) 3020 X \p{Greek_Extended} \p{Block=Greek_Extended} (Short: 3021 \p{InGreekExt}) (256) 3022 \p{Grek} \p{Greek} (= \p{Script_Extensions=Greek}) 3023 (NOT \p{Greek_And_Coptic}) (522) 3024 \p{Gujarati} \p{Script_Extensions=Gujarati} (Short: 3025 \p{Gujr}; NOT \p{Block=Gujarati}) (105) 3026 \p{Gujr} \p{Gujarati} (= \p{Script_Extensions= 3027 Gujarati}) (NOT \p{Block=Gujarati}) (105) 3028 \p{Gunjala_Gondi} \p{Script_Extensions=Gunjala_Gondi} 3029 (Short: \p{Gong}; NOT \p{Block= 3030 Gunjala_Gondi}) (65) 3031 \p{Gurmukhi} \p{Script_Extensions=Gurmukhi} (Short: 3032 \p{Guru}; NOT \p{Block=Gurmukhi}) (94) 3033 \p{Guru} \p{Gurmukhi} (= \p{Script_Extensions= 3034 Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94) 3035 X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (= 3036 \p{Block=Halfwidth_And_Fullwidth_Forms}) 3037 (240) 3038 X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block= 3039 Combining_Half_Marks}) (16) 3040 X \p{Halfwidth_And_Fullwidth_Forms} \p{Block= 3041 Halfwidth_And_Fullwidth_Forms} (Short: 3042 \p{InHalfAndFullForms}) (240) 3043 \p{Han} \p{Script_Extensions=Han} (98_696) 3044 \p{Hang} \p{Hangul} (= \p{Script_Extensions= 3045 Hangul}) (NOT \p{Hangul_Syllables}) 3046 (11_775) 3047 \p{Hangul} \p{Script_Extensions=Hangul} (Short: 3048 \p{Hang}; NOT \p{Hangul_Syllables}) 3049 (11_775) 3050 X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo} 3051 (Short: \p{InCompatJamo}) (96) 3052 X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo}) 3053 (256) 3054 X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A} 3055 (Short: \p{InJamoExtA}) (32) 3056 X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B} 3057 (Short: \p{InJamoExtB}) (80) 3058 \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo} 3059 (125) 3060 \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125: 3061 U+1100..115F, U+A960..A97C) 3062 \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable} 3063 (399) 3064 \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399: 3065 U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70, 3066 U+AC8C ...) 3067 \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type= 3068 LVT_Syllable} (10_773) 3069 \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT}) 3070 (10_773: U+AC01..AC1B, U+AC1D..AC37, 3071 U+AC39..AC53, U+AC55..AC6F, 3072 U+AC71..AC8B, U+AC8D..ACA7 ...) 3073 \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type= 3074 Not_Applicable} (1_102_583 plus all 3075 above-Unicode code points) 3076 \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA}) 3077 (1_102_583 plus all above-Unicode code 3078 points: U+0000..10FF, U+1200..A95F, 3079 U+A97D..ABFF, U+D7A4..D7AF, 3080 U+D7C7..D7CA, U+D7FC..infinity) 3081 \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo} 3082 (137) 3083 \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137: 3084 U+11A8..11FF, U+D7CB..D7FB) 3085 \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo} 3086 (95) 3087 \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95: 3088 U+1160..11A7, U+D7B0..D7C6) 3089 X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short: 3090 \p{InHangul}) (11_184) 3091 \p{Hani} \p{Han} (= \p{Script_Extensions=Han}) 3092 (98_696) 3093 \p{Hanifi_Rohingya} \p{Script_Extensions=Hanifi_Rohingya} 3094 (Short: \p{Rohg}; NOT \p{Block= 3095 Hanifi_Rohingya}) (55) 3096 \p{Hano} \p{Hanunoo} (= \p{Script_Extensions= 3097 Hanunoo}) (NOT \p{Block=Hanunoo}) (23) 3098 \p{Hanunoo} \p{Script_Extensions=Hanunoo} (Short: 3099 \p{Hano}; NOT \p{Block=Hanunoo}) (23) 3100 \p{Hatr} \p{Hatran} (= \p{Script_Extensions= 3101 Hatran}) (NOT \p{Block=Hatran}) (26) 3102 \p{Hatran} \p{Script_Extensions=Hatran} (Short: 3103 \p{Hatr}; NOT \p{Block=Hatran}) (26) 3104 \p{Hebr} \p{Hebrew} (= \p{Script_Extensions= 3105 Hebrew}) (NOT \p{Block=Hebrew}) (134) 3106 \p{Hebrew} \p{Script_Extensions=Hebrew} (Short: 3107 \p{Hebr}; NOT \p{Block=Hebrew}) (134) 3108 \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44) 3109 \p{Hex: *} \p{Hex_Digit: *} 3110 \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44) 3111 \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068 3112 plus all above-Unicode code points: 3113 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>? 3114 \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff], 3115 U+0100..FF0F, U+FF1A..FF20, 3116 U+FF27..FF40, U+FF47..infinity) 3117 \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa- 3118 f], U+FF10..FF19, U+FF21..FF26, 3119 U+FF41..FF46) 3120 X \p{High_Private_Use_Surrogates} \p{Block= 3121 High_Private_Use_Surrogates} (Short: 3122 \p{InHighPUSurrogates}) (128) 3123 X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (= 3124 \p{Block=High_Private_Use_Surrogates}) 3125 (128) 3126 X \p{High_Surrogates} \p{Block=High_Surrogates} (896) 3127 \p{Hira} \p{Hiragana} (= \p{Script_Extensions= 3128 Hiragana}) (NOT \p{Block=Hiragana}) (433) 3129 \p{Hiragana} \p{Script_Extensions=Hiragana} (Short: 3130 \p{Hira}; NOT \p{Block=Hiragana}) (433) 3131 \p{Hluw} \p{Anatolian_Hieroglyphs} (= 3132 \p{Script_Extensions= 3133 Anatolian_Hieroglyphs}) (NOT \p{Block= 3134 Anatolian_Hieroglyphs}) (583) 3135 \p{Hmng} \p{Pahawh_Hmong} (= \p{Script_Extensions= 3136 Pahawh_Hmong}) (NOT \p{Block= 3137 Pahawh_Hmong}) (127) 3138 \p{Hmnp} \p{Nyiakeng_Puachue_Hmong} (= 3139 \p{Script_Extensions= 3140 Nyiakeng_Puachue_Hmong}) (NOT \p{Block= 3141 Nyiakeng_Puachue_Hmong}) (71) 3142 \p{HorizSpace} \p{XPosixBlank} (18) 3143 \p{Hst: *} \p{Hangul_Syllable_Type: *} 3144 \p{Hung} \p{Old_Hungarian} (= \p{Script_Extensions= 3145 Old_Hungarian}) (NOT \p{Block= 3146 Old_Hungarian}) (108) 3147 D \p{Hyphen} \p{Hyphen=Y} (11) 3148 D \p{Hyphen: N*} Supplanted by Line_Break property values; 3149 see www.unicode.org/reports/tr14 3150 (Single: \P{Hyphen}) (1_114_101 plus all 3151 above-Unicode code points: [\x00-\x20! 3152 \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z 3153 \[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-\xff], 3154 U+0100..0589, U+058B..1805, 3155 U+1807..200F, U+2012..2E16, U+2E18..30FA 3156 ...) 3157 D \p{Hyphen: Y*} Supplanted by Line_Break property values; 3158 see www.unicode.org/reports/tr14 3159 (Single: \p{Hyphen}) (11: [\-\xad], 3160 U+058A, U+1806, U+2010..2011, U+2E17, 3161 U+30FB ...) 3162 \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT 3163 \p{Ideographic_Description_Characters}) 3164 (139_482) 3165 \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (974_630 plus 3166 all above-Unicode code points: [\x00- 3167 \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@ 3168 \[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4\xb6 3169 \xb8-\xb9\xbb-\xbf\xd7\xf7], 3170 U+02C2..02C5, U+02D2..02DF, 3171 U+02E5..02EB, U+02ED, U+02EF..02FF ...) 3172 \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (139_482: 3173 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6 3174 \xd8-\xf6\xf8-\xff], U+0100..02C1, 3175 U+02C6..02D1, U+02E0..02E4, U+02EC, 3176 U+02EE ...) 3177 \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (136_345) 3178 \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (977_767 plus 3179 all above-Unicode code points: [\x00- 3180 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@ 3181 \[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6- 3182 \xb9\xbb-\xbf\xd7\xf7], U+02C2..02C5, 3183 U+02D2..02DF, U+02E5..02EB, U+02ED, 3184 U+02EF..036F ...) 3185 \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (136_345: [A- 3186 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 3187 \xff], U+0100..02C1, U+02C6..02D1, 3188 U+02E0..02E4, U+02EC, U+02EE ...) 3189 \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT 3190 \p{Ideographic_Description_Characters}) 3191 (139_482) 3192 \p{IDC: *} \p{ID_Continue: *} 3193 \p{Identifier_Status: Allowed} (112_159: [\'\-.0-9:A-Z_a-z\xb7 3194 \xc0-\xd6\xd8-\xf6\xf8-\xff], 3195 U+0100..0131, U+0134..013E, 3196 U+0141..0148, U+014A..017E, U+018F ...) 3197 \p{Identifier_Status: Restricted} (1_001_953 plus all above- 3198 Unicode code points: [\x00-\x20!\"#\$ 3199 \%&\(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x7f- 3200 \xb6\xb8-\xbf\xd7\xf7], U+0132..0133, 3201 U+013F..0140, U+0149, U+017F..018E, 3202 U+0190..019F ...) 3203 \p{Identifier_Type: Default_Ignorable} (398: [\xad], U+034F, 3204 U+061C, U+115F..1160, U+17B4..17B5, 3205 U+180B..180F ...) 3206 \p{Identifier_Type: Deprecated} (15: U+0149, U+0673, U+0F77, 3207 U+0F79, U+17A3..17A4, U+206A..206F ...) 3208 \p{Identifier_Type: Exclusion} (17_241: U+03E2..03EF, 3209 U+0800..082D, U+0830..083E, 3210 U+1680..169C, U+16A0..16EA, U+16EE..16F8 3211 ...) 3212 \p{Identifier_Type: Inclusion} (17: [\'\-.:\xb7], U+0375, U+058A, 3213 U+05F3..05F4, U+06FD..06FE, U+0F0B ...) 3214 \p{Identifier_Type: Limited_Use} (5268: U+0700..070D, 3215 U+070F..074A, U+074D..074F, 3216 U+07C0..07FA, U+07FD..07FF, U+0840..085B 3217 ...) 3218 \p{Identifier_Type: Not_Character} (964_920 plus all above-Unicode 3219 code points: [^\t\n\cK\f\r\x20-\x7e\x85 3220 \xa0-\xff], U+0378..0379, U+0380..0383, 3221 U+038B, U+038D, U+03A2 ...) 3222 \p{Identifier_Type: Not_NFKC} (4921: [\xa0\xa8\xaa\xaf\xb2-\xb5 3223 \xb8-\xba\xbc-\xbe], U+0132..0133, 3224 U+013F..0140, U+017F, U+01C4..01CC, 3225 U+01F1..01F3 ...) 3226 \p{Identifier_Type: Not_XID} (8277: [\t\n\cK\f\r\x20!\"#\$\%& 3227 \(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x85 3228 \xa1-\xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6 3229 \xbb\xbf\xd7\xf7], U+02C2..02C5, 3230 U+02D2..02D7, U+02DE..02DF, 3231 U+02E5..02EB, U+02ED ...) 3232 \p{Identifier_Type: Obsolete} (1627: U+018D, U+01AA..01AB, 3233 U+01B9..01BB, U+01BE..01BF, 3234 U+01F6..01F7, U+021C..021D ...) 3235 \p{Identifier_Type: Recommended} (112_142: [0-9A-Z_a-z\xc0-\xd6 3236 \xd8-\xf6\xf8-\xff], U+0100..0131, 3237 U+0134..013E, U+0141..0148, 3238 U+014A..017E, U+018F ...) 3239 \p{Identifier_Type: Technical} (1660: U+0180, U+018D, 3240 U+01AA..01AB, U+01BA..01BB, U+01BE, 3241 U+01C0..01C3 ...) 3242 \p{Identifier_Type: Uncommon_Use} (396: U+0181..018C, U+018E, 3243 U+0190..019F, U+01A2..01A9, 3244 U+01AC..01AE, U+01B1..01B8 ...) 3245 \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y}) 3246 (105_854) 3247 \p{Ideo: *} \p{Ideographic: *} 3248 \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo}) 3249 (105_854) 3250 \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_008_258 3251 plus all above-Unicode code points: 3252 U+0000..3005, U+3008..3020, 3253 U+302A..3037, U+303B..33FF, 3254 U+4DC0..4DFF, U+A000..F8FF ...) 3255 \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (105_854: 3256 U+3006..3007, U+3021..3029, 3257 U+3038..303A, U+3400..4DBF, 3258 U+4E00..9FFF, U+F900..FA6D ...) 3259 X \p{Ideographic_Description_Characters} \p{Block= 3260 Ideographic_Description_Characters} 3261 (Short: \p{InIDC}) (16) 3262 X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (= 3263 \p{Block= 3264 Ideographic_Symbols_And_Punctuation}) 3265 (32) 3266 X \p{Ideographic_Symbols_And_Punctuation} \p{Block= 3267 Ideographic_Symbols_And_Punctuation} 3268 (Short: \p{InIdeographicSymbols}) (32) 3269 \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (136_345) 3270 \p{IDS: *} \p{ID_Start: *} 3271 \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short: 3272 \p{IDSB}) (10) 3273 \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB}) 3274 (1_114_102 plus all above-Unicode code 3275 points: U+0000..2FEF, U+2FF2..2FF3, 3276 U+2FFC..infinity) 3277 \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10: 3278 U+2FF0..2FF1, U+2FF4..2FFB) 3279 \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short: 3280 \p{IDST}) (2) 3281 \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST}) 3282 (1_114_110 plus all above-Unicode code 3283 points: U+0000..2FF1, U+2FF4..infinity) 3284 \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2: 3285 U+2FF2..2FF3) 3286 \p{IDSB} \p{IDS_Binary_Operator} (= 3287 \p{IDS_Binary_Operator=Y}) (10) 3288 \p{IDSB: *} \p{IDS_Binary_Operator: *} 3289 \p{IDST} \p{IDS_Trinary_Operator} (= 3290 \p{IDS_Trinary_Operator=Y}) (2) 3291 \p{IDST: *} \p{IDS_Trinary_Operator: *} 3292 \p{Imperial_Aramaic} \p{Script_Extensions=Imperial_Aramaic} 3293 (Short: \p{Armi}; NOT \p{Block= 3294 Imperial_Aramaic}) (31) 3295 \p{In: *} \p{Present_In: *} (Perl extension) 3296 X \p{In_*} \p{Block: *} 3297 X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block= 3298 Common_Indic_Number_Forms}) (16) 3299 \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom}) 3300 (356: U+093C, U+0941..0944, U+094D, 3301 U+0952, U+0956..0957, U+0962..0963 ...) 3302 \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC= 3303 BottomAndLeft}) (1: U+A9BF) 3304 \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC= 3305 BottomAndRight}) (4: U+1B3B, U+A9BE, 3306 U+A9C0, U+11942) 3307 \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (66: 3308 U+093F, U+094E, U+09BF, U+09C7..09C8, 3309 U+0A3F, U+0ABF ...) 3310 \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC= 3311 LeftAndRight}) (22: U+09CB..09CC, 3312 U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C, 3313 U+0DDC, U+0DDE ...) 3314 \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_875 3315 plus all above-Unicode code points: 3316 U+0000..08FF, U+0904..0939, U+093D, 3317 U+0950, U+0953..0954, U+0958..0961 ...) 3318 \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC= 3319 Overstruck}) (10: U+1CD4, U+1CE2..1CE8, 3320 U+10A01, U+10A06) 3321 \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (295: 3322 U+0903, U+093B, U+093E, U+0940, 3323 U+0949..094C, U+094F ...) 3324 \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (428: 3325 U+0900..0902, U+093A, U+0945..0948, 3326 U+0951, U+0955, U+0981 ...) 3327 \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC= 3328 TopAndBottom}) (10: U+0C48, U+0F73, 3329 U+0F76..0F79, U+0F81, U+1B3C, 3330 U+1112E..1112F) 3331 \p{Indic_Positional_Category: Top_And_Bottom_And_Left} (Short: 3332 \p{InPC=TopAndBottomAndLeft}) (2: 3333 U+103C, U+1171E) 3334 \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short: 3335 \p{InPC=TopAndBottomAndRight}) (1: 3336 U+1B3D) 3337 \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC= 3338 TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE, 3339 U+1C29, U+114BB, U+115B9) 3340 \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short: 3341 \p{InPC=TopAndLeftAndRight}) (4: U+0B4C, 3342 U+0DDD, U+17BF, U+115BB) 3343 \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC= 3344 TopAndRight}) (13: U+0AC9, U+0B57, 3345 U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB, 3346 U+1925..1926 ...) 3347 \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC= 3348 VisualOrderLeft}) (19: U+0E40..0E44, 3349 U+0EC0..0EC4, U+19B5..19B7, U+19BA, 3350 U+AAB5..AAB6, U+AAB9 ...) 3351 X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80) 3352 \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha}) 3353 (17: U+093D, U+09BD, U+0ABD, U+0B3D, 3354 U+0C3D, U+0CBD ...) 3355 \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (94: 3356 U+0900..0902, U+0981..0982, U+09FC, 3357 U+0A01..0A02, U+0A70, U+0A81..0A82 ...) 3358 \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short: 3359 \p{InSC=BrahmiJoiningNumber}) (20: 3360 U+11052..11065) 3361 \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC= 3362 CantillationMark}) (58: U+0951..0952, 3363 U+0A51, U+0AFA, U+0AFC, U+1CD0..1CD2, 3364 U+1CD4..1CE1 ...) 3365 \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant}) 3366 (2241: U+0915..0939, U+0958..095F, 3367 U+0978..097F, U+0995..09A8, 3368 U+09AA..09B0, U+09B2 ...) 3369 \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC= 3370 ConsonantDead}) (14: U+09CE, U+0C5D, 3371 U+0CDD, U+0D54..0D56, U+0D7A..0D7F, 3372 U+1CF2..1CF3) 3373 \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC= 3374 ConsonantFinal}) (70: U+1930..1931, 3375 U+1933..1939, U+19C1..19C7, 3376 U+1A58..1A59, U+1B03, U+1B81 ...) 3377 \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short: 3378 \p{InSC=ConsonantHeadLetter}) (5: 3379 U+0F88..0F8C) 3380 \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short: 3381 \p{InSC=ConsonantInitialPostfixed}) (1: 3382 U+1A5A) 3383 \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC= 3384 ConsonantKiller}) (2: U+0E4C, U+17CD) 3385 \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC= 3386 ConsonantMedial}) (31: U+0A75, 3387 U+0EBC..0EBD, U+103B..103E, 3388 U+105E..1060, U+1082, U+1A55..1A56 ...) 3389 \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short: 3390 \p{InSC=ConsonantPlaceholder}) (22: [\- 3391 \xa0\xd7], U+0980, U+0A72..0A73, U+104B, 3392 U+104E, U+1900 ...) 3393 \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short: 3394 \p{InSC=ConsonantPrecedingRepha}) (4: 3395 U+0D4E, U+11941, U+11D46, U+11F02) 3396 \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC= 3397 ConsonantPrefixed}) (10: U+111C2..111C3, 3398 U+1193F, U+11A3A, U+11A84..11A89) 3399 \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC= 3400 ConsonantSubjoined}) (94: U+0F8D..0F97, 3401 U+0F99..0FBC, U+1929..192B, U+1A57, 3402 U+1A5B..1A5E, U+1BA1..1BA3 ...) 3403 \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short: 3404 \p{InSC=ConsonantSucceedingRepha}) (1: 3405 U+17CC) 3406 \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short: 3407 \p{InSC=ConsonantWithStacker}) (8: 3408 U+0CF1..0CF2, U+1CF5..1CF6, 3409 U+11003..11004, U+11460..11461) 3410 \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC= 3411 GeminationMark}) (4: U+0A71, U+0AFB, 3412 U+11237, U+11A98) 3413 \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC= 3414 InvisibleStacker}) (13: U+1039, U+17D2, 3415 U+1A60, U+1BAB, U+AAF6, U+10A3F ...) 3416 \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1: 3417 U+200D) 3418 \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC= 3419 ModifyingLetter}) (1: U+0B83) 3420 \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC= 3421 NonJoiner}) (1: U+200C) 3422 \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (32: 3423 U+093C, U+09BC, U+0A3C, U+0ABC, 3424 U+0AFD..0AFF, U+0B3C ...) 3425 \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (501: 3426 [0-9], U+0966..096F, U+09E6..09EF, 3427 U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F 3428 ...) 3429 \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC= 3430 NumberJoiner}) (1: U+1107F) 3431 \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other}) 3432 (1_109_473 plus all above-Unicode code 3433 points: [\x00-\x20!\"#\$\%&\'\(\)*+,. 3434 \/:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f- 3435 \x9f\xa1-\xb1\xb4-\xd6\xd8-\xff], 3436 U+0100..08FF, U+0950, U+0953..0954, 3437 U+0964..0965, U+0970..0971 ...) 3438 \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC= 3439 PureKiller}) (26: U+0D3B..0D3C, U+0E3A, 3440 U+0E4E, U+0EBA, U+0F84, U+103A ...) 3441 \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC= 3442 RegisterShifter}) (2: U+17C9..17CA) 3443 \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC= 3444 SyllableModifier}) (26: [\xb2-\xb3], 3445 U+09FE, U+0ECE, U+0F35, U+0F37, U+0FC6 3446 ...) 3447 \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC= 3448 ToneLetter}) (7: U+1970..1974, U+AAC0, 3449 U+AAC2) 3450 \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark}) 3451 (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037, 3452 U+1063..1064, U+1069..106D, U+1087..108D 3453 ...) 3454 \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27: 3455 U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D, 3456 U+0BCD ...) 3457 \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga}) 3458 (36: U+0903, U+0983, U+0A03, U+0A83, 3459 U+0B03, U+0C03 ...) 3460 \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30: 3461 U+1963..196D, U+A85E..A861, U+A866, 3462 U+A922..A92A, U+11150..11154) 3463 \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC= 3464 VowelDependent}) (697: U+093A..093B, 3465 U+093E..094C, U+094E..094F, 3466 U+0955..0957, U+0962..0963, U+09BE..09C4 3467 ...) 3468 \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC= 3469 VowelIndependent}) (500: U+0904..0914, 3470 U+0960..0961, U+0972..0977, 3471 U+0985..098C, U+098F..0990, U+0993..0994 3472 ...) 3473 \p{Inherited} \p{Script_Extensions=Inherited} (Short: 3474 \p{Zinh}) (586) 3475 \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation} 3476 (Short: \p{Pi}) (12) 3477 \p{InPC: *} \p{Indic_Positional_Category: *} 3478 \p{InSC: *} \p{Indic_Syllabic_Category: *} 3479 \p{Inscriptional_Pahlavi} \p{Script_Extensions= 3480 Inscriptional_Pahlavi} (Short: \p{Phli}; 3481 NOT \p{Block=Inscriptional_Pahlavi}) (27) 3482 \p{Inscriptional_Parthian} \p{Script_Extensions= 3483 Inscriptional_Parthian} (Short: 3484 \p{Prti}; NOT \p{Block= 3485 Inscriptional_Parthian}) (30) 3486 X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block= 3487 IPA_Extensions}) (96) 3488 X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short: 3489 \p{InIPAExt}) (96) 3490 \p{Is_*} \p{*} (Any exceptions are individually 3491 noted beginning with the word NOT.) If 3492 an entry has flag(s) at its beginning, 3493 like "D", the "Is_" form has the same 3494 flag(s) 3495 \p{Ital} \p{Old_Italic} (= \p{Script_Extensions= 3496 Old_Italic}) (NOT \p{Block=Old_Italic}) 3497 (39) 3498 X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo}) 3499 (256) 3500 X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block= 3501 Hangul_Jamo_Extended_A}) (32) 3502 X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block= 3503 Hangul_Jamo_Extended_B}) (80) 3504 \p{Java} \p{Javanese} (= \p{Script_Extensions= 3505 Javanese}) (NOT \p{Block=Javanese}) (91) 3506 \p{Javanese} \p{Script_Extensions=Javanese} (Short: 3507 \p{Java}; NOT \p{Block=Javanese}) (91) 3508 \p{Jg: *} \p{Joining_Group: *} 3509 \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2) 3510 \p{Join_C: *} \p{Join_Control: *} 3511 \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2) 3512 \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110 3513 plus all above-Unicode code points: 3514 U+0000..200B, U+200E..infinity) 3515 \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2: 3516 U+200C..200D) 3517 \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1: 3518 U+08BB) 3519 \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1: 3520 U+08BD) 3521 \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (2: 3522 U+08BC, U+08C4) 3523 \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (9: U+0639..063A, 3524 U+06A0, U+06FC, U+075D..075F, U+08B3, 3525 U+08C3) 3526 \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710) 3527 \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (29: U+0622..0623, 3528 U+0625, U+0627, U+0671..0673, U+0675, 3529 U+0773..0774 ...) 3530 \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (27: U+0628, 3531 U+062A..062B, U+066E, U+0679..0680, 3532 U+0750..0756, U+08A0..08A1 ...) 3533 \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D) 3534 \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg= 3535 BurushaskiYehBarree}) (2: U+077A..077B) 3536 \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15: U+062F..0630, 3537 U+0688..0690, U+06EE, U+0759..075A, 3538 U+08AE) 3539 \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4: 3540 U+0715..0716, U+072A, U+072F) 3541 \p{Joining_Group: E} (Short: \p{Jg=E}) (1: U+0725) 3542 \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7: 3543 U+063D..063F, U+06CC, U+06CE, 3544 U+0775..0776) 3545 \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1: U+074F) 3546 \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10: U+0641, 3547 U+06A1..06A6, U+0760..0761, U+08A4) 3548 \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1: 3549 U+0724) 3550 \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (17: U+063B..063C, 3551 U+06A9, U+06AB, U+06AF..06B4, 3552 U+0762..0764, U+088D ...) 3553 \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714, 3554 U+072E) 3555 \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (22: U+062C..062E, 3556 U+0681..0687, U+06BF, U+0757..0758, 3557 U+076E..076F, U+0772 ...) 3558 \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg= 3559 HamzaOnHehGoal}) (1: U+06C3) 3560 \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg= 3561 HanifiRohingyaKinnaYa}) (4: U+10D19, 3562 U+10D1E, U+10D20, U+10D23) 3563 \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg= 3564 HanifiRohingyaPa}) (3: U+10D02, U+10D09, 3565 U+10D1C) 3566 \p{Joining_Group: He} (Short: \p{Jg=He}) (1: U+0717) 3567 \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1: U+0647) 3568 \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2: 3569 U+06C1..06C2) 3570 \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A) 3571 \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (6: U+0643, 3572 U+06AC..06AE, U+077F, U+08B4) 3573 \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F) 3574 \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E) 3575 \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2: 3576 U+06BE, U+06FF) 3577 \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (8: U+0644, 3578 U+06B5..06B8, U+076A, U+08A6, U+08C7) 3579 \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720) 3580 \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1: 3581 U+0866) 3582 \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1: 3583 U+0861) 3584 \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1: 3585 U+0868) 3586 \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla}) 3587 (1: U+0869) 3588 \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1: 3589 U+0860) 3590 \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1: 3591 U+0864) 3592 \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna}) 3593 (1: U+0865) 3594 \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1: 3595 U+0862) 3596 \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1: 3597 U+0867) 3598 \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1: 3599 U+086A) 3600 \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1: 3601 U+0863) 3602 \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg= 3603 ManichaeanAleph}) (1: U+10AC0) 3604 \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin}) 3605 (2: U+10AD9..10ADA) 3606 \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth}) 3607 (2: U+10AC1..10AC2) 3608 \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg= 3609 ManichaeanDaleth}) (1: U+10AC5) 3610 \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg= 3611 ManichaeanDhamedh}) (1: U+10AD4) 3612 \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive}) 3613 (1: U+10AEC) 3614 \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg= 3615 ManichaeanGimel}) (2: U+10AC3..10AC4) 3616 \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth}) 3617 (1: U+10ACD) 3618 \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg= 3619 ManichaeanHundred}) (1: U+10AEF) 3620 \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph}) 3621 (3: U+10AD0..10AD2) 3622 \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg= 3623 ManichaeanLamedh}) (1: U+10AD3) 3624 \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem}) 3625 (1: U+10AD6) 3626 \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun}) 3627 (1: U+10AD7) 3628 \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne}) 3629 (1: U+10AEB) 3630 \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2: 3631 U+10ADB..10ADC) 3632 \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph}) 3633 (3: U+10ADE..10AE0) 3634 \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh}) 3635 (1: U+10AE1) 3636 \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg= 3637 ManichaeanSadhe}) (1: U+10ADD) 3638 \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg= 3639 ManichaeanSamekh}) (1: U+10AD8) 3640 \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw}) 3641 (1: U+10AE4) 3642 \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen}) 3643 (1: U+10AED) 3644 \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth}) 3645 (1: U+10ACE) 3646 \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg= 3647 ManichaeanThamedh}) (1: U+10AD5) 3648 \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg= 3649 ManichaeanTwenty}) (1: U+10AEE) 3650 \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw}) 3651 (1: U+10AC7) 3652 \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh}) 3653 (1: U+10ACF) 3654 \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg= 3655 ManichaeanZayin}) (2: U+10AC9..10ACA) 3656 \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645, 3657 U+0765..0766, U+08A7) 3658 \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1: U+0721) 3659 \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup}) 3660 (1_113_762 plus all above-Unicode code 3661 points: U+0000..061F, U+0621, U+0640, 3662 U+064B..066D, U+0670, U+0674 ...) 3663 \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (9: U+0646, 3664 U+06B9..06BC, U+0767..0769, U+0889) 3665 \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1: U+0722) 3666 \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1: U+06BD) 3667 \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1: U+0726) 3668 \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (6: U+0642, U+066F, 3669 U+06A7..06A8, U+08A5, U+08B5) 3670 \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729) 3671 \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (19: U+0631..0632, 3672 U+0691..0699, U+06EF, U+075B, 3673 U+076B..076C, U+0771 ...) 3674 \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1: 3675 U+0727) 3676 \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1: 3677 U+08AC) 3678 \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6: U+0635..0636, 3679 U+069D..069E, U+06FB, U+08AF) 3680 \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728) 3681 \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634, 3682 U+069A..069C, U+06FA, U+075C, U+076D, 3683 U+0770 ...) 3684 \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723) 3685 \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B) 3686 \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1: 3687 U+08B1) 3688 \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA) 3689 \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718) 3690 \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (6: U+0637..0638, 3691 U+069F, U+088B..088C, U+08A3) 3692 \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1: U+072C) 3693 \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3: 3694 U+0629, U+06C0, U+06D5) 3695 \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group= 3696 Hamza_On_Heh_Goal} (1) 3697 \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C) 3698 \p{Joining_Group: Thin_Yeh} (Short: \p{Jg=ThinYeh}) (1: U+0886) 3699 \p{Joining_Group: Vertical_Tail} (Short: \p{Jg=VerticalTail}) (1: 3700 U+088E) 3701 \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16: U+0624, U+0648, 3702 U+0676..0677, U+06C4..06CB, U+06CF, 3703 U+0778..0779 ...) 3704 \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (11: U+0620, U+0626, 3705 U+0649..064A, U+0678, U+06D0..06D1, 3706 U+0777 ...) 3707 \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2: 3708 U+06D2..06D3) 3709 \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1: 3710 U+06CD) 3711 \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D) 3712 \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E) 3713 \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719) 3714 \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D) 3715 \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (7) 3716 \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (610) 3717 \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (610: U+0620, 3718 U+0626, U+0628, U+062A..062E, 3719 U+0633..063F, U+0641..0647 ...) 3720 \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (7: U+0640, 3721 U+07FA, U+0883..0885, U+180A, U+200D) 3722 \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (5) 3723 \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (5: U+A872, 3724 U+10ACD, U+10AD7, U+10D00, U+10FCB) 3725 \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_188 plus 3726 all above-Unicode code points: [\x00- 3727 \xac\xae-\xff], U+0100..02FF, 3728 U+0370..0482, U+048A..0590, U+05BE, 3729 U+05C0 ...) 3730 \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (152) 3731 \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (152: 3732 U+0622..0625, U+0627, U+0629, 3733 U+062F..0632, U+0648, U+0671..0673 ...) 3734 \p{Joining_Type: T} \p{Joining_Type=Transparent} (2150) 3735 \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (2150: [\xad], 3736 U+0300..036F, U+0483..0489, 3737 U+0591..05BD, U+05BF, U+05C1..05C2 ...) 3738 \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_111_188 3739 plus all above-Unicode code points) 3740 \p{Jt: *} \p{Joining_Type: *} 3741 \p{Kaithi} \p{Script_Extensions=Kaithi} (Short: 3742 \p{Kthi}; NOT \p{Block=Kaithi}) (88) 3743 X \p{Kaktovik_Numerals} \p{Block=Kaktovik_Numerals} (32) 3744 \p{Kali} \p{Kayah_Li} (= \p{Script_Extensions= 3745 Kayah_Li}) (48) 3746 \p{Kana} \p{Katakana} (= \p{Script_Extensions= 3747 Katakana}) (NOT \p{Block=Katakana}) (373) 3748 X \p{Kana_Ext_A} \p{Kana_Extended_A} (= \p{Block= 3749 Kana_Extended_A}) (48) 3750 X \p{Kana_Ext_B} \p{Kana_Extended_B} (= \p{Block= 3751 Kana_Extended_B}) (16) 3752 X \p{Kana_Extended_A} \p{Block=Kana_Extended_A} (Short: 3753 \p{InKanaExtA}) (48) 3754 X \p{Kana_Extended_B} \p{Block=Kana_Extended_B} (Short: 3755 \p{InKanaExtB}) (16) 3756 X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block= 3757 Kana_Supplement}) (256) 3758 X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short: 3759 \p{InKanaSup}) (256) 3760 X \p{Kanbun} \p{Block=Kanbun} (16) 3761 X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block= 3762 Kangxi_Radicals}) (224) 3763 X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short: 3764 \p{InKangxi}) (224) 3765 \p{Kannada} \p{Script_Extensions=Kannada} (Short: 3766 \p{Knda}; NOT \p{Block=Kannada}) (106) 3767 \p{Katakana} \p{Script_Extensions=Katakana} (Short: 3768 \p{Kana}; NOT \p{Block=Katakana}) (373) 3769 X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (= 3770 \p{Block=Katakana_Phonetic_Extensions}) 3771 (16) 3772 X \p{Katakana_Phonetic_Extensions} \p{Block= 3773 Katakana_Phonetic_Extensions} (Short: 3774 \p{InKatakanaExt}) (16) 3775 \p{Kawi} \p{Script_Extensions=Kawi} (NOT \p{Block= 3776 Kawi}) (86) 3777 \p{Kayah_Li} \p{Script_Extensions=Kayah_Li} (Short: 3778 \p{Kali}) (48) 3779 \p{Khar} \p{Kharoshthi} (= \p{Script_Extensions= 3780 Kharoshthi}) (NOT \p{Block=Kharoshthi}) 3781 (68) 3782 \p{Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short: 3783 \p{Khar}; NOT \p{Block=Kharoshthi}) (68) 3784 \p{Khitan_Small_Script} \p{Script_Extensions=Khitan_Small_Script} 3785 (Short: \p{Kits}; NOT \p{Block= 3786 Khitan_Small_Script}) (471) 3787 \p{Khmer} \p{Script_Extensions=Khmer} (Short: 3788 \p{Khmr}; NOT \p{Block=Khmer}) (146) 3789 X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32) 3790 \p{Khmr} \p{Khmer} (= \p{Script_Extensions=Khmer}) 3791 (NOT \p{Block=Khmer}) (146) 3792 \p{Khoj} \p{Khojki} (= \p{Script_Extensions= 3793 Khojki}) (NOT \p{Block=Khojki}) (85) 3794 \p{Khojki} \p{Script_Extensions=Khojki} (Short: 3795 \p{Khoj}; NOT \p{Block=Khojki}) (85) 3796 \p{Khudawadi} \p{Script_Extensions=Khudawadi} (Short: 3797 \p{Sind}; NOT \p{Block=Khudawadi}) (81) 3798 \p{Kits} \p{Khitan_Small_Script} (= 3799 \p{Script_Extensions= 3800 Khitan_Small_Script}) (NOT \p{Block= 3801 Khitan_Small_Script}) (471) 3802 \p{Knda} \p{Kannada} (= \p{Script_Extensions= 3803 Kannada}) (NOT \p{Block=Kannada}) (106) 3804 \p{Kthi} \p{Kaithi} (= \p{Script_Extensions= 3805 Kaithi}) (NOT \p{Block=Kaithi}) (88) 3806 \p{L} \pL \p{Letter} (= \p{General_Category=Letter}) 3807 (136_104) 3808 X \p{L&} \p{Cased_Letter} (= \p{General_Category= 3809 Cased_Letter}) (4095) 3810 X \p{L_} \p{Cased_Letter} (= \p{General_Category= 3811 Cased_Letter}) Note the trailing '_' 3812 matters in spite of loose matching 3813 rules. (4095) 3814 \p{Lana} \p{Tai_Tham} (= \p{Script_Extensions= 3815 Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127) 3816 \p{Lao} \p{Script_Extensions=Lao} (NOT \p{Block= 3817 Lao}) (83) 3818 \p{Laoo} \p{Lao} (= \p{Script_Extensions=Lao}) (NOT 3819 \p{Block=Lao}) (83) 3820 \p{Latin} \p{Script_Extensions=Latin} (Short: 3821 \p{Latn}) (1510) 3822 X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block= 3823 Latin_1_Supplement}) (128) 3824 X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block= 3825 Latin_1_Supplement}) (128) 3826 X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short: 3827 \p{InLatin1}) (128) 3828 X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block= 3829 Latin_Extended_A}) (128) 3830 X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (= 3831 \p{Block=Latin_Extended_Additional}) 3832 (256) 3833 X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block= 3834 Latin_Extended_B}) (208) 3835 X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block= 3836 Latin_Extended_C}) (32) 3837 X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block= 3838 Latin_Extended_D}) (224) 3839 X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block= 3840 Latin_Extended_E}) (64) 3841 X \p{Latin_Ext_F} \p{Latin_Extended_F} (= \p{Block= 3842 Latin_Extended_F}) (64) 3843 X \p{Latin_Ext_G} \p{Latin_Extended_G} (= \p{Block= 3844 Latin_Extended_G}) (256) 3845 X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short: 3846 \p{InLatinExtA}) (128) 3847 X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional} 3848 (Short: \p{InLatinExtAdditional}) (256) 3849 X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short: 3850 \p{InLatinExtB}) (208) 3851 X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short: 3852 \p{InLatinExtC}) (32) 3853 X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short: 3854 \p{InLatinExtD}) (224) 3855 X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short: 3856 \p{InLatinExtE}) (64) 3857 X \p{Latin_Extended_F} \p{Block=Latin_Extended_F} (Short: 3858 \p{InLatinExtF}) (64) 3859 X \p{Latin_Extended_G} \p{Block=Latin_Extended_G} (Short: 3860 \p{InLatinExtG}) (256) 3861 \p{Latn} \p{Latin} (= \p{Script_Extensions=Latin}) 3862 (1510) 3863 \p{Lb: *} \p{Line_Break: *} 3864 \p{LC} \p{Cased_Letter} (= \p{General_Category= 3865 Cased_Letter}) (4095) 3866 \p{Lepc} \p{Lepcha} (= \p{Script_Extensions= 3867 Lepcha}) (NOT \p{Block=Lepcha}) (74) 3868 \p{Lepcha} \p{Script_Extensions=Lepcha} (Short: 3869 \p{Lepc}; NOT \p{Block=Lepcha}) (74) 3870 \p{Letter} \p{General_Category=Letter} (Short: \p{L}) 3871 (136_104) 3872 \p{Letter_Number} \p{General_Category=Letter_Number} (Short: 3873 \p{Nl}) (236) 3874 X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80) 3875 \p{Limb} \p{Limbu} (= \p{Script_Extensions=Limbu}) 3876 (NOT \p{Block=Limbu}) (69) 3877 \p{Limbu} \p{Script_Extensions=Limbu} (Short: 3878 \p{Limb}; NOT \p{Block=Limbu}) (69) 3879 \p{Lina} \p{Linear_A} (= \p{Script_Extensions= 3880 Linear_A}) (NOT \p{Block=Linear_A}) (386) 3881 \p{Linb} \p{Linear_B} (= \p{Script_Extensions= 3882 Linear_B}) (268) 3883 \p{Line_Break: AI} \p{Line_Break=Ambiguous} (707) 3884 \p{Line_Break: AL} \p{Line_Break=Alphabetic} (22_215) 3885 \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (22_215: [#&*<=>\@A- 3886 Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0- 3887 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6, 3888 U+02CE..02CF, U+02D1..02D7, U+02DC, 3889 U+02DE ...) 3890 \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-\xa8\xaa 3891 \xb2-\xb3\xb6-\xba\xbc-\xbe\xd7\xf7], 3892 U+02C7, U+02C9..02CB, U+02CD, U+02D0, 3893 U+02D8..02DB ...) 3894 \p{Line_Break: B2} \p{Line_Break=Break_Both} (3) 3895 \p{Line_Break: BA} \p{Line_Break=Break_After} (249) 3896 \p{Line_Break: BB} \p{Line_Break=Break_Before} (55) 3897 \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4) 3898 \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (249: [\t\|\xad], 3899 U+058A, U+05BE, U+0964..0965, 3900 U+0E5A..0E5B, U+0F0B ...) 3901 \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (55: [\xb4], 3902 U+02C8, U+02CC, U+02DF, U+0C77, U+0C84 3903 ...) 3904 \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014, 3905 U+2E3A..2E3B) 3906 \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/]) 3907 \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r]) 3908 \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1) 3909 \p{Line_Break: CJ} \p{Line_Break= 3910 Conditional_Japanese_Starter} (60) 3911 \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (97) 3912 \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]]) 3913 \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (97: [\}], 3914 U+0F3B, U+0F3D, U+169C, U+2046, U+207E 3915 ...) 3916 \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (2438) 3917 \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2438: [^\t\n 3918 \cK\f\r\x20-\x7e\x85\xa0-\xff], 3919 U+0300..034E, U+0350..035B, 3920 U+0363..036F, U+0483..0489, U+0591..05BD 3921 ...) 3922 \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (758: 3923 U+0E01..0E3A, U+0E40..0E4E, 3924 U+0E81..0E82, U+0E84, U+0E86..0E8A, 3925 U+0E8C..0EA3 ...) 3926 \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ}) 3927 (60: U+3041, U+3043, U+3045, U+3047, 3928 U+3049, U+3063 ...) 3929 \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC) 3930 \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2) 3931 \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1) 3932 \p{Line_Break: E_Base} (Short: \p{Lb=EB}) (134: U+261D, U+26F9, 3933 U+270A..270D, U+1F385, U+1F3C2..1F3C4, 3934 U+1F3C7 ...) 3935 \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF) 3936 \p{Line_Break: EB} \p{Line_Break=E_Base} (134) 3937 \p{Line_Break: EM} \p{Line_Break=E_Modifier} (5) 3938 \p{Line_Break: EX} \p{Line_Break=Exclamation} (40) 3939 \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (40: [!?], U+05C6, 3940 U+061B, U+061D..061F, U+06D4, U+07F9 ...) 3941 \p{Line_Break: GL} \p{Line_Break=Glue} (31) 3942 \p{Line_Break: Glue} (Short: \p{Lb=GL}) (31: [\xa0], U+034F, 3943 U+035C..0362, U+0F08, U+0F0C, U+0F12 ...) 3944 \p{Line_Break: H2} (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C, 3945 U+AC38, U+AC54, U+AC70, U+AC8C ...) 3946 \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B, 3947 U+AC1D..AC37, U+AC39..AC53, 3948 U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7 3949 ...) 3950 \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75: 3951 U+05D0..05EA, U+05EF..05F2, U+FB1D, 3952 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C 3953 ...) 3954 \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (75) 3955 \p{Line_Break: HY} \p{Line_Break=Hyphen} (1) 3956 \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1: [\-]) 3957 \p{Line_Break: ID} \p{Line_Break=Ideographic} (172_465) 3958 \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_465: 3959 U+231A..231B, U+23F0..23F3, 3960 U+2600..2603, U+2614..2615, U+2618, 3961 U+261A..261C ...) 3962 \p{Line_Break: IN} \p{Line_Break=Inseparable} (6) 3963 \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;], 3964 U+037E, U+0589, U+060C..060D, U+07F8, 3965 U+2044 ...) 3966 \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026, 3967 U+22EF, U+FE19, U+10AF6) 3968 \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6) 3969 \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13) 3970 \p{Line_Break: JL} (Short: \p{Lb=JL}) (125: U+1100..115F, 3971 U+A960..A97C) 3972 \p{Line_Break: JT} (Short: \p{Lb=JT}) (137: U+11A8..11FF, 3973 U+D7CB..D7FB) 3974 \p{Line_Break: JV} (Short: \p{Lb=JV}) (95: U+1160..11A7, 3975 U+D7B0..D7C6) 3976 \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1) 3977 \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n]) 3978 \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f], 3979 U+2028..2029) 3980 \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85]) 3981 \p{Line_Break: NL} \p{Line_Break=Next_Line} (1) 3982 \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6, 3983 U+203C..203D, U+2047..2049, U+3005, 3984 U+301C, U+303B..303C ...) 3985 \p{Line_Break: NS} \p{Line_Break=Nonstarter} (33) 3986 \p{Line_Break: NU} \p{Line_Break=Numeric} (672) 3987 \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (672: [0-9], 3988 U+0660..0669, U+066B..066C, 3989 U+06F0..06F9, U+07C0..07C9, U+0966..096F 3990 ...) 3991 \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (94) 3992 \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (94: [\(\[\{ 3993 \xa1\xbf], U+0F3A, U+0F3C, U+169B, 3994 U+201A, U+201E ...) 3995 \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (38) 3996 \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (38: [\%\xa2 3997 \xb0], U+0609..060B, U+066A, 3998 U+09F2..09F3, U+09F9, U+0D79 ...) 3999 \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (67) 4000 \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (67: [\$+\\\xa3- 4001 \xa5\xb1], U+058F, U+07FE..07FF, U+09FB, 4002 U+0AF1, U+0BF9 ...) 4003 \p{Line_Break: QU} \p{Line_Break=Quotation} (39) 4004 \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb], 4005 U+2018..2019, U+201B..201D, U+201F, 4006 U+2039..203A, U+275B..2760 ...) 4007 \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26: 4008 U+1F1E6..1F1FF) 4009 \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26) 4010 \p{Line_Break: SA} \p{Line_Break=Complex_Context} (758) 4011 D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048) 4012 \p{Line_Break: SP} \p{Line_Break=Space} (1) 4013 \p{Line_Break: Space} (Short: \p{Lb=SP}) (1: [\x20]) 4014 D \p{Line_Break: Surrogate} Surrogates should never appear in well- 4015 formed text, and therefore shouldn't be 4016 the basis for line breaking (Short: 4017 \p{Lb=SG}) (2048: U+D800..DFFF) 4018 \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1) 4019 \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (900_198 plus all 4020 above-Unicode code points: U+0378..0379, 4021 U+0380..0383, U+038B, U+038D, U+03A2, 4022 U+0530 ...) 4023 \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2) 4024 \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF) 4025 \p{Line_Break: XX} \p{Line_Break=Unknown} (900_198 plus all 4026 above-Unicode code points) 4027 \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1) 4028 \p{Line_Break: ZWJ} (Short: \p{Lb=ZWJ}) (1: U+200D) 4029 \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B) 4030 \p{Line_Separator} \p{General_Category=Line_Separator} 4031 (Short: \p{Zl}) (1) 4032 \p{Linear_A} \p{Script_Extensions=Linear_A} (Short: 4033 \p{Lina}; NOT \p{Block=Linear_A}) (386) 4034 \p{Linear_B} \p{Script_Extensions=Linear_B} (Short: 4035 \p{Linb}) (268) 4036 X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128) 4037 X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128) 4038 \p{Lisu} \p{Script_Extensions=Lisu} (NOT \p{Block= 4039 Lisu}) (49) 4040 X \p{Lisu_Sup} \p{Lisu_Supplement} (= \p{Block= 4041 Lisu_Supplement}) (16) 4042 X \p{Lisu_Supplement} \p{Block=Lisu_Supplement} (Short: 4043 \p{InLisuSup}) (16) 4044 \p{Ll} \p{Lowercase_Letter} (= 4045 \p{General_Category=Lowercase_Letter}) 4046 (/i= General_Category=Cased_Letter) 4047 (2233) 4048 \p{Lm} \p{Modifier_Letter} (= 4049 \p{General_Category=Modifier_Letter}) 4050 (397) 4051 \p{Lo} \p{Other_Letter} (= \p{General_Category= 4052 Other_Letter}) (131_612) 4053 \p{LOE} \p{Logical_Order_Exception} (= 4054 \p{Logical_Order_Exception=Y}) (19) 4055 \p{LOE: *} \p{Logical_Order_Exception: *} 4056 \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short: 4057 \p{LOE}) (19) 4058 \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE}) 4059 (1_114_093 plus all above-Unicode code 4060 points: U+0000..0E3F, U+0E45..0EBF, 4061 U+0EC5..19B4, U+19B8..19B9, 4062 U+19BB..AAB4, U+AAB7..AAB8 ...) 4063 \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19: 4064 U+0E40..0E44, U+0EC0..0EC4, 4065 U+19B5..19B7, U+19BA, U+AAB5..AAB6, 4066 U+AAB9 ...) 4067 X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024) 4068 \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i= 4069 Cased=Yes) (2544) 4070 \p{Lower: *} \p{Lowercase: *} 4071 \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i= 4072 Cased=Yes) (2544) 4073 \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased= 4074 No) (1_111_568 plus all above-Unicode 4075 code points: [\x00-\x20!\"#\$\%&\' 4076 \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`\{ 4077 \|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9\xbb- 4078 \xde\xf7], U+0100, U+0102, U+0104, 4079 U+0106, U+0108 ...) 4080 \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased= 4081 Yes) (2544: [a-z\xaa\xb5\xba\xdf-\xf6 4082 \xf8-\xff], U+0101, U+0103, U+0105, 4083 U+0107, U+0109 ...) 4084 \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter} 4085 (Short: \p{Ll}; /i= General_Category= 4086 Cased_Letter) (2233) 4087 \p{Lt} \p{Titlecase_Letter} (= 4088 \p{General_Category=Titlecase_Letter}) 4089 (/i= General_Category=Cased_Letter) (31) 4090 \p{Lu} \p{Uppercase_Letter} (= 4091 \p{General_Category=Uppercase_Letter}) 4092 (/i= General_Category=Cased_Letter) 4093 (1831) 4094 \p{Lyci} \p{Lycian} (= \p{Script_Extensions= 4095 Lycian}) (NOT \p{Block=Lycian}) (29) 4096 \p{Lycian} \p{Script_Extensions=Lycian} (Short: 4097 \p{Lyci}; NOT \p{Block=Lycian}) (29) 4098 \p{Lydi} \p{Lydian} (= \p{Script_Extensions= 4099 Lydian}) (NOT \p{Block=Lydian}) (27) 4100 \p{Lydian} \p{Script_Extensions=Lydian} (Short: 4101 \p{Lydi}; NOT \p{Block=Lydian}) (27) 4102 \p{M} \pM \p{Mark} (= \p{General_Category=Mark}) 4103 (2450) 4104 \p{Mahajani} \p{Script_Extensions=Mahajani} (Short: 4105 \p{Mahj}; NOT \p{Block=Mahajani}) (61) 4106 \p{Mahj} \p{Mahajani} (= \p{Script_Extensions= 4107 Mahajani}) (NOT \p{Block=Mahajani}) (61) 4108 X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block= 4109 Mahjong_Tiles}) (48) 4110 X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short: 4111 \p{InMahjong}) (48) 4112 \p{Maka} \p{Makasar} (= \p{Script_Extensions= 4113 Makasar}) (NOT \p{Block=Makasar}) (25) 4114 \p{Makasar} \p{Script_Extensions=Makasar} (Short: 4115 \p{Maka}; NOT \p{Block=Makasar}) (25) 4116 \p{Malayalam} \p{Script_Extensions=Malayalam} (Short: 4117 \p{Mlym}; NOT \p{Block=Malayalam}) (126) 4118 \p{Mand} \p{Mandaic} (= \p{Script_Extensions= 4119 Mandaic}) (NOT \p{Block=Mandaic}) (30) 4120 \p{Mandaic} \p{Script_Extensions=Mandaic} (Short: 4121 \p{Mand}; NOT \p{Block=Mandaic}) (30) 4122 \p{Mani} \p{Manichaean} (= \p{Script_Extensions= 4123 Manichaean}) (NOT \p{Block=Manichaean}) 4124 (52) 4125 \p{Manichaean} \p{Script_Extensions=Manichaean} (Short: 4126 \p{Mani}; NOT \p{Block=Manichaean}) (52) 4127 \p{Marc} \p{Marchen} (= \p{Script_Extensions= 4128 Marchen}) (NOT \p{Block=Marchen}) (68) 4129 \p{Marchen} \p{Script_Extensions=Marchen} (Short: 4130 \p{Marc}; NOT \p{Block=Marchen}) (68) 4131 \p{Mark} \p{General_Category=Mark} (Short: \p{M}) 4132 (2450) 4133 \p{Masaram_Gondi} \p{Script_Extensions=Masaram_Gondi} 4134 (Short: \p{Gonm}; NOT \p{Block= 4135 Masaram_Gondi}) (77) 4136 \p{Math} \p{Math=Y} (2310) 4137 \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all 4138 above-Unicode code points: [\x00-\x20! 4139 \"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-Z 4140 \[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2- 4141 \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF, 4142 U+03D3..03D4, U+03D6..03EF, 4143 U+03F2..03F3, U+03F7..0605 ...) 4144 \p{Math: Y*} (Single: \p{Math}) (2310: [+<=>\^\|~\xac 4145 \xb1\xd7\xf7], U+03D0..03D2, U+03D5, 4146 U+03F0..03F1, U+03F4..03F6, U+0606..0608 4147 ...) 4148 X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (= 4149 \p{Block= 4150 Mathematical_Alphanumeric_Symbols}) 4151 (1024) 4152 X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block= 4153 Mathematical_Operators}) (256) 4154 \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short: 4155 \p{Sm}) (948) 4156 X \p{Mathematical_Alphanumeric_Symbols} \p{Block= 4157 Mathematical_Alphanumeric_Symbols} 4158 (Short: \p{InMathAlphanum}) (1024) 4159 X \p{Mathematical_Operators} \p{Block=Mathematical_Operators} 4160 (Short: \p{InMathOperators}) (256) 4161 X \p{Mayan_Numerals} \p{Block=Mayan_Numerals} (32) 4162 \p{Mc} \p{Spacing_Mark} (= \p{General_Category= 4163 Spacing_Mark}) (452) 4164 \p{Me} \p{Enclosing_Mark} (= \p{General_Category= 4165 Enclosing_Mark}) (13) 4166 \p{Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short: 4167 \p{Medf}; NOT \p{Block=Medefaidrin}) (91) 4168 \p{Medf} \p{Medefaidrin} (= \p{Script_Extensions= 4169 Medefaidrin}) (NOT \p{Block= 4170 Medefaidrin}) (91) 4171 \p{Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} (Short: 4172 \p{Mtei}; NOT \p{Block=Meetei_Mayek}) 4173 (79) 4174 X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block= 4175 Meetei_Mayek_Extensions}) (32) 4176 X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions} 4177 (Short: \p{InMeeteiMayekExt}) (32) 4178 \p{Mend} \p{Mende_Kikakui} (= \p{Script_Extensions= 4179 Mende_Kikakui}) (NOT \p{Block= 4180 Mende_Kikakui}) (213) 4181 \p{Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui} 4182 (Short: \p{Mend}; NOT \p{Block= 4183 Mende_Kikakui}) (213) 4184 \p{Merc} \p{Meroitic_Cursive} (= 4185 \p{Script_Extensions=Meroitic_Cursive}) 4186 (NOT \p{Block=Meroitic_Cursive}) (90) 4187 \p{Mero} \p{Meroitic_Hieroglyphs} (= 4188 \p{Script_Extensions= 4189 Meroitic_Hieroglyphs}) (32) 4190 \p{Meroitic_Cursive} \p{Script_Extensions=Meroitic_Cursive} 4191 (Short: \p{Merc}; NOT \p{Block= 4192 Meroitic_Cursive}) (90) 4193 \p{Meroitic_Hieroglyphs} \p{Script_Extensions= 4194 Meroitic_Hieroglyphs} (Short: \p{Mero}) 4195 (32) 4196 \p{Miao} \p{Script_Extensions=Miao} (NOT \p{Block= 4197 Miao}) (149) 4198 X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (= 4199 \p{Block= 4200 Miscellaneous_Symbols_And_Arrows}) (256) 4201 X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A} 4202 (= \p{Block= 4203 Miscellaneous_Mathematical_Symbols_A}) 4204 (48) 4205 X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B} 4206 (= \p{Block= 4207 Miscellaneous_Mathematical_Symbols_B}) 4208 (128) 4209 X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs} 4210 (= \p{Block= 4211 Miscellaneous_Symbols_And_Pictographs}) 4212 (768) 4213 X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block= 4214 Miscellaneous_Symbols}) (256) 4215 X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block= 4216 Miscellaneous_Technical}) (256) 4217 X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block= 4218 Miscellaneous_Mathematical_Symbols_A} 4219 (Short: \p{InMiscMathSymbolsA}) (48) 4220 X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block= 4221 Miscellaneous_Mathematical_Symbols_B} 4222 (Short: \p{InMiscMathSymbolsB}) (128) 4223 X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short: 4224 \p{InMiscSymbols}) (256) 4225 X \p{Miscellaneous_Symbols_And_Arrows} \p{Block= 4226 Miscellaneous_Symbols_And_Arrows} 4227 (Short: \p{InMiscArrows}) (256) 4228 X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block= 4229 Miscellaneous_Symbols_And_Pictographs} 4230 (Short: \p{InMiscPictographs}) (768) 4231 X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical} 4232 (Short: \p{InMiscTechnical}) (256) 4233 \p{Mlym} \p{Malayalam} (= \p{Script_Extensions= 4234 Malayalam}) (NOT \p{Block=Malayalam}) 4235 (126) 4236 \p{Mn} \p{Nonspacing_Mark} (= 4237 \p{General_Category=Nonspacing_Mark}) 4238 (1985) 4239 \p{Modi} \p{Script_Extensions=Modi} (NOT \p{Block= 4240 Modi}) (89) 4241 \p{Modifier_Letter} \p{General_Category=Modifier_Letter} 4242 (Short: \p{Lm}) (397) 4243 X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block= 4244 Spacing_Modifier_Letters}) (80) 4245 \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol} 4246 (Short: \p{Sk}) (125) 4247 X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32) 4248 \p{Mong} \p{Mongolian} (= \p{Script_Extensions= 4249 Mongolian}) (NOT \p{Block=Mongolian}) 4250 (172) 4251 \p{Mongolian} \p{Script_Extensions=Mongolian} (Short: 4252 \p{Mong}; NOT \p{Block=Mongolian}) (172) 4253 X \p{Mongolian_Sup} \p{Mongolian_Supplement} (= \p{Block= 4254 Mongolian_Supplement}) (32) 4255 X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short: 4256 \p{InMongolianSup}) (32) 4257 \p{Mro} \p{Script_Extensions=Mro} (NOT \p{Block= 4258 Mro}) (43) 4259 \p{Mroo} \p{Mro} (= \p{Script_Extensions=Mro}) (NOT 4260 \p{Block=Mro}) (43) 4261 \p{Mtei} \p{Meetei_Mayek} (= \p{Script_Extensions= 4262 Meetei_Mayek}) (NOT \p{Block= 4263 Meetei_Mayek}) (79) 4264 \p{Mult} \p{Multani} (= \p{Script_Extensions= 4265 Multani}) (NOT \p{Block=Multani}) (48) 4266 \p{Multani} \p{Script_Extensions=Multani} (Short: 4267 \p{Mult}; NOT \p{Block=Multani}) (48) 4268 X \p{Music} \p{Musical_Symbols} (= \p{Block= 4269 Musical_Symbols}) (256) 4270 X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short: 4271 \p{InMusic}) (256) 4272 \p{Myanmar} \p{Script_Extensions=Myanmar} (Short: 4273 \p{Mymr}; NOT \p{Block=Myanmar}) (224) 4274 X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block= 4275 Myanmar_Extended_A}) (32) 4276 X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block= 4277 Myanmar_Extended_B}) (32) 4278 X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short: 4279 \p{InMyanmarExtA}) (32) 4280 X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short: 4281 \p{InMyanmarExtB}) (32) 4282 \p{Mymr} \p{Myanmar} (= \p{Script_Extensions= 4283 Myanmar}) (NOT \p{Block=Myanmar}) (224) 4284 \p{N} \pN \p{Number} (= \p{General_Category=Number}) 4285 (1831) 4286 \p{Na=*} \p{Name=*} 4287 \p{Nabataean} \p{Script_Extensions=Nabataean} (Short: 4288 \p{Nbat}; NOT \p{Block=Nabataean}) (40) 4289 \p{Nag_Mundari} \p{Script_Extensions=Nag_Mundari} (Short: 4290 \p{Nagm}; NOT \p{Block=Nag_Mundari}) (42) 4291 \p{Nagm} \p{Nag_Mundari} (= \p{Script_Extensions= 4292 Nag_Mundari}) (NOT \p{Block= 4293 Nag_Mundari}) (42) 4294 \p{Name=*} Combination of Name and Name_Alias 4295 properties; has special loose matching 4296 rules, for which see Unicode UAX #44 4297 \p{Nand} \p{Nandinagari} (= \p{Script_Extensions= 4298 Nandinagari}) (NOT \p{Block= 4299 Nandinagari}) (86) 4300 \p{Nandinagari} \p{Script_Extensions=Nandinagari} (Short: 4301 \p{Nand}; NOT \p{Block=Nandinagari}) (86) 4302 \p{Narb} \p{Old_North_Arabian} (= 4303 \p{Script_Extensions=Old_North_Arabian}) 4304 (32) 4305 X \p{NB} \p{No_Block} (= \p{Block=No_Block}) 4306 (820_944 plus all above-Unicode code 4307 points) 4308 \p{Nbat} \p{Nabataean} (= \p{Script_Extensions= 4309 Nabataean}) (NOT \p{Block=Nabataean}) 4310 (40) 4311 \p{NChar} \p{Noncharacter_Code_Point} (= 4312 \p{Noncharacter_Code_Point=Y}) (66) 4313 \p{NChar: *} \p{Noncharacter_Code_Point: *} 4314 \p{Nd} \p{XPosixDigit} (= \p{General_Category= 4315 Decimal_Number}) (680) 4316 \p{New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short: 4317 \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83) 4318 \p{Newa} \p{Script_Extensions=Newa} (NOT \p{Block= 4319 Newa}) (97) 4320 \p{NFC_QC: *} \p{NFC_Quick_Check: *} 4321 \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (111) 4322 \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (111: 4323 U+0300..0304, U+0306..030C, U+030F, 4324 U+0311, U+0313..0314, U+031B ...) 4325 \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT 4326 \P{NFC_Quick_Check} NOR \P{NFC_QC}) 4327 (1120) 4328 \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT 4329 \P{NFC_Quick_Check} NOR \P{NFC_QC}) 4330 (1120: U+0340..0341, U+0343..0344, 4331 U+0374, U+037E, U+0387, U+0958..095F ...) 4332 \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT 4333 \p{NFC_Quick_Check} NOR \p{NFC_QC}) 4334 (1_112_881 plus all above-Unicode code 4335 points) 4336 \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT 4337 \p{NFC_Quick_Check} NOR \p{NFC_QC}) 4338 (1_112_881 plus all above-Unicode code 4339 points: U+0000..02FF, U+0305, 4340 U+030D..030E, U+0310, U+0312, 4341 U+0315..031A ...) 4342 \p{NFD_QC: *} \p{NFD_Quick_Check: *} 4343 \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT 4344 \P{NFD_Quick_Check} NOR \P{NFD_QC}) 4345 (13_233) 4346 \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT 4347 \P{NFD_Quick_Check} NOR \P{NFD_QC}) 4348 (13_233: [\xc0-\xc5\xc7-\xcf\xd1-\xd6 4349 \xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-\xf6 4350 \xf9-\xfd\xff], U+0100..010F, 4351 U+0112..0125, U+0128..0130, 4352 U+0134..0137, U+0139..013E ...) 4353 \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT 4354 \p{NFD_Quick_Check} NOR \p{NFD_QC}) 4355 (1_100_879 plus all above-Unicode code 4356 points) 4357 \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT 4358 \p{NFD_Quick_Check} NOR \p{NFD_QC}) 4359 (1_100_879 plus all above-Unicode code 4360 points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde- 4361 \xdf\xe6\xf0\xf7-\xf8\xfe], 4362 U+0110..0111, U+0126..0127, 4363 U+0131..0133, U+0138, U+013F..0142 ...) 4364 \p{NFKC_QC: *} \p{NFKC_Quick_Check: *} 4365 \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (111) 4366 \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (111: 4367 U+0300..0304, U+0306..030C, U+030F, 4368 U+0311, U+0313..0314, U+031B ...) 4369 \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT 4370 \P{NFKC_Quick_Check} NOR \P{NFKC_QC}) 4371 (4928) 4372 \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT 4373 \P{NFKC_Quick_Check} NOR \P{NFKC_QC}) 4374 (4928: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8- 4375 \xba\xbc-\xbe], U+0132..0133, 4376 U+013F..0140, U+0149, U+017F, 4377 U+01C4..01CC ...) 4378 \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT 4379 \p{NFKC_Quick_Check} NOR \p{NFKC_QC}) 4380 (1_109_073 plus all above-Unicode code 4381 points) 4382 \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT 4383 \p{NFKC_Quick_Check} NOR \p{NFKC_QC}) 4384 (1_109_073 plus all above-Unicode code 4385 points: [\x00-\x9f\xa1-\xa7\xa9\xab- 4386 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff], 4387 U+0100..0131, U+0134..013E, 4388 U+0141..0148, U+014A..017E, U+0180..01C3 4389 ...) 4390 \p{NFKD_QC: *} \p{NFKD_Quick_Check: *} 4391 \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT 4392 \P{NFKD_Quick_Check} NOR \P{NFKD_QC}) 4393 (17_029) 4394 \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT 4395 \P{NFKD_Quick_Check} NOR \P{NFKD_QC}) 4396 (17_029: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8- 4397 \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1- 4398 \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1- 4399 \xf6\xf9-\xfd\xff], U+0100..010F, 4400 U+0112..0125, U+0128..0130, 4401 U+0132..0137, U+0139..0140 ...) 4402 \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT 4403 \p{NFKD_Quick_Check} NOR \p{NFKD_QC}) 4404 (1_097_083 plus all above-Unicode code 4405 points) 4406 \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT 4407 \p{NFKD_Quick_Check} NOR \p{NFKD_QC}) 4408 (1_097_083 plus all above-Unicode code 4409 points: [\x00-\x9f\xa1-\xa7\xa9\xab- 4410 \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xc6\xd0 4411 \xd7-\xd8\xde-\xdf\xe6\xf0\xf7-\xf8 4412 \xfe], U+0110..0111, U+0126..0127, 4413 U+0131, U+0138, U+0141..0142 ...) 4414 \p{Nko} \p{Script_Extensions=Nko} (NOT \p{Block= 4415 NKo}) (67) 4416 \p{Nkoo} \p{Nko} (= \p{Script_Extensions=Nko}) (NOT 4417 \p{Block=NKo}) (67) 4418 \p{Nl} \p{Letter_Number} (= \p{General_Category= 4419 Letter_Number}) (236) 4420 \p{No} \p{Other_Number} (= \p{General_Category= 4421 Other_Number}) (915) 4422 X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB}) 4423 (820_944 plus all above-Unicode code 4424 points) 4425 \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short: 4426 \p{NChar}) (66) 4427 \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar}) 4428 (1_114_046 plus all above-Unicode code 4429 points: U+0000..FDCF, U+FDF0..FFFD, 4430 U+10000..1FFFD, U+20000..2FFFD, 4431 U+30000..3FFFD, U+40000..4FFFD ...) 4432 \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar}) 4433 (66: U+FDD0..FDEF, U+FFFE..FFFF, 4434 U+1FFFE..1FFFF, U+2FFFE..2FFFF, 4435 U+3FFFE..3FFFF, U+4FFFE..4FFFF ...) 4436 \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark} 4437 (Short: \p{Mn}) (1985) 4438 \p{Nshu} \p{Nushu} (= \p{Script_Extensions=Nushu}) 4439 (NOT \p{Block=Nushu}) (397) 4440 \p{Nt: *} \p{Numeric_Type: *} 4441 \p{Number} \p{General_Category=Number} (Short: \p{N}) 4442 (1831) 4443 X \p{Number_Forms} \p{Block=Number_Forms} (64) 4444 \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (680) 4445 \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (680: [0-9], 4446 U+0660..0669, U+06F0..06F9, 4447 U+07C0..07C9, U+0966..096F, U+09E6..09EF 4448 ...) 4449 \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128) 4450 \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9], 4451 U+1369..1371, U+19DA, U+2070, 4452 U+2074..2079, U+2080..2089 ...) 4453 \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_200 plus all 4454 above-Unicode code points: [\x00-\x20! 4455 \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\] 4456 \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba- 4457 \xbb\xbf-\xff], U+0100..065F, 4458 U+066A..06EF, U+06FA..07BF, 4459 U+07CA..0965, U+0970..09E5 ...) 4460 \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (1104) 4461 \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1104: [\xbc-\xbe], 4462 U+09F4..09F9, U+0B72..0B77, 4463 U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E 4464 ...) 4465 T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33) 4466 T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (87: [0], U+0660, 4467 U+06F0, U+07C0, U+0966, U+09E6 ...) 4468 T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0, 4469 U+11FD4) 4470 T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1) 4471 T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2) 4472 T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3) 4473 T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4) 4474 T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5) 4475 T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6) 4476 T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7) 4477 T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8) 4478 T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75, 4479 U+0D76, U+A833, U+11FC9..11FCA) 4480 T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6) 4481 T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152, 4482 U+11FCB) 4483 T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1: U+2151) 4484 T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76, 4485 U+0D77, U+215B, U+A834, U+11FCC ...) 4486 T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1: U+2150) 4487 T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD) 4488 T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (4: U+2159, U+109F7, 4489 U+12461, U+1ED3D) 4490 T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77, 4491 U+0D78, U+A835, U+11FCE) 4492 T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155, 4493 U+11FCF) 4494 T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7, 4495 U+0B72, U+0D73, U+A830, U+10140 ...) 4496 T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (6: U+2153, U+109F9, 4497 U+10E7D, U+1245A, U+1245D, U+12465) 4498 T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1: U+215C) 4499 T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1: U+2156) 4500 T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA) 4501 T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73, 4502 U+0D74, U+0F2A, U+2CFD, U+A831 ...) 4503 T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC) 4504 T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1: U+2157) 4505 T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1: U+215D) 4506 T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (7: U+2154, U+10177, 4507 U+109FD, U+10E7E, U+1245B, U+1245E ...) 4508 T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8, 4509 U+0B74, U+0D75, U+A832, U+10178 ...) 4510 T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1: U+2158) 4511 T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (3: U+215A, U+109FF, 4512 U+1245C) 4513 T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1: U+215E) 4514 T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC) 4515 T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (144: [1\xb9], U+0661, 4516 U+06F1, U+07C1, U+0967, U+09E7 ...) 4517 T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1: U+0F2B) 4518 T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (143: [2\xb2], U+0662, 4519 U+06F2, U+07C2, U+0968, U+09E8 ...) 4520 T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1: U+0F2C) 4521 T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (144: [3\xb3], U+0663, 4522 U+06F3, U+07C3, U+0969, U+09E9 ...) 4523 T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1: U+0F2D) 4524 T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (135: [4], U+0664, 4525 U+06F4, U+07C4, U+096A, U+09EA ...) 4526 T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1: U+0F2E) 4527 T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (133: [5], U+0665, 4528 U+06F5, U+07C5, U+096B, U+09EB ...) 4529 T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F) 4530 T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (117: [6], U+0666, 4531 U+06F6, U+07C6, U+096C, U+09EC ...) 4532 T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30) 4533 T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (116: [7], U+0667, 4534 U+06F7, U+07C7, U+096D, U+09ED ...) 4535 T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31) 4536 T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (112: [8], U+0668, 4537 U+06F8, U+07C8, U+096E, U+09EE ...) 4538 T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32) 4539 T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (116: [9], U+0669, 4540 U+06F9, U+07C9, U+096F, U+09EF ...) 4541 T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (63: U+0BF0, U+0D70, 4542 U+1372, U+2169, U+2179, U+2469 ...) 4543 T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (9: U+216A, U+217A, 4544 U+246A, U+247E, U+2492, U+24EB ...) 4545 T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (9: U+216B, U+217B, 4546 U+246B, U+247F, U+2493, U+24EC ...) 4547 T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (7: U+246C, U+2480, 4548 U+2494, U+24ED, U+16E8D, U+1D2CD ...) 4549 T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (7: U+246D, U+2481, 4550 U+2495, U+24EE, U+16E8E, U+1D2CE ...) 4551 T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (7: U+246E, U+2482, 4552 U+2496, U+24EF, U+16E8F, U+1D2CF ...) 4553 T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (8: U+09F9, U+246F, 4554 U+2483, U+2497, U+24F0, U+16E90 ...) 4555 T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (8: U+16EE, U+2470, 4556 U+2484, U+2498, U+24F1, U+16E91 ...) 4557 T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (8: U+16EF, U+2471, 4558 U+2485, U+2499, U+24F2, U+16E92 ...) 4559 T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (8: U+16F0, U+2472, 4560 U+2486, U+249A, U+24F3, U+16E93 ...) 4561 T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (36: U+1373, U+2473, 4562 U+2487, U+249B, U+24F4, U+3039 ...) 4563 T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1: U+3251) 4564 T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1: U+3252) 4565 T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1: U+3253) 4566 T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1: U+3254) 4567 T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1: U+3255) 4568 T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1: U+3256) 4569 T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1: U+3257) 4570 T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1: U+3258) 4571 T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1: U+3259) 4572 T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (19: U+1374, U+303A, 4573 U+324A, U+325A, U+5345, U+10112 ...) 4574 T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1: U+325B) 4575 T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1: U+325C) 4576 T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1: U+325D) 4577 T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1: U+325E) 4578 T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1: U+325F) 4579 T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1: U+32B1) 4580 T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1: U+32B2) 4581 T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1: U+32B3) 4582 T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1: U+32B4) 4583 T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (18: U+1375, U+324B, 4584 U+32B5, U+534C, U+10113, U+102ED ...) 4585 T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1: U+32B6) 4586 T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1: U+32B7) 4587 T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1: U+32B8) 4588 T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1: U+32B9) 4589 T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1: U+32BA) 4590 T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1: U+32BB) 4591 T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1: U+32BC) 4592 T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1: U+32BD) 4593 T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1: U+32BE) 4594 T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (29: U+1376, U+216C, 4595 U+217C, U+2186, U+324C, U+32BF ...) 4596 T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (13: U+1377, U+324D, 4597 U+10115, U+102EF, U+109CE, U+10E6E ...) 4598 T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (13: U+1378, U+324E, 4599 U+10116, U+102F0, U+109CF, U+10E6F ...) 4600 T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (12: U+1379, U+324F, 4601 U+10117, U+102F1, U+10E70, U+11062 ...) 4602 T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (12: U+137A, U+10118, 4603 U+102F2, U+10341, U+10E71, U+11063 ...) 4604 T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (35: U+0BF1, U+0D71, 4605 U+137B, U+216D, U+217D, U+4F70 ...) 4606 T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (6: U+1011A, U+102F4, 4607 U+109D3, U+10E73, U+1EC84, U+1ED14) 4608 T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (7: U+1011B, U+1016B, 4609 U+102F5, U+109D4, U+10E74, U+1EC85 ...) 4610 T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (7: U+1011C, U+102F6, 4611 U+109D5, U+10E75, U+1EC86, U+1ED16 ...) 4612 T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (16: U+216E, U+217E, 4613 U+1011D, U+10145, U+1014C, U+10153 ...) 4614 T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (7: U+1011E, U+102F8, 4615 U+109D7, U+10E77, U+1EC88, U+1ED18 ...) 4616 T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (6: U+1011F, U+102F9, 4617 U+109D8, U+10E78, U+1EC89, U+1ED19) 4618 T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (6: U+10120, U+102FA, 4619 U+109D9, U+10E79, U+1EC8A, U+1ED1A) 4620 T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (7: U+10121, U+102FB, 4621 U+1034A, U+109DA, U+10E7A, U+1EC8B ...) 4622 T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72, 4623 U+216F, U+217F..2180, U+4EDF, U+5343 ...) 4624 T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC, 4625 U+1EC8D, U+1ED1D, U+1ED3A) 4626 T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD, 4627 U+1EC8E, U+1ED1E) 4628 T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE, 4629 U+1EC8F, U+1ED1F) 4630 T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126, 4631 U+10146, U+1014E, U+10172, U+109DF ...) 4632 T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0, 4633 U+1EC91, U+1ED21) 4634 T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1, 4635 U+1EC92, U+1ED22) 4636 T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2, 4637 U+1EC93, U+1ED23) 4638 T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3, 4639 U+1EC94, U+1ED24) 4640 T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13: 4641 U+137C, U+2182, U+4E07, U+842C, U+1012B, 4642 U+10155 ...) 4643 T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4: 4644 U+1012C, U+109E5, U+1EC96, U+1ED26) 4645 T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4: 4646 U+1012D, U+109E6, U+1EC97, U+1ED27) 4647 T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4: 4648 U+1012E, U+109E7, U+1EC98, U+1ED28) 4649 T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7: 4650 U+2187, U+1012F, U+10147, U+10156, 4651 U+109E8, U+1EC99 ...) 4652 T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4: 4653 U+10130, U+109E9, U+1EC9A, U+1ED2A) 4654 T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4: 4655 U+10131, U+109EA, U+1EC9B, U+1ED2B) 4656 T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4: 4657 U+10132, U+109EB, U+1EC9C, U+1ED2C) 4658 T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4: 4659 U+10133, U+109EC, U+1EC9D, U+1ED2D) 4660 T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5: 4661 U+2188, U+109ED, U+1EC9E, U+1ECA0, 4662 U+1ECB4) 4663 T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2: 4664 U+109EE, U+1EC9F) 4665 T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1: 4666 U+12432) 4667 T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1: 4668 U+109EF) 4669 T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1: 4670 U+109F0) 4671 T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1: 4672 U+12433) 4673 T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1: 4674 U+109F1) 4675 T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1: 4676 U+109F2) 4677 T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1: 4678 U+109F3) 4679 T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1: 4680 U+109F4) 4681 T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1: 4682 U+109F5) 4683 T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1: 4684 U+16B5E) 4685 T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000}) 4686 (1: U+1ECA1) 4687 T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000}) 4688 (1: U+1ECA2) 4689 T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000}) 4690 (3: U+4EBF, U+5104, U+16B5F) 4691 T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv= 4692 10000000000}) (1: U+16B60) 4693 T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv= 4694 1000000000000}) (2: U+5146, U+16B61) 4695 \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_200 plus all 4696 above-Unicode code points: [\x00-\x20! 4697 \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\] 4698 \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba- 4699 \xbb\xbf-\xff], U+0100..065F, 4700 U+066A..06EF, U+06FA..07BF, 4701 U+07CA..0965, U+0970..09E5 ...) 4702 \p{Nushu} \p{Script_Extensions=Nushu} (Short: 4703 \p{Nshu}; NOT \p{Block=Nushu}) (397) 4704 \p{Nv: *} \p{Numeric_Value: *} 4705 \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions= 4706 Nyiakeng_Puachue_Hmong} (Short: 4707 \p{Hmnp}; NOT \p{Block= 4708 Nyiakeng_Puachue_Hmong}) (71) 4709 X \p{OCR} \p{Optical_Character_Recognition} (= 4710 \p{Block=Optical_Character_Recognition}) 4711 (32) 4712 \p{Ogam} \p{Ogham} (= \p{Script_Extensions=Ogham}) 4713 (NOT \p{Block=Ogham}) (29) 4714 \p{Ogham} \p{Script_Extensions=Ogham} (Short: 4715 \p{Ogam}; NOT \p{Block=Ogham}) (29) 4716 \p{Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short: 4717 \p{Olck}) (48) 4718 \p{Olck} \p{Ol_Chiki} (= \p{Script_Extensions= 4719 Ol_Chiki}) (48) 4720 \p{Old_Hungarian} \p{Script_Extensions=Old_Hungarian} 4721 (Short: \p{Hung}; NOT \p{Block= 4722 Old_Hungarian}) (108) 4723 \p{Old_Italic} \p{Script_Extensions=Old_Italic} (Short: 4724 \p{Ital}; NOT \p{Block=Old_Italic}) (39) 4725 \p{Old_North_Arabian} \p{Script_Extensions=Old_North_Arabian} 4726 (Short: \p{Narb}) (32) 4727 \p{Old_Permic} \p{Script_Extensions=Old_Permic} (Short: 4728 \p{Perm}; NOT \p{Block=Old_Permic}) (44) 4729 \p{Old_Persian} \p{Script_Extensions=Old_Persian} (Short: 4730 \p{Xpeo}; NOT \p{Block=Old_Persian}) (50) 4731 \p{Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short: 4732 \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40) 4733 \p{Old_South_Arabian} \p{Script_Extensions=Old_South_Arabian} 4734 (Short: \p{Sarb}) (32) 4735 \p{Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short: 4736 \p{Orkh}; NOT \p{Block=Old_Turkic}) (73) 4737 \p{Old_Uyghur} \p{Script_Extensions=Old_Uyghur} (Short: 4738 \p{Ougr}; NOT \p{Block=Old_Uyghur}) (28) 4739 \p{Open_Punctuation} \p{General_Category=Open_Punctuation} 4740 (Short: \p{Ps}) (79) 4741 X \p{Optical_Character_Recognition} \p{Block= 4742 Optical_Character_Recognition} (Short: 4743 \p{InOCR}) (32) 4744 \p{Oriya} \p{Script_Extensions=Oriya} (Short: 4745 \p{Orya}; NOT \p{Block=Oriya}) (97) 4746 \p{Orkh} \p{Old_Turkic} (= \p{Script_Extensions= 4747 Old_Turkic}) (NOT \p{Block=Old_Turkic}) 4748 (73) 4749 X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48) 4750 \p{Orya} \p{Oriya} (= \p{Script_Extensions=Oriya}) 4751 (NOT \p{Block=Oriya}) (97) 4752 \p{Osage} \p{Script_Extensions=Osage} (Short: 4753 \p{Osge}; NOT \p{Block=Osage}) (72) 4754 \p{Osge} \p{Osage} (= \p{Script_Extensions=Osage}) 4755 (NOT \p{Block=Osage}) (72) 4756 \p{Osma} \p{Osmanya} (= \p{Script_Extensions= 4757 Osmanya}) (NOT \p{Block=Osmanya}) (40) 4758 \p{Osmanya} \p{Script_Extensions=Osmanya} (Short: 4759 \p{Osma}; NOT \p{Block=Osmanya}) (40) 4760 \p{Other} \p{General_Category=Other} (Short: \p{C}) 4761 (965_096 plus all above-Unicode code 4762 points) 4763 \p{Other_Letter} \p{General_Category=Other_Letter} (Short: 4764 \p{Lo}) (131_612) 4765 \p{Other_Number} \p{General_Category=Other_Number} (Short: 4766 \p{No}) (915) 4767 \p{Other_Punctuation} \p{General_Category=Other_Punctuation} 4768 (Short: \p{Po}) (628) 4769 \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short: 4770 \p{So}) (6634) 4771 X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80) 4772 \p{Ougr} \p{Old_Uyghur} (= \p{Script_Extensions= 4773 Old_Uyghur}) (NOT \p{Block=Old_Uyghur}) 4774 (28) 4775 \p{P} \pP \p{Punct} (= \p{General_Category= 4776 Punctuation}) (NOT 4777 \p{General_Punctuation}) (842) 4778 \p{Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} (Short: 4779 \p{Hmng}; NOT \p{Block=Pahawh_Hmong}) 4780 (127) 4781 \p{Palm} \p{Palmyrene} (= \p{Script_Extensions= 4782 Palmyrene}) (32) 4783 \p{Palmyrene} \p{Script_Extensions=Palmyrene} (Short: 4784 \p{Palm}) (32) 4785 \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator} 4786 (Short: \p{Zp}) (1) 4787 \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax= 4788 Y}) (2760) 4789 \p{Pat_Syn: *} \p{Pattern_Syntax: *} 4790 \p{Pat_WS} \p{Pattern_White_Space} (= 4791 \p{Pattern_White_Space=Y}) (11) 4792 \p{Pat_WS: *} \p{Pattern_White_Space: *} 4793 \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn}) 4794 (2760) 4795 \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn}) 4796 (1_111_352 plus all above-Unicode code 4797 points: [\x00-\x200-9A-Z_a-z\x7f-\xa0 4798 \xa8\xaa\xad\xaf\xb2-\xb5\xb7-\xba\xbc- 4799 \xbe\xc0-\xd6\xd8-\xf6\xf8-\xff], 4800 U+0100..200F, U+2028..202F, 4801 U+203F..2040, U+2054, U+205F..218F ...) 4802 \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760: 4803 [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\] 4804 \^`\{\|\}~\xa1-\xa7\xa9\xab-\xac\xae 4805 \xb0-\xb1\xb6\xbb\xbf\xd7\xf7], 4806 U+2010..2027, U+2030..203E, 4807 U+2041..2053, U+2055..205E, U+2190..245F 4808 ...) 4809 \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short: 4810 \p{PatWS}) (11) 4811 \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS}) 4812 (1_114_101 plus all above-Unicode code 4813 points: [^\t\n\cK\f\r\x20\x85], 4814 U+0100..200D, U+2010..2027, 4815 U+202A..infinity) 4816 \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11: 4817 [\t\n\cK\f\r\x20\x85], U+200E..200F, 4818 U+2028..2029) 4819 \p{Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short: 4820 \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57) 4821 \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script_Extensions= 4822 Pau_Cin_Hau}) (NOT \p{Block= 4823 Pau_Cin_Hau}) (57) 4824 \p{Pc} \p{Connector_Punctuation} (= 4825 \p{General_Category= 4826 Connector_Punctuation}) (10) 4827 \p{PCM} \p{Prepended_Concatenation_Mark} (= 4828 \p{Prepended_Concatenation_Mark=Y}) (13) 4829 \p{PCM: *} \p{Prepended_Concatenation_Mark: *} 4830 \p{Pd} \p{Dash_Punctuation} (= 4831 \p{General_Category=Dash_Punctuation}) 4832 (26) 4833 \p{Pe} \p{Close_Punctuation} (= 4834 \p{General_Category=Close_Punctuation}) 4835 (77) 4836 \p{PerlSpace} \p{PosixSpace} (6) 4837 \p{PerlWord} \p{PosixWord} (63) 4838 \p{Perm} \p{Old_Permic} (= \p{Script_Extensions= 4839 Old_Permic}) (NOT \p{Block=Old_Permic}) 4840 (44) 4841 \p{Pf} \p{Final_Punctuation} (= 4842 \p{General_Category=Final_Punctuation}) 4843 (10) 4844 \p{Phag} \p{Phags_Pa} (= \p{Script_Extensions= 4845 Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59) 4846 \p{Phags_Pa} \p{Script_Extensions=Phags_Pa} (Short: 4847 \p{Phag}; NOT \p{Block=Phags_Pa}) (59) 4848 X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block= 4849 Phaistos_Disc}) (48) 4850 X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short: 4851 \p{InPhaistos}) (48) 4852 \p{Phli} \p{Inscriptional_Pahlavi} (= 4853 \p{Script_Extensions= 4854 Inscriptional_Pahlavi}) (NOT \p{Block= 4855 Inscriptional_Pahlavi}) (27) 4856 \p{Phlp} \p{Psalter_Pahlavi} (= 4857 \p{Script_Extensions=Psalter_Pahlavi}) 4858 (NOT \p{Block=Psalter_Pahlavi}) (30) 4859 \p{Phnx} \p{Phoenician} (= \p{Script_Extensions= 4860 Phoenician}) (NOT \p{Block=Phoenician}) 4861 (29) 4862 \p{Phoenician} \p{Script_Extensions=Phoenician} (Short: 4863 \p{Phnx}; NOT \p{Block=Phoenician}) (29) 4864 X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block= 4865 Phonetic_Extensions}) (128) 4866 X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (= 4867 \p{Block= 4868 Phonetic_Extensions_Supplement}) (64) 4869 X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short: 4870 \p{InPhoneticExt}) (128) 4871 X \p{Phonetic_Extensions_Supplement} \p{Block= 4872 Phonetic_Extensions_Supplement} (Short: 4873 \p{InPhoneticExtSup}) (64) 4874 \p{Pi} \p{Initial_Punctuation} (= 4875 \p{General_Category= 4876 Initial_Punctuation}) (12) 4877 X \p{Playing_Cards} \p{Block=Playing_Cards} (96) 4878 \p{Plrd} \p{Miao} (= \p{Script_Extensions=Miao}) 4879 (NOT \p{Block=Miao}) (149) 4880 \p{Po} \p{Other_Punctuation} (= 4881 \p{General_Category=Other_Punctuation}) 4882 (628) 4883 \p{PosixAlnum} (62: [0-9A-Za-z]) 4884 \p{PosixAlpha} (52: [A-Za-z]) 4885 \p{PosixBlank} (2: [\t\x20]) 4886 \p{PosixCntrl} ASCII control characters (33: ACK, BEL, 4887 BS, CAN, CR, DC1, DC2, DC3, DC4, DEL, 4888 DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF, 4889 FS, GS, HT, LF, NAK, NUL, RS, SI, SO, 4890 SOH, STX, SUB, SYN, US, VT) 4891 \p{PosixDigit} (10: [0-9]) 4892 \p{PosixGraph} (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A- 4893 Z\[\\\]\^_`a-z\{\|\}~]) 4894 \p{PosixLower} (/i= PosixAlpha) (26: [a-z]) 4895 \p{PosixPrint} (95: [\x20-\x7e]) 4896 \p{PosixPunct} (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@ 4897 \[\\\]\^_`\{\|\}~]) 4898 \p{PosixSpace} (Short: \p{PerlSpace}) (6: [\t\n\cK\f\r 4899 \x20]) 4900 \p{PosixUpper} (/i= PosixAlpha) (26: [A-Z]) 4901 \p{PosixWord} \w, restricted to ASCII (Short: 4902 \p{PerlWord}) (63: [0-9A-Z_a-z]) 4903 \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} (Short: \p{AHex}) 4904 (22) 4905 \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark= 4906 Y} (Short: \p{PCM}) (13) 4907 \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM}) 4908 (1_114_099 plus all above-Unicode code 4909 points: U+0000..05FF, U+0606..06DC, 4910 U+06DE..070E, U+0710..088F, 4911 U+0892..08E1, U+08E3..110BC ...) 4912 \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM}) 4913 (13: U+0600..0605, U+06DD, U+070F, 4914 U+0890..0891, U+08E2, U+110BD ...) 4915 T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl 4916 extension) (33_979) 4917 \p{Present_In: V1_1} \p{Present_In=1.1} (= \p{Age=V1_1}) (Perl 4918 extension) (33_979) 4919 T \p{Present_In: 2.0} Code point's usage introduced in version 4920 2.0 or earlier (Short: \p{In=2.0}) (Perl 4921 extension) (178_500: U+0000..01F5, 4922 U+01FA..0217, U+0250..02A8, 4923 U+02B0..02DE, U+02E0..02E9, U+0300..0345 4924 ...) 4925 \p{Present_In: V2_0} \p{Present_In=2.0} (Perl extension) 4926 (178_500) 4927 T \p{Present_In: 2.1} Code point's usage introduced in version 4928 2.1 or earlier (Short: \p{In=2.1}) (Perl 4929 extension) (178_502: U+0000..01F5, 4930 U+01FA..0217, U+0250..02A8, 4931 U+02B0..02DE, U+02E0..02E9, U+0300..0345 4932 ...) 4933 \p{Present_In: V2_1} \p{Present_In=2.1} (Perl extension) 4934 (178_502) 4935 T \p{Present_In: 3.0} Code point's usage introduced in version 4936 3.0 or earlier (Short: \p{In=3.0}) (Perl 4937 extension) (188_809: U+0000..021F, 4938 U+0222..0233, U+0250..02AD, 4939 U+02B0..02EE, U+0300..034E, U+0360..0362 4940 ...) 4941 \p{Present_In: V3_0} \p{Present_In=3.0} (Perl extension) 4942 (188_809) 4943 T \p{Present_In: 3.1} Code point's usage introduced in version 4944 3.1 or earlier (Short: \p{In=3.1}) (Perl 4945 extension) (233_787: U+0000..021F, 4946 U+0222..0233, U+0250..02AD, 4947 U+02B0..02EE, U+0300..034E, U+0360..0362 4948 ...) 4949 \p{Present_In: V3_1} \p{Present_In=3.1} (Perl extension) 4950 (233_787) 4951 T \p{Present_In: 3.2} Code point's usage introduced in version 4952 3.2 or earlier (Short: \p{In=3.2}) (Perl 4953 extension) (234_803: U+0000..0220, 4954 U+0222..0233, U+0250..02AD, 4955 U+02B0..02EE, U+0300..034F, U+0360..036F 4956 ...) 4957 \p{Present_In: V3_2} \p{Present_In=3.2} (Perl extension) 4958 (234_803) 4959 T \p{Present_In: 4.0} Code point's usage introduced in version 4960 4.0 or earlier (Short: \p{In=4.0}) (Perl 4961 extension) (236_029: U+0000..0236, 4962 U+0250..0357, U+035D..036F, 4963 U+0374..0375, U+037A, U+037E ...) 4964 \p{Present_In: V4_0} \p{Present_In=4.0} (Perl extension) 4965 (236_029) 4966 T \p{Present_In: 4.1} Code point's usage introduced in version 4967 4.1 or earlier (Short: \p{In=4.1}) (Perl 4968 extension) (237_302: U+0000..0241, 4969 U+0250..036F, U+0374..0375, U+037A, 4970 U+037E, U+0384..038A ...) 4971 \p{Present_In: V4_1} \p{Present_In=4.1} (Perl extension) 4972 (237_302) 4973 T \p{Present_In: 5.0} Code point's usage introduced in version 4974 5.0 or earlier (Short: \p{In=5.0}) (Perl 4975 extension) (238_671: U+0000..036F, 4976 U+0374..0375, U+037A..037E, 4977 U+0384..038A, U+038C, U+038E..03A1 ...) 4978 \p{Present_In: V5_0} \p{Present_In=5.0} (Perl extension) 4979 (238_671) 4980 T \p{Present_In: 5.1} Code point's usage introduced in version 4981 5.1 or earlier (Short: \p{In=5.1}) (Perl 4982 extension) (240_295: U+0000..0377, 4983 U+037A..037E, U+0384..038A, U+038C, 4984 U+038E..03A1, U+03A3..0523 ...) 4985 \p{Present_In: V5_1} \p{Present_In=5.1} (Perl extension) 4986 (240_295) 4987 T \p{Present_In: 5.2} Code point's usage introduced in version 4988 5.2 or earlier (Short: \p{In=5.2}) (Perl 4989 extension) (246_943: U+0000..0377, 4990 U+037A..037E, U+0384..038A, U+038C, 4991 U+038E..03A1, U+03A3..0525 ...) 4992 \p{Present_In: V5_2} \p{Present_In=5.2} (Perl extension) 4993 (246_943) 4994 T \p{Present_In: 6.0} Code point's usage introduced in version 4995 6.0 or earlier (Short: \p{In=6.0}) (Perl 4996 extension) (249_031: U+0000..0377, 4997 U+037A..037E, U+0384..038A, U+038C, 4998 U+038E..03A1, U+03A3..0527 ...) 4999 \p{Present_In: V6_0} \p{Present_In=6.0} (Perl extension) 5000 (249_031) 5001 T \p{Present_In: 6.1} Code point's usage introduced in version 5002 6.1 or earlier (Short: \p{In=6.1}) (Perl 5003 extension) (249_763: U+0000..0377, 5004 U+037A..037E, U+0384..038A, U+038C, 5005 U+038E..03A1, U+03A3..0527 ...) 5006 \p{Present_In: V6_1} \p{Present_In=6.1} (Perl extension) 5007 (249_763) 5008 T \p{Present_In: 6.2} Code point's usage introduced in version 5009 6.2 or earlier (Short: \p{In=6.2}) (Perl 5010 extension) (249_764: U+0000..0377, 5011 U+037A..037E, U+0384..038A, U+038C, 5012 U+038E..03A1, U+03A3..0527 ...) 5013 \p{Present_In: V6_2} \p{Present_In=6.2} (Perl extension) 5014 (249_764) 5015 T \p{Present_In: 6.3} Code point's usage introduced in version 5016 6.3 or earlier (Short: \p{In=6.3}) (Perl 5017 extension) (249_769: U+0000..0377, 5018 U+037A..037E, U+0384..038A, U+038C, 5019 U+038E..03A1, U+03A3..0527 ...) 5020 \p{Present_In: V6_3} \p{Present_In=6.3} (Perl extension) 5021 (249_769) 5022 T \p{Present_In: 7.0} Code point's usage introduced in version 5023 7.0 or earlier (Short: \p{In=7.0}) (Perl 5024 extension) (252_603: U+0000..0377, 5025 U+037A..037F, U+0384..038A, U+038C, 5026 U+038E..03A1, U+03A3..052F ...) 5027 \p{Present_In: V7_0} \p{Present_In=7.0} (Perl extension) 5028 (252_603) 5029 T \p{Present_In: 8.0} Code point's usage introduced in version 5030 8.0 or earlier (Short: \p{In=8.0}) (Perl 5031 extension) (260_319: U+0000..0377, 5032 U+037A..037F, U+0384..038A, U+038C, 5033 U+038E..03A1, U+03A3..052F ...) 5034 \p{Present_In: V8_0} \p{Present_In=8.0} (Perl extension) 5035 (260_319) 5036 T \p{Present_In: 9.0} Code point's usage introduced in version 5037 9.0 or earlier (Short: \p{In=9.0}) (Perl 5038 extension) (267_819: U+0000..0377, 5039 U+037A..037F, U+0384..038A, U+038C, 5040 U+038E..03A1, U+03A3..052F ...) 5041 \p{Present_In: V9_0} \p{Present_In=9.0} (Perl extension) 5042 (267_819) 5043 T \p{Present_In: 10.0} Code point's usage introduced in version 5044 10.0 or earlier (Short: \p{In=10.0}) 5045 (Perl extension) (276_337: U+0000..0377, 5046 U+037A..037F, U+0384..038A, U+038C, 5047 U+038E..03A1, U+03A3..052F ...) 5048 \p{Present_In: V10_0} \p{Present_In=10.0} (Perl extension) 5049 (276_337) 5050 T \p{Present_In: 11.0} Code point's usage introduced in version 5051 11.0 or earlier (Short: \p{In=11.0}) 5052 (Perl extension) (277_021: U+0000..0377, 5053 U+037A..037F, U+0384..038A, U+038C, 5054 U+038E..03A1, U+03A3..052F ...) 5055 \p{Present_In: V11_0} \p{Present_In=11.0} (Perl extension) 5056 (277_021) 5057 T \p{Present_In: 12.0} Code point's usage introduced in version 5058 12.0 or earlier (Short: \p{In=12.0}) 5059 (Perl extension) (277_575: U+0000..0377, 5060 U+037A..037F, U+0384..038A, U+038C, 5061 U+038E..03A1, U+03A3..052F ...) 5062 \p{Present_In: V12_0} \p{Present_In=12.0} (Perl extension) 5063 (277_575) 5064 T \p{Present_In: 12.1} Code point's usage introduced in version 5065 12.1 or earlier (Short: \p{In=12.1}) 5066 (Perl extension) (277_576: U+0000..0377, 5067 U+037A..037F, U+0384..038A, U+038C, 5068 U+038E..03A1, U+03A3..052F ...) 5069 \p{Present_In: V12_1} \p{Present_In=12.1} (Perl extension) 5070 (277_576) 5071 T \p{Present_In: 13.0} Code point's usage introduced in version 5072 13.0 or earlier (Short: \p{In=13.0}) 5073 (Perl extension) (283_506: U+0000..0377, 5074 U+037A..037F, U+0384..038A, U+038C, 5075 U+038E..03A1, U+03A3..052F ...) 5076 \p{Present_In: V13_0} \p{Present_In=13.0} (Perl extension) 5077 (283_506) 5078 T \p{Present_In: 14.0} Code point's usage introduced in version 5079 14.0 or earlier (Short: \p{In=14.0}) 5080 (Perl extension) (284_344: U+0000..0377, 5081 U+037A..037F, U+0384..038A, U+038C, 5082 U+038E..03A1, U+03A3..052F ...) 5083 \p{Present_In: V14_0} \p{Present_In=14.0} (Perl extension) 5084 (284_344) 5085 T \p{Present_In: 15.0} Code point's usage introduced in version 5086 15.0 or earlier (Short: \p{In=15.0}) 5087 (Perl extension) (288_833: U+0000..0377, 5088 U+037A..037F, U+0384..038A, U+038C, 5089 U+038E..03A1, U+03A3..052F ...) 5090 \p{Present_In: V15_0} \p{Present_In=15.0} (Perl extension) 5091 (288_833) 5092 \p{Present_In: NA} \p{Present_In=Unassigned} (= \p{Age= 5093 Unassigned}) (Perl extension) (825_279 5094 plus all above-Unicode code points) 5095 \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=NA}) 5096 (Perl extension) (825_279 plus all 5097 above-Unicode code points) 5098 \p{Print} \p{XPosixPrint} (286_652) 5099 \p{Private_Use} \p{General_Category=Private_Use} (Short: 5100 \p{Co}; NOT \p{Private_Use_Area}) 5101 (137_468) 5102 X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short: 5103 \p{InPUA}) (6400) 5104 \p{Prti} \p{Inscriptional_Parthian} (= 5105 \p{Script_Extensions= 5106 Inscriptional_Parthian}) (NOT \p{Block= 5107 Inscriptional_Parthian}) (30) 5108 \p{Ps} \p{Open_Punctuation} (= 5109 \p{General_Category=Open_Punctuation}) 5110 (79) 5111 \p{Psalter_Pahlavi} \p{Script_Extensions=Psalter_Pahlavi} 5112 (Short: \p{Phlp}; NOT \p{Block= 5113 Psalter_Pahlavi}) (30) 5114 X \p{PUA} \p{Private_Use_Area} (= \p{Block= 5115 Private_Use_Area}) (6400) 5116 \p{Punct} \p{General_Category=Punctuation} (Short: 5117 \p{P}; NOT \p{General_Punctuation}) (842) 5118 \p{Punctuation} \p{Punct} (= \p{General_Category= 5119 Punctuation}) (NOT 5120 \p{General_Punctuation}) (842) 5121 \p{Qaac} \p{Coptic} (= \p{Script_Extensions= 5122 Coptic}) (NOT \p{Block=Coptic}) (165) 5123 \p{Qaai} \p{Inherited} (= \p{Script_Extensions= 5124 Inherited}) (586) 5125 \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark= 5126 Y}) (30) 5127 \p{QMark: *} \p{Quotation_Mark: *} 5128 \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark}) 5129 (30) 5130 \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082 5131 plus all above-Unicode code points: 5132 [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=>? 5133 \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xaa\xac- 5134 \xba\xbc-\xff], U+0100..2017, 5135 U+2020..2038, U+203B..2E41, 5136 U+2E43..300B, U+3010..301C ...) 5137 \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30: [\" 5138 \'\xab\xbb], U+2018..201F, U+2039..203A, 5139 U+2E42, U+300C..300F, U+301D..301F ...) 5140 \p{Radical} \p{Radical=Y} (329) 5141 \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all 5142 above-Unicode code points: U+0000..2E7F, 5143 U+2E9A, U+2EF4..2EFF, U+2FD6..infinity) 5144 \p{Radical: Y*} (Single: \p{Radical}) (329: U+2E80..2E99, 5145 U+2E9B..2EF3, U+2F00..2FD5) 5146 \p{Regional_Indicator} \p{Regional_Indicator=Y} (Short: \p{RI}) 5147 (26) 5148 \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086 5149 plus all above-Unicode code points: 5150 U+0000..1F1E5, U+1F200..infinity) 5151 \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26: 5152 U+1F1E6..1F1FF) 5153 \p{Rejang} \p{Script_Extensions=Rejang} (Short: 5154 \p{Rjng}; NOT \p{Block=Rejang}) (37) 5155 \p{RI} \p{Regional_Indicator} (= 5156 \p{Regional_Indicator=Y}) (26) 5157 \p{RI: *} \p{Regional_Indicator: *} 5158 \p{Rjng} \p{Rejang} (= \p{Script_Extensions= 5159 Rejang}) (NOT \p{Block=Rejang}) (37) 5160 \p{Rohg} \p{Hanifi_Rohingya} (= 5161 \p{Script_Extensions=Hanifi_Rohingya}) 5162 (NOT \p{Block=Hanifi_Rohingya}) (55) 5163 X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block= 5164 Rumi_Numeral_Symbols}) (32) 5165 X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short: 5166 \p{InRumi}) (32) 5167 \p{Runic} \p{Script_Extensions=Runic} (Short: 5168 \p{Runr}; NOT \p{Block=Runic}) (86) 5169 \p{Runr} \p{Runic} (= \p{Script_Extensions=Runic}) 5170 (NOT \p{Block=Runic}) (86) 5171 \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol}) 5172 (7770) 5173 \p{Samaritan} \p{Script_Extensions=Samaritan} (Short: 5174 \p{Samr}; NOT \p{Block=Samaritan}) (61) 5175 \p{Samr} \p{Samaritan} (= \p{Script_Extensions= 5176 Samaritan}) (NOT \p{Block=Samaritan}) 5177 (61) 5178 \p{Sarb} \p{Old_South_Arabian} (= 5179 \p{Script_Extensions=Old_South_Arabian}) 5180 (32) 5181 \p{Saur} \p{Saurashtra} (= \p{Script_Extensions= 5182 Saurashtra}) (NOT \p{Block=Saurashtra}) 5183 (82) 5184 \p{Saurashtra} \p{Script_Extensions=Saurashtra} (Short: 5185 \p{Saur}; NOT \p{Block=Saurashtra}) (82) 5186 \p{SB: *} \p{Sentence_Break: *} 5187 \p{Sc} \p{Currency_Symbol} (= 5188 \p{General_Category=Currency_Symbol}) 5189 (63) 5190 \p{Sc: *} \p{Script: *} 5191 \p{Script: Adlam} (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B, 5192 U+1E950..1E959, U+1E95E..1E95F) 5193 \p{Script: Adlm} \p{Script=Adlam} (88) 5194 \p{Script: Aghb} \p{Script=Caucasian_Albanian} (= 5195 \p{Script_Extensions= 5196 Caucasian_Albanian}) (53) 5197 \p{Script: Ahom} \p{Script_Extensions=Ahom} (Short: \p{Sc= 5198 Ahom}, \p{Ahom}) (65) 5199 \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions= 5200 Anatolian_Hieroglyphs} (Short: \p{Sc= 5201 Hluw}, \p{Hluw}) (583) 5202 \p{Script: Arab} \p{Script=Arabic} (1368) 5203 \p{Script: Arabic} (Short: \p{Sc=Arab}) (1368: U+0600..0604, 5204 U+0606..060B, U+060D..061A, 5205 U+061C..061E, U+0620..063F, U+0641..064A 5206 ...) 5207 \p{Script: Armenian} \p{Script_Extensions=Armenian} (Short: 5208 \p{Sc=Armn}, \p{Armn}) (96) 5209 \p{Script: Armi} \p{Script=Imperial_Aramaic} (= 5210 \p{Script_Extensions=Imperial_Aramaic}) 5211 (31) 5212 \p{Script: Armn} \p{Script=Armenian} (= 5213 \p{Script_Extensions=Armenian}) (96) 5214 \p{Script: Avestan} \p{Script_Extensions=Avestan} (Short: 5215 \p{Sc=Avst}, \p{Avst}) (61) 5216 \p{Script: Avst} \p{Script=Avestan} (= 5217 \p{Script_Extensions=Avestan}) (61) 5218 \p{Script: Bali} \p{Script=Balinese} (= 5219 \p{Script_Extensions=Balinese}) (124) 5220 \p{Script: Balinese} \p{Script_Extensions=Balinese} (Short: 5221 \p{Sc=Bali}, \p{Bali}) (124) 5222 \p{Script: Bamu} \p{Script=Bamum} (= \p{Script_Extensions= 5223 Bamum}) (657) 5224 \p{Script: Bamum} \p{Script_Extensions=Bamum} (Short: \p{Sc= 5225 Bamu}, \p{Bamu}) (657) 5226 \p{Script: Bass} \p{Script=Bassa_Vah} (= 5227 \p{Script_Extensions=Bassa_Vah}) (36) 5228 \p{Script: Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short: 5229 \p{Sc=Bass}, \p{Bass}) (36) 5230 \p{Script: Batak} \p{Script_Extensions=Batak} (Short: \p{Sc= 5231 Batk}, \p{Batk}) (56) 5232 \p{Script: Batk} \p{Script=Batak} (= \p{Script_Extensions= 5233 Batak}) (56) 5234 \p{Script: Beng} \p{Script=Bengali} (96) 5235 \p{Script: Bengali} (Short: \p{Sc=Beng}) (96: U+0980..0983, 5236 U+0985..098C, U+098F..0990, 5237 U+0993..09A8, U+09AA..09B0, U+09B2 ...) 5238 \p{Script: Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short: 5239 \p{Sc=Bhks}, \p{Bhks}) (97) 5240 \p{Script: Bhks} \p{Script=Bhaiksuki} (= 5241 \p{Script_Extensions=Bhaiksuki}) (97) 5242 \p{Script: Bopo} \p{Script=Bopomofo} (77) 5243 \p{Script: Bopomofo} (Short: \p{Sc=Bopo}) (77: U+02EA..02EB, 5244 U+3105..312F, U+31A0..31BF) 5245 \p{Script: Brah} \p{Script=Brahmi} (= \p{Script_Extensions= 5246 Brahmi}) (115) 5247 \p{Script: Brahmi} \p{Script_Extensions=Brahmi} (Short: 5248 \p{Sc=Brah}, \p{Brah}) (115) 5249 \p{Script: Brai} \p{Script=Braille} (= 5250 \p{Script_Extensions=Braille}) (256) 5251 \p{Script: Braille} \p{Script_Extensions=Braille} (Short: 5252 \p{Sc=Brai}, \p{Brai}) (256) 5253 \p{Script: Bugi} \p{Script=Buginese} (30) 5254 \p{Script: Buginese} (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B, 5255 U+1A1E..1A1F) 5256 \p{Script: Buhd} \p{Script=Buhid} (20) 5257 \p{Script: Buhid} (Short: \p{Sc=Buhd}) (20: U+1740..1753) 5258 \p{Script: Cakm} \p{Script=Chakma} (71) 5259 \p{Script: Canadian_Aboriginal} \p{Script_Extensions= 5260 Canadian_Aboriginal} (Short: \p{Sc= 5261 Cans}, \p{Cans}) (726) 5262 \p{Script: Cans} \p{Script=Canadian_Aboriginal} (= 5263 \p{Script_Extensions= 5264 Canadian_Aboriginal}) (726) 5265 \p{Script: Cari} \p{Script=Carian} (= \p{Script_Extensions= 5266 Carian}) (49) 5267 \p{Script: Carian} \p{Script_Extensions=Carian} (Short: 5268 \p{Sc=Cari}, \p{Cari}) (49) 5269 \p{Script: Caucasian_Albanian} \p{Script_Extensions= 5270 Caucasian_Albanian} (Short: \p{Sc=Aghb}, 5271 \p{Aghb}) (53) 5272 \p{Script: Chakma} (Short: \p{Sc=Cakm}) (71: U+11100..11134, 5273 U+11136..11147) 5274 \p{Script: Cham} \p{Script_Extensions=Cham} (Short: \p{Sc= 5275 Cham}, \p{Cham}) (83) 5276 \p{Script: Cher} \p{Script=Cherokee} (= 5277 \p{Script_Extensions=Cherokee}) (172) 5278 \p{Script: Cherokee} \p{Script_Extensions=Cherokee} (Short: 5279 \p{Sc=Cher}, \p{Cher}) (172) 5280 \p{Script: Chorasmian} \p{Script_Extensions=Chorasmian} (Short: 5281 \p{Sc=Chrs}, \p{Chrs}) (28) 5282 \p{Script: Chrs} \p{Script=Chorasmian} (= 5283 \p{Script_Extensions=Chorasmian}) (28) 5284 \p{Script: Common} (Short: \p{Sc=Zyyy}) (8301: [\x00-\x20! 5285 \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\] 5286 \^_`\{\|\}~\x7f-\xa9\xab-\xb9\xbb-\xbf 5287 \xd7\xf7], U+02B9..02DF, U+02E5..02E9, 5288 U+02EC..02FF, U+0374, U+037E ...) 5289 \p{Script: Copt} \p{Script=Coptic} (137) 5290 \p{Script: Coptic} (Short: \p{Sc=Copt}) (137: U+03E2..03EF, 5291 U+2C80..2CF3, U+2CF9..2CFF) 5292 \p{Script: Cpmn} \p{Script=Cypro_Minoan} (99) 5293 \p{Script: Cprt} \p{Script=Cypriot} (55) 5294 \p{Script: Cuneiform} \p{Script_Extensions=Cuneiform} (Short: 5295 \p{Sc=Xsux}, \p{Xsux}) (1234) 5296 \p{Script: Cypriot} (Short: \p{Sc=Cprt}) (55: U+10800..10805, 5297 U+10808, U+1080A..10835, U+10837..10838, 5298 U+1083C, U+1083F) 5299 \p{Script: Cypro_Minoan} (Short: \p{Sc=Cpmn}) (99: U+12F90..12FF2) 5300 \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}) (506: U+0400..0484, 5301 U+0487..052F, U+1C80..1C88, U+1D2B, 5302 U+1D78, U+2DE0..2DFF ...) 5303 \p{Script: Cyrl} \p{Script=Cyrillic} (506) 5304 \p{Script: Deseret} \p{Script_Extensions=Deseret} (Short: 5305 \p{Sc=Dsrt}, \p{Dsrt}) (80) 5306 \p{Script: Deva} \p{Script=Devanagari} (164) 5307 \p{Script: Devanagari} (Short: \p{Sc=Deva}) (164: U+0900..0950, 5308 U+0955..0963, U+0966..097F, 5309 U+A8E0..A8FF, U+11B00..11B09) 5310 \p{Script: Diak} \p{Script=Dives_Akuru} (= 5311 \p{Script_Extensions=Dives_Akuru}) (72) 5312 \p{Script: Dives_Akuru} \p{Script_Extensions=Dives_Akuru} (Short: 5313 \p{Sc=Diak}, \p{Diak}) (72) 5314 \p{Script: Dogr} \p{Script=Dogra} (60) 5315 \p{Script: Dogra} (Short: \p{Sc=Dogr}) (60: U+11800..1183B) 5316 \p{Script: Dsrt} \p{Script=Deseret} (= 5317 \p{Script_Extensions=Deseret}) (80) 5318 \p{Script: Dupl} \p{Script=Duployan} (143) 5319 \p{Script: Duployan} (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A, 5320 U+1BC70..1BC7C, U+1BC80..1BC88, 5321 U+1BC90..1BC99, U+1BC9C..1BC9F) 5322 \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (= 5323 \p{Script_Extensions= 5324 Egyptian_Hieroglyphs}) (1110) 5325 \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions= 5326 Egyptian_Hieroglyphs} (Short: \p{Sc= 5327 Egyp}, \p{Egyp}) (1110) 5328 \p{Script: Elba} \p{Script=Elbasan} (= 5329 \p{Script_Extensions=Elbasan}) (40) 5330 \p{Script: Elbasan} \p{Script_Extensions=Elbasan} (Short: 5331 \p{Sc=Elba}, \p{Elba}) (40) 5332 \p{Script: Elym} \p{Script=Elymaic} (= 5333 \p{Script_Extensions=Elymaic}) (23) 5334 \p{Script: Elymaic} \p{Script_Extensions=Elymaic} (Short: 5335 \p{Sc=Elym}, \p{Elym}) (23) 5336 \p{Script: Ethi} \p{Script=Ethiopic} (= 5337 \p{Script_Extensions=Ethiopic}) (523) 5338 \p{Script: Ethiopic} \p{Script_Extensions=Ethiopic} (Short: 5339 \p{Sc=Ethi}, \p{Ethi}) (523) 5340 \p{Script: Geor} \p{Script=Georgian} (173) 5341 \p{Script: Georgian} (Short: \p{Sc=Geor}) (173: U+10A0..10C5, 5342 U+10C7, U+10CD, U+10D0..10FA, 5343 U+10FC..10FF, U+1C90..1CBA ...) 5344 \p{Script: Glag} \p{Script=Glagolitic} (134) 5345 \p{Script: Glagolitic} (Short: \p{Sc=Glag}) (134: U+2C00..2C5F, 5346 U+1E000..1E006, U+1E008..1E018, 5347 U+1E01B..1E021, U+1E023..1E024, 5348 U+1E026..1E02A) 5349 \p{Script: Gong} \p{Script=Gunjala_Gondi} (63) 5350 \p{Script: Gonm} \p{Script=Masaram_Gondi} (75) 5351 \p{Script: Goth} \p{Script=Gothic} (= \p{Script_Extensions= 5352 Gothic}) (27) 5353 \p{Script: Gothic} \p{Script_Extensions=Gothic} (Short: 5354 \p{Sc=Goth}, \p{Goth}) (27) 5355 \p{Script: Gran} \p{Script=Grantha} (85) 5356 \p{Script: Grantha} (Short: \p{Sc=Gran}) (85: U+11300..11303, 5357 U+11305..1130C, U+1130F..11310, 5358 U+11313..11328, U+1132A..11330, 5359 U+11332..11333 ...) 5360 \p{Script: Greek} (Short: \p{Sc=Grek}) (518: U+0370..0373, 5361 U+0375..0377, U+037A..037D, U+037F, 5362 U+0384, U+0386 ...) 5363 \p{Script: Grek} \p{Script=Greek} (518) 5364 \p{Script: Gujarati} (Short: \p{Sc=Gujr}) (91: U+0A81..0A83, 5365 U+0A85..0A8D, U+0A8F..0A91, 5366 U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3 5367 ...) 5368 \p{Script: Gujr} \p{Script=Gujarati} (91) 5369 \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63: 5370 U+11D60..11D65, U+11D67..11D68, 5371 U+11D6A..11D8E, U+11D90..11D91, 5372 U+11D93..11D98, U+11DA0..11DA9) 5373 \p{Script: Gurmukhi} (Short: \p{Sc=Guru}) (80: U+0A01..0A03, 5374 U+0A05..0A0A, U+0A0F..0A10, 5375 U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33 5376 ...) 5377 \p{Script: Guru} \p{Script=Gurmukhi} (80) 5378 \p{Script: Han} (Short: \p{Sc=Han}) (98_408: U+2E80..2E99, 5379 U+2E9B..2EF3, U+2F00..2FD5, U+3005, 5380 U+3007, U+3021..3029 ...) 5381 \p{Script: Hang} \p{Script=Hangul} (11_739) 5382 \p{Script: Hangul} (Short: \p{Sc=Hang}) (11_739: 5383 U+1100..11FF, U+302E..302F, 5384 U+3131..318E, U+3200..321E, 5385 U+3260..327E, U+A960..A97C ...) 5386 \p{Script: Hani} \p{Script=Han} (98_408) 5387 \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50: 5388 U+10D00..10D27, U+10D30..10D39) 5389 \p{Script: Hano} \p{Script=Hanunoo} (21) 5390 \p{Script: Hanunoo} (Short: \p{Sc=Hano}) (21: U+1720..1734) 5391 \p{Script: Hatr} \p{Script=Hatran} (= \p{Script_Extensions= 5392 Hatran}) (26) 5393 \p{Script: Hatran} \p{Script_Extensions=Hatran} (Short: 5394 \p{Sc=Hatr}, \p{Hatr}) (26) 5395 \p{Script: Hebr} \p{Script=Hebrew} (= \p{Script_Extensions= 5396 Hebrew}) (134) 5397 \p{Script: Hebrew} \p{Script_Extensions=Hebrew} (Short: 5398 \p{Sc=Hebr}, \p{Hebr}) (134) 5399 \p{Script: Hira} \p{Script=Hiragana} (381) 5400 \p{Script: Hiragana} (Short: \p{Sc=Hira}) (381: U+3041..3096, 5401 U+309D..309F, U+1B001..1B11F, U+1B132, 5402 U+1B150..1B152, U+1F200) 5403 \p{Script: Hluw} \p{Script=Anatolian_Hieroglyphs} (= 5404 \p{Script_Extensions= 5405 Anatolian_Hieroglyphs}) (583) 5406 \p{Script: Hmng} \p{Script=Pahawh_Hmong} (= 5407 \p{Script_Extensions=Pahawh_Hmong}) (127) 5408 \p{Script: Hmnp} \p{Script=Nyiakeng_Puachue_Hmong} (= 5409 \p{Script_Extensions= 5410 Nyiakeng_Puachue_Hmong}) (71) 5411 \p{Script: Hung} \p{Script=Old_Hungarian} (= 5412 \p{Script_Extensions=Old_Hungarian}) 5413 (108) 5414 \p{Script: Imperial_Aramaic} \p{Script_Extensions= 5415 Imperial_Aramaic} (Short: \p{Sc=Armi}, 5416 \p{Armi}) (31) 5417 \p{Script: Inherited} (Short: \p{Sc=Zinh}) (657: U+0300..036F, 5418 U+0485..0486, U+064B..0655, U+0670, 5419 U+0951..0954, U+1AB0..1ACE ...) 5420 \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions= 5421 Inscriptional_Pahlavi} (Short: \p{Sc= 5422 Phli}, \p{Phli}) (27) 5423 \p{Script: Inscriptional_Parthian} \p{Script_Extensions= 5424 Inscriptional_Parthian} (Short: \p{Sc= 5425 Prti}, \p{Prti}) (30) 5426 \p{Script: Ital} \p{Script=Old_Italic} (= 5427 \p{Script_Extensions=Old_Italic}) (39) 5428 \p{Script: Java} \p{Script=Javanese} (90) 5429 \p{Script: Javanese} (Short: \p{Sc=Java}) (90: U+A980..A9CD, 5430 U+A9D0..A9D9, U+A9DE..A9DF) 5431 \p{Script: Kaithi} (Short: \p{Sc=Kthi}) (68: U+11080..110C2, 5432 U+110CD) 5433 \p{Script: Kali} \p{Script=Kayah_Li} (47) 5434 \p{Script: Kana} \p{Script=Katakana} (321) 5435 \p{Script: Kannada} (Short: \p{Sc=Knda}) (91: U+0C80..0C8C, 5436 U+0C8E..0C90, U+0C92..0CA8, 5437 U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4 5438 ...) 5439 \p{Script: Katakana} (Short: \p{Sc=Kana}) (321: U+30A1..30FA, 5440 U+30FD..30FF, U+31F0..31FF, 5441 U+32D0..32FE, U+3300..3357, U+FF66..FF6F 5442 ...) 5443 \p{Script: Kawi} \p{Script_Extensions=Kawi} (Short: \p{Sc= 5444 Kawi}, \p{Kawi}) (86) 5445 \p{Script: Kayah_Li} (Short: \p{Sc=Kali}) (47: U+A900..A92D, 5446 U+A92F) 5447 \p{Script: Khar} \p{Script=Kharoshthi} (= 5448 \p{Script_Extensions=Kharoshthi}) (68) 5449 \p{Script: Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short: 5450 \p{Sc=Khar}, \p{Khar}) (68) 5451 \p{Script: Khitan_Small_Script} \p{Script_Extensions= 5452 Khitan_Small_Script} (Short: \p{Sc= 5453 Kits}, \p{Kits}) (471) 5454 \p{Script: Khmer} \p{Script_Extensions=Khmer} (Short: \p{Sc= 5455 Khmr}, \p{Khmr}) (146) 5456 \p{Script: Khmr} \p{Script=Khmer} (= \p{Script_Extensions= 5457 Khmer}) (146) 5458 \p{Script: Khoj} \p{Script=Khojki} (65) 5459 \p{Script: Khojki} (Short: \p{Sc=Khoj}) (65: U+11200..11211, 5460 U+11213..11241) 5461 \p{Script: Khudawadi} (Short: \p{Sc=Sind}) (69: U+112B0..112EA, 5462 U+112F0..112F9) 5463 \p{Script: Kits} \p{Script=Khitan_Small_Script} (= 5464 \p{Script_Extensions= 5465 Khitan_Small_Script}) (471) 5466 \p{Script: Knda} \p{Script=Kannada} (91) 5467 \p{Script: Kthi} \p{Script=Kaithi} (68) 5468 \p{Script: Lana} \p{Script=Tai_Tham} (= 5469 \p{Script_Extensions=Tai_Tham}) (127) 5470 \p{Script: Lao} \p{Script_Extensions=Lao} (Short: \p{Sc= 5471 Lao}, \p{Lao}) (83) 5472 \p{Script: Laoo} \p{Script=Lao} (= \p{Script_Extensions= 5473 Lao}) (83) 5474 \p{Script: Latin} (Short: \p{Sc=Latn}) (1481: [A-Za-z\xaa 5475 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 5476 U+0100..02B8, U+02E0..02E4, 5477 U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65 5478 ...) 5479 \p{Script: Latn} \p{Script=Latin} (1481) 5480 \p{Script: Lepc} \p{Script=Lepcha} (= \p{Script_Extensions= 5481 Lepcha}) (74) 5482 \p{Script: Lepcha} \p{Script_Extensions=Lepcha} (Short: 5483 \p{Sc=Lepc}, \p{Lepc}) (74) 5484 \p{Script: Limb} \p{Script=Limbu} (68) 5485 \p{Script: Limbu} (Short: \p{Sc=Limb}) (68: U+1900..191E, 5486 U+1920..192B, U+1930..193B, U+1940, 5487 U+1944..194F) 5488 \p{Script: Lina} \p{Script=Linear_A} (341) 5489 \p{Script: Linb} \p{Script=Linear_B} (211) 5490 \p{Script: Linear_A} (Short: \p{Sc=Lina}) (341: U+10600..10736, 5491 U+10740..10755, U+10760..10767) 5492 \p{Script: Linear_B} (Short: \p{Sc=Linb}) (211: U+10000..1000B, 5493 U+1000D..10026, U+10028..1003A, 5494 U+1003C..1003D, U+1003F..1004D, 5495 U+10050..1005D ...) 5496 \p{Script: Lisu} \p{Script_Extensions=Lisu} (Short: \p{Sc= 5497 Lisu}, \p{Lisu}) (49) 5498 \p{Script: Lyci} \p{Script=Lycian} (= \p{Script_Extensions= 5499 Lycian}) (29) 5500 \p{Script: Lycian} \p{Script_Extensions=Lycian} (Short: 5501 \p{Sc=Lyci}, \p{Lyci}) (29) 5502 \p{Script: Lydi} \p{Script=Lydian} (= \p{Script_Extensions= 5503 Lydian}) (27) 5504 \p{Script: Lydian} \p{Script_Extensions=Lydian} (Short: 5505 \p{Sc=Lydi}, \p{Lydi}) (27) 5506 \p{Script: Mahajani} (Short: \p{Sc=Mahj}) (39: U+11150..11176) 5507 \p{Script: Mahj} \p{Script=Mahajani} (39) 5508 \p{Script: Maka} \p{Script=Makasar} (= 5509 \p{Script_Extensions=Makasar}) (25) 5510 \p{Script: Makasar} \p{Script_Extensions=Makasar} (Short: 5511 \p{Sc=Maka}, \p{Maka}) (25) 5512 \p{Script: Malayalam} (Short: \p{Sc=Mlym}) (118: U+0D00..0D0C, 5513 U+0D0E..0D10, U+0D12..0D44, 5514 U+0D46..0D48, U+0D4A..0D4F, U+0D54..0D63 5515 ...) 5516 \p{Script: Mand} \p{Script=Mandaic} (29) 5517 \p{Script: Mandaic} (Short: \p{Sc=Mand}) (29: U+0840..085B, 5518 U+085E) 5519 \p{Script: Mani} \p{Script=Manichaean} (51) 5520 \p{Script: Manichaean} (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6, 5521 U+10AEB..10AF6) 5522 \p{Script: Marc} \p{Script=Marchen} (= 5523 \p{Script_Extensions=Marchen}) (68) 5524 \p{Script: Marchen} \p{Script_Extensions=Marchen} (Short: 5525 \p{Sc=Marc}, \p{Marc}) (68) 5526 \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75: 5527 U+11D00..11D06, U+11D08..11D09, 5528 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D, 5529 U+11D3F..11D47 ...) 5530 \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short: 5531 \p{Sc=Medf}, \p{Medf}) (91) 5532 \p{Script: Medf} \p{Script=Medefaidrin} (= 5533 \p{Script_Extensions=Medefaidrin}) (91) 5534 \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} 5535 (Short: \p{Sc=Mtei}, \p{Mtei}) (79) 5536 \p{Script: Mend} \p{Script=Mende_Kikakui} (= 5537 \p{Script_Extensions=Mende_Kikakui}) 5538 (213) 5539 \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui} 5540 (Short: \p{Sc=Mend}, \p{Mend}) (213) 5541 \p{Script: Merc} \p{Script=Meroitic_Cursive} (= 5542 \p{Script_Extensions=Meroitic_Cursive}) 5543 (90) 5544 \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (= 5545 \p{Script_Extensions= 5546 Meroitic_Hieroglyphs}) (32) 5547 \p{Script: Meroitic_Cursive} \p{Script_Extensions= 5548 Meroitic_Cursive} (Short: \p{Sc=Merc}, 5549 \p{Merc}) (90) 5550 \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions= 5551 Meroitic_Hieroglyphs} (Short: \p{Sc= 5552 Mero}, \p{Mero}) (32) 5553 \p{Script: Miao} \p{Script_Extensions=Miao} (Short: \p{Sc= 5554 Miao}, \p{Miao}) (149) 5555 \p{Script: Mlym} \p{Script=Malayalam} (118) 5556 \p{Script: Modi} (Short: \p{Sc=Modi}) (79: U+11600..11644, 5557 U+11650..11659) 5558 \p{Script: Mong} \p{Script=Mongolian} (168) 5559 \p{Script: Mongolian} (Short: \p{Sc=Mong}) (168: U+1800..1801, 5560 U+1804, U+1806..1819, U+1820..1878, 5561 U+1880..18AA, U+11660..1166C) 5562 \p{Script: Mro} \p{Script_Extensions=Mro} (Short: \p{Sc= 5563 Mro}, \p{Mro}) (43) 5564 \p{Script: Mroo} \p{Script=Mro} (= \p{Script_Extensions= 5565 Mro}) (43) 5566 \p{Script: Mtei} \p{Script=Meetei_Mayek} (= 5567 \p{Script_Extensions=Meetei_Mayek}) (79) 5568 \p{Script: Mult} \p{Script=Multani} (38) 5569 \p{Script: Multani} (Short: \p{Sc=Mult}) (38: U+11280..11286, 5570 U+11288, U+1128A..1128D, U+1128F..1129D, 5571 U+1129F..112A9) 5572 \p{Script: Myanmar} (Short: \p{Sc=Mymr}) (223: U+1000..109F, 5573 U+A9E0..A9FE, U+AA60..AA7F) 5574 \p{Script: Mymr} \p{Script=Myanmar} (223) 5575 \p{Script: Nabataean} \p{Script_Extensions=Nabataean} (Short: 5576 \p{Sc=Nbat}, \p{Nbat}) (40) 5577 \p{Script: Nag_Mundari} \p{Script_Extensions=Nag_Mundari} (Short: 5578 \p{Sc=Nagm}, \p{Nagm}) (42) 5579 \p{Script: Nagm} \p{Script=Nag_Mundari} (= 5580 \p{Script_Extensions=Nag_Mundari}) (42) 5581 \p{Script: Nand} \p{Script=Nandinagari} (65) 5582 \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7, 5583 U+119AA..119D7, U+119DA..119E4) 5584 \p{Script: Narb} \p{Script=Old_North_Arabian} (= 5585 \p{Script_Extensions=Old_North_Arabian}) 5586 (32) 5587 \p{Script: Nbat} \p{Script=Nabataean} (= 5588 \p{Script_Extensions=Nabataean}) (40) 5589 \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short: 5590 \p{Sc=Talu}, \p{Talu}) (83) 5591 \p{Script: Newa} \p{Script_Extensions=Newa} (Short: \p{Sc= 5592 Newa}, \p{Newa}) (97) 5593 \p{Script: Nko} (Short: \p{Sc=Nko}) (62: U+07C0..07FA, 5594 U+07FD..07FF) 5595 \p{Script: Nkoo} \p{Script=Nko} (62) 5596 \p{Script: Nshu} \p{Script=Nushu} (= \p{Script_Extensions= 5597 Nushu}) (397) 5598 \p{Script: Nushu} \p{Script_Extensions=Nushu} (Short: \p{Sc= 5599 Nshu}, \p{Nshu}) (397) 5600 \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions= 5601 Nyiakeng_Puachue_Hmong} (Short: \p{Sc= 5602 Hmnp}, \p{Hmnp}) (71) 5603 \p{Script: Ogam} \p{Script=Ogham} (= \p{Script_Extensions= 5604 Ogham}) (29) 5605 \p{Script: Ogham} \p{Script_Extensions=Ogham} (Short: \p{Sc= 5606 Ogam}, \p{Ogam}) (29) 5607 \p{Script: Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short: 5608 \p{Sc=Olck}, \p{Olck}) (48) 5609 \p{Script: Olck} \p{Script=Ol_Chiki} (= 5610 \p{Script_Extensions=Ol_Chiki}) (48) 5611 \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian} 5612 (Short: \p{Sc=Hung}, \p{Hung}) (108) 5613 \p{Script: Old_Italic} \p{Script_Extensions=Old_Italic} (Short: 5614 \p{Sc=Ital}, \p{Ital}) (39) 5615 \p{Script: Old_North_Arabian} \p{Script_Extensions= 5616 Old_North_Arabian} (Short: \p{Sc=Narb}, 5617 \p{Narb}) (32) 5618 \p{Script: Old_Permic} (Short: \p{Sc=Perm}) (43: U+10350..1037A) 5619 \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short: 5620 \p{Sc=Xpeo}, \p{Xpeo}) (50) 5621 \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short: 5622 \p{Sc=Sogo}, \p{Sogo}) (40) 5623 \p{Script: Old_South_Arabian} \p{Script_Extensions= 5624 Old_South_Arabian} (Short: \p{Sc=Sarb}, 5625 \p{Sarb}) (32) 5626 \p{Script: Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short: 5627 \p{Sc=Orkh}, \p{Orkh}) (73) 5628 \p{Script: Old_Uyghur} (Short: \p{Sc=Ougr}) (26: U+10F70..10F89) 5629 \p{Script: Oriya} (Short: \p{Sc=Orya}) (91: U+0B01..0B03, 5630 U+0B05..0B0C, U+0B0F..0B10, 5631 U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33 5632 ...) 5633 \p{Script: Orkh} \p{Script=Old_Turkic} (= 5634 \p{Script_Extensions=Old_Turkic}) (73) 5635 \p{Script: Orya} \p{Script=Oriya} (91) 5636 \p{Script: Osage} \p{Script_Extensions=Osage} (Short: \p{Sc= 5637 Osge}, \p{Osge}) (72) 5638 \p{Script: Osge} \p{Script=Osage} (= \p{Script_Extensions= 5639 Osage}) (72) 5640 \p{Script: Osma} \p{Script=Osmanya} (= 5641 \p{Script_Extensions=Osmanya}) (40) 5642 \p{Script: Osmanya} \p{Script_Extensions=Osmanya} (Short: 5643 \p{Sc=Osma}, \p{Osma}) (40) 5644 \p{Script: Ougr} \p{Script=Old_Uyghur} (26) 5645 \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} 5646 (Short: \p{Sc=Hmng}, \p{Hmng}) (127) 5647 \p{Script: Palm} \p{Script=Palmyrene} (= 5648 \p{Script_Extensions=Palmyrene}) (32) 5649 \p{Script: Palmyrene} \p{Script_Extensions=Palmyrene} (Short: 5650 \p{Sc=Palm}, \p{Palm}) (32) 5651 \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short: 5652 \p{Sc=Pauc}, \p{Pauc}) (57) 5653 \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (= 5654 \p{Script_Extensions=Pau_Cin_Hau}) (57) 5655 \p{Script: Perm} \p{Script=Old_Permic} (43) 5656 \p{Script: Phag} \p{Script=Phags_Pa} (56) 5657 \p{Script: Phags_Pa} (Short: \p{Sc=Phag}) (56: U+A840..A877) 5658 \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (= 5659 \p{Script_Extensions= 5660 Inscriptional_Pahlavi}) (27) 5661 \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29) 5662 \p{Script: Phnx} \p{Script=Phoenician} (= 5663 \p{Script_Extensions=Phoenician}) (29) 5664 \p{Script: Phoenician} \p{Script_Extensions=Phoenician} (Short: 5665 \p{Sc=Phnx}, \p{Phnx}) (29) 5666 \p{Script: Plrd} \p{Script=Miao} (= \p{Script_Extensions= 5667 Miao}) (149) 5668 \p{Script: Prti} \p{Script=Inscriptional_Parthian} (= 5669 \p{Script_Extensions= 5670 Inscriptional_Parthian}) (30) 5671 \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29: 5672 U+10B80..10B91, U+10B99..10B9C, 5673 U+10BA9..10BAF) 5674 \p{Script: Qaac} \p{Script=Coptic} (137) 5675 \p{Script: Qaai} \p{Script=Inherited} (657) 5676 \p{Script: Rejang} \p{Script_Extensions=Rejang} (Short: 5677 \p{Sc=Rjng}, \p{Rjng}) (37) 5678 \p{Script: Rjng} \p{Script=Rejang} (= \p{Script_Extensions= 5679 Rejang}) (37) 5680 \p{Script: Rohg} \p{Script=Hanifi_Rohingya} (50) 5681 \p{Script: Runic} \p{Script_Extensions=Runic} (Short: \p{Sc= 5682 Runr}, \p{Runr}) (86) 5683 \p{Script: Runr} \p{Script=Runic} (= \p{Script_Extensions= 5684 Runic}) (86) 5685 \p{Script: Samaritan} \p{Script_Extensions=Samaritan} (Short: 5686 \p{Sc=Samr}, \p{Samr}) (61) 5687 \p{Script: Samr} \p{Script=Samaritan} (= 5688 \p{Script_Extensions=Samaritan}) (61) 5689 \p{Script: Sarb} \p{Script=Old_South_Arabian} (= 5690 \p{Script_Extensions=Old_South_Arabian}) 5691 (32) 5692 \p{Script: Saur} \p{Script=Saurashtra} (= 5693 \p{Script_Extensions=Saurashtra}) (82) 5694 \p{Script: Saurashtra} \p{Script_Extensions=Saurashtra} (Short: 5695 \p{Sc=Saur}, \p{Saur}) (82) 5696 \p{Script: Sgnw} \p{Script=SignWriting} (= 5697 \p{Script_Extensions=SignWriting}) (672) 5698 \p{Script: Sharada} (Short: \p{Sc=Shrd}) (96: U+11180..111DF) 5699 \p{Script: Shavian} \p{Script_Extensions=Shavian} (Short: 5700 \p{Sc=Shaw}, \p{Shaw}) (48) 5701 \p{Script: Shaw} \p{Script=Shavian} (= 5702 \p{Script_Extensions=Shavian}) (48) 5703 \p{Script: Shrd} \p{Script=Sharada} (96) 5704 \p{Script: Sidd} \p{Script=Siddham} (= 5705 \p{Script_Extensions=Siddham}) (92) 5706 \p{Script: Siddham} \p{Script_Extensions=Siddham} (Short: 5707 \p{Sc=Sidd}, \p{Sidd}) (92) 5708 \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short: 5709 \p{Sc=Sgnw}, \p{Sgnw}) (672) 5710 \p{Script: Sind} \p{Script=Khudawadi} (69) 5711 \p{Script: Sinh} \p{Script=Sinhala} (111) 5712 \p{Script: Sinhala} (Short: \p{Sc=Sinh}) (111: U+0D81..0D83, 5713 U+0D85..0D96, U+0D9A..0DB1, 5714 U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...) 5715 \p{Script: Sogd} \p{Script=Sogdian} (42) 5716 \p{Script: Sogdian} (Short: \p{Sc=Sogd}) (42: U+10F30..10F59) 5717 \p{Script: Sogo} \p{Script=Old_Sogdian} (= 5718 \p{Script_Extensions=Old_Sogdian}) (40) 5719 \p{Script: Sora} \p{Script=Sora_Sompeng} (= 5720 \p{Script_Extensions=Sora_Sompeng}) (35) 5721 \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} 5722 (Short: \p{Sc=Sora}, \p{Sora}) (35) 5723 \p{Script: Soyo} \p{Script=Soyombo} (= 5724 \p{Script_Extensions=Soyombo}) (83) 5725 \p{Script: Soyombo} \p{Script_Extensions=Soyombo} (Short: 5726 \p{Sc=Soyo}, \p{Soyo}) (83) 5727 \p{Script: Sund} \p{Script=Sundanese} (= 5728 \p{Script_Extensions=Sundanese}) (72) 5729 \p{Script: Sundanese} \p{Script_Extensions=Sundanese} (Short: 5730 \p{Sc=Sund}, \p{Sund}) (72) 5731 \p{Script: Sylo} \p{Script=Syloti_Nagri} (45) 5732 \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (45: U+A800..A82C) 5733 \p{Script: Syrc} \p{Script=Syriac} (88) 5734 \p{Script: Syriac} (Short: \p{Sc=Syrc}) (88: U+0700..070D, 5735 U+070F..074A, U+074D..074F, U+0860..086A) 5736 \p{Script: Tagalog} (Short: \p{Sc=Tglg}) (23: U+1700..1715, 5737 U+171F) 5738 \p{Script: Tagb} \p{Script=Tagbanwa} (18) 5739 \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}) (18: U+1760..176C, 5740 U+176E..1770, U+1772..1773) 5741 \p{Script: Tai_Le} (Short: \p{Sc=Tale}) (35: U+1950..196D, 5742 U+1970..1974) 5743 \p{Script: Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short: 5744 \p{Sc=Lana}, \p{Lana}) (127) 5745 \p{Script: Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short: 5746 \p{Sc=Tavt}, \p{Tavt}) (72) 5747 \p{Script: Takr} \p{Script=Takri} (68) 5748 \p{Script: Takri} (Short: \p{Sc=Takr}) (68: U+11680..116B9, 5749 U+116C0..116C9) 5750 \p{Script: Tale} \p{Script=Tai_Le} (35) 5751 \p{Script: Talu} \p{Script=New_Tai_Lue} (= 5752 \p{Script_Extensions=New_Tai_Lue}) (83) 5753 \p{Script: Tamil} (Short: \p{Sc=Taml}) (123: U+0B82..0B83, 5754 U+0B85..0B8A, U+0B8E..0B90, 5755 U+0B92..0B95, U+0B99..0B9A, U+0B9C ...) 5756 \p{Script: Taml} \p{Script=Tamil} (123) 5757 \p{Script: Tang} \p{Script=Tangut} (= \p{Script_Extensions= 5758 Tangut}) (6914) 5759 \p{Script: Tangsa} \p{Script_Extensions=Tangsa} (Short: 5760 \p{Sc=Tnsa}, \p{Tnsa}) (89) 5761 \p{Script: Tangut} \p{Script_Extensions=Tangut} (Short: 5762 \p{Sc=Tang}, \p{Tang}) (6914) 5763 \p{Script: Tavt} \p{Script=Tai_Viet} (= 5764 \p{Script_Extensions=Tai_Viet}) (72) 5765 \p{Script: Telu} \p{Script=Telugu} (100) 5766 \p{Script: Telugu} (Short: \p{Sc=Telu}) (100: U+0C00..0C0C, 5767 U+0C0E..0C10, U+0C12..0C28, 5768 U+0C2A..0C39, U+0C3C..0C44, U+0C46..0C48 5769 ...) 5770 \p{Script: Tfng} \p{Script=Tifinagh} (= 5771 \p{Script_Extensions=Tifinagh}) (59) 5772 \p{Script: Tglg} \p{Script=Tagalog} (23) 5773 \p{Script: Thaa} \p{Script=Thaana} (50) 5774 \p{Script: Thaana} (Short: \p{Sc=Thaa}) (50: U+0780..07B1) 5775 \p{Script: Thai} \p{Script_Extensions=Thai} (Short: \p{Sc= 5776 Thai}, \p{Thai}) (86) 5777 \p{Script: Tibetan} \p{Script_Extensions=Tibetan} (Short: 5778 \p{Sc=Tibt}, \p{Tibt}) (207) 5779 \p{Script: Tibt} \p{Script=Tibetan} (= 5780 \p{Script_Extensions=Tibetan}) (207) 5781 \p{Script: Tifinagh} \p{Script_Extensions=Tifinagh} (Short: 5782 \p{Sc=Tfng}, \p{Tfng}) (59) 5783 \p{Script: Tirh} \p{Script=Tirhuta} (82) 5784 \p{Script: Tirhuta} (Short: \p{Sc=Tirh}) (82: U+11480..114C7, 5785 U+114D0..114D9) 5786 \p{Script: Tnsa} \p{Script=Tangsa} (= \p{Script_Extensions= 5787 Tangsa}) (89) 5788 \p{Script: Toto} \p{Script_Extensions=Toto} (Short: \p{Sc= 5789 Toto}, \p{Toto}) (31) 5790 \p{Script: Ugar} \p{Script=Ugaritic} (= 5791 \p{Script_Extensions=Ugaritic}) (31) 5792 \p{Script: Ugaritic} \p{Script_Extensions=Ugaritic} (Short: 5793 \p{Sc=Ugar}, \p{Ugar}) (31) 5794 \p{Script: Unknown} \p{Script_Extensions=Unknown} (Short: 5795 \p{Sc=Zzzz}, \p{Zzzz}) (964_861 plus all 5796 above-Unicode code points) 5797 \p{Script: Vai} \p{Script_Extensions=Vai} (Short: \p{Sc= 5798 Vai}, \p{Vai}) (300) 5799 \p{Script: Vaii} \p{Script=Vai} (= \p{Script_Extensions= 5800 Vai}) (300) 5801 \p{Script: Vith} \p{Script=Vithkuqi} (= 5802 \p{Script_Extensions=Vithkuqi}) (70) 5803 \p{Script: Vithkuqi} \p{Script_Extensions=Vithkuqi} (Short: 5804 \p{Sc=Vith}, \p{Vith}) (70) 5805 \p{Script: Wancho} \p{Script_Extensions=Wancho} (Short: 5806 \p{Sc=Wcho}, \p{Wcho}) (59) 5807 \p{Script: Wara} \p{Script=Warang_Citi} (= 5808 \p{Script_Extensions=Warang_Citi}) (84) 5809 \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short: 5810 \p{Sc=Wara}, \p{Wara}) (84) 5811 \p{Script: Wcho} \p{Script=Wancho} (= \p{Script_Extensions= 5812 Wancho}) (59) 5813 \p{Script: Xpeo} \p{Script=Old_Persian} (= 5814 \p{Script_Extensions=Old_Persian}) (50) 5815 \p{Script: Xsux} \p{Script=Cuneiform} (= 5816 \p{Script_Extensions=Cuneiform}) (1234) 5817 \p{Script: Yezi} \p{Script=Yezidi} (47) 5818 \p{Script: Yezidi} (Short: \p{Sc=Yezi}) (47: U+10E80..10EA9, 5819 U+10EAB..10EAD, U+10EB0..10EB1) 5820 \p{Script: Yi} (Short: \p{Sc=Yi}) (1220: U+A000..A48C, 5821 U+A490..A4C6) 5822 \p{Script: Yiii} \p{Script=Yi} (1220) 5823 \p{Script: Zanabazar_Square} \p{Script_Extensions= 5824 Zanabazar_Square} (Short: \p{Sc=Zanb}, 5825 \p{Zanb}) (72) 5826 \p{Script: Zanb} \p{Script=Zanabazar_Square} (= 5827 \p{Script_Extensions=Zanabazar_Square}) 5828 (72) 5829 \p{Script: Zinh} \p{Script=Inherited} (657) 5830 \p{Script: Zyyy} \p{Script=Common} (8301) 5831 \p{Script: Zzzz} \p{Script=Unknown} (= 5832 \p{Script_Extensions=Unknown}) (964_861 5833 plus all above-Unicode code points) 5834 \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (90: 5835 U+061F, U+0640, U+1E900..1E94B, 5836 U+1E950..1E959, U+1E95E..1E95F) 5837 \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (90) 5838 \p{Script_Extensions: Aghb} \p{Script_Extensions= 5839 Caucasian_Albanian} (53) 5840 \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (65: 5841 U+11700..1171A, U+1171D..1172B, 5842 U+11730..11746) 5843 \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw}, 5844 \p{Hluw}) (583: U+14400..14646) 5845 \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1414) 5846 \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab}) 5847 (1414: U+0600..0604, U+0606..06DC, 5848 U+06DE..06FF, U+0750..077F, 5849 U+0870..088E, U+0890..0891 ...) 5850 \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn}) 5851 (96: U+0531..0556, U+0559..058A, 5852 U+058D..058F, U+FB13..FB17) 5853 \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic} 5854 (31) 5855 \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96) 5856 \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst}) 5857 (61: U+10B00..10B35, U+10B39..10B3F) 5858 \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61) 5859 \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (124) 5860 \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali}) 5861 (124: U+1B00..1B4C, U+1B50..1B7E) 5862 \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657) 5863 \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657: 5864 U+A6A0..A6F7, U+16800..16A38) 5865 \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36) 5866 \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass}) 5867 (36: U+16AD0..16AED, U+16AF0..16AF5) 5868 \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56: 5869 U+1BC0..1BF3, U+1BFC..1BFF) 5870 \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56) 5871 \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113) 5872 \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng}) 5873 (113: U+0951..0952, U+0964..0965, 5874 U+0980..0983, U+0985..098C, 5875 U+098F..0990, U+0993..09A8 ...) 5876 \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks}) 5877 (97: U+11C00..11C08, U+11C0A..11C36, 5878 U+11C38..11C45, U+11C50..11C6C) 5879 \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97) 5880 \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (117) 5881 \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo}) 5882 (117: U+02EA..02EB, U+3001..3003, 5883 U+3008..3011, U+3013..301F, 5884 U+302A..302D, U+3030 ...) 5885 \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (115) 5886 \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah}) 5887 (115: U+11000..1104D, U+11052..11075, 5888 U+1107F) 5889 \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256) 5890 \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai}) 5891 (256: U+2800..28FF) 5892 \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31) 5893 \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi}) 5894 (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF) 5895 \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22) 5896 \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22: 5897 U+1735..1736, U+1740..1753) 5898 \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (91) 5899 \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans}, 5900 \p{Cans}) (726: U+1400..167F, 5901 U+18B0..18F5, U+11AB0..11ABF) 5902 \p{Script_Extensions: Cans} \p{Script_Extensions= 5903 Canadian_Aboriginal} (726) 5904 \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49) 5905 \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49: 5906 U+102A0..102D0) 5907 \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb}, 5908 \p{Aghb}) (53: U+10530..10563, U+1056F) 5909 \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (91: 5910 U+09E6..09EF, U+1040..1049, 5911 U+11100..11134, U+11136..11147) 5912 \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83: 5913 U+AA00..AA36, U+AA40..AA4D, 5914 U+AA50..AA59, U+AA5C..AA5F) 5915 \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172) 5916 \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher}) 5917 (172: U+13A0..13F5, U+13F8..13FD, 5918 U+AB70..ABBF) 5919 \p{Script_Extensions: Chorasmian} (Short: \p{Scx=Chrs}, \p{Chrs}) 5920 (28: U+10FB0..10FCB) 5921 \p{Script_Extensions: Chrs} \p{Script_Extensions=Chorasmian} (28) 5922 \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy}) 5923 (7873: [\x00-\x20!\"#\$\%&\'\(\)*+,\-. 5924 \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xa9 5925 \xab-\xb9\xbb-\xbf\xd7\xf7], 5926 U+02B9..02DF, U+02E5..02E9, 5927 U+02EC..02FF, U+0374, U+037E ...) 5928 \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165) 5929 \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt}) 5930 (165: U+03E2..03EF, U+2C80..2CF3, 5931 U+2CF9..2CFF, U+102E0..102FB) 5932 \p{Script_Extensions: Cpmn} \p{Script_Extensions=Cypro_Minoan} 5933 (101) 5934 \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112) 5935 \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux}) 5936 (1234: U+12000..12399, U+12400..1246E, 5937 U+12470..12474, U+12480..12543) 5938 \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt}) 5939 (112: U+10100..10102, U+10107..10133, 5940 U+10137..1013F, U+10800..10805, U+10808, 5941 U+1080A..10835 ...) 5942 \p{Script_Extensions: Cypro_Minoan} (Short: \p{Scx=Cpmn}, 5943 \p{Cpmn}) (101: U+10100..10101, 5944 U+12F90..12FF2) 5945 \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl}) 5946 (510: U+0400..052F, U+1C80..1C88, 5947 U+1D2B, U+1D78, U+1DF8, U+2DE0..2DFF ...) 5948 \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (510) 5949 \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt}) 5950 (80: U+10400..1044F) 5951 \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (220) 5952 \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva}) 5953 (220: U+0900..0952, U+0955..097F, 5954 U+1CD0..1CF6, U+1CF8..1CF9, U+20F0, 5955 U+A830..A839 ...) 5956 \p{Script_Extensions: Diak} \p{Script_Extensions=Dives_Akuru} (72) 5957 \p{Script_Extensions: Dives_Akuru} (Short: \p{Scx=Diak}, \p{Diak}) 5958 (72: U+11900..11906, U+11909, 5959 U+1190C..11913, U+11915..11916, 5960 U+11918..11935, U+11937..11938 ...) 5961 \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82) 5962 \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82: 5963 U+0964..096F, U+A830..A839, 5964 U+11800..1183B) 5965 \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80) 5966 \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147) 5967 \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl}) 5968 (147: U+1BC00..1BC6A, U+1BC70..1BC7C, 5969 U+1BC80..1BC88, U+1BC90..1BC99, 5970 U+1BC9C..1BCA3) 5971 \p{Script_Extensions: Egyp} \p{Script_Extensions= 5972 Egyptian_Hieroglyphs} (1110) 5973 \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp}, 5974 \p{Egyp}) (1110: U+13000..13455) 5975 \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40) 5976 \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba}) 5977 (40: U+10500..10527) 5978 \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23) 5979 \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym}) 5980 (23: U+10FE0..10FF6) 5981 \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (523) 5982 \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi}) 5983 (523: U+1200..1248, U+124A..124D, 5984 U+1250..1256, U+1258, U+125A..125D, 5985 U+1260..1288 ...) 5986 \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (174) 5987 \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor}) 5988 (174: U+10A0..10C5, U+10C7, U+10CD, 5989 U+10D0..10FF, U+1C90..1CBA, U+1CBD..1CBF 5990 ...) 5991 \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (138) 5992 \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag}) 5993 (138: U+0484, U+0487, U+2C00..2C5F, 5994 U+2E43, U+A66F, U+1E000..1E006 ...) 5995 \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi} 5996 (65) 5997 \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi} 5998 (77) 5999 \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27) 6000 \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27: 6001 U+10330..1034A) 6002 \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116) 6003 \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran}) 6004 (116: U+0951..0952, U+0964..0965, 6005 U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3, 6006 U+1CF2..1CF4 ...) 6007 \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522: 6008 U+0342, U+0345, U+0370..0373, 6009 U+0375..0377, U+037A..037D, U+037F ...) 6010 \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522) 6011 \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr}) 6012 (105: U+0951..0952, U+0964..0965, 6013 U+0A81..0A83, U+0A85..0A8D, 6014 U+0A8F..0A91, U+0A93..0AA8 ...) 6015 \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105) 6016 \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong}, 6017 \p{Gong}) (65: U+0964..0965, 6018 U+11D60..11D65, U+11D67..11D68, 6019 U+11D6A..11D8E, U+11D90..11D91, 6020 U+11D93..11D98 ...) 6021 \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru}) 6022 (94: U+0951..0952, U+0964..0965, 6023 U+0A01..0A03, U+0A05..0A0A, 6024 U+0A0F..0A10, U+0A13..0A28 ...) 6025 \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94) 6026 \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (98_696: 6027 U+2E80..2E99, U+2E9B..2EF3, 6028 U+2F00..2FD5, U+3001..3003, 6029 U+3005..3011, U+3013..301F ...) 6030 \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775) 6031 \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang}) 6032 (11_775: U+1100..11FF, U+3001..3003, 6033 U+3008..3011, U+3013..301F, 6034 U+302E..3030, U+3037 ...) 6035 \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (98_696) 6036 \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg}, 6037 \p{Rohg}) (55: U+060C, U+061B, U+061F, 6038 U+0640, U+06D4, U+10D00..10D27 ...) 6039 \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23) 6040 \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano}) 6041 (23: U+1720..1736) 6042 \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26) 6043 \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26: 6044 U+108E0..108F2, U+108F4..108F5, 6045 U+108FB..108FF) 6046 \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134) 6047 \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr}) 6048 (134: U+0591..05C7, U+05D0..05EA, 6049 U+05EF..05F4, U+FB1D..FB36, 6050 U+FB38..FB3C, U+FB3E ...) 6051 \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (433) 6052 \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira}) 6053 (433: U+3001..3003, U+3008..3011, 6054 U+3013..301F, U+3030..3035, U+3037, 6055 U+303C..303D ...) 6056 \p{Script_Extensions: Hluw} \p{Script_Extensions= 6057 Anatolian_Hieroglyphs} (583) 6058 \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong} 6059 (127) 6060 \p{Script_Extensions: Hmnp} \p{Script_Extensions= 6061 Nyiakeng_Puachue_Hmong} (71) 6062 \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian} 6063 (108) 6064 \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi}, 6065 \p{Armi}) (31: U+10840..10855, 6066 U+10857..1085F) 6067 \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh}) 6068 (586: U+0300..0341, U+0343..0344, 6069 U+0346..0362, U+0953..0954, 6070 U+1AB0..1ACE, U+1DC2..1DF7 ...) 6071 \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli}, 6072 \p{Phli}) (27: U+10B60..10B72, 6073 U+10B78..10B7F) 6074 \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx= 6075 Prti}, \p{Prti}) (30: U+10B40..10B55, 6076 U+10B58..10B5F) 6077 \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39) 6078 \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91) 6079 \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java}) 6080 (91: U+A980..A9CD, U+A9CF..A9D9, 6081 U+A9DE..A9DF) 6082 \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (88: 6083 U+0966..096F, U+A830..A839, 6084 U+11080..110C2, U+110CD) 6085 \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48) 6086 \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (373) 6087 \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda}) 6088 (106: U+0951..0952, U+0964..0965, 6089 U+0C80..0C8C, U+0C8E..0C90, 6090 U+0C92..0CA8, U+0CAA..0CB3 ...) 6091 \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana}) 6092 (373: U+3001..3003, U+3008..3011, 6093 U+3013..301F, U+3030..3035, U+3037, 6094 U+303C..303D ...) 6095 \p{Script_Extensions: Kawi} (Short: \p{Scx=Kawi}, \p{Kawi}) (86: 6096 U+11F00..11F10, U+11F12..11F3A, 6097 U+11F3E..11F59) 6098 \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali}) 6099 (48: U+A900..A92F) 6100 \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68) 6101 \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar}) 6102 (68: U+10A00..10A03, U+10A05..10A06, 6103 U+10A0C..10A13, U+10A15..10A17, 6104 U+10A19..10A35, U+10A38..10A3A ...) 6105 \p{Script_Extensions: Khitan_Small_Script} (Short: \p{Scx=Kits}, 6106 \p{Kits}) (471: U+16FE4, U+18B00..18CD5) 6107 \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146: 6108 U+1780..17DD, U+17E0..17E9, 6109 U+17F0..17F9, U+19E0..19FF) 6110 \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146) 6111 \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (85) 6112 \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (85: 6113 U+0AE6..0AEF, U+A830..A839, 6114 U+11200..11211, U+11213..11241) 6115 \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind}) 6116 (81: U+0964..0965, U+A830..A839, 6117 U+112B0..112EA, U+112F0..112F9) 6118 \p{Script_Extensions: Kits} \p{Script_Extensions= 6119 Khitan_Small_Script} (471) 6120 \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (106) 6121 \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (88) 6122 \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127) 6123 \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (83: 6124 U+0E81..0E82, U+0E84, U+0E86..0E8A, 6125 U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...) 6126 \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (83) 6127 \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn}) 6128 (1510: [A-Za-z\xaa\xba\xc0-\xd6\xd8- 6129 \xf6\xf8-\xff], U+0100..02B8, 6130 U+02E0..02E4, U+0363..036F, 6131 U+0485..0486, U+0951..0952 ...) 6132 \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1510) 6133 \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74) 6134 \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74: 6135 U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F) 6136 \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69) 6137 \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69: 6138 U+0965, U+1900..191E, U+1920..192B, 6139 U+1930..193B, U+1940, U+1944..194F) 6140 \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386) 6141 \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268) 6142 \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina}) 6143 (386: U+10107..10133, U+10600..10736, 6144 U+10740..10755, U+10760..10767) 6145 \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb}) 6146 (268: U+10000..1000B, U+1000D..10026, 6147 U+10028..1003A, U+1003C..1003D, 6148 U+1003F..1004D, U+10050..1005D ...) 6149 \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (49: 6150 U+A4D0..A4FF, U+11FB0) 6151 \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29) 6152 \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29: 6153 U+10280..1029C) 6154 \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27) 6155 \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27: 6156 U+10920..10939, U+1093F) 6157 \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj}) 6158 (61: U+0964..096F, U+A830..A839, 6159 U+11150..11176) 6160 \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61) 6161 \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25) 6162 \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka}) 6163 (25: U+11EE0..11EF8) 6164 \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym}) 6165 (126: U+0951..0952, U+0964..0965, 6166 U+0D00..0D0C, U+0D0E..0D10, 6167 U+0D12..0D44, U+0D46..0D48 ...) 6168 \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30) 6169 \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand}) 6170 (30: U+0640, U+0840..085B, U+085E) 6171 \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52) 6172 \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani}) 6173 (52: U+0640, U+10AC0..10AE6, 6174 U+10AEB..10AF6) 6175 \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68) 6176 \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc}) 6177 (68: U+11C70..11C8F, U+11C92..11CA7, 6178 U+11CA9..11CB6) 6179 \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm}, 6180 \p{Gonm}) (77: U+0964..0965, 6181 U+11D00..11D06, U+11D08..11D09, 6182 U+11D0B..11D36, U+11D3A, U+11D3C..11D3D 6183 ...) 6184 \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf}) 6185 (91: U+16E40..16E9A) 6186 \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91) 6187 \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei}, 6188 \p{Mtei}) (79: U+AAE0..AAF6, 6189 U+ABC0..ABED, U+ABF0..ABF9) 6190 \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui} 6191 (213) 6192 \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend}, 6193 \p{Mend}) (213: U+1E800..1E8C4, 6194 U+1E8C7..1E8D6) 6195 \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive} 6196 (90) 6197 \p{Script_Extensions: Mero} \p{Script_Extensions= 6198 Meroitic_Hieroglyphs} (32) 6199 \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc}, 6200 \p{Merc}) (90: U+109A0..109B7, 6201 U+109BC..109CF, U+109D2..109FF) 6202 \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero}, 6203 \p{Mero}) (32: U+10980..1099F) 6204 \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149: 6205 U+16F00..16F4A, U+16F4F..16F87, 6206 U+16F8F..16F9F) 6207 \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (126) 6208 \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89: 6209 U+A830..A839, U+11600..11644, 6210 U+11650..11659) 6211 \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (172) 6212 \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong}) 6213 (172: U+1800..1819, U+1820..1878, 6214 U+1880..18AA, U+202F, U+11660..1166C) 6215 \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43: 6216 U+16A40..16A5E, U+16A60..16A69, 6217 U+16A6E..16A6F) 6218 \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43) 6219 \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79) 6220 \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48) 6221 \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult}) 6222 (48: U+0A66..0A6F, U+11280..11286, 6223 U+11288, U+1128A..1128D, U+1128F..1129D, 6224 U+1129F..112A9) 6225 \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr}) 6226 (224: U+1000..109F, U+A92E, 6227 U+A9E0..A9FE, U+AA60..AA7F) 6228 \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224) 6229 \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat}) 6230 (40: U+10880..1089E, U+108A7..108AF) 6231 \p{Script_Extensions: Nag_Mundari} (Short: \p{Scx=Nagm}, \p{Nagm}) 6232 (42: U+1E4D0..1E4F9) 6233 \p{Script_Extensions: Nagm} \p{Script_Extensions=Nag_Mundari} (42) 6234 \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86) 6235 \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand}) 6236 (86: U+0964..0965, U+0CE6..0CEF, U+1CE9, 6237 U+1CF2, U+1CFA, U+A830..A835 ...) 6238 \p{Script_Extensions: Narb} \p{Script_Extensions= 6239 Old_North_Arabian} (32) 6240 \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40) 6241 \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu}) 6242 (83: U+1980..19AB, U+19B0..19C9, 6243 U+19D0..19DA, U+19DE..19DF) 6244 \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (97: 6245 U+11400..1145B, U+1145D..11461) 6246 \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (67: 6247 U+060C, U+061B, U+061F, U+07C0..07FA, 6248 U+07FD..07FF, U+FD3E..FD3F) 6249 \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (67) 6250 \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397) 6251 \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397: 6252 U+16FE1, U+1B170..1B2FB) 6253 \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx= 6254 Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C, 6255 U+1E130..1E13D, U+1E140..1E149, 6256 U+1E14E..1E14F) 6257 \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29) 6258 \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29: 6259 U+1680..169C) 6260 \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck}) 6261 (48: U+1C50..1C7F) 6262 \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48) 6263 \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung}, 6264 \p{Hung}) (108: U+10C80..10CB2, 6265 U+10CC0..10CF2, U+10CFA..10CFF) 6266 \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital}) 6267 (39: U+10300..10323, U+1032D..1032F) 6268 \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb}, 6269 \p{Narb}) (32: U+10A80..10A9F) 6270 \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm}) 6271 (44: U+0483, U+10350..1037A) 6272 \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo}) 6273 (50: U+103A0..103C3, U+103C8..103D5) 6274 \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo}) 6275 (40: U+10F00..10F27) 6276 \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb}, 6277 \p{Sarb}) (32: U+10A60..10A7F) 6278 \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh}) 6279 (73: U+10C00..10C48) 6280 \p{Script_Extensions: Old_Uyghur} (Short: \p{Scx=Ougr}, \p{Ougr}) 6281 (28: U+0640, U+10AF2, U+10F70..10F89) 6282 \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (97: 6283 U+0951..0952, U+0964..0965, 6284 U+0B01..0B03, U+0B05..0B0C, 6285 U+0B0F..0B10, U+0B13..0B28 ...) 6286 \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73) 6287 \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (97) 6288 \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72: 6289 U+104B0..104D3, U+104D8..104FB) 6290 \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72) 6291 \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40) 6292 \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma}) 6293 (40: U+10480..1049D, U+104A0..104A9) 6294 \p{Script_Extensions: Ougr} \p{Script_Extensions=Old_Uyghur} (28) 6295 \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng}, 6296 \p{Hmng}) (127: U+16B00..16B45, 6297 U+16B50..16B59, U+16B5B..16B61, 6298 U+16B63..16B77, U+16B7D..16B8F) 6299 \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32) 6300 \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm}) 6301 (32: U+10860..1087F) 6302 \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc}) 6303 (57: U+11AC0..11AF8) 6304 \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57) 6305 \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44) 6306 \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59) 6307 \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag}) 6308 (59: U+1802..1803, U+1805, U+A840..A877) 6309 \p{Script_Extensions: Phli} \p{Script_Extensions= 6310 Inscriptional_Pahlavi} (27) 6311 \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi} 6312 (30) 6313 \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29) 6314 \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx}) 6315 (29: U+10900..1091B, U+1091F) 6316 \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149) 6317 \p{Script_Extensions: Prti} \p{Script_Extensions= 6318 Inscriptional_Parthian} (30) 6319 \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp}, 6320 \p{Phlp}) (30: U+0640, U+10B80..10B91, 6321 U+10B99..10B9C, U+10BA9..10BAF) 6322 \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165) 6323 \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (586) 6324 \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37: 6325 U+A930..A953, U+A95F) 6326 \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37) 6327 \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya} 6328 (55) 6329 \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86: 6330 U+16A0..16EA, U+16EE..16F8) 6331 \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86) 6332 \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr}) 6333 (61: U+0800..082D, U+0830..083E) 6334 \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61) 6335 \p{Script_Extensions: Sarb} \p{Script_Extensions= 6336 Old_South_Arabian} (32) 6337 \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82) 6338 \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur}) 6339 (82: U+A880..A8C5, U+A8CE..A8D9) 6340 \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672) 6341 \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd}) 6342 (102: U+0951, U+1CD7, U+1CD9, 6343 U+1CDC..1CDD, U+1CE0, U+11180..111DF) 6344 \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw}) 6345 (48: U+10450..1047F) 6346 \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48) 6347 \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (102) 6348 \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92) 6349 \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd}) 6350 (92: U+11580..115B5, U+115B8..115DD) 6351 \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw}) 6352 (672: U+1D800..1DA8B, U+1DA9B..1DA9F, 6353 U+1DAA1..1DAAF) 6354 \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81) 6355 \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (113) 6356 \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh}) 6357 (113: U+0964..0965, U+0D81..0D83, 6358 U+0D85..0D96, U+0D9A..0DB1, 6359 U+0DB3..0DBB, U+0DBD ...) 6360 \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43) 6361 \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd}) 6362 (43: U+0640, U+10F30..10F59) 6363 \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40) 6364 \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35) 6365 \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora}, 6366 \p{Sora}) (35: U+110D0..110E8, 6367 U+110F0..110F9) 6368 \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83) 6369 \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo}) 6370 (83: U+11A50..11AA2) 6371 \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72) 6372 \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund}) 6373 (72: U+1B80..1BBF, U+1CC0..1CC7) 6374 \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (57) 6375 \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo}, 6376 \p{Sylo}) (57: U+0964..0965, 6377 U+09E6..09EF, U+A800..A82C) 6378 \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (107) 6379 \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc}) 6380 (107: U+060C, U+061B..061C, U+061F, 6381 U+0640, U+064B..0655, U+0670 ...) 6382 \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg}) 6383 (25: U+1700..1715, U+171F, U+1735..1736) 6384 \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20) 6385 \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb}) 6386 (20: U+1735..1736, U+1760..176C, 6387 U+176E..1770, U+1772..1773) 6388 \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45: 6389 U+1040..1049, U+1950..196D, U+1970..1974) 6390 \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana}) 6391 (127: U+1A20..1A5E, U+1A60..1A7C, 6392 U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD) 6393 \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt}) 6394 (72: U+AA80..AAC2, U+AADB..AADF) 6395 \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (80) 6396 \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (80: 6397 U+0964..0965, U+A830..A839, 6398 U+11680..116B9, U+116C0..116C9) 6399 \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45) 6400 \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83) 6401 \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133: 6402 U+0951..0952, U+0964..0965, 6403 U+0B82..0B83, U+0B85..0B8A, 6404 U+0B8E..0B90, U+0B92..0B95 ...) 6405 \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133) 6406 \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6914) 6407 \p{Script_Extensions: Tangsa} (Short: \p{Scx=Tnsa}, \p{Tnsa}) (89: 6408 U+16A70..16ABE, U+16AC0..16AC9) 6409 \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang}) 6410 (6914: U+16FE0, U+17000..187F7, 6411 U+18800..18AFF, U+18D00..18D08) 6412 \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72) 6413 \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (106) 6414 \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu}) 6415 (106: U+0951..0952, U+0964..0965, 6416 U+0C00..0C0C, U+0C0E..0C10, 6417 U+0C12..0C28, U+0C2A..0C39 ...) 6418 \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59) 6419 \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (25) 6420 \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66) 6421 \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66: 6422 U+060C, U+061B..061C, U+061F, 6423 U+0660..0669, U+0780..07B1, U+FDF2 ...) 6424 \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86: 6425 U+0E01..0E3A, U+0E40..0E5B) 6426 \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt}) 6427 (207: U+0F00..0F47, U+0F49..0F6C, 6428 U+0F71..0F97, U+0F99..0FBC, 6429 U+0FBE..0FCC, U+0FCE..0FD4 ...) 6430 \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207) 6431 \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng}) 6432 (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F) 6433 \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97) 6434 \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh}) 6435 (97: U+0951..0952, U+0964..0965, U+1CF2, 6436 U+A830..A839, U+11480..114C7, 6437 U+114D0..114D9) 6438 \p{Script_Extensions: Tnsa} \p{Script_Extensions=Tangsa} (89) 6439 \p{Script_Extensions: Toto} (Short: \p{Scx=Toto}, \p{Toto}) (31: 6440 U+1E290..1E2AE) 6441 \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31) 6442 \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar}) 6443 (31: U+10380..1039D, U+1039F) 6444 \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz}) 6445 (964_861 plus all above-Unicode code 6446 points: U+0378..0379, U+0380..0383, 6447 U+038B, U+038D, U+03A2, U+0530 ...) 6448 \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300: 6449 U+A500..A62B) 6450 \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300) 6451 \p{Script_Extensions: Vith} \p{Script_Extensions=Vithkuqi} (70) 6452 \p{Script_Extensions: Vithkuqi} (Short: \p{Scx=Vith}, \p{Vith}) 6453 (70: U+10570..1057A, U+1057C..1058A, 6454 U+1058C..10592, U+10594..10595, 6455 U+10597..105A1, U+105A3..105B1 ...) 6456 \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59: 6457 U+1E2C0..1E2F9, U+1E2FF) 6458 \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84) 6459 \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara}) 6460 (84: U+118A0..118F2, U+118FF) 6461 \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59) 6462 \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50) 6463 \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234) 6464 \p{Script_Extensions: Yezi} \p{Script_Extensions=Yezidi} (60) 6465 \p{Script_Extensions: Yezidi} (Short: \p{Scx=Yezi}, \p{Yezi}) (60: 6466 U+060C, U+061B, U+061F, U+0660..0669, 6467 U+10E80..10EA9, U+10EAB..10EAD ...) 6468 \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246: 6469 U+3001..3002, U+3008..3011, 6470 U+3014..301B, U+30FB, U+A000..A48C, 6471 U+A490..A4C6 ...) 6472 \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246) 6473 \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb}, 6474 \p{Zanb}) (72: U+11A00..11A47) 6475 \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square} 6476 (72) 6477 \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (586) 6478 \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7873) 6479 \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (964_861 6480 plus all above-Unicode code points) 6481 \p{Scx: *} \p{Script_Extensions: *} 6482 \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (50) 6483 \p{SD: *} \p{Soft_Dotted: *} 6484 \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4) 6485 \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024, 6486 U+FE52, U+FF0E) 6487 \p{Sentence_Break: CL} \p{Sentence_Break=Close} (195) 6488 \p{Sentence_Break: Close} (Short: \p{SB=CL}) (195: [\"\'\(\)\[\] 6489 \{\}\xab\xbb], U+0F3A..0F3D, 6490 U+169B..169C, U+2018..201F, 6491 U+2039..203A, U+2045..2046 ...) 6492 \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1: [\r]) 6493 \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (2550) 6494 \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2550: U+0300..036F, 6495 U+0483..0489, U+0591..05BD, U+05BF, 6496 U+05C1..05C2, U+05C4..05C5 ...) 6497 \p{Sentence_Break: FO} \p{Sentence_Break=Format} (72) 6498 \p{Sentence_Break: Format} (Short: \p{SB=FO}) (72: [\xad], 6499 U+0600..0605, U+061C, U+06DD, U+070F, 6500 U+0890..0891 ...) 6501 \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (132_036) 6502 \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1: [\n]) 6503 \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2497) 6504 \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2497: [a-z\xaa\xb5 6505 \xba\xdf-\xf6\xf8-\xff], U+0101, U+0103, 6506 U+0105, U+0107, U+0109 ...) 6507 \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (682) 6508 \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (682: [0-9], 6509 U+0660..0669, U+066B..066C, 6510 U+06F0..06F9, U+07C0..07C9, U+0966..096F 6511 ...) 6512 \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (132_036: U+01BB, 6513 U+01C0..01C3, U+0294, U+02B9..02BF, 6514 U+02C6..02D1, U+02EC ...) 6515 \p{Sentence_Break: Other} (Short: \p{SB=XX}) (973_938 plus all 6516 above-Unicode code points: [^\t\n\cK\f 6517 \r\x20!\"\'\(\),\-.0-9:?A-Z\[\]a-z\{\} 6518 \x85\xa0\xaa-\xab\xad\xb5\xba-\xbb\xc0- 6519 \xd6\xd8-\xf6\xf8-\xff], U+02C2..02C5, 6520 U+02D2..02DF, U+02E5..02EB, U+02ED, 6521 U+02EF..02FF ...) 6522 \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26) 6523 \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:], 6524 U+055D, U+060C..060D, U+07F8, U+1802, 6525 U+1808 ...) 6526 \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3) 6527 \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85], 6528 U+2028..2029) 6529 \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0], 6530 U+1680, U+2000..200A, U+202F, U+205F, 6531 U+3000) 6532 \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (151) 6533 \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (151: [!?], U+0589, 6534 U+061D..061F, U+06D4, U+0700..0702, 6535 U+07F9 ...) 6536 \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1936) 6537 \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1936: [A-Z\xc0-\xd6 6538 \xd8-\xde], U+0100, U+0102, U+0104, 6539 U+0106, U+0108 ...) 6540 \p{Sentence_Break: XX} \p{Sentence_Break=Other} (973_938 plus all 6541 above-Unicode code points) 6542 \p{Sentence_Terminal} \p{Sentence_Terminal=Y} (Short: \p{STerm}) 6543 (154) 6544 \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm}) 6545 (1_113_958 plus all above-Unicode code 6546 points: [\x00-\x20\"#\$\%&\'\(\)*+,\- 6547 \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f- 6548 \xff], U+0100..0588, U+058A..061C, 6549 U+0620..06D3, U+06D5..06FF, U+0703..07F8 6550 ...) 6551 \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (154: 6552 [!.?], U+0589, U+061D..061F, U+06D4, 6553 U+0700..0702, U+07F9 ...) 6554 \p{Separator} \p{General_Category=Separator} (Short: 6555 \p{Z}) (19) 6556 \p{Sgnw} \p{SignWriting} (= \p{Script_Extensions= 6557 SignWriting}) (672) 6558 \p{Sharada} \p{Script_Extensions=Sharada} (Short: 6559 \p{Shrd}; NOT \p{Block=Sharada}) (102) 6560 \p{Shavian} \p{Script_Extensions=Shavian} (Short: 6561 \p{Shaw}) (48) 6562 \p{Shaw} \p{Shavian} (= \p{Script_Extensions= 6563 Shavian}) (48) 6564 X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls} 6565 (16) 6566 \p{Shrd} \p{Sharada} (= \p{Script_Extensions= 6567 Sharada}) (NOT \p{Block=Sharada}) (102) 6568 \p{Sidd} \p{Siddham} (= \p{Script_Extensions= 6569 Siddham}) (NOT \p{Block=Siddham}) (92) 6570 \p{Siddham} \p{Script_Extensions=Siddham} (Short: 6571 \p{Sidd}; NOT \p{Block=Siddham}) (92) 6572 \p{SignWriting} \p{Script_Extensions=SignWriting} (Short: 6573 \p{Sgnw}) (672) 6574 \p{Sind} \p{Khudawadi} (= \p{Script_Extensions= 6575 Khudawadi}) (NOT \p{Block=Khudawadi}) 6576 (81) 6577 \p{Sinh} \p{Sinhala} (= \p{Script_Extensions= 6578 Sinhala}) (NOT \p{Block=Sinhala}) (113) 6579 \p{Sinhala} \p{Script_Extensions=Sinhala} (Short: 6580 \p{Sinh}; NOT \p{Block=Sinhala}) (113) 6581 X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32) 6582 \p{Sk} \p{Modifier_Symbol} (= 6583 \p{General_Category=Modifier_Symbol}) 6584 (125) 6585 \p{Sm} \p{Math_Symbol} (= \p{General_Category= 6586 Math_Symbol}) (948) 6587 X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short: 6588 \p{InSmallForms}) (32) 6589 X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block= 6590 Small_Form_Variants}) (32) 6591 X \p{Small_Kana_Ext} \p{Small_Kana_Extension} (= \p{Block= 6592 Small_Kana_Extension}) (64) 6593 X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short: 6594 \p{InSmallKanaExt}) (64) 6595 \p{So} \p{Other_Symbol} (= \p{General_Category= 6596 Other_Symbol}) (6634) 6597 \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (50) 6598 \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_062 plus 6599 all above-Unicode code points: [\x00- 6600 \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A- 6601 Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff], 6602 U+0100..012E, U+0130..0248, 6603 U+024A..0267, U+0269..029C, U+029E..02B1 6604 ...) 6605 \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (50: [i-j], 6606 U+012F, U+0249, U+0268, U+029D, U+02B2 6607 ...) 6608 \p{Sogd} \p{Sogdian} (= \p{Script_Extensions= 6609 Sogdian}) (NOT \p{Block=Sogdian}) (43) 6610 \p{Sogdian} \p{Script_Extensions=Sogdian} (Short: 6611 \p{Sogd}; NOT \p{Block=Sogdian}) (43) 6612 \p{Sogo} \p{Old_Sogdian} (= \p{Script_Extensions= 6613 Old_Sogdian}) (NOT \p{Block= 6614 Old_Sogdian}) (40) 6615 \p{Sora} \p{Sora_Sompeng} (= \p{Script_Extensions= 6616 Sora_Sompeng}) (NOT \p{Block= 6617 Sora_Sompeng}) (35) 6618 \p{Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} (Short: 6619 \p{Sora}; NOT \p{Block=Sora_Sompeng}) 6620 (35) 6621 \p{Soyo} \p{Soyombo} (= \p{Script_Extensions= 6622 Soyombo}) (NOT \p{Block=Soyombo}) (83) 6623 \p{Soyombo} \p{Script_Extensions=Soyombo} (Short: 6624 \p{Soyo}; NOT \p{Block=Soyombo}) (83) 6625 \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25) 6626 \p{Space: *} \p{White_Space: *} 6627 \p{Space_Separator} \p{General_Category=Space_Separator} 6628 (Short: \p{Zs}) (17) 6629 \p{SpacePerl} \p{XPosixSpace} (25) 6630 \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short: 6631 \p{Mc}) (452) 6632 X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters} 6633 (Short: \p{InModifierLetters}) (80) 6634 X \p{Specials} \p{Block=Specials} (16) 6635 \p{STerm} \p{Sentence_Terminal} (= 6636 \p{Sentence_Terminal=Y}) (154) 6637 \p{STerm: *} \p{Sentence_Terminal: *} 6638 \p{Sund} \p{Sundanese} (= \p{Script_Extensions= 6639 Sundanese}) (NOT \p{Block=Sundanese}) 6640 (72) 6641 \p{Sundanese} \p{Script_Extensions=Sundanese} (Short: 6642 \p{Sund}; NOT \p{Block=Sundanese}) (72) 6643 X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block= 6644 Sundanese_Supplement}) (16) 6645 X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short: 6646 \p{InSundaneseSup}) (16) 6647 X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block= 6648 Supplemental_Arrows_A}) (16) 6649 X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block= 6650 Supplemental_Arrows_B}) (128) 6651 X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block= 6652 Supplemental_Arrows_C}) (256) 6653 X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (= 6654 \p{Block= 6655 Supplemental_Mathematical_Operators}) 6656 (256) 6657 X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (= 6658 \p{Block= 6659 Supplementary_Private_Use_Area_A}) 6660 (65_536) 6661 X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (= 6662 \p{Block= 6663 Supplementary_Private_Use_Area_B}) 6664 (65_536) 6665 X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block= 6666 Supplemental_Punctuation}) (128) 6667 X \p{Sup_Symbols_And_Pictographs} 6668 \p{Supplemental_Symbols_And_Pictographs} 6669 (= \p{Block= 6670 Supplemental_Symbols_And_Pictographs}) 6671 (256) 6672 X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (= 6673 \p{Block=Superscripts_And_Subscripts}) 6674 (48) 6675 X \p{Superscripts_And_Subscripts} \p{Block= 6676 Superscripts_And_Subscripts} (Short: 6677 \p{InSuperAndSub}) (48) 6678 X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short: 6679 \p{InSupArrowsA}) (16) 6680 X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short: 6681 \p{InSupArrowsB}) (128) 6682 X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short: 6683 \p{InSupArrowsC}) (256) 6684 X \p{Supplemental_Mathematical_Operators} \p{Block= 6685 Supplemental_Mathematical_Operators} 6686 (Short: \p{InSupMathOperators}) (256) 6687 X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation} 6688 (Short: \p{InSupPunctuation}) (128) 6689 X \p{Supplemental_Symbols_And_Pictographs} \p{Block= 6690 Supplemental_Symbols_And_Pictographs} 6691 (Short: \p{InSupSymbolsAndPictographs}) 6692 (256) 6693 X \p{Supplementary_Private_Use_Area_A} \p{Block= 6694 Supplementary_Private_Use_Area_A} 6695 (Short: \p{InSupPUAA}) (65_536) 6696 X \p{Supplementary_Private_Use_Area_B} \p{Block= 6697 Supplementary_Private_Use_Area_B} 6698 (Short: \p{InSupPUAB}) (65_536) 6699 \p{Surrogate} \p{General_Category=Surrogate} (Short: 6700 \p{Cs}) (2048) 6701 X \p{Sutton_SignWriting} \p{Block=Sutton_SignWriting} (688) 6702 \p{Sylo} \p{Syloti_Nagri} (= \p{Script_Extensions= 6703 Syloti_Nagri}) (NOT \p{Block= 6704 Syloti_Nagri}) (57) 6705 \p{Syloti_Nagri} \p{Script_Extensions=Syloti_Nagri} (Short: 6706 \p{Sylo}; NOT \p{Block=Syloti_Nagri}) 6707 (57) 6708 \p{Symbol} \p{General_Category=Symbol} (Short: \p{S}) 6709 (7770) 6710 X \p{Symbols_And_Pictographs_Ext_A} 6711 \p{Symbols_And_Pictographs_Extended_A} 6712 (= \p{Block= 6713 Symbols_And_Pictographs_Extended_A}) 6714 (144) 6715 X \p{Symbols_And_Pictographs_Extended_A} \p{Block= 6716 Symbols_And_Pictographs_Extended_A} (144) 6717 X \p{Symbols_For_Legacy_Computing} \p{Block= 6718 Symbols_For_Legacy_Computing} (256) 6719 \p{Syrc} \p{Syriac} (= \p{Script_Extensions= 6720 Syriac}) (NOT \p{Block=Syriac}) (107) 6721 \p{Syriac} \p{Script_Extensions=Syriac} (Short: 6722 \p{Syrc}; NOT \p{Block=Syriac}) (107) 6723 X \p{Syriac_Sup} \p{Syriac_Supplement} (= \p{Block= 6724 Syriac_Supplement}) (16) 6725 X \p{Syriac_Supplement} \p{Block=Syriac_Supplement} (Short: 6726 \p{InSyriacSup}) (16) 6727 \p{Tagalog} \p{Script_Extensions=Tagalog} (Short: 6728 \p{Tglg}; NOT \p{Block=Tagalog}) (25) 6729 \p{Tagb} \p{Tagbanwa} (= \p{Script_Extensions= 6730 Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20) 6731 \p{Tagbanwa} \p{Script_Extensions=Tagbanwa} (Short: 6732 \p{Tagb}; NOT \p{Block=Tagbanwa}) (20) 6733 X \p{Tags} \p{Block=Tags} (128) 6734 \p{Tai_Le} \p{Script_Extensions=Tai_Le} (Short: 6735 \p{Tale}; NOT \p{Block=Tai_Le}) (45) 6736 \p{Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short: 6737 \p{Lana}; NOT \p{Block=Tai_Tham}) (127) 6738 \p{Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short: 6739 \p{Tavt}; NOT \p{Block=Tai_Viet}) (72) 6740 X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block= 6741 Tai_Xuan_Jing_Symbols}) (96) 6742 X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short: 6743 \p{InTaiXuanJing}) (96) 6744 \p{Takr} \p{Takri} (= \p{Script_Extensions=Takri}) 6745 (NOT \p{Block=Takri}) (80) 6746 \p{Takri} \p{Script_Extensions=Takri} (Short: 6747 \p{Takr}; NOT \p{Block=Takri}) (80) 6748 \p{Tale} \p{Tai_Le} (= \p{Script_Extensions= 6749 Tai_Le}) (NOT \p{Block=Tai_Le}) (45) 6750 \p{Talu} \p{New_Tai_Lue} (= \p{Script_Extensions= 6751 New_Tai_Lue}) (NOT \p{Block= 6752 New_Tai_Lue}) (83) 6753 \p{Tamil} \p{Script_Extensions=Tamil} (Short: 6754 \p{Taml}; NOT \p{Block=Tamil}) (133) 6755 X \p{Tamil_Sup} \p{Tamil_Supplement} (= \p{Block= 6756 Tamil_Supplement}) (64) 6757 X \p{Tamil_Supplement} \p{Block=Tamil_Supplement} (Short: 6758 \p{InTamilSup}) (64) 6759 \p{Taml} \p{Tamil} (= \p{Script_Extensions=Tamil}) 6760 (NOT \p{Block=Tamil}) (133) 6761 \p{Tang} \p{Tangut} (= \p{Script_Extensions= 6762 Tangut}) (NOT \p{Block=Tangut}) (6914) 6763 \p{Tangsa} \p{Script_Extensions=Tangsa} (Short: 6764 \p{Tnsa}; NOT \p{Block=Tangsa}) (89) 6765 \p{Tangut} \p{Script_Extensions=Tangut} (Short: 6766 \p{Tang}; NOT \p{Block=Tangut}) (6914) 6767 X \p{Tangut_Components} \p{Block=Tangut_Components} (768) 6768 X \p{Tangut_Sup} \p{Tangut_Supplement} (= \p{Block= 6769 Tangut_Supplement}) (128) 6770 X \p{Tangut_Supplement} \p{Block=Tangut_Supplement} (Short: 6771 \p{InTangutSup}) (128) 6772 \p{Tavt} \p{Tai_Viet} (= \p{Script_Extensions= 6773 Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72) 6774 \p{Telu} \p{Telugu} (= \p{Script_Extensions= 6775 Telugu}) (NOT \p{Block=Telugu}) (106) 6776 \p{Telugu} \p{Script_Extensions=Telugu} (Short: 6777 \p{Telu}; NOT \p{Block=Telugu}) (106) 6778 \p{Term} \p{Terminal_Punctuation} (= 6779 \p{Terminal_Punctuation=Y}) (278) 6780 \p{Term: *} \p{Terminal_Punctuation: *} 6781 \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short: 6782 \p{Term}) (278) 6783 \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term}) 6784 (1_113_834 plus all above-Unicode code 6785 points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0- 6786 9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff], 6787 U+0100..037D, U+037F..0386, 6788 U+0388..0588, U+058A..05C2, U+05C4..060B 6789 ...) 6790 \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (278: 6791 [!,.:;?], U+037E, U+0387, U+0589, 6792 U+05C3, U+060C ...) 6793 \p{Tfng} \p{Tifinagh} (= \p{Script_Extensions= 6794 Tifinagh}) (NOT \p{Block=Tifinagh}) (59) 6795 \p{Tglg} \p{Tagalog} (= \p{Script_Extensions= 6796 Tagalog}) (NOT \p{Block=Tagalog}) (25) 6797 \p{Thaa} \p{Thaana} (= \p{Script_Extensions= 6798 Thaana}) (NOT \p{Block=Thaana}) (66) 6799 \p{Thaana} \p{Script_Extensions=Thaana} (Short: 6800 \p{Thaa}; NOT \p{Block=Thaana}) (66) 6801 \p{Thai} \p{Script_Extensions=Thai} (NOT \p{Block= 6802 Thai}) (86) 6803 \p{Tibetan} \p{Script_Extensions=Tibetan} (Short: 6804 \p{Tibt}; NOT \p{Block=Tibetan}) (207) 6805 \p{Tibt} \p{Tibetan} (= \p{Script_Extensions= 6806 Tibetan}) (NOT \p{Block=Tibetan}) (207) 6807 \p{Tifinagh} \p{Script_Extensions=Tifinagh} (Short: 6808 \p{Tfng}; NOT \p{Block=Tifinagh}) (59) 6809 \p{Tirh} \p{Tirhuta} (= \p{Script_Extensions= 6810 Tirhuta}) (NOT \p{Block=Tirhuta}) (97) 6811 \p{Tirhuta} \p{Script_Extensions=Tirhuta} (Short: 6812 \p{Tirh}; NOT \p{Block=Tirhuta}) (97) 6813 \p{Title} \p{Titlecase} (/i= Cased=Yes) (31) 6814 \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i= 6815 Cased=Yes) (31: U+01C5, U+01C8, U+01CB, 6816 U+01F2, U+1F88..1F8F, U+1F98..1F9F ...) 6817 \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter} 6818 (Short: \p{Lt}; /i= General_Category= 6819 Cased_Letter) (31) 6820 \p{Tnsa} \p{Tangsa} (= \p{Script_Extensions= 6821 Tangsa}) (NOT \p{Block=Tangsa}) (89) 6822 \p{Toto} \p{Script_Extensions=Toto} (NOT \p{Block= 6823 Toto}) (31) 6824 X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block= 6825 Transport_And_Map_Symbols}) (128) 6826 X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols} 6827 (Short: \p{InTransportAndMap}) (128) 6828 X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics} 6829 (= \p{Block= 6830 Unified_Canadian_Aboriginal_Syllabics}) 6831 (640) 6832 X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_- 6833 Extended} (= \p{Block= 6834 Unified_Canadian_Aboriginal_Syllabics_- 6835 Extended}) (80) 6836 X \p{UCAS_Ext_A} \p{Unified_Canadian_Aboriginal_Syllabics_- 6837 Extended_A} (= \p{Block= 6838 Unified_Canadian_Aboriginal_Syllabics_- 6839 Extended_A}) (16) 6840 \p{Ugar} \p{Ugaritic} (= \p{Script_Extensions= 6841 Ugaritic}) (NOT \p{Block=Ugaritic}) (31) 6842 \p{Ugaritic} \p{Script_Extensions=Ugaritic} (Short: 6843 \p{Ugar}; NOT \p{Block=Ugaritic}) (31) 6844 \p{UIdeo} \p{Unified_Ideograph} (= 6845 \p{Unified_Ideograph=Y}) (97_058) 6846 \p{UIdeo: *} \p{Unified_Ideograph: *} 6847 \p{Unassigned} \p{General_Category=Unassigned} (Short: 6848 \p{Cn}) (825_345 plus all above-Unicode 6849 code points) 6850 \p{Unicode} \p{Any} (1_114_112) 6851 X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block= 6852 Unified_Canadian_Aboriginal_Syllabics} 6853 (Short: \p{InUCAS}) (640) 6854 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block= 6855 Unified_Canadian_Aboriginal_Syllabics_- 6856 Extended} (Short: \p{InUCASExt}) (80) 6857 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended_A} \p{Block= 6858 Unified_Canadian_Aboriginal_Syllabics_- 6859 Extended_A} (Short: \p{InUCASExtA}) (16) 6860 \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo}) 6861 (97_058) 6862 \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo}) 6863 (1_017_054 plus all above-Unicode code 6864 points: U+0000..33FF, U+4DC0..4DFF, 6865 U+A000..FA0D, U+FA10, U+FA12, 6866 U+FA15..FA1E ...) 6867 \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (97_058: 6868 U+3400..4DBF, U+4E00..9FFF, 6869 U+FA0E..FA0F, U+FA11, U+FA13..FA14, 6870 U+FA1F ...) 6871 \p{Unknown} \p{Script_Extensions=Unknown} (Short: 6872 \p{Zzzz}) (964_861 plus all above- 6873 Unicode code points) 6874 \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i= 6875 Cased=Yes) (1951) 6876 \p{Upper: *} \p{Uppercase: *} 6877 \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i= 6878 Cased=Yes) (1951) 6879 \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased= 6880 No) (1_112_161 plus all above-Unicode 6881 code points: [\x00-\x20!\"#\$\%&\' 6882 \(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-z\{ 6883 \|\}~\x7f-\xbf\xd7\xdf-\xff], U+0101, 6884 U+0103, U+0105, U+0107, U+0109 ...) 6885 \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased= 6886 Yes) (1951: [A-Z\xc0-\xd6\xd8-\xde], 6887 U+0100, U+0102, U+0104, U+0106, U+0108 6888 ...) 6889 \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter} 6890 (Short: \p{Lu}; /i= General_Category= 6891 Cased_Letter) (1831) 6892 \p{Vai} \p{Script_Extensions=Vai} (NOT \p{Block= 6893 Vai}) (300) 6894 \p{Vaii} \p{Vai} (= \p{Script_Extensions=Vai}) (NOT 6895 \p{Block=Vai}) (300) 6896 \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS}; 6897 NOT \p{Variation_Selectors}) (260) 6898 \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_852 6899 plus all above-Unicode code points: 6900 U+0000..180A, U+180E, U+1810..FDFF, 6901 U+FE10..E00FF, U+E01F0..infinity) 6902 \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (260: 6903 U+180B..180D, U+180F, U+FE00..FE0F, 6904 U+E0100..E01EF) 6905 X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short: 6906 \p{InVS}) (16) 6907 X \p{Variation_Selectors_Supplement} \p{Block= 6908 Variation_Selectors_Supplement} (Short: 6909 \p{InVSSup}) (240) 6910 X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block= 6911 Vedic_Extensions}) (48) 6912 X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short: 6913 \p{InVedicExt}) (48) 6914 X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16) 6915 \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated} 6916 (786_609 plus all above-Unicode code 6917 points) 6918 \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (786_609 plus 6919 all above-Unicode code points: [\x00- 6920 \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf- 6921 \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9, 6922 U+02EC..10FF, U+1200..1400, 6923 U+1680..18AF, U+1900..2015 ...) 6924 \p{Vertical_Orientation: Tr} \p{Vertical_Orientation= 6925 Transformed_Rotated} (47) 6926 \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr}) 6927 (47: U+2329..232A, U+3008..3011, 6928 U+3014..301F, U+3030, U+30A0, U+30FC ...) 6929 \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu}) 6930 (148: U+3001..3002, U+3041, U+3043, 6931 U+3045, U+3047, U+3049 ...) 6932 \p{Vertical_Orientation: Tu} \p{Vertical_Orientation= 6933 Transformed_Upright} (148) 6934 \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright} 6935 (327_308) 6936 \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (327_308: 6937 [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7], 6938 U+02EA..02EB, U+1100..11FF, 6939 U+1401..167F, U+18B0..18FF, U+2016 ...) 6940 \p{VertSpace} \v (7: [\n\cK\f\r\x85], U+2028..2029) 6941 \p{Vith} \p{Vithkuqi} (= \p{Script_Extensions= 6942 Vithkuqi}) (NOT \p{Block=Vithkuqi}) (70) 6943 \p{Vithkuqi} \p{Script_Extensions=Vithkuqi} (Short: 6944 \p{Vith}; NOT \p{Block=Vithkuqi}) (70) 6945 \p{Vo: *} \p{Vertical_Orientation: *} 6946 \p{VS} \p{Variation_Selector} (= 6947 \p{Variation_Selector=Y}) (NOT 6948 \p{Variation_Selectors}) (260) 6949 \p{VS: *} \p{Variation_Selector: *} 6950 X \p{VS_Sup} \p{Variation_Selectors_Supplement} (= 6951 \p{Block= 6952 Variation_Selectors_Supplement}) (240) 6953 \p{Wancho} \p{Script_Extensions=Wancho} (Short: 6954 \p{Wcho}; NOT \p{Block=Wancho}) (59) 6955 \p{Wara} \p{Warang_Citi} (= \p{Script_Extensions= 6956 Warang_Citi}) (NOT \p{Block= 6957 Warang_Citi}) (84) 6958 \p{Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short: 6959 \p{Wara}; NOT \p{Block=Warang_Citi}) (84) 6960 \p{WB: *} \p{Word_Break: *} 6961 \p{Wcho} \p{Wancho} (= \p{Script_Extensions= 6962 Wancho}) (NOT \p{Block=Wancho}) (59) 6963 \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25) 6964 \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087 6965 plus all above-Unicode code points: [^ 6966 \t\n\cK\f\r\x20\x85\xa0], U+0100..167F, 6967 U+1681..1FFF, U+200B..2027, 6968 U+202A..202E, U+2030..205E ...) 6969 \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25: [\t 6970 \n\cK\f\r\x20\x85\xa0], U+1680, 6971 U+2000..200A, U+2028..2029, U+202F, 6972 U+205F ...) 6973 \p{Word} \p{XPosixWord} (139_612) 6974 \p{Word_Break: ALetter} (Short: \p{WB=LE}) (29_489: [A-Za-z\xaa 6975 \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 6976 U+0100..02D7, U+02DE..02FF, 6977 U+0370..0374, U+0376..0377, U+037A..037D 6978 ...) 6979 \p{Word_Break: CR} (Short: \p{WB=CR}) (1: [\r]) 6980 \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"]) 6981 \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1) 6982 \p{Word_Break: E_Base} (Short: \p{WB=EB}) (0) 6983 \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0) 6984 \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0) 6985 \p{Word_Break: EB} \p{Word_Break=E_Base} (0) 6986 \p{Word_Break: EBG} \p{Word_Break=E_Base_GAZ} (0) 6987 \p{Word_Break: EM} \p{Word_Break=E_Modifier} (0) 6988 \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (11) 6989 \p{Word_Break: Extend} (Short: \p{WB=Extend}) (2554: 6990 U+0300..036F, U+0483..0489, 6991 U+0591..05BD, U+05BF, U+05C1..05C2, 6992 U+05C4..05C5 ...) 6993 \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F, 6994 U+203F..2040, U+2054, U+FE33..FE34, 6995 U+FE4D..FE4F ...) 6996 \p{Word_Break: FO} \p{Word_Break=Format} (71) 6997 \p{Word_Break: Format} (Short: \p{WB=FO}) (71: [\xad], 6998 U+0600..0605, U+061C, U+06DD, U+070F, 6999 U+0890..0891 ...) 7000 \p{Word_Break: GAZ} \p{Word_Break=Glue_After_Zwj} (0) 7001 \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0) 7002 \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75: 7003 U+05D0..05EA, U+05EF..05F2, U+FB1D, 7004 U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C 7005 ...) 7006 \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (75) 7007 \p{Word_Break: KA} \p{Word_Break=Katakana} (331) 7008 \p{Word_Break: Katakana} (Short: \p{WB=KA}) (331: U+3031..3035, 7009 U+309B..309C, U+30A0..30FA, 7010 U+30FC..30FF, U+31F0..31FF, U+32D0..32FE 7011 ...) 7012 \p{Word_Break: LE} \p{Word_Break=ALetter} (29_489) 7013 \p{Word_Break: LF} (Short: \p{WB=LF}) (1: [\n]) 7014 \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7) 7015 \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9: [:\xb7], U+0387, 7016 U+055F, U+05F4, U+2027, U+FE13 ...) 7017 \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15: [,;], U+037E, 7018 U+0589, U+060C..060D, U+066C, U+07F8 ...) 7019 \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.], 7020 U+2018..2019, U+2024, U+FE52, U+FF07, 7021 U+FF0E) 7022 \p{Word_Break: ML} \p{Word_Break=MidLetter} (9) 7023 \p{Word_Break: MN} \p{Word_Break=MidNum} (15) 7024 \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85], 7025 U+2028..2029) 7026 \p{Word_Break: NL} \p{Word_Break=Newline} (5) 7027 \p{Word_Break: NU} \p{Word_Break=Numeric} (681) 7028 \p{Word_Break: Numeric} (Short: \p{WB=NU}) (681: [0-9], 7029 U+0660..0669, U+066B, U+06F0..06F9, 7030 U+07C0..07C9, U+0966..096F ...) 7031 \p{Word_Break: Other} (Short: \p{WB=XX}) (1_080_819 plus all 7032 above-Unicode code points: [^\n\cK\f\r 7033 \x20\"\',.0-9:;A-Z_a-z\x85\xaa\xad\xb5 7034 \xb7\xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 7035 U+02D8..02DD, U+0375, U+0378..0379, 7036 U+0380..0385, U+038B ...) 7037 \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26: 7038 U+1F1E6..1F1FF) 7039 \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26) 7040 \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\']) 7041 \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1) 7042 \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20], 7043 U+1680, U+2000..2006, U+2008..200A, 7044 U+205F, U+3000) 7045 \p{Word_Break: XX} \p{Word_Break=Other} (1_080_819 plus all 7046 above-Unicode code points) 7047 \p{Word_Break: ZWJ} (Short: \p{WB=ZWJ}) (1: U+200D) 7048 \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25) 7049 \p{WSpace: *} \p{White_Space: *} 7050 \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44) 7051 \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC}) 7052 (139_463) 7053 \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (974_649 7054 plus all above-Unicode code points: 7055 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>? 7056 \@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4 7057 \xb6\xb8-\xb9\xbb-\xbf\xd7\xf7], 7058 U+02C2..02C5, U+02D2..02DF, 7059 U+02E5..02EB, U+02ED, U+02EF..02FF ...) 7060 \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (139_463: 7061 [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6 7062 \xd8-\xf6\xf8-\xff], U+0100..02C1, 7063 U+02C6..02D1, U+02E0..02E4, U+02EC, 7064 U+02EE ...) 7065 \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (136_322) 7066 \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (977_790 7067 plus all above-Unicode code points: 7068 [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<= 7069 >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4 7070 \xb6-\xb9\xbb-\xbf\xd7\xf7], 7071 U+02C2..02C5, U+02D2..02DF, 7072 U+02E5..02EB, U+02ED, U+02EF..036F ...) 7073 \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (136_322: 7074 [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6 7075 \xf8-\xff], U+0100..02C1, U+02C6..02D1, 7076 U+02E0..02E4, U+02EC, U+02EE ...) 7077 \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y}) 7078 (139_463) 7079 \p{XIDC: *} \p{XID_Continue: *} 7080 \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (136_322) 7081 \p{XIDS: *} \p{XID_Start: *} 7082 \p{Xpeo} \p{Old_Persian} (= \p{Script_Extensions= 7083 Old_Persian}) (NOT \p{Block= 7084 Old_Persian}) (50) 7085 \p{XPerlSpace} \p{XPosixSpace} (25) 7086 \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short: 7087 \p{Alnum}) (138_445: [0-9A-Za-z\xaa\xb5 7088 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 7089 U+0100..02C1, U+02C6..02D1, 7090 U+02E0..02E4, U+02EC, U+02EE ...) 7091 \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha}) 7092 (137_765) 7093 \p{XPosixBlank} \h, Horizontal white space (Short: 7094 \p{Blank}) (18: [\t\x20\xa0], U+1680, 7095 U+2000..200A, U+202F, U+205F, U+3000) 7096 \p{XPosixCntrl} \p{General_Category=Control} Control 7097 characters (Short: \p{Cc}) (65) 7098 \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9] 7099 + all other decimal digits (Short: 7100 \p{Nd}) (680) 7101 \p{XPosixGraph} Characters that are graphical (Short: 7102 \p{Graph}) (286_635: [!\"#\$\%&\' 7103 \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`a-z 7104 \{\|\}~\xa1-\xff], U+0100..0377, 7105 U+037A..037F, U+0384..038A, U+038C, 7106 U+038E..03A1 ...) 7107 \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i= 7108 Cased=Yes) (2544) 7109 \p{XPosixPrint} Characters that are graphical plus space 7110 characters (but no controls) (Short: 7111 \p{Print}) (286_652: [\x20-\x7e\xa0- 7112 \xff], U+0100..0377, U+037A..037F, 7113 U+0384..038A, U+038C, U+038E..03A1 ...) 7114 \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (851: 7115 [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\] 7116 \^_`\{\|\}~\xa1\xa7\xab\xb6-\xb7\xbb 7117 \xbf], U+037E, U+0387, U+055A..055F, 7118 U+0589..058A, U+05BE ...) 7119 \p{XPosixSpace} \s including beyond ASCII and vertical tab 7120 (Short: \p{SpacePerl}) (25: [\t\n\cK\f 7121 \r\x20\x85\xa0], U+1680, U+2000..200A, 7122 U+2028..2029, U+202F, U+205F ...) 7123 \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i= 7124 Cased=Yes) (1951) 7125 \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} + 7126 \pM + \p{Pc} + \p{Join_Control} (Short: 7127 \p{Word}) (139_612: [0-9A-Z_a-z\xaa\xb5 7128 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 7129 U+0100..02C1, U+02C6..02D1, 7130 U+02E0..02E4, U+02EC, U+02EE ...) 7131 \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44) 7132 \p{Xsux} \p{Cuneiform} (= \p{Script_Extensions= 7133 Cuneiform}) (NOT \p{Block=Cuneiform}) 7134 (1234) 7135 \p{Yezi} \p{Yezidi} (= \p{Script_Extensions= 7136 Yezidi}) (NOT \p{Block=Yezidi}) (60) 7137 \p{Yezidi} \p{Script_Extensions=Yezidi} (Short: 7138 \p{Yezi}; NOT \p{Block=Yezidi}) (60) 7139 \p{Yi} \p{Script_Extensions=Yi} (1246) 7140 X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64) 7141 X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168) 7142 \p{Yiii} \p{Yi} (= \p{Script_Extensions=Yi}) (1246) 7143 X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block= 7144 Yijing_Hexagram_Symbols}) (64) 7145 X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols} 7146 (Short: \p{InYijing}) (64) 7147 \p{Z} \pZ \p{Separator} (= \p{General_Category= 7148 Separator}) (19) 7149 \p{Zanabazar_Square} \p{Script_Extensions=Zanabazar_Square} 7150 (Short: \p{Zanb}; NOT \p{Block= 7151 Zanabazar_Square}) (72) 7152 \p{Zanb} \p{Zanabazar_Square} (= 7153 \p{Script_Extensions=Zanabazar_Square}) 7154 (NOT \p{Block=Zanabazar_Square}) (72) 7155 \p{Zinh} \p{Inherited} (= \p{Script_Extensions= 7156 Inherited}) (586) 7157 \p{Zl} \p{Line_Separator} (= \p{General_Category= 7158 Line_Separator}) (1) 7159 X \p{Znamenny_Music} \p{Znamenny_Musical_Notation} (= \p{Block= 7160 Znamenny_Musical_Notation}) (208) 7161 X \p{Znamenny_Musical_Notation} \p{Block=Znamenny_Musical_Notation} 7162 (Short: \p{InZnamennyMusic}) (208) 7163 \p{Zp} \p{Paragraph_Separator} (= 7164 \p{General_Category= 7165 Paragraph_Separator}) (1) 7166 \p{Zs} \p{Space_Separator} (= 7167 \p{General_Category=Space_Separator}) 7168 (17) 7169 \p{Zyyy} \p{Common} (= \p{Script_Extensions= 7170 Common}) (7873) 7171 \p{Zzzz} \p{Unknown} (= \p{Script_Extensions= 7172 Unknown}) (964_861 plus all above- 7173 Unicode code points) 7174 7175 7176 7177=head2 Legal C<\p{}> and C<\P{}> constructs that match no characters 7178 7179Unicode has some property-value pairs that currently don't match anything. 7180This happens generally either because they are obsolete, or they exist for 7181symmetry with other forms, but no language has yet been encoded that uses 7182them. In this version of Unicode, the following match zero code points: 7183 7184=over 4 7185 7186=item \p{Canonical_Combining_Class=Attached_Below_Left} 7187 7188=item \p{Canonical_Combining_Class=CCC133} 7189 7190=item \p{Grapheme_Cluster_Break=E_Base} 7191 7192=item \p{Grapheme_Cluster_Break=E_Base_GAZ} 7193 7194=item \p{Grapheme_Cluster_Break=E_Modifier} 7195 7196=item \p{Grapheme_Cluster_Break=Glue_After_Zwj} 7197 7198=item \p{Word_Break=E_Base} 7199 7200=item \p{Word_Break=E_Base_GAZ} 7201 7202=item \p{Word_Break=E_Modifier} 7203 7204=item \p{Word_Break=Glue_After_Zwj} 7205 7206=back 7207 7208 7209 7210=head1 Properties accessible through Unicode::UCD 7211 7212The value of any Unicode (not including Perl extensions) character 7213property mentioned above for any single code point is available through 7214L<Unicode::UCD/charprop()>. L<Unicode::UCD/charprops_all()> returns the 7215values of all the Unicode properties for a given code point. 7216 7217Besides these, all the Unicode character properties mentioned above 7218(except for those marked as for internal use by Perl) are also 7219accessible by L<Unicode::UCD/prop_invlist()>. 7220 7221Due to their nature, not all Unicode character properties are suitable for 7222regular expression matches, nor C<prop_invlist()>. The remaining 7223non-provisional, non-internal ones are accessible via 7224L<Unicode::UCD/prop_invmap()> (except for those that this Perl installation 7225hasn't included; see L<below for which those are|/Unicode character properties 7226that are NOT accepted by Perl>). 7227 7228For compatibility with other parts of Perl, all the single forms given in the 7229table in the L<section above|/Properties accessible through \p{} and \P{}> 7230are recognized. BUT, there are some ambiguities between some Perl extensions 7231and the Unicode properties, all of which are silently resolved in favor of the 7232official Unicode property. To avoid surprises, you should only use 7233C<prop_invmap()> for forms listed in the table below, which omits the 7234non-recommended ones. The affected forms are the Perl single form equivalents 7235of Unicode properties, such as C<\p{sc}> being a single-form equivalent of 7236C<\p{gc=sc}>, which is treated by C<prop_invmap()> as the C<Script> property, 7237whose short name is C<sc>. The table indicates the current ambiguities in the 7238INFO column, beginning with the word C<"NOT">. 7239 7240The standard Unicode properties listed below are documented in 7241L<http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is documented in 7242L<Unicode::UCD/prop_invmap()>. The other Perl extensions are in 7243L<perlunicode/Other Properties>; 7244 7245The first column in the table is a name for the property; the second column is 7246an alternative name, if any, plus possibly some annotations. The alternative 7247name is the property's full name, unless that would simply repeat the first 7248column, in which case the second column indicates the property's short name 7249(if different). The annotations are given only in the entry for the full 7250name. The annotations for binary properties include a list of the first few 7251ranges that the property matches. To avoid any ambiguity, the SPACE character 7252is represented as C<\x20>. 7253 7254If a property is obsolete, etc, the entry will be flagged with the same 7255characters used in the table in the L<section above|/Properties accessible 7256through \p{} and \P{}>, like B<D> or B<S>. 7257 7258 NAME INFO 7259 7260 Age 7261 AHex ASCII_Hex_Digit 7262 All (Perl extension). All code points, 7263 including those above Unicode. Same as 7264 qr/./s. U+0000..infinity 7265 Alnum XPosixAlnum. (Perl extension) 7266 Alpha Alphabetic 7267 Alphabetic (Short: Alpha). [A-Za-z\xaa\xb5\xba\xc0- 7268 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1, 7269 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE 7270 ... 7271 Any (Perl extension). All Unicode code 7272 points. U+0000..10FFFF 7273 ASCII Block=Basic_Latin. (Perl extension). 7274 [\x00-\x7f] 7275 ASCII_Hex_Digit (Short: AHex). [0-9A-Fa-f] 7276 Assigned (Perl extension). All assigned code 7277 points. U+0000..0377, U+037A..037F, 7278 U+0384..038A, U+038C, U+038E..03A1, 7279 U+03A3..052F ... 7280 Bc Bidi_Class 7281 Bidi_C Bidi_Control 7282 Bidi_Class (Short: bc) 7283 Bidi_Control (Short: Bidi_C). U+061C, U+200E..200F, 7284 U+202A..202E, U+2066..2069 7285 Bidi_M Bidi_Mirrored 7286 Bidi_Mirrored (Short: Bidi_M). [\(\)<>\[\]\{\}\xab 7287 \xbb], U+0F3A..0F3D, U+169B..169C, 7288 U+2039..203A, U+2045..2046, U+207D..207E 7289 ... 7290 Bidi_Mirroring_Glyph (Short: bmg) 7291 Bidi_Paired_Bracket (Short: bpb) 7292 Bidi_Paired_Bracket_Type (Short: bpt) 7293 Blank XPosixBlank. (Perl extension) 7294 Blk Block 7295 Block (Short: blk) 7296 Bmg Bidi_Mirroring_Glyph 7297 Bpb Bidi_Paired_Bracket 7298 Bpt Bidi_Paired_Bracket_Type 7299 Canonical_Combining_Class (Short: ccc) 7300 Case_Folding (Short: cf) 7301 Case_Ignorable (Short: CI). [\'.:\^`\xa8\xad\xaf\xb4 7302 \xb7-\xb8], U+02B0..036F, U+0374..0375, 7303 U+037A, U+0384..0385, U+0387 ... 7304 Cased [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 7305 \xff], U+0100..01BA, U+01BC..01BF, 7306 U+01C4..0293, U+0295..02B8, U+02C0..02C1 7307 ... 7308 Category General_Category 7309 Ccc Canonical_Combining_Class 7310 CE Composition_Exclusion 7311 Cf Case_Folding; NOT 'cf' meaning 7312 'General_Category=Format' 7313 Changes_When_Casefolded (Short: CWCF). [A-Z\xb5\xc0-\xd6\xd8- 7314 \xdf], U+0100, U+0102, U+0104, U+0106, 7315 U+0108 ... 7316 Changes_When_Casemapped (Short: CWCM). [A-Za-z\xb5\xc0-\xd6\xd8- 7317 \xf6\xf8-\xff], U+0100..0137, 7318 U+0139..018C, U+018E..019A, U+019C..01A9, 7319 U+01AC..01B9 ... 7320 Changes_When_Lowercased (Short: CWL). [A-Z\xc0-\xd6\xd8-\xde], 7321 U+0100, U+0102, U+0104, U+0106, U+0108 ... 7322 Changes_When_NFKC_Casefolded (Short: CWKCF). [A-Z\xa0\xa8\xaa 7323 \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0- 7324 \xd6\xd8-\xdf], U+0100, U+0102, U+0104, 7325 U+0106, U+0108 ... 7326 Changes_When_Titlecased (Short: CWT). [a-z\xb5\xdf-\xf6\xf8- 7327 \xff], U+0101, U+0103, U+0105, U+0107, 7328 U+0109 ... 7329 Changes_When_Uppercased (Short: CWU). [a-z\xb5\xdf-\xf6\xf8- 7330 \xff], U+0101, U+0103, U+0105, U+0107, 7331 U+0109 ... 7332 CI Case_Ignorable 7333 Cntrl XPosixCntrl (=General_Category=Control). 7334 (Perl extension) 7335 Comp_Ex Full_Composition_Exclusion 7336 Composition_Exclusion (Short: CE). U+0958..095F, U+09DC..09DD, 7337 U+09DF, U+0A33, U+0A36, U+0A59..0A5B ... 7338 CWCF Changes_When_Casefolded 7339 CWCM Changes_When_Casemapped 7340 CWKCF Changes_When_NFKC_Casefolded 7341 CWL Changes_When_Lowercased 7342 CWT Changes_When_Titlecased 7343 CWU Changes_When_Uppercased 7344 Dash [\-], U+058A, U+05BE, U+1400, U+1806, 7345 U+2010..2015 ... 7346 Decomposition_Mapping (Short: dm) 7347 Decomposition_Type (Short: dt) 7348 Default_Ignorable_Code_Point (Short: DI). [\xad], U+034F, U+061C, 7349 U+115F..1160, U+17B4..17B5, U+180B..180F 7350 ... 7351 Dep Deprecated 7352 Deprecated (Short: Dep). U+0149, U+0673, U+0F77, 7353 U+0F79, U+17A3..17A4, U+206A..206F ... 7354 DI Default_Ignorable_Code_Point 7355 Dia Diacritic 7356 Diacritic (Short: Dia). [\^`\xa8\xaf\xb4\xb7-\xb8], 7357 U+02B0..034E, U+0350..0357, U+035D..0362, 7358 U+0374..0375, U+037A ... 7359 Digit XPosixDigit (=General_Category= 7360 Decimal_Number). (Perl extension) 7361 Dm Decomposition_Mapping 7362 Dt Decomposition_Type 7363 Ea East_Asian_Width 7364 East_Asian_Width (Short: ea) 7365 EBase Emoji_Modifier_Base 7366 EComp Emoji_Component 7367 EMod Emoji_Modifier 7368 Emoji [#*0-9\xa9\xae], U+203C, U+2049, U+2122, 7369 U+2139, U+2194..2199 ... 7370 Emoji_Component (Short: EComp). [#*0-9], U+200D, U+20E3, 7371 U+FE0F, U+1F1E6..1F1FF, U+1F3FB..1F3FF ... 7372 Emoji_Modifier (Short: EMod). U+1F3FB..1F3FF 7373 Emoji_Modifier_Base (Short: EBase). U+261D, U+26F9, 7374 U+270A..270D, U+1F385, U+1F3C2..1F3C4, 7375 U+1F3C7 ... 7376 Emoji_Presentation (Short: EPres). U+231A..231B, 7377 U+23E9..23EC, U+23F0, U+23F3, 7378 U+25FD..25FE, U+2614..2615 ... 7379 EPres Emoji_Presentation 7380 EqUIdeo Equivalent_Unified_Ideograph 7381 Equivalent_Unified_Ideograph (Short: EqUIdeo) 7382 Ext Extender 7383 Extended_Pictographic (Short: ExtPict). [\xa9\xae], U+203C, 7384 U+2049, U+2122, U+2139, U+2194..2199 ... 7385 Extender (Short: Ext). [\xb7], U+02D0..02D1, 7386 U+0640, U+07FA, U+0B55, U+0E46 ... 7387 ExtPict Extended_Pictographic 7388 Full_Composition_Exclusion (Short: Comp_Ex). U+0340..0341, 7389 U+0343..0344, U+0374, U+037E, U+0387, 7390 U+0958..095F ... 7391 Gc General_Category 7392 GCB Grapheme_Cluster_Break 7393 General_Category (Short: gc) 7394 Gr_Base Grapheme_Base 7395 Gr_Ext Grapheme_Extend 7396 Graph XPosixGraph. (Perl extension) 7397 Grapheme_Base (Short: Gr_Base). [\x20-\x7e\xa0-\xac 7398 \xae-\xff], U+0100..02FF, U+0370..0377, 7399 U+037A..037F, U+0384..038A, U+038C ... 7400 Grapheme_Cluster_Break (Short: GCB) 7401 Grapheme_Extend (Short: Gr_Ext). U+0300..036F, 7402 U+0483..0489, U+0591..05BD, U+05BF, 7403 U+05C1..05C2, U+05C4..05C5 ... 7404 Hangul_Syllable_Type (Short: hst) 7405 Hex Hex_Digit 7406 Hex_Digit (Short: Hex). [0-9A-Fa-f], U+FF10..FF19, 7407 U+FF21..FF26, U+FF41..FF46 7408 HorizSpace XPosixBlank. (Perl extension) 7409 Hst Hangul_Syllable_Type 7410 D Hyphen [\-\xad], U+058A, U+1806, U+2010..2011, 7411 U+2E17, U+30FB ... Supplanted by 7412 Line_Break property values; see 7413 www.unicode.org/reports/tr14 7414 ID_Continue (Short: IDC). [0-9A-Z_a-z\xaa\xb5\xb7 7415 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 7416 U+0100..02C1, U+02C6..02D1, U+02E0..02E4, 7417 U+02EC, U+02EE ... 7418 ID_Start (Short: IDS). [A-Za-z\xaa\xb5\xba\xc0- 7419 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1, 7420 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE 7421 ... 7422 IDC ID_Continue 7423 Identifier_Status 7424 Identifier_Type 7425 Ideo Ideographic 7426 Ideographic (Short: Ideo). U+3006..3007, 7427 U+3021..3029, U+3038..303A, U+3400..4DBF, 7428 U+4E00..9FFF, U+F900..FA6D ... 7429 IDS ID_Start 7430 IDS_Binary_Operator (Short: IDSB). U+2FF0..2FF1, U+2FF4..2FFB 7431 IDS_Trinary_Operator (Short: IDST). U+2FF2..2FF3 7432 IDSB IDS_Binary_Operator 7433 IDST IDS_Trinary_Operator 7434 In Present_In. (Perl extension) 7435 Indic_Positional_Category (Short: InPC) 7436 Indic_Syllabic_Category (Short: InSC) 7437 InPC Indic_Positional_Category 7438 InSC Indic_Syllabic_Category 7439 Isc ISO_Comment; NOT 'isc' meaning 7440 'General_Category=Other' 7441 ISO_Comment (Short: isc) 7442 Jg Joining_Group 7443 Join_C Join_Control 7444 Join_Control (Short: Join_C). U+200C..200D 7445 Joining_Group (Short: jg) 7446 Joining_Type (Short: jt) 7447 Jt Joining_Type 7448 Lb Line_Break 7449 Lc Lowercase_Mapping; NOT 'lc' meaning 7450 'General_Category=Cased_Letter' 7451 Line_Break (Short: lb) 7452 LOE Logical_Order_Exception 7453 Logical_Order_Exception (Short: LOE). U+0E40..0E44, U+0EC0..0EC4, 7454 U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9 7455 ... 7456 Lower Lowercase 7457 Lowercase (Short: Lower). [a-z\xaa\xb5\xba\xdf- 7458 \xf6\xf8-\xff], U+0101, U+0103, U+0105, 7459 U+0107, U+0109 ... 7460 Lowercase_Mapping (Short: lc) 7461 Math [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2, 7462 U+03D5, U+03F0..03F1, U+03F4..03F6, 7463 U+0606..0608 ... 7464 Na Name 7465 Na1 Unicode_1_Name 7466 Name (Short: na) 7467 Name_Alias 7468 NChar Noncharacter_Code_Point 7469 NFC_QC NFC_Quick_Check 7470 NFC_Quick_Check (Short: NFC_QC) 7471 NFD_QC NFD_Quick_Check 7472 NFD_Quick_Check (Short: NFD_QC) 7473 NFKC_Casefold (Short: NFKC_CF) 7474 NFKC_CF NFKC_Casefold 7475 NFKC_QC NFKC_Quick_Check 7476 NFKC_Quick_Check (Short: NFKC_QC) 7477 NFKD_QC NFKD_Quick_Check 7478 NFKD_Quick_Check (Short: NFKD_QC) 7479 Noncharacter_Code_Point (Short: NChar). U+FDD0..FDEF, 7480 U+FFFE..FFFF, U+1FFFE..1FFFF, 7481 U+2FFFE..2FFFF, U+3FFFE..3FFFF, 7482 U+4FFFE..4FFFF ... 7483 Nt Numeric_Type 7484 Numeric_Type (Short: nt) 7485 Numeric_Value (Short: nv) 7486 Nv Numeric_Value 7487 Pat_Syn Pattern_Syntax 7488 Pat_WS Pattern_White_Space 7489 Pattern_Syntax (Short: Pat_Syn). [!\"#\$\%&\'\(\)*+,\-. 7490 \/:;<=>?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9 7491 \xab-\xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7 7492 \xf7], U+2010..2027, U+2030..203E, 7493 U+2041..2053, U+2055..205E, U+2190..245F 7494 ... 7495 Pattern_White_Space (Short: Pat_WS). [\t\n\cK\f\r\x20\x85], 7496 U+200E..200F, U+2028..2029 7497 PCM Prepended_Concatenation_Mark 7498 Perl_Decimal_Digit (Perl extension) 7499 PerlSpace PosixSpace. (Perl extension) 7500 PerlWord PosixWord. (Perl extension) 7501 PosixAlnum (Perl extension). [0-9A-Za-z] 7502 PosixAlpha (Perl extension). [A-Za-z] 7503 PosixBlank (Perl extension). [\t\x20] 7504 PosixCntrl (Perl extension). ASCII control 7505 characters. ACK, BEL, BS, CAN, CR, DC1, 7506 DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT, 7507 ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK, 7508 NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT 7509 PosixDigit (Perl extension). [0-9] 7510 PosixGraph (Perl extension). [!\"#\$\%&\'\(\)*+,\-. 7511 \/0-9:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~] 7512 PosixLower (Perl extension). [a-z] 7513 PosixPrint (Perl extension). [\x20-\x7e] 7514 PosixPunct (Perl extension). [!\"#\$\%&\'\(\)*+,\-. 7515 \/:;<=>?\@\[\\\]\^_`\{\|\}~] 7516 PosixSpace (Perl extension). [\t\n\cK\f\r\x20] 7517 PosixUpper (Perl extension). [A-Z] 7518 PosixWord (Perl extension). \w, restricted to 7519 ASCII. [0-9A-Z_a-z] 7520 PosixXDigit ASCII_Hex_Digit. (Perl extension). 7521 [0-9A-Fa-f] 7522 Prepended_Concatenation_Mark (Short: PCM). U+0600..0605, U+06DD, 7523 U+070F, U+0890..0891, U+08E2, U+110BD ... 7524 Present_In (Short: In). (Perl extension) 7525 Print XPosixPrint. (Perl extension) 7526 Punct General_Category=Punctuation. (Perl 7527 extension). [!\"#\%&\'\(\)*,\-.\/:;?\@ 7528 \[\\\]_\{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf], 7529 U+037E, U+0387, U+055A..055F, 7530 U+0589..058A, U+05BE ... 7531 QMark Quotation_Mark 7532 Quotation_Mark (Short: QMark). [\"\'\xab\xbb], 7533 U+2018..201F, U+2039..203A, U+2E42, 7534 U+300C..300F, U+301D..301F ... 7535 Radical U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5 7536 Regional_Indicator (Short: RI). U+1F1E6..1F1FF 7537 RI Regional_Indicator 7538 SB Sentence_Break 7539 Sc Script; NOT 'sc' meaning 7540 'General_Category=Currency_Symbol' 7541 Scf Simple_Case_Folding 7542 Script (Short: sc) 7543 Script_Extensions (Short: scx) 7544 Scx Script_Extensions 7545 SD Soft_Dotted 7546 Sentence_Break (Short: SB) 7547 Sentence_Terminal (Short: STerm). [!.?], U+0589, 7548 U+061D..061F, U+06D4, U+0700..0702, U+07F9 7549 ... 7550 Sfc Simple_Case_Folding 7551 Simple_Case_Folding (Short: scf) 7552 Simple_Lowercase_Mapping (Short: slc) 7553 Simple_Titlecase_Mapping (Short: stc) 7554 Simple_Uppercase_Mapping (Short: suc) 7555 Slc Simple_Lowercase_Mapping 7556 Soft_Dotted (Short: SD). [i-j], U+012F, U+0249, 7557 U+0268, U+029D, U+02B2 ... 7558 Space White_Space 7559 SpacePerl XPosixSpace. (Perl extension) 7560 Stc Simple_Titlecase_Mapping 7561 STerm Sentence_Terminal 7562 Suc Simple_Uppercase_Mapping 7563 Tc Titlecase_Mapping 7564 Term Terminal_Punctuation 7565 Terminal_Punctuation (Short: Term). [!,.:;?], U+037E, U+0387, 7566 U+0589, U+05C3, U+060C ... 7567 Title Titlecase. (Perl extension) 7568 Titlecase (Short: Title). (Perl extension). (= 7569 \p{Gc=Lt}). U+01C5, U+01C8, U+01CB, 7570 U+01F2, U+1F88..1F8F, U+1F98..1F9F ... 7571 Titlecase_Mapping (Short: tc) 7572 Uc Uppercase_Mapping 7573 UIdeo Unified_Ideograph 7574 Unicode Any. (Perl extension) 7575 Unicode_1_Name (Short: na1) 7576 Unified_Ideograph (Short: UIdeo). U+3400..4DBF, 7577 U+4E00..9FFF, U+FA0E..FA0F, U+FA11, 7578 U+FA13..FA14, U+FA1F ... 7579 Upper Uppercase 7580 Uppercase (Short: Upper). [A-Z\xc0-\xd6\xd8-\xde], 7581 U+0100, U+0102, U+0104, U+0106, U+0108 ... 7582 Uppercase_Mapping (Short: uc) 7583 Variation_Selector (Short: VS). U+180B..180D, U+180F, 7584 U+FE00..FE0F, U+E0100..E01EF 7585 Vertical_Orientation (Short: vo) 7586 VertSpace (Perl extension). \v. [\n\cK\f\r\x85], 7587 U+2028..2029 7588 Vo Vertical_Orientation 7589 VS Variation_Selector 7590 WB Word_Break 7591 White_Space (Short: WSpace). [\t\n\cK\f\r\x20\x85 7592 \xa0], U+1680, U+2000..200A, U+2028..2029, 7593 U+202F, U+205F ... 7594 Word XPosixWord. (Perl extension) 7595 Word_Break (Short: WB) 7596 WSpace White_Space 7597 XDigit XPosixXDigit (=Hex_Digit). (Perl 7598 extension) 7599 XID_Continue (Short: XIDC). [0-9A-Z_a-z\xaa\xb5\xb7 7600 \xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 7601 U+0100..02C1, U+02C6..02D1, U+02E0..02E4, 7602 U+02EC, U+02EE ... 7603 XID_Start (Short: XIDS). [A-Za-z\xaa\xb5\xba\xc0- 7604 \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1, 7605 U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE 7606 ... 7607 XIDC XID_Continue 7608 XIDS XID_Start 7609 XPerlSpace XPosixSpace. (Perl extension) 7610 XPosixAlnum (Short: Alnum). (Perl extension). 7611 Alphabetic and (decimal) Numeric. [0-9A- 7612 Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8- 7613 \xff], U+0100..02C1, U+02C6..02D1, 7614 U+02E0..02E4, U+02EC, U+02EE ... 7615 XPosixAlpha Alphabetic. (Perl extension). [A-Za-z 7616 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 7617 U+0100..02C1, U+02C6..02D1, U+02E0..02E4, 7618 U+02EC, U+02EE ... 7619 XPosixBlank (Short: Blank). (Perl extension). \h, 7620 Horizontal white space. [\t\x20\xa0], 7621 U+1680, U+2000..200A, U+202F, U+205F, 7622 U+3000 7623 XPosixCntrl General_Category=Control (Short: Cntrl). 7624 (Perl extension). Control characters. 7625 [\x00-\x1f\x7f-\x9f] 7626 XPosixDigit General_Category=Decimal_Number (Short: 7627 Digit). (Perl extension). [0-9] + all 7628 other decimal digits. [0-9], 7629 U+0660..0669, U+06F0..06F9, U+07C0..07C9, 7630 U+0966..096F, U+09E6..09EF ... 7631 XPosixGraph (Short: Graph). (Perl extension). 7632 Characters that are graphical. [!\"#\$ 7633 \%&\'\(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\] 7634 \^_`a-z\{\|\}~\xa1-\xff], U+0100..0377, 7635 U+037A..037F, U+0384..038A, U+038C, 7636 U+038E..03A1 ... 7637 XPosixLower Lowercase. (Perl extension). [a-z\xaa 7638 \xb5\xba\xdf-\xf6\xf8-\xff], U+0101, 7639 U+0103, U+0105, U+0107, U+0109 ... 7640 XPosixPrint (Short: Print). (Perl extension). 7641 Characters that are graphical plus space 7642 characters (but no controls). [\x20-\x7e 7643 \xa0-\xff], U+0100..0377, U+037A..037F, 7644 U+0384..038A, U+038C, U+038E..03A1 ... 7645 XPosixPunct (Perl extension). \p{Punct} + ASCII-range 7646 \p{Symbol}. [!\"#\$\%&\'\(\)*+,\-.\/:;<= 7647 >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6- 7648 \xb7\xbb\xbf], U+037E, U+0387, 7649 U+055A..055F, U+0589..058A, U+05BE ... 7650 XPosixSpace (Perl extension). \s including beyond 7651 ASCII and vertical tab. [\t\n\cK\f\r\x20 7652 \x85\xa0], U+1680, U+2000..200A, 7653 U+2028..2029, U+202F, U+205F ... 7654 XPosixUpper Uppercase. (Perl extension). [A-Z\xc0- 7655 \xd6\xd8-\xde], U+0100, U+0102, U+0104, 7656 U+0106, U+0108 ... 7657 XPosixWord (Short: Word). (Perl extension). \w, 7658 including beyond ASCII; = \p{Alnum} + \pM 7659 + \p{Pc} + \p{Join_Control}. [0-9A-Z_a-z 7660 \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff], 7661 U+0100..02C1, U+02C6..02D1, U+02E0..02E4, 7662 U+02EC, U+02EE ... 7663 XPosixXDigit Hex_Digit (Short: XDigit). (Perl 7664 extension). [0-9A-Fa-f], U+FF10..FF19, 7665 U+FF21..FF26, U+FF41..FF46 7666 7667 7668=head1 Properties accessible through other means 7669 7670Certain properties are accessible also via core function calls. These are: 7671 7672 Lowercase_Mapping lc() and lcfirst() 7673 Titlecase_Mapping ucfirst() 7674 Uppercase_Mapping uc() 7675 7676Also, Case_Folding is accessible through the C</i> modifier in regular 7677expressions, the C<\F> transliteration escape, and the C<L<fc|perlfunc/fc>> 7678operator. 7679 7680Besides being able to say C<\p{Name=...}>, the Name and Name_Aliases 7681properties are accessible through the C<\N{}> interpolation in double-quoted 7682strings and regular expressions; and functions C<charnames::viacode()>, 7683C<charnames::vianame()>, and C<charnames::string_vianame()> (which require a 7684C<use charnames ();> to be specified. 7685 7686Finally, most properties related to decomposition are accessible via 7687L<Unicode::Normalize>. 7688 7689=head1 Unicode character properties that are NOT accepted by Perl 7690 7691Perl will generate an error for a few character properties in Unicode when 7692used in a regular expression. The non-Unihan ones are listed below, with the 7693reasons they are not accepted, perhaps with work-arounds. The short names for 7694the properties are listed enclosed in (parentheses). 7695As described after the list, an installation can change the defaults and choose 7696to accept any of these. The list is machine generated based on the 7697choices made for the installation that generated this document. 7698 7699 7700=over 4 7701 7702 7703 7704=item I<Expands_On_NFC> (XO_NFC) 7705 7706=item I<Expands_On_NFD> (XO_NFD) 7707 7708=item I<Expands_On_NFKC> (XO_NFKC) 7709 7710=item I<Expands_On_NFKD> (XO_NFKD) 7711 7712Deprecated by Unicode. These are characters that expand to more than one character in the specified normalization form, but whether they actually take up more bytes or not depends on the encoding being used. For example, a UTF-8 encoded character may expand to a different number of bytes than a UTF-32 encoded character. 7713 7714 7715 7716=item I<Grapheme_Link> (Gr_Link) 7717 7718Duplicates ccc=vr (Canonical_Combining_Class=Virama) 7719 7720 7721 7722=item I<Jamo_Short_Name> (JSN) 7723 7724=item I<Other_Alphabetic> (OAlpha) 7725 7726=item I<Other_Default_Ignorable_Code_Point> (ODI) 7727 7728=item I<Other_Grapheme_Extend> (OGr_Ext) 7729 7730=item I<Other_ID_Continue> (OIDC) 7731 7732=item I<Other_ID_Start> (OIDS) 7733 7734=item I<Other_Lowercase> (OLower) 7735 7736=item I<Other_Math> (OMath) 7737 7738=item I<Other_Uppercase> (OUpper) 7739 7740Used by Unicode internally for generating other properties and not intended to be used stand-alone 7741 7742 7743 7744=item I<Script=Katakana_Or_Hiragana> (sc=Hrkt) 7745 7746Obsolete. All code points previously matched by this have been moved to "Script=Common". Consider instead using "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or both) 7747 7748 7749 7750=item I<Script_Extensions=Katakana_Or_Hiragana> (scx=Hrkt) 7751 7752All code points that would be matched by this are matched by either "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" 7753 7754=back 7755 7756 7757An installation can choose to allow any of these to be matched by downloading 7758the Unicode database from L<http://www.unicode.org/Public/> to 7759C<$Config{privlib}>/F<unicore/> in the Perl source tree, changing the 7760controlling lists contained in the program 7761C<$Config{privlib}>/F<unicore/mktables> and then re-compiling and installing. 7762(C<%Config> is available from the Config module). 7763 7764Also, perl can be recompiled to operate on an earlier version of the Unicode 7765standard. Further information is at 7766C<$Config{privlib}>/F<unicore/README.perl>. 7767 7768=head1 Other information in the Unicode data base 7769 7770The Unicode data base is delivered in two different formats. The XML version 7771is valid for more modern Unicode releases. The other version is a collection 7772of files. The two are intended to give equivalent information. Perl uses the 7773older form; this allows you to recompile Perl to use early Unicode releases. 7774 7775The only non-character property that Perl currently supports is Named 7776Sequences, in which a sequence of code points 7777is given a name and generally treated as a single entity. (Perl supports 7778these via the C<\N{...}> double-quotish construct, 7779L<charnames/charnames::string_vianame(name)>, and L<Unicode::UCD/namedseq()>. 7780 7781Below is a list of the files in the Unicode data base that Perl doesn't 7782currently use, along with very brief descriptions of their purposes. 7783Some of the names of the files have been shortened from those that Unicode 7784uses, in order to allow them to be distinguishable from similarly named files 7785on file systems for which only the first 8 characters of a name are 7786significant. 7787 7788=over 4 7789 7790 7791 7792 7793=item F<auxiliary/GraphemeBreakTest.html> 7794 7795=item F<auxiliary/LineBreakTest.html> 7796 7797=item F<auxiliary/SentenceBreakTest.html> 7798 7799=item F<auxiliary/WordBreakTest.html> 7800 7801Documentation of validation Tests 7802 7803 7804 7805=item F<BidiCharacterTest.txt> 7806 7807=item F<BidiTest.txt> 7808 7809Validation Tests 7810 7811 7812 7813=item F<CJKRadicals.txt> 7814 7815Maps the kRSUnicode property values to corresponding code points 7816 7817 7818 7819=item F<confusables.txt> 7820 7821=item F<confusablesSummary.txt> 7822 7823=item F<intentional.txt> 7824 7825Currently unused by Perl 7826 7827 7828 7829=item F<emoji/ReadMe.txt> 7830 7831=item F<ReadMe.txt> 7832 7833Documentation 7834 7835 7836 7837=item F<EmojiSources.txt> 7838 7839Maps certain Unicode code points to their legacy Japanese cell-phone values 7840 7841 7842 7843=item F<extracted/DName.txt> 7844 7845This file adds no new information not already present in other files 7846 7847 7848 7849=item F<Index.txt> 7850 7851Alphabetical index of Unicode characters 7852 7853 7854 7855=item F<NamedSqProv.txt> 7856 7857Named sequences proposed for inclusion in a later version of the Unicode Standard; if you need them now, you can append this file to F<NamedSequences.txt> and recompile perl 7858 7859 7860 7861=item F<NamesList.html> 7862 7863Describes the format and contents of F<NamesList.txt> 7864 7865 7866 7867=item F<NamesList.txt> 7868 7869Annotated list of characters 7870 7871 7872 7873=item F<NormalizationCorrections.txt> 7874 7875Documentation of corrections already incorporated into the Unicode data base 7876 7877 7878 7879=item F<NushuSources.txt> 7880 7881Specifies source material for Nushu characters 7882 7883 7884 7885=item F<StandardizedVariants.html> 7886 7887Obsoleted as of Unicode 9.0, but previously provided a visual display of the standard variant sequences derived from F<StandardizedVariants.txt>. 7888 7889 7890 7891=item F<StandardizedVariants.txt> 7892 7893Certain glyph variations for character display are standardized. This lists the non-Unihan ones; the Unihan ones are also not used by Perl, and are in a separate Unicode data base L<http://www.unicode.org/ivd> 7894 7895 7896 7897=item F<TangutSources.txt> 7898 7899Specifies source mappings for Tangut ideographs and components. This data file also includes informative radical-stroke values that are used internally by Unicode 7900 7901 7902 7903=item F<USourceData.txt> 7904 7905Documentation of status and cross reference of proposals for encoding by Unicode of Unihan characters 7906 7907 7908 7909=item F<USourceGlyphs.pdf> 7910 7911Pictures of the characters in F<USourceData.txt> 7912 7913 7914=back 7915 7916=head1 SEE ALSO 7917 7918L<http://www.unicode.org/reports/tr44/> 7919 7920L<perlrecharclass> 7921 7922L<perlunicode> 7923 7924