xref: /openbsd/gnu/usr.bin/perl/pod/perluniprops.pod (revision 4cfece93)
1=begin comment
2
3# !!!!!!!   DO NOT EDIT THIS FILE   !!!!!!!
4# This file is machine-generated by lib/unicore/mktables from the Unicode
5# database, Version 12.1.0.  Any changes made here will be lost!
6
7
8To change this file, edit lib/unicore/mktables instead.
9
10=end comment
11
12=head1 NAME
13
14perluniprops - Index of Unicode Version 12.1.0 character properties in Perl
15
16=head1 DESCRIPTION
17
18This document provides information about the portion of the Unicode database
19that deals with character properties, that is the portion that is defined on
20single code points.  (L</Other information in the Unicode data base>
21below briefly mentions other data that Unicode provides.)
22
23Perl can provide access to all non-provisional Unicode character properties,
24though not all are enabled by default.  The omitted ones are the Unihan
25properties (accessible via the CPAN module L<Unicode::Unihan>) and certain
26deprecated or Unicode-internal properties.  (An installation may choose to
27recompile Perl's tables to change this.  See L</Unicode character
28properties that are NOT accepted by Perl>.)
29
30For most purposes, access to Unicode properties from the Perl core is through
31regular expression matches, as described in the next section.
32For some special purposes, and to access the properties that are not suitable
33for regular expression matching, all the Unicode character properties that
34Perl handles are accessible via the standard L<Unicode::UCD> module, as
35described in the section L</Properties accessible through Unicode::UCD>.
36
37Perl also provides some additional extensions and short-cut synonyms
38for Unicode properties.
39
40This document merely lists all available properties and does not attempt to
41explain what each property really means.  There is a brief description of each
42Perl extension; see L<perlunicode/Other Properties> for more information on
43these.  There is some detail about Blocks, Scripts, General_Category,
44and Bidi_Class in L<perlunicode>, but to find out about the intricacies of the
45official Unicode properties, refer to the Unicode standard.  A good starting
46place is L<http://www.unicode.org/reports/tr44/>.
47
48Note that you can define your own properties; see
49L<perlunicode/"User-Defined Character Properties">.
50
51=head1 Properties accessible through C<\p{}> and C<\P{}>
52
53The Perl regular expression C<\p{}> and C<\P{}> constructs give access to
54most of the Unicode character properties.  The table below shows all these
55constructs, both single and compound forms.
56
57B<Compound forms> consist of two components, separated by an equals sign or a
58colon.  The first component is the property name, and the second component is
59the particular value of the property to match against, for example,
60C<\p{Script_Extensions: Greek}> and C<\p{Script_Extensions=Greek}> both mean
61to match characters whose Script_Extensions property value is Greek.
62(C<Script_Extensions> is an improved version of the C<Script> property.)
63
64B<Single forms>, like C<\p{Greek}>, are mostly Perl-defined shortcuts for
65their equivalent compound forms.  The table shows these equivalences.  (In our
66example, C<\p{Greek}> is a just a shortcut for
67C<\p{Script_Extensions=Greek}>).  There are also a few Perl-defined single
68forms that are not shortcuts for a compound form.  One such is C<\p{Word}>.
69These are also listed in the table.
70
71In parsing these constructs, Perl always ignores Upper/lower case differences
72everywhere within the {braces}.  Thus C<\p{Greek}> means the same thing as
73C<\p{greek}>.  But note that changing the case of the C<"p"> or C<"P"> before
74the left brace completely changes the meaning of the construct, from "match"
75(for C<\p{}>) to "doesn't match" (for C<\P{}>).  Casing in this document is
76for improved legibility.
77
78Also, white space, hyphens, and underscores are normally ignored
79everywhere between the {braces}, and hence can be freely added or removed
80even if the C</x> modifier hasn't been specified on the regular expression.
81But in the table below a 'B<T>' at the beginning of an entry
82means that tighter (stricter) rules are used for that entry:
83
84=over 4
85
86=over 4
87
88=item Single form (C<\p{name}>) tighter rules:
89
90White space, hyphens, and underscores ARE significant
91except for:
92
93=over 4
94
95=item * white space adjacent to a non-word character
96
97=item * underscores separating digits in numbers
98
99=back
100
101That means, for example, that you can freely add or remove white space
102adjacent to (but within) the braces without affecting the meaning.
103
104=item Compound form (C<\p{name=value}> or C<\p{name:value}>) tighter rules:
105
106The tighter rules given above for the single form apply to everything to the
107right of the colon or equals; the looser rules still apply to everything to
108the left.
109
110That means, for example, that you can freely add or remove white space
111adjacent to (but within) the braces and the colon or equal sign.
112
113=back
114
115=back
116
117Some properties are considered obsolete by Unicode, but still available.
118There are several varieties of obsolescence:
119
120=over 4
121
122=over 4
123
124=item Stabilized
125
126A property may be stabilized.  Such a determination does not indicate
127that the property should or should not be used; instead it is a declaration
128that the property will not be maintained nor extended for newly encoded
129characters.  Such properties are marked with an 'B<S>' in the
130table.
131
132=item Deprecated
133
134A property may be deprecated, perhaps because its original intent
135has been replaced by another property, or because its specification was
136somehow defective.  This means that its use is strongly
137discouraged, so much so that a warning will be issued if used, unless the
138regular expression is in the scope of a C<S<no warnings 'deprecated'>>
139statement.  A 'B<D>' flags each such entry in the table, and
140the entry there for the longest, most descriptive version of the property will
141give the reason it is deprecated, and perhaps advice.  Perl may issue such a
142warning, even for properties that aren't officially deprecated by Unicode,
143when there used to be characters or code points that were matched by them, but
144no longer.  This is to warn you that your program may not work like it did on
145earlier Unicode releases.
146
147A deprecated property may be made unavailable in a future Perl version, so it
148is best to move away from them.
149
150A deprecated property may also be stabilized, but this fact is not shown.
151
152=item Obsolete
153
154Properties marked with an 'B<O>' in the table are considered (plain)
155obsolete.  Generally this designation is given to properties that Unicode once
156used for internal purposes (but not any longer).
157
158=item Discouraged
159
160This is not actually a Unicode-specified obsolescence, but applies to certain
161Perl extensions that are present for backwards compatibility, but are
162discouraged from being used.  These are not obsolete, but their meanings are
163not stable.  Future Unicode versions could force any of these extensions to be
164removed without warning, replaced by another property with the same name that
165means something different.  An 'B<X>' flags each such entry in the
166table.  Use the equivalent shown instead.
167
168
169In particular, matches in the Block property have single forms
170defined by Perl that begin with C<"In_">, C<"Is_>, or even with no prefix at
171all,  Like all B<DISCOURAGED> forms, these are not stable.  For example,
172C<\p{Block=Deseret}> can currently be written as C<\p{In_Deseret}>,
173C<\p{Is_Deseret}>, or C<\p{Deseret}>.  But, a new Unicode version may
174come along that would force Perl to change the meaning of one or more of
175these, and your program would no longer be correct.  Currently there are no
176such conflicts with the form that begins C<"In_">, but there are many with the
177other two shortcuts, and Unicode continues to define new properties that begin
178with C<"In">, so it's quite possible that a conflict will occur in the future.
179The compound form is guaranteed to not become obsolete, and its meaning is
180clearer anyway.  See L<perlunicode/"Blocks"> for more information about this.
181
182
183=back
184
185=back
186
187The table below has two columns.  The left column contains the C<\p{}>
188constructs to look up, possibly preceded by the flags mentioned above; and
189the right column contains information about them, like a description, or
190synonyms.  The table shows both the single and compound forms for each
191property that has them.  If the left column is a short name for a property,
192the right column will give its longer, more descriptive name; and if the left
193column is the longest name, the right column will show any equivalent shortest
194name, in both single and compound forms if applicable.
195
196If braces are not needed to specify a property (e.g., C<\pL>), the left
197column contains both forms, with and without braces.
198
199The right column will also caution you if a property means something different
200than what might normally be expected.
201
202All single forms are Perl extensions; a few compound forms are as well, and
203are noted as such.
204
205Numbers in (parentheses) indicate the total number of Unicode code points
206matched by the property.  For the entries that give the longest, most
207descriptive version of the property, the count is followed by a list of some
208of the code points matched by it.  The list includes all the matched
209characters in the 0-255 range, enclosed in the familiar [brackets] the same as
210a regular expression bracketed character class.  Following that, the next few
211higher matching ranges are also given.  To avoid visual ambiguity, the SPACE
212character is represented as C<\x20>.
213
214For emphasis, those properties that match no code points at all are listed as
215well in a separate section following the table.
216
217Most properties match the same code points regardless of whether C<"/i">
218case-insensitive matching is specified or not.  But a few properties are
219affected.  These are shown with the notation S<C<(/i= I<other_property>)>>
220in the second column.  Under case-insensitive matching they match the
221same code pode points as the property I<other_property>.
222
223There is no description given for most non-Perl defined properties (See
224L<http://www.unicode.org/reports/tr44/> for that).
225
226For compactness, 'B<*>' is used as a wildcard instead of showing all possible
227combinations.  For example, entries like:
228
229 \p{Gc: *}                                  \p{General_Category: *}
230
231mean that 'Gc' is a synonym for 'General_Category', and anything that is valid
232for the latter is also valid for the former.  Similarly,
233
234 \p{Is_*}                                   \p{*}
235
236means that if and only if, for example, C<\p{Foo}> exists, then
237C<\p{Is_Foo}> and C<\p{IsFoo}> are also valid and all mean the same thing.
238And similarly, C<\p{Foo=Bar}> means the same as C<\p{Is_Foo=Bar}> and
239C<\p{IsFoo=Bar}>.  "*" here is restricted to something not beginning with an
240underscore.
241
242Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for 'Y'.
243And 'No', 'F', and 'False' are all synonyms for 'N'.  The table shows 'Y*' and
244'N*' to indicate this, and doesn't have separate entries for the other
245possibilities.  Note that not all properties which have values 'Yes' and 'No'
246are binary, and they have all their values spelled out without using this wild
247card, and a C<NOT> clause in their description that highlights their not being
248binary.  These also require the compound form to match them, whereas true
249binary properties have both single and compound forms available.
250
251Note that all non-essential underscores are removed in the display of the
252short names below.
253
254B<Legend summary:>
255
256=over 4
257
258=item Z<>B<*> is a wild-card
259
260=item B<(\d+)> in the info column gives the number of Unicode code points matched
261by this property.
262
263=item B<D> means this is deprecated.
264
265=item B<O> means this is obsolete.
266
267=item B<S> means this is stabilized.
268
269=item B<T> means tighter (stricter) name matching applies.
270
271=item B<X> means use of this form is discouraged, and may not be
272stable.
273
274=back
275
276       NAME                           INFO
277
278   \p{Adlam}               \p{Script_Extensions=Adlam} (Short:
279                             \p{Adlm}; NOT \p{Block=Adlam}) (89)
280   \p{Adlm}                \p{Adlam} (= \p{Script_Extensions=Adlam})
281                             (NOT \p{Block=Adlam}) (89)
282 X \p{Aegean_Numbers}      \p{Block=Aegean_Numbers} (64)
283 T \p{Age: 1.1}            \p{Age=V1_1} (33_979)
284   \p{Age: V1_1}           Code point's usage introduced in version
285                             1.1 (33_979: U+0000..01F5, U+01FA..0217,
286                             U+0250..02A8, U+02B0..02DE,
287                             U+02E0..02E9, U+0300..0345 ...)
288 T \p{Age: 2.0}            \p{Age=V2_0} (144_521)
289   \p{Age: V2_0}           Code point's usage was introduced in
290                             version 2.0; See also Property
291                             'Present_In' (144_521: U+0591..05A1,
292                             U+05A3..05AF, U+05C4, U+0F00..0F47,
293                             U+0F49..0F69, U+0F71..0F8B ...)
294 T \p{Age: 2.1}            \p{Age=V2_1} (2)
295   \p{Age: V2_1}           Code point's usage was introduced in
296                             version 2.1; See also Property
297                             'Present_In' (2: U+20AC, U+FFFC)
298 T \p{Age: 3.0}            \p{Age=V3_0} (10_307)
299   \p{Age: V3_0}           Code point's usage was introduced in
300                             version 3.0; See also Property
301                             'Present_In' (10_307: U+01F6..01F9,
302                             U+0218..021F, U+0222..0233,
303                             U+02A9..02AD, U+02DF, U+02EA..02EE ...)
304 T \p{Age: 3.1}            \p{Age=V3_1} (44_978)
305   \p{Age: V3_1}           Code point's usage was introduced in
306                             version 3.1; See also Property
307                             'Present_In' (44_978: U+03F4..03F5,
308                             U+FDD0..FDEF, U+10300..1031E,
309                             U+10320..10323, U+10330..1034A,
310                             U+10400..10425 ...)
311 T \p{Age: 3.2}            \p{Age=V3_2} (1016)
312   \p{Age: V3_2}           Code point's usage was introduced in
313                             version 3.2; See also Property
314                             'Present_In' (1016: U+0220, U+034F,
315                             U+0363..036F, U+03D8..03D9, U+03F6,
316                             U+048A..048B ...)
317 T \p{Age: 4.0}            \p{Age=V4_0} (1226)
318   \p{Age: V4_0}           Code point's usage was introduced in
319                             version 4.0; See also Property
320                             'Present_In' (1226: U+0221,
321                             U+0234..0236, U+02AE..02AF,
322                             U+02EF..02FF, U+0350..0357, U+035D..035F
323                             ...)
324 T \p{Age: 4.1}            \p{Age=V4_1} (1273)
325   \p{Age: V4_1}           Code point's usage was introduced in
326                             version 4.1; See also Property
327                             'Present_In' (1273: U+0237..0241,
328                             U+0358..035C, U+03FC..03FF,
329                             U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
330 T \p{Age: 5.0}            \p{Age=V5_0} (1369)
331   \p{Age: V5_0}           Code point's usage was introduced in
332                             version 5.0; See also Property
333                             'Present_In' (1369: U+0242..024F,
334                             U+037B..037D, U+04CF, U+04FA..04FF,
335                             U+0510..0513, U+05BA ...)
336 T \p{Age: 5.1}            \p{Age=V5_1} (1624)
337   \p{Age: V5_1}           Code point's usage was introduced in
338                             version 5.1; See also Property
339                             'Present_In' (1624: U+0370..0373,
340                             U+0376..0377, U+03CF, U+0487,
341                             U+0514..0523, U+0606..060A ...)
342 T \p{Age: 5.2}            \p{Age=V5_2} (6648)
343   \p{Age: V5_2}           Code point's usage was introduced in
344                             version 5.2; See also Property
345                             'Present_In' (6648: U+0524..0525,
346                             U+0800..082D, U+0830..083E, U+0900,
347                             U+094E, U+0955 ...)
348 T \p{Age: 6.0}            \p{Age=V6_0} (2088)
349   \p{Age: V6_0}           Code point's usage was introduced in
350                             version 6.0; See also Property
351                             'Present_In' (2088: U+0526..0527,
352                             U+0620, U+065F, U+0840..085B, U+085E,
353                             U+093A..093B ...)
354 T \p{Age: 6.1}            \p{Age=V6_1} (732)
355   \p{Age: V6_1}           Code point's usage was introduced in
356                             version 6.1; See also Property
357                             'Present_In' (732: U+058F, U+0604,
358                             U+08A0, U+08A2..08AC, U+08E4..08FE,
359                             U+0AF0 ...)
360 T \p{Age: 6.2}            \p{Age=V6_2} (1)
361   \p{Age: V6_2}           Code point's usage was introduced in
362                             version 6.2; See also Property
363                             'Present_In' (1: U+20BA)
364 T \p{Age: 6.3}            \p{Age=V6_3} (5)
365   \p{Age: V6_3}           Code point's usage was introduced in
366                             version 6.3; See also Property
367                             'Present_In' (5: U+061C, U+2066..2069)
368 T \p{Age: 7.0}            \p{Age=V7_0} (2834)
369   \p{Age: V7_0}           Code point's usage was introduced in
370                             version 7.0; See also Property
371                             'Present_In' (2834: U+037F,
372                             U+0528..052F, U+058D..058E, U+0605,
373                             U+08A1, U+08AD..08B2 ...)
374 T \p{Age: 8.0}            \p{Age=V8_0} (7716)
375   \p{Age: V8_0}           Code point's usage was introduced in
376                             version 8.0; See also Property
377                             'Present_In' (7716: U+08B3..08B4,
378                             U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
379                             ...)
380 T \p{Age: 9.0}            \p{Age=V9_0} (7500)
381   \p{Age: V9_0}           Code point's usage was introduced in
382                             version 9.0; See also Property
383                             'Present_In' (7500: U+08B6..08BD,
384                             U+08D4..08E2, U+0C80, U+0D4F,
385                             U+0D54..0D56, U+0D58..0D5E ...)
386 T \p{Age: 10.0}           \p{Age=V10_0} (8518)
387   \p{Age: V10_0}          Code point's usage was introduced in
388                             version 10.0; See also Property
389                             'Present_In' (8518: U+0860..086A,
390                             U+09FC..09FD, U+0AFA..0AFF, U+0D00,
391                             U+0D3B..0D3C, U+1CF7 ...)
392 T \p{Age: 11.0}           \p{Age=V11_0} (684)
393   \p{Age: V11_0}          Code point's usage was introduced in
394                             version 11.0; See also Property
395                             'Present_In' (684: U+0560, U+0588,
396                             U+05EF, U+07FD..07FF, U+08D3, U+09FE ...)
397 T \p{Age: 12.0}           \p{Age=V12_0} (554)
398   \p{Age: V12_0}          Code point's usage was introduced in
399                             version 12.0; See also Property
400                             'Present_In' (554: U+0C77, U+0E86,
401                             U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...)
402 T \p{Age: 12.1}           \p{Age=V12_1} (1)
403   \p{Age: V12_1}          Code point's usage was introduced in
404                             version 12.1; See also Property
405                             'Present_In' (1: U+32FF)
406   \p{Age: NA}             \p{Age=Unassigned} (836_536 plus all
407                             above-Unicode code points)
408   \p{Age: Unassigned}     Code point's usage has not been assigned
409                             in any Unicode release thus far. (Short:
410                             \p{Age=NA}) (836_536 plus all above-
411                             Unicode code points: U+0378..0379,
412                             U+0380..0383, U+038B, U+038D, U+03A2,
413                             U+0530 ...)
414   \p{Aghb}                \p{Caucasian_Albanian} (=
415                             \p{Script_Extensions=
416                             Caucasian_Albanian}) (NOT \p{Block=
417                             Caucasian_Albanian}) (53)
418   \p{AHex}                \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
419                             (22)
420   \p{AHex: *}             \p{ASCII_Hex_Digit: *}
421   \p{Ahom}                \p{Script_Extensions=Ahom} (NOT \p{Block=
422                             Ahom}) (58)
423 X \p{Alchemical}          \p{Alchemical_Symbols} (= \p{Block=
424                             Alchemical_Symbols}) (128)
425 X \p{Alchemical_Symbols}  \p{Block=Alchemical_Symbols} (Short:
426                             \p{InAlchemical}) (128)
427   \p{All}                 All code points, including those above
428                             Unicode.  Same as qr/./s (1_114_112 plus
429                             all above-Unicode code points:
430                             U+0000..infinity)
431   \p{Alnum}               \p{XPosixAlnum} (127_886)
432   \p{Alpha}               \p{XPosixAlpha} (= \p{Alphabetic=Y})
433                             (127_256)
434   \p{Alpha: *}            \p{Alphabetic: *}
435   \p{Alphabetic}          \p{XPosixAlpha} (= \p{Alphabetic=Y})
436                             (127_256)
437   \p{Alphabetic: N*}      (Short: \p{Alpha=N}, \P{Alpha}) (986_856
438                             plus all above-Unicode code points:
439                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
440                             >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
441                             \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
442                             U+02C2..02C5, U+02D2..02DF,
443                             U+02E5..02EB, U+02ED, U+02EF..0344 ...)
444   \p{Alphabetic: Y*}      (Short: \p{Alpha=Y}, \p{Alpha}) (127_256:
445                             [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
446                             \xf6\xf8-\xff], U+0100..02C1,
447                             U+02C6..02D1, U+02E0..02E4, U+02EC,
448                             U+02EE ...)
449 X \p{Alphabetic_PF}       \p{Alphabetic_Presentation_Forms} (=
450                             \p{Block=Alphabetic_Presentation_Forms})
451                             (80)
452 X \p{Alphabetic_Presentation_Forms} \p{Block=
453                             Alphabetic_Presentation_Forms} (Short:
454                             \p{InAlphabeticPF}) (80)
455   \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
456                             Anatolian_Hieroglyphs} (Short: \p{Hluw};
457                             NOT \p{Block=Anatolian_Hieroglyphs})
458                             (583)
459 X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
460                             \p{Block=
461                             Ancient_Greek_Musical_Notation}) (80)
462 X \p{Ancient_Greek_Musical_Notation} \p{Block=
463                             Ancient_Greek_Musical_Notation} (Short:
464                             \p{InAncientGreekMusic}) (80)
465 X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
466 X \p{Ancient_Symbols}     \p{Block=Ancient_Symbols} (64)
467   \p{Any}                 All Unicode code points (1_114_112:
468                             U+0000..10FFFF)
469   \p{Arab}                \p{Arabic} (= \p{Script_Extensions=
470                             Arabic}) (NOT \p{Block=Arabic}) (1325)
471   \p{Arabic}              \p{Script_Extensions=Arabic} (Short:
472                             \p{Arab}; NOT \p{Block=Arabic}) (1325)
473 X \p{Arabic_Ext_A}        \p{Arabic_Extended_A} (= \p{Block=
474                             Arabic_Extended_A}) (96)
475 X \p{Arabic_Extended_A}   \p{Block=Arabic_Extended_A} (Short:
476                             \p{InArabicExtA}) (96)
477 X \p{Arabic_Math}         \p{Arabic_Mathematical_Alphabetic_Symbols}
478                             (= \p{Block=
479                             Arabic_Mathematical_Alphabetic_Symbols})
480                             (256)
481 X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
482                             Arabic_Mathematical_Alphabetic_Symbols}
483                             (Short: \p{InArabicMath}) (256)
484 X \p{Arabic_PF_A}         \p{Arabic_Presentation_Forms_A} (=
485                             \p{Block=Arabic_Presentation_Forms_A})
486                             (688)
487 X \p{Arabic_PF_B}         \p{Arabic_Presentation_Forms_B} (=
488                             \p{Block=Arabic_Presentation_Forms_B})
489                             (144)
490 X \p{Arabic_Presentation_Forms_A} \p{Block=
491                             Arabic_Presentation_Forms_A} (Short:
492                             \p{InArabicPFA}) (688)
493 X \p{Arabic_Presentation_Forms_B} \p{Block=
494                             Arabic_Presentation_Forms_B} (Short:
495                             \p{InArabicPFB}) (144)
496 X \p{Arabic_Sup}          \p{Arabic_Supplement} (= \p{Block=
497                             Arabic_Supplement}) (48)
498 X \p{Arabic_Supplement}   \p{Block=Arabic_Supplement} (Short:
499                             \p{InArabicSup}) (48)
500   \p{Armenian}            \p{Script_Extensions=Armenian} (Short:
501                             \p{Armn}; NOT \p{Block=Armenian}) (96)
502   \p{Armi}                \p{Imperial_Aramaic} (=
503                             \p{Script_Extensions=Imperial_Aramaic})
504                             (NOT \p{Block=Imperial_Aramaic}) (31)
505   \p{Armn}                \p{Armenian} (= \p{Script_Extensions=
506                             Armenian}) (NOT \p{Block=Armenian}) (96)
507 X \p{Arrows}              \p{Block=Arrows} (112)
508   \p{ASCII}               \p{Block=Basic_Latin} (128)
509   \p{ASCII_Hex_Digit}     \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
510                             (22)
511   \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
512                             plus all above-Unicode code points:
513                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
514                             >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
515                             U+0100..infinity)
516   \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
517                             Fa-f])
518   \p{Assigned}            All assigned code points (277_510:
519                             U+0000..0377, U+037A..037F,
520                             U+0384..038A, U+038C, U+038E..03A1,
521                             U+03A3..052F ...)
522   \p{Avestan}             \p{Script_Extensions=Avestan} (Short:
523                             \p{Avst}; NOT \p{Block=Avestan}) (61)
524   \p{Avst}                \p{Avestan} (= \p{Script_Extensions=
525                             Avestan}) (NOT \p{Block=Avestan}) (61)
526   \p{Bali}                \p{Balinese} (= \p{Script_Extensions=
527                             Balinese}) (NOT \p{Block=Balinese}) (121)
528   \p{Balinese}            \p{Script_Extensions=Balinese} (Short:
529                             \p{Bali}; NOT \p{Block=Balinese}) (121)
530   \p{Bamu}                \p{Bamum} (= \p{Script_Extensions=Bamum})
531                             (NOT \p{Block=Bamum}) (657)
532   \p{Bamum}               \p{Script_Extensions=Bamum} (Short:
533                             \p{Bamu}; NOT \p{Block=Bamum}) (657)
534 X \p{Bamum_Sup}           \p{Bamum_Supplement} (= \p{Block=
535                             Bamum_Supplement}) (576)
536 X \p{Bamum_Supplement}    \p{Block=Bamum_Supplement} (Short:
537                             \p{InBamumSup}) (576)
538 X \p{Basic_Latin}         \p{ASCII} (= \p{Block=Basic_Latin}) (128)
539   \p{Bass}                \p{Bassa_Vah} (= \p{Script_Extensions=
540                             Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
541                             (36)
542   \p{Bassa_Vah}           \p{Script_Extensions=Bassa_Vah} (Short:
543                             \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
544   \p{Batak}               \p{Script_Extensions=Batak} (Short:
545                             \p{Batk}; NOT \p{Block=Batak}) (56)
546   \p{Batk}                \p{Batak} (= \p{Script_Extensions=Batak})
547                             (NOT \p{Block=Batak}) (56)
548   \p{Bc: *}               \p{Bidi_Class: *}
549   \p{Beng}                \p{Bengali} (= \p{Script_Extensions=
550                             Bengali}) (NOT \p{Block=Bengali}) (113)
551   \p{Bengali}             \p{Script_Extensions=Bengali} (Short:
552                             \p{Beng}; NOT \p{Block=Bengali}) (113)
553   \p{Bhaiksuki}           \p{Script_Extensions=Bhaiksuki} (Short:
554                             \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
555   \p{Bhks}                \p{Bhaiksuki} (= \p{Script_Extensions=
556                             Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
557                             (97)
558   \p{Bidi_C}              \p{Bidi_Control} (= \p{Bidi_Control=Y})
559                             (12)
560   \p{Bidi_C: *}           \p{Bidi_Control: *}
561   \p{Bidi_Class: AL}      \p{Bidi_Class=Arabic_Letter} (1698)
562   \p{Bidi_Class: AN}      \p{Bidi_Class=Arabic_Number} (61)
563   \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1698: U+0608,
564                             U+060B, U+060D, U+061B..064A,
565                             U+066D..066F, U+0671..06D5 ...)
566   \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (61:
567                             U+0600..0605, U+0660..0669,
568                             U+066B..066C, U+06DD, U+08E2,
569                             U+10D30..10D39 ...)
570   \p{Bidi_Class: B}       \p{Bidi_Class=Paragraph_Separator} (7)
571   \p{Bidi_Class: BN}      \p{Bidi_Class=Boundary_Neutral} (4016)
572   \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016:
573                             [^\t\n\cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-
574                             \xff], U+180E, U+200B..200D,
575                             U+2060..2065, U+206A..206F, U+FDD0..FDEF
576                             ...)
577   \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15:
578                             [,.\/:\xa0], U+060C, U+202F, U+2044,
579                             U+FE50, U+FE52 ...)
580   \p{Bidi_Class: CS}      \p{Bidi_Class=Common_Separator} (15)
581   \p{Bidi_Class: EN}      \p{Bidi_Class=European_Number} (158)
582   \p{Bidi_Class: ES}      \p{Bidi_Class=European_Separator} (12)
583   \p{Bidi_Class: ET}      \p{Bidi_Class=European_Terminator} (92)
584   \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158: [0-9\xb2-
585                             \xb3\xb9], U+06F0..06F9, U+2070,
586                             U+2074..2079, U+2080..2089, U+2488..249B
587                             ...)
588   \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
589                             U+207A..207B, U+208A..208B, U+2212,
590                             U+FB29, U+FE62..FE63 ...)
591   \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92:
592                             [#\$\%\xa2-\xa5\xb0-\xb1], U+058F,
593                             U+0609..060A, U+066A, U+09F2..09F3,
594                             U+09FB ...)
595   \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
596                             U+2068)
597   \p{Bidi_Class: FSI}     \p{Bidi_Class=First_Strong_Isolate} (1)
598   \p{Bidi_Class: L}       \p{Bidi_Class=Left_To_Right} (1_096_767
599                             plus all above-Unicode code points)
600   \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_767 plus
601                             all above-Unicode code points: [A-Za-
602                             z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
603                             \xff], U+0100..02B8, U+02BB..02C1,
604                             U+02D0..02D1, U+02E0..02E4, U+02EE ...)
605   \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
606                             U+202A)
607   \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
608                             U+2066)
609   \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
610                             U+202D)
611   \p{Bidi_Class: LRE}     \p{Bidi_Class=Left_To_Right_Embedding} (1)
612   \p{Bidi_Class: LRI}     \p{Bidi_Class=Left_To_Right_Isolate} (1)
613   \p{Bidi_Class: LRO}     \p{Bidi_Class=Left_To_Right_Override} (1)
614   \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1834:
615                             U+0300..036F, U+0483..0489,
616                             U+0591..05BD, U+05BF, U+05C1..05C2,
617                             U+05C4..05C5 ...)
618   \p{Bidi_Class: NSM}     \p{Bidi_Class=Nonspacing_Mark} (1834)
619   \p{Bidi_Class: ON}      \p{Bidi_Class=Other_Neutral} (5658)
620   \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5658:
621                             [!\"&\'\(\)*;<=
622                             >?\@\[\\\]\^_`\{\|\}~\xa1\xa6-\xa9\xab-
623                             \xac\xae-\xaf\xb4\xb6-\xb8\xbb-
624                             \xbf\xd7\xf7], U+02B9..02BA,
625                             U+02C2..02CF, U+02D2..02DF,
626                             U+02E5..02ED, U+02EF..02FF ...)
627   \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7:
628                             [\n\r\x1c-\x1e\x85], U+2029)
629   \p{Bidi_Class: PDF}     \p{Bidi_Class=Pop_Directional_Format} (1)
630   \p{Bidi_Class: PDI}     \p{Bidi_Class=Pop_Directional_Isolate} (1)
631   \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
632                             U+202C)
633   \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
634                             U+2069)
635   \p{Bidi_Class: R}       \p{Bidi_Class=Right_To_Left} (3765)
636   \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3765: U+0590,
637                             U+05BE, U+05C0, U+05C3, U+05C6,
638                             U+05C8..05FF ...)
639   \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
640                             U+202B)
641   \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
642                             U+2067)
643   \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
644                             U+202E)
645   \p{Bidi_Class: RLE}     \p{Bidi_Class=Right_To_Left_Embedding} (1)
646   \p{Bidi_Class: RLI}     \p{Bidi_Class=Right_To_Left_Isolate} (1)
647   \p{Bidi_Class: RLO}     \p{Bidi_Class=Right_To_Left_Override} (1)
648   \p{Bidi_Class: S}       \p{Bidi_Class=Segment_Separator} (3)
649   \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3:
650                             [\t\cK\x1f])
651   \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
652                             U+1680, U+2000..200A, U+2028, U+205F,
653                             U+3000)
654   \p{Bidi_Class: WS}      \p{Bidi_Class=White_Space} (17)
655   \p{Bidi_Control}        \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
656   \p{Bidi_Control: N*}    (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
657                             plus all above-Unicode code points:
658                             U+0000..061B, U+061D..200D,
659                             U+2010..2029, U+202F..2065,
660                             U+206A..infinity)
661   \p{Bidi_Control: Y*}    (Short: \p{BidiC=Y}, \p{BidiC}) (12:
662                             U+061C, U+200E..200F, U+202A..202E,
663                             U+2066..2069)
664   \p{Bidi_M}              \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
665                             (545)
666   \p{Bidi_M: *}           \p{Bidi_Mirrored: *}
667   \p{Bidi_Mirrored}       \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
668                             (545)
669   \p{Bidi_Mirrored: N*}   (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
670                             plus all above-Unicode code points:
671                             [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
672                             Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
673                             \xff], U+0100..0F39, U+0F3E..169A,
674                             U+169D..2038, U+203B..2044, U+2047..207C
675                             ...)
676   \p{Bidi_Mirrored: Y*}   (Short: \p{BidiM=Y}, \p{BidiM}) (545:
677                             [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
678                             U+169B..169C, U+2039..203A,
679                             U+2045..2046, U+207D..207E ...)
680   \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
681                             (60)
682   \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60:
683                             [\)\]\}], U+0F3B, U+0F3D, U+169C,
684                             U+2046, U+207E ...)
685   \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
686                             (1_113_992 plus all above-Unicode code
687                             points)
688   \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
689                             plus all above-Unicode code points:
690                             [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=
691                             >?\@A-Z\\\^_`a-z\|~\x7f-\xff],
692                             U+0100..0F39, U+0F3E..169A,
693                             U+169D..2044, U+2047..207C, U+207F..208C
694                             ...)
695   \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
696                             (60)
697   \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60:
698                             [\(\[\{], U+0F3A, U+0F3C, U+169B,
699                             U+2045, U+207D ...)
700   \p{Blank}               \p{XPosixBlank} (18)
701   \p{Blk: *}              \p{Block: *}
702   \p{Block: Adlam}        (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
703                             U+1E900..1E95F)
704   \p{Block: Aegean_Numbers} (64: U+10100..1013F)
705   \p{Block: Ahom}         (NOT \p{Ahom} NOR \p{Is_Ahom}) (64:
706                             U+11700..1173F)
707   \p{Block: Alchemical}   \p{Block=Alchemical_Symbols} (128)
708   \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
709                             U+1F700..1F77F)
710   \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
711                             (80)
712   \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
713                             AlphabeticPF}) (80: U+FB00..FB4F)
714   \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
715                             NOR \p{Is_Anatolian_Hieroglyphs}) (640:
716                             U+14400..1467F)
717   \p{Block: Ancient_Greek_Music} \p{Block=
718                             Ancient_Greek_Musical_Notation} (80)
719   \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
720                             AncientGreekMusic}) (80: U+1D200..1D24F)
721   \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
722   \p{Block: Ancient_Symbols} (64: U+10190..101CF)
723   \p{Block: Arabic}       (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
724                             U+0600..06FF)
725   \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
726   \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
727                             U+08A0..08FF)
728   \p{Block: Arabic_Math}  \p{Block=
729                             Arabic_Mathematical_Alphabetic_Symbols}
730                             (256)
731   \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
732                             ArabicMath}) (256: U+1EE00..1EEFF)
733   \p{Block: Arabic_PF_A}  \p{Block=Arabic_Presentation_Forms_A} (688)
734   \p{Block: Arabic_PF_B}  \p{Block=Arabic_Presentation_Forms_B} (144)
735   \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
736                             (688: U+FB50..FDFF)
737   \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
738                             (144: U+FE70..FEFF)
739   \p{Block: Arabic_Sup}   \p{Block=Arabic_Supplement} (48)
740   \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
741                             U+0750..077F)
742   \p{Block: Armenian}     (NOT \p{Armenian} NOR \p{Is_Armenian})
743                             (96: U+0530..058F)
744   \p{Block: Arrows}       (112: U+2190..21FF)
745   \p{Block: ASCII}        \p{Block=Basic_Latin} (128)
746   \p{Block: Avestan}      (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
747                             U+10B00..10B3F)
748   \p{Block: Balinese}     (NOT \p{Balinese} NOR \p{Is_Balinese})
749                             (128: U+1B00..1B7F)
750   \p{Block: Bamum}        (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
751                             U+A6A0..A6FF)
752   \p{Block: Bamum_Sup}    \p{Block=Bamum_Supplement} (576)
753   \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
754                             U+16800..16A3F)
755   \p{Block: Basic_Latin}  (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
756   \p{Block: Bassa_Vah}    (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
757                             (48: U+16AD0..16AFF)
758   \p{Block: Batak}        (NOT \p{Batak} NOR \p{Is_Batak}) (64:
759                             U+1BC0..1BFF)
760   \p{Block: Bengali}      (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
761                             U+0980..09FF)
762   \p{Block: Bhaiksuki}    (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
763                             (112: U+11C00..11C6F)
764   \p{Block: Block_Elements} (32: U+2580..259F)
765   \p{Block: Bopomofo}     (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
766                             (48: U+3100..312F)
767   \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
768   \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
769                             U+31A0..31BF)
770   \p{Block: Box_Drawing}  (128: U+2500..257F)
771   \p{Block: Brahmi}       (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
772                             U+11000..1107F)
773   \p{Block: Braille}      \p{Block=Braille_Patterns} (256)
774   \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
775                             U+2800..28FF)
776   \p{Block: Buginese}     (NOT \p{Buginese} NOR \p{Is_Buginese})
777                             (32: U+1A00..1A1F)
778   \p{Block: Buhid}        (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
779                             U+1740..175F)
780   \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
781                             (256)
782   \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
783                             ByzantineMusic}) (256: U+1D000..1D0FF)
784   \p{Block: Canadian_Syllabics} \p{Block=
785                             Unified_Canadian_Aboriginal_Syllabics}
786                             (640)
787   \p{Block: Carian}       (NOT \p{Carian} NOR \p{Is_Carian}) (64:
788                             U+102A0..102DF)
789   \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
790                             \p{Is_Caucasian_Albanian}) (64:
791                             U+10530..1056F)
792   \p{Block: Chakma}       (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
793                             U+11100..1114F)
794   \p{Block: Cham}         (NOT \p{Cham} NOR \p{Is_Cham}) (96:
795                             U+AA00..AA5F)
796   \p{Block: Cherokee}     (NOT \p{Cherokee} NOR \p{Is_Cherokee})
797                             (96: U+13A0..13FF)
798   \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
799   \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
800                             U+AB70..ABBF)
801   \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F)
802   \p{Block: CJK}          \p{Block=CJK_Unified_Ideographs} (20_992)
803   \p{Block: CJK_Compat}   \p{Block=CJK_Compatibility} (256)
804   \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
805   \p{Block: CJK_Compat_Ideographs} \p{Block=
806                             CJK_Compatibility_Ideographs} (512)
807   \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
808                             CJK_Compatibility_Ideographs_Supplement}
809                             (544)
810   \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
811                             U+3300..33FF)
812   \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
813                             (32: U+FE30..FE4F)
814   \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
815                             CJKCompatIdeographs}) (512: U+F900..FAFF)
816   \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
817                             CJKCompatIdeographsSup}) (544:
818                             U+2F800..2FA1F)
819   \p{Block: CJK_Ext_A}    \p{Block=
820                             CJK_Unified_Ideographs_Extension_A}
821                             (6592)
822   \p{Block: CJK_Ext_B}    \p{Block=
823                             CJK_Unified_Ideographs_Extension_B}
824                             (42_720)
825   \p{Block: CJK_Ext_C}    \p{Block=
826                             CJK_Unified_Ideographs_Extension_C}
827                             (4160)
828   \p{Block: CJK_Ext_D}    \p{Block=
829                             CJK_Unified_Ideographs_Extension_D} (224)
830   \p{Block: CJK_Ext_E}    \p{Block=
831                             CJK_Unified_Ideographs_Extension_E}
832                             (5776)
833   \p{Block: CJK_Ext_F}    \p{Block=
834                             CJK_Unified_Ideographs_Extension_F}
835                             (7488)
836   \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
837   \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
838                             (128: U+2E80..2EFF)
839   \p{Block: CJK_Strokes}  (48: U+31C0..31EF)
840   \p{Block: CJK_Symbols}  \p{Block=CJK_Symbols_And_Punctuation} (64)
841   \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
842                             (64: U+3000..303F)
843   \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
844                             U+4E00..9FFF)
845   \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
846                             CJKExtA}) (6592: U+3400..4DBF)
847   \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
848                             CJKExtB}) (42_720: U+20000..2A6DF)
849   \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
850                             CJKExtC}) (4160: U+2A700..2B73F)
851   \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
852                             CJKExtD}) (224: U+2B740..2B81F)
853   \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
854                             CJKExtE}) (5776: U+2B820..2CEAF)
855   \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
856                             CJKExtF}) (7488: U+2CEB0..2EBEF)
857   \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
858                             Diacriticals}) (112: U+0300..036F)
859   \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
860                             DiacriticalsExt}) (80: U+1AB0..1AFF)
861   \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
862                             DiacriticalsForSymbols}) (48:
863                             U+20D0..20FF)
864   \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
865                             DiacriticalsSup}) (64: U+1DC0..1DFF)
866   \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
867                             U+FE20..FE2F)
868   \p{Block: Combining_Marks_For_Symbols} \p{Block=
869                             Combining_Diacritical_Marks_For_Symbols}
870                             (48)
871   \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
872                             IndicNumberForms}) (16: U+A830..A83F)
873   \p{Block: Compat_Jamo}  \p{Block=Hangul_Compatibility_Jamo} (96)
874   \p{Block: Control_Pictures} (64: U+2400..243F)
875   \p{Block: Coptic}       (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
876                             U+2C80..2CFF)
877   \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
878   \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
879   \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
880                             U+1D360..1D37F)
881   \p{Block: Cuneiform}    (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
882                             (1024: U+12000..123FF)
883   \p{Block: Cuneiform_Numbers} \p{Block=
884                             Cuneiform_Numbers_And_Punctuation} (128)
885   \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
886                             CuneiformNumbers}) (128: U+12400..1247F)
887   \p{Block: Currency_Symbols} (48: U+20A0..20CF)
888   \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
889   \p{Block: Cyrillic}     (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
890                             (256: U+0400..04FF)
891   \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
892   \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
893   \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
894   \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
895                             U+2DE0..2DFF)
896   \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
897                             U+A640..A69F)
898   \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
899                             U+1C80..1C8F)
900   \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
901   \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
902                             U+0500..052F)
903   \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
904                             (48)
905   \p{Block: Deseret}      (80: U+10400..1044F)
906   \p{Block: Devanagari}   (NOT \p{Devanagari} NOR \p{Is_Devanagari})
907                             (128: U+0900..097F)
908   \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
909   \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
910                             U+A8E0..A8FF)
911   \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
912   \p{Block: Diacriticals_Ext} \p{Block=
913                             Combining_Diacritical_Marks_Extended}
914                             (80)
915   \p{Block: Diacriticals_For_Symbols} \p{Block=
916                             Combining_Diacritical_Marks_For_Symbols}
917                             (48)
918   \p{Block: Diacriticals_Sup} \p{Block=
919                             Combining_Diacritical_Marks_Supplement}
920                             (64)
921   \p{Block: Dingbats}     (192: U+2700..27BF)
922   \p{Block: Dogra}        (NOT \p{Dogra} NOR \p{Is_Dogra}) (80:
923                             U+11800..1184F)
924   \p{Block: Domino}       \p{Block=Domino_Tiles} (112)
925   \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
926                             U+1F030..1F09F)
927   \p{Block: Duployan}     (NOT \p{Duployan} NOR \p{Is_Duployan})
928                             (160: U+1BC00..1BC9F)
929   \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
930   \p{Block: Egyptian_Hieroglyph_Format_Controls} (16: U+13430..1343F)
931   \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
932                             \p{Is_Egyptian_Hieroglyphs}) (1072:
933                             U+13000..1342F)
934   \p{Block: Elbasan}      (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
935                             U+10500..1052F)
936   \p{Block: Elymaic}      (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32:
937                             U+10FE0..10FFF)
938   \p{Block: Emoticons}    (80: U+1F600..1F64F)
939   \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
940   \p{Block: Enclosed_Alphanum_Sup} \p{Block=
941                             Enclosed_Alphanumeric_Supplement} (256)
942   \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
943                             EnclosedAlphanumSup}) (256:
944                             U+1F100..1F1FF)
945   \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
946                             EnclosedAlphanum}) (160: U+2460..24FF)
947   \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
948                             (256)
949   \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
950                             EnclosedCJK}) (256: U+3200..32FF)
951   \p{Block: Enclosed_Ideographic_Sup} \p{Block=
952                             Enclosed_Ideographic_Supplement} (256)
953   \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
954                             EnclosedIdeographicSup}) (256:
955                             U+1F200..1F2FF)
956   \p{Block: Ethiopic}     (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
957                             (384: U+1200..137F)
958   \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
959   \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
960   \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
961                             U+2D80..2DDF)
962   \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
963                             U+AB00..AB2F)
964   \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
965   \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
966                             U+1380..139F)
967   \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
968                             \p{Punct} NOR \p{Is_Punctuation}) (112:
969                             U+2000..206F)
970   \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
971   \p{Block: Geometric_Shapes_Ext} \p{Block=
972                             Geometric_Shapes_Extended} (128)
973   \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
974                             GeometricShapesExt}) (128:
975                             U+1F780..1F7FF)
976   \p{Block: Georgian}     (NOT \p{Georgian} NOR \p{Is_Georgian})
977                             (96: U+10A0..10FF)
978   \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48)
979   \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48:
980                             U+1C90..1CBF)
981   \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
982   \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
983                             U+2D00..2D2F)
984   \p{Block: Glagolitic}   (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
985                             (96: U+2C00..2C5F)
986   \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
987   \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
988                             (48: U+1E000..1E02F)
989   \p{Block: Gothic}       (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
990                             U+10330..1034F)
991   \p{Block: Grantha}      (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
992                             U+11300..1137F)
993   \p{Block: Greek}        \p{Block=Greek_And_Coptic} (NOT \p{Greek}
994                             NOR \p{Is_Greek}) (144)
995   \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
996                             NOR \p{Is_Greek}) (144: U+0370..03FF)
997   \p{Block: Greek_Ext}    \p{Block=Greek_Extended} (256)
998   \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
999                             U+1F00..1FFF)
1000   \p{Block: Gujarati}     (NOT \p{Gujarati} NOR \p{Is_Gujarati})
1001                             (128: U+0A80..0AFF)
1002   \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR
1003                             \p{Is_Gunjala_Gondi}) (80:
1004                             U+11D60..11DAF)
1005   \p{Block: Gurmukhi}     (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
1006                             (128: U+0A00..0A7F)
1007   \p{Block: Half_And_Full_Forms} \p{Block=
1008                             Halfwidth_And_Fullwidth_Forms} (240)
1009   \p{Block: Half_Marks}   \p{Block=Combining_Half_Marks} (16)
1010   \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
1011                             HalfAndFullForms}) (240: U+FF00..FFEF)
1012   \p{Block: Hangul}       \p{Block=Hangul_Syllables} (NOT \p{Hangul}
1013                             NOR \p{Is_Hangul}) (11_184)
1014   \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
1015                             (96: U+3130..318F)
1016   \p{Block: Hangul_Jamo}  (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
1017   \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
1018                             U+A960..A97F)
1019   \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
1020                             U+D7B0..D7FF)
1021   \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
1022                             NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
1023   \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR
1024                             \p{Is_Hanifi_Rohingya}) (64:
1025                             U+10D00..10D3F)
1026   \p{Block: Hanunoo}      (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
1027                             U+1720..173F)
1028   \p{Block: Hatran}       (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
1029                             U+108E0..108FF)
1030   \p{Block: Hebrew}       (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
1031                             U+0590..05FF)
1032   \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
1033                             HighPUSurrogates}) (128: U+DB80..DBFF)
1034   \p{Block: High_PU_Surrogates} \p{Block=
1035                             High_Private_Use_Surrogates} (128)
1036   \p{Block: High_Surrogates} (896: U+D800..DB7F)
1037   \p{Block: Hiragana}     (NOT \p{Hiragana} NOR \p{Is_Hiragana})
1038                             (96: U+3040..309F)
1039   \p{Block: IDC}          \p{Block=
1040                             Ideographic_Description_Characters} (NOT
1041                             \p{ID_Continue} NOR \p{Is_IDC}) (16)
1042   \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
1043                             NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
1044                             U+2FF0..2FFF)
1045   \p{Block: Ideographic_Symbols} \p{Block=
1046                             Ideographic_Symbols_And_Punctuation} (32)
1047   \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
1048                             IdeographicSymbols}) (32: U+16FE0..16FFF)
1049   \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
1050                             \p{Is_Imperial_Aramaic}) (32:
1051                             U+10840..1085F)
1052   \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1053                             (16)
1054   \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF)
1055   \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
1056                             NOR \p{Is_Inscriptional_Pahlavi}) (32:
1057                             U+10B60..10B7F)
1058   \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
1059                             NOR \p{Is_Inscriptional_Parthian}) (32:
1060                             U+10B40..10B5F)
1061   \p{Block: IPA_Ext}      \p{Block=IPA_Extensions} (96)
1062   \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
1063                             U+0250..02AF)
1064   \p{Block: Jamo}         \p{Block=Hangul_Jamo} (256)
1065   \p{Block: Jamo_Ext_A}   \p{Block=Hangul_Jamo_Extended_A} (32)
1066   \p{Block: Jamo_Ext_B}   \p{Block=Hangul_Jamo_Extended_B} (80)
1067   \p{Block: Javanese}     (NOT \p{Javanese} NOR \p{Is_Javanese})
1068                             (96: U+A980..A9DF)
1069   \p{Block: Kaithi}       (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
1070                             U+11080..110CF)
1071   \p{Block: Kana_Ext_A}   \p{Block=Kana_Extended_A} (48)
1072   \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
1073                             U+1B100..1B12F)
1074   \p{Block: Kana_Sup}     \p{Block=Kana_Supplement} (256)
1075   \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
1076                             U+1B000..1B0FF)
1077   \p{Block: Kanbun}       (16: U+3190..319F)
1078   \p{Block: Kangxi}       \p{Block=Kangxi_Radicals} (224)
1079   \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
1080                             U+2F00..2FDF)
1081   \p{Block: Kannada}      (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
1082                             U+0C80..0CFF)
1083   \p{Block: Katakana}     (NOT \p{Katakana} NOR \p{Is_Katakana})
1084                             (96: U+30A0..30FF)
1085   \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
1086   \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
1087                             KatakanaExt}) (16: U+31F0..31FF)
1088   \p{Block: Kayah_Li}     (48: U+A900..A92F)
1089   \p{Block: Kharoshthi}   (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
1090                             (96: U+10A00..10A5F)
1091   \p{Block: Khmer}        (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
1092                             U+1780..17FF)
1093   \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
1094   \p{Block: Khojki}       (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
1095                             U+11200..1124F)
1096   \p{Block: Khudawadi}    (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
1097                             (80: U+112B0..112FF)
1098   \p{Block: Lao}          (NOT \p{Lao} NOR \p{Is_Lao}) (128:
1099                             U+0E80..0EFF)
1100   \p{Block: Latin_1}      \p{Block=Latin_1_Supplement} (128)
1101   \p{Block: Latin_1_Sup}  \p{Block=Latin_1_Supplement} (128)
1102   \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
1103                             \xff])
1104   \p{Block: Latin_Ext_A}  \p{Block=Latin_Extended_A} (128)
1105   \p{Block: Latin_Ext_Additional} \p{Block=
1106                             Latin_Extended_Additional} (256)
1107   \p{Block: Latin_Ext_B}  \p{Block=Latin_Extended_B} (208)
1108   \p{Block: Latin_Ext_C}  \p{Block=Latin_Extended_C} (32)
1109   \p{Block: Latin_Ext_D}  \p{Block=Latin_Extended_D} (224)
1110   \p{Block: Latin_Ext_E}  \p{Block=Latin_Extended_E} (64)
1111   \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
1112                             U+0100..017F)
1113   \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
1114                             LatinExtAdditional}) (256: U+1E00..1EFF)
1115   \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
1116                             U+0180..024F)
1117   \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
1118                             U+2C60..2C7F)
1119   \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
1120                             U+A720..A7FF)
1121   \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
1122                             U+AB30..AB6F)
1123   \p{Block: Lepcha}       (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
1124                             U+1C00..1C4F)
1125   \p{Block: Letterlike_Symbols} (80: U+2100..214F)
1126   \p{Block: Limbu}        (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
1127                             U+1900..194F)
1128   \p{Block: Linear_A}     (NOT \p{Linear_A} NOR \p{Is_Linear_A})
1129                             (384: U+10600..1077F)
1130   \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
1131   \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
1132   \p{Block: Lisu}         (48: U+A4D0..A4FF)
1133   \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
1134   \p{Block: Lycian}       (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
1135                             U+10280..1029F)
1136   \p{Block: Lydian}       (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
1137                             U+10920..1093F)
1138   \p{Block: Mahajani}     (NOT \p{Mahajani} NOR \p{Is_Mahajani})
1139                             (48: U+11150..1117F)
1140   \p{Block: Mahjong}      \p{Block=Mahjong_Tiles} (48)
1141   \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
1142                             U+1F000..1F02F)
1143   \p{Block: Makasar}      (NOT \p{Makasar} NOR \p{Is_Makasar}) (32:
1144                             U+11EE0..11EFF)
1145   \p{Block: Malayalam}    (NOT \p{Malayalam} NOR \p{Is_Malayalam})
1146                             (128: U+0D00..0D7F)
1147   \p{Block: Mandaic}      (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
1148                             U+0840..085F)
1149   \p{Block: Manichaean}   (NOT \p{Manichaean} NOR \p{Is_Manichaean})
1150                             (64: U+10AC0..10AFF)
1151   \p{Block: Marchen}      (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
1152                             U+11C70..11CBF)
1153   \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
1154                             \p{Is_Masaram_Gondi}) (96:
1155                             U+11D00..11D5F)
1156   \p{Block: Math_Alphanum} \p{Block=
1157                             Mathematical_Alphanumeric_Symbols} (1024)
1158   \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
1159   \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
1160                             MathAlphanum}) (1024: U+1D400..1D7FF)
1161   \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
1162                             (256: U+2200..22FF)
1163   \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF)
1164   \p{Block: Medefaidrin}  (NOT \p{Medefaidrin} NOR
1165                             \p{Is_Medefaidrin}) (96: U+16E40..16E9F)
1166   \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
1167                             \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
1168   \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
1169   \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
1170                             (32: U+AAE0..AAFF)
1171   \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
1172                             \p{Is_Mende_Kikakui}) (224:
1173                             U+1E800..1E8DF)
1174   \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
1175                             \p{Is_Meroitic_Cursive}) (96:
1176                             U+109A0..109FF)
1177   \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
1178   \p{Block: Miao}         (NOT \p{Miao} NOR \p{Is_Miao}) (160:
1179                             U+16F00..16F9F)
1180   \p{Block: Misc_Arrows}  \p{Block=Miscellaneous_Symbols_And_Arrows}
1181                             (256)
1182   \p{Block: Misc_Math_Symbols_A} \p{Block=
1183                             Miscellaneous_Mathematical_Symbols_A}
1184                             (48)
1185   \p{Block: Misc_Math_Symbols_B} \p{Block=
1186                             Miscellaneous_Mathematical_Symbols_B}
1187                             (128)
1188   \p{Block: Misc_Pictographs} \p{Block=
1189                             Miscellaneous_Symbols_And_Pictographs}
1190                             (768)
1191   \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1192   \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
1193   \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
1194                             MiscMathSymbolsA}) (48: U+27C0..27EF)
1195   \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
1196                             MiscMathSymbolsB}) (128: U+2980..29FF)
1197   \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
1198                             (256: U+2600..26FF)
1199   \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
1200                             MiscArrows}) (256: U+2B00..2BFF)
1201   \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
1202                             MiscPictographs}) (768: U+1F300..1F5FF)
1203   \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
1204                             (256: U+2300..23FF)
1205   \p{Block: Modi}         (NOT \p{Modi} NOR \p{Is_Modi}) (96:
1206                             U+11600..1165F)
1207   \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
1208   \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
1209   \p{Block: Mongolian}    (NOT \p{Mongolian} NOR \p{Is_Mongolian})
1210                             (176: U+1800..18AF)
1211   \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
1212   \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
1213                             U+11660..1167F)
1214   \p{Block: Mro}          (NOT \p{Mro} NOR \p{Is_Mro}) (48:
1215                             U+16A40..16A6F)
1216   \p{Block: Multani}      (NOT \p{Multani} NOR \p{Is_Multani}) (48:
1217                             U+11280..112AF)
1218   \p{Block: Music}        \p{Block=Musical_Symbols} (256)
1219   \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
1220                             U+1D100..1D1FF)
1221   \p{Block: Myanmar}      (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
1222                             U+1000..109F)
1223   \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
1224   \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
1225   \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
1226                             U+AA60..AA7F)
1227   \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
1228                             U+A9E0..A9FF)
1229   \p{Block: Nabataean}    (NOT \p{Nabataean} NOR \p{Is_Nabataean})
1230                             (48: U+10880..108AF)
1231   \p{Block: Nandinagari}  (NOT \p{Nandinagari} NOR
1232                             \p{Is_Nandinagari}) (96: U+119A0..119FF)
1233   \p{Block: NB}           \p{Block=No_Block} (832_720 plus all
1234                             above-Unicode code points)
1235   \p{Block: New_Tai_Lue}  (NOT \p{New_Tai_Lue} NOR
1236                             \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
1237   \p{Block: Newa}         (NOT \p{Newa} NOR \p{Is_Newa}) (128:
1238                             U+11400..1147F)
1239   \p{Block: NKo}          (NOT \p{Nko} NOR \p{Is_NKo}) (64:
1240                             U+07C0..07FF)
1241   \p{Block: No_Block}     (Short: \p{Blk=NB}) (832_720 plus all
1242                             above-Unicode code points: U+0870..089F,
1243                             U+2FE0..2FEF, U+10200..1027F,
1244                             U+103E0..103FF, U+10570..105FF,
1245                             U+10780..107FF ...)
1246   \p{Block: Number_Forms} (64: U+2150..218F)
1247   \p{Block: Nushu}        (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
1248                             U+1B170..1B2FF)
1249   \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong}
1250                             NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80:
1251                             U+1E100..1E14F)
1252   \p{Block: OCR}          \p{Block=Optical_Character_Recognition}
1253                             (32)
1254   \p{Block: Ogham}        (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
1255                             U+1680..169F)
1256   \p{Block: Ol_Chiki}     (48: U+1C50..1C7F)
1257   \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
1258                             \p{Is_Old_Hungarian}) (128:
1259                             U+10C80..10CFF)
1260   \p{Block: Old_Italic}   (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
1261                             (48: U+10300..1032F)
1262   \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
1263   \p{Block: Old_Permic}   (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
1264                             (48: U+10350..1037F)
1265   \p{Block: Old_Persian}  (NOT \p{Old_Persian} NOR
1266                             \p{Is_Old_Persian}) (64: U+103A0..103DF)
1267   \p{Block: Old_Sogdian}  (NOT \p{Old_Sogdian} NOR
1268                             \p{Is_Old_Sogdian}) (48: U+10F00..10F2F)
1269   \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
1270   \p{Block: Old_Turkic}   (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
1271                             (80: U+10C00..10C4F)
1272   \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
1273                             U+2440..245F)
1274   \p{Block: Oriya}        (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
1275                             U+0B00..0B7F)
1276   \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
1277   \p{Block: Osage}        (NOT \p{Osage} NOR \p{Is_Osage}) (80:
1278                             U+104B0..104FF)
1279   \p{Block: Osmanya}      (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
1280                             U+10480..104AF)
1281   \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F)
1282   \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
1283                             \p{Is_Pahawh_Hmong}) (144:
1284                             U+16B00..16B8F)
1285   \p{Block: Palmyrene}    (32: U+10860..1087F)
1286   \p{Block: Pau_Cin_Hau}  (NOT \p{Pau_Cin_Hau} NOR
1287                             \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
1288   \p{Block: Phags_Pa}     (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
1289                             (64: U+A840..A87F)
1290   \p{Block: Phaistos}     \p{Block=Phaistos_Disc} (48)
1291   \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
1292                             U+101D0..101FF)
1293   \p{Block: Phoenician}   (NOT \p{Phoenician} NOR \p{Is_Phoenician})
1294                             (32: U+10900..1091F)
1295   \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
1296   \p{Block: Phonetic_Ext_Sup} \p{Block=
1297                             Phonetic_Extensions_Supplement} (64)
1298   \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
1299                             U+1D00..1D7F)
1300   \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
1301                             PhoneticExtSup}) (64: U+1D80..1DBF)
1302   \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
1303   \p{Block: Private_Use}  \p{Block=Private_Use_Area} (NOT
1304                             \p{Private_Use} NOR \p{Is_Private_Use})
1305                             (6400)
1306   \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
1307                             \p{Private_Use} NOR \p{Is_Private_Use})
1308                             (6400: U+E000..F8FF)
1309   \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
1310                             \p{Is_Psalter_Pahlavi}) (48:
1311                             U+10B80..10BAF)
1312   \p{Block: PUA}          \p{Block=Private_Use_Area} (NOT
1313                             \p{Private_Use} NOR \p{Is_Private_Use})
1314                             (6400)
1315   \p{Block: Punctuation}  \p{Block=General_Punctuation} (NOT
1316                             \p{Punct} NOR \p{Is_Punctuation}) (112)
1317   \p{Block: Rejang}       (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
1318                             U+A930..A95F)
1319   \p{Block: Rumi}         \p{Block=Rumi_Numeral_Symbols} (32)
1320   \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
1321                             U+10E60..10E7F)
1322   \p{Block: Runic}        (NOT \p{Runic} NOR \p{Is_Runic}) (96:
1323                             U+16A0..16FF)
1324   \p{Block: Samaritan}    (NOT \p{Samaritan} NOR \p{Is_Samaritan})
1325                             (64: U+0800..083F)
1326   \p{Block: Saurashtra}   (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
1327                             (96: U+A880..A8DF)
1328   \p{Block: Sharada}      (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
1329                             U+11180..111DF)
1330   \p{Block: Shavian}      (48: U+10450..1047F)
1331   \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
1332   \p{Block: Siddham}      (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
1333                             U+11580..115FF)
1334   \p{Block: Sinhala}      (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
1335                             U+0D80..0DFF)
1336   \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
1337   \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
1338                             U+FE50..FE6F)
1339   \p{Block: Small_Forms}  \p{Block=Small_Form_Variants} (32)
1340   \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64)
1341   \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64:
1342                             U+1B130..1B16F)
1343   \p{Block: Sogdian}      (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64:
1344                             U+10F30..10F6F)
1345   \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
1346                             \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
1347   \p{Block: Soyombo}      (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
1348                             U+11A50..11AAF)
1349   \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1350                             ModifierLetters}) (80: U+02B0..02FF)
1351   \p{Block: Specials}     (16: U+FFF0..FFFF)
1352   \p{Block: Sundanese}    (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1353                             (64: U+1B80..1BBF)
1354   \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1355   \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
1356                             U+1CC0..1CCF)
1357   \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1358   \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1359   \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1360   \p{Block: Sup_Math_Operators} \p{Block=
1361                             Supplemental_Mathematical_Operators}
1362                             (256)
1363   \p{Block: Sup_PUA_A}    \p{Block=Supplementary_Private_Use_Area_A}
1364                             (65_536)
1365   \p{Block: Sup_PUA_B}    \p{Block=Supplementary_Private_Use_Area_B}
1366                             (65_536)
1367   \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1368   \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1369                             Supplemental_Symbols_And_Pictographs}
1370                             (256)
1371   \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1372   \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1373                             SuperAndSub}) (48: U+2070..209F)
1374   \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
1375                             U+27F0..27FF)
1376   \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
1377                             U+2900..297F)
1378   \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
1379                             U+1F800..1F8FF)
1380   \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1381                             SupMathOperators}) (256: U+2A00..2AFF)
1382   \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1383                             SupPunctuation}) (128: U+2E00..2E7F)
1384   \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1385                             SupSymbolsAndPictographs}) (256:
1386                             U+1F900..1F9FF)
1387   \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1388                             SupPUAA}) (65_536: U+F0000..FFFFF)
1389   \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1390                             SupPUAB}) (65_536: U+100000..10FFFF)
1391   \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
1392   \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1393                             \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
1394   \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block=
1395                             Symbols_And_Pictographs_Extended_A} (144)
1396   \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk=
1397                             SymbolsAndPictographsExtA}) (144:
1398                             U+1FA70..1FAFF)
1399   \p{Block: Syriac}       (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
1400                             U+0700..074F)
1401   \p{Block: Syriac_Sup}   \p{Block=Syriac_Supplement} (16)
1402   \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
1403                             U+0860..086F)
1404   \p{Block: Tagalog}      (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
1405                             U+1700..171F)
1406   \p{Block: Tagbanwa}     (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
1407                             (32: U+1760..177F)
1408   \p{Block: Tags}         (128: U+E0000..E007F)
1409   \p{Block: Tai_Le}       (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
1410                             U+1950..197F)
1411   \p{Block: Tai_Tham}     (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1412                             (144: U+1A20..1AAF)
1413   \p{Block: Tai_Viet}     (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
1414                             (96: U+AA80..AADF)
1415   \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1416   \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
1417                             U+1D300..1D35F)
1418   \p{Block: Takri}        (NOT \p{Takri} NOR \p{Is_Takri}) (80:
1419                             U+11680..116CF)
1420   \p{Block: Tamil}        (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
1421                             U+0B80..0BFF)
1422   \p{Block: Tamil_Sup}    \p{Block=Tamil_Supplement} (64)
1423   \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64:
1424                             U+11FC0..11FFF)
1425   \p{Block: Tangut}       (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
1426                             U+17000..187FF)
1427   \p{Block: Tangut_Components} (768: U+18800..18AFF)
1428   \p{Block: Telugu}       (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
1429                             U+0C00..0C7F)
1430   \p{Block: Thaana}       (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
1431                             U+0780..07BF)
1432   \p{Block: Thai}         (NOT \p{Thai} NOR \p{Is_Thai}) (128:
1433                             U+0E00..0E7F)
1434   \p{Block: Tibetan}      (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
1435                             U+0F00..0FFF)
1436   \p{Block: Tifinagh}     (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
1437                             (80: U+2D30..2D7F)
1438   \p{Block: Tirhuta}      (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
1439                             U+11480..114DF)
1440   \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1441                             (128)
1442   \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1443                             TransportAndMap}) (128: U+1F680..1F6FF)
1444   \p{Block: UCAS}         \p{Block=
1445                             Unified_Canadian_Aboriginal_Syllabics}
1446                             (640)
1447   \p{Block: UCAS_Ext}     \p{Block=
1448                             Unified_Canadian_Aboriginal_Syllabics_-
1449                             Extended} (80)
1450   \p{Block: Ugaritic}     (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
1451                             (32: U+10380..1039F)
1452   \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1453                             UCAS}) (640: U+1400..167F)
1454   \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1455                             \p{Blk=UCASExt}) (80: U+18B0..18FF)
1456   \p{Block: Vai}          (NOT \p{Vai} NOR \p{Is_Vai}) (320:
1457                             U+A500..A63F)
1458   \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1459                             \p{Variation_Selector} NOR \p{Is_VS})
1460                             (16: U+FE00..FE0F)
1461   \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1462                             (240: U+E0100..E01EF)
1463   \p{Block: Vedic_Ext}    \p{Block=Vedic_Extensions} (48)
1464   \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
1465                             U+1CD0..1CFF)
1466   \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
1467   \p{Block: VS}           \p{Block=Variation_Selectors} (NOT
1468                             \p{Variation_Selector} NOR \p{Is_VS})
1469                             (16)
1470   \p{Block: VS_Sup}       \p{Block=Variation_Selectors_Supplement}
1471                             (240)
1472   \p{Block: Wancho}       (NOT \p{Wancho} NOR \p{Is_Wancho}) (64:
1473                             U+1E2C0..1E2FF)
1474   \p{Block: Warang_Citi}  (NOT \p{Warang_Citi} NOR
1475                             \p{Is_Warang_Citi}) (96: U+118A0..118FF)
1476   \p{Block: Yi_Radicals}  (64: U+A490..A4CF)
1477   \p{Block: Yi_Syllables} (1168: U+A000..A48F)
1478   \p{Block: Yijing}       \p{Block=Yijing_Hexagram_Symbols} (64)
1479   \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
1480                             U+4DC0..4DFF)
1481   \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
1482                             \p{Is_Zanabazar_Square}) (80:
1483                             U+11A00..11A4F)
1484 X \p{Block_Elements}      \p{Block=Block_Elements} (32)
1485   \p{Bopo}                \p{Bopomofo} (= \p{Script_Extensions=
1486                             Bopomofo}) (NOT \p{Block=Bopomofo}) (112)
1487   \p{Bopomofo}            \p{Script_Extensions=Bopomofo} (Short:
1488                             \p{Bopo}; NOT \p{Block=Bopomofo}) (112)
1489 X \p{Bopomofo_Ext}        \p{Bopomofo_Extended} (= \p{Block=
1490                             Bopomofo_Extended}) (32)
1491 X \p{Bopomofo_Extended}   \p{Block=Bopomofo_Extended} (Short:
1492                             \p{InBopomofoExt}) (32)
1493 X \p{Box_Drawing}         \p{Block=Box_Drawing} (128)
1494   \p{Bpt: *}              \p{Bidi_Paired_Bracket_Type: *}
1495   \p{Brah}                \p{Brahmi} (= \p{Script_Extensions=
1496                             Brahmi}) (NOT \p{Block=Brahmi}) (109)
1497   \p{Brahmi}              \p{Script_Extensions=Brahmi} (Short:
1498                             \p{Brah}; NOT \p{Block=Brahmi}) (109)
1499   \p{Brai}                \p{Braille} (= \p{Script_Extensions=
1500                             Braille}) (256)
1501   \p{Braille}             \p{Script_Extensions=Braille} (Short:
1502                             \p{Brai}) (256)
1503 X \p{Braille_Patterns}    \p{Block=Braille_Patterns} (Short:
1504                             \p{InBraille}) (256)
1505   \p{Bugi}                \p{Buginese} (= \p{Script_Extensions=
1506                             Buginese}) (NOT \p{Block=Buginese}) (31)
1507   \p{Buginese}            \p{Script_Extensions=Buginese} (Short:
1508                             \p{Bugi}; NOT \p{Block=Buginese}) (31)
1509   \p{Buhd}                \p{Buhid} (= \p{Script_Extensions=Buhid})
1510                             (NOT \p{Block=Buhid}) (22)
1511   \p{Buhid}               \p{Script_Extensions=Buhid} (Short:
1512                             \p{Buhd}; NOT \p{Block=Buhid}) (22)
1513 X \p{Byzantine_Music}     \p{Byzantine_Musical_Symbols} (= \p{Block=
1514                             Byzantine_Musical_Symbols}) (256)
1515 X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1516                             (Short: \p{InByzantineMusic}) (256)
1517   \p{C} \pC               \p{Other} (= \p{General_Category=Other})
1518                             (976_344 plus all above-Unicode code
1519                             points)
1520   \p{Cakm}                \p{Chakma} (= \p{Script_Extensions=
1521                             Chakma}) (NOT \p{Block=Chakma}) (90)
1522   \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1523                             (Short: \p{Cans}) (710)
1524 X \p{Canadian_Syllabics}  \p{Unified_Canadian_Aboriginal_Syllabics}
1525                             (= \p{Block=
1526                             Unified_Canadian_Aboriginal_Syllabics})
1527                             (640)
1528 T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1529                             Not_Reordered} (1_113_250 plus all
1530                             above-Unicode code points)
1531 T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1532                             Overlay} (32)
1533 T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1534                             Nukta} (25)
1535 T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1536                             Kana_Voicing} (2)
1537 T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1538                             Virama} (58)
1539 T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1540                             CCC10} (1)
1541   \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
1542                             U+05B0)
1543 T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1544                             CCC11} (1)
1545   \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
1546                             U+05B1)
1547 T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1548                             CCC12} (1)
1549   \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
1550                             U+05B2)
1551 T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1552                             CCC13} (1)
1553   \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
1554                             U+05B3)
1555 T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1556                             CCC14} (1)
1557   \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
1558                             U+05B4)
1559 T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1560                             CCC15} (1)
1561   \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
1562                             U+05B5)
1563 T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1564                             CCC16} (1)
1565   \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
1566                             U+05B6)
1567 T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1568                             CCC17} (1)
1569   \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
1570                             U+05B7)
1571 T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1572                             CCC18} (2)
1573   \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
1574                             U+05B8, U+05C7)
1575 T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1576                             CCC19} (2)
1577   \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
1578                             U+05B9..05BA)
1579 T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1580                             CCC20} (1)
1581   \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
1582                             U+05BB)
1583 T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1584                             CCC21} (1)
1585   \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
1586                             U+05BC)
1587 T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1588                             CCC22} (1)
1589   \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
1590                             U+05BD)
1591 T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1592                             CCC23} (1)
1593   \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
1594                             U+05BF)
1595 T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1596                             CCC24} (1)
1597   \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
1598                             U+05C1)
1599 T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1600                             CCC25} (1)
1601   \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
1602                             U+05C2)
1603 T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1604                             CCC26} (1)
1605   \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
1606                             U+FB1E)
1607 T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1608                             CCC27} (2)
1609   \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
1610                             U+064B, U+08F0)
1611 T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1612                             CCC28} (2)
1613   \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
1614                             U+064C, U+08F1)
1615 T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1616                             CCC29} (2)
1617   \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
1618                             U+064D, U+08F2)
1619 T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1620                             CCC30} (2)
1621   \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
1622                             U+0618, U+064E)
1623 T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1624                             CCC31} (2)
1625   \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
1626                             U+0619, U+064F)
1627 T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1628                             CCC32} (2)
1629   \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
1630                             U+061A, U+0650)
1631 T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1632                             CCC33} (1)
1633   \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
1634                             U+0651)
1635 T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1636                             CCC34} (1)
1637   \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
1638                             U+0652)
1639 T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1640                             CCC35} (1)
1641   \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
1642                             U+0670)
1643 T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1644                             CCC36} (1)
1645   \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
1646                             U+0711)
1647 T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1648                             CCC84} (1)
1649   \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
1650                             U+0C55)
1651 T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1652                             CCC91} (1)
1653   \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
1654                             U+0C56)
1655 T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1656                             CCC103} (2)
1657   \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
1658                             U+0E38..0E39)
1659 T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1660                             CCC107} (4)
1661   \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
1662                             U+0E48..0E4B)
1663 T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1664                             CCC118} (2)
1665   \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
1666                             U+0EB8..0EB9)
1667 T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1668                             CCC122} (4)
1669   \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
1670                             U+0EC8..0ECB)
1671 T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1672                             CCC129} (1)
1673   \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
1674                             U+0F71)
1675 T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1676                             CCC130} (6)
1677   \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
1678                             U+0F72, U+0F7A..0F7D, U+0F80)
1679 T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1680                             CCC132} (1)
1681   \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
1682                             U+0F74)
1683 T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1684                             CCC133} (0)
1685   \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1686 T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1687                             Attached_Below_Left} (0)
1688 T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1689                             Attached_Below} (5)
1690 T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1691                             Attached_Above} (1)
1692 T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1693                             Attached_Above_Right} (9)
1694 T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1695                             Below_Left} (1)
1696 T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1697                             Below} (163)
1698 T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1699                             Below_Right} (4)
1700 T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1701                             Left} (2)
1702 T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1703                             Right} (1)
1704 T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1705                             Above_Left} (5)
1706 T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1707                             Above} (482)
1708 T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1709                             Above_Right} (5)
1710 T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1711                             Double_Below} (4)
1712 T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1713                             Double_Above} (5)
1714 T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1715                             Iota_Subscript} (1)
1716   \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1717                             Above} (482)
1718   \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (482:
1719                             U+0300..0314, U+033D..0344, U+0346,
1720                             U+034A..034C, U+0350..0352, U+0357 ...)
1721   \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
1722                             U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
1723   \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (5:
1724                             U+0315, U+031A, U+0358, U+1DF6, U+302C)
1725   \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1726                             Above_Left} (5)
1727   \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1728                             Above_Right} (5)
1729   \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1730                             Attached_Above} (1)
1731   \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1732                             Attached_Above_Right} (9)
1733   \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1734                             Attached_Below} (5)
1735   \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1736                             Attached_Below_Left} (0)
1737   \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1738                             (1: U+1DCE)
1739   \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1740                             \p{Ccc=ATAR}) (9: U+031B, U+0F39,
1741                             U+1D165..1D166, U+1D16E..1D172)
1742   \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1743                             (5: U+0321..0322, U+0327..0328, U+1DD0)
1744   \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1745                             ATBL}) (0)
1746   \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1747                             Below} (163)
1748   \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (163:
1749                             U+0316..0319, U+031C..0320,
1750                             U+0323..0326, U+0329..0333,
1751                             U+0339..033C, U+0347..0349 ...)
1752   \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1:
1753                             U+302A)
1754   \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
1755                             U+059A, U+05AD, U+1939, U+302D)
1756   \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1757                             Below_Left} (1)
1758   \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1759                             Below_Right} (4)
1760   \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1761                             Double_Above} (5)
1762   \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1763                             Double_Below} (4)
1764   \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
1765                             (5: U+035D..035E, U+0360..0361, U+1DCD)
1766   \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
1767                             (4: U+035C, U+035F, U+0362, U+1DFC)
1768   \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1769                             (1: U+0345)
1770   \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1771                             Iota_Subscript} (1)
1772   \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
1773                             (2: U+3099..309A)
1774   \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1775                             Kana_Voicing} (2)
1776   \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1777                             Left} (2)
1778   \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
1779                             U+302E..302F)
1780   \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1781                             Nukta} (25)
1782   \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1783                             (1_113_250 plus all above-Unicode code
1784                             points: U+0000..02FF, U+034F,
1785                             U+0370..0482, U+0488..0590, U+05BE,
1786                             U+05C0 ...)
1787   \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1788                             Not_Reordered} (1_113_250 plus all
1789                             above-Unicode code points)
1790   \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (25:
1791                             U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
1792                             U+0CBC ...)
1793   \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1794                             Overlay} (32)
1795   \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
1796                             U+0334..0338, U+1CD4, U+1CE2..1CE8,
1797                             U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
1798                             ...)
1799   \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1800                             Right} (1)
1801   \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
1802                             U+1D16D)
1803   \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (58:
1804                             U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
1805                             U+0BCD ...)
1806   \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1807                             Virama} (58)
1808   \p{Cans}                \p{Canadian_Aboriginal} (=
1809                             \p{Script_Extensions=
1810                             Canadian_Aboriginal}) (710)
1811   \p{Cari}                \p{Carian} (= \p{Script_Extensions=
1812                             Carian}) (NOT \p{Block=Carian}) (49)
1813   \p{Carian}              \p{Script_Extensions=Carian} (Short:
1814                             \p{Cari}; NOT \p{Block=Carian}) (49)
1815   \p{Case_Ignorable}      \p{Case_Ignorable=Y} (Short: \p{CI}) (2396)
1816   \p{Case_Ignorable: N*}  (Short: \p{CI=N}, \P{CI}) (1_111_716 plus
1817                             all above-Unicode code points: [\x00-
1818                             \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-
1819                             Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
1820                             \xac\xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
1821                             U+0100..02AF, U+0370..0373,
1822                             U+0376..0379, U+037B..0383, U+0386 ...)
1823   \p{Case_Ignorable: Y*}  (Short: \p{CI=Y}, \p{CI}) (2396:
1824                             [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
1825                             U+02B0..036F, U+0374..0375, U+037A,
1826                             U+0384..0385, U+0387 ...)
1827   \p{Cased}               \p{Cased=Y} (4279)
1828   \p{Cased: N*}           (Single: \P{Cased}) (1_109_833 plus all
1829                             above-Unicode code points: [\x00-
1830                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1831                             >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
1832                             \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7], U+01BB,
1833                             U+01C0..01C3, U+0294, U+02B9..02BF,
1834                             U+02C2..02DF ...)
1835   \p{Cased: Y*}           (Single: \p{Cased}) (4279: [A-Za-
1836                             z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
1837                             \xff], U+0100..01BA, U+01BC..01BF,
1838                             U+01C4..0293, U+0295..02B8, U+02C0..02C1
1839                             ...)
1840   \p{Cased_Letter}        \p{General_Category=Cased_Letter} (Short:
1841                             \p{LC}) (3970)
1842   \p{Category: *}         \p{General_Category: *}
1843   \p{Caucasian_Albanian}  \p{Script_Extensions=Caucasian_Albanian}
1844                             (Short: \p{Aghb}; NOT \p{Block=
1845                             Caucasian_Albanian}) (53)
1846   \p{Cc}                  \p{XPosixCntrl} (= \p{General_Category=
1847                             Control}) (65)
1848   \p{Ccc: *}              \p{Canonical_Combining_Class: *}
1849   \p{CE}                  \p{Composition_Exclusion} (=
1850                             \p{Composition_Exclusion=Y}) (81)
1851   \p{CE: *}               \p{Composition_Exclusion: *}
1852   \p{Cf}                  \p{Format} (= \p{General_Category=Format})
1853                             (161)
1854   \p{Chakma}              \p{Script_Extensions=Chakma} (Short:
1855                             \p{Cakm}; NOT \p{Block=Chakma}) (90)
1856   \p{Cham}                \p{Script_Extensions=Cham} (NOT \p{Block=
1857                             Cham}) (83)
1858   \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1859                             \p{CWCF}) (1463)
1860   \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1861                             (1_112_649 plus all above-Unicode code
1862                             points: [\x00-
1863                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1864                             >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\xb4\xb6-
1865                             \xbf\xd7\xe0-\xff], U+0101, U+0103,
1866                             U+0105, U+0107, U+0109 ...)
1867   \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1868                             (1463: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
1869                             U+0100, U+0102, U+0104, U+0106, U+0108
1870                             ...)
1871   \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1872                             \p{CWCM}) (2841)
1873   \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1874                             (1_111_271 plus all above-Unicode code
1875                             points: [\x00-
1876                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1877                             >?\@\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1878                             \xbf\xd7\xf7], U+0138, U+018D, U+019B,
1879                             U+01AA..01AB, U+01BA..01BB ...)
1880   \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1881                             (2841: [A-Za-z\xb5\xc0-\xd6\xd8-
1882                             \xf6\xf8-\xff], U+0100..0137,
1883                             U+0139..018C, U+018E..019A,
1884                             U+019C..01A9, U+01AC..01B9 ...)
1885   \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1886                             \p{CWL}) (1390)
1887   \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1888                             (1_112_722 plus all above-Unicode code
1889                             points: [\x00-
1890                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1891                             >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1892                             \xbf\xd7\xdf-\xff], U+0101, U+0103,
1893                             U+0105, U+0107, U+0109 ...)
1894   \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1390:
1895                             [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
1896                             U+0104, U+0106, U+0108 ...)
1897   \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
1898                             Y} (Short: \p{CWKCF}) (10_315)
1899   \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
1900                             \P{CWKCF}) (1_103_797 plus all above-
1901                             Unicode code points: [\x00-
1902                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
1903                             >?\@\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
1904                             \xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6-
1905                             \xb7\xbb\xbf\xd7\xe0-\xff], U+0101,
1906                             U+0103, U+0105, U+0107, U+0109 ...)
1907   \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
1908                             \p{CWKCF}) (10_315: [A-
1909                             Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
1910                             \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf],
1911                             U+0100, U+0102, U+0104, U+0106, U+0108
1912                             ...)
1913   \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
1914                             \p{CWT}) (1409)
1915   \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
1916                             (1_112_703 plus all above-Unicode code
1917                             points: [\x00-
1918                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
1919                             Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1920                             \xde\xf7], U+0100, U+0102, U+0104,
1921                             U+0106, U+0108 ...)
1922   \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1409:
1923                             [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1924                             U+0103, U+0105, U+0107, U+0109 ...)
1925   \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
1926                             \p{CWU}) (1482)
1927   \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
1928                             (1_112_630 plus all above-Unicode code
1929                             points: [\x00-
1930                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
1931                             Z\[\\\]\^_`\{\|\}~\x7f-\xb4\xb6-
1932                             \xde\xf7], U+0100, U+0102, U+0104,
1933                             U+0106, U+0108 ...)
1934   \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1482:
1935                             [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
1936                             U+0103, U+0105, U+0107, U+0109 ...)
1937   \p{Cher}                \p{Cherokee} (= \p{Script_Extensions=
1938                             Cherokee}) (NOT \p{Block=Cherokee}) (172)
1939   \p{Cherokee}            \p{Script_Extensions=Cherokee} (Short:
1940                             \p{Cher}; NOT \p{Block=Cherokee}) (172)
1941 X \p{Cherokee_Sup}        \p{Cherokee_Supplement} (= \p{Block=
1942                             Cherokee_Supplement}) (80)
1943 X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
1944                             \p{InCherokeeSup}) (80)
1945 X \p{Chess_Symbols}       \p{Block=Chess_Symbols} (112)
1946   \p{CI}                  \p{Case_Ignorable} (= \p{Case_Ignorable=
1947                             Y}) (2396)
1948   \p{CI: *}               \p{Case_Ignorable: *}
1949 X \p{CJK}                 \p{CJK_Unified_Ideographs} (= \p{Block=
1950                             CJK_Unified_Ideographs}) (20_992)
1951 X \p{CJK_Compat}          \p{CJK_Compatibility} (= \p{Block=
1952                             CJK_Compatibility}) (256)
1953 X \p{CJK_Compat_Forms}    \p{CJK_Compatibility_Forms} (= \p{Block=
1954                             CJK_Compatibility_Forms}) (32)
1955 X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
1956                             \p{Block=CJK_Compatibility_Ideographs})
1957                             (512)
1958 X \p{CJK_Compat_Ideographs_Sup}
1959                             \p{CJK_Compatibility_Ideographs_-
1960                             Supplement} (= \p{Block=
1961                             CJK_Compatibility_Ideographs_-
1962                             Supplement}) (544)
1963 X \p{CJK_Compatibility}   \p{Block=CJK_Compatibility} (Short:
1964                             \p{InCJKCompat}) (256)
1965 X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
1966                             (Short: \p{InCJKCompatForms}) (32)
1967 X \p{CJK_Compatibility_Ideographs} \p{Block=
1968                             CJK_Compatibility_Ideographs} (Short:
1969                             \p{InCJKCompatIdeographs}) (512)
1970 X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
1971                             CJK_Compatibility_Ideographs_Supplement}
1972                             (Short: \p{InCJKCompatIdeographsSup})
1973                             (544)
1974 X \p{CJK_Ext_A}           \p{CJK_Unified_Ideographs_Extension_A} (=
1975                             \p{Block=
1976                             CJK_Unified_Ideographs_Extension_A})
1977                             (6592)
1978 X \p{CJK_Ext_B}           \p{CJK_Unified_Ideographs_Extension_B} (=
1979                             \p{Block=
1980                             CJK_Unified_Ideographs_Extension_B})
1981                             (42_720)
1982 X \p{CJK_Ext_C}           \p{CJK_Unified_Ideographs_Extension_C} (=
1983                             \p{Block=
1984                             CJK_Unified_Ideographs_Extension_C})
1985                             (4160)
1986 X \p{CJK_Ext_D}           \p{CJK_Unified_Ideographs_Extension_D} (=
1987                             \p{Block=
1988                             CJK_Unified_Ideographs_Extension_D})
1989                             (224)
1990 X \p{CJK_Ext_E}           \p{CJK_Unified_Ideographs_Extension_E} (=
1991                             \p{Block=
1992                             CJK_Unified_Ideographs_Extension_E})
1993                             (5776)
1994 X \p{CJK_Ext_F}           \p{CJK_Unified_Ideographs_Extension_F} (=
1995                             \p{Block=
1996                             CJK_Unified_Ideographs_Extension_F})
1997                             (7488)
1998 X \p{CJK_Radicals_Sup}    \p{CJK_Radicals_Supplement} (= \p{Block=
1999                             CJK_Radicals_Supplement}) (128)
2000 X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
2001                             (Short: \p{InCJKRadicalsSup}) (128)
2002 X \p{CJK_Strokes}         \p{Block=CJK_Strokes} (48)
2003 X \p{CJK_Symbols}         \p{CJK_Symbols_And_Punctuation} (=
2004                             \p{Block=CJK_Symbols_And_Punctuation})
2005                             (64)
2006 X \p{CJK_Symbols_And_Punctuation} \p{Block=
2007                             CJK_Symbols_And_Punctuation} (Short:
2008                             \p{InCJKSymbols}) (64)
2009 X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
2010                             (Short: \p{InCJK}) (20_992)
2011 X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
2012                             CJK_Unified_Ideographs_Extension_A}
2013                             (Short: \p{InCJKExtA}) (6592)
2014 X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
2015                             CJK_Unified_Ideographs_Extension_B}
2016                             (Short: \p{InCJKExtB}) (42_720)
2017 X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
2018                             CJK_Unified_Ideographs_Extension_C}
2019                             (Short: \p{InCJKExtC}) (4160)
2020 X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
2021                             CJK_Unified_Ideographs_Extension_D}
2022                             (Short: \p{InCJKExtD}) (224)
2023 X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
2024                             CJK_Unified_Ideographs_Extension_E}
2025                             (Short: \p{InCJKExtE}) (5776)
2026 X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
2027                             CJK_Unified_Ideographs_Extension_F}
2028                             (Short: \p{InCJKExtF}) (7488)
2029   \p{Close_Punctuation}   \p{General_Category=Close_Punctuation}
2030                             (Short: \p{Pe}) (73)
2031   \p{Cn}                  \p{Unassigned} (= \p{General_Category=
2032                             Unassigned}) (836_602 plus all above-
2033                             Unicode code points)
2034   \p{Cntrl}               \p{XPosixCntrl} (= \p{General_Category=
2035                             Control}) (65)
2036   \p{Co}                  \p{Private_Use} (= \p{General_Category=
2037                             Private_Use}) (NOT \p{Private_Use_Area})
2038                             (137_468)
2039 X \p{Combining_Diacritical_Marks} \p{Block=
2040                             Combining_Diacritical_Marks} (Short:
2041                             \p{InDiacriticals}) (112)
2042 X \p{Combining_Diacritical_Marks_Extended} \p{Block=
2043                             Combining_Diacritical_Marks_Extended}
2044                             (Short: \p{InDiacriticalsExt}) (80)
2045 X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
2046                             Combining_Diacritical_Marks_For_Symbols}
2047                             (Short: \p{InDiacriticalsForSymbols})
2048                             (48)
2049 X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
2050                             Combining_Diacritical_Marks_Supplement}
2051                             (Short: \p{InDiacriticalsSup}) (64)
2052 X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
2053                             \p{InHalfMarks}) (16)
2054   \p{Combining_Mark}      \p{Mark} (= \p{General_Category=Mark})
2055                             (2268)
2056 X \p{Combining_Marks_For_Symbols}
2057                             \p{Combining_Diacritical_Marks_For_-
2058                             Symbols} (= \p{Block=
2059                             Combining_Diacritical_Marks_For_-
2060                             Symbols}) (48)
2061   \p{Common}              \p{Script_Extensions=Common} (Short:
2062                             \p{Zyyy}) (7386)
2063 X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
2064                             (Short: \p{InIndicNumberForms}) (16)
2065   \p{Comp_Ex}             \p{Full_Composition_Exclusion} (=
2066                             \p{Full_Composition_Exclusion=Y}) (1120)
2067   \p{Comp_Ex: *}          \p{Full_Composition_Exclusion: *}
2068 X \p{Compat_Jamo}         \p{Hangul_Compatibility_Jamo} (= \p{Block=
2069                             Hangul_Compatibility_Jamo}) (96)
2070   \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
2071                             \p{CE}) (81)
2072   \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
2073                             plus all above-Unicode code points:
2074                             U+0000..0957, U+0960..09DB, U+09DE,
2075                             U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
2076                             ...)
2077   \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
2078                             U+0958..095F, U+09DC..09DD, U+09DF,
2079                             U+0A33, U+0A36, U+0A59..0A5B ...)
2080   \p{Connector_Punctuation} \p{General_Category=
2081                             Connector_Punctuation} (Short: \p{Pc})
2082                             (10)
2083   \p{Control}             \p{XPosixCntrl} (= \p{General_Category=
2084                             Control}) (65)
2085 X \p{Control_Pictures}    \p{Block=Control_Pictures} (64)
2086   \p{Copt}                \p{Coptic} (= \p{Script_Extensions=
2087                             Coptic}) (NOT \p{Block=Coptic}) (165)
2088   \p{Coptic}              \p{Script_Extensions=Coptic} (Short:
2089                             \p{Copt}; NOT \p{Block=Coptic}) (165)
2090 X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
2091 X \p{Counting_Rod}        \p{Counting_Rod_Numerals} (= \p{Block=
2092                             Counting_Rod_Numerals}) (32)
2093 X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
2094                             \p{InCountingRod}) (32)
2095   \p{Cprt}                \p{Cypriot} (= \p{Script_Extensions=
2096                             Cypriot}) (112)
2097   \p{Cs}                  \p{Surrogate} (= \p{General_Category=
2098                             Surrogate}) (2048)
2099   \p{Cuneiform}           \p{Script_Extensions=Cuneiform} (Short:
2100                             \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
2101 X \p{Cuneiform_Numbers}   \p{Cuneiform_Numbers_And_Punctuation} (=
2102                             \p{Block=
2103                             Cuneiform_Numbers_And_Punctuation}) (128)
2104 X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
2105                             Cuneiform_Numbers_And_Punctuation}
2106                             (Short: \p{InCuneiformNumbers}) (128)
2107   \p{Currency_Symbol}     \p{General_Category=Currency_Symbol}
2108                             (Short: \p{Sc}) (62)
2109 X \p{Currency_Symbols}    \p{Block=Currency_Symbols} (48)
2110   \p{CWCF}                \p{Changes_When_Casefolded} (=
2111                             \p{Changes_When_Casefolded=Y}) (1463)
2112   \p{CWCF: *}             \p{Changes_When_Casefolded: *}
2113   \p{CWCM}                \p{Changes_When_Casemapped} (=
2114                             \p{Changes_When_Casemapped=Y}) (2841)
2115   \p{CWCM: *}             \p{Changes_When_Casemapped: *}
2116   \p{CWKCF}               \p{Changes_When_NFKC_Casefolded} (=
2117                             \p{Changes_When_NFKC_Casefolded=Y})
2118                             (10_315)
2119   \p{CWKCF: *}            \p{Changes_When_NFKC_Casefolded: *}
2120   \p{CWL}                 \p{Changes_When_Lowercased} (=
2121                             \p{Changes_When_Lowercased=Y}) (1390)
2122   \p{CWL: *}              \p{Changes_When_Lowercased: *}
2123   \p{CWT}                 \p{Changes_When_Titlecased} (=
2124                             \p{Changes_When_Titlecased=Y}) (1409)
2125   \p{CWT: *}              \p{Changes_When_Titlecased: *}
2126   \p{CWU}                 \p{Changes_When_Uppercased} (=
2127                             \p{Changes_When_Uppercased=Y}) (1482)
2128   \p{CWU: *}              \p{Changes_When_Uppercased: *}
2129   \p{Cypriot}             \p{Script_Extensions=Cypriot} (Short:
2130                             \p{Cprt}) (112)
2131 X \p{Cypriot_Syllabary}   \p{Block=Cypriot_Syllabary} (64)
2132   \p{Cyrillic}            \p{Script_Extensions=Cyrillic} (Short:
2133                             \p{Cyrl}; NOT \p{Block=Cyrillic}) (446)
2134 X \p{Cyrillic_Ext_A}      \p{Cyrillic_Extended_A} (= \p{Block=
2135                             Cyrillic_Extended_A}) (32)
2136 X \p{Cyrillic_Ext_B}      \p{Cyrillic_Extended_B} (= \p{Block=
2137                             Cyrillic_Extended_B}) (96)
2138 X \p{Cyrillic_Ext_C}      \p{Cyrillic_Extended_C} (= \p{Block=
2139                             Cyrillic_Extended_C}) (16)
2140 X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
2141                             \p{InCyrillicExtA}) (32)
2142 X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
2143                             \p{InCyrillicExtB}) (96)
2144 X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
2145                             \p{InCyrillicExtC}) (16)
2146 X \p{Cyrillic_Sup}        \p{Cyrillic_Supplement} (= \p{Block=
2147                             Cyrillic_Supplement}) (48)
2148 X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
2149                             \p{InCyrillicSup}) (48)
2150 X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
2151                             Cyrillic_Supplement}) (48)
2152   \p{Cyrl}                \p{Cyrillic} (= \p{Script_Extensions=
2153                             Cyrillic}) (NOT \p{Block=Cyrillic}) (446)
2154   \p{Dash}                \p{Dash=Y} (28)
2155   \p{Dash: N*}            (Single: \P{Dash}) (1_114_084 plus all
2156                             above-Unicode code points: [\x00-
2157                             \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
2158                             Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2159                             U+0100..0589, U+058B..05BD,
2160                             U+05BF..13FF, U+1401..1805, U+1807..200F
2161                             ...)
2162   \p{Dash: Y*}            (Single: \p{Dash}) (28: [\-], U+058A,
2163                             U+05BE, U+1400, U+1806, U+2010..2015 ...)
2164   \p{Dash_Punctuation}    \p{General_Category=Dash_Punctuation}
2165                             (Short: \p{Pd}) (24)
2166   \p{Decimal_Number}      \p{XPosixDigit} (= \p{General_Category=
2167                             Decimal_Number}) (630)
2168   \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
2169                             (13_232)
2170   \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232:
2171                             [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-
2172                             \xdd\xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-
2173                             \xfd\xff], U+0100..010F, U+0112..0125,
2174                             U+0128..0130, U+0134..0137, U+0139..013E
2175                             ...)
2176   \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
2177                             U+2460..2473, U+24B6..24EA,
2178                             U+3244..3247, U+3251..327E,
2179                             U+3280..32BF, U+32D0..32FE ...)
2180   \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
2181   \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720:
2182                             [\xa8\xaf\xb4-\xb5\xb8], U+0132..0133,
2183                             U+013F..0140, U+0149, U+017F,
2184                             U+01C4..01CC ...)
2185   \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
2186   \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
2187   \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
2188                             U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
2189                             ...)
2190   \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184: U+2102,
2191                             U+210A..2113, U+2115, U+2119..211D,
2192                             U+2124, U+2128 ...)
2193   \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
2194   \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
2195                             \xbe], U+2150..215F, U+2189)
2196   \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
2197   \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
2198                             U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
2199                             ...)
2200   \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
2201   \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
2202                             U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
2203                             ...)
2204   \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
2205   \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
2206                             U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
2207                             ...)
2208   \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
2209   \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
2210                             U+FF61..FFBE, U+FFC2..FFC7,
2211                             U+FFCA..FFCF, U+FFD2..FFD7,
2212                             U+FFDA..FFDC, U+FFE8..FFEE)
2213   \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
2214   \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
2215                             U+0F0C, U+2007, U+2011, U+202F)
2216   \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
2217                             Non_Canonical} (Perl extension) (3664)
2218   \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
2219                             decompositions (Short: \p{Dt=NonCanon})
2220                             (Perl extension) (3664:
2221                             [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-\xba\xbc-
2222                             \xbe], U+0132..0133, U+013F..0140,
2223                             U+0149, U+017F, U+01C4..01CC ...)
2224   \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_216 plus
2225                             all above-Unicode code points: [\x00-
2226                             \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
2227                             \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
2228                             \xdf\xe6\xf0\xf7-\xf8\xfe],
2229                             U+0110..0111, U+0126..0127, U+0131,
2230                             U+0138, U+0141..0142 ...)
2231   \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
2232                             U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
2233   \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
2234   \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286)
2235   \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250,
2236                             U+32CC..32CF, U+32FF..3357,
2237                             U+3371..33DF, U+33FF, U+1F130..1F14F ...)
2238   \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38: U+1D62..1D6A,
2239                             U+2080..208E, U+2090..209C, U+2C7C)
2240   \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (153)
2241   \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (153: [\xaa\xb2-
2242                             \xb3\xb9-\xba], U+02B0..02B8,
2243                             U+02E0..02E4, U+10FC, U+1D2C..1D2E,
2244                             U+1D30..1D3A ...)
2245   \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
2246   \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
2247                             U+30FF, U+FE10..FE19, U+FE30..FE44,
2248                             U+FE47..FE48)
2249   \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
2250                             U+FF01..FF60, U+FFE0..FFE6)
2251   \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
2252                             Y} (Short: \p{DI}) (4173)
2253   \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
2254                             (1_109_939 plus all above-Unicode code
2255                             points: [\x00-\xac\xae-\xff],
2256                             U+0100..034E, U+0350..061B,
2257                             U+061D..115E, U+1161..17B3, U+17B6..180A
2258                             ...)
2259   \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
2260                             (4173: [\xad], U+034F, U+061C,
2261                             U+115F..1160, U+17B4..17B5, U+180B..180E
2262                             ...)
2263   \p{Dep}                 \p{Deprecated} (= \p{Deprecated=Y}) (15)
2264   \p{Dep: *}              \p{Deprecated: *}
2265   \p{Deprecated}          \p{Deprecated=Y} (Short: \p{Dep}) (15)
2266   \p{Deprecated: N*}      (Short: \p{Dep=N}, \P{Dep}) (1_114_097
2267                             plus all above-Unicode code points:
2268                             U+0000..0148, U+014A..0672,
2269                             U+0674..0F76, U+0F78, U+0F7A..17A2,
2270                             U+17A5..2069 ...)
2271   \p{Deprecated: Y*}      (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
2272                             U+0673, U+0F77, U+0F79, U+17A3..17A4,
2273                             U+206A..206F ...)
2274   \p{Deseret}             \p{Script_Extensions=Deseret} (Short:
2275                             \p{Dsrt}) (80)
2276   \p{Deva}                \p{Devanagari} (= \p{Script_Extensions=
2277                             Devanagari}) (NOT \p{Block=Devanagari})
2278                             (210)
2279   \p{Devanagari}          \p{Script_Extensions=Devanagari} (Short:
2280                             \p{Deva}; NOT \p{Block=Devanagari}) (210)
2281 X \p{Devanagari_Ext}      \p{Devanagari_Extended} (= \p{Block=
2282                             Devanagari_Extended}) (32)
2283 X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
2284                             \p{InDevanagariExt}) (32)
2285   \p{DI}                  \p{Default_Ignorable_Code_Point} (=
2286                             \p{Default_Ignorable_Code_Point=Y})
2287                             (4173)
2288   \p{DI: *}               \p{Default_Ignorable_Code_Point: *}
2289   \p{Dia}                 \p{Diacritic} (= \p{Diacritic=Y}) (873)
2290   \p{Dia: *}              \p{Diacritic: *}
2291   \p{Diacritic}           \p{Diacritic=Y} (Short: \p{Dia}) (873)
2292   \p{Diacritic: N*}       (Short: \p{Dia=N}, \P{Dia}) (1_113_239
2293                             plus all above-Unicode code points:
2294                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2295                             >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
2296                             \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
2297                             U+0100..02AF, U+034F, U+0358..035C,
2298                             U+0363..0373, U+0376..0379 ...)
2299   \p{Diacritic: Y*}       (Short: \p{Dia=Y}, \p{Dia}) (873:
2300                             [\^`\xa8\xaf\xb4\xb7-\xb8],
2301                             U+02B0..034E, U+0350..0357,
2302                             U+035D..0362, U+0374..0375, U+037A ...)
2303 X \p{Diacriticals}        \p{Combining_Diacritical_Marks} (=
2304                             \p{Block=Combining_Diacritical_Marks})
2305                             (112)
2306 X \p{Diacriticals_Ext}    \p{Combining_Diacritical_Marks_Extended}
2307                             (= \p{Block=
2308                             Combining_Diacritical_Marks_Extended})
2309                             (80)
2310 X \p{Diacriticals_For_Symbols}
2311                             \p{Combining_Diacritical_Marks_For_-
2312                             Symbols} (= \p{Block=
2313                             Combining_Diacritical_Marks_For_-
2314                             Symbols}) (48)
2315 X \p{Diacriticals_Sup}    \p{Combining_Diacritical_Marks_Supplement}
2316                             (= \p{Block=
2317                             Combining_Diacritical_Marks_Supplement})
2318                             (64)
2319   \p{Digit}               \p{XPosixDigit} (= \p{General_Category=
2320                             Decimal_Number}) (630)
2321 X \p{Dingbats}            \p{Block=Dingbats} (192)
2322   \p{Dogr}                \p{Dogra} (= \p{Script_Extensions=Dogra})
2323                             (NOT \p{Block=Dogra}) (82)
2324   \p{Dogra}               \p{Script_Extensions=Dogra} (Short:
2325                             \p{Dogr}; NOT \p{Block=Dogra}) (82)
2326 X \p{Domino}              \p{Domino_Tiles} (= \p{Block=
2327                             Domino_Tiles}) (112)
2328 X \p{Domino_Tiles}        \p{Block=Domino_Tiles} (Short:
2329                             \p{InDomino}) (112)
2330   \p{Dsrt}                \p{Deseret} (= \p{Script_Extensions=
2331                             Deseret}) (80)
2332   \p{Dt: *}               \p{Decomposition_Type: *}
2333   \p{Dupl}                \p{Duployan} (= \p{Script_Extensions=
2334                             Duployan}) (NOT \p{Block=Duployan}) (147)
2335   \p{Duployan}            \p{Script_Extensions=Duployan} (Short:
2336                             \p{Dupl}; NOT \p{Block=Duployan}) (147)
2337   \p{Ea: *}               \p{East_Asian_Width: *}
2338 X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
2339                             (208)
2340   \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
2341   \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739:
2342                             [\xa1\xa4\xa7-\xa8\xaa\xad-\xae\xb0-
2343                             \xb4\xb6-\xba\xbc-\xbf\xc6\xd0\xd7-
2344                             \xd8\xde-\xe1\xe6\xe8-\xea\xec-
2345                             \xed\xf0\xf2-\xf3\xf7-\xfa\xfc\xfe],
2346                             U+0101, U+0111, U+0113, U+011B,
2347                             U+0126..0127 ...)
2348   \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
2349   \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
2350                             U+FF01..FF60, U+FFE0..FFE6)
2351   \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
2352   \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
2353                             U+FF61..FFBE, U+FFC2..FFC7,
2354                             U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
2355                             ...)
2356   \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (793_252 plus
2357                             all above-Unicode code points)
2358   \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
2359   \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-
2360                             \x7e\xa2-\xa3\xa5-\xa6\xac\xaf],
2361                             U+27E6..27ED, U+2985..2986)
2362   \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (793_252 plus all
2363                             above-Unicode code points: [\x00-
2364                             \x1f\x7f-\xa0\xa9\xab\xb5\xbb\xc0-
2365                             \xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd\xe2-
2366                             \xe5\xe7\xeb\xee-\xef\xf1\xf4-
2367                             \xf6\xfb\xfd\xff], U+00FF..0100,
2368                             U+0102..0110, U+0112, U+0114..011A,
2369                             U+011C..0125 ...)
2370   \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (181_783)
2371   \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (181_783:
2372                             U+1100..115F, U+231A..231B,
2373                             U+2329..232A, U+23E9..23EC, U+23F0,
2374                             U+23F3 ...)
2375   \p{Egyp}                \p{Egyptian_Hieroglyphs} (=
2376                             \p{Script_Extensions=
2377                             Egyptian_Hieroglyphs}) (NOT \p{Block=
2378                             Egyptian_Hieroglyphs}) (1080)
2379 X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block=
2380                             Egyptian_Hieroglyph_Format_Controls} (16)
2381   \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
2382                             Egyptian_Hieroglyphs} (Short: \p{Egyp};
2383                             NOT \p{Block=Egyptian_Hieroglyphs})
2384                             (1080)
2385   \p{Elba}                \p{Elbasan} (= \p{Script_Extensions=
2386                             Elbasan}) (NOT \p{Block=Elbasan}) (40)
2387   \p{Elbasan}             \p{Script_Extensions=Elbasan} (Short:
2388                             \p{Elba}; NOT \p{Block=Elbasan}) (40)
2389   \p{Elym}                \p{Elymaic} (= \p{Script_Extensions=
2390                             Elymaic}) (NOT \p{Block=Elymaic}) (23)
2391   \p{Elymaic}             \p{Script_Extensions=Elymaic} (Short:
2392                             \p{Elym}; NOT \p{Block=Elymaic}) (23)
2393 X \p{Emoticons}           \p{Block=Emoticons} (80)
2394 X \p{Enclosed_Alphanum}   \p{Enclosed_Alphanumerics} (= \p{Block=
2395                             Enclosed_Alphanumerics}) (160)
2396 X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
2397                             \p{Block=
2398                             Enclosed_Alphanumeric_Supplement}) (256)
2399 X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
2400                             Enclosed_Alphanumeric_Supplement}
2401                             (Short: \p{InEnclosedAlphanumSup}) (256)
2402 X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
2403                             (Short: \p{InEnclosedAlphanum}) (160)
2404 X \p{Enclosed_CJK}        \p{Enclosed_CJK_Letters_And_Months} (=
2405                             \p{Block=
2406                             Enclosed_CJK_Letters_And_Months}) (256)
2407 X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
2408                             Enclosed_CJK_Letters_And_Months} (Short:
2409                             \p{InEnclosedCJK}) (256)
2410 X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
2411                             (= \p{Block=
2412                             Enclosed_Ideographic_Supplement}) (256)
2413 X \p{Enclosed_Ideographic_Supplement} \p{Block=
2414                             Enclosed_Ideographic_Supplement} (Short:
2415                             \p{InEnclosedIdeographicSup}) (256)
2416   \p{Enclosing_Mark}      \p{General_Category=Enclosing_Mark}
2417                             (Short: \p{Me}) (13)
2418   \p{Ethi}                \p{Ethiopic} (= \p{Script_Extensions=
2419                             Ethiopic}) (NOT \p{Block=Ethiopic}) (495)
2420   \p{Ethiopic}            \p{Script_Extensions=Ethiopic} (Short:
2421                             \p{Ethi}; NOT \p{Block=Ethiopic}) (495)
2422 X \p{Ethiopic_Ext}        \p{Ethiopic_Extended} (= \p{Block=
2423                             Ethiopic_Extended}) (96)
2424 X \p{Ethiopic_Ext_A}      \p{Ethiopic_Extended_A} (= \p{Block=
2425                             Ethiopic_Extended_A}) (48)
2426 X \p{Ethiopic_Extended}   \p{Block=Ethiopic_Extended} (Short:
2427                             \p{InEthiopicExt}) (96)
2428 X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
2429                             \p{InEthiopicExtA}) (48)
2430 X \p{Ethiopic_Sup}        \p{Ethiopic_Supplement} (= \p{Block=
2431                             Ethiopic_Supplement}) (32)
2432 X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
2433                             \p{InEthiopicSup}) (32)
2434   \p{Ext}                 \p{Extender} (= \p{Extender=Y}) (47)
2435   \p{Ext: *}              \p{Extender: *}
2436   \p{Extender}            \p{Extender=Y} (Short: \p{Ext}) (47)
2437   \p{Extender: N*}        (Short: \p{Ext=N}, \P{Ext}) (1_114_065
2438                             plus all above-Unicode code points:
2439                             [\x00-\xb6\xb8-\xff], U+0100..02CF,
2440                             U+02D2..063F, U+0641..07F9,
2441                             U+07FB..0E45, U+0E47..0EC5 ...)
2442   \p{Extender: Y*}        (Short: \p{Ext=Y}, \p{Ext}) (47: [\xb7],
2443                             U+02D0..02D1, U+0640, U+07FA, U+0E46,
2444                             U+0EC6 ...)
2445   \p{Final_Punctuation}   \p{General_Category=Final_Punctuation}
2446                             (Short: \p{Pf}) (10)
2447   \p{Format}              \p{General_Category=Format} (Short:
2448                             \p{Cf}) (161)
2449   \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
2450                             (Short: \p{CompEx}) (1120)
2451   \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
2452                             \P{CompEx}) (1_112_992 plus all above-
2453                             Unicode code points: U+0000..033F,
2454                             U+0342, U+0345..0373, U+0375..037D,
2455                             U+037F..0386, U+0388..0957 ...)
2456   \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
2457                             \p{CompEx}) (1120: U+0340..0341,
2458                             U+0343..0344, U+0374, U+037E, U+0387,
2459                             U+0958..095F ...)
2460   \p{Gc: *}               \p{General_Category: *}
2461   \p{GCB: *}              \p{Grapheme_Cluster_Break: *}
2462   \p{General_Category: C} \p{General_Category=Other} (976_344 plus
2463                             all above-Unicode code points)
2464   \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
2465                             \p{Gc=LC}, \p{LC}) (3970: [A-Za-
2466                             z\xb5\xc0-\xd6\xd8-\xf6\xf8-\xff],
2467                             U+0100..01BA, U+01BC..01BF,
2468                             U+01C4..0293, U+0295..02AF, U+0370..0373
2469                             ...)
2470   \p{General_Category: Cc} \p{General_Category=Control} (65)
2471   \p{General_Category: Cf} \p{General_Category=Format} (161)
2472   \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
2473                             (73: [\)\]\}], U+0F3B, U+0F3D, U+169C,
2474                             U+2046, U+207E ...)
2475   \p{General_Category: Cn} \p{General_Category=Unassigned} (836_602
2476                             plus all above-Unicode code points)
2477   \p{General_Category: Cntrl} \p{General_Category=Control} (65)
2478   \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
2479   \p{General_Category: Combining_Mark} \p{General_Category=Mark}
2480                             (2268)
2481   \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
2482                             \p{Pc}) (10: [_], U+203F..2040, U+2054,
2483                             U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
2484   \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
2485                             [\x00-\x1f\x7f-\x9f])
2486   \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
2487   \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
2488                             (62: [\$\xa2-\xa5], U+058F, U+060B,
2489                             U+07FE..07FF, U+09F2..09F3, U+09FB ...)
2490   \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
2491                             (24: [\-], U+058A, U+05BE, U+1400,
2492                             U+1806, U+2010..2015 ...)
2493   \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
2494                             (630: [0-9], U+0660..0669, U+06F0..06F9,
2495                             U+07C0..07C9, U+0966..096F, U+09E6..09EF
2496                             ...)
2497   \p{General_Category: Digit} \p{General_Category=Decimal_Number}
2498                             (630)
2499   \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
2500                             (13: U+0488..0489, U+1ABE, U+20DD..20E0,
2501                             U+20E2..20E4, U+A670..A672)
2502   \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
2503                             (10: [\xbb], U+2019, U+201D, U+203A,
2504                             U+2E03, U+2E05 ...)
2505   \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (161:
2506                             [\xad], U+0600..0605, U+061C, U+06DD,
2507                             U+070F, U+08E2 ...)
2508   \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
2509                             \p{Pi}) (12: [\xab], U+2018,
2510                             U+201B..201C, U+201F, U+2039, U+2E02 ...)
2511   \p{General_Category: L} \p{General_Category=Letter} (125_643)
2512 X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3970)
2513 X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
2514                             the trailing '_' matters in spite of
2515                             loose matching rules. (3970)
2516   \p{General_Category: LC} \p{General_Category=Cased_Letter} (3970)
2517   \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (125_643:
2518                             [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
2519                             \xf6\xf8-\xff], U+0100..02C1,
2520                             U+02C6..02D1, U+02E0..02E4, U+02EC,
2521                             U+02EE ...)
2522   \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
2523                             (236: U+16EE..16F0, U+2160..2182,
2524                             U+2185..2188, U+3007, U+3021..3029,
2525                             U+3038..303A ...)
2526   \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
2527                             (1: U+2028)
2528   \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
2529                             (/i= General_Category=Cased_Letter)
2530                             (2151)
2531   \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (259)
2532   \p{General_Category: Lo} \p{General_Category=Other_Letter}
2533                             (121_414)
2534   \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
2535                             /i= General_Category=Cased_Letter)
2536                             (2151: [a-z\xb5\xdf-\xf6\xf8-\xff],
2537                             U+0101, U+0103, U+0105, U+0107, U+0109
2538                             ...)
2539   \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
2540                             (/i= General_Category=Cased_Letter) (31)
2541   \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
2542                             (/i= General_Category=Cased_Letter)
2543                             (1788)
2544   \p{General_Category: M} \p{General_Category=Mark} (2268)
2545   \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2268:
2546                             U+0300..036F, U+0483..0489,
2547                             U+0591..05BD, U+05BF, U+05C1..05C2,
2548                             U+05C4..05C5 ...)
2549   \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
2550                             [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
2551                             U+0606..0608, U+2044, U+2052,
2552                             U+207A..207C ...)
2553   \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (429)
2554   \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
2555   \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
2556                             (1826)
2557   \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
2558                             (259: U+02B0..02C1, U+02C6..02D1,
2559                             U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
2560   \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
2561                             (121: [\^`\xa8\xaf\xb4\xb8],
2562                             U+02C2..02C5, U+02D2..02DF,
2563                             U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2564   \p{General_Category: N} \p{General_Category=Number} (1754)
2565   \p{General_Category: Nd} \p{General_Category=Decimal_Number} (630)
2566   \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
2567   \p{General_Category: No} \p{General_Category=Other_Number} (888)
2568   \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
2569                             (1826: U+0300..036F, U+0483..0487,
2570                             U+0591..05BD, U+05BF, U+05C1..05C2,
2571                             U+05C4..05C5 ...)
2572   \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1754:
2573                             [0-9\xb2-\xb3\xb9\xbc-\xbe],
2574                             U+0660..0669, U+06F0..06F9,
2575                             U+07C0..07C9, U+0966..096F, U+09E6..09EF
2576                             ...)
2577   \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
2578                             (75: [\(\[\{], U+0F3A, U+0F3C, U+169B,
2579                             U+201A, U+201E ...)
2580   \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (976_344 plus
2581                             all above-Unicode code points: [\x00-
2582                             \x1f\x7f-\x9f\xad], U+0378..0379,
2583                             U+0380..0383, U+038B, U+038D, U+03A2 ...)
2584   \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
2585                             (121_414: [\xaa\xba], U+01BB,
2586                             U+01C0..01C3, U+0294, U+05D0..05EA,
2587                             U+05EF..05F2 ...)
2588   \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
2589                             (888: [\xb2-\xb3\xb9\xbc-\xbe],
2590                             U+09F4..09F9, U+0B72..0B77,
2591                             U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
2592                             ...)
2593   \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
2594                             (588:
2595                             [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7\xb6-
2596                             \xb7\xbf], U+037E, U+0387, U+055A..055F,
2597                             U+0589, U+05C0 ...)
2598   \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
2599                             (6161: [\xa6\xa9\xae\xb0], U+0482,
2600                             U+058D..058E, U+060E..060F, U+06DE,
2601                             U+06E9 ...)
2602   \p{General_Category: P} \p{General_Category=Punctuation} (792)
2603   \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
2604                             \p{Zp}) (1: U+2029)
2605   \p{General_Category: Pc} \p{General_Category=
2606                             Connector_Punctuation} (10)
2607   \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24)
2608   \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
2609                             (73)
2610   \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
2611                             (10)
2612   \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
2613                             (12)
2614   \p{General_Category: Po} \p{General_Category=Other_Punctuation}
2615                             (588)
2616   \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
2617                             (137_468: U+E000..F8FF, U+F0000..FFFFD,
2618                             U+100000..10FFFD)
2619   \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
2620   \p{General_Category: Punct} \p{General_Category=Punctuation} (792)
2621   \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (792:
2622                             [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
2623                             \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
2624                             U+037E, U+0387, U+055A..055F,
2625                             U+0589..058A, U+05BE ...)
2626   \p{General_Category: S} \p{General_Category=Symbol} (7292)
2627   \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (62)
2628   \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
2629                             [\x20\xa0], U+1680, U+2000..200A,
2630                             U+2028..2029, U+202F, U+205F ...)
2631   \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (121)
2632   \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
2633   \p{General_Category: So} \p{General_Category=Other_Symbol} (6161)
2634   \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
2635                             (17: [\x20\xa0], U+1680, U+2000..200A,
2636                             U+202F, U+205F, U+3000)
2637   \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
2638                             (429: U+0903, U+093B, U+093E..0940,
2639                             U+0949..094C, U+094E..094F, U+0982..0983
2640                             ...)
2641   \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
2642                             U+D800..DFFF)
2643   \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7292:
2644                             [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
2645                             \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
2646                             U+02D2..02DF, U+02E5..02EB, U+02ED,
2647                             U+02EF..02FF ...)
2648   \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
2649                             /i= General_Category=Cased_Letter) (31:
2650                             U+01C5, U+01C8, U+01CB, U+01F2,
2651                             U+1F88..1F8F, U+1F98..1F9F ...)
2652   \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
2653                             (836_602 plus all above-Unicode code
2654                             points: U+0378..0379, U+0380..0383,
2655                             U+038B, U+038D, U+03A2, U+0530 ...)
2656   \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
2657                             /i= General_Category=Cased_Letter)
2658                             (1788: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
2659                             U+0102, U+0104, U+0106, U+0108 ...)
2660   \p{General_Category: Z} \p{General_Category=Separator} (19)
2661   \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
2662   \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
2663                             (1)
2664   \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
2665 X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
2666                             \p{InPunctuation}) (112)
2667 X \p{Geometric_Shapes}    \p{Block=Geometric_Shapes} (96)
2668 X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
2669                             \p{Block=Geometric_Shapes_Extended})
2670                             (128)
2671 X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
2672                             (Short: \p{InGeometricShapesExt}) (128)
2673   \p{Geor}                \p{Georgian} (= \p{Script_Extensions=
2674                             Georgian}) (NOT \p{Block=Georgian}) (175)
2675   \p{Georgian}            \p{Script_Extensions=Georgian} (Short:
2676                             \p{Geor}; NOT \p{Block=Georgian}) (175)
2677 X \p{Georgian_Ext}        \p{Georgian_Extended} (= \p{Block=
2678                             Georgian_Extended}) (48)
2679 X \p{Georgian_Extended}   \p{Block=Georgian_Extended} (Short:
2680                             \p{InGeorgianExt}) (48)
2681 X \p{Georgian_Sup}        \p{Georgian_Supplement} (= \p{Block=
2682                             Georgian_Supplement}) (48)
2683 X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
2684                             \p{InGeorgianSup}) (48)
2685   \p{Glag}                \p{Glagolitic} (= \p{Script_Extensions=
2686                             Glagolitic}) (NOT \p{Block=Glagolitic})
2687                             (136)
2688   \p{Glagolitic}          \p{Script_Extensions=Glagolitic} (Short:
2689                             \p{Glag}; NOT \p{Block=Glagolitic}) (136)
2690 X \p{Glagolitic_Sup}      \p{Glagolitic_Supplement} (= \p{Block=
2691                             Glagolitic_Supplement}) (48)
2692 X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
2693                             \p{InGlagoliticSup}) (48)
2694   \p{Gong}                \p{Gunjala_Gondi} (= \p{Script_Extensions=
2695                             Gunjala_Gondi}) (NOT \p{Block=
2696                             Gunjala_Gondi}) (65)
2697   \p{Gonm}                \p{Masaram_Gondi} (= \p{Script_Extensions=
2698                             Masaram_Gondi}) (NOT \p{Block=
2699                             Masaram_Gondi}) (77)
2700   \p{Goth}                \p{Gothic} (= \p{Script_Extensions=
2701                             Gothic}) (NOT \p{Block=Gothic}) (27)
2702   \p{Gothic}              \p{Script_Extensions=Gothic} (Short:
2703                             \p{Goth}; NOT \p{Block=Gothic}) (27)
2704   \p{Gr_Base}             \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
2705                             (135_898)
2706   \p{Gr_Base: *}          \p{Grapheme_Base: *}
2707   \p{Gr_Ext}              \p{Grapheme_Extend} (= \p{Grapheme_Extend=
2708                             Y}) (1965)
2709   \p{Gr_Ext: *}           \p{Grapheme_Extend: *}
2710   \p{Gran}                \p{Grantha} (= \p{Script_Extensions=
2711                             Grantha}) (NOT \p{Block=Grantha}) (116)
2712   \p{Grantha}             \p{Script_Extensions=Grantha} (Short:
2713                             \p{Gran}; NOT \p{Block=Grantha}) (116)
2714   \p{Graph}               \p{XPosixGraph} (275_378)
2715   \p{Grapheme_Base}       \p{Grapheme_Base=Y} (Short: \p{GrBase})
2716                             (135_898)
2717   \p{Grapheme_Base: N*}   (Short: \p{GrBase=N}, \P{GrBase}) (978_214
2718                             plus all above-Unicode code points:
2719                             [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
2720                             U+0378..0379, U+0380..0383, U+038B,
2721                             U+038D ...)
2722   \p{Grapheme_Base: Y*}   (Short: \p{GrBase=Y}, \p{GrBase})
2723                             (135_898: [\x20-\x7e\xa0-\xac\xae-\xff],
2724                             U+0100..02FF, U+0370..0377,
2725                             U+037A..037F, U+0384..038A, U+038C ...)
2726   \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2727                             (3886)
2728   \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3886:
2729                             [^\n\r\x20-\x7e\xa0-\xac\xae-\xff],
2730                             U+061C, U+180E, U+200B, U+200E..200F,
2731                             U+2028..202E ...)
2732   \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
2733   \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0)
2734   \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0)
2735   \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0)
2736   \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2737                             (0)
2738   \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2739                             E_Base_GAZ} (0)
2740   \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2741                             E_Modifier} (0)
2742   \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2743                             (1970)
2744   \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1970:
2745                             U+0300..036F, U+0483..0489,
2746                             U+0591..05BD, U+05BF, U+05C1..05C2,
2747                             U+05C4..05C5 ...)
2748   \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2749                             Glue_After_Zwj} (0)
2750   \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0)
2751   \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
2752                             U+1100..115F, U+A960..A97C)
2753   \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
2754   \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
2755                             U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
2756                             ...)
2757   \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
2758                             U+AC01..AC1B, U+AC1D..AC37,
2759                             U+AC39..AC53, U+AC55..AC6F,
2760                             U+AC71..AC8B, U+AC8D..ACA7 ...)
2761   \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_301
2762                             plus all above-Unicode code points:
2763                             [\x20-\x7e\xa0-\xac\xae-\xff],
2764                             U+0100..02FF, U+0370..0482,
2765                             U+048A..0590, U+05BE, U+05C0 ...)
2766   \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2767                             (22)
2768   \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (22:
2769                             U+0600..0605, U+06DD, U+070F, U+08E2,
2770                             U+0D4E, U+110BD ...)
2771   \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2772                             (26: U+1F1E6..1F1FF)
2773   \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2774                             Regional_Indicator} (26)
2775   \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2776                             SpacingMark} (375)
2777   \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (375:
2778                             U+0903, U+093B, U+093E..0940,
2779                             U+0949..094C, U+094E..094F, U+0982..0983
2780                             ...)
2781   \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
2782                             U+11A8..11FF, U+D7CB..D7FB)
2783   \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
2784                             U+1160..11A7, U+D7B0..D7C6)
2785   \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
2786                             (1_096_301 plus all above-Unicode code
2787                             points)
2788   \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
2789   \p{Grapheme_Extend}     \p{Grapheme_Extend=Y} (Short: \p{GrExt})
2790                             (1965)
2791   \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_147
2792                             plus all above-Unicode code points:
2793                             U+0000..02FF, U+0370..0482,
2794                             U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
2795   \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1965:
2796                             U+0300..036F, U+0483..0489,
2797                             U+0591..05BD, U+05BF, U+05C1..05C2,
2798                             U+05C4..05C5 ...)
2799   \p{Greek}               \p{Script_Extensions=Greek} (Short:
2800                             \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
2801 X \p{Greek_And_Coptic}    \p{Block=Greek_And_Coptic} (Short:
2802                             \p{InGreek}) (144)
2803 X \p{Greek_Ext}           \p{Greek_Extended} (= \p{Block=
2804                             Greek_Extended}) (256)
2805 X \p{Greek_Extended}      \p{Block=Greek_Extended} (Short:
2806                             \p{InGreekExt}) (256)
2807   \p{Grek}                \p{Greek} (= \p{Script_Extensions=Greek})
2808                             (NOT \p{Greek_And_Coptic}) (522)
2809   \p{Gujarati}            \p{Script_Extensions=Gujarati} (Short:
2810                             \p{Gujr}; NOT \p{Block=Gujarati}) (105)
2811   \p{Gujr}                \p{Gujarati} (= \p{Script_Extensions=
2812                             Gujarati}) (NOT \p{Block=Gujarati}) (105)
2813   \p{Gunjala_Gondi}       \p{Script_Extensions=Gunjala_Gondi}
2814                             (Short: \p{Gong}; NOT \p{Block=
2815                             Gunjala_Gondi}) (65)
2816   \p{Gurmukhi}            \p{Script_Extensions=Gurmukhi} (Short:
2817                             \p{Guru}; NOT \p{Block=Gurmukhi}) (94)
2818   \p{Guru}                \p{Gurmukhi} (= \p{Script_Extensions=
2819                             Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94)
2820 X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
2821                             \p{Block=Halfwidth_And_Fullwidth_Forms})
2822                             (240)
2823 X \p{Half_Marks}          \p{Combining_Half_Marks} (= \p{Block=
2824                             Combining_Half_Marks}) (16)
2825 X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
2826                             Halfwidth_And_Fullwidth_Forms} (Short:
2827                             \p{InHalfAndFullForms}) (240)
2828   \p{Han}                 \p{Script_Extensions=Han} (89_513)
2829   \p{Hang}                \p{Hangul} (= \p{Script_Extensions=
2830                             Hangul}) (NOT \p{Hangul_Syllables})
2831                             (11_775)
2832   \p{Hangul}              \p{Script_Extensions=Hangul} (Short:
2833                             \p{Hang}; NOT \p{Hangul_Syllables})
2834                             (11_775)
2835 X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
2836                             (Short: \p{InCompatJamo}) (96)
2837 X \p{Hangul_Jamo}         \p{Block=Hangul_Jamo} (Short: \p{InJamo})
2838                             (256)
2839 X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
2840                             (Short: \p{InJamoExtA}) (32)
2841 X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
2842                             (Short: \p{InJamoExtB}) (80)
2843   \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
2844                             (125)
2845   \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
2846                             U+1100..115F, U+A960..A97C)
2847   \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
2848                             (399)
2849   \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
2850                             U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
2851                             U+AC8C ...)
2852   \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
2853                             LVT_Syllable} (10_773)
2854   \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
2855                             (10_773: U+AC01..AC1B, U+AC1D..AC37,
2856                             U+AC39..AC53, U+AC55..AC6F,
2857                             U+AC71..AC8B, U+AC8D..ACA7 ...)
2858   \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
2859                             Not_Applicable} (1_102_583 plus all
2860                             above-Unicode code points)
2861   \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
2862                             (1_102_583 plus all above-Unicode code
2863                             points: U+0000..10FF, U+1200..A95F,
2864                             U+A97D..ABFF, U+D7A4..D7AF,
2865                             U+D7C7..D7CA, U+D7FC..infinity)
2866   \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
2867                             (137)
2868   \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
2869                             U+11A8..11FF, U+D7CB..D7FB)
2870   \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
2871                             (95)
2872   \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
2873                             U+1160..11A7, U+D7B0..D7C6)
2874 X \p{Hangul_Syllables}    \p{Block=Hangul_Syllables} (Short:
2875                             \p{InHangul}) (11_184)
2876   \p{Hani}                \p{Han} (= \p{Script_Extensions=Han})
2877                             (89_513)
2878   \p{Hanifi_Rohingya}     \p{Script_Extensions=Hanifi_Rohingya}
2879                             (Short: \p{Rohg}; NOT \p{Block=
2880                             Hanifi_Rohingya}) (55)
2881   \p{Hano}                \p{Hanunoo} (= \p{Script_Extensions=
2882                             Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
2883   \p{Hanunoo}             \p{Script_Extensions=Hanunoo} (Short:
2884                             \p{Hano}; NOT \p{Block=Hanunoo}) (23)
2885   \p{Hatr}                \p{Hatran} (= \p{Script_Extensions=
2886                             Hatran}) (NOT \p{Block=Hatran}) (26)
2887   \p{Hatran}              \p{Script_Extensions=Hatran} (Short:
2888                             \p{Hatr}; NOT \p{Block=Hatran}) (26)
2889   \p{Hebr}                \p{Hebrew} (= \p{Script_Extensions=
2890                             Hebrew}) (NOT \p{Block=Hebrew}) (134)
2891   \p{Hebrew}              \p{Script_Extensions=Hebrew} (Short:
2892                             \p{Hebr}; NOT \p{Block=Hebrew}) (134)
2893   \p{Hex}                 \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2894   \p{Hex: *}              \p{Hex_Digit: *}
2895   \p{Hex_Digit}           \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
2896   \p{Hex_Digit: N*}       (Short: \p{Hex=N}, \P{Hex}) (1_114_068
2897                             plus all above-Unicode code points:
2898                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
2899                             >?\@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
2900                             U+0100..FF0F, U+FF1A..FF20,
2901                             U+FF27..FF40, U+FF47..infinity)
2902   \p{Hex_Digit: Y*}       (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
2903                             f], U+FF10..FF19, U+FF21..FF26,
2904                             U+FF41..FF46)
2905 X \p{High_Private_Use_Surrogates} \p{Block=
2906                             High_Private_Use_Surrogates} (Short:
2907                             \p{InHighPUSurrogates}) (128)
2908 X \p{High_PU_Surrogates}  \p{High_Private_Use_Surrogates} (=
2909                             \p{Block=High_Private_Use_Surrogates})
2910                             (128)
2911 X \p{High_Surrogates}     \p{Block=High_Surrogates} (896)
2912   \p{Hira}                \p{Hiragana} (= \p{Script_Extensions=
2913                             Hiragana}) (NOT \p{Block=Hiragana}) (431)
2914   \p{Hiragana}            \p{Script_Extensions=Hiragana} (Short:
2915                             \p{Hira}; NOT \p{Block=Hiragana}) (431)
2916   \p{Hluw}                \p{Anatolian_Hieroglyphs} (=
2917                             \p{Script_Extensions=
2918                             Anatolian_Hieroglyphs}) (NOT \p{Block=
2919                             Anatolian_Hieroglyphs}) (583)
2920   \p{Hmng}                \p{Pahawh_Hmong} (= \p{Script_Extensions=
2921                             Pahawh_Hmong}) (NOT \p{Block=
2922                             Pahawh_Hmong}) (127)
2923   \p{Hmnp}                \p{Nyiakeng_Puachue_Hmong} (=
2924                             \p{Script_Extensions=
2925                             Nyiakeng_Puachue_Hmong}) (NOT \p{Block=
2926                             Nyiakeng_Puachue_Hmong}) (71)
2927   \p{HorizSpace}          \p{XPosixBlank} (18)
2928   \p{Hst: *}              \p{Hangul_Syllable_Type: *}
2929   \p{Hung}                \p{Old_Hungarian} (= \p{Script_Extensions=
2930                             Old_Hungarian}) (NOT \p{Block=
2931                             Old_Hungarian}) (108)
2932 D \p{Hyphen}              \p{Hyphen=Y} (11)
2933 D \p{Hyphen: N*}          Supplanted by Line_Break property values;
2934                             see www.unicode.org/reports/tr14
2935                             (Single: \P{Hyphen}) (1_114_101 plus all
2936                             above-Unicode code points: [\x00-
2937                             \x20!\"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-
2938                             Z\[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-
2939                             \xff], U+0100..0589, U+058B..1805,
2940                             U+1807..200F, U+2012..2E16, U+2E18..30FA
2941                             ...)
2942 D \p{Hyphen: Y*}          Supplanted by Line_Break property values;
2943                             see www.unicode.org/reports/tr14
2944                             (Single: \p{Hyphen}) (11: [\-\xad],
2945                             U+058A, U+1806, U+2010..2011, U+2E17,
2946                             U+30FB ...)
2947   \p{ID_Continue}         \p{ID_Continue=Y} (Short: \p{IDC}; NOT
2948                             \p{Ideographic_Description_Characters})
2949                             (128_789)
2950   \p{ID_Continue: N*}     (Short: \p{IDC=N}, \P{IDC}) (985_323 plus
2951                             all above-Unicode code points: [\x00-
2952                             \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
2953                             >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
2954                             \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
2955                             U+02C2..02C5, U+02D2..02DF,
2956                             U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2957   \p{ID_Continue: Y*}     (Short: \p{IDC=Y}, \p{IDC}) (128_789:
2958                             [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
2959                             \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
2960                             U+02C6..02D1, U+02E0..02E4, U+02EC,
2961                             U+02EE ...)
2962   \p{ID_Start}            \p{ID_Start=Y} (Short: \p{IDS}) (125_884)
2963   \p{ID_Start: N*}        (Short: \p{IDS=N}, \P{IDS}) (988_228 plus
2964                             all above-Unicode code points: [\x00-
2965                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2966                             >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
2967                             \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
2968                             U+02C2..02C5, U+02D2..02DF,
2969                             U+02E5..02EB, U+02ED, U+02EF..036F ...)
2970   \p{ID_Start: Y*}        (Short: \p{IDS=Y}, \p{IDS}) (125_884: [A-
2971                             Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
2972                             \xff], U+0100..02C1, U+02C6..02D1,
2973                             U+02E0..02E4, U+02EC, U+02EE ...)
2974   \p{IDC}                 \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
2975                             \p{Ideographic_Description_Characters})
2976                             (128_789)
2977   \p{IDC: *}              \p{ID_Continue: *}
2978   \p{Ideo}                \p{Ideographic} (= \p{Ideographic=Y})
2979                             (96_190)
2980   \p{Ideo: *}             \p{Ideographic: *}
2981   \p{Ideographic}         \p{Ideographic=Y} (Short: \p{Ideo})
2982                             (96_190)
2983   \p{Ideographic: N*}     (Short: \p{Ideo=N}, \P{Ideo}) (1_017_922
2984                             plus all above-Unicode code points:
2985                             U+0000..3005, U+3008..3020,
2986                             U+302A..3037, U+303B..33FF,
2987                             U+4DB6..4DFF, U+9FF0..F8FF ...)
2988   \p{Ideographic: Y*}     (Short: \p{Ideo=Y}, \p{Ideo}) (96_190:
2989                             U+3006..3007, U+3021..3029,
2990                             U+3038..303A, U+3400..4DB5,
2991                             U+4E00..9FEF, U+F900..FA6D ...)
2992 X \p{Ideographic_Description_Characters} \p{Block=
2993                             Ideographic_Description_Characters}
2994                             (Short: \p{InIDC}) (16)
2995 X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
2996                             \p{Block=
2997                             Ideographic_Symbols_And_Punctuation})
2998                             (32)
2999 X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
3000                             Ideographic_Symbols_And_Punctuation}
3001                             (Short: \p{InIdeographicSymbols}) (32)
3002   \p{IDS}                 \p{ID_Start} (= \p{ID_Start=Y}) (125_884)
3003   \p{IDS: *}              \p{ID_Start: *}
3004   \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
3005                             \p{IDSB}) (10)
3006   \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
3007                             (1_114_102 plus all above-Unicode code
3008                             points: U+0000..2FEF, U+2FF2..2FF3,
3009                             U+2FFC..infinity)
3010   \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
3011                             U+2FF0..2FF1, U+2FF4..2FFB)
3012   \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
3013                             \p{IDST}) (2)
3014   \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
3015                             (1_114_110 plus all above-Unicode code
3016                             points: U+0000..2FF1, U+2FF4..infinity)
3017   \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
3018                             U+2FF2..2FF3)
3019   \p{IDSB}                \p{IDS_Binary_Operator} (=
3020                             \p{IDS_Binary_Operator=Y}) (10)
3021   \p{IDSB: *}             \p{IDS_Binary_Operator: *}
3022   \p{IDST}                \p{IDS_Trinary_Operator} (=
3023                             \p{IDS_Trinary_Operator=Y}) (2)
3024   \p{IDST: *}             \p{IDS_Trinary_Operator: *}
3025   \p{Imperial_Aramaic}    \p{Script_Extensions=Imperial_Aramaic}
3026                             (Short: \p{Armi}; NOT \p{Block=
3027                             Imperial_Aramaic}) (31)
3028   \p{In: *}               \p{Present_In: *} (Perl extension)
3029 X \p{In_*}                \p{Block: *}
3030 X \p{Indic_Number_Forms}  \p{Common_Indic_Number_Forms} (= \p{Block=
3031                             Common_Indic_Number_Forms}) (16)
3032   \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
3033                             (349: U+093C, U+0941..0944, U+094D,
3034                             U+0952, U+0956..0957, U+0962..0963 ...)
3035   \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
3036                             BottomAndLeft}) (1: U+A9BF)
3037   \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
3038                             BottomAndRight}) (2: U+1B3B, U+A9C0)
3039   \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (61:
3040                             U+093F, U+094E, U+09BF, U+09C7..09C8,
3041                             U+0A3F, U+0ABF ...)
3042   \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
3043                             LeftAndRight}) (21: U+09CB..09CC,
3044                             U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
3045                             U+0DDC, U+0DDE ...)
3046   \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_936
3047                             plus all above-Unicode code points:
3048                             U+0000..08FF, U+0904..0939, U+093D,
3049                             U+0950, U+0958..0961, U+0964..0980 ...)
3050   \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
3051                             Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
3052                             U+10A01, U+10A06)
3053   \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (281:
3054                             U+0903, U+093B, U+093E, U+0940,
3055                             U+0949..094C, U+094F ...)
3056   \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (398:
3057                             U+0900..0902, U+093A, U+0945..0948,
3058                             U+0951, U+0953..0955, U+0981 ...)
3059   \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
3060                             TopAndBottom}) (10: U+0C48, U+0F73,
3061                             U+0F76..0F79, U+0F81, U+1B3C,
3062                             U+1112E..1112F)
3063   \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
3064                             \p{InPC=TopAndBottomAndRight}) (1:
3065                             U+1B3D)
3066   \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
3067                             TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
3068                             U+1C29, U+114BB, U+115B9)
3069   \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
3070                             \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
3071                             U+0DDD, U+17BF, U+115BB)
3072   \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
3073                             TopAndRight}) (13: U+0AC9, U+0B57,
3074                             U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
3075                             U+1925..1926 ...)
3076   \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
3077                             VisualOrderLeft}) (19: U+0E40..0E44,
3078                             U+0EC0..0EC4, U+19B5..19B7, U+19BA,
3079                             U+AAB5..AAB6, U+AAB9 ...)
3080 X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80)
3081   \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
3082                             (17: U+093D, U+09BD, U+0ABD, U+0B3D,
3083                             U+0C3D, U+0CBD ...)
3084   \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (86:
3085                             U+0900..0902, U+0981..0982, U+09FC,
3086                             U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
3087   \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
3088                             \p{InSC=BrahmiJoiningNumber}) (20:
3089                             U+11052..11065)
3090   \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
3091                             CantillationMark}) (59: U+0951..0952,
3092                             U+0A51, U+0AFA..0AFC, U+1CD0..1CD2,
3093                             U+1CD4..1CE1, U+1CF4 ...)
3094   \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
3095                             (2160: U+0915..0939, U+0958..095F,
3096                             U+0978..097F, U+0995..09A8,
3097                             U+09AA..09B0, U+09B2 ...)
3098   \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
3099                             ConsonantDead}) (12: U+09CE,
3100                             U+0D54..0D56, U+0D7A..0D7F, U+1CF2..1CF3)
3101   \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
3102                             ConsonantFinal}) (67: U+1930..1931,
3103                             U+1933..1939, U+19C1..19C7,
3104                             U+1A58..1A59, U+1BBE..1BBF, U+1BF0..1BF1
3105                             ...)
3106   \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
3107                             \p{InSC=ConsonantHeadLetter}) (5:
3108                             U+0F88..0F8C)
3109   \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short:
3110                             \p{InSC=ConsonantInitialPostfixed}) (1:
3111                             U+1A5A)
3112   \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
3113                             ConsonantKiller}) (2: U+0E4C, U+17CD)
3114   \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
3115                             ConsonantMedial}) (29: U+0A75,
3116                             U+0EBC..0EBD, U+103B..103E,
3117                             U+105E..1060, U+1082, U+1A55..1A56 ...)
3118   \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
3119                             \p{InSC=ConsonantPlaceholder}) (22: [\-
3120                             \xa0\xd7], U+0980, U+0A72..0A73, U+104B,
3121                             U+104E, U+1900 ...)
3122   \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
3123                             \p{InSC=ConsonantPrecedingRepha}) (2:
3124                             U+0D4E, U+11D46)
3125   \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
3126                             ConsonantPrefixed}) (9: U+111C2..111C3,
3127                             U+11A3A, U+11A84..11A89)
3128   \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
3129                             ConsonantSubjoined}) (94: U+0F8D..0F97,
3130                             U+0F99..0FBC, U+1929..192B, U+1A57,
3131                             U+1A5B..1A5E, U+1BA1..1BA3 ...)
3132   \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
3133                             \p{InSC=ConsonantSucceedingRepha}) (4:
3134                             U+17CC, U+1B03, U+1B81, U+A982)
3135   \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
3136                             \p{InSC=ConsonantWithStacker}) (6:
3137                             U+0CF1..0CF2, U+1CF5..1CF6,
3138                             U+11003..11004)
3139   \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
3140                             GeminationMark}) (3: U+0A71, U+11237,
3141                             U+11A98)
3142   \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
3143                             InvisibleStacker}) (11: U+1039, U+17D2,
3144                             U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
3145   \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
3146                             U+200D)
3147   \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
3148                             ModifyingLetter}) (1: U+0B83)
3149   \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
3150                             NonJoiner}) (1: U+200C)
3151   \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (30:
3152                             U+093C, U+09BC, U+0A3C, U+0ABC,
3153                             U+0AFD..0AFF, U+0B3C ...)
3154   \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (481:
3155                             [0-9], U+0966..096F, U+09E6..09EF,
3156                             U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
3157                             ...)
3158   \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
3159                             NumberJoiner}) (1: U+1107F)
3160   \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
3161                             (1_109_650 plus all above-Unicode code
3162                             points: [\x00-
3163                             \x20!\"#\$\%&\'\(\)*+,.\/:;<=>?\@A-
3164                             Z\[\\\]\^_`a-z\{\|\}~\x7f-\x9f\xa1-
3165                             \xb1\xb4-\xd6\xd8-\xff], U+0100..08FF,
3166                             U+0950, U+0953..0954, U+0964..0965,
3167                             U+0970..0971 ...)
3168   \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
3169                             PureKiller}) (21: U+0D3B..0D3C, U+0E3A,
3170                             U+0E4E, U+0EBA, U+0F84, U+103A ...)
3171   \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
3172                             RegisterShifter}) (2: U+17C9..17CA)
3173   \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
3174                             SyllableModifier}) (25: [\xb2-\xb3],
3175                             U+09FE, U+0F35, U+0F37, U+0FC6, U+17CB
3176                             ...)
3177   \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
3178                             ToneLetter}) (7: U+1970..1974, U+AAC0,
3179                             U+AAC2)
3180   \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
3181                             (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
3182                             U+1063..1064, U+1069..106D, U+1087..108D
3183                             ...)
3184   \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27:
3185                             U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
3186                             U+0BCD ...)
3187   \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
3188                             (35: U+0903, U+0983, U+0A03, U+0A83,
3189                             U+0B03, U+0C03 ...)
3190   \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
3191                             U+1963..196D, U+A85E..A861, U+A866,
3192                             U+A922..A92A, U+11150..11154)
3193   \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
3194                             VowelDependent}) (673: U+093A..093B,
3195                             U+093E..094C, U+094E..094F,
3196                             U+0955..0957, U+0962..0963, U+09BE..09C4
3197                             ...)
3198   \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
3199                             VowelIndependent}) (476: U+0904..0914,
3200                             U+0960..0961, U+0972..0977,
3201                             U+0985..098C, U+098F..0990, U+0993..0994
3202                             ...)
3203   \p{Inherited}           \p{Script_Extensions=Inherited} (Short:
3204                             \p{Zinh}) (502)
3205   \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
3206                             (Short: \p{Pi}) (12)
3207   \p{InPC: *}             \p{Indic_Positional_Category: *}
3208   \p{InSC: *}             \p{Indic_Syllabic_Category: *}
3209   \p{Inscriptional_Pahlavi} \p{Script_Extensions=
3210                             Inscriptional_Pahlavi} (Short: \p{Phli};
3211                             NOT \p{Block=Inscriptional_Pahlavi}) (27)
3212   \p{Inscriptional_Parthian} \p{Script_Extensions=
3213                             Inscriptional_Parthian} (Short:
3214                             \p{Prti}; NOT \p{Block=
3215                             Inscriptional_Parthian}) (30)
3216 X \p{IPA_Ext}             \p{IPA_Extensions} (= \p{Block=
3217                             IPA_Extensions}) (96)
3218 X \p{IPA_Extensions}      \p{Block=IPA_Extensions} (Short:
3219                             \p{InIPAExt}) (96)
3220   \p{Is_*}                \p{*} (Any exceptions are individually
3221                             noted beginning with the word NOT.) If
3222                             an entry has flag(s) at its beginning,
3223                             like "D", the "Is_" form has the same
3224                             flag(s)
3225   \p{Ital}                \p{Old_Italic} (= \p{Script_Extensions=
3226                             Old_Italic}) (NOT \p{Block=Old_Italic})
3227                             (39)
3228 X \p{Jamo}                \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
3229                             (256)
3230 X \p{Jamo_Ext_A}          \p{Hangul_Jamo_Extended_A} (= \p{Block=
3231                             Hangul_Jamo_Extended_A}) (32)
3232 X \p{Jamo_Ext_B}          \p{Hangul_Jamo_Extended_B} (= \p{Block=
3233                             Hangul_Jamo_Extended_B}) (80)
3234   \p{Java}                \p{Javanese} (= \p{Script_Extensions=
3235                             Javanese}) (NOT \p{Block=Javanese}) (91)
3236   \p{Javanese}            \p{Script_Extensions=Javanese} (Short:
3237                             \p{Java}; NOT \p{Block=Javanese}) (91)
3238   \p{Jg: *}               \p{Joining_Group: *}
3239   \p{Join_C}              \p{Join_Control} (= \p{Join_Control=Y}) (2)
3240   \p{Join_C: *}           \p{Join_Control: *}
3241   \p{Join_Control}        \p{Join_Control=Y} (Short: \p{JoinC}) (2)
3242   \p{Join_Control: N*}    (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
3243                             plus all above-Unicode code points:
3244                             U+0000..200B, U+200E..infinity)
3245   \p{Join_Control: Y*}    (Short: \p{JoinC=Y}, \p{JoinC}) (2:
3246                             U+200C..200D)
3247   \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
3248                             U+08BB)
3249   \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
3250                             U+08BD)
3251   \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (1:
3252                             U+08BC)
3253   \p{Joining_Group: Ain}  (Short: \p{Jg=Ain}) (8: U+0639..063A,
3254                             U+06A0, U+06FC, U+075D..075F, U+08B3)
3255   \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
3256   \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10: U+0622..0623,
3257                             U+0625, U+0627, U+0671..0673, U+0675,
3258                             U+0773..0774)
3259   \p{Joining_Group: Beh}  (Short: \p{Jg=Beh}) (24: U+0628,
3260                             U+062A..062B, U+066E, U+0679..0680,
3261                             U+0750..0756, U+08A0..08A1 ...)
3262   \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
3263   \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
3264                             BurushaskiYehBarree}) (2: U+077A..077B)
3265   \p{Joining_Group: Dal}  (Short: \p{Jg=Dal}) (15: U+062F..0630,
3266                             U+0688..0690, U+06EE, U+0759..075A,
3267                             U+08AE)
3268   \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
3269                             U+0715..0716, U+072A, U+072F)
3270   \p{Joining_Group: E}    (Short: \p{Jg=E}) (1: U+0725)
3271   \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
3272                             U+063D..063F, U+06CC, U+06CE,
3273                             U+0775..0776)
3274   \p{Joining_Group: Fe}   (Short: \p{Jg=Fe}) (1: U+074F)
3275   \p{Joining_Group: Feh}  (Short: \p{Jg=Feh}) (10: U+0641,
3276                             U+06A1..06A6, U+0760..0761, U+08A4)
3277   \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
3278                             U+0724)
3279   \p{Joining_Group: Gaf}  (Short: \p{Jg=Gaf}) (14: U+063B..063C,
3280                             U+06A9, U+06AB, U+06AF..06B4,
3281                             U+0762..0764, U+08B0)
3282   \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
3283                             U+072E)
3284   \p{Joining_Group: Hah}  (Short: \p{Jg=Hah}) (18: U+062C..062E,
3285                             U+0681..0687, U+06BF, U+0757..0758,
3286                             U+076E..076F, U+0772 ...)
3287   \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
3288                             HamzaOnHehGoal}) (1: U+06C3)
3289   \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg=
3290                             HanifiRohingyaKinnaYa}) (4: U+10D19,
3291                             U+10D1E, U+10D20, U+10D23)
3292   \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg=
3293                             HanifiRohingyaPa}) (3: U+10D02, U+10D09,
3294                             U+10D1C)
3295   \p{Joining_Group: He}   (Short: \p{Jg=He}) (1: U+0717)
3296   \p{Joining_Group: Heh}  (Short: \p{Jg=Heh}) (1: U+0647)
3297   \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
3298                             U+06C1..06C2)
3299   \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
3300   \p{Joining_Group: Kaf}  (Short: \p{Jg=Kaf}) (6: U+0643,
3301                             U+06AC..06AE, U+077F, U+08B4)
3302   \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
3303   \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
3304   \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
3305                             U+06BE, U+06FF)
3306   \p{Joining_Group: Lam}  (Short: \p{Jg=Lam}) (7: U+0644,
3307                             U+06B5..06B8, U+076A, U+08A6)
3308   \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
3309   \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
3310                             U+0866)
3311   \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
3312                             U+0861)
3313   \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
3314                             U+0868)
3315   \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
3316                             (1: U+0869)
3317   \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
3318                             U+0860)
3319   \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
3320                             U+0864)
3321   \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
3322                             (1: U+0865)
3323   \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
3324                             U+0862)
3325   \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
3326                             U+0867)
3327   \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
3328                             U+086A)
3329   \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
3330                             U+0863)
3331   \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
3332                             ManichaeanAleph}) (1: U+10AC0)
3333   \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
3334                             (2: U+10AD9..10ADA)
3335   \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
3336                             (2: U+10AC1..10AC2)
3337   \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
3338                             ManichaeanDaleth}) (1: U+10AC5)
3339   \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
3340                             ManichaeanDhamedh}) (1: U+10AD4)
3341   \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
3342                             (1: U+10AEC)
3343   \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
3344                             ManichaeanGimel}) (2: U+10AC3..10AC4)
3345   \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
3346                             (1: U+10ACD)
3347   \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
3348                             ManichaeanHundred}) (1: U+10AEF)
3349   \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
3350                             (3: U+10AD0..10AD2)
3351   \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
3352                             ManichaeanLamedh}) (1: U+10AD3)
3353   \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
3354                             (1: U+10AD6)
3355   \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
3356                             (1: U+10AD7)
3357   \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
3358                             (1: U+10AEB)
3359   \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
3360                             U+10ADB..10ADC)
3361   \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
3362                             (3: U+10ADE..10AE0)
3363   \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
3364                             (1: U+10AE1)
3365   \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
3366                             ManichaeanSadhe}) (1: U+10ADD)
3367   \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
3368                             ManichaeanSamekh}) (1: U+10AD8)
3369   \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
3370                             (1: U+10AE4)
3371   \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
3372                             (1: U+10AED)
3373   \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
3374                             (1: U+10ACE)
3375   \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
3376                             ManichaeanThamedh}) (1: U+10AD5)
3377   \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
3378                             ManichaeanTwenty}) (1: U+10AEE)
3379   \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
3380                             (1: U+10AC7)
3381   \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
3382                             (1: U+10ACF)
3383   \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
3384                             ManichaeanZayin}) (2: U+10AC9..10ACA)
3385   \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
3386                             U+0765..0766, U+08A7)
3387   \p{Joining_Group: Mim}  (Short: \p{Jg=Mim}) (1: U+0721)
3388   \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
3389                             (1_113_800 plus all above-Unicode code
3390                             points: U+0000..061F, U+0621, U+0640,
3391                             U+064B..066D, U+0670, U+0674 ...)
3392   \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8: U+0646,
3393                             U+06B9..06BC, U+0767..0769)
3394   \p{Joining_Group: Nun}  (Short: \p{Jg=Nun}) (1: U+0722)
3395   \p{Joining_Group: Nya}  (Short: \p{Jg=Nya}) (1: U+06BD)
3396   \p{Joining_Group: Pe}   (Short: \p{Jg=Pe}) (1: U+0726)
3397   \p{Joining_Group: Qaf}  (Short: \p{Jg=Qaf}) (5: U+0642, U+066F,
3398                             U+06A7..06A8, U+08A5)
3399   \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
3400   \p{Joining_Group: Reh}  (Short: \p{Jg=Reh}) (19: U+0631..0632,
3401                             U+0691..0699, U+06EF, U+075B,
3402                             U+076B..076C, U+0771 ...)
3403   \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
3404                             U+0727)
3405   \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
3406                             U+08AC)
3407   \p{Joining_Group: Sad}  (Short: \p{Jg=Sad}) (6: U+0635..0636,
3408                             U+069D..069E, U+06FB, U+08AF)
3409   \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
3410   \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
3411                             U+069A..069C, U+06FA, U+075C, U+076D,
3412                             U+0770 ...)
3413   \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
3414   \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
3415   \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
3416                             U+08B1)
3417   \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
3418   \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
3419   \p{Joining_Group: Tah}  (Short: \p{Jg=Tah}) (4: U+0637..0638,
3420                             U+069F, U+08A3)
3421   \p{Joining_Group: Taw}  (Short: \p{Jg=Taw}) (1: U+072C)
3422   \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
3423                             U+0629, U+06C0, U+06D5)
3424   \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
3425                             Hamza_On_Heh_Goal} (1)
3426   \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
3427   \p{Joining_Group: Waw}  (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
3428                             U+0676..0677, U+06C4..06CB, U+06CF,
3429                             U+0778..0779 ...)
3430   \p{Joining_Group: Yeh}  (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
3431                             U+0649..064A, U+0678, U+06D0..06D1,
3432                             U+0777 ...)
3433   \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
3434                             U+06D2..06D3)
3435   \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
3436                             U+06CD)
3437   \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
3438   \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
3439   \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
3440   \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
3441   \p{Joining_Type: C}     \p{Joining_Type=Join_Causing} (4)
3442   \p{Joining_Type: D}     \p{Joining_Type=Dual_Joining} (565)
3443   \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (565: U+0620,
3444                             U+0626, U+0628, U+062A..062E,
3445                             U+0633..063F, U+0641..0647 ...)
3446   \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4: U+0640,
3447                             U+07FA, U+180A, U+200D)
3448   \p{Joining_Type: L}     \p{Joining_Type=Left_Joining} (4)
3449   \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (4: U+A872,
3450                             U+10ACD, U+10AD7, U+10D00)
3451   \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_437 plus
3452                             all above-Unicode code points: [\x00-
3453                             \xac\xae-\xff], U+0100..02FF,
3454                             U+0370..0482, U+048A..0590, U+05BE,
3455                             U+05C0 ...)
3456   \p{Joining_Type: R}     \p{Joining_Type=Right_Joining} (118)
3457   \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (118:
3458                             U+0622..0625, U+0627, U+0629,
3459                             U+062F..0632, U+0648, U+0671..0673 ...)
3460   \p{Joining_Type: T}     \p{Joining_Type=Transparent} (1984)
3461   \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1984: [\xad],
3462                             U+0300..036F, U+0483..0489,
3463                             U+0591..05BD, U+05BF, U+05C1..05C2 ...)
3464   \p{Joining_Type: U}     \p{Joining_Type=Non_Joining} (1_111_437
3465                             plus all above-Unicode code points)
3466   \p{Jt: *}               \p{Joining_Type: *}
3467   \p{Kaithi}              \p{Script_Extensions=Kaithi} (Short:
3468                             \p{Kthi}; NOT \p{Block=Kaithi}) (87)
3469   \p{Kali}                \p{Kayah_Li} (= \p{Script_Extensions=
3470                             Kayah_Li}) (48)
3471   \p{Kana}                \p{Katakana} (= \p{Script_Extensions=
3472                             Katakana}) (NOT \p{Block=Katakana}) (356)
3473 X \p{Kana_Ext_A}          \p{Kana_Extended_A} (= \p{Block=
3474                             Kana_Extended_A}) (48)
3475 X \p{Kana_Extended_A}     \p{Block=Kana_Extended_A} (Short:
3476                             \p{InKanaExtA}) (48)
3477 X \p{Kana_Sup}            \p{Kana_Supplement} (= \p{Block=
3478                             Kana_Supplement}) (256)
3479 X \p{Kana_Supplement}     \p{Block=Kana_Supplement} (Short:
3480                             \p{InKanaSup}) (256)
3481 X \p{Kanbun}              \p{Block=Kanbun} (16)
3482 X \p{Kangxi}              \p{Kangxi_Radicals} (= \p{Block=
3483                             Kangxi_Radicals}) (224)
3484 X \p{Kangxi_Radicals}     \p{Block=Kangxi_Radicals} (Short:
3485                             \p{InKangxi}) (224)
3486   \p{Kannada}             \p{Script_Extensions=Kannada} (Short:
3487                             \p{Knda}; NOT \p{Block=Kannada}) (104)
3488   \p{Katakana}            \p{Script_Extensions=Katakana} (Short:
3489                             \p{Kana}; NOT \p{Block=Katakana}) (356)
3490 X \p{Katakana_Ext}        \p{Katakana_Phonetic_Extensions} (=
3491                             \p{Block=Katakana_Phonetic_Extensions})
3492                             (16)
3493 X \p{Katakana_Phonetic_Extensions} \p{Block=
3494                             Katakana_Phonetic_Extensions} (Short:
3495                             \p{InKatakanaExt}) (16)
3496   \p{Kayah_Li}            \p{Script_Extensions=Kayah_Li} (Short:
3497                             \p{Kali}) (48)
3498   \p{Khar}                \p{Kharoshthi} (= \p{Script_Extensions=
3499                             Kharoshthi}) (NOT \p{Block=Kharoshthi})
3500                             (68)
3501   \p{Kharoshthi}          \p{Script_Extensions=Kharoshthi} (Short:
3502                             \p{Khar}; NOT \p{Block=Kharoshthi}) (68)
3503   \p{Khmer}               \p{Script_Extensions=Khmer} (Short:
3504                             \p{Khmr}; NOT \p{Block=Khmer}) (146)
3505 X \p{Khmer_Symbols}       \p{Block=Khmer_Symbols} (32)
3506   \p{Khmr}                \p{Khmer} (= \p{Script_Extensions=Khmer})
3507                             (NOT \p{Block=Khmer}) (146)
3508   \p{Khoj}                \p{Khojki} (= \p{Script_Extensions=
3509                             Khojki}) (NOT \p{Block=Khojki}) (82)
3510   \p{Khojki}              \p{Script_Extensions=Khojki} (Short:
3511                             \p{Khoj}; NOT \p{Block=Khojki}) (82)
3512   \p{Khudawadi}           \p{Script_Extensions=Khudawadi} (Short:
3513                             \p{Sind}; NOT \p{Block=Khudawadi}) (81)
3514   \p{Knda}                \p{Kannada} (= \p{Script_Extensions=
3515                             Kannada}) (NOT \p{Block=Kannada}) (104)
3516   \p{Kthi}                \p{Kaithi} (= \p{Script_Extensions=
3517                             Kaithi}) (NOT \p{Block=Kaithi}) (87)
3518   \p{L} \pL               \p{Letter} (= \p{General_Category=Letter})
3519                             (125_643)
3520 X \p{L&}                  \p{Cased_Letter} (= \p{General_Category=
3521                             Cased_Letter}) (3970)
3522 X \p{L_}                  \p{Cased_Letter} (= \p{General_Category=
3523                             Cased_Letter}) Note the trailing '_'
3524                             matters in spite of loose matching
3525                             rules. (3970)
3526   \p{Lana}                \p{Tai_Tham} (= \p{Script_Extensions=
3527                             Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
3528   \p{Lao}                 \p{Script_Extensions=Lao} (NOT \p{Block=
3529                             Lao}) (82)
3530   \p{Laoo}                \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
3531                             \p{Block=Lao}) (82)
3532   \p{Latin}               \p{Script_Extensions=Latin} (Short:
3533                             \p{Latn}) (1387)
3534 X \p{Latin_1}             \p{Latin_1_Supplement} (= \p{Block=
3535                             Latin_1_Supplement}) (128)
3536 X \p{Latin_1_Sup}         \p{Latin_1_Supplement} (= \p{Block=
3537                             Latin_1_Supplement}) (128)
3538 X \p{Latin_1_Supplement}  \p{Block=Latin_1_Supplement} (Short:
3539                             \p{InLatin1}) (128)
3540 X \p{Latin_Ext_A}         \p{Latin_Extended_A} (= \p{Block=
3541                             Latin_Extended_A}) (128)
3542 X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
3543                             \p{Block=Latin_Extended_Additional})
3544                             (256)
3545 X \p{Latin_Ext_B}         \p{Latin_Extended_B} (= \p{Block=
3546                             Latin_Extended_B}) (208)
3547 X \p{Latin_Ext_C}         \p{Latin_Extended_C} (= \p{Block=
3548                             Latin_Extended_C}) (32)
3549 X \p{Latin_Ext_D}         \p{Latin_Extended_D} (= \p{Block=
3550                             Latin_Extended_D}) (224)
3551 X \p{Latin_Ext_E}         \p{Latin_Extended_E} (= \p{Block=
3552                             Latin_Extended_E}) (64)
3553 X \p{Latin_Extended_A}    \p{Block=Latin_Extended_A} (Short:
3554                             \p{InLatinExtA}) (128)
3555 X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
3556                             (Short: \p{InLatinExtAdditional}) (256)
3557 X \p{Latin_Extended_B}    \p{Block=Latin_Extended_B} (Short:
3558                             \p{InLatinExtB}) (208)
3559 X \p{Latin_Extended_C}    \p{Block=Latin_Extended_C} (Short:
3560                             \p{InLatinExtC}) (32)
3561 X \p{Latin_Extended_D}    \p{Block=Latin_Extended_D} (Short:
3562                             \p{InLatinExtD}) (224)
3563 X \p{Latin_Extended_E}    \p{Block=Latin_Extended_E} (Short:
3564                             \p{InLatinExtE}) (64)
3565   \p{Latn}                \p{Latin} (= \p{Script_Extensions=Latin})
3566                             (1387)
3567   \p{Lb: *}               \p{Line_Break: *}
3568   \p{LC}                  \p{Cased_Letter} (= \p{General_Category=
3569                             Cased_Letter}) (3970)
3570   \p{Lepc}                \p{Lepcha} (= \p{Script_Extensions=
3571                             Lepcha}) (NOT \p{Block=Lepcha}) (74)
3572   \p{Lepcha}              \p{Script_Extensions=Lepcha} (Short:
3573                             \p{Lepc}; NOT \p{Block=Lepcha}) (74)
3574   \p{Letter}              \p{General_Category=Letter} (Short: \p{L})
3575                             (125_643)
3576   \p{Letter_Number}       \p{General_Category=Letter_Number} (Short:
3577                             \p{Nl}) (236)
3578 X \p{Letterlike_Symbols}  \p{Block=Letterlike_Symbols} (80)
3579   \p{Limb}                \p{Limbu} (= \p{Script_Extensions=Limbu})
3580                             (NOT \p{Block=Limbu}) (69)
3581   \p{Limbu}               \p{Script_Extensions=Limbu} (Short:
3582                             \p{Limb}; NOT \p{Block=Limbu}) (69)
3583   \p{Lina}                \p{Linear_A} (= \p{Script_Extensions=
3584                             Linear_A}) (NOT \p{Block=Linear_A}) (386)
3585   \p{Linb}                \p{Linear_B} (= \p{Script_Extensions=
3586                             Linear_B}) (268)
3587   \p{Line_Break: AI}      \p{Line_Break=Ambiguous} (707)
3588   \p{Line_Break: AL}      \p{Line_Break=Alphabetic} (20_582)
3589   \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (20_582: [#&*<=>\@A-
3590                             Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
3591                             \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
3592                             U+02CE..02CF, U+02D1..02D7, U+02DC,
3593                             U+02DE ...)
3594   \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-
3595                             \xa8\xaa\xb2-\xb3\xb6-\xba\xbc-
3596                             \xbe\xd7\xf7], U+02C7, U+02C9..02CB,
3597                             U+02CD, U+02D0, U+02D8..02DB ...)
3598   \p{Line_Break: B2}      \p{Line_Break=Break_Both} (3)
3599   \p{Line_Break: BA}      \p{Line_Break=Break_After} (239)
3600   \p{Line_Break: BB}      \p{Line_Break=Break_Before} (45)
3601   \p{Line_Break: BK}      \p{Line_Break=Mandatory_Break} (4)
3602   \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (239: [\t\|\xad],
3603                             U+058A, U+05BE, U+0964..0965,
3604                             U+0E5A..0E5B, U+0F0B ...)
3605   \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (45: [\xb4],
3606                             U+02C8, U+02CC, U+02DF, U+0C77, U+0C84
3607                             ...)
3608   \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
3609                             U+2E3A..2E3B)
3610   \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
3611   \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
3612   \p{Line_Break: CB}      \p{Line_Break=Contingent_Break} (1)
3613   \p{Line_Break: CJ}      \p{Line_Break=
3614                             Conditional_Japanese_Starter} (58)
3615   \p{Line_Break: CL}      \p{Line_Break=Close_Punctuation} (91)
3616   \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
3617   \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (91: [\}],
3618                             U+0F3B, U+0F3D, U+169C, U+2046, U+207E
3619                             ...)
3620   \p{Line_Break: CM}      \p{Line_Break=Combining_Mark} (2260)
3621   \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2260:
3622                             [^\t\n\cK\f\r\x20-\x7e\x85\xa0-\xff],
3623                             U+0300..034E, U+0350..035B,
3624                             U+0363..036F, U+0483..0489, U+0591..05BD
3625                             ...)
3626   \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (750:
3627                             U+0E01..0E3A, U+0E40..0E4E,
3628                             U+0E81..0E82, U+0E84, U+0E86..0E8A,
3629                             U+0E8C..0EA3 ...)
3630   \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
3631                             (58: U+3041, U+3043, U+3045, U+3047,
3632                             U+3049, U+3063 ...)
3633   \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
3634   \p{Line_Break: CP}      \p{Line_Break=Close_Parenthesis} (2)
3635   \p{Line_Break: CR}      \p{Line_Break=Carriage_Return} (1)
3636   \p{Line_Break: E_Base}  (Short: \p{Lb=EB}) (120: U+261D, U+26F9,
3637                             U+270A..270D, U+1F385, U+1F3C2..1F3C4,
3638                             U+1F3C7 ...)
3639   \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
3640   \p{Line_Break: EB}      \p{Line_Break=E_Base} (120)
3641   \p{Line_Break: EM}      \p{Line_Break=E_Modifier} (5)
3642   \p{Line_Break: EX}      \p{Line_Break=Exclamation} (37)
3643   \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (37: [!?], U+05C6,
3644                             U+061B, U+061E..061F, U+06D4, U+07F9 ...)
3645   \p{Line_Break: GL}      \p{Line_Break=Glue} (25)
3646   \p{Line_Break: Glue}    (Short: \p{Lb=GL}) (25: [\xa0], U+034F,
3647                             U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
3648   \p{Line_Break: H2}      (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
3649                             U+AC38, U+AC54, U+AC70, U+AC8C ...)
3650   \p{Line_Break: H3}      (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
3651                             U+AC1D..AC37, U+AC39..AC53,
3652                             U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
3653                             ...)
3654   \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75:
3655                             U+05D0..05EA, U+05EF..05F2, U+FB1D,
3656                             U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
3657                             ...)
3658   \p{Line_Break: HL}      \p{Line_Break=Hebrew_Letter} (75)
3659   \p{Line_Break: HY}      \p{Line_Break=Hyphen} (1)
3660   \p{Line_Break: Hyphen}  (Short: \p{Lb=HY}) (1: [\-])
3661   \p{Line_Break: ID}      \p{Line_Break=Ideographic} (172_693)
3662   \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_693:
3663                             U+231A..231B, U+23F0..23F3,
3664                             U+2600..2603, U+2614..2615, U+2618,
3665                             U+261A..261C ...)
3666   \p{Line_Break: IN}      \p{Line_Break=Inseparable} (6)
3667   \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
3668                             U+037E, U+0589, U+060C..060D, U+07F8,
3669                             U+2044 ...)
3670   \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
3671                             U+22EF, U+FE19, U+10AF6)
3672   \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
3673   \p{Line_Break: IS}      \p{Line_Break=Infix_Numeric} (13)
3674   \p{Line_Break: JL}      (Short: \p{Lb=JL}) (125: U+1100..115F,
3675                             U+A960..A97C)
3676   \p{Line_Break: JT}      (Short: \p{Lb=JT}) (137: U+11A8..11FF,
3677                             U+D7CB..D7FB)
3678   \p{Line_Break: JV}      (Short: \p{Lb=JV}) (95: U+1160..11A7,
3679                             U+D7B0..D7C6)
3680   \p{Line_Break: LF}      \p{Line_Break=Line_Feed} (1)
3681   \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
3682   \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
3683                             U+2028..2029)
3684   \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
3685   \p{Line_Break: NL}      \p{Line_Break=Next_Line} (1)
3686   \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6,
3687                             U+203C..203D, U+2047..2049, U+3005,
3688                             U+301C, U+303B..303C ...)
3689   \p{Line_Break: NS}      \p{Line_Break=Nonstarter} (33)
3690   \p{Line_Break: NU}      \p{Line_Break=Numeric} (622)
3691   \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (622: [0-9],
3692                             U+0660..0669, U+066B..066C,
3693                             U+06F0..06F9, U+07C0..07C9, U+0966..096F
3694                             ...)
3695   \p{Line_Break: OP}      \p{Line_Break=Open_Punctuation} (88)
3696   \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (88:
3697                             [\(\[\{\xa1\xbf], U+0F3A, U+0F3C,
3698                             U+169B, U+201A, U+201E ...)
3699   \p{Line_Break: PO}      \p{Line_Break=Postfix_Numeric} (36)
3700   \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (36:
3701                             [\%\xa2\xb0], U+0609..060B, U+066A,
3702                             U+09F2..09F3, U+09F9, U+0D79 ...)
3703   \p{Line_Break: PR}      \p{Line_Break=Prefix_Numeric} (68)
3704   \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (68: [\$+\\\xa3-
3705                             \xa5\xb1], U+058F, U+07FE..07FF, U+09FB,
3706                             U+0AF1, U+0BF9 ...)
3707   \p{Line_Break: QU}      \p{Line_Break=Quotation} (39)
3708   \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
3709                             U+2018..2019, U+201B..201D, U+201F,
3710                             U+2039..203A, U+275B..2760 ...)
3711   \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
3712                             U+1F1E6..1F1FF)
3713   \p{Line_Break: RI}      \p{Line_Break=Regional_Indicator} (26)
3714   \p{Line_Break: SA}      \p{Line_Break=Complex_Context} (750)
3715 D \p{Line_Break: SG}      \p{Line_Break=Surrogate} (2048)
3716   \p{Line_Break: SP}      \p{Line_Break=Space} (1)
3717   \p{Line_Break: Space}   (Short: \p{Lb=SP}) (1: [\x20])
3718 D \p{Line_Break: Surrogate} Surrogates should never appear in well-
3719                             formed text, and therefore shouldn't be
3720                             the basis for line breaking (Short:
3721                             \p{Lb=SG}) (2048: U+D800..DFFF)
3722   \p{Line_Break: SY}      \p{Line_Break=Break_Symbols} (1)
3723   \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (901_897 plus all
3724                             above-Unicode code points: U+0378..0379,
3725                             U+0380..0383, U+038B, U+038D, U+03A2,
3726                             U+0530 ...)
3727   \p{Line_Break: WJ}      \p{Line_Break=Word_Joiner} (2)
3728   \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
3729   \p{Line_Break: XX}      \p{Line_Break=Unknown} (901_897 plus all
3730                             above-Unicode code points)
3731   \p{Line_Break: ZW}      \p{Line_Break=ZWSpace} (1)
3732   \p{Line_Break: ZWJ}     (Short: \p{Lb=ZWJ}) (1: U+200D)
3733   \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
3734   \p{Line_Separator}      \p{General_Category=Line_Separator}
3735                             (Short: \p{Zl}) (1)
3736   \p{Linear_A}            \p{Script_Extensions=Linear_A} (Short:
3737                             \p{Lina}; NOT \p{Block=Linear_A}) (386)
3738   \p{Linear_B}            \p{Script_Extensions=Linear_B} (Short:
3739                             \p{Linb}) (268)
3740 X \p{Linear_B_Ideograms}  \p{Block=Linear_B_Ideograms} (128)
3741 X \p{Linear_B_Syllabary}  \p{Block=Linear_B_Syllabary} (128)
3742   \p{Lisu}                \p{Script_Extensions=Lisu} (48)
3743   \p{Ll}                  \p{Lowercase_Letter} (=
3744                             \p{General_Category=Lowercase_Letter})
3745                             (/i= General_Category=Cased_Letter)
3746                             (2151)
3747   \p{Lm}                  \p{Modifier_Letter} (=
3748                             \p{General_Category=Modifier_Letter})
3749                             (259)
3750   \p{Lo}                  \p{Other_Letter} (= \p{General_Category=
3751                             Other_Letter}) (121_414)
3752   \p{LOE}                 \p{Logical_Order_Exception} (=
3753                             \p{Logical_Order_Exception=Y}) (19)
3754   \p{LOE: *}              \p{Logical_Order_Exception: *}
3755   \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
3756                             \p{LOE}) (19)
3757   \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
3758                             (1_114_093 plus all above-Unicode code
3759                             points: U+0000..0E3F, U+0E45..0EBF,
3760                             U+0EC5..19B4, U+19B8..19B9,
3761                             U+19BB..AAB4, U+AAB7..AAB8 ...)
3762   \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
3763                             U+0E40..0E44, U+0EC0..0EC4,
3764                             U+19B5..19B7, U+19BA, U+AAB5..AAB6,
3765                             U+AAB9 ...)
3766 X \p{Low_Surrogates}      \p{Block=Low_Surrogates} (1024)
3767   \p{Lower}               \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3768                             Cased=Yes) (2340)
3769   \p{Lower: *}            \p{Lowercase: *}
3770   \p{Lowercase}           \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
3771                             Cased=Yes) (2340)
3772   \p{Lowercase: N*}       (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
3773                             No) (1_111_772 plus all above-Unicode
3774                             code points: [\x00-
3775                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
3776                             Z\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
3777                             \xb4\xb6-\xb9\xbb-\xde\xf7], U+0100,
3778                             U+0102, U+0104, U+0106, U+0108 ...)
3779   \p{Lowercase: Y*}       (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
3780                             Yes) (2340: [a-z\xaa\xb5\xba\xdf-
3781                             \xf6\xf8-\xff], U+0101, U+0103, U+0105,
3782                             U+0107, U+0109 ...)
3783   \p{Lowercase_Letter}    \p{General_Category=Lowercase_Letter}
3784                             (Short: \p{Ll}; /i= General_Category=
3785                             Cased_Letter) (2151)
3786   \p{Lt}                  \p{Titlecase_Letter} (=
3787                             \p{General_Category=Titlecase_Letter})
3788                             (/i= General_Category=Cased_Letter) (31)
3789   \p{Lu}                  \p{Uppercase_Letter} (=
3790                             \p{General_Category=Uppercase_Letter})
3791                             (/i= General_Category=Cased_Letter)
3792                             (1788)
3793   \p{Lyci}                \p{Lycian} (= \p{Script_Extensions=
3794                             Lycian}) (NOT \p{Block=Lycian}) (29)
3795   \p{Lycian}              \p{Script_Extensions=Lycian} (Short:
3796                             \p{Lyci}; NOT \p{Block=Lycian}) (29)
3797   \p{Lydi}                \p{Lydian} (= \p{Script_Extensions=
3798                             Lydian}) (NOT \p{Block=Lydian}) (27)
3799   \p{Lydian}              \p{Script_Extensions=Lydian} (Short:
3800                             \p{Lydi}; NOT \p{Block=Lydian}) (27)
3801   \p{M} \pM               \p{Mark} (= \p{General_Category=Mark})
3802                             (2268)
3803   \p{Mahajani}            \p{Script_Extensions=Mahajani} (Short:
3804                             \p{Mahj}; NOT \p{Block=Mahajani}) (61)
3805   \p{Mahj}                \p{Mahajani} (= \p{Script_Extensions=
3806                             Mahajani}) (NOT \p{Block=Mahajani}) (61)
3807 X \p{Mahjong}             \p{Mahjong_Tiles} (= \p{Block=
3808                             Mahjong_Tiles}) (48)
3809 X \p{Mahjong_Tiles}       \p{Block=Mahjong_Tiles} (Short:
3810                             \p{InMahjong}) (48)
3811   \p{Maka}                \p{Makasar} (= \p{Script_Extensions=
3812                             Makasar}) (NOT \p{Block=Makasar}) (25)
3813   \p{Makasar}             \p{Script_Extensions=Makasar} (Short:
3814                             \p{Maka}; NOT \p{Block=Makasar}) (25)
3815   \p{Malayalam}           \p{Script_Extensions=Malayalam} (Short:
3816                             \p{Mlym}; NOT \p{Block=Malayalam}) (125)
3817   \p{Mand}                \p{Mandaic} (= \p{Script_Extensions=
3818                             Mandaic}) (NOT \p{Block=Mandaic}) (30)
3819   \p{Mandaic}             \p{Script_Extensions=Mandaic} (Short:
3820                             \p{Mand}; NOT \p{Block=Mandaic}) (30)
3821   \p{Mani}                \p{Manichaean} (= \p{Script_Extensions=
3822                             Manichaean}) (NOT \p{Block=Manichaean})
3823                             (52)
3824   \p{Manichaean}          \p{Script_Extensions=Manichaean} (Short:
3825                             \p{Mani}; NOT \p{Block=Manichaean}) (52)
3826   \p{Marc}                \p{Marchen} (= \p{Script_Extensions=
3827                             Marchen}) (NOT \p{Block=Marchen}) (68)
3828   \p{Marchen}             \p{Script_Extensions=Marchen} (Short:
3829                             \p{Marc}; NOT \p{Block=Marchen}) (68)
3830   \p{Mark}                \p{General_Category=Mark} (Short: \p{M})
3831                             (2268)
3832   \p{Masaram_Gondi}       \p{Script_Extensions=Masaram_Gondi}
3833                             (Short: \p{Gonm}; NOT \p{Block=
3834                             Masaram_Gondi}) (77)
3835   \p{Math}                \p{Math=Y} (2310)
3836   \p{Math: N*}            (Single: \P{Math}) (1_111_802 plus all
3837                             above-Unicode code points: [\x00-
3838                             \x20!\"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-
3839                             Z\[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
3840                             \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
3841                             U+03D3..03D4, U+03D6..03EF,
3842                             U+03F2..03F3, U+03F7..0605 ...)
3843   \p{Math: Y*}            (Single: \p{Math}) (2310: [+<=
3844                             >\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
3845                             U+03D5, U+03F0..03F1, U+03F4..03F6,
3846                             U+0606..0608 ...)
3847 X \p{Math_Alphanum}       \p{Mathematical_Alphanumeric_Symbols} (=
3848                             \p{Block=
3849                             Mathematical_Alphanumeric_Symbols})
3850                             (1024)
3851 X \p{Math_Operators}      \p{Mathematical_Operators} (= \p{Block=
3852                             Mathematical_Operators}) (256)
3853   \p{Math_Symbol}         \p{General_Category=Math_Symbol} (Short:
3854                             \p{Sm}) (948)
3855 X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
3856                             Mathematical_Alphanumeric_Symbols}
3857                             (Short: \p{InMathAlphanum}) (1024)
3858 X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
3859                             (Short: \p{InMathOperators}) (256)
3860 X \p{Mayan_Numerals}      \p{Block=Mayan_Numerals} (32)
3861   \p{Mc}                  \p{Spacing_Mark} (= \p{General_Category=
3862                             Spacing_Mark}) (429)
3863   \p{Me}                  \p{Enclosing_Mark} (= \p{General_Category=
3864                             Enclosing_Mark}) (13)
3865   \p{Medefaidrin}         \p{Script_Extensions=Medefaidrin} (Short:
3866                             \p{Medf}; NOT \p{Block=Medefaidrin}) (91)
3867   \p{Medf}                \p{Medefaidrin} (= \p{Script_Extensions=
3868                             Medefaidrin}) (NOT \p{Block=
3869                             Medefaidrin}) (91)
3870   \p{Meetei_Mayek}        \p{Script_Extensions=Meetei_Mayek} (Short:
3871                             \p{Mtei}; NOT \p{Block=Meetei_Mayek})
3872                             (79)
3873 X \p{Meetei_Mayek_Ext}    \p{Meetei_Mayek_Extensions} (= \p{Block=
3874                             Meetei_Mayek_Extensions}) (32)
3875 X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
3876                             (Short: \p{InMeeteiMayekExt}) (32)
3877   \p{Mend}                \p{Mende_Kikakui} (= \p{Script_Extensions=
3878                             Mende_Kikakui}) (NOT \p{Block=
3879                             Mende_Kikakui}) (213)
3880   \p{Mende_Kikakui}       \p{Script_Extensions=Mende_Kikakui}
3881                             (Short: \p{Mend}; NOT \p{Block=
3882                             Mende_Kikakui}) (213)
3883   \p{Merc}                \p{Meroitic_Cursive} (=
3884                             \p{Script_Extensions=Meroitic_Cursive})
3885                             (NOT \p{Block=Meroitic_Cursive}) (90)
3886   \p{Mero}                \p{Meroitic_Hieroglyphs} (=
3887                             \p{Script_Extensions=
3888                             Meroitic_Hieroglyphs}) (32)
3889   \p{Meroitic_Cursive}    \p{Script_Extensions=Meroitic_Cursive}
3890                             (Short: \p{Merc}; NOT \p{Block=
3891                             Meroitic_Cursive}) (90)
3892   \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
3893                             Meroitic_Hieroglyphs} (Short: \p{Mero})
3894                             (32)
3895   \p{Miao}                \p{Script_Extensions=Miao} (NOT \p{Block=
3896                             Miao}) (149)
3897 X \p{Misc_Arrows}         \p{Miscellaneous_Symbols_And_Arrows} (=
3898                             \p{Block=
3899                             Miscellaneous_Symbols_And_Arrows}) (256)
3900 X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
3901                             (= \p{Block=
3902                             Miscellaneous_Mathematical_Symbols_A})
3903                             (48)
3904 X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
3905                             (= \p{Block=
3906                             Miscellaneous_Mathematical_Symbols_B})
3907                             (128)
3908 X \p{Misc_Pictographs}    \p{Miscellaneous_Symbols_And_Pictographs}
3909                             (= \p{Block=
3910                             Miscellaneous_Symbols_And_Pictographs})
3911                             (768)
3912 X \p{Misc_Symbols}        \p{Miscellaneous_Symbols} (= \p{Block=
3913                             Miscellaneous_Symbols}) (256)
3914 X \p{Misc_Technical}      \p{Miscellaneous_Technical} (= \p{Block=
3915                             Miscellaneous_Technical}) (256)
3916 X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
3917                             Miscellaneous_Mathematical_Symbols_A}
3918                             (Short: \p{InMiscMathSymbolsA}) (48)
3919 X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
3920                             Miscellaneous_Mathematical_Symbols_B}
3921                             (Short: \p{InMiscMathSymbolsB}) (128)
3922 X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
3923                             \p{InMiscSymbols}) (256)
3924 X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
3925                             Miscellaneous_Symbols_And_Arrows}
3926                             (Short: \p{InMiscArrows}) (256)
3927 X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
3928                             Miscellaneous_Symbols_And_Pictographs}
3929                             (Short: \p{InMiscPictographs}) (768)
3930 X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
3931                             (Short: \p{InMiscTechnical}) (256)
3932   \p{Mlym}                \p{Malayalam} (= \p{Script_Extensions=
3933                             Malayalam}) (NOT \p{Block=Malayalam})
3934                             (125)
3935   \p{Mn}                  \p{Nonspacing_Mark} (=
3936                             \p{General_Category=Nonspacing_Mark})
3937                             (1826)
3938   \p{Modi}                \p{Script_Extensions=Modi} (NOT \p{Block=
3939                             Modi}) (89)
3940   \p{Modifier_Letter}     \p{General_Category=Modifier_Letter}
3941                             (Short: \p{Lm}) (259)
3942 X \p{Modifier_Letters}    \p{Spacing_Modifier_Letters} (= \p{Block=
3943                             Spacing_Modifier_Letters}) (80)
3944   \p{Modifier_Symbol}     \p{General_Category=Modifier_Symbol}
3945                             (Short: \p{Sk}) (121)
3946 X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
3947   \p{Mong}                \p{Mongolian} (= \p{Script_Extensions=
3948                             Mongolian}) (NOT \p{Block=Mongolian})
3949                             (171)
3950   \p{Mongolian}           \p{Script_Extensions=Mongolian} (Short:
3951                             \p{Mong}; NOT \p{Block=Mongolian}) (171)
3952 X \p{Mongolian_Sup}       \p{Mongolian_Supplement} (= \p{Block=
3953                             Mongolian_Supplement}) (32)
3954 X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
3955                             \p{InMongolianSup}) (32)
3956   \p{Mro}                 \p{Script_Extensions=Mro} (NOT \p{Block=
3957                             Mro}) (43)
3958   \p{Mroo}                \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
3959                             \p{Block=Mro}) (43)
3960   \p{Mtei}                \p{Meetei_Mayek} (= \p{Script_Extensions=
3961                             Meetei_Mayek}) (NOT \p{Block=
3962                             Meetei_Mayek}) (79)
3963   \p{Mult}                \p{Multani} (= \p{Script_Extensions=
3964                             Multani}) (NOT \p{Block=Multani}) (48)
3965   \p{Multani}             \p{Script_Extensions=Multani} (Short:
3966                             \p{Mult}; NOT \p{Block=Multani}) (48)
3967 X \p{Music}               \p{Musical_Symbols} (= \p{Block=
3968                             Musical_Symbols}) (256)
3969 X \p{Musical_Symbols}     \p{Block=Musical_Symbols} (Short:
3970                             \p{InMusic}) (256)
3971   \p{Myanmar}             \p{Script_Extensions=Myanmar} (Short:
3972                             \p{Mymr}; NOT \p{Block=Myanmar}) (224)
3973 X \p{Myanmar_Ext_A}       \p{Myanmar_Extended_A} (= \p{Block=
3974                             Myanmar_Extended_A}) (32)
3975 X \p{Myanmar_Ext_B}       \p{Myanmar_Extended_B} (= \p{Block=
3976                             Myanmar_Extended_B}) (32)
3977 X \p{Myanmar_Extended_A}  \p{Block=Myanmar_Extended_A} (Short:
3978                             \p{InMyanmarExtA}) (32)
3979 X \p{Myanmar_Extended_B}  \p{Block=Myanmar_Extended_B} (Short:
3980                             \p{InMyanmarExtB}) (32)
3981   \p{Mymr}                \p{Myanmar} (= \p{Script_Extensions=
3982                             Myanmar}) (NOT \p{Block=Myanmar}) (224)
3983   \p{N} \pN               \p{Number} (= \p{General_Category=Number})
3984                             (1754)
3985   \p{Nabataean}           \p{Script_Extensions=Nabataean} (Short:
3986                             \p{Nbat}; NOT \p{Block=Nabataean}) (40)
3987   \p{Nand}                \p{Nandinagari} (= \p{Script_Extensions=
3988                             Nandinagari}) (NOT \p{Block=
3989                             Nandinagari}) (86)
3990   \p{Nandinagari}         \p{Script_Extensions=Nandinagari} (Short:
3991                             \p{Nand}; NOT \p{Block=Nandinagari}) (86)
3992   \p{Narb}                \p{Old_North_Arabian} (=
3993                             \p{Script_Extensions=Old_North_Arabian})
3994                             (32)
3995 X \p{NB}                  \p{No_Block} (= \p{Block=No_Block})
3996                             (832_720 plus all above-Unicode code
3997                             points)
3998   \p{Nbat}                \p{Nabataean} (= \p{Script_Extensions=
3999                             Nabataean}) (NOT \p{Block=Nabataean})
4000                             (40)
4001   \p{NChar}               \p{Noncharacter_Code_Point} (=
4002                             \p{Noncharacter_Code_Point=Y}) (66)
4003   \p{NChar: *}            \p{Noncharacter_Code_Point: *}
4004   \p{Nd}                  \p{XPosixDigit} (= \p{General_Category=
4005                             Decimal_Number}) (630)
4006   \p{New_Tai_Lue}         \p{Script_Extensions=New_Tai_Lue} (Short:
4007                             \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
4008   \p{Newa}                \p{Script_Extensions=Newa} (NOT \p{Block=
4009                             Newa}) (94)
4010   \p{NFC_QC: *}           \p{NFC_Quick_Check: *}
4011   \p{NFC_Quick_Check: M}  \p{NFC_Quick_Check=Maybe} (110)
4012   \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110:
4013                             U+0300..0304, U+0306..030C, U+030F,
4014                             U+0311, U+0313..0314, U+031B ...)
4015   \p{NFC_Quick_Check: N}  \p{NFC_Quick_Check=No} (NOT
4016                             \P{NFC_Quick_Check} NOR \P{NFC_QC})
4017                             (1120)
4018   \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
4019                             \P{NFC_Quick_Check} NOR \P{NFC_QC})
4020                             (1120: U+0340..0341, U+0343..0344,
4021                             U+0374, U+037E, U+0387, U+0958..095F ...)
4022   \p{NFC_Quick_Check: Y}  \p{NFC_Quick_Check=Yes} (NOT
4023                             \p{NFC_Quick_Check} NOR \p{NFC_QC})
4024                             (1_112_882 plus all above-Unicode code
4025                             points)
4026   \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
4027                             \p{NFC_Quick_Check} NOR \p{NFC_QC})
4028                             (1_112_882 plus all above-Unicode code
4029                             points: U+0000..02FF, U+0305,
4030                             U+030D..030E, U+0310, U+0312,
4031                             U+0315..031A ...)
4032   \p{NFD_QC: *}           \p{NFD_Quick_Check: *}
4033   \p{NFD_Quick_Check: N}  \p{NFD_Quick_Check=No} (NOT
4034                             \P{NFD_Quick_Check} NOR \P{NFD_QC})
4035                             (13_232)
4036   \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
4037                             \P{NFD_Quick_Check} NOR \P{NFD_QC})
4038                             (13_232: [\xc0-\xc5\xc7-\xcf\xd1-
4039                             \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4040                             \xf6\xf9-\xfd\xff], U+0100..010F,
4041                             U+0112..0125, U+0128..0130,
4042                             U+0134..0137, U+0139..013E ...)
4043   \p{NFD_Quick_Check: Y}  \p{NFD_Quick_Check=Yes} (NOT
4044                             \p{NFD_Quick_Check} NOR \p{NFD_QC})
4045                             (1_100_880 plus all above-Unicode code
4046                             points)
4047   \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
4048                             \p{NFD_Quick_Check} NOR \p{NFD_QC})
4049                             (1_100_880 plus all above-Unicode code
4050                             points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
4051                             \xdf\xe6\xf0\xf7-\xf8\xfe],
4052                             U+0110..0111, U+0126..0127,
4053                             U+0131..0133, U+0138, U+013F..0142 ...)
4054   \p{NFKC_QC: *}          \p{NFKC_Quick_Check: *}
4055   \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110)
4056   \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110:
4057                             U+0300..0304, U+0306..030C, U+030F,
4058                             U+0311, U+0313..0314, U+031B ...)
4059   \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
4060                             \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4061                             (4796)
4062   \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
4063                             \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4064                             (4796: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4065                             \xba\xbc-\xbe], U+0132..0133,
4066                             U+013F..0140, U+0149, U+017F,
4067                             U+01C4..01CC ...)
4068   \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
4069                             \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4070                             (1_109_206 plus all above-Unicode code
4071                             points)
4072   \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
4073                             \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4074                             (1_109_206 plus all above-Unicode code
4075                             points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4076                             \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
4077                             U+0100..0131, U+0134..013E,
4078                             U+0141..0148, U+014A..017E, U+0180..01C3
4079                             ...)
4080   \p{NFKD_QC: *}          \p{NFKD_Quick_Check: *}
4081   \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
4082                             \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4083                             (16_896)
4084   \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
4085                             \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4086                             (16_896: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4087                             \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
4088                             \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4089                             \xf6\xf9-\xfd\xff], U+0100..010F,
4090                             U+0112..0125, U+0128..0130,
4091                             U+0132..0137, U+0139..0140 ...)
4092   \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
4093                             \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4094                             (1_097_216 plus all above-Unicode code
4095                             points)
4096   \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
4097                             \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4098                             (1_097_216 plus all above-Unicode code
4099                             points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4100                             \xae\xb0-\xb1\xb6-
4101                             \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-
4102                             \xdf\xe6\xf0\xf7-\xf8\xfe],
4103                             U+0110..0111, U+0126..0127, U+0131,
4104                             U+0138, U+0141..0142 ...)
4105   \p{Nko}                 \p{Script_Extensions=Nko} (NOT \p{Block=
4106                             NKo}) (62)
4107   \p{Nkoo}                \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
4108                             \p{Block=NKo}) (62)
4109   \p{Nl}                  \p{Letter_Number} (= \p{General_Category=
4110                             Letter_Number}) (236)
4111   \p{No}                  \p{Other_Number} (= \p{General_Category=
4112                             Other_Number}) (888)
4113 X \p{No_Block}            \p{Block=No_Block} (Short: \p{InNB})
4114                             (832_720 plus all above-Unicode code
4115                             points)
4116   \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
4117                             \p{NChar}) (66)
4118   \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
4119                             (1_114_046 plus all above-Unicode code
4120                             points: U+0000..FDCF, U+FDF0..FFFD,
4121                             U+10000..1FFFD, U+20000..2FFFD,
4122                             U+30000..3FFFD, U+40000..4FFFD ...)
4123   \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
4124                             (66: U+FDD0..FDEF, U+FFFE..FFFF,
4125                             U+1FFFE..1FFFF, U+2FFFE..2FFFF,
4126                             U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
4127   \p{Nonspacing_Mark}     \p{General_Category=Nonspacing_Mark}
4128                             (Short: \p{Mn}) (1826)
4129   \p{Nshu}                \p{Nushu} (= \p{Script_Extensions=Nushu})
4130                             (NOT \p{Block=Nushu}) (397)
4131   \p{Nt: *}               \p{Numeric_Type: *}
4132   \p{Number}              \p{General_Category=Number} (Short: \p{N})
4133                             (1754)
4134 X \p{Number_Forms}        \p{Block=Number_Forms} (64)
4135   \p{Numeric_Type: De}    \p{Numeric_Type=Decimal} (630)
4136   \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (630: [0-9],
4137                             U+0660..0669, U+06F0..06F9,
4138                             U+07C0..07C9, U+0966..096F, U+09E6..09EF
4139                             ...)
4140   \p{Numeric_Type: Di}    \p{Numeric_Type=Digit} (128)
4141   \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
4142                             U+1369..1371, U+19DA, U+2070,
4143                             U+2074..2079, U+2080..2089 ...)
4144   \p{Numeric_Type: None}  (Short: \p{Nt=None}) (1_112_277 plus all
4145                             above-Unicode code points: [\x00-
4146                             \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
4147                             Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
4148                             \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
4149                             U+066A..06EF, U+06FA..07BF,
4150                             U+07CA..0965, U+0970..09E5 ...)
4151   \p{Numeric_Type: Nu}    \p{Numeric_Type=Numeric} (1077)
4152   \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1077: [\xbc-\xbe],
4153                             U+09F4..09F9, U+0B72..0B77,
4154                             U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
4155                             ...)
4156 T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
4157 T \p{Numeric_Value: 0}    (Short: \p{Nv=0}) (81: [0], U+0660,
4158                             U+06F0, U+07C0, U+0966, U+09E6 ...)
4159 T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0,
4160                             U+11FD4)
4161 T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1)
4162 T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2)
4163 T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3)
4164 T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4)
4165 T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5)
4166 T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6)
4167 T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7)
4168 T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8)
4169 T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75,
4170                             U+0D76, U+A833, U+11FC9..11FCA)
4171 T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
4172 T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152,
4173                             U+11FCB)
4174 T \p{Numeric_Value: 1/9}  (Short: \p{Nv=1/9}) (1: U+2151)
4175 T \p{Numeric_Value: 1/8}  (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76,
4176                             U+0D77, U+215B, U+A834, U+11FCC ...)
4177 T \p{Numeric_Value: 1/7}  (Short: \p{Nv=1/7}) (1: U+2150)
4178 T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD)
4179 T \p{Numeric_Value: 1/6}  (Short: \p{Nv=1/6}) (4: U+2159, U+109F7,
4180                             U+12461, U+1ED3D)
4181 T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77,
4182                             U+0D78, U+A835, U+11FCE)
4183 T \p{Numeric_Value: 1/5}  (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155,
4184                             U+11FCF)
4185 T \p{Numeric_Value: 1/4}  (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7,
4186                             U+0B72, U+0D73, U+A830, U+10140 ...)
4187 T \p{Numeric_Value: 1/3}  (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
4188                             U+10E7D, U+1245A, U+1245D, U+12465)
4189 T \p{Numeric_Value: 3/8}  (Short: \p{Nv=3/8}) (1: U+215C)
4190 T \p{Numeric_Value: 2/5}  (Short: \p{Nv=2/5}) (1: U+2156)
4191 T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
4192 T \p{Numeric_Value: 1/2}  (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73,
4193                             U+0D74, U+0F2A, U+2CFD, U+A831 ...)
4194 T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
4195 T \p{Numeric_Value: 3/5}  (Short: \p{Nv=3/5}) (1: U+2157)
4196 T \p{Numeric_Value: 5/8}  (Short: \p{Nv=5/8}) (1: U+215D)
4197 T \p{Numeric_Value: 2/3}  (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
4198                             U+109FD, U+10E7E, U+1245B, U+1245E ...)
4199 T \p{Numeric_Value: 3/4}  (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8,
4200                             U+0B74, U+0D75, U+A832, U+10178 ...)
4201 T \p{Numeric_Value: 4/5}  (Short: \p{Nv=4/5}) (1: U+2158)
4202 T \p{Numeric_Value: 5/6}  (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
4203                             U+1245C)
4204 T \p{Numeric_Value: 7/8}  (Short: \p{Nv=7/8}) (1: U+215E)
4205 T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
4206 T \p{Numeric_Value: 1}    (Short: \p{Nv=1}) (137: [1\xb9], U+0661,
4207                             U+06F1, U+07C1, U+0967, U+09E7 ...)
4208 T \p{Numeric_Value: 3/2}  (Short: \p{Nv=3/2}) (1: U+0F2B)
4209 T \p{Numeric_Value: 2}    (Short: \p{Nv=2}) (136: [2\xb2], U+0662,
4210                             U+06F2, U+07C2, U+0968, U+09E8 ...)
4211 T \p{Numeric_Value: 5/2}  (Short: \p{Nv=5/2}) (1: U+0F2C)
4212 T \p{Numeric_Value: 3}    (Short: \p{Nv=3}) (137: [3\xb3], U+0663,
4213                             U+06F3, U+07C3, U+0969, U+09E9 ...)
4214 T \p{Numeric_Value: 7/2}  (Short: \p{Nv=7/2}) (1: U+0F2D)
4215 T \p{Numeric_Value: 4}    (Short: \p{Nv=4}) (128: [4], U+0664,
4216                             U+06F4, U+07C4, U+096A, U+09EA ...)
4217 T \p{Numeric_Value: 9/2}  (Short: \p{Nv=9/2}) (1: U+0F2E)
4218 T \p{Numeric_Value: 5}    (Short: \p{Nv=5}) (127: [5], U+0665,
4219                             U+06F5, U+07C5, U+096B, U+09EB ...)
4220 T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
4221 T \p{Numeric_Value: 6}    (Short: \p{Nv=6}) (111: [6], U+0666,
4222                             U+06F6, U+07C6, U+096C, U+09EC ...)
4223 T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
4224 T \p{Numeric_Value: 7}    (Short: \p{Nv=7}) (110: [7], U+0667,
4225                             U+06F7, U+07C7, U+096D, U+09ED ...)
4226 T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
4227 T \p{Numeric_Value: 8}    (Short: \p{Nv=8}) (106: [8], U+0668,
4228                             U+06F8, U+07C8, U+096E, U+09EE ...)
4229 T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
4230 T \p{Numeric_Value: 9}    (Short: \p{Nv=9}) (110: [9], U+0669,
4231                             U+06F9, U+07C9, U+096F, U+09EF ...)
4232 T \p{Numeric_Value: 10}   (Short: \p{Nv=10}) (61: U+0BF0, U+0D70,
4233                             U+1372, U+2169, U+2179, U+2469 ...)
4234 T \p{Numeric_Value: 11}   (Short: \p{Nv=11}) (8: U+216A, U+217A,
4235                             U+246A, U+247E, U+2492, U+24EB ...)
4236 T \p{Numeric_Value: 12}   (Short: \p{Nv=12}) (8: U+216B, U+217B,
4237                             U+246B, U+247F, U+2493, U+24EC ...)
4238 T \p{Numeric_Value: 13}   (Short: \p{Nv=13}) (6: U+246C, U+2480,
4239                             U+2494, U+24ED, U+16E8D, U+1D2ED)
4240 T \p{Numeric_Value: 14}   (Short: \p{Nv=14}) (6: U+246D, U+2481,
4241                             U+2495, U+24EE, U+16E8E, U+1D2EE)
4242 T \p{Numeric_Value: 15}   (Short: \p{Nv=15}) (6: U+246E, U+2482,
4243                             U+2496, U+24EF, U+16E8F, U+1D2EF)
4244 T \p{Numeric_Value: 16}   (Short: \p{Nv=16}) (7: U+09F9, U+246F,
4245                             U+2483, U+2497, U+24F0, U+16E90 ...)
4246 T \p{Numeric_Value: 17}   (Short: \p{Nv=17}) (7: U+16EE, U+2470,
4247                             U+2484, U+2498, U+24F1, U+16E91 ...)
4248 T \p{Numeric_Value: 18}   (Short: \p{Nv=18}) (7: U+16EF, U+2471,
4249                             U+2485, U+2499, U+24F2, U+16E92 ...)
4250 T \p{Numeric_Value: 19}   (Short: \p{Nv=19}) (7: U+16F0, U+2472,
4251                             U+2486, U+249A, U+24F3, U+16E93 ...)
4252 T \p{Numeric_Value: 20}   (Short: \p{Nv=20}) (35: U+1373, U+2473,
4253                             U+2487, U+249B, U+24F4, U+3039 ...)
4254 T \p{Numeric_Value: 21}   (Short: \p{Nv=21}) (1: U+3251)
4255 T \p{Numeric_Value: 22}   (Short: \p{Nv=22}) (1: U+3252)
4256 T \p{Numeric_Value: 23}   (Short: \p{Nv=23}) (1: U+3253)
4257 T \p{Numeric_Value: 24}   (Short: \p{Nv=24}) (1: U+3254)
4258 T \p{Numeric_Value: 25}   (Short: \p{Nv=25}) (1: U+3255)
4259 T \p{Numeric_Value: 26}   (Short: \p{Nv=26}) (1: U+3256)
4260 T \p{Numeric_Value: 27}   (Short: \p{Nv=27}) (1: U+3257)
4261 T \p{Numeric_Value: 28}   (Short: \p{Nv=28}) (1: U+3258)
4262 T \p{Numeric_Value: 29}   (Short: \p{Nv=29}) (1: U+3259)
4263 T \p{Numeric_Value: 30}   (Short: \p{Nv=30}) (19: U+1374, U+303A,
4264                             U+324A, U+325A, U+5345, U+10112 ...)
4265 T \p{Numeric_Value: 31}   (Short: \p{Nv=31}) (1: U+325B)
4266 T \p{Numeric_Value: 32}   (Short: \p{Nv=32}) (1: U+325C)
4267 T \p{Numeric_Value: 33}   (Short: \p{Nv=33}) (1: U+325D)
4268 T \p{Numeric_Value: 34}   (Short: \p{Nv=34}) (1: U+325E)
4269 T \p{Numeric_Value: 35}   (Short: \p{Nv=35}) (1: U+325F)
4270 T \p{Numeric_Value: 36}   (Short: \p{Nv=36}) (1: U+32B1)
4271 T \p{Numeric_Value: 37}   (Short: \p{Nv=37}) (1: U+32B2)
4272 T \p{Numeric_Value: 38}   (Short: \p{Nv=38}) (1: U+32B3)
4273 T \p{Numeric_Value: 39}   (Short: \p{Nv=39}) (1: U+32B4)
4274 T \p{Numeric_Value: 40}   (Short: \p{Nv=40}) (18: U+1375, U+324B,
4275                             U+32B5, U+534C, U+10113, U+102ED ...)
4276 T \p{Numeric_Value: 41}   (Short: \p{Nv=41}) (1: U+32B6)
4277 T \p{Numeric_Value: 42}   (Short: \p{Nv=42}) (1: U+32B7)
4278 T \p{Numeric_Value: 43}   (Short: \p{Nv=43}) (1: U+32B8)
4279 T \p{Numeric_Value: 44}   (Short: \p{Nv=44}) (1: U+32B9)
4280 T \p{Numeric_Value: 45}   (Short: \p{Nv=45}) (1: U+32BA)
4281 T \p{Numeric_Value: 46}   (Short: \p{Nv=46}) (1: U+32BB)
4282 T \p{Numeric_Value: 47}   (Short: \p{Nv=47}) (1: U+32BC)
4283 T \p{Numeric_Value: 48}   (Short: \p{Nv=48}) (1: U+32BD)
4284 T \p{Numeric_Value: 49}   (Short: \p{Nv=49}) (1: U+32BE)
4285 T \p{Numeric_Value: 50}   (Short: \p{Nv=50}) (29: U+1376, U+216C,
4286                             U+217C, U+2186, U+324C, U+32BF ...)
4287 T \p{Numeric_Value: 60}   (Short: \p{Nv=60}) (13: U+1377, U+324D,
4288                             U+10115, U+102EF, U+109CE, U+10E6E ...)
4289 T \p{Numeric_Value: 70}   (Short: \p{Nv=70}) (13: U+1378, U+324E,
4290                             U+10116, U+102F0, U+109CF, U+10E6F ...)
4291 T \p{Numeric_Value: 80}   (Short: \p{Nv=80}) (12: U+1379, U+324F,
4292                             U+10117, U+102F1, U+10E70, U+11062 ...)
4293 T \p{Numeric_Value: 90}   (Short: \p{Nv=90}) (12: U+137A, U+10118,
4294                             U+102F2, U+10341, U+10E71, U+11063 ...)
4295 T \p{Numeric_Value: 100}  (Short: \p{Nv=100}) (34: U+0BF1, U+0D71,
4296                             U+137B, U+216D, U+217D, U+4F70 ...)
4297 T \p{Numeric_Value: 200}  (Short: \p{Nv=200}) (6: U+1011A, U+102F4,
4298                             U+109D3, U+10E73, U+1EC84, U+1ED14)
4299 T \p{Numeric_Value: 300}  (Short: \p{Nv=300}) (7: U+1011B, U+1016B,
4300                             U+102F5, U+109D4, U+10E74, U+1EC85 ...)
4301 T \p{Numeric_Value: 400}  (Short: \p{Nv=400}) (7: U+1011C, U+102F6,
4302                             U+109D5, U+10E75, U+1EC86, U+1ED16 ...)
4303 T \p{Numeric_Value: 500}  (Short: \p{Nv=500}) (16: U+216E, U+217E,
4304                             U+1011D, U+10145, U+1014C, U+10153 ...)
4305 T \p{Numeric_Value: 600}  (Short: \p{Nv=600}) (7: U+1011E, U+102F8,
4306                             U+109D7, U+10E77, U+1EC88, U+1ED18 ...)
4307 T \p{Numeric_Value: 700}  (Short: \p{Nv=700}) (6: U+1011F, U+102F9,
4308                             U+109D8, U+10E78, U+1EC89, U+1ED19)
4309 T \p{Numeric_Value: 800}  (Short: \p{Nv=800}) (6: U+10120, U+102FA,
4310                             U+109D9, U+10E79, U+1EC8A, U+1ED1A)
4311 T \p{Numeric_Value: 900}  (Short: \p{Nv=900}) (7: U+10121, U+102FB,
4312                             U+1034A, U+109DA, U+10E7A, U+1EC8B ...)
4313 T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72,
4314                             U+216F, U+217F..2180, U+4EDF, U+5343 ...)
4315 T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC,
4316                             U+1EC8D, U+1ED1D, U+1ED3A)
4317 T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD,
4318                             U+1EC8E, U+1ED1E)
4319 T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE,
4320                             U+1EC8F, U+1ED1F)
4321 T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126,
4322                             U+10146, U+1014E, U+10172, U+109DF ...)
4323 T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0,
4324                             U+1EC91, U+1ED21)
4325 T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1,
4326                             U+1EC92, U+1ED22)
4327 T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2,
4328                             U+1EC93, U+1ED23)
4329 T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3,
4330                             U+1EC94, U+1ED24)
4331 T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13:
4332                             U+137C, U+2182, U+4E07, U+842C, U+1012B,
4333                             U+10155 ...)
4334 T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4:
4335                             U+1012C, U+109E5, U+1EC96, U+1ED26)
4336 T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4:
4337                             U+1012D, U+109E6, U+1EC97, U+1ED27)
4338 T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4:
4339                             U+1012E, U+109E7, U+1EC98, U+1ED28)
4340 T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7:
4341                             U+2187, U+1012F, U+10147, U+10156,
4342                             U+109E8, U+1EC99 ...)
4343 T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4:
4344                             U+10130, U+109E9, U+1EC9A, U+1ED2A)
4345 T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4:
4346                             U+10131, U+109EA, U+1EC9B, U+1ED2B)
4347 T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4:
4348                             U+10132, U+109EB, U+1EC9C, U+1ED2C)
4349 T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4:
4350                             U+10133, U+109EC, U+1EC9D, U+1ED2D)
4351 T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5:
4352                             U+2188, U+109ED, U+1EC9E, U+1ECA0,
4353                             U+1ECB4)
4354 T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2:
4355                             U+109EE, U+1EC9F)
4356 T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
4357                             U+12432)
4358 T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
4359                             U+109EF)
4360 T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
4361                             U+109F0)
4362 T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
4363                             U+12433)
4364 T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
4365                             U+109F1)
4366 T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
4367                             U+109F2)
4368 T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
4369                             U+109F3)
4370 T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
4371                             U+109F4)
4372 T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
4373                             U+109F5)
4374 T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
4375                             U+16B5E)
4376 T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000})
4377                             (1: U+1ECA1)
4378 T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000})
4379                             (1: U+1ECA2)
4380 T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
4381                             (3: U+4EBF, U+5104, U+16B5F)
4382 T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
4383                             10000000000}) (1: U+16B60)
4384 T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
4385                             1000000000000}) (2: U+5146, U+16B61)
4386   \p{Numeric_Value: NaN}  (Short: \p{Nv=NaN}) (1_112_277 plus all
4387                             above-Unicode code points: [\x00-
4388                             \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-
4389                             Z\[\\\]\^_`a-z\{\|\}~\x7f-\xb1\xb4-
4390                             \xb8\xba-\xbb\xbf-\xff], U+0100..065F,
4391                             U+066A..06EF, U+06FA..07BF,
4392                             U+07CA..0965, U+0970..09E5 ...)
4393   \p{Nushu}               \p{Script_Extensions=Nushu} (Short:
4394                             \p{Nshu}; NOT \p{Block=Nushu}) (397)
4395   \p{Nv: *}               \p{Numeric_Value: *}
4396   \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
4397                             Nyiakeng_Puachue_Hmong} (Short:
4398                             \p{Hmnp}; NOT \p{Block=
4399                             Nyiakeng_Puachue_Hmong}) (71)
4400 X \p{OCR}                 \p{Optical_Character_Recognition} (=
4401                             \p{Block=Optical_Character_Recognition})
4402                             (32)
4403   \p{Ogam}                \p{Ogham} (= \p{Script_Extensions=Ogham})
4404                             (NOT \p{Block=Ogham}) (29)
4405   \p{Ogham}               \p{Script_Extensions=Ogham} (Short:
4406                             \p{Ogam}; NOT \p{Block=Ogham}) (29)
4407   \p{Ol_Chiki}            \p{Script_Extensions=Ol_Chiki} (Short:
4408                             \p{Olck}) (48)
4409   \p{Olck}                \p{Ol_Chiki} (= \p{Script_Extensions=
4410                             Ol_Chiki}) (48)
4411   \p{Old_Hungarian}       \p{Script_Extensions=Old_Hungarian}
4412                             (Short: \p{Hung}; NOT \p{Block=
4413                             Old_Hungarian}) (108)
4414   \p{Old_Italic}          \p{Script_Extensions=Old_Italic} (Short:
4415                             \p{Ital}; NOT \p{Block=Old_Italic}) (39)
4416   \p{Old_North_Arabian}   \p{Script_Extensions=Old_North_Arabian}
4417                             (Short: \p{Narb}) (32)
4418   \p{Old_Permic}          \p{Script_Extensions=Old_Permic} (Short:
4419                             \p{Perm}; NOT \p{Block=Old_Permic}) (44)
4420   \p{Old_Persian}         \p{Script_Extensions=Old_Persian} (Short:
4421                             \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
4422   \p{Old_Sogdian}         \p{Script_Extensions=Old_Sogdian} (Short:
4423                             \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40)
4424   \p{Old_South_Arabian}   \p{Script_Extensions=Old_South_Arabian}
4425                             (Short: \p{Sarb}) (32)
4426   \p{Old_Turkic}          \p{Script_Extensions=Old_Turkic} (Short:
4427                             \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
4428   \p{Open_Punctuation}    \p{General_Category=Open_Punctuation}
4429                             (Short: \p{Ps}) (75)
4430 X \p{Optical_Character_Recognition} \p{Block=
4431                             Optical_Character_Recognition} (Short:
4432                             \p{InOCR}) (32)
4433   \p{Oriya}               \p{Script_Extensions=Oriya} (Short:
4434                             \p{Orya}; NOT \p{Block=Oriya}) (96)
4435   \p{Orkh}                \p{Old_Turkic} (= \p{Script_Extensions=
4436                             Old_Turkic}) (NOT \p{Block=Old_Turkic})
4437                             (73)
4438 X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
4439   \p{Orya}                \p{Oriya} (= \p{Script_Extensions=Oriya})
4440                             (NOT \p{Block=Oriya}) (96)
4441   \p{Osage}               \p{Script_Extensions=Osage} (Short:
4442                             \p{Osge}; NOT \p{Block=Osage}) (72)
4443   \p{Osge}                \p{Osage} (= \p{Script_Extensions=Osage})
4444                             (NOT \p{Block=Osage}) (72)
4445   \p{Osma}                \p{Osmanya} (= \p{Script_Extensions=
4446                             Osmanya}) (NOT \p{Block=Osmanya}) (40)
4447   \p{Osmanya}             \p{Script_Extensions=Osmanya} (Short:
4448                             \p{Osma}; NOT \p{Block=Osmanya}) (40)
4449   \p{Other}               \p{General_Category=Other} (Short: \p{C})
4450                             (976_344 plus all above-Unicode code
4451                             points)
4452   \p{Other_Letter}        \p{General_Category=Other_Letter} (Short:
4453                             \p{Lo}) (121_414)
4454   \p{Other_Number}        \p{General_Category=Other_Number} (Short:
4455                             \p{No}) (888)
4456   \p{Other_Punctuation}   \p{General_Category=Other_Punctuation}
4457                             (Short: \p{Po}) (588)
4458   \p{Other_Symbol}        \p{General_Category=Other_Symbol} (Short:
4459                             \p{So}) (6161)
4460 X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80)
4461   \p{P} \pP               \p{Punct} (= \p{General_Category=
4462                             Punctuation}) (NOT
4463                             \p{General_Punctuation}) (792)
4464   \p{Pahawh_Hmong}        \p{Script_Extensions=Pahawh_Hmong} (Short:
4465                             \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
4466                             (127)
4467   \p{Palm}                \p{Palmyrene} (= \p{Script_Extensions=
4468                             Palmyrene}) (32)
4469   \p{Palmyrene}           \p{Script_Extensions=Palmyrene} (Short:
4470                             \p{Palm}) (32)
4471   \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
4472                             (Short: \p{Zp}) (1)
4473   \p{Pat_Syn}             \p{Pattern_Syntax} (= \p{Pattern_Syntax=
4474                             Y}) (2760)
4475   \p{Pat_Syn: *}          \p{Pattern_Syntax: *}
4476   \p{Pat_WS}              \p{Pattern_White_Space} (=
4477                             \p{Pattern_White_Space=Y}) (11)
4478   \p{Pat_WS: *}           \p{Pattern_White_Space: *}
4479   \p{Pattern_Syntax}      \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
4480                             (2760)
4481   \p{Pattern_Syntax: N*}  (Short: \p{PatSyn=N}, \P{PatSyn})
4482                             (1_111_352 plus all above-Unicode code
4483                             points: [\x00-\x200-9A-Z_a-z\x7f-
4484                             \xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb7-
4485                             \xba\xbc-\xbe\xc0-\xd6\xd8-\xf6\xf8-
4486                             \xff], U+0100..200F, U+2028..202F,
4487                             U+203F..2040, U+2054, U+205F..218F ...)
4488   \p{Pattern_Syntax: Y*}  (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
4489                             [!\"#\$\%&\'\(\)*+,\-.\/:;<=
4490                             >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
4491                             \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
4492                             U+2010..2027, U+2030..203E,
4493                             U+2041..2053, U+2055..205E, U+2190..245F
4494                             ...)
4495   \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
4496                             \p{PatWS}) (11)
4497   \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
4498                             (1_114_101 plus all above-Unicode code
4499                             points: [^\t\n\cK\f\r\x20\x85],
4500                             U+0100..200D, U+2010..2027,
4501                             U+202A..infinity)
4502   \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
4503                             [\t\n\cK\f\r\x20\x85], U+200E..200F,
4504                             U+2028..2029)
4505   \p{Pau_Cin_Hau}         \p{Script_Extensions=Pau_Cin_Hau} (Short:
4506                             \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
4507   \p{Pauc}                \p{Pau_Cin_Hau} (= \p{Script_Extensions=
4508                             Pau_Cin_Hau}) (NOT \p{Block=
4509                             Pau_Cin_Hau}) (57)
4510   \p{Pc}                  \p{Connector_Punctuation} (=
4511                             \p{General_Category=
4512                             Connector_Punctuation}) (10)
4513   \p{PCM}                 \p{Prepended_Concatenation_Mark} (=
4514                             \p{Prepended_Concatenation_Mark=Y}) (11)
4515   \p{PCM: *}              \p{Prepended_Concatenation_Mark: *}
4516   \p{Pd}                  \p{Dash_Punctuation} (=
4517                             \p{General_Category=Dash_Punctuation})
4518                             (24)
4519   \p{Pe}                  \p{Close_Punctuation} (=
4520                             \p{General_Category=Close_Punctuation})
4521                             (73)
4522   \p{PerlSpace}           \p{PosixSpace} (6)
4523   \p{PerlWord}            \p{PosixWord} (63)
4524   \p{Perm}                \p{Old_Permic} (= \p{Script_Extensions=
4525                             Old_Permic}) (NOT \p{Block=Old_Permic})
4526                             (44)
4527   \p{Pf}                  \p{Final_Punctuation} (=
4528                             \p{General_Category=Final_Punctuation})
4529                             (10)
4530   \p{Phag}                \p{Phags_Pa} (= \p{Script_Extensions=
4531                             Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
4532   \p{Phags_Pa}            \p{Script_Extensions=Phags_Pa} (Short:
4533                             \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
4534 X \p{Phaistos}            \p{Phaistos_Disc} (= \p{Block=
4535                             Phaistos_Disc}) (48)
4536 X \p{Phaistos_Disc}       \p{Block=Phaistos_Disc} (Short:
4537                             \p{InPhaistos}) (48)
4538   \p{Phli}                \p{Inscriptional_Pahlavi} (=
4539                             \p{Script_Extensions=
4540                             Inscriptional_Pahlavi}) (NOT \p{Block=
4541                             Inscriptional_Pahlavi}) (27)
4542   \p{Phlp}                \p{Psalter_Pahlavi} (=
4543                             \p{Script_Extensions=Psalter_Pahlavi})
4544                             (NOT \p{Block=Psalter_Pahlavi}) (30)
4545   \p{Phnx}                \p{Phoenician} (= \p{Script_Extensions=
4546                             Phoenician}) (NOT \p{Block=Phoenician})
4547                             (29)
4548   \p{Phoenician}          \p{Script_Extensions=Phoenician} (Short:
4549                             \p{Phnx}; NOT \p{Block=Phoenician}) (29)
4550 X \p{Phonetic_Ext}        \p{Phonetic_Extensions} (= \p{Block=
4551                             Phonetic_Extensions}) (128)
4552 X \p{Phonetic_Ext_Sup}    \p{Phonetic_Extensions_Supplement} (=
4553                             \p{Block=
4554                             Phonetic_Extensions_Supplement}) (64)
4555 X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
4556                             \p{InPhoneticExt}) (128)
4557 X \p{Phonetic_Extensions_Supplement} \p{Block=
4558                             Phonetic_Extensions_Supplement} (Short:
4559                             \p{InPhoneticExtSup}) (64)
4560   \p{Pi}                  \p{Initial_Punctuation} (=
4561                             \p{General_Category=
4562                             Initial_Punctuation}) (12)
4563 X \p{Playing_Cards}       \p{Block=Playing_Cards} (96)
4564   \p{Plrd}                \p{Miao} (= \p{Script_Extensions=Miao})
4565                             (NOT \p{Block=Miao}) (149)
4566   \p{Po}                  \p{Other_Punctuation} (=
4567                             \p{General_Category=Other_Punctuation})
4568                             (588)
4569   \p{PosixAlnum}          (62: [0-9A-Za-z])
4570   \p{PosixAlpha}          (52: [A-Za-z])
4571   \p{PosixBlank}          (2: [\t\x20])
4572   \p{PosixCntrl}          ASCII control characters (33: ACK, BEL,
4573                             BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
4574                             DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
4575                             FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
4576                             SOH, STX, SUB, SYN, US, VT)
4577   \p{PosixDigit}          (10: [0-9])
4578   \p{PosixGraph}          (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
4579                             Z\[\\\]\^_`a-z\{\|\}~])
4580   \p{PosixLower}          (/i= PosixAlpha) (26: [a-z])
4581   \p{PosixPrint}          (95: [\x20-\x7e])
4582   \p{PosixPunct}          (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=
4583                             >?\@\[\\\]\^_`\{\|\}~])
4584   \p{PosixSpace}          (Short: \p{PerlSpace}) (6:
4585                             [\t\n\cK\f\r\x20])
4586   \p{PosixUpper}          (/i= PosixAlpha) (26: [A-Z])
4587   \p{PosixWord}           \w, restricted to ASCII (Short:
4588                             \p{PerlWord}) (63: [0-9A-Z_a-z])
4589   \p{PosixXDigit}         \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
4590                             (22)
4591   \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
4592                             Y} (Short: \p{PCM}) (11)
4593   \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
4594                             (1_114_101 plus all above-Unicode code
4595                             points: U+0000..05FF, U+0606..06DC,
4596                             U+06DE..070E, U+0710..08E1,
4597                             U+08E3..110BC, U+110BE..110CC ...)
4598   \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
4599                             (11: U+0600..0605, U+06DD, U+070F,
4600                             U+08E2, U+110BD, U+110CD)
4601 T \p{Present_In: 1.1}     \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
4602                             extension) (33_979)
4603 T \p{Present_In: 2.0}     Code point's usage introduced in version
4604                             2.0 or earlier (Short: \p{In=2.0}) (Perl
4605                             extension) (178_500: U+0000..01F5,
4606                             U+01FA..0217, U+0250..02A8,
4607                             U+02B0..02DE, U+02E0..02E9, U+0300..0345
4608                             ...)
4609   \p{Present_In: V2_0}    \p{Present_In=2.0} (Perl extension)
4610                             (178_500)
4611 T \p{Present_In: 2.1}     Code point's usage introduced in version
4612                             2.1 or earlier (Short: \p{In=2.1}) (Perl
4613                             extension) (178_502: U+0000..01F5,
4614                             U+01FA..0217, U+0250..02A8,
4615                             U+02B0..02DE, U+02E0..02E9, U+0300..0345
4616                             ...)
4617   \p{Present_In: V2_1}    \p{Present_In=2.1} (Perl extension)
4618                             (178_502)
4619 T \p{Present_In: 3.0}     Code point's usage introduced in version
4620                             3.0 or earlier (Short: \p{In=3.0}) (Perl
4621                             extension) (188_809: U+0000..021F,
4622                             U+0222..0233, U+0250..02AD,
4623                             U+02B0..02EE, U+0300..034E, U+0360..0362
4624                             ...)
4625   \p{Present_In: V3_0}    \p{Present_In=3.0} (Perl extension)
4626                             (188_809)
4627 T \p{Present_In: 3.1}     Code point's usage introduced in version
4628                             3.1 or earlier (Short: \p{In=3.1}) (Perl
4629                             extension) (233_787: U+0000..021F,
4630                             U+0222..0233, U+0250..02AD,
4631                             U+02B0..02EE, U+0300..034E, U+0360..0362
4632                             ...)
4633   \p{Present_In: V3_1}    \p{Present_In=3.1} (Perl extension)
4634                             (233_787)
4635 T \p{Present_In: 3.2}     Code point's usage introduced in version
4636                             3.2 or earlier (Short: \p{In=3.2}) (Perl
4637                             extension) (234_803: U+0000..0220,
4638                             U+0222..0233, U+0250..02AD,
4639                             U+02B0..02EE, U+0300..034F, U+0360..036F
4640                             ...)
4641   \p{Present_In: V3_2}    \p{Present_In=3.2} (Perl extension)
4642                             (234_803)
4643 T \p{Present_In: 4.0}     Code point's usage introduced in version
4644                             4.0 or earlier (Short: \p{In=4.0}) (Perl
4645                             extension) (236_029: U+0000..0236,
4646                             U+0250..0357, U+035D..036F,
4647                             U+0374..0375, U+037A, U+037E ...)
4648   \p{Present_In: V4_0}    \p{Present_In=4.0} (Perl extension)
4649                             (236_029)
4650 T \p{Present_In: 4.1}     Code point's usage introduced in version
4651                             4.1 or earlier (Short: \p{In=4.1}) (Perl
4652                             extension) (237_302: U+0000..0241,
4653                             U+0250..036F, U+0374..0375, U+037A,
4654                             U+037E, U+0384..038A ...)
4655   \p{Present_In: V4_1}    \p{Present_In=4.1} (Perl extension)
4656                             (237_302)
4657 T \p{Present_In: 5.0}     Code point's usage introduced in version
4658                             5.0 or earlier (Short: \p{In=5.0}) (Perl
4659                             extension) (238_671: U+0000..036F,
4660                             U+0374..0375, U+037A..037E,
4661                             U+0384..038A, U+038C, U+038E..03A1 ...)
4662   \p{Present_In: V5_0}    \p{Present_In=5.0} (Perl extension)
4663                             (238_671)
4664 T \p{Present_In: 5.1}     Code point's usage introduced in version
4665                             5.1 or earlier (Short: \p{In=5.1}) (Perl
4666                             extension) (240_295: U+0000..0377,
4667                             U+037A..037E, U+0384..038A, U+038C,
4668                             U+038E..03A1, U+03A3..0523 ...)
4669   \p{Present_In: V5_1}    \p{Present_In=5.1} (Perl extension)
4670                             (240_295)
4671 T \p{Present_In: 5.2}     Code point's usage introduced in version
4672                             5.2 or earlier (Short: \p{In=5.2}) (Perl
4673                             extension) (246_943: U+0000..0377,
4674                             U+037A..037E, U+0384..038A, U+038C,
4675                             U+038E..03A1, U+03A3..0525 ...)
4676   \p{Present_In: V5_2}    \p{Present_In=5.2} (Perl extension)
4677                             (246_943)
4678 T \p{Present_In: 6.0}     Code point's usage introduced in version
4679                             6.0 or earlier (Short: \p{In=6.0}) (Perl
4680                             extension) (249_031: U+0000..0377,
4681                             U+037A..037E, U+0384..038A, U+038C,
4682                             U+038E..03A1, U+03A3..0527 ...)
4683   \p{Present_In: V6_0}    \p{Present_In=6.0} (Perl extension)
4684                             (249_031)
4685 T \p{Present_In: 6.1}     Code point's usage introduced in version
4686                             6.1 or earlier (Short: \p{In=6.1}) (Perl
4687                             extension) (249_763: U+0000..0377,
4688                             U+037A..037E, U+0384..038A, U+038C,
4689                             U+038E..03A1, U+03A3..0527 ...)
4690   \p{Present_In: V6_1}    \p{Present_In=6.1} (Perl extension)
4691                             (249_763)
4692 T \p{Present_In: 6.2}     Code point's usage introduced in version
4693                             6.2 or earlier (Short: \p{In=6.2}) (Perl
4694                             extension) (249_764: U+0000..0377,
4695                             U+037A..037E, U+0384..038A, U+038C,
4696                             U+038E..03A1, U+03A3..0527 ...)
4697   \p{Present_In: V6_2}    \p{Present_In=6.2} (Perl extension)
4698                             (249_764)
4699 T \p{Present_In: 6.3}     Code point's usage introduced in version
4700                             6.3 or earlier (Short: \p{In=6.3}) (Perl
4701                             extension) (249_769: U+0000..0377,
4702                             U+037A..037E, U+0384..038A, U+038C,
4703                             U+038E..03A1, U+03A3..0527 ...)
4704   \p{Present_In: V6_3}    \p{Present_In=6.3} (Perl extension)
4705                             (249_769)
4706 T \p{Present_In: 7.0}     Code point's usage introduced in version
4707                             7.0 or earlier (Short: \p{In=7.0}) (Perl
4708                             extension) (252_603: U+0000..0377,
4709                             U+037A..037F, U+0384..038A, U+038C,
4710                             U+038E..03A1, U+03A3..052F ...)
4711   \p{Present_In: V7_0}    \p{Present_In=7.0} (Perl extension)
4712                             (252_603)
4713 T \p{Present_In: 8.0}     Code point's usage introduced in version
4714                             8.0 or earlier (Short: \p{In=8.0}) (Perl
4715                             extension) (260_319: U+0000..0377,
4716                             U+037A..037F, U+0384..038A, U+038C,
4717                             U+038E..03A1, U+03A3..052F ...)
4718   \p{Present_In: V8_0}    \p{Present_In=8.0} (Perl extension)
4719                             (260_319)
4720 T \p{Present_In: 9.0}     Code point's usage introduced in version
4721                             9.0 or earlier (Short: \p{In=9.0}) (Perl
4722                             extension) (267_819: U+0000..0377,
4723                             U+037A..037F, U+0384..038A, U+038C,
4724                             U+038E..03A1, U+03A3..052F ...)
4725   \p{Present_In: V9_0}    \p{Present_In=9.0} (Perl extension)
4726                             (267_819)
4727 T \p{Present_In: 10.0}    Code point's usage introduced in version
4728                             10.0 or earlier (Short: \p{In=10.0})
4729                             (Perl extension) (276_337: U+0000..0377,
4730                             U+037A..037F, U+0384..038A, U+038C,
4731                             U+038E..03A1, U+03A3..052F ...)
4732   \p{Present_In: V10_0}   \p{Present_In=10.0} (Perl extension)
4733                             (276_337)
4734 T \p{Present_In: 11.0}    Code point's usage introduced in version
4735                             11.0 or earlier (Short: \p{In=11.0})
4736                             (Perl extension) (277_021: U+0000..0377,
4737                             U+037A..037F, U+0384..038A, U+038C,
4738                             U+038E..03A1, U+03A3..052F ...)
4739   \p{Present_In: V11_0}   \p{Present_In=11.0} (Perl extension)
4740                             (277_021)
4741 T \p{Present_In: 12.0}    Code point's usage introduced in version
4742                             12.0 or earlier (Short: \p{In=12.0})
4743                             (Perl extension) (277_575: U+0000..0377,
4744                             U+037A..037F, U+0384..038A, U+038C,
4745                             U+038E..03A1, U+03A3..052F ...)
4746   \p{Present_In: V12_0}   \p{Present_In=12.0} (Perl extension)
4747                             (277_575)
4748 T \p{Present_In: 12.1}    Code point's usage introduced in version
4749                             12.1 or earlier (Short: \p{In=12.1})
4750                             (Perl extension) (277_576: U+0000..0377,
4751                             U+037A..037F, U+0384..038A, U+038C,
4752                             U+038E..03A1, U+03A3..052F ...)
4753   \p{Present_In: V12_1}   \p{Present_In=12.1} (Perl extension)
4754                             (277_576)
4755   \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
4756                             Unassigned}) (Perl extension) (836_536
4757                             plus all above-Unicode code points)
4758   \p{Print}               \p{XPosixPrint} (275_395)
4759   \p{Private_Use}         \p{General_Category=Private_Use} (Short:
4760                             \p{Co}; NOT \p{Private_Use_Area})
4761                             (137_468)
4762 X \p{Private_Use_Area}    \p{Block=Private_Use_Area} (Short:
4763                             \p{InPUA}) (6400)
4764   \p{Prti}                \p{Inscriptional_Parthian} (=
4765                             \p{Script_Extensions=
4766                             Inscriptional_Parthian}) (NOT \p{Block=
4767                             Inscriptional_Parthian}) (30)
4768   \p{Ps}                  \p{Open_Punctuation} (=
4769                             \p{General_Category=Open_Punctuation})
4770                             (75)
4771   \p{Psalter_Pahlavi}     \p{Script_Extensions=Psalter_Pahlavi}
4772                             (Short: \p{Phlp}; NOT \p{Block=
4773                             Psalter_Pahlavi}) (30)
4774 X \p{PUA}                 \p{Private_Use_Area} (= \p{Block=
4775                             Private_Use_Area}) (6400)
4776   \p{Punct}               \p{General_Category=Punctuation} (Short:
4777                             \p{P}; NOT \p{General_Punctuation}) (792)
4778   \p{Punctuation}         \p{Punct} (= \p{General_Category=
4779                             Punctuation}) (NOT
4780                             \p{General_Punctuation}) (792)
4781   \p{Qaac}                \p{Coptic} (= \p{Script_Extensions=
4782                             Coptic}) (NOT \p{Block=Coptic}) (165)
4783   \p{Qaai}                \p{Inherited} (= \p{Script_Extensions=
4784                             Inherited}) (502)
4785   \p{QMark}               \p{Quotation_Mark} (= \p{Quotation_Mark=
4786                             Y}) (30)
4787   \p{QMark: *}            \p{Quotation_Mark: *}
4788   \p{Quotation_Mark}      \p{Quotation_Mark=Y} (Short: \p{QMark})
4789                             (30)
4790   \p{Quotation_Mark: N*}  (Short: \p{QMark=N}, \P{QMark}) (1_114_082
4791                             plus all above-Unicode code points:
4792                             [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=
4793                             >?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
4794                             \xaa\xac-\xba\xbc-\xff], U+0100..2017,
4795                             U+2020..2038, U+203B..2E41,
4796                             U+2E43..300B, U+3010..301C ...)
4797   \p{Quotation_Mark: Y*}  (Short: \p{QMark=Y}, \p{QMark}) (30:
4798                             [\"\'\xab\xbb], U+2018..201F,
4799                             U+2039..203A, U+2E42, U+300C..300F,
4800                             U+301D..301F ...)
4801   \p{Radical}             \p{Radical=Y} (329)
4802   \p{Radical: N*}         (Single: \P{Radical}) (1_113_783 plus all
4803                             above-Unicode code points: U+0000..2E7F,
4804                             U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
4805   \p{Radical: Y*}         (Single: \p{Radical}) (329: U+2E80..2E99,
4806                             U+2E9B..2EF3, U+2F00..2FD5)
4807   \p{Regional_Indicator}  \p{Regional_Indicator=Y} (Short: \p{RI})
4808                             (26)
4809   \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
4810                             plus all above-Unicode code points:
4811                             U+0000..1F1E5, U+1F200..infinity)
4812   \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
4813                             U+1F1E6..1F1FF)
4814   \p{Rejang}              \p{Script_Extensions=Rejang} (Short:
4815                             \p{Rjng}; NOT \p{Block=Rejang}) (37)
4816   \p{RI}                  \p{Regional_Indicator} (=
4817                             \p{Regional_Indicator=Y}) (26)
4818   \p{RI: *}               \p{Regional_Indicator: *}
4819   \p{Rjng}                \p{Rejang} (= \p{Script_Extensions=
4820                             Rejang}) (NOT \p{Block=Rejang}) (37)
4821   \p{Rohg}                \p{Hanifi_Rohingya} (=
4822                             \p{Script_Extensions=Hanifi_Rohingya})
4823                             (NOT \p{Block=Hanifi_Rohingya}) (55)
4824 X \p{Rumi}                \p{Rumi_Numeral_Symbols} (= \p{Block=
4825                             Rumi_Numeral_Symbols}) (32)
4826 X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
4827                             \p{InRumi}) (32)
4828   \p{Runic}               \p{Script_Extensions=Runic} (Short:
4829                             \p{Runr}; NOT \p{Block=Runic}) (86)
4830   \p{Runr}                \p{Runic} (= \p{Script_Extensions=Runic})
4831                             (NOT \p{Block=Runic}) (86)
4832   \p{S} \pS               \p{Symbol} (= \p{General_Category=Symbol})
4833                             (7292)
4834   \p{Samaritan}           \p{Script_Extensions=Samaritan} (Short:
4835                             \p{Samr}; NOT \p{Block=Samaritan}) (61)
4836   \p{Samr}                \p{Samaritan} (= \p{Script_Extensions=
4837                             Samaritan}) (NOT \p{Block=Samaritan})
4838                             (61)
4839   \p{Sarb}                \p{Old_South_Arabian} (=
4840                             \p{Script_Extensions=Old_South_Arabian})
4841                             (32)
4842   \p{Saur}                \p{Saurashtra} (= \p{Script_Extensions=
4843                             Saurashtra}) (NOT \p{Block=Saurashtra})
4844                             (82)
4845   \p{Saurashtra}          \p{Script_Extensions=Saurashtra} (Short:
4846                             \p{Saur}; NOT \p{Block=Saurashtra}) (82)
4847   \p{SB: *}               \p{Sentence_Break: *}
4848   \p{Sc}                  \p{Currency_Symbol} (=
4849                             \p{General_Category=Currency_Symbol})
4850                             (62)
4851   \p{Sc: *}               \p{Script: *}
4852   \p{Script: Adlam}       (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B,
4853                             U+1E950..1E959, U+1E95E..1E95F)
4854   \p{Script: Adlm}        \p{Script=Adlam} (88)
4855   \p{Script: Aghb}        \p{Script=Caucasian_Albanian} (=
4856                             \p{Script_Extensions=
4857                             Caucasian_Albanian}) (53)
4858   \p{Script: Ahom}        \p{Script_Extensions=Ahom} (Short: \p{Sc=
4859                             Ahom}, \p{Ahom}) (58)
4860   \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
4861                             Anatolian_Hieroglyphs} (Short: \p{Sc=
4862                             Hluw}, \p{Hluw}) (583)
4863   \p{Script: Arab}        \p{Script=Arabic} (1281)
4864   \p{Script: Arabic}      (Short: \p{Sc=Arab}) (1281: U+0600..0604,
4865                             U+0606..060B, U+060D..061A, U+061C,
4866                             U+061E, U+0620..063F ...)
4867   \p{Script: Armenian}    (Short: \p{Sc=Armn}) (95: U+0531..0556,
4868                             U+0559..0588, U+058A, U+058D..058F,
4869                             U+FB13..FB17)
4870   \p{Script: Armi}        \p{Script=Imperial_Aramaic} (=
4871                             \p{Script_Extensions=Imperial_Aramaic})
4872                             (31)
4873   \p{Script: Armn}        \p{Script=Armenian} (95)
4874   \p{Script: Avestan}     \p{Script_Extensions=Avestan} (Short:
4875                             \p{Sc=Avst}, \p{Avst}) (61)
4876   \p{Script: Avst}        \p{Script=Avestan} (=
4877                             \p{Script_Extensions=Avestan}) (61)
4878   \p{Script: Bali}        \p{Script=Balinese} (=
4879                             \p{Script_Extensions=Balinese}) (121)
4880   \p{Script: Balinese}    \p{Script_Extensions=Balinese} (Short:
4881                             \p{Sc=Bali}, \p{Bali}) (121)
4882   \p{Script: Bamu}        \p{Script=Bamum} (= \p{Script_Extensions=
4883                             Bamum}) (657)
4884   \p{Script: Bamum}       \p{Script_Extensions=Bamum} (Short: \p{Sc=
4885                             Bamu}, \p{Bamu}) (657)
4886   \p{Script: Bass}        \p{Script=Bassa_Vah} (=
4887                             \p{Script_Extensions=Bassa_Vah}) (36)
4888   \p{Script: Bassa_Vah}   \p{Script_Extensions=Bassa_Vah} (Short:
4889                             \p{Sc=Bass}, \p{Bass}) (36)
4890   \p{Script: Batak}       \p{Script_Extensions=Batak} (Short: \p{Sc=
4891                             Batk}, \p{Batk}) (56)
4892   \p{Script: Batk}        \p{Script=Batak} (= \p{Script_Extensions=
4893                             Batak}) (56)
4894   \p{Script: Beng}        \p{Script=Bengali} (96)
4895   \p{Script: Bengali}     (Short: \p{Sc=Beng}) (96: U+0980..0983,
4896                             U+0985..098C, U+098F..0990,
4897                             U+0993..09A8, U+09AA..09B0, U+09B2 ...)
4898   \p{Script: Bhaiksuki}   \p{Script_Extensions=Bhaiksuki} (Short:
4899                             \p{Sc=Bhks}, \p{Bhks}) (97)
4900   \p{Script: Bhks}        \p{Script=Bhaiksuki} (=
4901                             \p{Script_Extensions=Bhaiksuki}) (97)
4902   \p{Script: Bopo}        \p{Script=Bopomofo} (72)
4903   \p{Script: Bopomofo}    (Short: \p{Sc=Bopo}) (72: U+02EA..02EB,
4904                             U+3105..312F, U+31A0..31BA)
4905   \p{Script: Brah}        \p{Script=Brahmi} (= \p{Script_Extensions=
4906                             Brahmi}) (109)
4907   \p{Script: Brahmi}      \p{Script_Extensions=Brahmi} (Short:
4908                             \p{Sc=Brah}, \p{Brah}) (109)
4909   \p{Script: Brai}        \p{Script=Braille} (=
4910                             \p{Script_Extensions=Braille}) (256)
4911   \p{Script: Braille}     \p{Script_Extensions=Braille} (Short:
4912                             \p{Sc=Brai}, \p{Brai}) (256)
4913   \p{Script: Bugi}        \p{Script=Buginese} (30)
4914   \p{Script: Buginese}    (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
4915                             U+1A1E..1A1F)
4916   \p{Script: Buhd}        \p{Script=Buhid} (20)
4917   \p{Script: Buhid}       (Short: \p{Sc=Buhd}) (20: U+1740..1753)
4918   \p{Script: Cakm}        \p{Script=Chakma} (70)
4919   \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
4920                             Canadian_Aboriginal} (Short: \p{Sc=
4921                             Cans}, \p{Cans}) (710)
4922   \p{Script: Cans}        \p{Script=Canadian_Aboriginal} (=
4923                             \p{Script_Extensions=
4924                             Canadian_Aboriginal}) (710)
4925   \p{Script: Cari}        \p{Script=Carian} (= \p{Script_Extensions=
4926                             Carian}) (49)
4927   \p{Script: Carian}      \p{Script_Extensions=Carian} (Short:
4928                             \p{Sc=Cari}, \p{Cari}) (49)
4929   \p{Script: Caucasian_Albanian} \p{Script_Extensions=
4930                             Caucasian_Albanian} (Short: \p{Sc=Aghb},
4931                             \p{Aghb}) (53)
4932   \p{Script: Chakma}      (Short: \p{Sc=Cakm}) (70: U+11100..11134,
4933                             U+11136..11146)
4934   \p{Script: Cham}        \p{Script_Extensions=Cham} (Short: \p{Sc=
4935                             Cham}, \p{Cham}) (83)
4936   \p{Script: Cher}        \p{Script=Cherokee} (=
4937                             \p{Script_Extensions=Cherokee}) (172)
4938   \p{Script: Cherokee}    \p{Script_Extensions=Cherokee} (Short:
4939                             \p{Sc=Cher}, \p{Cher}) (172)
4940   \p{Script: Common}      (Short: \p{Sc=Zyyy}) (7805: [\x00-
4941                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
4942                             >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
4943                             \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
4944                             U+02E5..02E9, U+02EC..02FF, U+0374,
4945                             U+037E ...)
4946   \p{Script: Copt}        \p{Script=Coptic} (137)
4947   \p{Script: Coptic}      (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
4948                             U+2C80..2CF3, U+2CF9..2CFF)
4949   \p{Script: Cprt}        \p{Script=Cypriot} (55)
4950   \p{Script: Cuneiform}   \p{Script_Extensions=Cuneiform} (Short:
4951                             \p{Sc=Xsux}, \p{Xsux}) (1234)
4952   \p{Script: Cypriot}     (Short: \p{Sc=Cprt}) (55: U+10800..10805,
4953                             U+10808, U+1080A..10835, U+10837..10838,
4954                             U+1083C, U+1083F)
4955   \p{Script: Cyrillic}    (Short: \p{Sc=Cyrl}) (443: U+0400..0484,
4956                             U+0487..052F, U+1C80..1C88, U+1D2B,
4957                             U+1D78, U+2DE0..2DFF ...)
4958   \p{Script: Cyrl}        \p{Script=Cyrillic} (443)
4959   \p{Script: Deseret}     \p{Script_Extensions=Deseret} (Short:
4960                             \p{Sc=Dsrt}, \p{Dsrt}) (80)
4961   \p{Script: Deva}        \p{Script=Devanagari} (154)
4962   \p{Script: Devanagari}  (Short: \p{Sc=Deva}) (154: U+0900..0950,
4963                             U+0955..0963, U+0966..097F, U+A8E0..A8FF)
4964   \p{Script: Dogr}        \p{Script=Dogra} (60)
4965   \p{Script: Dogra}       (Short: \p{Sc=Dogr}) (60: U+11800..1183B)
4966   \p{Script: Dsrt}        \p{Script=Deseret} (=
4967                             \p{Script_Extensions=Deseret}) (80)
4968   \p{Script: Dupl}        \p{Script=Duployan} (143)
4969   \p{Script: Duployan}    (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
4970                             U+1BC70..1BC7C, U+1BC80..1BC88,
4971                             U+1BC90..1BC99, U+1BC9C..1BC9F)
4972   \p{Script: Egyp}        \p{Script=Egyptian_Hieroglyphs} (=
4973                             \p{Script_Extensions=
4974                             Egyptian_Hieroglyphs}) (1080)
4975   \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
4976                             Egyptian_Hieroglyphs} (Short: \p{Sc=
4977                             Egyp}, \p{Egyp}) (1080)
4978   \p{Script: Elba}        \p{Script=Elbasan} (=
4979                             \p{Script_Extensions=Elbasan}) (40)
4980   \p{Script: Elbasan}     \p{Script_Extensions=Elbasan} (Short:
4981                             \p{Sc=Elba}, \p{Elba}) (40)
4982   \p{Script: Elym}        \p{Script=Elymaic} (=
4983                             \p{Script_Extensions=Elymaic}) (23)
4984   \p{Script: Elymaic}     \p{Script_Extensions=Elymaic} (Short:
4985                             \p{Sc=Elym}, \p{Elym}) (23)
4986   \p{Script: Ethi}        \p{Script=Ethiopic} (=
4987                             \p{Script_Extensions=Ethiopic}) (495)
4988   \p{Script: Ethiopic}    \p{Script_Extensions=Ethiopic} (Short:
4989                             \p{Sc=Ethi}, \p{Ethi}) (495)
4990   \p{Script: Geor}        \p{Script=Georgian} (173)
4991   \p{Script: Georgian}    (Short: \p{Sc=Geor}) (173: U+10A0..10C5,
4992                             U+10C7, U+10CD, U+10D0..10FA,
4993                             U+10FC..10FF, U+1C90..1CBA ...)
4994   \p{Script: Glag}        \p{Script=Glagolitic} (132)
4995   \p{Script: Glagolitic}  (Short: \p{Sc=Glag}) (132: U+2C00..2C2E,
4996                             U+2C30..2C5E, U+1E000..1E006,
4997                             U+1E008..1E018, U+1E01B..1E021,
4998                             U+1E023..1E024 ...)
4999   \p{Script: Gong}        \p{Script=Gunjala_Gondi} (63)
5000   \p{Script: Gonm}        \p{Script=Masaram_Gondi} (75)
5001   \p{Script: Goth}        \p{Script=Gothic} (= \p{Script_Extensions=
5002                             Gothic}) (27)
5003   \p{Script: Gothic}      \p{Script_Extensions=Gothic} (Short:
5004                             \p{Sc=Goth}, \p{Goth}) (27)
5005   \p{Script: Gran}        \p{Script=Grantha} (85)
5006   \p{Script: Grantha}     (Short: \p{Sc=Gran}) (85: U+11300..11303,
5007                             U+11305..1130C, U+1130F..11310,
5008                             U+11313..11328, U+1132A..11330,
5009                             U+11332..11333 ...)
5010   \p{Script: Greek}       (Short: \p{Sc=Grek}) (518: U+0370..0373,
5011                             U+0375..0377, U+037A..037D, U+037F,
5012                             U+0384, U+0386 ...)
5013   \p{Script: Grek}        \p{Script=Greek} (518)
5014   \p{Script: Gujarati}    (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
5015                             U+0A85..0A8D, U+0A8F..0A91,
5016                             U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
5017                             ...)
5018   \p{Script: Gujr}        \p{Script=Gujarati} (91)
5019   \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63:
5020                             U+11D60..11D65, U+11D67..11D68,
5021                             U+11D6A..11D8E, U+11D90..11D91,
5022                             U+11D93..11D98, U+11DA0..11DA9)
5023   \p{Script: Gurmukhi}    (Short: \p{Sc=Guru}) (80: U+0A01..0A03,
5024                             U+0A05..0A0A, U+0A0F..0A10,
5025                             U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
5026                             ...)
5027   \p{Script: Guru}        \p{Script=Gurmukhi} (80)
5028   \p{Script: Han}         (Short: \p{Sc=Han}) (89_233: U+2E80..2E99,
5029                             U+2E9B..2EF3, U+2F00..2FD5, U+3005,
5030                             U+3007, U+3021..3029 ...)
5031   \p{Script: Hang}        \p{Script=Hangul} (11_739)
5032   \p{Script: Hangul}      (Short: \p{Sc=Hang}) (11_739:
5033                             U+1100..11FF, U+302E..302F,
5034                             U+3131..318E, U+3200..321E,
5035                             U+3260..327E, U+A960..A97C ...)
5036   \p{Script: Hani}        \p{Script=Han} (89_233)
5037   \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50:
5038                             U+10D00..10D27, U+10D30..10D39)
5039   \p{Script: Hano}        \p{Script=Hanunoo} (21)
5040   \p{Script: Hanunoo}     (Short: \p{Sc=Hano}) (21: U+1720..1734)
5041   \p{Script: Hatr}        \p{Script=Hatran} (= \p{Script_Extensions=
5042                             Hatran}) (26)
5043   \p{Script: Hatran}      \p{Script_Extensions=Hatran} (Short:
5044                             \p{Sc=Hatr}, \p{Hatr}) (26)
5045   \p{Script: Hebr}        \p{Script=Hebrew} (= \p{Script_Extensions=
5046                             Hebrew}) (134)
5047   \p{Script: Hebrew}      \p{Script_Extensions=Hebrew} (Short:
5048                             \p{Sc=Hebr}, \p{Hebr}) (134)
5049   \p{Script: Hira}        \p{Script=Hiragana} (379)
5050   \p{Script: Hiragana}    (Short: \p{Sc=Hira}) (379: U+3041..3096,
5051                             U+309D..309F, U+1B001..1B11E,
5052                             U+1B150..1B152, U+1F200)
5053   \p{Script: Hluw}        \p{Script=Anatolian_Hieroglyphs} (=
5054                             \p{Script_Extensions=
5055                             Anatolian_Hieroglyphs}) (583)
5056   \p{Script: Hmng}        \p{Script=Pahawh_Hmong} (=
5057                             \p{Script_Extensions=Pahawh_Hmong}) (127)
5058   \p{Script: Hmnp}        \p{Script=Nyiakeng_Puachue_Hmong} (=
5059                             \p{Script_Extensions=
5060                             Nyiakeng_Puachue_Hmong}) (71)
5061   \p{Script: Hung}        \p{Script=Old_Hungarian} (=
5062                             \p{Script_Extensions=Old_Hungarian})
5063                             (108)
5064   \p{Script: Imperial_Aramaic} \p{Script_Extensions=
5065                             Imperial_Aramaic} (Short: \p{Sc=Armi},
5066                             \p{Armi}) (31)
5067   \p{Script: Inherited}   (Short: \p{Sc=Zinh}) (571: U+0300..036F,
5068                             U+0485..0486, U+064B..0655, U+0670,
5069                             U+0951..0954, U+1AB0..1ABE ...)
5070   \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
5071                             Inscriptional_Pahlavi} (Short: \p{Sc=
5072                             Phli}, \p{Phli}) (27)
5073   \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
5074                             Inscriptional_Parthian} (Short: \p{Sc=
5075                             Prti}, \p{Prti}) (30)
5076   \p{Script: Ital}        \p{Script=Old_Italic} (=
5077                             \p{Script_Extensions=Old_Italic}) (39)
5078   \p{Script: Java}        \p{Script=Javanese} (90)
5079   \p{Script: Javanese}    (Short: \p{Sc=Java}) (90: U+A980..A9CD,
5080                             U+A9D0..A9D9, U+A9DE..A9DF)
5081   \p{Script: Kaithi}      (Short: \p{Sc=Kthi}) (67: U+11080..110C1,
5082                             U+110CD)
5083   \p{Script: Kali}        \p{Script=Kayah_Li} (47)
5084   \p{Script: Kana}        \p{Script=Katakana} (304)
5085   \p{Script: Kannada}     (Short: \p{Sc=Knda}) (89: U+0C80..0C8C,
5086                             U+0C8E..0C90, U+0C92..0CA8,
5087                             U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4
5088                             ...)
5089   \p{Script: Katakana}    (Short: \p{Sc=Kana}) (304: U+30A1..30FA,
5090                             U+30FD..30FF, U+31F0..31FF,
5091                             U+32D0..32FE, U+3300..3357, U+FF66..FF6F
5092                             ...)
5093   \p{Script: Kayah_Li}    (Short: \p{Sc=Kali}) (47: U+A900..A92D,
5094                             U+A92F)
5095   \p{Script: Khar}        \p{Script=Kharoshthi} (=
5096                             \p{Script_Extensions=Kharoshthi}) (68)
5097   \p{Script: Kharoshthi}  \p{Script_Extensions=Kharoshthi} (Short:
5098                             \p{Sc=Khar}, \p{Khar}) (68)
5099   \p{Script: Khmer}       \p{Script_Extensions=Khmer} (Short: \p{Sc=
5100                             Khmr}, \p{Khmr}) (146)
5101   \p{Script: Khmr}        \p{Script=Khmer} (= \p{Script_Extensions=
5102                             Khmer}) (146)
5103   \p{Script: Khoj}        \p{Script=Khojki} (62)
5104   \p{Script: Khojki}      (Short: \p{Sc=Khoj}) (62: U+11200..11211,
5105                             U+11213..1123E)
5106   \p{Script: Khudawadi}   (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
5107                             U+112F0..112F9)
5108   \p{Script: Knda}        \p{Script=Kannada} (89)
5109   \p{Script: Kthi}        \p{Script=Kaithi} (67)
5110   \p{Script: Lana}        \p{Script=Tai_Tham} (=
5111                             \p{Script_Extensions=Tai_Tham}) (127)
5112   \p{Script: Lao}         \p{Script_Extensions=Lao} (Short: \p{Sc=
5113                             Lao}, \p{Lao}) (82)
5114   \p{Script: Laoo}        \p{Script=Lao} (= \p{Script_Extensions=
5115                             Lao}) (82)
5116   \p{Script: Latin}       (Short: \p{Sc=Latn}) (1366: [A-Za-
5117                             z\xaa\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
5118                             U+0100..02B8, U+02E0..02E4,
5119                             U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
5120                             ...)
5121   \p{Script: Latn}        \p{Script=Latin} (1366)
5122   \p{Script: Lepc}        \p{Script=Lepcha} (= \p{Script_Extensions=
5123                             Lepcha}) (74)
5124   \p{Script: Lepcha}      \p{Script_Extensions=Lepcha} (Short:
5125                             \p{Sc=Lepc}, \p{Lepc}) (74)
5126   \p{Script: Limb}        \p{Script=Limbu} (68)
5127   \p{Script: Limbu}       (Short: \p{Sc=Limb}) (68: U+1900..191E,
5128                             U+1920..192B, U+1930..193B, U+1940,
5129                             U+1944..194F)
5130   \p{Script: Lina}        \p{Script=Linear_A} (341)
5131   \p{Script: Linb}        \p{Script=Linear_B} (211)
5132   \p{Script: Linear_A}    (Short: \p{Sc=Lina}) (341: U+10600..10736,
5133                             U+10740..10755, U+10760..10767)
5134   \p{Script: Linear_B}    (Short: \p{Sc=Linb}) (211: U+10000..1000B,
5135                             U+1000D..10026, U+10028..1003A,
5136                             U+1003C..1003D, U+1003F..1004D,
5137                             U+10050..1005D ...)
5138   \p{Script: Lisu}        \p{Script_Extensions=Lisu} (Short: \p{Sc=
5139                             Lisu}, \p{Lisu}) (48)
5140   \p{Script: Lyci}        \p{Script=Lycian} (= \p{Script_Extensions=
5141                             Lycian}) (29)
5142   \p{Script: Lycian}      \p{Script_Extensions=Lycian} (Short:
5143                             \p{Sc=Lyci}, \p{Lyci}) (29)
5144   \p{Script: Lydi}        \p{Script=Lydian} (= \p{Script_Extensions=
5145                             Lydian}) (27)
5146   \p{Script: Lydian}      \p{Script_Extensions=Lydian} (Short:
5147                             \p{Sc=Lydi}, \p{Lydi}) (27)
5148   \p{Script: Mahajani}    (Short: \p{Sc=Mahj}) (39: U+11150..11176)
5149   \p{Script: Mahj}        \p{Script=Mahajani} (39)
5150   \p{Script: Maka}        \p{Script=Makasar} (=
5151                             \p{Script_Extensions=Makasar}) (25)
5152   \p{Script: Makasar}     \p{Script_Extensions=Makasar} (Short:
5153                             \p{Sc=Maka}, \p{Maka}) (25)
5154   \p{Script: Malayalam}   (Short: \p{Sc=Mlym}) (117: U+0D00..0D03,
5155                             U+0D05..0D0C, U+0D0E..0D10,
5156                             U+0D12..0D44, U+0D46..0D48, U+0D4A..0D4F
5157                             ...)
5158   \p{Script: Mand}        \p{Script=Mandaic} (29)
5159   \p{Script: Mandaic}     (Short: \p{Sc=Mand}) (29: U+0840..085B,
5160                             U+085E)
5161   \p{Script: Mani}        \p{Script=Manichaean} (51)
5162   \p{Script: Manichaean}  (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
5163                             U+10AEB..10AF6)
5164   \p{Script: Marc}        \p{Script=Marchen} (=
5165                             \p{Script_Extensions=Marchen}) (68)
5166   \p{Script: Marchen}     \p{Script_Extensions=Marchen} (Short:
5167                             \p{Sc=Marc}, \p{Marc}) (68)
5168   \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75:
5169                             U+11D00..11D06, U+11D08..11D09,
5170                             U+11D0B..11D36, U+11D3A, U+11D3C..11D3D,
5171                             U+11D3F..11D47 ...)
5172   \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
5173                             \p{Sc=Medf}, \p{Medf}) (91)
5174   \p{Script: Medf}        \p{Script=Medefaidrin} (=
5175                             \p{Script_Extensions=Medefaidrin}) (91)
5176   \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
5177                             (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
5178   \p{Script: Mend}        \p{Script=Mende_Kikakui} (=
5179                             \p{Script_Extensions=Mende_Kikakui})
5180                             (213)
5181   \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
5182                             (Short: \p{Sc=Mend}, \p{Mend}) (213)
5183   \p{Script: Merc}        \p{Script=Meroitic_Cursive} (=
5184                             \p{Script_Extensions=Meroitic_Cursive})
5185                             (90)
5186   \p{Script: Mero}        \p{Script=Meroitic_Hieroglyphs} (=
5187                             \p{Script_Extensions=
5188                             Meroitic_Hieroglyphs}) (32)
5189   \p{Script: Meroitic_Cursive} \p{Script_Extensions=
5190                             Meroitic_Cursive} (Short: \p{Sc=Merc},
5191                             \p{Merc}) (90)
5192   \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
5193                             Meroitic_Hieroglyphs} (Short: \p{Sc=
5194                             Mero}, \p{Mero}) (32)
5195   \p{Script: Miao}        \p{Script_Extensions=Miao} (Short: \p{Sc=
5196                             Miao}, \p{Miao}) (149)
5197   \p{Script: Mlym}        \p{Script=Malayalam} (117)
5198   \p{Script: Modi}        (Short: \p{Sc=Modi}) (79: U+11600..11644,
5199                             U+11650..11659)
5200   \p{Script: Mong}        \p{Script=Mongolian} (167)
5201   \p{Script: Mongolian}   (Short: \p{Sc=Mong}) (167: U+1800..1801,
5202                             U+1804, U+1806..180E, U+1810..1819,
5203                             U+1820..1878, U+1880..18AA ...)
5204   \p{Script: Mro}         \p{Script_Extensions=Mro} (Short: \p{Sc=
5205                             Mro}, \p{Mro}) (43)
5206   \p{Script: Mroo}        \p{Script=Mro} (= \p{Script_Extensions=
5207                             Mro}) (43)
5208   \p{Script: Mtei}        \p{Script=Meetei_Mayek} (=
5209                             \p{Script_Extensions=Meetei_Mayek}) (79)
5210   \p{Script: Mult}        \p{Script=Multani} (38)
5211   \p{Script: Multani}     (Short: \p{Sc=Mult}) (38: U+11280..11286,
5212                             U+11288, U+1128A..1128D, U+1128F..1129D,
5213                             U+1129F..112A9)
5214   \p{Script: Myanmar}     (Short: \p{Sc=Mymr}) (223: U+1000..109F,
5215                             U+A9E0..A9FE, U+AA60..AA7F)
5216   \p{Script: Mymr}        \p{Script=Myanmar} (223)
5217   \p{Script: Nabataean}   \p{Script_Extensions=Nabataean} (Short:
5218                             \p{Sc=Nbat}, \p{Nbat}) (40)
5219   \p{Script: Nand}        \p{Script=Nandinagari} (65)
5220   \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7,
5221                             U+119AA..119D7, U+119DA..119E4)
5222   \p{Script: Narb}        \p{Script=Old_North_Arabian} (=
5223                             \p{Script_Extensions=Old_North_Arabian})
5224                             (32)
5225   \p{Script: Nbat}        \p{Script=Nabataean} (=
5226                             \p{Script_Extensions=Nabataean}) (40)
5227   \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
5228                             \p{Sc=Talu}, \p{Talu}) (83)
5229   \p{Script: Newa}        \p{Script_Extensions=Newa} (Short: \p{Sc=
5230                             Newa}, \p{Newa}) (94)
5231   \p{Script: Nko}         \p{Script_Extensions=Nko} (Short: \p{Sc=
5232                             Nko}, \p{Nko}) (62)
5233   \p{Script: Nkoo}        \p{Script=Nko} (= \p{Script_Extensions=
5234                             Nko}) (62)
5235   \p{Script: Nshu}        \p{Script=Nushu} (= \p{Script_Extensions=
5236                             Nushu}) (397)
5237   \p{Script: Nushu}       \p{Script_Extensions=Nushu} (Short: \p{Sc=
5238                             Nshu}, \p{Nshu}) (397)
5239   \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
5240                             Nyiakeng_Puachue_Hmong} (Short: \p{Sc=
5241                             Hmnp}, \p{Hmnp}) (71)
5242   \p{Script: Ogam}        \p{Script=Ogham} (= \p{Script_Extensions=
5243                             Ogham}) (29)
5244   \p{Script: Ogham}       \p{Script_Extensions=Ogham} (Short: \p{Sc=
5245                             Ogam}, \p{Ogam}) (29)
5246   \p{Script: Ol_Chiki}    \p{Script_Extensions=Ol_Chiki} (Short:
5247                             \p{Sc=Olck}, \p{Olck}) (48)
5248   \p{Script: Olck}        \p{Script=Ol_Chiki} (=
5249                             \p{Script_Extensions=Ol_Chiki}) (48)
5250   \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
5251                             (Short: \p{Sc=Hung}, \p{Hung}) (108)
5252   \p{Script: Old_Italic}  \p{Script_Extensions=Old_Italic} (Short:
5253                             \p{Sc=Ital}, \p{Ital}) (39)
5254   \p{Script: Old_North_Arabian} \p{Script_Extensions=
5255                             Old_North_Arabian} (Short: \p{Sc=Narb},
5256                             \p{Narb}) (32)
5257   \p{Script: Old_Permic}  (Short: \p{Sc=Perm}) (43: U+10350..1037A)
5258   \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
5259                             \p{Sc=Xpeo}, \p{Xpeo}) (50)
5260   \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
5261                             \p{Sc=Sogo}, \p{Sogo}) (40)
5262   \p{Script: Old_South_Arabian} \p{Script_Extensions=
5263                             Old_South_Arabian} (Short: \p{Sc=Sarb},
5264                             \p{Sarb}) (32)
5265   \p{Script: Old_Turkic}  \p{Script_Extensions=Old_Turkic} (Short:
5266                             \p{Sc=Orkh}, \p{Orkh}) (73)
5267   \p{Script: Oriya}       (Short: \p{Sc=Orya}) (90: U+0B01..0B03,
5268                             U+0B05..0B0C, U+0B0F..0B10,
5269                             U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
5270                             ...)
5271   \p{Script: Orkh}        \p{Script=Old_Turkic} (=
5272                             \p{Script_Extensions=Old_Turkic}) (73)
5273   \p{Script: Orya}        \p{Script=Oriya} (90)
5274   \p{Script: Osage}       \p{Script_Extensions=Osage} (Short: \p{Sc=
5275                             Osge}, \p{Osge}) (72)
5276   \p{Script: Osge}        \p{Script=Osage} (= \p{Script_Extensions=
5277                             Osage}) (72)
5278   \p{Script: Osma}        \p{Script=Osmanya} (=
5279                             \p{Script_Extensions=Osmanya}) (40)
5280   \p{Script: Osmanya}     \p{Script_Extensions=Osmanya} (Short:
5281                             \p{Sc=Osma}, \p{Osma}) (40)
5282   \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
5283                             (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
5284   \p{Script: Palm}        \p{Script=Palmyrene} (=
5285                             \p{Script_Extensions=Palmyrene}) (32)
5286   \p{Script: Palmyrene}   \p{Script_Extensions=Palmyrene} (Short:
5287                             \p{Sc=Palm}, \p{Palm}) (32)
5288   \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
5289                             \p{Sc=Pauc}, \p{Pauc}) (57)
5290   \p{Script: Pauc}        \p{Script=Pau_Cin_Hau} (=
5291                             \p{Script_Extensions=Pau_Cin_Hau}) (57)
5292   \p{Script: Perm}        \p{Script=Old_Permic} (43)
5293   \p{Script: Phag}        \p{Script=Phags_Pa} (56)
5294   \p{Script: Phags_Pa}    (Short: \p{Sc=Phag}) (56: U+A840..A877)
5295   \p{Script: Phli}        \p{Script=Inscriptional_Pahlavi} (=
5296                             \p{Script_Extensions=
5297                             Inscriptional_Pahlavi}) (27)
5298   \p{Script: Phlp}        \p{Script=Psalter_Pahlavi} (29)
5299   \p{Script: Phnx}        \p{Script=Phoenician} (=
5300                             \p{Script_Extensions=Phoenician}) (29)
5301   \p{Script: Phoenician}  \p{Script_Extensions=Phoenician} (Short:
5302                             \p{Sc=Phnx}, \p{Phnx}) (29)
5303   \p{Script: Plrd}        \p{Script=Miao} (= \p{Script_Extensions=
5304                             Miao}) (149)
5305   \p{Script: Prti}        \p{Script=Inscriptional_Parthian} (=
5306                             \p{Script_Extensions=
5307                             Inscriptional_Parthian}) (30)
5308   \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
5309                             U+10B80..10B91, U+10B99..10B9C,
5310                             U+10BA9..10BAF)
5311   \p{Script: Qaac}        \p{Script=Coptic} (137)
5312   \p{Script: Qaai}        \p{Script=Inherited} (571)
5313   \p{Script: Rejang}      \p{Script_Extensions=Rejang} (Short:
5314                             \p{Sc=Rjng}, \p{Rjng}) (37)
5315   \p{Script: Rjng}        \p{Script=Rejang} (= \p{Script_Extensions=
5316                             Rejang}) (37)
5317   \p{Script: Rohg}        \p{Script=Hanifi_Rohingya} (50)
5318   \p{Script: Runic}       \p{Script_Extensions=Runic} (Short: \p{Sc=
5319                             Runr}, \p{Runr}) (86)
5320   \p{Script: Runr}        \p{Script=Runic} (= \p{Script_Extensions=
5321                             Runic}) (86)
5322   \p{Script: Samaritan}   \p{Script_Extensions=Samaritan} (Short:
5323                             \p{Sc=Samr}, \p{Samr}) (61)
5324   \p{Script: Samr}        \p{Script=Samaritan} (=
5325                             \p{Script_Extensions=Samaritan}) (61)
5326   \p{Script: Sarb}        \p{Script=Old_South_Arabian} (=
5327                             \p{Script_Extensions=Old_South_Arabian})
5328                             (32)
5329   \p{Script: Saur}        \p{Script=Saurashtra} (=
5330                             \p{Script_Extensions=Saurashtra}) (82)
5331   \p{Script: Saurashtra}  \p{Script_Extensions=Saurashtra} (Short:
5332                             \p{Sc=Saur}, \p{Saur}) (82)
5333   \p{Script: Sgnw}        \p{Script=SignWriting} (=
5334                             \p{Script_Extensions=SignWriting}) (672)
5335   \p{Script: Sharada}     (Short: \p{Sc=Shrd}) (94: U+11180..111CD,
5336                             U+111D0..111DF)
5337   \p{Script: Shavian}     \p{Script_Extensions=Shavian} (Short:
5338                             \p{Sc=Shaw}, \p{Shaw}) (48)
5339   \p{Script: Shaw}        \p{Script=Shavian} (=
5340                             \p{Script_Extensions=Shavian}) (48)
5341   \p{Script: Shrd}        \p{Script=Sharada} (94)
5342   \p{Script: Sidd}        \p{Script=Siddham} (=
5343                             \p{Script_Extensions=Siddham}) (92)
5344   \p{Script: Siddham}     \p{Script_Extensions=Siddham} (Short:
5345                             \p{Sc=Sidd}, \p{Sidd}) (92)
5346   \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
5347                             \p{Sc=Sgnw}, \p{Sgnw}) (672)
5348   \p{Script: Sind}        \p{Script=Khudawadi} (69)
5349   \p{Script: Sinh}        \p{Script=Sinhala} (110)
5350   \p{Script: Sinhala}     (Short: \p{Sc=Sinh}) (110: U+0D82..0D83,
5351                             U+0D85..0D96, U+0D9A..0DB1,
5352                             U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
5353   \p{Script: Sogd}        \p{Script=Sogdian} (42)
5354   \p{Script: Sogdian}     (Short: \p{Sc=Sogd}) (42: U+10F30..10F59)
5355   \p{Script: Sogo}        \p{Script=Old_Sogdian} (=
5356                             \p{Script_Extensions=Old_Sogdian}) (40)
5357   \p{Script: Sora}        \p{Script=Sora_Sompeng} (=
5358                             \p{Script_Extensions=Sora_Sompeng}) (35)
5359   \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
5360                             (Short: \p{Sc=Sora}, \p{Sora}) (35)
5361   \p{Script: Soyo}        \p{Script=Soyombo} (=
5362                             \p{Script_Extensions=Soyombo}) (83)
5363   \p{Script: Soyombo}     \p{Script_Extensions=Soyombo} (Short:
5364                             \p{Sc=Soyo}, \p{Soyo}) (83)
5365   \p{Script: Sund}        \p{Script=Sundanese} (=
5366                             \p{Script_Extensions=Sundanese}) (72)
5367   \p{Script: Sundanese}   \p{Script_Extensions=Sundanese} (Short:
5368                             \p{Sc=Sund}, \p{Sund}) (72)
5369   \p{Script: Sylo}        \p{Script=Syloti_Nagri} (44)
5370   \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (44: U+A800..A82B)
5371   \p{Script: Syrc}        \p{Script=Syriac} (88)
5372   \p{Script: Syriac}      (Short: \p{Sc=Syrc}) (88: U+0700..070D,
5373                             U+070F..074A, U+074D..074F, U+0860..086A)
5374   \p{Script: Tagalog}     (Short: \p{Sc=Tglg}) (20: U+1700..170C,
5375                             U+170E..1714)
5376   \p{Script: Tagb}        \p{Script=Tagbanwa} (18)
5377   \p{Script: Tagbanwa}    (Short: \p{Sc=Tagb}) (18: U+1760..176C,
5378                             U+176E..1770, U+1772..1773)
5379   \p{Script: Tai_Le}      (Short: \p{Sc=Tale}) (35: U+1950..196D,
5380                             U+1970..1974)
5381   \p{Script: Tai_Tham}    \p{Script_Extensions=Tai_Tham} (Short:
5382                             \p{Sc=Lana}, \p{Lana}) (127)
5383   \p{Script: Tai_Viet}    \p{Script_Extensions=Tai_Viet} (Short:
5384                             \p{Sc=Tavt}, \p{Tavt}) (72)
5385   \p{Script: Takr}        \p{Script=Takri} (67)
5386   \p{Script: Takri}       (Short: \p{Sc=Takr}) (67: U+11680..116B8,
5387                             U+116C0..116C9)
5388   \p{Script: Tale}        \p{Script=Tai_Le} (35)
5389   \p{Script: Talu}        \p{Script=New_Tai_Lue} (=
5390                             \p{Script_Extensions=New_Tai_Lue}) (83)
5391   \p{Script: Tamil}       (Short: \p{Sc=Taml}) (123: U+0B82..0B83,
5392                             U+0B85..0B8A, U+0B8E..0B90,
5393                             U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
5394   \p{Script: Taml}        \p{Script=Tamil} (123)
5395   \p{Script: Tang}        \p{Script=Tangut} (= \p{Script_Extensions=
5396                             Tangut}) (6892)
5397   \p{Script: Tangut}      \p{Script_Extensions=Tangut} (Short:
5398                             \p{Sc=Tang}, \p{Tang}) (6892)
5399   \p{Script: Tavt}        \p{Script=Tai_Viet} (=
5400                             \p{Script_Extensions=Tai_Viet}) (72)
5401   \p{Script: Telu}        \p{Script=Telugu} (98)
5402   \p{Script: Telugu}      (Short: \p{Sc=Telu}) (98: U+0C00..0C0C,
5403                             U+0C0E..0C10, U+0C12..0C28,
5404                             U+0C2A..0C39, U+0C3D..0C44, U+0C46..0C48
5405                             ...)
5406   \p{Script: Tfng}        \p{Script=Tifinagh} (=
5407                             \p{Script_Extensions=Tifinagh}) (59)
5408   \p{Script: Tglg}        \p{Script=Tagalog} (20)
5409   \p{Script: Thaa}        \p{Script=Thaana} (50)
5410   \p{Script: Thaana}      (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
5411   \p{Script: Thai}        \p{Script_Extensions=Thai} (Short: \p{Sc=
5412                             Thai}, \p{Thai}) (86)
5413   \p{Script: Tibetan}     \p{Script_Extensions=Tibetan} (Short:
5414                             \p{Sc=Tibt}, \p{Tibt}) (207)
5415   \p{Script: Tibt}        \p{Script=Tibetan} (=
5416                             \p{Script_Extensions=Tibetan}) (207)
5417   \p{Script: Tifinagh}    \p{Script_Extensions=Tifinagh} (Short:
5418                             \p{Sc=Tfng}, \p{Tfng}) (59)
5419   \p{Script: Tirh}        \p{Script=Tirhuta} (82)
5420   \p{Script: Tirhuta}     (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
5421                             U+114D0..114D9)
5422   \p{Script: Ugar}        \p{Script=Ugaritic} (=
5423                             \p{Script_Extensions=Ugaritic}) (31)
5424   \p{Script: Ugaritic}    \p{Script_Extensions=Ugaritic} (Short:
5425                             \p{Sc=Ugar}, \p{Ugar}) (31)
5426   \p{Script: Unknown}     \p{Script_Extensions=Unknown} (Short:
5427                             \p{Sc=Zzzz}, \p{Zzzz}) (976_118 plus all
5428                             above-Unicode code points)
5429   \p{Script: Vai}         \p{Script_Extensions=Vai} (Short: \p{Sc=
5430                             Vai}, \p{Vai}) (300)
5431   \p{Script: Vaii}        \p{Script=Vai} (= \p{Script_Extensions=
5432                             Vai}) (300)
5433   \p{Script: Wancho}      \p{Script_Extensions=Wancho} (Short:
5434                             \p{Sc=Wcho}, \p{Wcho}) (59)
5435   \p{Script: Wara}        \p{Script=Warang_Citi} (=
5436                             \p{Script_Extensions=Warang_Citi}) (84)
5437   \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
5438                             \p{Sc=Wara}, \p{Wara}) (84)
5439   \p{Script: Wcho}        \p{Script=Wancho} (= \p{Script_Extensions=
5440                             Wancho}) (59)
5441   \p{Script: Xpeo}        \p{Script=Old_Persian} (=
5442                             \p{Script_Extensions=Old_Persian}) (50)
5443   \p{Script: Xsux}        \p{Script=Cuneiform} (=
5444                             \p{Script_Extensions=Cuneiform}) (1234)
5445   \p{Script: Yi}          (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
5446                             U+A490..A4C6)
5447   \p{Script: Yiii}        \p{Script=Yi} (1220)
5448   \p{Script: Zanabazar_Square} \p{Script_Extensions=
5449                             Zanabazar_Square} (Short: \p{Sc=Zanb},
5450                             \p{Zanb}) (72)
5451   \p{Script: Zanb}        \p{Script=Zanabazar_Square} (=
5452                             \p{Script_Extensions=Zanabazar_Square})
5453                             (72)
5454   \p{Script: Zinh}        \p{Script=Inherited} (571)
5455   \p{Script: Zyyy}        \p{Script=Common} (7805)
5456   \p{Script: Zzzz}        \p{Script=Unknown} (=
5457                             \p{Script_Extensions=Unknown}) (976_118
5458                             plus all above-Unicode code points)
5459   \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (89:
5460                             U+0640, U+1E900..1E94B, U+1E950..1E959,
5461                             U+1E95E..1E95F)
5462   \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (89)
5463   \p{Script_Extensions: Aghb} \p{Script_Extensions=
5464                             Caucasian_Albanian} (53)
5465   \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (58:
5466                             U+11700..1171A, U+1171D..1172B,
5467                             U+11730..1173F)
5468   \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
5469                             \p{Hluw}) (583: U+14400..14646)
5470   \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1325)
5471   \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
5472                             (1325: U+0600..0604, U+0606..061C,
5473                             U+061E..06DC, U+06DE..06FF,
5474                             U+0750..077F, U+08A0..08B4 ...)
5475   \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
5476                             (96: U+0531..0556, U+0559..058A,
5477                             U+058D..058F, U+FB13..FB17)
5478   \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
5479                             (31)
5480   \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96)
5481   \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
5482                             (61: U+10B00..10B35, U+10B39..10B3F)
5483   \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
5484   \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
5485   \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
5486                             (121: U+1B00..1B4B, U+1B50..1B7C)
5487   \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
5488   \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
5489                             U+A6A0..A6F7, U+16800..16A38)
5490   \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
5491   \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
5492                             (36: U+16AD0..16AED, U+16AF0..16AF5)
5493   \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
5494                             U+1BC0..1BF3, U+1BFC..1BFF)
5495   \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
5496   \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113)
5497   \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
5498                             (113: U+0951..0952, U+0964..0965,
5499                             U+0980..0983, U+0985..098C,
5500                             U+098F..0990, U+0993..09A8 ...)
5501   \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
5502                             (97: U+11C00..11C08, U+11C0A..11C36,
5503                             U+11C38..11C45, U+11C50..11C6C)
5504   \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
5505   \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (112)
5506   \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
5507                             (112: U+02EA..02EB, U+3001..3003,
5508                             U+3008..3011, U+3013..301F,
5509                             U+302A..302D, U+3030 ...)
5510   \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
5511   \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
5512                             (109: U+11000..1104D, U+11052..1106F,
5513                             U+1107F)
5514   \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
5515   \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
5516                             (256: U+2800..28FF)
5517   \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
5518   \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
5519                             (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
5520   \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
5521   \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
5522                             U+1735..1736, U+1740..1753)
5523   \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (90)
5524   \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
5525                             \p{Cans}) (710: U+1400..167F,
5526                             U+18B0..18F5)
5527   \p{Script_Extensions: Cans} \p{Script_Extensions=
5528                             Canadian_Aboriginal} (710)
5529   \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
5530   \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
5531                             U+102A0..102D0)
5532   \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
5533                             \p{Aghb}) (53: U+10530..10563, U+1056F)
5534   \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (90:
5535                             U+09E6..09EF, U+1040..1049,
5536                             U+11100..11134, U+11136..11146)
5537   \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
5538                             U+AA00..AA36, U+AA40..AA4D,
5539                             U+AA50..AA59, U+AA5C..AA5F)
5540   \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
5541   \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
5542                             (172: U+13A0..13F5, U+13F8..13FD,
5543                             U+AB70..ABBF)
5544   \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
5545                             (7386: [\x00-
5546                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
5547                             >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
5548                             \xb9\xbb-\xbf\xd7\xf7], U+02B9..02DF,
5549                             U+02E5..02E9, U+02EC..02FF, U+0374,
5550                             U+037E ...)
5551   \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
5552   \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
5553                             (165: U+03E2..03EF, U+2C80..2CF3,
5554                             U+2CF9..2CFF, U+102E0..102FB)
5555   \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
5556   \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
5557                             (1234: U+12000..12399, U+12400..1246E,
5558                             U+12470..12474, U+12480..12543)
5559   \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
5560                             (112: U+10100..10102, U+10107..10133,
5561                             U+10137..1013F, U+10800..10805, U+10808,
5562                             U+1080A..10835 ...)
5563   \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
5564                             (446: U+0400..052F, U+1C80..1C88,
5565                             U+1D2B, U+1D78, U+2DE0..2DFF, U+2E43 ...)
5566   \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (446)
5567   \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
5568                             (80: U+10400..1044F)
5569   \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
5570   \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
5571                             (210: U+0900..0952, U+0955..097F,
5572                             U+1CD0..1CF6, U+1CF8..1CF9, U+20F0,
5573                             U+A830..A839 ...)
5574   \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82)
5575   \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82:
5576                             U+0964..096F, U+A830..A839,
5577                             U+11800..1183B)
5578   \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
5579   \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
5580   \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
5581                             (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
5582                             U+1BC80..1BC88, U+1BC90..1BC99,
5583                             U+1BC9C..1BCA3)
5584   \p{Script_Extensions: Egyp} \p{Script_Extensions=
5585                             Egyptian_Hieroglyphs} (1080)
5586   \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
5587                             \p{Egyp}) (1080: U+13000..1342E,
5588                             U+13430..13438)
5589   \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
5590   \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
5591                             (40: U+10500..10527)
5592   \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23)
5593   \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym})
5594                             (23: U+10FE0..10FF6)
5595   \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
5596   \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
5597                             (495: U+1200..1248, U+124A..124D,
5598                             U+1250..1256, U+1258, U+125A..125D,
5599                             U+1260..1288 ...)
5600   \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (175)
5601   \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
5602                             (175: U+0589, U+10A0..10C5, U+10C7,
5603                             U+10CD, U+10D0..10FF, U+1C90..1CBA ...)
5604   \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (136)
5605   \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
5606                             (136: U+0484, U+0487, U+2C00..2C2E,
5607                             U+2C30..2C5E, U+2E43, U+A66F ...)
5608   \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi}
5609                             (65)
5610   \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
5611                             (77)
5612   \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
5613   \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
5614                             U+10330..1034A)
5615   \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116)
5616   \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
5617                             (116: U+0951..0952, U+0964..0965,
5618                             U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3,
5619                             U+1CF2..1CF4 ...)
5620   \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
5621                             U+0342, U+0345, U+0370..0373,
5622                             U+0375..0377, U+037A..037D, U+037F ...)
5623   \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
5624   \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
5625                             (105: U+0951..0952, U+0964..0965,
5626                             U+0A81..0A83, U+0A85..0A8D,
5627                             U+0A8F..0A91, U+0A93..0AA8 ...)
5628   \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
5629   \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong},
5630                             \p{Gong}) (65: U+0964..0965,
5631                             U+11D60..11D65, U+11D67..11D68,
5632                             U+11D6A..11D8E, U+11D90..11D91,
5633                             U+11D93..11D98 ...)
5634   \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
5635                             (94: U+0951..0952, U+0964..0965,
5636                             U+0A01..0A03, U+0A05..0A0A,
5637                             U+0A0F..0A10, U+0A13..0A28 ...)
5638   \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94)
5639   \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (89_513:
5640                             U+2E80..2E99, U+2E9B..2EF3,
5641                             U+2F00..2FD5, U+3001..3003,
5642                             U+3005..3011, U+3013..301F ...)
5643   \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
5644   \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
5645                             (11_775: U+1100..11FF, U+3001..3003,
5646                             U+3008..3011, U+3013..301F,
5647                             U+302E..3030, U+3037 ...)
5648   \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (89_513)
5649   \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg},
5650                             \p{Rohg}) (55: U+060C, U+061B, U+061F,
5651                             U+0640, U+06D4, U+10D00..10D27 ...)
5652   \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
5653   \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
5654                             (23: U+1720..1736)
5655   \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
5656   \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
5657                             U+108E0..108F2, U+108F4..108F5,
5658                             U+108FB..108FF)
5659   \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134)
5660   \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
5661                             (134: U+0591..05C7, U+05D0..05EA,
5662                             U+05EF..05F4, U+FB1D..FB36,
5663                             U+FB38..FB3C, U+FB3E ...)
5664   \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (431)
5665   \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
5666                             (431: U+3001..3003, U+3008..3011,
5667                             U+3013..301F, U+3030..3035, U+3037,
5668                             U+303C..303D ...)
5669   \p{Script_Extensions: Hluw} \p{Script_Extensions=
5670                             Anatolian_Hieroglyphs} (583)
5671   \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
5672                             (127)
5673   \p{Script_Extensions: Hmnp} \p{Script_Extensions=
5674                             Nyiakeng_Puachue_Hmong} (71)
5675   \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
5676                             (108)
5677   \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
5678                             \p{Armi}) (31: U+10840..10855,
5679                             U+10857..1085F)
5680   \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
5681                             (502: U+0300..0341, U+0343..0344,
5682                             U+0346..0362, U+0953..0954,
5683                             U+1AB0..1ABE, U+1DC2..1DF9 ...)
5684   \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
5685                             \p{Phli}) (27: U+10B60..10B72,
5686                             U+10B78..10B7F)
5687   \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
5688                             Prti}, \p{Prti}) (30: U+10B40..10B55,
5689                             U+10B58..10B5F)
5690   \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
5691   \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
5692   \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
5693                             (91: U+A980..A9CD, U+A9CF..A9D9,
5694                             U+A9DE..A9DF)
5695   \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (87:
5696                             U+0966..096F, U+A830..A839,
5697                             U+11080..110C1, U+110CD)
5698   \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
5699   \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (356)
5700   \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
5701                             (104: U+0951..0952, U+0964..0965,
5702                             U+0C80..0C8C, U+0C8E..0C90,
5703                             U+0C92..0CA8, U+0CAA..0CB3 ...)
5704   \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
5705                             (356: U+3001..3003, U+3008..3011,
5706                             U+3013..301F, U+3030..3035, U+3037,
5707                             U+303C..303D ...)
5708   \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
5709                             (48: U+A900..A92F)
5710   \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68)
5711   \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
5712                             (68: U+10A00..10A03, U+10A05..10A06,
5713                             U+10A0C..10A13, U+10A15..10A17,
5714                             U+10A19..10A35, U+10A38..10A3A ...)
5715   \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
5716                             U+1780..17DD, U+17E0..17E9,
5717                             U+17F0..17F9, U+19E0..19FF)
5718   \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
5719   \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (82)
5720   \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (82:
5721                             U+0AE6..0AEF, U+A830..A839,
5722                             U+11200..11211, U+11213..1123E)
5723   \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
5724                             (81: U+0964..0965, U+A830..A839,
5725                             U+112B0..112EA, U+112F0..112F9)
5726   \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (104)
5727   \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (87)
5728   \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
5729   \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (82:
5730                             U+0E81..0E82, U+0E84, U+0E86..0E8A,
5731                             U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...)
5732   \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (82)
5733   \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
5734                             (1387: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
5735                             \xf6\xf8-\xff], U+0100..02B8,
5736                             U+02E0..02E4, U+0363..036F,
5737                             U+0485..0486, U+0951..0952 ...)
5738   \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1387)
5739   \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
5740   \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
5741                             U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
5742   \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
5743   \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
5744                             U+0965, U+1900..191E, U+1920..192B,
5745                             U+1930..193B, U+1940, U+1944..194F)
5746   \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
5747   \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
5748   \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
5749                             (386: U+10107..10133, U+10600..10736,
5750                             U+10740..10755, U+10760..10767)
5751   \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
5752                             (268: U+10000..1000B, U+1000D..10026,
5753                             U+10028..1003A, U+1003C..1003D,
5754                             U+1003F..1004D, U+10050..1005D ...)
5755   \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (48:
5756                             U+A4D0..A4FF)
5757   \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
5758   \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
5759                             U+10280..1029C)
5760   \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
5761   \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
5762                             U+10920..10939, U+1093F)
5763   \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
5764                             (61: U+0964..096F, U+A830..A839,
5765                             U+11150..11176)
5766   \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
5767   \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25)
5768   \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka})
5769                             (25: U+11EE0..11EF8)
5770   \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
5771                             (125: U+0951..0952, U+0964..0965,
5772                             U+0D00..0D03, U+0D05..0D0C,
5773                             U+0D0E..0D10, U+0D12..0D44 ...)
5774   \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
5775   \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
5776                             (30: U+0640, U+0840..085B, U+085E)
5777   \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
5778   \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
5779                             (52: U+0640, U+10AC0..10AE6,
5780                             U+10AEB..10AF6)
5781   \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
5782   \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
5783                             (68: U+11C70..11C8F, U+11C92..11CA7,
5784                             U+11CA9..11CB6)
5785   \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
5786                             \p{Gonm}) (77: U+0964..0965,
5787                             U+11D00..11D06, U+11D08..11D09,
5788                             U+11D0B..11D36, U+11D3A, U+11D3C..11D3D
5789                             ...)
5790   \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf})
5791                             (91: U+16E40..16E9A)
5792   \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91)
5793   \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
5794                             \p{Mtei}) (79: U+AAE0..AAF6,
5795                             U+ABC0..ABED, U+ABF0..ABF9)
5796   \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
5797                             (213)
5798   \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
5799                             \p{Mend}) (213: U+1E800..1E8C4,
5800                             U+1E8C7..1E8D6)
5801   \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
5802                             (90)
5803   \p{Script_Extensions: Mero} \p{Script_Extensions=
5804                             Meroitic_Hieroglyphs} (32)
5805   \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
5806                             \p{Merc}) (90: U+109A0..109B7,
5807                             U+109BC..109CF, U+109D2..109FF)
5808   \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
5809                             \p{Mero}) (32: U+10980..1099F)
5810   \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149:
5811                             U+16F00..16F4A, U+16F4F..16F87,
5812                             U+16F8F..16F9F)
5813   \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (125)
5814   \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
5815                             U+A830..A839, U+11600..11644,
5816                             U+11650..11659)
5817   \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (171)
5818   \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
5819                             (171: U+1800..180E, U+1810..1819,
5820                             U+1820..1878, U+1880..18AA, U+202F,
5821                             U+11660..1166C)
5822   \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
5823                             U+16A40..16A5E, U+16A60..16A69,
5824                             U+16A6E..16A6F)
5825   \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
5826   \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
5827   \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
5828   \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
5829                             (48: U+0A66..0A6F, U+11280..11286,
5830                             U+11288, U+1128A..1128D, U+1128F..1129D,
5831                             U+1129F..112A9)
5832   \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
5833                             (224: U+1000..109F, U+A92E,
5834                             U+A9E0..A9FE, U+AA60..AA7F)
5835   \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
5836   \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
5837                             (40: U+10880..1089E, U+108A7..108AF)
5838   \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86)
5839   \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand})
5840                             (86: U+0964..0965, U+0CE6..0CEF, U+1CE9,
5841                             U+1CF2, U+1CFA, U+A830..A835 ...)
5842   \p{Script_Extensions: Narb} \p{Script_Extensions=
5843                             Old_North_Arabian} (32)
5844   \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
5845   \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
5846                             (83: U+1980..19AB, U+19B0..19C9,
5847                             U+19D0..19DA, U+19DE..19DF)
5848   \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (94:
5849                             U+11400..11459, U+1145B, U+1145D..1145F)
5850   \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (62:
5851                             U+07C0..07FA, U+07FD..07FF)
5852   \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (62)
5853   \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
5854   \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
5855                             U+16FE1, U+1B170..1B2FB)
5856   \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx=
5857                             Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C,
5858                             U+1E130..1E13D, U+1E140..1E149,
5859                             U+1E14E..1E14F)
5860   \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
5861   \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
5862                             U+1680..169C)
5863   \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
5864                             (48: U+1C50..1C7F)
5865   \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
5866   \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
5867                             \p{Hung}) (108: U+10C80..10CB2,
5868                             U+10CC0..10CF2, U+10CFA..10CFF)
5869   \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
5870                             (39: U+10300..10323, U+1032D..1032F)
5871   \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
5872                             \p{Narb}) (32: U+10A80..10A9F)
5873   \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
5874                             (44: U+0483, U+10350..1037A)
5875   \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
5876                             (50: U+103A0..103C3, U+103C8..103D5)
5877   \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo})
5878                             (40: U+10F00..10F27)
5879   \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
5880                             \p{Sarb}) (32: U+10A60..10A7F)
5881   \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
5882                             (73: U+10C00..10C48)
5883   \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (96:
5884                             U+0951..0952, U+0964..0965,
5885                             U+0B01..0B03, U+0B05..0B0C,
5886                             U+0B0F..0B10, U+0B13..0B28 ...)
5887   \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
5888   \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (96)
5889   \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
5890                             U+104B0..104D3, U+104D8..104FB)
5891   \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
5892   \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
5893   \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
5894                             (40: U+10480..1049D, U+104A0..104A9)
5895   \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
5896                             \p{Hmng}) (127: U+16B00..16B45,
5897                             U+16B50..16B59, U+16B5B..16B61,
5898                             U+16B63..16B77, U+16B7D..16B8F)
5899   \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
5900   \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
5901                             (32: U+10860..1087F)
5902   \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
5903                             (57: U+11AC0..11AF8)
5904   \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
5905   \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
5906   \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
5907   \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
5908                             (59: U+1802..1803, U+1805, U+A840..A877)
5909   \p{Script_Extensions: Phli} \p{Script_Extensions=
5910                             Inscriptional_Pahlavi} (27)
5911   \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
5912                             (30)
5913   \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
5914   \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
5915                             (29: U+10900..1091B, U+1091F)
5916   \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149)
5917   \p{Script_Extensions: Prti} \p{Script_Extensions=
5918                             Inscriptional_Parthian} (30)
5919   \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
5920                             \p{Phlp}) (30: U+0640, U+10B80..10B91,
5921                             U+10B99..10B9C, U+10BA9..10BAF)
5922   \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
5923   \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (502)
5924   \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
5925                             U+A930..A953, U+A95F)
5926   \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
5927   \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya}
5928                             (55)
5929   \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
5930                             U+16A0..16EA, U+16EE..16F8)
5931   \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
5932   \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
5933                             (61: U+0800..082D, U+0830..083E)
5934   \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
5935   \p{Script_Extensions: Sarb} \p{Script_Extensions=
5936                             Old_South_Arabian} (32)
5937   \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
5938   \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
5939                             (82: U+A880..A8C5, U+A8CE..A8D9)
5940   \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
5941   \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
5942                             (100: U+0951, U+1CD7, U+1CD9,
5943                             U+1CDC..1CDD, U+1CE0, U+11180..111CD ...)
5944   \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
5945                             (48: U+10450..1047F)
5946   \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
5947   \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (100)
5948   \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
5949   \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
5950                             (92: U+11580..115B5, U+115B8..115DD)
5951   \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
5952                             (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
5953                             U+1DAA1..1DAAF)
5954   \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
5955   \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112)
5956   \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
5957                             (112: U+0964..0965, U+0D82..0D83,
5958                             U+0D85..0D96, U+0D9A..0DB1,
5959                             U+0DB3..0DBB, U+0DBD ...)
5960   \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43)
5961   \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd})
5962                             (43: U+0640, U+10F30..10F59)
5963   \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40)
5964   \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
5965   \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
5966                             \p{Sora}) (35: U+110D0..110E8,
5967                             U+110F0..110F9)
5968   \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83)
5969   \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
5970                             (83: U+11A50..11AA2)
5971   \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
5972   \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
5973                             (72: U+1B80..1BBF, U+1CC0..1CC7)
5974   \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56)
5975   \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
5976                             \p{Sylo}) (56: U+0964..0965,
5977                             U+09E6..09EF, U+A800..A82B)
5978   \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (105)
5979   \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
5980                             (105: U+060C, U+061B..061C, U+061F,
5981                             U+0640, U+064B..0655, U+0670 ...)
5982   \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
5983                             (22: U+1700..170C, U+170E..1714,
5984                             U+1735..1736)
5985   \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
5986   \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
5987                             (20: U+1735..1736, U+1760..176C,
5988                             U+176E..1770, U+1772..1773)
5989   \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
5990                             U+1040..1049, U+1950..196D, U+1970..1974)
5991   \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
5992                             (127: U+1A20..1A5E, U+1A60..1A7C,
5993                             U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
5994   \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
5995                             (72: U+AA80..AAC2, U+AADB..AADF)
5996   \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (79)
5997   \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (79:
5998                             U+0964..0965, U+A830..A839,
5999                             U+11680..116B8, U+116C0..116C9)
6000   \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
6001   \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
6002   \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133:
6003                             U+0951..0952, U+0964..0965,
6004                             U+0B82..0B83, U+0B85..0B8A,
6005                             U+0B8E..0B90, U+0B92..0B95 ...)
6006   \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133)
6007   \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6892)
6008   \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
6009                             (6892: U+16FE0, U+17000..187F7,
6010                             U+18800..18AF2)
6011   \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
6012   \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (104)
6013   \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
6014                             (104: U+0951..0952, U+0964..0965,
6015                             U+0C00..0C0C, U+0C0E..0C10,
6016                             U+0C12..0C28, U+0C2A..0C39 ...)
6017   \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
6018   \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
6019   \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
6020   \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
6021                             U+060C, U+061B..061C, U+061F,
6022                             U+0660..0669, U+0780..07B1, U+FDF2 ...)
6023   \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
6024                             U+0E01..0E3A, U+0E40..0E5B)
6025   \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
6026                             (207: U+0F00..0F47, U+0F49..0F6C,
6027                             U+0F71..0F97, U+0F99..0FBC,
6028                             U+0FBE..0FCC, U+0FCE..0FD4 ...)
6029   \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
6030   \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
6031                             (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
6032   \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97)
6033   \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
6034                             (97: U+0951..0952, U+0964..0965, U+1CF2,
6035                             U+A830..A839, U+11480..114C7,
6036                             U+114D0..114D9)
6037   \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
6038   \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
6039                             (31: U+10380..1039D, U+1039F)
6040   \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
6041                             (976_118 plus all above-Unicode code
6042                             points: U+0378..0379, U+0380..0383,
6043                             U+038B, U+038D, U+03A2, U+0530 ...)
6044   \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
6045                             U+A500..A62B)
6046   \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
6047   \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59:
6048                             U+1E2C0..1E2F9, U+1E2FF)
6049   \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
6050   \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
6051                             (84: U+118A0..118F2, U+118FF)
6052   \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59)
6053   \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
6054   \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
6055   \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
6056                             U+3001..3002, U+3008..3011,
6057                             U+3014..301B, U+30FB, U+A000..A48C,
6058                             U+A490..A4C6 ...)
6059   \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
6060   \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
6061                             \p{Zanb}) (72: U+11A00..11A47)
6062   \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
6063                             (72)
6064   \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (502)
6065   \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7386)
6066   \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (976_118
6067                             plus all above-Unicode code points)
6068   \p{Scx: *}              \p{Script_Extensions: *}
6069   \p{SD}                  \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
6070   \p{SD: *}               \p{Soft_Dotted: *}
6071   \p{Sentence_Break: AT}  \p{Sentence_Break=ATerm} (4)
6072   \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
6073                             U+FE52, U+FF0E)
6074   \p{Sentence_Break: CL}  \p{Sentence_Break=Close} (187)
6075   \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187:
6076                             [\"\'\(\)\[\]\{\}\xab\xbb],
6077                             U+0F3A..0F3D, U+169B..169C,
6078                             U+2018..201F, U+2039..203A, U+2045..2046
6079                             ...)
6080   \p{Sentence_Break: CR}  (Short: \p{SB=CR}) (1: [\r])
6081   \p{Sentence_Break: EX}  \p{Sentence_Break=Extend} (2368)
6082   \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2368: U+0300..036F,
6083                             U+0483..0489, U+0591..05BD, U+05BF,
6084                             U+05C1..05C2, U+05C4..05C5 ...)
6085   \p{Sentence_Break: FO}  \p{Sentence_Break=Format} (63)
6086   \p{Sentence_Break: Format} (Short: \p{SB=FO}) (63: [\xad],
6087                             U+0600..0605, U+061C, U+06DD, U+070F,
6088                             U+08E2 ...)
6089   \p{Sentence_Break: LE}  \p{Sentence_Break=OLetter} (121_822)
6090   \p{Sentence_Break: LF}  (Short: \p{SB=LF}) (1: [\n])
6091   \p{Sentence_Break: LO}  \p{Sentence_Break=Lower} (2293)
6092   \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2293: [a-
6093                             z\xaa\xb5\xba\xdf-\xf6\xf8-\xff],
6094                             U+0101, U+0103, U+0105, U+0107, U+0109
6095                             ...)
6096   \p{Sentence_Break: NU}  \p{Sentence_Break=Numeric} (632)
6097   \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (632: [0-9],
6098                             U+0660..0669, U+066B..066C,
6099                             U+06F0..06F9, U+07C0..07C9, U+0966..096F
6100                             ...)
6101   \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (121_822: U+01BB,
6102                             U+01C0..01C3, U+0294, U+02B9..02BF,
6103                             U+02C6..02D1, U+02EC ...)
6104   \p{Sentence_Break: Other} (Short: \p{SB=XX}) (984_661 plus all
6105                             above-Unicode code points:
6106                             [^\t\n\cK\f\r\x20!\"\'\(\),\-.0-9:?A-
6107                             Z\[\]a-z\{\}\x85\xa0\xaa-
6108                             \xab\xad\xb5\xba-\xbb\xc0-\xd6\xd8-
6109                             \xf6\xf8-\xff], U+02C2..02C5,
6110                             U+02D2..02DF, U+02E5..02EB, U+02ED,
6111                             U+02EF..02FF ...)
6112   \p{Sentence_Break: SC}  \p{Sentence_Break=SContinue} (26)
6113   \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
6114                             U+055D, U+060C..060D, U+07F8, U+1802,
6115                             U+1808 ...)
6116   \p{Sentence_Break: SE}  \p{Sentence_Break=Sep} (3)
6117   \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
6118                             U+2028..2029)
6119   \p{Sentence_Break: Sp}  (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
6120                             U+1680, U+2000..200A, U+202F, U+205F,
6121                             U+3000)
6122   \p{Sentence_Break: ST}  \p{Sentence_Break=STerm} (138)
6123   \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (138: [!?], U+0589,
6124                             U+061E..061F, U+06D4, U+0700..0702,
6125                             U+07F9 ...)
6126   \p{Sentence_Break: UP}  \p{Sentence_Break=Upper} (1893)
6127   \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1893: [A-Z\xc0-
6128                             \xd6\xd8-\xde], U+0100, U+0102, U+0104,
6129                             U+0106, U+0108 ...)
6130   \p{Sentence_Break: XX}  \p{Sentence_Break=Other} (984_661 plus all
6131                             above-Unicode code points)
6132   \p{Sentence_Terminal}   \p{Sentence_Terminal=Y} (Short: \p{STerm})
6133                             (141)
6134   \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
6135                             (1_113_971 plus all above-Unicode code
6136                             points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
6137                             \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
6138                             \xff], U+0100..0588, U+058A..061D,
6139                             U+0620..06D3, U+06D5..06FF, U+0703..07F8
6140                             ...)
6141   \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (141:
6142                             [!.?], U+0589, U+061E..061F, U+06D4,
6143                             U+0700..0702, U+07F9 ...)
6144   \p{Separator}           \p{General_Category=Separator} (Short:
6145                             \p{Z}) (19)
6146   \p{Sgnw}                \p{SignWriting} (= \p{Script_Extensions=
6147                             SignWriting}) (672)
6148   \p{Sharada}             \p{Script_Extensions=Sharada} (Short:
6149                             \p{Shrd}; NOT \p{Block=Sharada}) (100)
6150   \p{Shavian}             \p{Script_Extensions=Shavian} (Short:
6151                             \p{Shaw}) (48)
6152   \p{Shaw}                \p{Shavian} (= \p{Script_Extensions=
6153                             Shavian}) (48)
6154 X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
6155                             (16)
6156   \p{Shrd}                \p{Sharada} (= \p{Script_Extensions=
6157                             Sharada}) (NOT \p{Block=Sharada}) (100)
6158   \p{Sidd}                \p{Siddham} (= \p{Script_Extensions=
6159                             Siddham}) (NOT \p{Block=Siddham}) (92)
6160   \p{Siddham}             \p{Script_Extensions=Siddham} (Short:
6161                             \p{Sidd}; NOT \p{Block=Siddham}) (92)
6162   \p{SignWriting}         \p{Script_Extensions=SignWriting} (Short:
6163                             \p{Sgnw}) (672)
6164   \p{Sind}                \p{Khudawadi} (= \p{Script_Extensions=
6165                             Khudawadi}) (NOT \p{Block=Khudawadi})
6166                             (81)
6167   \p{Sinh}                \p{Sinhala} (= \p{Script_Extensions=
6168                             Sinhala}) (NOT \p{Block=Sinhala}) (112)
6169   \p{Sinhala}             \p{Script_Extensions=Sinhala} (Short:
6170                             \p{Sinh}; NOT \p{Block=Sinhala}) (112)
6171 X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
6172   \p{Sk}                  \p{Modifier_Symbol} (=
6173                             \p{General_Category=Modifier_Symbol})
6174                             (121)
6175   \p{Sm}                  \p{Math_Symbol} (= \p{General_Category=
6176                             Math_Symbol}) (948)
6177 X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
6178                             \p{InSmallForms}) (32)
6179 X \p{Small_Forms}         \p{Small_Form_Variants} (= \p{Block=
6180                             Small_Form_Variants}) (32)
6181 X \p{Small_Kana_Ext}      \p{Small_Kana_Extension} (= \p{Block=
6182                             Small_Kana_Extension}) (64)
6183 X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short:
6184                             \p{InSmallKanaExt}) (64)
6185   \p{So}                  \p{Other_Symbol} (= \p{General_Category=
6186                             Other_Symbol}) (6161)
6187   \p{Soft_Dotted}         \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
6188   \p{Soft_Dotted: N*}     (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
6189                             all above-Unicode code points: [\x00-
6190                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6191                             Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
6192                             U+0100..012E, U+0130..0248,
6193                             U+024A..0267, U+0269..029C, U+029E..02B1
6194                             ...)
6195   \p{Soft_Dotted: Y*}     (Short: \p{SD=Y}, \p{SD}) (46: [i-j],
6196                             U+012F, U+0249, U+0268, U+029D, U+02B2
6197                             ...)
6198   \p{Sogd}                \p{Sogdian} (= \p{Script_Extensions=
6199                             Sogdian}) (NOT \p{Block=Sogdian}) (43)
6200   \p{Sogdian}             \p{Script_Extensions=Sogdian} (Short:
6201                             \p{Sogd}; NOT \p{Block=Sogdian}) (43)
6202   \p{Sogo}                \p{Old_Sogdian} (= \p{Script_Extensions=
6203                             Old_Sogdian}) (NOT \p{Block=
6204                             Old_Sogdian}) (40)
6205   \p{Sora}                \p{Sora_Sompeng} (= \p{Script_Extensions=
6206                             Sora_Sompeng}) (NOT \p{Block=
6207                             Sora_Sompeng}) (35)
6208   \p{Sora_Sompeng}        \p{Script_Extensions=Sora_Sompeng} (Short:
6209                             \p{Sora}; NOT \p{Block=Sora_Sompeng})
6210                             (35)
6211   \p{Soyo}                \p{Soyombo} (= \p{Script_Extensions=
6212                             Soyombo}) (NOT \p{Block=Soyombo}) (83)
6213   \p{Soyombo}             \p{Script_Extensions=Soyombo} (Short:
6214                             \p{Soyo}; NOT \p{Block=Soyombo}) (83)
6215   \p{Space}               \p{White_Space} (= \p{White_Space=Y}) (25)
6216   \p{Space: *}            \p{White_Space: *}
6217   \p{Space_Separator}     \p{General_Category=Space_Separator}
6218                             (Short: \p{Zs}) (17)
6219   \p{SpacePerl}           \p{XPosixSpace} (25)
6220   \p{Spacing_Mark}        \p{General_Category=Spacing_Mark} (Short:
6221                             \p{Mc}) (429)
6222 X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
6223                             (Short: \p{InModifierLetters}) (80)
6224 X \p{Specials}            \p{Block=Specials} (16)
6225   \p{STerm}               \p{Sentence_Terminal} (=
6226                             \p{Sentence_Terminal=Y}) (141)
6227   \p{STerm: *}            \p{Sentence_Terminal: *}
6228   \p{Sund}                \p{Sundanese} (= \p{Script_Extensions=
6229                             Sundanese}) (NOT \p{Block=Sundanese})
6230                             (72)
6231   \p{Sundanese}           \p{Script_Extensions=Sundanese} (Short:
6232                             \p{Sund}; NOT \p{Block=Sundanese}) (72)
6233 X \p{Sundanese_Sup}       \p{Sundanese_Supplement} (= \p{Block=
6234                             Sundanese_Supplement}) (16)
6235 X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
6236                             \p{InSundaneseSup}) (16)
6237 X \p{Sup_Arrows_A}        \p{Supplemental_Arrows_A} (= \p{Block=
6238                             Supplemental_Arrows_A}) (16)
6239 X \p{Sup_Arrows_B}        \p{Supplemental_Arrows_B} (= \p{Block=
6240                             Supplemental_Arrows_B}) (128)
6241 X \p{Sup_Arrows_C}        \p{Supplemental_Arrows_C} (= \p{Block=
6242                             Supplemental_Arrows_C}) (256)
6243 X \p{Sup_Math_Operators}  \p{Supplemental_Mathematical_Operators} (=
6244                             \p{Block=
6245                             Supplemental_Mathematical_Operators})
6246                             (256)
6247 X \p{Sup_PUA_A}           \p{Supplementary_Private_Use_Area_A} (=
6248                             \p{Block=
6249                             Supplementary_Private_Use_Area_A})
6250                             (65_536)
6251 X \p{Sup_PUA_B}           \p{Supplementary_Private_Use_Area_B} (=
6252                             \p{Block=
6253                             Supplementary_Private_Use_Area_B})
6254                             (65_536)
6255 X \p{Sup_Punctuation}     \p{Supplemental_Punctuation} (= \p{Block=
6256                             Supplemental_Punctuation}) (128)
6257 X \p{Sup_Symbols_And_Pictographs}
6258                             \p{Supplemental_Symbols_And_Pictographs}
6259                             (= \p{Block=
6260                             Supplemental_Symbols_And_Pictographs})
6261                             (256)
6262 X \p{Super_And_Sub}       \p{Superscripts_And_Subscripts} (=
6263                             \p{Block=Superscripts_And_Subscripts})
6264                             (48)
6265 X \p{Superscripts_And_Subscripts} \p{Block=
6266                             Superscripts_And_Subscripts} (Short:
6267                             \p{InSuperAndSub}) (48)
6268 X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
6269                             \p{InSupArrowsA}) (16)
6270 X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
6271                             \p{InSupArrowsB}) (128)
6272 X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
6273                             \p{InSupArrowsC}) (256)
6274 X \p{Supplemental_Mathematical_Operators} \p{Block=
6275                             Supplemental_Mathematical_Operators}
6276                             (Short: \p{InSupMathOperators}) (256)
6277 X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
6278                             (Short: \p{InSupPunctuation}) (128)
6279 X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
6280                             Supplemental_Symbols_And_Pictographs}
6281                             (Short: \p{InSupSymbolsAndPictographs})
6282                             (256)
6283 X \p{Supplementary_Private_Use_Area_A} \p{Block=
6284                             Supplementary_Private_Use_Area_A}
6285                             (Short: \p{InSupPUAA}) (65_536)
6286 X \p{Supplementary_Private_Use_Area_B} \p{Block=
6287                             Supplementary_Private_Use_Area_B}
6288                             (Short: \p{InSupPUAB}) (65_536)
6289   \p{Surrogate}           \p{General_Category=Surrogate} (Short:
6290                             \p{Cs}) (2048)
6291 X \p{Sutton_SignWriting}  \p{Block=Sutton_SignWriting} (688)
6292   \p{Sylo}                \p{Syloti_Nagri} (= \p{Script_Extensions=
6293                             Syloti_Nagri}) (NOT \p{Block=
6294                             Syloti_Nagri}) (56)
6295   \p{Syloti_Nagri}        \p{Script_Extensions=Syloti_Nagri} (Short:
6296                             \p{Sylo}; NOT \p{Block=Syloti_Nagri})
6297                             (56)
6298   \p{Symbol}              \p{General_Category=Symbol} (Short: \p{S})
6299                             (7292)
6300 X \p{Symbols_And_Pictographs_Ext_A}
6301                             \p{Symbols_And_Pictographs_Extended_A}
6302                             (= \p{Block=
6303                             Symbols_And_Pictographs_Extended_A})
6304                             (144)
6305 X \p{Symbols_And_Pictographs_Extended_A} \p{Block=
6306                             Symbols_And_Pictographs_Extended_A} (144)
6307   \p{Syrc}                \p{Syriac} (= \p{Script_Extensions=
6308                             Syriac}) (NOT \p{Block=Syriac}) (105)
6309   \p{Syriac}              \p{Script_Extensions=Syriac} (Short:
6310                             \p{Syrc}; NOT \p{Block=Syriac}) (105)
6311 X \p{Syriac_Sup}          \p{Syriac_Supplement} (= \p{Block=
6312                             Syriac_Supplement}) (16)
6313 X \p{Syriac_Supplement}   \p{Block=Syriac_Supplement} (Short:
6314                             \p{InSyriacSup}) (16)
6315   \p{Tagalog}             \p{Script_Extensions=Tagalog} (Short:
6316                             \p{Tglg}; NOT \p{Block=Tagalog}) (22)
6317   \p{Tagb}                \p{Tagbanwa} (= \p{Script_Extensions=
6318                             Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
6319   \p{Tagbanwa}            \p{Script_Extensions=Tagbanwa} (Short:
6320                             \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
6321 X \p{Tags}                \p{Block=Tags} (128)
6322   \p{Tai_Le}              \p{Script_Extensions=Tai_Le} (Short:
6323                             \p{Tale}; NOT \p{Block=Tai_Le}) (45)
6324   \p{Tai_Tham}            \p{Script_Extensions=Tai_Tham} (Short:
6325                             \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
6326   \p{Tai_Viet}            \p{Script_Extensions=Tai_Viet} (Short:
6327                             \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
6328 X \p{Tai_Xuan_Jing}       \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
6329                             Tai_Xuan_Jing_Symbols}) (96)
6330 X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
6331                             \p{InTaiXuanJing}) (96)
6332   \p{Takr}                \p{Takri} (= \p{Script_Extensions=Takri})
6333                             (NOT \p{Block=Takri}) (79)
6334   \p{Takri}               \p{Script_Extensions=Takri} (Short:
6335                             \p{Takr}; NOT \p{Block=Takri}) (79)
6336   \p{Tale}                \p{Tai_Le} (= \p{Script_Extensions=
6337                             Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
6338   \p{Talu}                \p{New_Tai_Lue} (= \p{Script_Extensions=
6339                             New_Tai_Lue}) (NOT \p{Block=
6340                             New_Tai_Lue}) (83)
6341   \p{Tamil}               \p{Script_Extensions=Tamil} (Short:
6342                             \p{Taml}; NOT \p{Block=Tamil}) (133)
6343 X \p{Tamil_Sup}           \p{Tamil_Supplement} (= \p{Block=
6344                             Tamil_Supplement}) (64)
6345 X \p{Tamil_Supplement}    \p{Block=Tamil_Supplement} (Short:
6346                             \p{InTamilSup}) (64)
6347   \p{Taml}                \p{Tamil} (= \p{Script_Extensions=Tamil})
6348                             (NOT \p{Block=Tamil}) (133)
6349   \p{Tang}                \p{Tangut} (= \p{Script_Extensions=
6350                             Tangut}) (NOT \p{Block=Tangut}) (6892)
6351   \p{Tangut}              \p{Script_Extensions=Tangut} (Short:
6352                             \p{Tang}; NOT \p{Block=Tangut}) (6892)
6353 X \p{Tangut_Components}   \p{Block=Tangut_Components} (768)
6354   \p{Tavt}                \p{Tai_Viet} (= \p{Script_Extensions=
6355                             Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
6356   \p{Telu}                \p{Telugu} (= \p{Script_Extensions=
6357                             Telugu}) (NOT \p{Block=Telugu}) (104)
6358   \p{Telugu}              \p{Script_Extensions=Telugu} (Short:
6359                             \p{Telu}; NOT \p{Block=Telugu}) (104)
6360   \p{Term}                \p{Terminal_Punctuation} (=
6361                             \p{Terminal_Punctuation=Y}) (264)
6362   \p{Term: *}             \p{Terminal_Punctuation: *}
6363   \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
6364                             \p{Term}) (264)
6365   \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
6366                             (1_113_848 plus all above-Unicode code
6367                             points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
6368                             9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
6369                             U+0100..037D, U+037F..0386,
6370                             U+0388..0588, U+058A..05C2, U+05C4..060B
6371                             ...)
6372   \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (264:
6373                             [!,.:;?], U+037E, U+0387, U+0589,
6374                             U+05C3, U+060C ...)
6375   \p{Tfng}                \p{Tifinagh} (= \p{Script_Extensions=
6376                             Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
6377   \p{Tglg}                \p{Tagalog} (= \p{Script_Extensions=
6378                             Tagalog}) (NOT \p{Block=Tagalog}) (22)
6379   \p{Thaa}                \p{Thaana} (= \p{Script_Extensions=
6380                             Thaana}) (NOT \p{Block=Thaana}) (66)
6381   \p{Thaana}              \p{Script_Extensions=Thaana} (Short:
6382                             \p{Thaa}; NOT \p{Block=Thaana}) (66)
6383   \p{Thai}                \p{Script_Extensions=Thai} (NOT \p{Block=
6384                             Thai}) (86)
6385   \p{Tibetan}             \p{Script_Extensions=Tibetan} (Short:
6386                             \p{Tibt}; NOT \p{Block=Tibetan}) (207)
6387   \p{Tibt}                \p{Tibetan} (= \p{Script_Extensions=
6388                             Tibetan}) (NOT \p{Block=Tibetan}) (207)
6389   \p{Tifinagh}            \p{Script_Extensions=Tifinagh} (Short:
6390                             \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
6391   \p{Tirh}                \p{Tirhuta} (= \p{Script_Extensions=
6392                             Tirhuta}) (NOT \p{Block=Tirhuta}) (97)
6393   \p{Tirhuta}             \p{Script_Extensions=Tirhuta} (Short:
6394                             \p{Tirh}; NOT \p{Block=Tirhuta}) (97)
6395   \p{Title}               \p{Titlecase} (/i= Cased=Yes) (31)
6396   \p{Titlecase}           (= \p{Gc=Lt}) (Short: \p{Title}; /i=
6397                             Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
6398                             U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
6399   \p{Titlecase_Letter}    \p{General_Category=Titlecase_Letter}
6400                             (Short: \p{Lt}; /i= General_Category=
6401                             Cased_Letter) (31)
6402 X \p{Transport_And_Map}   \p{Transport_And_Map_Symbols} (= \p{Block=
6403                             Transport_And_Map_Symbols}) (128)
6404 X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
6405                             (Short: \p{InTransportAndMap}) (128)
6406 X \p{UCAS}                \p{Unified_Canadian_Aboriginal_Syllabics}
6407                             (= \p{Block=
6408                             Unified_Canadian_Aboriginal_Syllabics})
6409                             (640)
6410 X \p{UCAS_Ext}            \p{Unified_Canadian_Aboriginal_Syllabics_-
6411                             Extended} (= \p{Block=
6412                             Unified_Canadian_Aboriginal_Syllabics_-
6413                             Extended}) (80)
6414   \p{Ugar}                \p{Ugaritic} (= \p{Script_Extensions=
6415                             Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
6416   \p{Ugaritic}            \p{Script_Extensions=Ugaritic} (Short:
6417                             \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
6418   \p{UIdeo}               \p{Unified_Ideograph} (=
6419                             \p{Unified_Ideograph=Y}) (87_887)
6420   \p{UIdeo: *}            \p{Unified_Ideograph: *}
6421   \p{Unassigned}          \p{General_Category=Unassigned} (Short:
6422                             \p{Cn}) (836_602 plus all above-Unicode
6423                             code points)
6424   \p{Unicode}             \p{Any} (1_114_112)
6425 X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
6426                             Unified_Canadian_Aboriginal_Syllabics}
6427                             (Short: \p{InUCAS}) (640)
6428 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
6429                             Unified_Canadian_Aboriginal_Syllabics_-
6430                             Extended} (Short: \p{InUCASExt}) (80)
6431   \p{Unified_Ideograph}   \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
6432                             (87_887)
6433   \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
6434                             (1_026_225 plus all above-Unicode code
6435                             points: U+0000..33FF, U+4DB6..4DFF,
6436                             U+9FF0..FA0D, U+FA10, U+FA12,
6437                             U+FA15..FA1E ...)
6438   \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (87_887:
6439                             U+3400..4DB5, U+4E00..9FEF,
6440                             U+FA0E..FA0F, U+FA11, U+FA13..FA14,
6441                             U+FA1F ...)
6442   \p{Unknown}             \p{Script_Extensions=Unknown} (Short:
6443                             \p{Zzzz}) (976_118 plus all above-
6444                             Unicode code points)
6445   \p{Upper}               \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6446                             Cased=Yes) (1908)
6447   \p{Upper: *}            \p{Uppercase: *}
6448   \p{Uppercase}           \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6449                             Cased=Yes) (1908)
6450   \p{Uppercase: N*}       (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
6451                             No) (1_112_204 plus all above-Unicode
6452                             code points: [\x00-
6453                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6454                             >?\@\[\\\]\^_`a-z\{\|\}~\x7f-
6455                             \xbf\xd7\xdf-\xff], U+0101, U+0103,
6456                             U+0105, U+0107, U+0109 ...)
6457   \p{Uppercase: Y*}       (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
6458                             Yes) (1908: [A-Z\xc0-\xd6\xd8-\xde],
6459                             U+0100, U+0102, U+0104, U+0106, U+0108
6460                             ...)
6461   \p{Uppercase_Letter}    \p{General_Category=Uppercase_Letter}
6462                             (Short: \p{Lu}; /i= General_Category=
6463                             Cased_Letter) (1788)
6464   \p{Vai}                 \p{Script_Extensions=Vai} (NOT \p{Block=
6465                             Vai}) (300)
6466   \p{Vaii}                \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
6467                             \p{Block=Vai}) (300)
6468   \p{Variation_Selector}  \p{Variation_Selector=Y} (Short: \p{VS};
6469                             NOT \p{Variation_Selectors}) (259)
6470   \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
6471                             plus all above-Unicode code points:
6472                             U+0000..180A, U+180E..FDFF,
6473                             U+FE10..E00FF, U+E01F0..infinity)
6474   \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259:
6475                             U+180B..180D, U+FE00..FE0F,
6476                             U+E0100..E01EF)
6477 X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
6478                             \p{InVS}) (16)
6479 X \p{Variation_Selectors_Supplement} \p{Block=
6480                             Variation_Selectors_Supplement} (Short:
6481                             \p{InVSSup}) (240)
6482 X \p{Vedic_Ext}           \p{Vedic_Extensions} (= \p{Block=
6483                             Vedic_Extensions}) (48)
6484 X \p{Vedic_Extensions}    \p{Block=Vedic_Extensions} (Short:
6485                             \p{InVedicExt}) (48)
6486 X \p{Vertical_Forms}      \p{Block=Vertical_Forms} (16)
6487   \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
6488                             (787_620 plus all above-Unicode code
6489                             points)
6490   \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (787_620 plus
6491                             all above-Unicode code points: [\x00-
6492                             \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
6493                             \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
6494                             U+02EC..10FF, U+1200..1400,
6495                             U+1680..18AF, U+1900..2015 ...)
6496   \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
6497                             Transformed_Rotated} (47)
6498   \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
6499                             (47: U+2329..232A, U+3008..3011,
6500                             U+3014..301F, U+3030, U+30A0, U+30FC ...)
6501   \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
6502                             (148: U+3001..3002, U+3041, U+3043,
6503                             U+3045, U+3047, U+3049 ...)
6504   \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
6505                             Transformed_Upright} (148)
6506   \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
6507                             (326_297)
6508   \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (326_297:
6509                             [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
6510                             U+02EA..02EB, U+1100..11FF,
6511                             U+1401..167F, U+18B0..18FF, U+2016 ...)
6512   \p{VertSpace}           \v (7: [\n\cK\f\r\x85], U+2028..2029)
6513   \p{Vo: *}               \p{Vertical_Orientation: *}
6514   \p{VS}                  \p{Variation_Selector} (=
6515                             \p{Variation_Selector=Y}) (NOT
6516                             \p{Variation_Selectors}) (259)
6517   \p{VS: *}               \p{Variation_Selector: *}
6518 X \p{VS_Sup}              \p{Variation_Selectors_Supplement} (=
6519                             \p{Block=
6520                             Variation_Selectors_Supplement}) (240)
6521   \p{Wancho}              \p{Script_Extensions=Wancho} (Short:
6522                             \p{Wcho}; NOT \p{Block=Wancho}) (59)
6523   \p{Wara}                \p{Warang_Citi} (= \p{Script_Extensions=
6524                             Warang_Citi}) (NOT \p{Block=
6525                             Warang_Citi}) (84)
6526   \p{Warang_Citi}         \p{Script_Extensions=Warang_Citi} (Short:
6527                             \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
6528   \p{WB: *}               \p{Word_Break: *}
6529   \p{Wcho}                \p{Wancho} (= \p{Script_Extensions=
6530                             Wancho}) (NOT \p{Block=Wancho}) (59)
6531   \p{White_Space}         \p{White_Space=Y} (Short: \p{Space}) (25)
6532   \p{White_Space: N*}     (Short: \p{Space=N}, \P{Space}) (1_114_087
6533                             plus all above-Unicode code points:
6534                             [^\t\n\cK\f\r\x20\x85\xa0],
6535                             U+0100..167F, U+1681..1FFF,
6536                             U+200B..2027, U+202A..202E, U+2030..205E
6537                             ...)
6538   \p{White_Space: Y*}     (Short: \p{Space=Y}, \p{Space}) (25:
6539                             [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6540                             U+2000..200A, U+2028..2029, U+202F,
6541                             U+205F ...)
6542   \p{Word}                \p{XPosixWord} (128_919)
6543   \p{Word_Break: ALetter} (Short: \p{WB=LE}) (28_693: [A-Za-
6544                             z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6545                             \xff], U+0100..02D7, U+02DE..02E4,
6546                             U+02EC..02FF, U+0370..0374, U+0376..0377
6547                             ...)
6548   \p{Word_Break: CR}      (Short: \p{WB=CR}) (1: [\r])
6549   \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
6550   \p{Word_Break: DQ}      \p{Word_Break=Double_Quote} (1)
6551   \p{Word_Break: E_Base}  (Short: \p{WB=EB}) (0)
6552   \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0)
6553   \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0)
6554   \p{Word_Break: EB}      \p{Word_Break=E_Base} (0)
6555   \p{Word_Break: EBG}     \p{Word_Break=E_Base_GAZ} (0)
6556   \p{Word_Break: EM}      \p{Word_Break=E_Modifier} (0)
6557   \p{Word_Break: EX}      \p{Word_Break=ExtendNumLet} (11)
6558   \p{Word_Break: Extend}  (Short: \p{WB=Extend}) (2372:
6559                             U+0300..036F, U+0483..0489,
6560                             U+0591..05BD, U+05BF, U+05C1..05C2,
6561                             U+05C4..05C5 ...)
6562   \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
6563                             U+203F..2040, U+2054, U+FE33..FE34,
6564                             U+FE4D..FE4F ...)
6565   \p{Word_Break: FO}      \p{Word_Break=Format} (62)
6566   \p{Word_Break: Format}  (Short: \p{WB=FO}) (62: [\xad],
6567                             U+0600..0605, U+061C, U+06DD, U+070F,
6568                             U+08E2 ...)
6569   \p{Word_Break: GAZ}     \p{Word_Break=Glue_After_Zwj} (0)
6570   \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0)
6571   \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75:
6572                             U+05D0..05EA, U+05EF..05F2, U+FB1D,
6573                             U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
6574                             ...)
6575   \p{Word_Break: HL}      \p{Word_Break=Hebrew_Letter} (75)
6576   \p{Word_Break: KA}      \p{Word_Break=Katakana} (314)
6577   \p{Word_Break: Katakana} (Short: \p{WB=KA}) (314: U+3031..3035,
6578                             U+309B..309C, U+30A0..30FA,
6579                             U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
6580                             ...)
6581   \p{Word_Break: LE}      \p{Word_Break=ALetter} (28_693)
6582   \p{Word_Break: LF}      (Short: \p{WB=LF}) (1: [\n])
6583   \p{Word_Break: MB}      \p{Word_Break=MidNumLet} (7)
6584   \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (8: [:\xb7], U+0387,
6585                             U+05F4, U+2027, U+FE13, U+FE55 ...)
6586   \p{Word_Break: MidNum}  (Short: \p{WB=MN}) (15: [,;], U+037E,
6587                             U+0589, U+060C..060D, U+066C, U+07F8 ...)
6588   \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
6589                             U+2018..2019, U+2024, U+FE52, U+FF07,
6590                             U+FF0E)
6591   \p{Word_Break: ML}      \p{Word_Break=MidLetter} (8)
6592   \p{Word_Break: MN}      \p{Word_Break=MidNum} (15)
6593   \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
6594                             U+2028..2029)
6595   \p{Word_Break: NL}      \p{Word_Break=Newline} (5)
6596   \p{Word_Break: NU}      \p{Word_Break=Numeric} (631)
6597   \p{Word_Break: Numeric} (Short: \p{WB=NU}) (631: [0-9],
6598                             U+0660..0669, U+066B, U+06F0..06F9,
6599                             U+07C0..07C9, U+0966..096F ...)
6600   \p{Word_Break: Other}   (Short: \p{WB=XX}) (1_081_874 plus all
6601                             above-Unicode code points:
6602                             [^\n\cK\f\r\x20\"\',.0-9:;A-Z_a-
6603                             z\x85\xaa\xad\xb5\xb7\xba\xc0-\xd6\xd8-
6604                             \xf6\xf8-\xff], U+02D8..02DD,
6605                             U+02E5..02EB, U+0375, U+0378..0379,
6606                             U+0380..0385 ...)
6607   \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
6608                             U+1F1E6..1F1FF)
6609   \p{Word_Break: RI}      \p{Word_Break=Regional_Indicator} (26)
6610   \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
6611   \p{Word_Break: SQ}      \p{Word_Break=Single_Quote} (1)
6612   \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20],
6613                             U+1680, U+2000..2006, U+2008..200A,
6614                             U+205F, U+3000)
6615   \p{Word_Break: XX}      \p{Word_Break=Other} (1_081_874 plus all
6616                             above-Unicode code points)
6617   \p{Word_Break: ZWJ}     (Short: \p{WB=ZWJ}) (1: U+200D)
6618   \p{WSpace}              \p{White_Space} (= \p{White_Space=Y}) (25)
6619   \p{WSpace: *}           \p{White_Space: *}
6620   \p{XDigit}              \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
6621   \p{XID_Continue}        \p{XID_Continue=Y} (Short: \p{XIDC})
6622                             (128_770)
6623   \p{XID_Continue: N*}    (Short: \p{XIDC=N}, \P{XIDC}) (985_342
6624                             plus all above-Unicode code points:
6625                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=
6626                             >?\@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-
6627                             \xb4\xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
6628                             U+02C2..02C5, U+02D2..02DF,
6629                             U+02E5..02EB, U+02ED, U+02EF..02FF ...)
6630   \p{XID_Continue: Y*}    (Short: \p{XIDC=Y}, \p{XIDC}) (128_770:
6631                             [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-
6632                             \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6633                             U+02C6..02D1, U+02E0..02E4, U+02EC,
6634                             U+02EE ...)
6635   \p{XID_Start}           \p{XID_Start=Y} (Short: \p{XIDS}) (125_861)
6636   \p{XID_Start: N*}       (Short: \p{XIDS=N}, \P{XIDS}) (988_251
6637                             plus all above-Unicode code points:
6638                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
6639                             >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-
6640                             \xb4\xb6-\xb9\xbb-\xbf\xd7\xf7],
6641                             U+02C2..02C5, U+02D2..02DF,
6642                             U+02E5..02EB, U+02ED, U+02EF..036F ...)
6643   \p{XID_Start: Y*}       (Short: \p{XIDS=Y}, \p{XIDS}) (125_861:
6644                             [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-
6645                             \xf6\xf8-\xff], U+0100..02C1,
6646                             U+02C6..02D1, U+02E0..02E4, U+02EC,
6647                             U+02EE ...)
6648   \p{XIDC}                \p{XID_Continue} (= \p{XID_Continue=Y})
6649                             (128_770)
6650   \p{XIDC: *}             \p{XID_Continue: *}
6651   \p{XIDS}                \p{XID_Start} (= \p{XID_Start=Y}) (125_861)
6652   \p{XIDS: *}             \p{XID_Start: *}
6653   \p{Xpeo}                \p{Old_Persian} (= \p{Script_Extensions=
6654                             Old_Persian}) (NOT \p{Block=
6655                             Old_Persian}) (50)
6656   \p{XPerlSpace}          \p{XPosixSpace} (25)
6657   \p{XPosixAlnum}         Alphabetic and (decimal) Numeric (Short:
6658                             \p{Alnum}) (127_886: [0-9A-Za-
6659                             z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6660                             \xff], U+0100..02C1, U+02C6..02D1,
6661                             U+02E0..02E4, U+02EC, U+02EE ...)
6662   \p{XPosixAlpha}         \p{Alphabetic=Y} (Short: \p{Alpha})
6663                             (127_256)
6664   \p{XPosixBlank}         \h, Horizontal white space (Short:
6665                             \p{Blank}) (18: [\t\x20\xa0], U+1680,
6666                             U+2000..200A, U+202F, U+205F, U+3000)
6667   \p{XPosixCntrl}         \p{General_Category=Control} Control
6668                             characters (Short: \p{Cc}) (65)
6669   \p{XPosixDigit}         \p{General_Category=Decimal_Number} [0-9]
6670                             + all other decimal digits (Short:
6671                             \p{Nd}) (630)
6672   \p{XPosixGraph}         Characters that are graphical (Short:
6673                             \p{Graph}) (275_378:
6674                             [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6675                             Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
6676                             U+0100..0377, U+037A..037F,
6677                             U+0384..038A, U+038C, U+038E..03A1 ...)
6678   \p{XPosixLower}         \p{Lowercase=Y} (Short: \p{Lower}; /i=
6679                             Cased=Yes) (2340)
6680   \p{XPosixPrint}         Characters that are graphical plus space
6681                             characters (but no controls) (Short:
6682                             \p{Print}) (275_395: [\x20-\x7e\xa0-
6683                             \xff], U+0100..0377, U+037A..037F,
6684                             U+0384..038A, U+038C, U+038E..03A1 ...)
6685   \p{XPosixPunct}         \p{Punct} + ASCII-range \p{Symbol} (801:
6686                             [!\"#\$\%&\'\(\)*+,\-.\/:;<=
6687                             >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
6688                             \xb7\xbb\xbf], U+037E, U+0387,
6689                             U+055A..055F, U+0589..058A, U+05BE ...)
6690   \p{XPosixSpace}         \s including beyond ASCII and vertical tab
6691                             (Short: \p{SpacePerl}) (25:
6692                             [\t\n\cK\f\r\x20\x85\xa0], U+1680,
6693                             U+2000..200A, U+2028..2029, U+202F,
6694                             U+205F ...)
6695   \p{XPosixUpper}         \p{Uppercase=Y} (Short: \p{Upper}; /i=
6696                             Cased=Yes) (1908)
6697   \p{XPosixWord}          \w, including beyond ASCII; = \p{Alnum} +
6698                             \pM + \p{Pc} + \p{Join_Control} (Short:
6699                             \p{Word}) (128_919: [0-9A-Z_a-
6700                             z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6701                             \xff], U+0100..02C1, U+02C6..02D1,
6702                             U+02E0..02E4, U+02EC, U+02EE ...)
6703   \p{XPosixXDigit}        \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
6704   \p{Xsux}                \p{Cuneiform} (= \p{Script_Extensions=
6705                             Cuneiform}) (NOT \p{Block=Cuneiform})
6706                             (1234)
6707   \p{Yi}                  \p{Script_Extensions=Yi} (1246)
6708 X \p{Yi_Radicals}         \p{Block=Yi_Radicals} (64)
6709 X \p{Yi_Syllables}        \p{Block=Yi_Syllables} (1168)
6710   \p{Yiii}                \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
6711 X \p{Yijing}              \p{Yijing_Hexagram_Symbols} (= \p{Block=
6712                             Yijing_Hexagram_Symbols}) (64)
6713 X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
6714                             (Short: \p{InYijing}) (64)
6715   \p{Z} \pZ               \p{Separator} (= \p{General_Category=
6716                             Separator}) (19)
6717   \p{Zanabazar_Square}    \p{Script_Extensions=Zanabazar_Square}
6718                             (Short: \p{Zanb}; NOT \p{Block=
6719                             Zanabazar_Square}) (72)
6720   \p{Zanb}                \p{Zanabazar_Square} (=
6721                             \p{Script_Extensions=Zanabazar_Square})
6722                             (NOT \p{Block=Zanabazar_Square}) (72)
6723   \p{Zinh}                \p{Inherited} (= \p{Script_Extensions=
6724                             Inherited}) (502)
6725   \p{Zl}                  \p{Line_Separator} (= \p{General_Category=
6726                             Line_Separator}) (1)
6727   \p{Zp}                  \p{Paragraph_Separator} (=
6728                             \p{General_Category=
6729                             Paragraph_Separator}) (1)
6730   \p{Zs}                  \p{Space_Separator} (=
6731                             \p{General_Category=Space_Separator})
6732                             (17)
6733   \p{Zyyy}                \p{Common} (= \p{Script_Extensions=
6734                             Common}) (7386)
6735   \p{Zzzz}                \p{Unknown} (= \p{Script_Extensions=
6736                             Unknown}) (976_118 plus all above-
6737                             Unicode code points)
6738
6739
6740
6741=head2 Legal C<\p{}> and C<\P{}> constructs that match no characters
6742
6743Unicode has some property-value pairs that currently don't match anything.
6744This happens generally either because they are obsolete, or they exist for
6745symmetry with other forms, but no language has yet been encoded that uses
6746them.  In this version of Unicode, the following match zero code points:
6747
6748=over 4
6749
6750=item \p{Canonical_Combining_Class=Attached_Below_Left}
6751
6752=item \p{Canonical_Combining_Class=CCC133}
6753
6754=item \p{Grapheme_Cluster_Break=E_Base}
6755
6756=item \p{Grapheme_Cluster_Break=E_Base_GAZ}
6757
6758=item \p{Grapheme_Cluster_Break=E_Modifier}
6759
6760=item \p{Grapheme_Cluster_Break=Glue_After_Zwj}
6761
6762=item \p{Word_Break=E_Base}
6763
6764=item \p{Word_Break=E_Base_GAZ}
6765
6766=item \p{Word_Break=E_Modifier}
6767
6768=item \p{Word_Break=Glue_After_Zwj}
6769
6770=back
6771
6772
6773
6774=head1 Properties accessible through Unicode::UCD
6775
6776The value of any Unicode (not including Perl extensions) character
6777property mentioned above for any single code point is available through
6778L<Unicode::UCD/charprop()>.  L<Unicode::UCD/charprops_all()> returns the
6779values of all the Unicode properties for a given code point.
6780
6781Besides these, all the Unicode character properties mentioned above
6782(except for those marked as for internal use by Perl) are also
6783accessible by L<Unicode::UCD/prop_invlist()>.
6784
6785Due to their nature, not all Unicode character properties are suitable for
6786regular expression matches, nor C<prop_invlist()>.  The remaining
6787non-provisional, non-internal ones are accessible via
6788L<Unicode::UCD/prop_invmap()> (except for those that this Perl installation
6789hasn't included; see L<below for which those are|/Unicode character properties
6790that are NOT accepted by Perl>).
6791
6792For compatibility with other parts of Perl, all the single forms given in the
6793table in the L<section above|/Properties accessible through \p{} and \P{}>
6794are recognized.  BUT, there are some ambiguities between some Perl extensions
6795and the Unicode properties, all of which are silently resolved in favor of the
6796official Unicode property.  To avoid surprises, you should only use
6797C<prop_invmap()> for forms listed in the table below, which omits the
6798non-recommended ones.  The affected forms are the Perl single form equivalents
6799of Unicode properties, such as C<\p{sc}> being a single-form equivalent of
6800C<\p{gc=sc}>, which is treated by C<prop_invmap()> as the C<Script> property,
6801whose short name is C<sc>.  The table indicates the current ambiguities in the
6802INFO column, beginning with the word C<"NOT">.
6803
6804The standard Unicode properties listed below are documented in
6805L<http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is documented in
6806L<Unicode::UCD/prop_invmap()>.  The other Perl extensions are in
6807L<perlunicode/Other Properties>;
6808
6809The first column in the table is a name for the property; the second column is
6810an alternative name, if any, plus possibly some annotations.  The alternative
6811name is the property's full name, unless that would simply repeat the first
6812column, in which case the second column indicates the property's short name
6813(if different).  The annotations are given only in the entry for the full
6814name.  The annotations for binary properties include a list of the first few
6815ranges that the property matches.  To avoid any ambiguity, the SPACE character
6816is represented as C<\x20>.
6817
6818If a property is obsolete, etc, the entry will be flagged with the same
6819characters used in the table in the L<section above|/Properties accessible
6820through \p{} and \P{}>, like B<D> or B<S>.
6821
6822   NAME                      INFO
6823
6824   Age
6825   AHex                    ASCII_Hex_Digit
6826   All                     (Perl extension).  All code points,
6827                           including those above Unicode.  Same as
6828                           qr/./s.  U+0000..infinity
6829   Alnum                   XPosixAlnum.  (Perl extension)
6830   Alpha                   Alphabetic
6831   Alphabetic              (Short: Alpha).  [A-Za-z\xaa\xb5\xba\xc0-
6832                           \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6833                           U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6834                           ...
6835   Any                     (Perl extension).  All Unicode code
6836                           points.  U+0000..10FFFF
6837   ASCII                   Block=Basic_Latin.  (Perl extension).
6838                           [\x00-\x7f]
6839   ASCII_Hex_Digit         (Short: AHex).  [0-9A-Fa-f]
6840   Assigned                (Perl extension).  All assigned code
6841                           points.  U+0000..0377, U+037A..037F,
6842                           U+0384..038A, U+038C, U+038E..03A1,
6843                           U+03A3..052F ...
6844   Bc                      Bidi_Class
6845   Bidi_C                  Bidi_Control
6846   Bidi_Class              (Short: bc)
6847   Bidi_Control            (Short: Bidi_C).  U+061C, U+200E..200F,
6848                           U+202A..202E, U+2066..2069
6849   Bidi_M                  Bidi_Mirrored
6850   Bidi_Mirrored           (Short: Bidi_M).
6851                           [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
6852                           U+169B..169C, U+2039..203A, U+2045..2046,
6853                           U+207D..207E ...
6854   Bidi_Mirroring_Glyph    (Short: bmg)
6855   Bidi_Paired_Bracket     (Short: bpb)
6856   Bidi_Paired_Bracket_Type (Short: bpt)
6857   Blank                   XPosixBlank.  (Perl extension)
6858   Blk                     Block
6859   Block                   (Short: blk)
6860   Bmg                     Bidi_Mirroring_Glyph
6861   Bpb                     Bidi_Paired_Bracket
6862   Bpt                     Bidi_Paired_Bracket_Type
6863   Canonical_Combining_Class (Short: ccc)
6864   Case_Folding            (Short: cf)
6865   Case_Ignorable          (Short: CI).
6866                           [\'.:\^`\xa8\xad\xaf\xb4\xb7-\xb8],
6867                           U+02B0..036F, U+0374..0375, U+037A,
6868                           U+0384..0385, U+0387 ...
6869   Cased                   [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
6870                           \xff], U+0100..01BA, U+01BC..01BF,
6871                           U+01C4..0293, U+0295..02B8, U+02C0..02C1
6872                           ...
6873   Category                General_Category
6874   Ccc                     Canonical_Combining_Class
6875   CE                      Composition_Exclusion
6876   Cf                      Case_Folding; NOT 'cf' meaning
6877                           'General_Category=Format'
6878   Changes_When_Casefolded (Short: CWCF).  [A-Z\xb5\xc0-\xd6\xd8-
6879                           \xdf], U+0100, U+0102, U+0104, U+0106,
6880                           U+0108 ...
6881   Changes_When_Casemapped (Short: CWCM).  [A-Za-z\xb5\xc0-\xd6\xd8-
6882                           \xf6\xf8-\xff], U+0100..0137,
6883                           U+0139..018C, U+018E..019A, U+019C..01A9,
6884                           U+01AC..01B9 ...
6885   Changes_When_Lowercased (Short: CWL).  [A-Z\xc0-\xd6\xd8-\xde],
6886                           U+0100, U+0102, U+0104, U+0106, U+0108 ...
6887   Changes_When_NFKC_Casefolded (Short: CWKCF).  [A-
6888                           Z\xa0\xa8\xaa\xad\xaf\xb2-\xb5\xb8-
6889                           \xba\xbc-\xbe\xc0-\xd6\xd8-\xdf], U+0100,
6890                           U+0102, U+0104, U+0106, U+0108 ...
6891   Changes_When_Titlecased (Short: CWT).  [a-z\xb5\xdf-\xf6\xf8-
6892                           \xff], U+0101, U+0103, U+0105, U+0107,
6893                           U+0109 ...
6894   Changes_When_Uppercased (Short: CWU).  [a-z\xb5\xdf-\xf6\xf8-
6895                           \xff], U+0101, U+0103, U+0105, U+0107,
6896                           U+0109 ...
6897   CI                      Case_Ignorable
6898   Cntrl                   XPosixCntrl (=General_Category=Control).
6899                           (Perl extension)
6900   Comp_Ex                 Full_Composition_Exclusion
6901   Composition_Exclusion   (Short: CE).  U+0958..095F, U+09DC..09DD,
6902                           U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
6903   CWCF                    Changes_When_Casefolded
6904   CWCM                    Changes_When_Casemapped
6905   CWKCF                   Changes_When_NFKC_Casefolded
6906   CWL                     Changes_When_Lowercased
6907   CWT                     Changes_When_Titlecased
6908   CWU                     Changes_When_Uppercased
6909   Dash                    [\-], U+058A, U+05BE, U+1400, U+1806,
6910                           U+2010..2015 ...
6911   Decomposition_Mapping   (Short: dm)
6912   Decomposition_Type      (Short: dt)
6913   Default_Ignorable_Code_Point (Short: DI).  [\xad], U+034F, U+061C,
6914                           U+115F..1160, U+17B4..17B5, U+180B..180E
6915                           ...
6916   Dep                     Deprecated
6917   Deprecated              (Short: Dep).  U+0149, U+0673, U+0F77,
6918                           U+0F79, U+17A3..17A4, U+206A..206F ...
6919   DI                      Default_Ignorable_Code_Point
6920   Dia                     Diacritic
6921   Diacritic               (Short: Dia).  [\^`\xa8\xaf\xb4\xb7-\xb8],
6922                           U+02B0..034E, U+0350..0357, U+035D..0362,
6923                           U+0374..0375, U+037A ...
6924   Digit                   XPosixDigit (=General_Category=
6925                           Decimal_Number).  (Perl extension)
6926   Dm                      Decomposition_Mapping
6927   Dt                      Decomposition_Type
6928   Ea                      East_Asian_Width
6929   East_Asian_Width        (Short: ea)
6930   EqUIdeo                 Equivalent_Unified_Ideograph
6931   Equivalent_Unified_Ideograph (Short: EqUIdeo)
6932   Ext                     Extender
6933   Extender                (Short: Ext).  [\xb7], U+02D0..02D1,
6934                           U+0640, U+07FA, U+0E46, U+0EC6 ...
6935   Full_Composition_Exclusion (Short: Comp_Ex).  U+0340..0341,
6936                           U+0343..0344, U+0374, U+037E, U+0387,
6937                           U+0958..095F ...
6938   Gc                      General_Category
6939   GCB                     Grapheme_Cluster_Break
6940   General_Category        (Short: gc)
6941   Gr_Base                 Grapheme_Base
6942   Gr_Ext                  Grapheme_Extend
6943   Graph                   XPosixGraph.  (Perl extension)
6944   Grapheme_Base           (Short: Gr_Base).  [\x20-\x7e\xa0-
6945                           \xac\xae-\xff], U+0100..02FF,
6946                           U+0370..0377, U+037A..037F, U+0384..038A,
6947                           U+038C ...
6948   Grapheme_Cluster_Break  (Short: GCB)
6949   Grapheme_Extend         (Short: Gr_Ext).  U+0300..036F,
6950                           U+0483..0489, U+0591..05BD, U+05BF,
6951                           U+05C1..05C2, U+05C4..05C5 ...
6952   Hangul_Syllable_Type    (Short: hst)
6953   Hex                     Hex_Digit
6954   Hex_Digit               (Short: Hex).  [0-9A-Fa-f], U+FF10..FF19,
6955                           U+FF21..FF26, U+FF41..FF46
6956   HorizSpace              XPosixBlank.  (Perl extension)
6957   Hst                     Hangul_Syllable_Type
6958 D Hyphen                  [\-\xad], U+058A, U+1806, U+2010..2011,
6959                           U+2E17, U+30FB ...  Supplanted by
6960                           Line_Break property values; see
6961                           www.unicode.org/reports/tr14
6962   ID_Continue             (Short: IDC).  [0-9A-Z_a-
6963                           z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
6964                           \xff], U+0100..02C1, U+02C6..02D1,
6965                           U+02E0..02E4, U+02EC, U+02EE ...
6966   ID_Start                (Short: IDS).  [A-Za-z\xaa\xb5\xba\xc0-
6967                           \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
6968                           U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
6969                           ...
6970   IDC                     ID_Continue
6971   Ideo                    Ideographic
6972   Ideographic             (Short: Ideo).  U+3006..3007,
6973                           U+3021..3029, U+3038..303A, U+3400..4DB5,
6974                           U+4E00..9FEF, U+F900..FA6D ...
6975   IDS                     ID_Start
6976   IDS_Binary_Operator     (Short: IDSB).  U+2FF0..2FF1, U+2FF4..2FFB
6977   IDS_Trinary_Operator    (Short: IDST).  U+2FF2..2FF3
6978   IDSB                    IDS_Binary_Operator
6979   IDST                    IDS_Trinary_Operator
6980   In                      Present_In.  (Perl extension)
6981   Indic_Positional_Category (Short: InPC)
6982   Indic_Syllabic_Category (Short: InSC)
6983   InPC                    Indic_Positional_Category
6984   InSC                    Indic_Syllabic_Category
6985   Isc                     ISO_Comment; NOT 'isc' meaning
6986                           'General_Category=Other'
6987   ISO_Comment             (Short: isc)
6988   Jg                      Joining_Group
6989   Join_C                  Join_Control
6990   Join_Control            (Short: Join_C).  U+200C..200D
6991   Joining_Group           (Short: jg)
6992   Joining_Type            (Short: jt)
6993   Jt                      Joining_Type
6994   Lb                      Line_Break
6995   Lc                      Lowercase_Mapping; NOT 'lc' meaning
6996                           'General_Category=Cased_Letter'
6997   Line_Break              (Short: lb)
6998   LOE                     Logical_Order_Exception
6999   Logical_Order_Exception (Short: LOE).  U+0E40..0E44, U+0EC0..0EC4,
7000                           U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
7001                           ...
7002   Lower                   Lowercase
7003   Lowercase               (Short: Lower).  [a-z\xaa\xb5\xba\xdf-
7004                           \xf6\xf8-\xff], U+0101, U+0103, U+0105,
7005                           U+0107, U+0109 ...
7006   Lowercase_Mapping       (Short: lc)
7007   Math                    [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
7008                           U+03D5, U+03F0..03F1, U+03F4..03F6,
7009                           U+0606..0608 ...
7010   Na                      Name
7011   Na1                     Unicode_1_Name
7012   Name                    (Short: na)
7013   Name_Alias
7014   NChar                   Noncharacter_Code_Point
7015   NFC_QC                  NFC_Quick_Check
7016   NFC_Quick_Check         (Short: NFC_QC)
7017   NFD_QC                  NFD_Quick_Check
7018   NFD_Quick_Check         (Short: NFD_QC)
7019   NFKC_Casefold           (Short: NFKC_CF)
7020   NFKC_CF                 NFKC_Casefold
7021   NFKC_QC                 NFKC_Quick_Check
7022   NFKC_Quick_Check        (Short: NFKC_QC)
7023   NFKD_QC                 NFKD_Quick_Check
7024   NFKD_Quick_Check        (Short: NFKD_QC)
7025   Noncharacter_Code_Point (Short: NChar).  U+FDD0..FDEF,
7026                           U+FFFE..FFFF, U+1FFFE..1FFFF,
7027                           U+2FFFE..2FFFF, U+3FFFE..3FFFF,
7028                           U+4FFFE..4FFFF ...
7029   Nt                      Numeric_Type
7030   Numeric_Type            (Short: nt)
7031   Numeric_Value           (Short: nv)
7032   Nv                      Numeric_Value
7033   Pat_Syn                 Pattern_Syntax
7034   Pat_WS                  Pattern_White_Space
7035   Pattern_Syntax          (Short: Pat_Syn).
7036                           [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7037                           >?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9\xab-
7038                           \xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
7039                           U+2010..2027, U+2030..203E, U+2041..2053,
7040                           U+2055..205E, U+2190..245F ...
7041   Pattern_White_Space     (Short: Pat_WS).  [\t\n\cK\f\r\x20\x85],
7042                           U+200E..200F, U+2028..2029
7043   PCM                     Prepended_Concatenation_Mark
7044   Perl_Decimal_Digit      (Perl extension)
7045   PerlSpace               PosixSpace.  (Perl extension)
7046   PerlWord                PosixWord.  (Perl extension)
7047   PosixAlnum              (Perl extension).  [0-9A-Za-z]
7048   PosixAlpha              (Perl extension).  [A-Za-z]
7049   PosixBlank              (Perl extension).  [\t\x20]
7050   PosixCntrl              (Perl extension).  ASCII control
7051                           characters.  ACK, BEL, BS, CAN, CR, DC1,
7052                           DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
7053                           ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
7054                           NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
7055   PosixDigit              (Perl extension).  [0-9]
7056   PosixGraph              (Perl extension).
7057                           [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
7058                           Z\[\\\]\^_`a-z\{\|\}~]
7059   PosixLower              (Perl extension).  [a-z]
7060   PosixPrint              (Perl extension).  [\x20-\x7e]
7061   PosixPunct              (Perl extension).
7062                           [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7063                           >?\@\[\\\]\^_`\{\|\}~]
7064   PosixSpace              (Perl extension).  [\t\n\cK\f\r\x20]
7065   PosixUpper              (Perl extension).  [A-Z]
7066   PosixWord               (Perl extension).  \w, restricted to
7067                           ASCII.  [0-9A-Z_a-z]
7068   PosixXDigit             ASCII_Hex_Digit.  (Perl extension).
7069                           [0-9A-Fa-f]
7070   Prepended_Concatenation_Mark (Short: PCM).  U+0600..0605, U+06DD,
7071                           U+070F, U+08E2, U+110BD, U+110CD
7072   Present_In              (Short: In).  (Perl extension)
7073   Print                   XPosixPrint.  (Perl extension)
7074   Punct                   General_Category=Punctuation.  (Perl
7075                           extension).
7076                           [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_-
7077                           \{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
7078                           U+037E, U+0387, U+055A..055F,
7079                           U+0589..058A, U+05BE ...
7080   QMark                   Quotation_Mark
7081   Quotation_Mark          (Short: QMark).  [\"\'\xab\xbb],
7082                           U+2018..201F, U+2039..203A, U+2E42,
7083                           U+300C..300F, U+301D..301F ...
7084   Radical                 U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
7085   Regional_Indicator      (Short: RI).  U+1F1E6..1F1FF
7086   RI                      Regional_Indicator
7087   SB                      Sentence_Break
7088   Sc                      Script; NOT 'sc' meaning
7089                           'General_Category=Currency_Symbol'
7090   Scf                     Simple_Case_Folding
7091   Script                  (Short: sc)
7092   Script_Extensions       (Short: scx)
7093   Scx                     Script_Extensions
7094   SD                      Soft_Dotted
7095   Sentence_Break          (Short: SB)
7096   Sentence_Terminal       (Short: STerm).  [!.?], U+0589,
7097                           U+061E..061F, U+06D4, U+0700..0702, U+07F9
7098                           ...
7099   Sfc                     Simple_Case_Folding
7100   Simple_Case_Folding     (Short: scf)
7101   Simple_Lowercase_Mapping (Short: slc)
7102   Simple_Titlecase_Mapping (Short: stc)
7103   Simple_Uppercase_Mapping (Short: suc)
7104   Slc                     Simple_Lowercase_Mapping
7105   Soft_Dotted             (Short: SD).  [i-j], U+012F, U+0249,
7106                           U+0268, U+029D, U+02B2 ...
7107   Space                   White_Space
7108   SpacePerl               XPosixSpace.  (Perl extension)
7109   Stc                     Simple_Titlecase_Mapping
7110   STerm                   Sentence_Terminal
7111   Suc                     Simple_Uppercase_Mapping
7112   Tc                      Titlecase_Mapping
7113   Term                    Terminal_Punctuation
7114   Terminal_Punctuation    (Short: Term).  [!,.:;?], U+037E, U+0387,
7115                           U+0589, U+05C3, U+060C ...
7116   Title                   Titlecase.  (Perl extension)
7117   Titlecase               (Short: Title).  (Perl extension).  (=
7118                           \p{Gc=Lt}).  U+01C5, U+01C8, U+01CB,
7119                           U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
7120   Titlecase_Mapping       (Short: tc)
7121   Uc                      Uppercase_Mapping
7122   UIdeo                   Unified_Ideograph
7123   Unicode                 Any.  (Perl extension)
7124   Unicode_1_Name          (Short: na1)
7125   Unified_Ideograph       (Short: UIdeo).  U+3400..4DB5,
7126                           U+4E00..9FEF, U+FA0E..FA0F, U+FA11,
7127                           U+FA13..FA14, U+FA1F ...
7128   Upper                   Uppercase
7129   Uppercase               (Short: Upper).  [A-Z\xc0-\xd6\xd8-\xde],
7130                           U+0100, U+0102, U+0104, U+0106, U+0108 ...
7131   Uppercase_Mapping       (Short: uc)
7132   Variation_Selector      (Short: VS).  U+180B..180D, U+FE00..FE0F,
7133                           U+E0100..E01EF
7134   Vertical_Orientation    (Short: vo)
7135   VertSpace               (Perl extension).  \v.  [\n\cK\f\r\x85],
7136                           U+2028..2029
7137   Vo                      Vertical_Orientation
7138   VS                      Variation_Selector
7139   WB                      Word_Break
7140   White_Space             (Short: WSpace).
7141                           [\t\n\cK\f\r\x20\x85\xa0], U+1680,
7142                           U+2000..200A, U+2028..2029, U+202F, U+205F
7143                           ...
7144   Word                    XPosixWord.  (Perl extension)
7145   Word_Break              (Short: WB)
7146   WSpace                  White_Space
7147   XDigit                  XPosixXDigit (=Hex_Digit).  (Perl
7148                           extension)
7149   XID_Continue            (Short: XIDC).  [0-9A-Z_a-
7150                           z\xaa\xb5\xb7\xba\xc0-\xd6\xd8-\xf6\xf8-
7151                           \xff], U+0100..02C1, U+02C6..02D1,
7152                           U+02E0..02E4, U+02EC, U+02EE ...
7153   XID_Start               (Short: XIDS).  [A-Za-z\xaa\xb5\xba\xc0-
7154                           \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7155                           U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7156                           ...
7157   XIDC                    XID_Continue
7158   XIDS                    XID_Start
7159   XPerlSpace              XPosixSpace.  (Perl extension)
7160   XPosixAlnum             (Short: Alnum).  (Perl extension).
7161                           Alphabetic and (decimal) Numeric.  [0-9A-
7162                           Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7163                           \xff], U+0100..02C1, U+02C6..02D1,
7164                           U+02E0..02E4, U+02EC, U+02EE ...
7165   XPosixAlpha             Alphabetic.  (Perl extension).  [A-Za-
7166                           z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7167                           U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7168                           U+02EC, U+02EE ...
7169   XPosixBlank             (Short: Blank).  (Perl extension).  \h,
7170                           Horizontal white space.  [\t\x20\xa0],
7171                           U+1680, U+2000..200A, U+202F, U+205F,
7172                           U+3000
7173   XPosixCntrl             General_Category=Control  (Short: Cntrl).
7174                           (Perl extension).  Control characters.
7175                           [\x00-\x1f\x7f-\x9f]
7176   XPosixDigit             General_Category=Decimal_Number  (Short:
7177                           Digit).  (Perl extension).  [0-9] + all
7178                           other decimal digits.  [0-9],
7179                           U+0660..0669, U+06F0..06F9, U+07C0..07C9,
7180                           U+0966..096F, U+09E6..09EF ...
7181   XPosixGraph             (Short: Graph).  (Perl extension).
7182                           Characters that are graphical.
7183                           [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
7184                           Z\[\\\]\^_`a-z\{\|\}~\xa1-\xff],
7185                           U+0100..0377, U+037A..037F, U+0384..038A,
7186                           U+038C, U+038E..03A1 ...
7187   XPosixLower             Lowercase.  (Perl extension).  [a-
7188                           z\xaa\xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
7189                           U+0103, U+0105, U+0107, U+0109 ...
7190   XPosixPrint             (Short: Print).  (Perl extension).
7191                           Characters that are graphical plus space
7192                           characters (but no controls).  [\x20-
7193                           \x7e\xa0-\xff], U+0100..0377,
7194                           U+037A..037F, U+0384..038A, U+038C,
7195                           U+038E..03A1 ...
7196   XPosixPunct             (Perl extension).  \p{Punct} + ASCII-range
7197                           \p{Symbol}.  [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7198                           >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
7199                           \xb7\xbb\xbf], U+037E, U+0387,
7200                           U+055A..055F, U+0589..058A, U+05BE ...
7201   XPosixSpace             (Perl extension).  \s including beyond
7202                           ASCII and vertical tab.
7203                           [\t\n\cK\f\r\x20\x85\xa0], U+1680,
7204                           U+2000..200A, U+2028..2029, U+202F, U+205F
7205                           ...
7206   XPosixUpper             Uppercase.  (Perl extension).  [A-Z\xc0-
7207                           \xd6\xd8-\xde], U+0100, U+0102, U+0104,
7208                           U+0106, U+0108 ...
7209   XPosixWord              (Short: Word).  (Perl extension).  \w,
7210                           including beyond ASCII; = \p{Alnum} + \pM
7211                           + \p{Pc} + \p{Join_Control}.  [0-9A-Z_a-
7212                           z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7213                           U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7214                           U+02EC, U+02EE ...
7215   XPosixXDigit            Hex_Digit  (Short: XDigit).  (Perl
7216                           extension).  [0-9A-Fa-f], U+FF10..FF19,
7217                           U+FF21..FF26, U+FF41..FF46
7218
7219
7220=head1 Properties accessible through other means
7221
7222Certain properties are accessible also via core function calls.  These are:
7223
7224 Lowercase_Mapping          lc() and lcfirst()
7225 Titlecase_Mapping          ucfirst()
7226 Uppercase_Mapping          uc()
7227
7228Also, Case_Folding is accessible through the C</i> modifier in regular
7229expressions, the C<\F> transliteration escape, and the C<L<fc|perlfunc/fc>>
7230operator.
7231
7232And, the Name and Name_Aliases properties are accessible through the C<\N{}>
7233interpolation in double-quoted strings and regular expressions; and functions
7234C<charnames::viacode()>, C<charnames::vianame()>, and
7235C<charnames::string_vianame()> (which require a C<use charnames ();> to be
7236specified.
7237
7238Finally, most properties related to decomposition are accessible via
7239L<Unicode::Normalize>.
7240
7241=head1 Unicode character properties that are NOT accepted by Perl
7242
7243Perl will generate an error for a few character properties in Unicode when
7244used in a regular expression.  The non-Unihan ones are listed below, with the
7245reasons they are not accepted, perhaps with work-arounds.  The short names for
7246the properties are listed enclosed in (parentheses).
7247As described after the list, an installation can change the defaults and choose
7248to accept any of these.  The list is machine generated based on the
7249choices made for the installation that generated this document.
7250
7251
7252=over 4
7253
7254
7255
7256=item I<Expands_On_NFC> (XO_NFC)
7257
7258=item I<Expands_On_NFD> (XO_NFD)
7259
7260=item I<Expands_On_NFKC> (XO_NFKC)
7261
7262=item I<Expands_On_NFKD> (XO_NFKD)
7263
7264Deprecated by Unicode.  These are characters that expand to more than one character in the specified normalization form, but whether they actually take up more bytes or not depends on the encoding being used.  For example, a UTF-8 encoded character may expand to a different number of bytes than a UTF-32 encoded character.
7265
7266
7267
7268=item I<Extended_Pictographic> (XPG)
7269
7270Not part of the Unicode Character Database
7271
7272
7273
7274=item I<Grapheme_Link> (Gr_Link)
7275
7276Duplicates ccc=vr (Canonical_Combining_Class=Virama)
7277
7278
7279
7280=item I<Jamo_Short_Name> (JSN)
7281
7282=item I<Other_Alphabetic> (OAlpha)
7283
7284=item I<Other_Default_Ignorable_Code_Point> (ODI)
7285
7286=item I<Other_Grapheme_Extend> (OGr_Ext)
7287
7288=item I<Other_ID_Continue> (OIDC)
7289
7290=item I<Other_ID_Start> (OIDS)
7291
7292=item I<Other_Lowercase> (OLower)
7293
7294=item I<Other_Math> (OMath)
7295
7296=item I<Other_Uppercase> (OUpper)
7297
7298Used by Unicode internally for generating other properties and not intended to be used stand-alone
7299
7300
7301
7302=item I<Script=Katakana_Or_Hiragana> (sc=Hrkt)
7303
7304Obsolete.  All code points previously matched by this have been moved to "Script=Common".  Consider instead using "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or both)
7305
7306
7307
7308=item I<Script_Extensions=Katakana_Or_Hiragana> (scx=Hrkt)
7309
7310All code points that would be matched by this are matched by either "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
7311
7312=back
7313
7314
7315An installation can choose to allow any of these to be matched by downloading
7316the Unicode database from L<http://www.unicode.org/Public/> to
7317C<$Config{privlib}>/F<unicore/> in the Perl source tree, changing the
7318controlling lists contained in the program
7319C<$Config{privlib}>/F<unicore/mktables> and then re-compiling and installing.
7320(C<%Config> is available from the Config module).
7321
7322Also, perl can be recompiled to operate on an earlier version of the Unicode
7323standard.  Further information is at
7324C<$Config{privlib}>/F<unicore/README.perl>.
7325
7326=head1 Other information in the Unicode data base
7327
7328The Unicode data base is delivered in two different formats.  The XML version
7329is valid for more modern Unicode releases.  The other version is a collection
7330of files.  The two are intended to give equivalent information.  Perl uses the
7331older form; this allows you to recompile Perl to use early Unicode releases.
7332
7333The only non-character property that Perl currently supports is Named
7334Sequences, in which a sequence of code points
7335is given a name and generally treated as a single entity.  (Perl supports
7336these via the C<\N{...}> double-quotish construct,
7337L<charnames/charnames::string_vianame(name)>, and L<Unicode::UCD/namedseq()>.
7338
7339Below is a list of the files in the Unicode data base that Perl doesn't
7340currently use, along with very brief descriptions of their purposes.
7341Some of the names of the files have been shortened from those that Unicode
7342uses, in order to allow them to be distinguishable from similarly named files
7343on file systems for which only the first 8 characters of a name are
7344significant.
7345
7346=over 4
7347
7348
7349
7350
7351=item F<auxiliary/GraphemeBreakTest.html>
7352
7353=item F<auxiliary/LineBreakTest.html>
7354
7355=item F<auxiliary/SentenceBreakTest.html>
7356
7357=item F<auxiliary/WordBreakTest.html>
7358
7359Documentation of validation Tests
7360
7361
7362
7363=item F<BidiCharacterTest.txt>
7364
7365=item F<BidiTest.txt>
7366
7367=item F<NormTest.txt>
7368
7369Validation Tests
7370
7371
7372
7373=item F<CJKRadicals.txt>
7374
7375Maps the kRSUnicode property values to corresponding code points
7376
7377
7378
7379=item F<EmojiSources.txt>
7380
7381Maps certain Unicode code points to their legacy Japanese cell-phone values
7382
7383
7384
7385=item F<extracted/DName.txt>
7386
7387This file adds no new information not already present in other files
7388
7389
7390
7391=item F<Index.txt>
7392
7393Alphabetical index of Unicode characters
7394
7395
7396
7397=item F<NamedSqProv.txt>
7398
7399Named sequences proposed for inclusion in a later version of the Unicode Standard; if you need them now, you can append this file to F<NamedSequences.txt> and recompile perl
7400
7401
7402
7403=item F<NamesList.html>
7404
7405Describes the format and contents of F<NamesList.txt>
7406
7407
7408
7409=item F<NamesList.txt>
7410
7411Annotated list of characters
7412
7413
7414
7415=item F<NormalizationCorrections.txt>
7416
7417Documentation of corrections already incorporated into the Unicode data base
7418
7419
7420
7421=item F<NushuSources.txt>
7422
7423Specifies source material for Nushu characters
7424
7425
7426
7427=item F<ReadMe.txt>
7428
7429Documentation
7430
7431
7432
7433=item F<StandardizedVariants.html>
7434
7435Obsoleted as of Unicode 9.0, but previously provided a visual display of the standard variant sequences derived from F<StandardizedVariants.txt>.
7436
7437
7438
7439=item F<StandardizedVariants.txt>
7440
7441Certain glyph variations for character display are standardized.  This lists the non-Unihan ones; the Unihan ones are also not used by Perl, and are in a separate Unicode data base L<http://www.unicode.org/ivd>
7442
7443
7444
7445=item F<TangutSources.txt>
7446
7447Specifies source mappings for Tangut ideographs and components. This data file also includes informative radical-stroke values that are used internally by Unicode
7448
7449
7450
7451=item F<USourceData.txt>
7452
7453Documentation of status and cross reference of proposals for encoding by Unicode of Unihan characters
7454
7455
7456
7457=item F<USourceGlyphs.pdf>
7458
7459Pictures of the characters in F<USourceData.txt>
7460
7461
7462=back
7463
7464=head1 SEE ALSO
7465
7466L<http://www.unicode.org/reports/tr44/>
7467
7468L<perlrecharclass>
7469
7470L<perlunicode>
7471
7472