xref: /openbsd/gnu/usr.bin/perl/pod/perluniprops.pod (revision fd823929)
1=begin comment
2
3# !!!!!!!   DO NOT EDIT THIS FILE   !!!!!!!
4# This file is machine-generated by lib/unicore/mktables from the Unicode
5# database, Version 15.0.0.  Any changes made here will be lost!
6
7
8To change this file, edit lib/unicore/mktables instead.
9
10=end comment
11
12=head1 NAME
13
14perluniprops - Index of Unicode Version 15.0.0 character properties in Perl
15
16=head1 DESCRIPTION
17
18This document provides information about the portion of the Unicode database
19that deals with character properties, that is the portion that is defined on
20single code points.  (L</Other information in the Unicode data base>
21below briefly mentions other data that Unicode provides.)
22
23Perl can provide access to all non-provisional Unicode character properties,
24though not all are enabled by default.  The omitted ones are the Unihan
25properties and certain
26deprecated or Unicode-internal properties.  (An installation may choose to
27recompile Perl's tables to change this.  See L</Unicode character
28properties that are NOT accepted by Perl>.)
29
30For most purposes, access to Unicode properties from the Perl core is through
31regular expression matches, as described in the next section.
32For some special purposes, and to access the properties that are not suitable
33for regular expression matching, all the Unicode character properties that
34Perl handles are accessible via the standard L<Unicode::UCD> module, as
35described in the section L</Properties accessible through Unicode::UCD>.
36
37Perl also provides some additional extensions and short-cut synonyms
38for Unicode properties.
39
40This document merely lists all available properties and does not attempt to
41explain what each property really means.  There is a brief description of each
42Perl extension; see L<perlunicode/Other Properties> for more information on
43these.  There is some detail about Blocks, Scripts, General_Category,
44and Bidi_Class in L<perlunicode>, but to find out about the intricacies of the
45official Unicode properties, refer to the Unicode standard.  A good starting
46place is L<http://www.unicode.org/reports/tr44/>.
47
48Note that you can define your own properties; see
49L<perlunicode/"User-Defined Character Properties">.
50
51=head1 Properties accessible through C<\p{}> and C<\P{}>
52
53The Perl regular expression C<\p{}> and C<\P{}> constructs give access to
54most of the Unicode character properties.  The table below shows all these
55constructs, both single and compound forms.
56
57B<Compound forms> consist of two components, separated by an equals sign or a
58colon.  The first component is the property name, and the second component is
59the particular value of the property to match against, for example,
60C<\p{Script_Extensions: Greek}> and C<\p{Script_Extensions=Greek}> both mean
61to match characters whose Script_Extensions property value is Greek.
62(C<Script_Extensions> is an improved version of the C<Script> property.)
63
64B<Single forms>, like C<\p{Greek}>, are mostly Perl-defined shortcuts for
65their equivalent compound forms.  The table shows these equivalences.  (In our
66example, C<\p{Greek}> is a just a shortcut for
67C<\p{Script_Extensions=Greek}>).  There are also a few Perl-defined single
68forms that are not shortcuts for a compound form.  One such is C<\p{Word}>.
69These are also listed in the table.
70
71In parsing these constructs, Perl always ignores Upper/lower case differences
72everywhere within the {braces}.  Thus C<\p{Greek}> means the same thing as
73C<\p{greek}>.  But note that changing the case of the C<"p"> or C<"P"> before
74the left brace completely changes the meaning of the construct, from "match"
75(for C<\p{}>) to "doesn't match" (for C<\P{}>).  Casing in this document is
76for improved legibility.
77
78Also, white space, hyphens, and underscores are normally ignored
79everywhere between the {braces}, and hence can be freely added or removed
80even if the C</x> modifier hasn't been specified on the regular expression.
81But in the table below a 'B<T>' at the beginning of an entry
82means that tighter (stricter) rules are used for that entry:
83
84=over 4
85
86=over 4
87
88=item Single form (C<\p{name}>) tighter rules:
89
90White space, hyphens, and underscores ARE significant
91except for:
92
93=over 4
94
95=item * white space adjacent to a non-word character
96
97=item * underscores separating digits in numbers
98
99=back
100
101That means, for example, that you can freely add or remove white space
102adjacent to (but within) the braces without affecting the meaning.
103
104=item Compound form (C<\p{name=value}> or C<\p{name:value}>) tighter rules:
105
106The tighter rules given above for the single form apply to everything to the
107right of the colon or equals; the looser rules still apply to everything to
108the left.
109
110That means, for example, that you can freely add or remove white space
111adjacent to (but within) the braces and the colon or equal sign.
112
113=back
114
115=back
116
117Some properties are considered obsolete by Unicode, but still available.
118There are several varieties of obsolescence:
119
120=over 4
121
122=over 4
123
124=item Stabilized
125
126A property may be stabilized.  Such a determination does not indicate
127that the property should or should not be used; instead it is a declaration
128that the property will not be maintained nor extended for newly encoded
129characters.  Such properties are marked with an 'B<S>' in the
130table.
131
132=item Deprecated
133
134A property may be deprecated, perhaps because its original intent
135has been replaced by another property, or because its specification was
136somehow defective.  This means that its use is strongly
137discouraged, so much so that a warning will be issued if used, unless the
138regular expression is in the scope of a C<S<no warnings 'deprecated'>>
139statement.  A 'B<D>' flags each such entry in the table, and
140the entry there for the longest, most descriptive version of the property will
141give the reason it is deprecated, and perhaps advice.  Perl may issue such a
142warning, even for properties that aren't officially deprecated by Unicode,
143when there used to be characters or code points that were matched by them, but
144no longer.  This is to warn you that your program may not work like it did on
145earlier Unicode releases.
146
147A deprecated property may be made unavailable in a future Perl version, so it
148is best to move away from them.
149
150A deprecated property may also be stabilized, but this fact is not shown.
151
152=item Obsolete
153
154Properties marked with an 'B<O>' in the table are considered (plain)
155obsolete.  Generally this designation is given to properties that Unicode once
156used for internal purposes (but not any longer).
157
158=item Discouraged
159
160This is not actually a Unicode-specified obsolescence, but applies to certain
161Perl extensions that are present for backwards compatibility, but are
162discouraged from being used.  These are not obsolete, but their meanings are
163not stable.  Future Unicode versions could force any of these extensions to be
164removed without warning, replaced by another property with the same name that
165means something different.  An 'B<X>' flags each such entry in the
166table.  Use the equivalent shown instead.
167
168
169In particular, matches in the Block property have single forms
170defined by Perl that begin with C<"In_">, C<"Is_>, or even with no prefix at
171all,  Like all B<DISCOURAGED> forms, these are not stable.  For example,
172C<\p{Block=Deseret}> can currently be written as C<\p{In_Deseret}>,
173C<\p{Is_Deseret}>, or C<\p{Deseret}>.  But, a new Unicode version may
174come along that would force Perl to change the meaning of one or more of
175these, and your program would no longer be correct.  Currently there are no
176such conflicts with the form that begins C<"In_">, but there are many with the
177other two shortcuts, and Unicode continues to define new properties that begin
178with C<"In">, so it's quite possible that a conflict will occur in the future.
179The compound form is guaranteed to not become obsolete, and its meaning is
180clearer anyway.  See L<perlunicode/"Blocks"> for more information about this.
181
182User-defined properties must begin with "In" or "Is".  These override any
183Unicode property of the same name.
184
185
186=back
187
188=back
189
190The table below has two columns.  The left column contains the C<\p{}>
191constructs to look up, possibly preceded by the flags mentioned above; and
192the right column contains information about them, like a description, or
193synonyms.  The table shows both the single and compound forms for each
194property that has them.  If the left column is a short name for a property,
195the right column will give its longer, more descriptive name; and if the left
196column is the longest name, the right column will show any equivalent shortest
197name, in both single and compound forms if applicable.
198
199If braces are not needed to specify a property (e.g., C<\pL>), the left
200column contains both forms, with and without braces.
201
202The right column will also caution you if a property means something different
203than what might normally be expected.
204
205All single forms are Perl extensions; a few compound forms are as well, and
206are noted as such.
207
208Numbers in (parentheses) indicate the total number of Unicode code points
209matched by the property.  For the entries that give the longest, most
210descriptive version of the property, the count is followed by a list of some
211of the code points matched by it.  The list includes all the matched
212characters in the 0-255 range, enclosed in the familiar [brackets] the same as
213a regular expression bracketed character class.  Following that, the next few
214higher matching ranges are also given.  To avoid visual ambiguity, the SPACE
215character is represented as C<\x20>.
216
217For emphasis, those properties that match no code points at all are listed as
218well in a separate section following the table.
219
220Most properties match the same code points regardless of whether C<"/i">
221case-insensitive matching is specified or not.  But a few properties are
222affected.  These are shown with the notation S<C<(/i= I<other_property>)>>
223in the second column.  Under case-insensitive matching they match the
224same code pode points as the property I<other_property>.
225
226There is no description given for most non-Perl defined properties (See
227L<http://www.unicode.org/reports/tr44/> for that).
228
229For compactness, 'B<*>' is used as a wildcard instead of showing all possible
230combinations.  For example, entries like:
231
232 \p{Gc: *}                                  \p{General_Category: *}
233
234mean that 'Gc' is a synonym for 'General_Category', and anything that is valid
235for the latter is also valid for the former.  Similarly,
236
237 \p{Is_*}                                   \p{*}
238
239means that if and only if, for example, C<\p{Foo}> exists, then
240C<\p{Is_Foo}> and C<\p{IsFoo}> are also valid and all mean the same thing.
241And similarly, C<\p{Foo=Bar}> means the same as C<\p{Is_Foo=Bar}> and
242C<\p{IsFoo=Bar}>.  "*" here is restricted to something not beginning with an
243underscore.
244
245Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for 'Y'.
246And 'No', 'F', and 'False' are all synonyms for 'N'.  The table shows 'Y*' and
247'N*' to indicate this, and doesn't have separate entries for the other
248possibilities.  Note that not all properties which have values 'Yes' and 'No'
249are binary, and they have all their values spelled out without using this wild
250card, and a C<NOT> clause in their description that highlights their not being
251binary.  These also require the compound form to match them, whereas true
252binary properties have both single and compound forms available.
253
254Note that all non-essential underscores are removed in the display of the
255short names below.
256
257B<Legend summary:>
258
259=over 4
260
261=item Z<>B<*> is a wild-card
262
263=item B<(\d+)> in the info column gives the number of Unicode code points matched
264by this property.
265
266=item B<D> means this is deprecated.
267
268=item B<O> means this is obsolete.
269
270=item B<S> means this is stabilized.
271
272=item B<T> means tighter (stricter) name matching applies.
273
274=item B<X> means use of this form is discouraged, and may not be
275stable.
276
277=back
278
279       NAME                           INFO
280
281   \p{Adlam}               \p{Script_Extensions=Adlam} (Short:
282                             \p{Adlm}; NOT \p{Block=Adlam}) (90)
283   \p{Adlm}                \p{Adlam} (= \p{Script_Extensions=Adlam})
284                             (NOT \p{Block=Adlam}) (90)
285 X \p{Aegean_Numbers}      \p{Block=Aegean_Numbers} (64)
286 T \p{Age: 1.1}            \p{Age=V1_1} (33_979)
287   \p{Age: V1_1}           Code point's usage introduced in version
288                             1.1 (33_979: U+0000..01F5, U+01FA..0217,
289                             U+0250..02A8, U+02B0..02DE,
290                             U+02E0..02E9, U+0300..0345 ...)
291 T \p{Age: 2.0}            \p{Age=V2_0} (144_521)
292   \p{Age: V2_0}           Code point's usage was introduced in
293                             version 2.0; See also Property
294                             'Present_In' (144_521: U+0591..05A1,
295                             U+05A3..05AF, U+05C4, U+0F00..0F47,
296                             U+0F49..0F69, U+0F71..0F8B ...)
297 T \p{Age: 2.1}            \p{Age=V2_1} (2)
298   \p{Age: V2_1}           Code point's usage was introduced in
299                             version 2.1; See also Property
300                             'Present_In' (2: U+20AC, U+FFFC)
301 T \p{Age: 3.0}            \p{Age=V3_0} (10_307)
302   \p{Age: V3_0}           Code point's usage was introduced in
303                             version 3.0; See also Property
304                             'Present_In' (10_307: U+01F6..01F9,
305                             U+0218..021F, U+0222..0233,
306                             U+02A9..02AD, U+02DF, U+02EA..02EE ...)
307 T \p{Age: 3.1}            \p{Age=V3_1} (44_978)
308   \p{Age: V3_1}           Code point's usage was introduced in
309                             version 3.1; See also Property
310                             'Present_In' (44_978: U+03F4..03F5,
311                             U+FDD0..FDEF, U+10300..1031E,
312                             U+10320..10323, U+10330..1034A,
313                             U+10400..10425 ...)
314 T \p{Age: 3.2}            \p{Age=V3_2} (1016)
315   \p{Age: V3_2}           Code point's usage was introduced in
316                             version 3.2; See also Property
317                             'Present_In' (1016: U+0220, U+034F,
318                             U+0363..036F, U+03D8..03D9, U+03F6,
319                             U+048A..048B ...)
320 T \p{Age: 4.0}            \p{Age=V4_0} (1226)
321   \p{Age: V4_0}           Code point's usage was introduced in
322                             version 4.0; See also Property
323                             'Present_In' (1226: U+0221,
324                             U+0234..0236, U+02AE..02AF,
325                             U+02EF..02FF, U+0350..0357, U+035D..035F
326                             ...)
327 T \p{Age: 4.1}            \p{Age=V4_1} (1273)
328   \p{Age: V4_1}           Code point's usage was introduced in
329                             version 4.1; See also Property
330                             'Present_In' (1273: U+0237..0241,
331                             U+0358..035C, U+03FC..03FF,
332                             U+04F6..04F7, U+05A2, U+05C5..05C7 ...)
333 T \p{Age: 5.0}            \p{Age=V5_0} (1369)
334   \p{Age: V5_0}           Code point's usage was introduced in
335                             version 5.0; See also Property
336                             'Present_In' (1369: U+0242..024F,
337                             U+037B..037D, U+04CF, U+04FA..04FF,
338                             U+0510..0513, U+05BA ...)
339 T \p{Age: 5.1}            \p{Age=V5_1} (1624)
340   \p{Age: V5_1}           Code point's usage was introduced in
341                             version 5.1; See also Property
342                             'Present_In' (1624: U+0370..0373,
343                             U+0376..0377, U+03CF, U+0487,
344                             U+0514..0523, U+0606..060A ...)
345 T \p{Age: 5.2}            \p{Age=V5_2} (6648)
346   \p{Age: V5_2}           Code point's usage was introduced in
347                             version 5.2; See also Property
348                             'Present_In' (6648: U+0524..0525,
349                             U+0800..082D, U+0830..083E, U+0900,
350                             U+094E, U+0955 ...)
351 T \p{Age: 6.0}            \p{Age=V6_0} (2088)
352   \p{Age: V6_0}           Code point's usage was introduced in
353                             version 6.0; See also Property
354                             'Present_In' (2088: U+0526..0527,
355                             U+0620, U+065F, U+0840..085B, U+085E,
356                             U+093A..093B ...)
357 T \p{Age: 6.1}            \p{Age=V6_1} (732)
358   \p{Age: V6_1}           Code point's usage was introduced in
359                             version 6.1; See also Property
360                             'Present_In' (732: U+058F, U+0604,
361                             U+08A0, U+08A2..08AC, U+08E4..08FE,
362                             U+0AF0 ...)
363 T \p{Age: 6.2}            \p{Age=V6_2} (1)
364   \p{Age: V6_2}           Code point's usage was introduced in
365                             version 6.2; See also Property
366                             'Present_In' (1: U+20BA)
367 T \p{Age: 6.3}            \p{Age=V6_3} (5)
368   \p{Age: V6_3}           Code point's usage was introduced in
369                             version 6.3; See also Property
370                             'Present_In' (5: U+061C, U+2066..2069)
371 T \p{Age: 7.0}            \p{Age=V7_0} (2834)
372   \p{Age: V7_0}           Code point's usage was introduced in
373                             version 7.0; See also Property
374                             'Present_In' (2834: U+037F,
375                             U+0528..052F, U+058D..058E, U+0605,
376                             U+08A1, U+08AD..08B2 ...)
377 T \p{Age: 8.0}            \p{Age=V8_0} (7716)
378   \p{Age: V8_0}           Code point's usage was introduced in
379                             version 8.0; See also Property
380                             'Present_In' (7716: U+08B3..08B4,
381                             U+08E3, U+0AF9, U+0C5A, U+0D5F, U+13F5
382                             ...)
383 T \p{Age: 9.0}            \p{Age=V9_0} (7500)
384   \p{Age: V9_0}           Code point's usage was introduced in
385                             version 9.0; See also Property
386                             'Present_In' (7500: U+08B6..08BD,
387                             U+08D4..08E2, U+0C80, U+0D4F,
388                             U+0D54..0D56, U+0D58..0D5E ...)
389 T \p{Age: 10.0}           \p{Age=V10_0} (8518)
390   \p{Age: V10_0}          Code point's usage was introduced in
391                             version 10.0; See also Property
392                             'Present_In' (8518: U+0860..086A,
393                             U+09FC..09FD, U+0AFA..0AFF, U+0D00,
394                             U+0D3B..0D3C, U+1CF7 ...)
395 T \p{Age: 11.0}           \p{Age=V11_0} (684)
396   \p{Age: V11_0}          Code point's usage was introduced in
397                             version 11.0; See also Property
398                             'Present_In' (684: U+0560, U+0588,
399                             U+05EF, U+07FD..07FF, U+08D3, U+09FE ...)
400 T \p{Age: 12.0}           \p{Age=V12_0} (554)
401   \p{Age: V12_0}          Code point's usage was introduced in
402                             version 12.0; See also Property
403                             'Present_In' (554: U+0C77, U+0E86,
404                             U+0E89, U+0E8C, U+0E8E..0E93, U+0E98 ...)
405 T \p{Age: 12.1}           \p{Age=V12_1} (1)
406   \p{Age: V12_1}          Code point's usage was introduced in
407                             version 12.1; See also Property
408                             'Present_In' (1: U+32FF)
409 T \p{Age: 13.0}           \p{Age=V13_0} (5930)
410   \p{Age: V13_0}          Code point's usage was introduced in
411                             version 13.0; See also Property
412                             'Present_In' (5930: U+08BE..08C7,
413                             U+0B55, U+0D04, U+0D81, U+1ABF..1AC0,
414                             U+2B97 ...)
415 T \p{Age: 14.0}           \p{Age=V14_0} (838)
416   \p{Age: V14_0}          Code point's usage was introduced in
417                             version 14.0; See also Property
418                             'Present_In' (838: U+061D, U+0870..088E,
419                             U+0890..0891, U+0898..089F, U+08B5,
420                             U+08C8..08D2 ...)
421 T \p{Age: 15.0}           \p{Age=V15_0} (4489)
422   \p{Age: V15_0}          Code point's usage was introduced in
423                             version 15.0; See also Property
424                             'Present_In' (4489: U+0CF3, U+0ECE,
425                             U+10EFD..10EFF, U+1123F..11241,
426                             U+11B00..11B09, U+11F00..11F10 ...)
427   \p{Age: NA}             \p{Age=Unassigned} (825_279 plus all
428                             above-Unicode code points)
429   \p{Age: Unassigned}     Code point's usage has not been assigned
430                             in any Unicode release thus far.
431 (Short: \p{Age=NA}) (825_279 plus all above-Unicode code points:
432                             U+0378..0379, U+0380..0383, U+038B,
433                             U+038D, U+03A2, U+0530 ...)
434   \p{Aghb}                \p{Caucasian_Albanian} (=
435                             \p{Script_Extensions=
436                             Caucasian_Albanian}) (NOT \p{Block=
437                             Caucasian_Albanian}) (53)
438   \p{AHex}                \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
439                             (22)
440   \p{AHex: *}             \p{ASCII_Hex_Digit: *}
441   \p{Ahom}                \p{Script_Extensions=Ahom} (NOT \p{Block=
442                             Ahom}) (65)
443 X \p{Alchemical}          \p{Alchemical_Symbols} (= \p{Block=
444                             Alchemical_Symbols}) (128)
445 X \p{Alchemical_Symbols}  \p{Block=Alchemical_Symbols} (Short:
446                             \p{InAlchemical}) (128)
447   \p{All}                 All code points, including those above
448                             Unicode.  Same as qr/./s (1_114_112 plus
449                             all above-Unicode code points:
450                             U+0000..infinity)
451   \p{Alnum}               \p{XPosixAlnum} (138_445)
452   \p{Alpha}               \p{XPosixAlpha} (= \p{Alphabetic=Y})
453                             (137_765)
454   \p{Alpha: *}            \p{Alphabetic: *}
455   \p{Alphabetic}          \p{XPosixAlpha} (= \p{Alphabetic=Y})
456                             (137_765)
457   \p{Alphabetic: N*}      (Short: \p{Alpha=N}, \P{Alpha}) (976_347
458                             plus all above-Unicode code points:
459                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
460                             >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
461                             \xb6-\xb9\xbb-\xbf\xd7\xf7],
462                             U+02C2..02C5, U+02D2..02DF,
463                             U+02E5..02EB, U+02ED, U+02EF..0344 ...)
464   \p{Alphabetic: Y*}      (Short: \p{Alpha=Y}, \p{Alpha}) (137_765:
465                             [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
466                             \xf8-\xff], U+0100..02C1, U+02C6..02D1,
467                             U+02E0..02E4, U+02EC, U+02EE ...)
468 X \p{Alphabetic_PF}       \p{Alphabetic_Presentation_Forms} (=
469                             \p{Block=Alphabetic_Presentation_Forms})
470                             (80)
471 X \p{Alphabetic_Presentation_Forms} \p{Block=
472                             Alphabetic_Presentation_Forms} (Short:
473                             \p{InAlphabeticPF}) (80)
474   \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
475                             Anatolian_Hieroglyphs} (Short: \p{Hluw};
476                             NOT \p{Block=Anatolian_Hieroglyphs})
477                             (583)
478 X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
479                             \p{Block=
480                             Ancient_Greek_Musical_Notation}) (80)
481 X \p{Ancient_Greek_Musical_Notation} \p{Block=
482                             Ancient_Greek_Musical_Notation} (Short:
483                             \p{InAncientGreekMusic}) (80)
484 X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
485 X \p{Ancient_Symbols}     \p{Block=Ancient_Symbols} (64)
486   \p{Any}                 All Unicode code points (1_114_112:
487                             U+0000..10FFFF)
488   \p{Arab}                \p{Arabic} (= \p{Script_Extensions=
489                             Arabic}) (NOT \p{Block=Arabic}) (1414)
490   \p{Arabic}              \p{Script_Extensions=Arabic} (Short:
491                             \p{Arab}; NOT \p{Block=Arabic}) (1414)
492 X \p{Arabic_Ext_A}        \p{Arabic_Extended_A} (= \p{Block=
493                             Arabic_Extended_A}) (96)
494 X \p{Arabic_Ext_B}        \p{Arabic_Extended_B} (= \p{Block=
495                             Arabic_Extended_B}) (48)
496 X \p{Arabic_Ext_C}        \p{Arabic_Extended_C} (= \p{Block=
497                             Arabic_Extended_C}) (64)
498 X \p{Arabic_Extended_A}   \p{Block=Arabic_Extended_A} (Short:
499                             \p{InArabicExtA}) (96)
500 X \p{Arabic_Extended_B}   \p{Block=Arabic_Extended_B} (Short:
501                             \p{InArabicExtB}) (48)
502 X \p{Arabic_Extended_C}   \p{Block=Arabic_Extended_C} (Short:
503                             \p{InArabicExtC}) (64)
504 X \p{Arabic_Math}         \p{Arabic_Mathematical_Alphabetic_Symbols}
505                             (= \p{Block=
506                             Arabic_Mathematical_Alphabetic_Symbols})
507                             (256)
508 X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
509                             Arabic_Mathematical_Alphabetic_Symbols}
510                             (Short: \p{InArabicMath}) (256)
511 X \p{Arabic_PF_A}         \p{Arabic_Presentation_Forms_A} (=
512                             \p{Block=Arabic_Presentation_Forms_A})
513                             (688)
514 X \p{Arabic_PF_B}         \p{Arabic_Presentation_Forms_B} (=
515                             \p{Block=Arabic_Presentation_Forms_B})
516                             (144)
517 X \p{Arabic_Presentation_Forms_A} \p{Block=
518                             Arabic_Presentation_Forms_A} (Short:
519                             \p{InArabicPFA}) (688)
520 X \p{Arabic_Presentation_Forms_B} \p{Block=
521                             Arabic_Presentation_Forms_B} (Short:
522                             \p{InArabicPFB}) (144)
523 X \p{Arabic_Sup}          \p{Arabic_Supplement} (= \p{Block=
524                             Arabic_Supplement}) (48)
525 X \p{Arabic_Supplement}   \p{Block=Arabic_Supplement} (Short:
526                             \p{InArabicSup}) (48)
527   \p{Armenian}            \p{Script_Extensions=Armenian} (Short:
528                             \p{Armn}; NOT \p{Block=Armenian}) (96)
529   \p{Armi}                \p{Imperial_Aramaic} (=
530                             \p{Script_Extensions=Imperial_Aramaic})
531                             (NOT \p{Block=Imperial_Aramaic}) (31)
532   \p{Armn}                \p{Armenian} (= \p{Script_Extensions=
533                             Armenian}) (NOT \p{Block=Armenian}) (96)
534 X \p{Arrows}              \p{Block=Arrows} (112)
535   \p{ASCII}               \p{Block=Basic_Latin} (128)
536   \p{ASCII_Hex_Digit}     \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
537                             (22)
538   \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
539                             plus all above-Unicode code points:
540                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
541                             \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
542                             U+0100..infinity)
543   \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22: [0-9A-
544                             Fa-f])
545   \p{Assigned}            All assigned code points (288_767:
546                             U+0000..0377, U+037A..037F,
547                             U+0384..038A, U+038C, U+038E..03A1,
548                             U+03A3..052F ...)
549   \p{Avestan}             \p{Script_Extensions=Avestan} (Short:
550                             \p{Avst}; NOT \p{Block=Avestan}) (61)
551   \p{Avst}                \p{Avestan} (= \p{Script_Extensions=
552                             Avestan}) (NOT \p{Block=Avestan}) (61)
553   \p{Bali}                \p{Balinese} (= \p{Script_Extensions=
554                             Balinese}) (NOT \p{Block=Balinese}) (124)
555   \p{Balinese}            \p{Script_Extensions=Balinese} (Short:
556                             \p{Bali}; NOT \p{Block=Balinese}) (124)
557   \p{Bamu}                \p{Bamum} (= \p{Script_Extensions=Bamum})
558                             (NOT \p{Block=Bamum}) (657)
559   \p{Bamum}               \p{Script_Extensions=Bamum} (Short:
560                             \p{Bamu}; NOT \p{Block=Bamum}) (657)
561 X \p{Bamum_Sup}           \p{Bamum_Supplement} (= \p{Block=
562                             Bamum_Supplement}) (576)
563 X \p{Bamum_Supplement}    \p{Block=Bamum_Supplement} (Short:
564                             \p{InBamumSup}) (576)
565 X \p{Basic_Latin}         \p{ASCII} (= \p{Block=Basic_Latin}) (128)
566   \p{Bass}                \p{Bassa_Vah} (= \p{Script_Extensions=
567                             Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
568                             (36)
569   \p{Bassa_Vah}           \p{Script_Extensions=Bassa_Vah} (Short:
570                             \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
571   \p{Batak}               \p{Script_Extensions=Batak} (Short:
572                             \p{Batk}; NOT \p{Block=Batak}) (56)
573   \p{Batk}                \p{Batak} (= \p{Script_Extensions=Batak})
574                             (NOT \p{Block=Batak}) (56)
575   \p{Bc: *}               \p{Bidi_Class: *}
576   \p{Beng}                \p{Bengali} (= \p{Script_Extensions=
577                             Bengali}) (NOT \p{Block=Bengali}) (113)
578   \p{Bengali}             \p{Script_Extensions=Bengali} (Short:
579                             \p{Beng}; NOT \p{Block=Bengali}) (113)
580   \p{Bhaiksuki}           \p{Script_Extensions=Bhaiksuki} (Short:
581                             \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
582   \p{Bhks}                \p{Bhaiksuki} (= \p{Script_Extensions=
583                             Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
584                             (97)
585   \p{Bidi_C}              \p{Bidi_Control} (= \p{Bidi_Control=Y})
586                             (12)
587   \p{Bidi_C: *}           \p{Bidi_Control: *}
588   \p{Bidi_Class: AL}      \p{Bidi_Class=Arabic_Letter} (1769)
589   \p{Bidi_Class: AN}      \p{Bidi_Class=Arabic_Number} (63)
590   \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1769: U+0608,
591                             U+060B, U+060D, U+061B..064A,
592                             U+066D..066F, U+0671..06D5 ...)
593   \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (63:
594                             U+0600..0605, U+0660..0669,
595                             U+066B..066C, U+06DD, U+0890..0891,
596                             U+08E2 ...)
597   \p{Bidi_Class: B}       \p{Bidi_Class=Paragraph_Separator} (7)
598   \p{Bidi_Class: BN}      \p{Bidi_Class=Boundary_Neutral} (4016)
599   \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016: [^\t\n
600                             \cK\f\r\x1c-\x7e\x85\xa0-\xac\xae-\xff],
601                             U+180E, U+200B..200D, U+2060..2065,
602                             U+206A..206F, U+FDD0..FDEF ...)
603   \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15: [,.\/:
604                             \xa0], U+060C, U+202F, U+2044, U+FE50,
605                             U+FE52 ...)
606   \p{Bidi_Class: CS}      \p{Bidi_Class=Common_Separator} (15)
607   \p{Bidi_Class: EN}      \p{Bidi_Class=European_Number} (168)
608   \p{Bidi_Class: ES}      \p{Bidi_Class=European_Separator} (12)
609   \p{Bidi_Class: ET}      \p{Bidi_Class=European_Terminator} (92)
610   \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (168: [0-9\xb2-
611                             \xb3\xb9], U+06F0..06F9, U+2070,
612                             U+2074..2079, U+2080..2089, U+2488..249B
613                             ...)
614   \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12: [+\-],
615                             U+207A..207B, U+208A..208B, U+2212,
616                             U+FB29, U+FE62..FE63 ...)
617   \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (92: [#\$
618                             \%\xa2-\xa5\xb0-\xb1], U+058F,
619                             U+0609..060A, U+066A, U+09F2..09F3,
620                             U+09FB ...)
621   \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1:
622                             U+2068)
623   \p{Bidi_Class: FSI}     \p{Bidi_Class=First_Strong_Isolate} (1)
624   \p{Bidi_Class: L}       \p{Bidi_Class=Left_To_Right} (1_096_272
625                             plus all above-Unicode code points)
626   \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_096_272 plus
627                             all above-Unicode code points: [A-Za-z
628                             \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
629                             \xff], U+0100..02B8, U+02BB..02C1,
630                             U+02D0..02D1, U+02E0..02E4, U+02EE ...)
631   \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1:
632                             U+202A)
633   \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1:
634                             U+2066)
635   \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1:
636                             U+202D)
637   \p{Bidi_Class: LRE}     \p{Bidi_Class=Left_To_Right_Embedding} (1)
638   \p{Bidi_Class: LRI}     \p{Bidi_Class=Left_To_Right_Isolate} (1)
639   \p{Bidi_Class: LRO}     \p{Bidi_Class=Left_To_Right_Override} (1)
640   \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1993:
641                             U+0300..036F, U+0483..0489,
642                             U+0591..05BD, U+05BF, U+05C1..05C2,
643                             U+05C4..05C5 ...)
644   \p{Bidi_Class: NSM}     \p{Bidi_Class=Nonspacing_Mark} (1993)
645   \p{Bidi_Class: ON}      \p{Bidi_Class=Other_Neutral} (6029)
646   \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (6029: [!\"&\'
647                             \(\)*;<=>?\@\[\\\]\^_`\{\|\}~\xa1\xa6-
648                             \xa9\xab-\xac\xae-\xaf\xb4\xb6-\xb8\xbb-
649                             \xbf\xd7\xf7], U+02B9..02BA,
650                             U+02C2..02CF, U+02D2..02DF,
651                             U+02E5..02ED, U+02EF..02FF ...)
652   \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7: [\n\r
653                             \x1c-\x1e\x85], U+2029)
654   \p{Bidi_Class: PDF}     \p{Bidi_Class=Pop_Directional_Format} (1)
655   \p{Bidi_Class: PDI}     \p{Bidi_Class=Pop_Directional_Isolate} (1)
656   \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1:
657                             U+202C)
658   \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1:
659                             U+2069)
660   \p{Bidi_Class: R}       \p{Bidi_Class=Right_To_Left} (3647)
661   \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (3647: U+0590,
662                             U+05BE, U+05C0, U+05C3, U+05C6,
663                             U+05C8..05FF ...)
664   \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1:
665                             U+202B)
666   \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1:
667                             U+2067)
668   \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1:
669                             U+202E)
670   \p{Bidi_Class: RLE}     \p{Bidi_Class=Right_To_Left_Embedding} (1)
671   \p{Bidi_Class: RLI}     \p{Bidi_Class=Right_To_Left_Isolate} (1)
672   \p{Bidi_Class: RLO}     \p{Bidi_Class=Right_To_Left_Override} (1)
673   \p{Bidi_Class: S}       \p{Bidi_Class=Segment_Separator} (3)
674   \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3: [\t\cK
675                             \x1f])
676   \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17: [\f\x20],
677                             U+1680, U+2000..200A, U+2028, U+205F,
678                             U+3000)
679   \p{Bidi_Class: WS}      \p{Bidi_Class=White_Space} (17)
680   \p{Bidi_Control}        \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
681   \p{Bidi_Control: N*}    (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
682                             plus all above-Unicode code points:
683                             U+0000..061B, U+061D..200D,
684                             U+2010..2029, U+202F..2065,
685                             U+206A..infinity)
686   \p{Bidi_Control: Y*}    (Short: \p{BidiC=Y}, \p{BidiC}) (12:
687                             U+061C, U+200E..200F, U+202A..202E,
688                             U+2066..2069)
689   \p{Bidi_M}              \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
690                             (553)
691   \p{Bidi_M: *}           \p{Bidi_Mirrored: *}
692   \p{Bidi_Mirrored}       \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
693                             (553)
694   \p{Bidi_Mirrored: N*}   (Short: \p{BidiM=N}, \P{BidiM}) (1_113_559
695                             plus all above-Unicode code points:
696                             [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;=?\@A-
697                             Z\\\^_`a-z\|~\x7f-\xaa\xac-\xba\xbc-
698                             \xff], U+0100..0F39, U+0F3E..169A,
699                             U+169D..2038, U+203B..2044, U+2047..207C
700                             ...)
701   \p{Bidi_Mirrored: Y*}   (Short: \p{BidiM=Y}, \p{BidiM}) (553:
702                             [\(\)<>\[\]\{\}\xab\xbb], U+0F3A..0F3D,
703                             U+169B..169C, U+2039..203A,
704                             U+2045..2046, U+207D..207E ...)
705   \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
706                             (64)
707   \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (64: [\)\]
708                             \}], U+0F3B, U+0F3D, U+169C, U+2046,
709                             U+207E ...)
710   \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
711                             (1_113_984 plus all above-Unicode code
712                             points)
713   \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_984
714                             plus all above-Unicode code points:
715                             [\x00-\x20!\"#\$\%&\'*+,\-.\/0-9:;<=>?
716                             \@A-Z\\\^_`a-z\|~\x7f-\xff],
717                             U+0100..0F39, U+0F3E..169A,
718                             U+169D..2044, U+2047..207C, U+207F..208C
719                             ...)
720   \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
721                             (64)
722   \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (64:
723                             [\(\[\{], U+0F3A, U+0F3C, U+169B,
724                             U+2045, U+207D ...)
725   \p{Blank}               \p{XPosixBlank} (18)
726   \p{Blk: *}              \p{Block: *}
727   \p{Block: Adlam}        (NOT \p{Adlam} NOR \p{Is_Adlam}) (96:
728                             U+1E900..1E95F)
729   \p{Block: Aegean_Numbers} (64: U+10100..1013F)
730   \p{Block: Ahom}         (NOT \p{Ahom} NOR \p{Is_Ahom}) (80:
731                             U+11700..1174F)
732   \p{Block: Alchemical}   \p{Block=Alchemical_Symbols} (128)
733   \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128:
734                             U+1F700..1F77F)
735   \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
736                             (80)
737   \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
738                             AlphabeticPF}) (80: U+FB00..FB4F)
739   \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
740                             NOR \p{Is_Anatolian_Hieroglyphs}) (640:
741                             U+14400..1467F)
742   \p{Block: Ancient_Greek_Music} \p{Block=
743                             Ancient_Greek_Musical_Notation} (80)
744   \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
745                             AncientGreekMusic}) (80: U+1D200..1D24F)
746   \p{Block: Ancient_Greek_Numbers} (80: U+10140..1018F)
747   \p{Block: Ancient_Symbols} (64: U+10190..101CF)
748   \p{Block: Arabic}       (NOT \p{Arabic} NOR \p{Is_Arabic}) (256:
749                             U+0600..06FF)
750   \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
751   \p{Block: Arabic_Ext_B} \p{Block=Arabic_Extended_B} (48)
752   \p{Block: Arabic_Ext_C} \p{Block=Arabic_Extended_C} (64)
753   \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96:
754                             U+08A0..08FF)
755   \p{Block: Arabic_Extended_B} (Short: \p{Blk=ArabicExtB}) (48:
756                             U+0870..089F)
757   \p{Block: Arabic_Extended_C} (Short: \p{Blk=ArabicExtC}) (64:
758                             U+10EC0..10EFF)
759   \p{Block: Arabic_Math}  \p{Block=
760                             Arabic_Mathematical_Alphabetic_Symbols}
761                             (256)
762   \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
763                             ArabicMath}) (256: U+1EE00..1EEFF)
764   \p{Block: Arabic_PF_A}  \p{Block=Arabic_Presentation_Forms_A} (688)
765   \p{Block: Arabic_PF_B}  \p{Block=Arabic_Presentation_Forms_B} (144)
766   \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
767                             (688: U+FB50..FDFF)
768   \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
769                             (144: U+FE70..FEFF)
770   \p{Block: Arabic_Sup}   \p{Block=Arabic_Supplement} (48)
771   \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48:
772                             U+0750..077F)
773   \p{Block: Armenian}     (NOT \p{Armenian} NOR \p{Is_Armenian})
774                             (96: U+0530..058F)
775   \p{Block: Arrows}       (112: U+2190..21FF)
776   \p{Block: ASCII}        \p{Block=Basic_Latin} (128)
777   \p{Block: Avestan}      (NOT \p{Avestan} NOR \p{Is_Avestan}) (64:
778                             U+10B00..10B3F)
779   \p{Block: Balinese}     (NOT \p{Balinese} NOR \p{Is_Balinese})
780                             (128: U+1B00..1B7F)
781   \p{Block: Bamum}        (NOT \p{Bamum} NOR \p{Is_Bamum}) (96:
782                             U+A6A0..A6FF)
783   \p{Block: Bamum_Sup}    \p{Block=Bamum_Supplement} (576)
784   \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576:
785                             U+16800..16A3F)
786   \p{Block: Basic_Latin}  (Short: \p{Blk=ASCII}) (128: [\x00-\x7f])
787   \p{Block: Bassa_Vah}    (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
788                             (48: U+16AD0..16AFF)
789   \p{Block: Batak}        (NOT \p{Batak} NOR \p{Is_Batak}) (64:
790                             U+1BC0..1BFF)
791   \p{Block: Bengali}      (NOT \p{Bengali} NOR \p{Is_Bengali}) (128:
792                             U+0980..09FF)
793   \p{Block: Bhaiksuki}    (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
794                             (112: U+11C00..11C6F)
795   \p{Block: Block_Elements} (32: U+2580..259F)
796   \p{Block: Bopomofo}     (NOT \p{Bopomofo} NOR \p{Is_Bopomofo})
797                             (48: U+3100..312F)
798   \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
799   \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32:
800                             U+31A0..31BF)
801   \p{Block: Box_Drawing}  (128: U+2500..257F)
802   \p{Block: Brahmi}       (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128:
803                             U+11000..1107F)
804   \p{Block: Braille}      \p{Block=Braille_Patterns} (256)
805   \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256:
806                             U+2800..28FF)
807   \p{Block: Buginese}     (NOT \p{Buginese} NOR \p{Is_Buginese})
808                             (32: U+1A00..1A1F)
809   \p{Block: Buhid}        (NOT \p{Buhid} NOR \p{Is_Buhid}) (32:
810                             U+1740..175F)
811   \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
812                             (256)
813   \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
814                             ByzantineMusic}) (256: U+1D000..1D0FF)
815   \p{Block: Canadian_Syllabics} \p{Block=
816                             Unified_Canadian_Aboriginal_Syllabics}
817                             (640)
818   \p{Block: Carian}       (NOT \p{Carian} NOR \p{Is_Carian}) (64:
819                             U+102A0..102DF)
820   \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
821                             \p{Is_Caucasian_Albanian}) (64:
822                             U+10530..1056F)
823   \p{Block: Chakma}       (NOT \p{Chakma} NOR \p{Is_Chakma}) (80:
824                             U+11100..1114F)
825   \p{Block: Cham}         (NOT \p{Cham} NOR \p{Is_Cham}) (96:
826                             U+AA00..AA5F)
827   \p{Block: Cherokee}     (NOT \p{Cherokee} NOR \p{Is_Cherokee})
828                             (96: U+13A0..13FF)
829   \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
830   \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80:
831                             U+AB70..ABBF)
832   \p{Block: Chess_Symbols} (112: U+1FA00..1FA6F)
833   \p{Block: Chorasmian}   (NOT \p{Chorasmian} NOR \p{Is_Chorasmian})
834                             (48: U+10FB0..10FDF)
835   \p{Block: CJK}          \p{Block=CJK_Unified_Ideographs} (20_992)
836   \p{Block: CJK_Compat}   \p{Block=CJK_Compatibility} (256)
837   \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
838   \p{Block: CJK_Compat_Ideographs} \p{Block=
839                             CJK_Compatibility_Ideographs} (512)
840   \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
841                             CJK_Compatibility_Ideographs_Supplement}
842                             (544)
843   \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256:
844                             U+3300..33FF)
845   \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
846                             (32: U+FE30..FE4F)
847   \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
848                             CJKCompatIdeographs}) (512: U+F900..FAFF)
849   \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
850                             CJKCompatIdeographsSup}) (544:
851                             U+2F800..2FA1F)
852   \p{Block: CJK_Ext_A}    \p{Block=
853                             CJK_Unified_Ideographs_Extension_A}
854                             (6592)
855   \p{Block: CJK_Ext_B}    \p{Block=
856                             CJK_Unified_Ideographs_Extension_B}
857                             (42_720)
858   \p{Block: CJK_Ext_C}    \p{Block=
859                             CJK_Unified_Ideographs_Extension_C}
860                             (4160)
861   \p{Block: CJK_Ext_D}    \p{Block=
862                             CJK_Unified_Ideographs_Extension_D} (224)
863   \p{Block: CJK_Ext_E}    \p{Block=
864                             CJK_Unified_Ideographs_Extension_E}
865                             (5776)
866   \p{Block: CJK_Ext_F}    \p{Block=
867                             CJK_Unified_Ideographs_Extension_F}
868                             (7488)
869   \p{Block: CJK_Ext_G}    \p{Block=
870                             CJK_Unified_Ideographs_Extension_G}
871                             (4944)
872   \p{Block: CJK_Ext_H}    \p{Block=
873                             CJK_Unified_Ideographs_Extension_H}
874                             (4192)
875   \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
876   \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
877                             (128: U+2E80..2EFF)
878   \p{Block: CJK_Strokes}  (48: U+31C0..31EF)
879   \p{Block: CJK_Symbols}  \p{Block=CJK_Symbols_And_Punctuation} (64)
880   \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
881                             (64: U+3000..303F)
882   \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992:
883                             U+4E00..9FFF)
884   \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
885                             CJKExtA}) (6592: U+3400..4DBF)
886   \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
887                             CJKExtB}) (42_720: U+20000..2A6DF)
888   \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
889                             CJKExtC}) (4160: U+2A700..2B73F)
890   \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
891                             CJKExtD}) (224: U+2B740..2B81F)
892   \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
893                             CJKExtE}) (5776: U+2B820..2CEAF)
894   \p{Block: CJK_Unified_Ideographs_Extension_F} (Short: \p{Blk=
895                             CJKExtF}) (7488: U+2CEB0..2EBEF)
896   \p{Block: CJK_Unified_Ideographs_Extension_G} (Short: \p{Blk=
897                             CJKExtG}) (4944: U+30000..3134F)
898   \p{Block: CJK_Unified_Ideographs_Extension_H} (Short: \p{Blk=
899                             CJKExtH}) (4192: U+31350..323AF)
900   \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
901                             Diacriticals}) (112: U+0300..036F)
902   \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
903                             DiacriticalsExt}) (80: U+1AB0..1AFF)
904   \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
905                             DiacriticalsForSymbols}) (48:
906                             U+20D0..20FF)
907   \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
908                             DiacriticalsSup}) (64: U+1DC0..1DFF)
909   \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16:
910                             U+FE20..FE2F)
911   \p{Block: Combining_Marks_For_Symbols} \p{Block=
912                             Combining_Diacritical_Marks_For_Symbols}
913                             (48)
914   \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
915                             IndicNumberForms}) (16: U+A830..A83F)
916   \p{Block: Compat_Jamo}  \p{Block=Hangul_Compatibility_Jamo} (96)
917   \p{Block: Control_Pictures} (64: U+2400..243F)
918   \p{Block: Coptic}       (NOT \p{Coptic} NOR \p{Is_Coptic}) (128:
919                             U+2C80..2CFF)
920   \p{Block: Coptic_Epact_Numbers} (32: U+102E0..102FF)
921   \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
922   \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32:
923                             U+1D360..1D37F)
924   \p{Block: Cuneiform}    (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
925                             (1024: U+12000..123FF)
926   \p{Block: Cuneiform_Numbers} \p{Block=
927                             Cuneiform_Numbers_And_Punctuation} (128)
928   \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
929                             CuneiformNumbers}) (128: U+12400..1247F)
930   \p{Block: Currency_Symbols} (48: U+20A0..20CF)
931   \p{Block: Cypriot_Syllabary} (64: U+10800..1083F)
932   \p{Block: Cypro_Minoan} (NOT \p{Cypro_Minoan} NOR
933                             \p{Is_Cypro_Minoan}) (112:
934                             U+12F90..12FFF)
935   \p{Block: Cyrillic}     (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
936                             (256: U+0400..04FF)
937   \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
938   \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
939   \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
940   \p{Block: Cyrillic_Ext_D} \p{Block=Cyrillic_Extended_D} (96)
941   \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32:
942                             U+2DE0..2DFF)
943   \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96:
944                             U+A640..A69F)
945   \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16:
946                             U+1C80..1C8F)
947   \p{Block: Cyrillic_Extended_D} (Short: \p{Blk=CyrillicExtD}) (96:
948                             U+1E030..1E08F)
949   \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
950   \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48:
951                             U+0500..052F)
952   \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
953                             (48)
954   \p{Block: Deseret}      (80: U+10400..1044F)
955   \p{Block: Devanagari}   (NOT \p{Devanagari} NOR \p{Is_Devanagari})
956                             (128: U+0900..097F)
957   \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
958   \p{Block: Devanagari_Ext_A} \p{Block=Devanagari_Extended_A} (96)
959   \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32:
960                             U+A8E0..A8FF)
961   \p{Block: Devanagari_Extended_A} (Short: \p{Blk=DevanagariExtA})
962                             (96: U+11B00..11B5F)
963   \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
964   \p{Block: Diacriticals_Ext} \p{Block=
965                             Combining_Diacritical_Marks_Extended}
966                             (80)
967   \p{Block: Diacriticals_For_Symbols} \p{Block=
968                             Combining_Diacritical_Marks_For_Symbols}
969                             (48)
970   \p{Block: Diacriticals_Sup} \p{Block=
971                             Combining_Diacritical_Marks_Supplement}
972                             (64)
973   \p{Block: Dingbats}     (192: U+2700..27BF)
974   \p{Block: Dives_Akuru}  (NOT \p{Dives_Akuru} NOR
975                             \p{Is_Dives_Akuru}) (96: U+11900..1195F)
976   \p{Block: Dogra}        (NOT \p{Dogra} NOR \p{Is_Dogra}) (80:
977                             U+11800..1184F)
978   \p{Block: Domino}       \p{Block=Domino_Tiles} (112)
979   \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112:
980                             U+1F030..1F09F)
981   \p{Block: Duployan}     (NOT \p{Duployan} NOR \p{Is_Duployan})
982                             (160: U+1BC00..1BC9F)
983   \p{Block: Early_Dynastic_Cuneiform} (208: U+12480..1254F)
984   \p{Block: Egyptian_Hieroglyph_Format_Controls} (48: U+13430..1345F)
985   \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
986                             \p{Is_Egyptian_Hieroglyphs}) (1072:
987                             U+13000..1342F)
988   \p{Block: Elbasan}      (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48:
989                             U+10500..1052F)
990   \p{Block: Elymaic}      (NOT \p{Elymaic} NOR \p{Is_Elymaic}) (32:
991                             U+10FE0..10FFF)
992   \p{Block: Emoticons}    (80: U+1F600..1F64F)
993   \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
994   \p{Block: Enclosed_Alphanum_Sup} \p{Block=
995                             Enclosed_Alphanumeric_Supplement} (256)
996   \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
997                             EnclosedAlphanumSup}) (256:
998                             U+1F100..1F1FF)
999   \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
1000                             EnclosedAlphanum}) (160: U+2460..24FF)
1001   \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
1002                             (256)
1003   \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
1004                             EnclosedCJK}) (256: U+3200..32FF)
1005   \p{Block: Enclosed_Ideographic_Sup} \p{Block=
1006                             Enclosed_Ideographic_Supplement} (256)
1007   \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
1008                             EnclosedIdeographicSup}) (256:
1009                             U+1F200..1F2FF)
1010   \p{Block: Ethiopic}     (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
1011                             (384: U+1200..137F)
1012   \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
1013   \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
1014   \p{Block: Ethiopic_Ext_B} \p{Block=Ethiopic_Extended_B} (32)
1015   \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96:
1016                             U+2D80..2DDF)
1017   \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48:
1018                             U+AB00..AB2F)
1019   \p{Block: Ethiopic_Extended_B} (Short: \p{Blk=EthiopicExtB}) (32:
1020                             U+1E7E0..1E7FF)
1021   \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
1022   \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32:
1023                             U+1380..139F)
1024   \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
1025                             \p{Punct} NOR \p{Is_Punctuation}) (112:
1026                             U+2000..206F)
1027   \p{Block: Geometric_Shapes} (96: U+25A0..25FF)
1028   \p{Block: Geometric_Shapes_Ext} \p{Block=
1029                             Geometric_Shapes_Extended} (128)
1030   \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
1031                             GeometricShapesExt}) (128:
1032                             U+1F780..1F7FF)
1033   \p{Block: Georgian}     (NOT \p{Georgian} NOR \p{Is_Georgian})
1034                             (96: U+10A0..10FF)
1035   \p{Block: Georgian_Ext} \p{Block=Georgian_Extended} (48)
1036   \p{Block: Georgian_Extended} (Short: \p{Blk=GeorgianExt}) (48:
1037                             U+1C90..1CBF)
1038   \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
1039   \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48:
1040                             U+2D00..2D2F)
1041   \p{Block: Glagolitic}   (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
1042                             (96: U+2C00..2C5F)
1043   \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
1044   \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
1045                             (48: U+1E000..1E02F)
1046   \p{Block: Gothic}       (NOT \p{Gothic} NOR \p{Is_Gothic}) (32:
1047                             U+10330..1034F)
1048   \p{Block: Grantha}      (NOT \p{Grantha} NOR \p{Is_Grantha}) (128:
1049                             U+11300..1137F)
1050   \p{Block: Greek}        \p{Block=Greek_And_Coptic} (NOT \p{Greek}
1051                             NOR \p{Is_Greek}) (144)
1052   \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
1053                             NOR \p{Is_Greek}) (144: U+0370..03FF)
1054   \p{Block: Greek_Ext}    \p{Block=Greek_Extended} (256)
1055   \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256:
1056                             U+1F00..1FFF)
1057   \p{Block: Gujarati}     (NOT \p{Gujarati} NOR \p{Is_Gujarati})
1058                             (128: U+0A80..0AFF)
1059   \p{Block: Gunjala_Gondi} (NOT \p{Gunjala_Gondi} NOR
1060                             \p{Is_Gunjala_Gondi}) (80:
1061                             U+11D60..11DAF)
1062   \p{Block: Gurmukhi}     (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
1063                             (128: U+0A00..0A7F)
1064   \p{Block: Half_And_Full_Forms} \p{Block=
1065                             Halfwidth_And_Fullwidth_Forms} (240)
1066   \p{Block: Half_Marks}   \p{Block=Combining_Half_Marks} (16)
1067   \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
1068                             HalfAndFullForms}) (240: U+FF00..FFEF)
1069   \p{Block: Hangul}       \p{Block=Hangul_Syllables} (NOT \p{Hangul}
1070                             NOR \p{Is_Hangul}) (11_184)
1071   \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
1072                             (96: U+3130..318F)
1073   \p{Block: Hangul_Jamo}  (Short: \p{Blk=Jamo}) (256: U+1100..11FF)
1074   \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32:
1075                             U+A960..A97F)
1076   \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80:
1077                             U+D7B0..D7FF)
1078   \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
1079                             NOR \p{Is_Hangul}) (11_184: U+AC00..D7AF)
1080   \p{Block: Hanifi_Rohingya} (NOT \p{Hanifi_Rohingya} NOR
1081                             \p{Is_Hanifi_Rohingya}) (64:
1082                             U+10D00..10D3F)
1083   \p{Block: Hanunoo}      (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32:
1084                             U+1720..173F)
1085   \p{Block: Hatran}       (NOT \p{Hatran} NOR \p{Is_Hatran}) (32:
1086                             U+108E0..108FF)
1087   \p{Block: Hebrew}       (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112:
1088                             U+0590..05FF)
1089   \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
1090                             HighPUSurrogates}) (128: U+DB80..DBFF)
1091   \p{Block: High_PU_Surrogates} \p{Block=
1092                             High_Private_Use_Surrogates} (128)
1093   \p{Block: High_Surrogates} (896: U+D800..DB7F)
1094   \p{Block: Hiragana}     (NOT \p{Hiragana} NOR \p{Is_Hiragana})
1095                             (96: U+3040..309F)
1096   \p{Block: IDC}          \p{Block=
1097                             Ideographic_Description_Characters} (NOT
1098                             \p{ID_Continue} NOR \p{Is_IDC}) (16)
1099   \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
1100                             NOT \p{ID_Continue} NOR \p{Is_IDC}) (16:
1101                             U+2FF0..2FFF)
1102   \p{Block: Ideographic_Symbols} \p{Block=
1103                             Ideographic_Symbols_And_Punctuation} (32)
1104   \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
1105                             IdeographicSymbols}) (32: U+16FE0..16FFF)
1106   \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
1107                             \p{Is_Imperial_Aramaic}) (32:
1108                             U+10840..1085F)
1109   \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
1110                             (16)
1111   \p{Block: Indic_Siyaq_Numbers} (80: U+1EC70..1ECBF)
1112   \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
1113                             NOR \p{Is_Inscriptional_Pahlavi}) (32:
1114                             U+10B60..10B7F)
1115   \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
1116                             NOR \p{Is_Inscriptional_Parthian}) (32:
1117                             U+10B40..10B5F)
1118   \p{Block: IPA_Ext}      \p{Block=IPA_Extensions} (96)
1119   \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96:
1120                             U+0250..02AF)
1121   \p{Block: Jamo}         \p{Block=Hangul_Jamo} (256)
1122   \p{Block: Jamo_Ext_A}   \p{Block=Hangul_Jamo_Extended_A} (32)
1123   \p{Block: Jamo_Ext_B}   \p{Block=Hangul_Jamo_Extended_B} (80)
1124   \p{Block: Javanese}     (NOT \p{Javanese} NOR \p{Is_Javanese})
1125                             (96: U+A980..A9DF)
1126   \p{Block: Kaithi}       (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80:
1127                             U+11080..110CF)
1128   \p{Block: Kaktovik_Numerals} (32: U+1D2C0..1D2DF)
1129   \p{Block: Kana_Ext_A}   \p{Block=Kana_Extended_A} (48)
1130   \p{Block: Kana_Ext_B}   \p{Block=Kana_Extended_B} (16)
1131   \p{Block: Kana_Extended_A} (Short: \p{Blk=KanaExtA}) (48:
1132                             U+1B100..1B12F)
1133   \p{Block: Kana_Extended_B} (Short: \p{Blk=KanaExtB}) (16:
1134                             U+1AFF0..1AFFF)
1135   \p{Block: Kana_Sup}     \p{Block=Kana_Supplement} (256)
1136   \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256:
1137                             U+1B000..1B0FF)
1138   \p{Block: Kanbun}       (16: U+3190..319F)
1139   \p{Block: Kangxi}       \p{Block=Kangxi_Radicals} (224)
1140   \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224:
1141                             U+2F00..2FDF)
1142   \p{Block: Kannada}      (NOT \p{Kannada} NOR \p{Is_Kannada}) (128:
1143                             U+0C80..0CFF)
1144   \p{Block: Katakana}     (NOT \p{Katakana} NOR \p{Is_Katakana})
1145                             (96: U+30A0..30FF)
1146   \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
1147   \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
1148                             KatakanaExt}) (16: U+31F0..31FF)
1149   \p{Block: Kawi}         (NOT \p{Kawi} NOR \p{Is_Kawi}) (96:
1150                             U+11F00..11F5F)
1151   \p{Block: Kayah_Li}     (48: U+A900..A92F)
1152   \p{Block: Kharoshthi}   (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
1153                             (96: U+10A00..10A5F)
1154   \p{Block: Khitan_Small_Script} (NOT \p{Khitan_Small_Script} NOR
1155                             \p{Is_Khitan_Small_Script}) (512:
1156                             U+18B00..18CFF)
1157   \p{Block: Khmer}        (NOT \p{Khmer} NOR \p{Is_Khmer}) (128:
1158                             U+1780..17FF)
1159   \p{Block: Khmer_Symbols} (32: U+19E0..19FF)
1160   \p{Block: Khojki}       (NOT \p{Khojki} NOR \p{Is_Khojki}) (80:
1161                             U+11200..1124F)
1162   \p{Block: Khudawadi}    (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
1163                             (80: U+112B0..112FF)
1164   \p{Block: Lao}          (NOT \p{Lao} NOR \p{Is_Lao}) (128:
1165                             U+0E80..0EFF)
1166   \p{Block: Latin_1}      \p{Block=Latin_1_Supplement} (128)
1167   \p{Block: Latin_1_Sup}  \p{Block=Latin_1_Supplement} (128)
1168   \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128: [\x80-
1169                             \xff])
1170   \p{Block: Latin_Ext_A}  \p{Block=Latin_Extended_A} (128)
1171   \p{Block: Latin_Ext_Additional} \p{Block=
1172                             Latin_Extended_Additional} (256)
1173   \p{Block: Latin_Ext_B}  \p{Block=Latin_Extended_B} (208)
1174   \p{Block: Latin_Ext_C}  \p{Block=Latin_Extended_C} (32)
1175   \p{Block: Latin_Ext_D}  \p{Block=Latin_Extended_D} (224)
1176   \p{Block: Latin_Ext_E}  \p{Block=Latin_Extended_E} (64)
1177   \p{Block: Latin_Ext_F}  \p{Block=Latin_Extended_F} (64)
1178   \p{Block: Latin_Ext_G}  \p{Block=Latin_Extended_G} (256)
1179   \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128:
1180                             U+0100..017F)
1181   \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
1182                             LatinExtAdditional}) (256: U+1E00..1EFF)
1183   \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208:
1184                             U+0180..024F)
1185   \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32:
1186                             U+2C60..2C7F)
1187   \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224:
1188                             U+A720..A7FF)
1189   \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64:
1190                             U+AB30..AB6F)
1191   \p{Block: Latin_Extended_F} (Short: \p{Blk=LatinExtF}) (64:
1192                             U+10780..107BF)
1193   \p{Block: Latin_Extended_G} (Short: \p{Blk=LatinExtG}) (256:
1194                             U+1DF00..1DFFF)
1195   \p{Block: Lepcha}       (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80:
1196                             U+1C00..1C4F)
1197   \p{Block: Letterlike_Symbols} (80: U+2100..214F)
1198   \p{Block: Limbu}        (NOT \p{Limbu} NOR \p{Is_Limbu}) (80:
1199                             U+1900..194F)
1200   \p{Block: Linear_A}     (NOT \p{Linear_A} NOR \p{Is_Linear_A})
1201                             (384: U+10600..1077F)
1202   \p{Block: Linear_B_Ideograms} (128: U+10080..100FF)
1203   \p{Block: Linear_B_Syllabary} (128: U+10000..1007F)
1204   \p{Block: Lisu}         (NOT \p{Lisu} NOR \p{Is_Lisu}) (48:
1205                             U+A4D0..A4FF)
1206   \p{Block: Lisu_Sup}     \p{Block=Lisu_Supplement} (16)
1207   \p{Block: Lisu_Supplement} (Short: \p{Blk=LisuSup}) (16:
1208                             U+11FB0..11FBF)
1209   \p{Block: Low_Surrogates} (1024: U+DC00..DFFF)
1210   \p{Block: Lycian}       (NOT \p{Lycian} NOR \p{Is_Lycian}) (32:
1211                             U+10280..1029F)
1212   \p{Block: Lydian}       (NOT \p{Lydian} NOR \p{Is_Lydian}) (32:
1213                             U+10920..1093F)
1214   \p{Block: Mahajani}     (NOT \p{Mahajani} NOR \p{Is_Mahajani})
1215                             (48: U+11150..1117F)
1216   \p{Block: Mahjong}      \p{Block=Mahjong_Tiles} (48)
1217   \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48:
1218                             U+1F000..1F02F)
1219   \p{Block: Makasar}      (NOT \p{Makasar} NOR \p{Is_Makasar}) (32:
1220                             U+11EE0..11EFF)
1221   \p{Block: Malayalam}    (NOT \p{Malayalam} NOR \p{Is_Malayalam})
1222                             (128: U+0D00..0D7F)
1223   \p{Block: Mandaic}      (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32:
1224                             U+0840..085F)
1225   \p{Block: Manichaean}   (NOT \p{Manichaean} NOR \p{Is_Manichaean})
1226                             (64: U+10AC0..10AFF)
1227   \p{Block: Marchen}      (NOT \p{Marchen} NOR \p{Is_Marchen}) (80:
1228                             U+11C70..11CBF)
1229   \p{Block: Masaram_Gondi} (NOT \p{Masaram_Gondi} NOR
1230                             \p{Is_Masaram_Gondi}) (96:
1231                             U+11D00..11D5F)
1232   \p{Block: Math_Alphanum} \p{Block=
1233                             Mathematical_Alphanumeric_Symbols} (1024)
1234   \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
1235   \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
1236                             MathAlphanum}) (1024: U+1D400..1D7FF)
1237   \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
1238                             (256: U+2200..22FF)
1239   \p{Block: Mayan_Numerals} (32: U+1D2E0..1D2FF)
1240   \p{Block: Medefaidrin}  (NOT \p{Medefaidrin} NOR
1241                             \p{Is_Medefaidrin}) (96: U+16E40..16E9F)
1242   \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
1243                             \p{Is_Meetei_Mayek}) (64: U+ABC0..ABFF)
1244   \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
1245   \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
1246                             (32: U+AAE0..AAFF)
1247   \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
1248                             \p{Is_Mende_Kikakui}) (224:
1249                             U+1E800..1E8DF)
1250   \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
1251                             \p{Is_Meroitic_Cursive}) (96:
1252                             U+109A0..109FF)
1253   \p{Block: Meroitic_Hieroglyphs} (32: U+10980..1099F)
1254   \p{Block: Miao}         (NOT \p{Miao} NOR \p{Is_Miao}) (160:
1255                             U+16F00..16F9F)
1256   \p{Block: Misc_Arrows}  \p{Block=Miscellaneous_Symbols_And_Arrows}
1257                             (256)
1258   \p{Block: Misc_Math_Symbols_A} \p{Block=
1259                             Miscellaneous_Mathematical_Symbols_A}
1260                             (48)
1261   \p{Block: Misc_Math_Symbols_B} \p{Block=
1262                             Miscellaneous_Mathematical_Symbols_B}
1263                             (128)
1264   \p{Block: Misc_Pictographs} \p{Block=
1265                             Miscellaneous_Symbols_And_Pictographs}
1266                             (768)
1267   \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
1268   \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
1269   \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
1270                             MiscMathSymbolsA}) (48: U+27C0..27EF)
1271   \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
1272                             MiscMathSymbolsB}) (128: U+2980..29FF)
1273   \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols})
1274                             (256: U+2600..26FF)
1275   \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
1276                             MiscArrows}) (256: U+2B00..2BFF)
1277   \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
1278                             MiscPictographs}) (768: U+1F300..1F5FF)
1279   \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
1280                             (256: U+2300..23FF)
1281   \p{Block: Modi}         (NOT \p{Modi} NOR \p{Is_Modi}) (96:
1282                             U+11600..1165F)
1283   \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
1284   \p{Block: Modifier_Tone_Letters} (32: U+A700..A71F)
1285   \p{Block: Mongolian}    (NOT \p{Mongolian} NOR \p{Is_Mongolian})
1286                             (176: U+1800..18AF)
1287   \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
1288   \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32:
1289                             U+11660..1167F)
1290   \p{Block: Mro}          (NOT \p{Mro} NOR \p{Is_Mro}) (48:
1291                             U+16A40..16A6F)
1292   \p{Block: Multani}      (NOT \p{Multani} NOR \p{Is_Multani}) (48:
1293                             U+11280..112AF)
1294   \p{Block: Music}        \p{Block=Musical_Symbols} (256)
1295   \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256:
1296                             U+1D100..1D1FF)
1297   \p{Block: Myanmar}      (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160:
1298                             U+1000..109F)
1299   \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
1300   \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
1301   \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32:
1302                             U+AA60..AA7F)
1303   \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32:
1304                             U+A9E0..A9FF)
1305   \p{Block: Nabataean}    (NOT \p{Nabataean} NOR \p{Is_Nabataean})
1306                             (48: U+10880..108AF)
1307   \p{Block: Nag_Mundari}  (NOT \p{Nag_Mundari} NOR
1308                             \p{Is_Nag_Mundari}) (48: U+1E4D0..1E4FF)
1309   \p{Block: Nandinagari}  (NOT \p{Nandinagari} NOR
1310                             \p{Is_Nandinagari}) (96: U+119A0..119FF)
1311   \p{Block: NB}           \p{Block=No_Block} (820_944 plus all
1312                             above-Unicode code points)
1313   \p{Block: New_Tai_Lue}  (NOT \p{New_Tai_Lue} NOR
1314                             \p{Is_New_Tai_Lue}) (96: U+1980..19DF)
1315   \p{Block: Newa}         (NOT \p{Newa} NOR \p{Is_Newa}) (128:
1316                             U+11400..1147F)
1317   \p{Block: NKo}          (NOT \p{Nko} NOR \p{Is_NKo}) (64:
1318                             U+07C0..07FF)
1319   \p{Block: No_Block}     (Short: \p{Blk=NB}) (820_944 plus all
1320                             above-Unicode code points: U+2FE0..2FEF,
1321                             U+10200..1027F, U+103E0..103FF,
1322                             U+105C0..105FF, U+107C0..107FF,
1323                             U+108B0..108DF ...)
1324   \p{Block: Number_Forms} (64: U+2150..218F)
1325   \p{Block: Nushu}        (NOT \p{Nushu} NOR \p{Is_Nushu}) (400:
1326                             U+1B170..1B2FF)
1327   \p{Block: Nyiakeng_Puachue_Hmong} (NOT \p{Nyiakeng_Puachue_Hmong}
1328                             NOR \p{Is_Nyiakeng_Puachue_Hmong}) (80:
1329                             U+1E100..1E14F)
1330   \p{Block: OCR}          \p{Block=Optical_Character_Recognition}
1331                             (32)
1332   \p{Block: Ogham}        (NOT \p{Ogham} NOR \p{Is_Ogham}) (32:
1333                             U+1680..169F)
1334   \p{Block: Ol_Chiki}     (48: U+1C50..1C7F)
1335   \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
1336                             \p{Is_Old_Hungarian}) (128:
1337                             U+10C80..10CFF)
1338   \p{Block: Old_Italic}   (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
1339                             (48: U+10300..1032F)
1340   \p{Block: Old_North_Arabian} (32: U+10A80..10A9F)
1341   \p{Block: Old_Permic}   (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
1342                             (48: U+10350..1037F)
1343   \p{Block: Old_Persian}  (NOT \p{Old_Persian} NOR
1344                             \p{Is_Old_Persian}) (64: U+103A0..103DF)
1345   \p{Block: Old_Sogdian}  (NOT \p{Old_Sogdian} NOR
1346                             \p{Is_Old_Sogdian}) (48: U+10F00..10F2F)
1347   \p{Block: Old_South_Arabian} (32: U+10A60..10A7F)
1348   \p{Block: Old_Turkic}   (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
1349                             (80: U+10C00..10C4F)
1350   \p{Block: Old_Uyghur}   (NOT \p{Old_Uyghur} NOR \p{Is_Old_Uyghur})
1351                             (64: U+10F70..10FAF)
1352   \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32:
1353                             U+2440..245F)
1354   \p{Block: Oriya}        (NOT \p{Oriya} NOR \p{Is_Oriya}) (128:
1355                             U+0B00..0B7F)
1356   \p{Block: Ornamental_Dingbats} (48: U+1F650..1F67F)
1357   \p{Block: Osage}        (NOT \p{Osage} NOR \p{Is_Osage}) (80:
1358                             U+104B0..104FF)
1359   \p{Block: Osmanya}      (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48:
1360                             U+10480..104AF)
1361   \p{Block: Ottoman_Siyaq_Numbers} (80: U+1ED00..1ED4F)
1362   \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
1363                             \p{Is_Pahawh_Hmong}) (144:
1364                             U+16B00..16B8F)
1365   \p{Block: Palmyrene}    (32: U+10860..1087F)
1366   \p{Block: Pau_Cin_Hau}  (NOT \p{Pau_Cin_Hau} NOR
1367                             \p{Is_Pau_Cin_Hau}) (64: U+11AC0..11AFF)
1368   \p{Block: Phags_Pa}     (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa})
1369                             (64: U+A840..A87F)
1370   \p{Block: Phaistos}     \p{Block=Phaistos_Disc} (48)
1371   \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48:
1372                             U+101D0..101FF)
1373   \p{Block: Phoenician}   (NOT \p{Phoenician} NOR \p{Is_Phoenician})
1374                             (32: U+10900..1091F)
1375   \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
1376   \p{Block: Phonetic_Ext_Sup} \p{Block=
1377                             Phonetic_Extensions_Supplement} (64)
1378   \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128:
1379                             U+1D00..1D7F)
1380   \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
1381                             PhoneticExtSup}) (64: U+1D80..1DBF)
1382   \p{Block: Playing_Cards} (96: U+1F0A0..1F0FF)
1383   \p{Block: Private_Use}  \p{Block=Private_Use_Area} (NOT
1384                             \p{Private_Use} NOR \p{Is_Private_Use})
1385                             (6400)
1386   \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
1387                             \p{Private_Use} NOR \p{Is_Private_Use})
1388                             (6400: U+E000..F8FF)
1389   \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
1390                             \p{Is_Psalter_Pahlavi}) (48:
1391                             U+10B80..10BAF)
1392   \p{Block: PUA}          \p{Block=Private_Use_Area} (NOT
1393                             \p{Private_Use} NOR \p{Is_Private_Use})
1394                             (6400)
1395   \p{Block: Punctuation}  \p{Block=General_Punctuation} (NOT
1396                             \p{Punct} NOR \p{Is_Punctuation}) (112)
1397   \p{Block: Rejang}       (NOT \p{Rejang} NOR \p{Is_Rejang}) (48:
1398                             U+A930..A95F)
1399   \p{Block: Rumi}         \p{Block=Rumi_Numeral_Symbols} (32)
1400   \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32:
1401                             U+10E60..10E7F)
1402   \p{Block: Runic}        (NOT \p{Runic} NOR \p{Is_Runic}) (96:
1403                             U+16A0..16FF)
1404   \p{Block: Samaritan}    (NOT \p{Samaritan} NOR \p{Is_Samaritan})
1405                             (64: U+0800..083F)
1406   \p{Block: Saurashtra}   (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
1407                             (96: U+A880..A8DF)
1408   \p{Block: Sharada}      (NOT \p{Sharada} NOR \p{Is_Sharada}) (96:
1409                             U+11180..111DF)
1410   \p{Block: Shavian}      (48: U+10450..1047F)
1411   \p{Block: Shorthand_Format_Controls} (16: U+1BCA0..1BCAF)
1412   \p{Block: Siddham}      (NOT \p{Siddham} NOR \p{Is_Siddham}) (128:
1413                             U+11580..115FF)
1414   \p{Block: Sinhala}      (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128:
1415                             U+0D80..0DFF)
1416   \p{Block: Sinhala_Archaic_Numbers} (32: U+111E0..111FF)
1417   \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32:
1418                             U+FE50..FE6F)
1419   \p{Block: Small_Forms}  \p{Block=Small_Form_Variants} (32)
1420   \p{Block: Small_Kana_Ext} \p{Block=Small_Kana_Extension} (64)
1421   \p{Block: Small_Kana_Extension} (Short: \p{Blk=SmallKanaExt}) (64:
1422                             U+1B130..1B16F)
1423   \p{Block: Sogdian}      (NOT \p{Sogdian} NOR \p{Is_Sogdian}) (64:
1424                             U+10F30..10F6F)
1425   \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
1426                             \p{Is_Sora_Sompeng}) (48: U+110D0..110FF)
1427   \p{Block: Soyombo}      (NOT \p{Soyombo} NOR \p{Is_Soyombo}) (96:
1428                             U+11A50..11AAF)
1429   \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
1430                             ModifierLetters}) (80: U+02B0..02FF)
1431   \p{Block: Specials}     (16: U+FFF0..FFFF)
1432   \p{Block: Sundanese}    (NOT \p{Sundanese} NOR \p{Is_Sundanese})
1433                             (64: U+1B80..1BBF)
1434   \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
1435   \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16:
1436                             U+1CC0..1CCF)
1437   \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
1438   \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
1439   \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
1440   \p{Block: Sup_Math_Operators} \p{Block=
1441                             Supplemental_Mathematical_Operators}
1442                             (256)
1443   \p{Block: Sup_PUA_A}    \p{Block=Supplementary_Private_Use_Area_A}
1444                             (65_536)
1445   \p{Block: Sup_PUA_B}    \p{Block=Supplementary_Private_Use_Area_B}
1446                             (65_536)
1447   \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
1448   \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
1449                             Supplemental_Symbols_And_Pictographs}
1450                             (256)
1451   \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
1452   \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
1453                             SuperAndSub}) (48: U+2070..209F)
1454   \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16:
1455                             U+27F0..27FF)
1456   \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128:
1457                             U+2900..297F)
1458   \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256:
1459                             U+1F800..1F8FF)
1460   \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
1461                             SupMathOperators}) (256: U+2A00..2AFF)
1462   \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
1463                             SupPunctuation}) (128: U+2E00..2E7F)
1464   \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
1465                             SupSymbolsAndPictographs}) (256:
1466                             U+1F900..1F9FF)
1467   \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
1468                             SupPUAA}) (65_536: U+F0000..FFFFF)
1469   \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
1470                             SupPUAB}) (65_536: U+100000..10FFFF)
1471   \p{Block: Sutton_SignWriting} (688: U+1D800..1DAAF)
1472   \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
1473                             \p{Is_Syloti_Nagri}) (48: U+A800..A82F)
1474   \p{Block: Symbols_And_Pictographs_Ext_A} \p{Block=
1475                             Symbols_And_Pictographs_Extended_A} (144)
1476   \p{Block: Symbols_And_Pictographs_Extended_A} (Short: \p{Blk=
1477                             SymbolsAndPictographsExtA}) (144:
1478                             U+1FA70..1FAFF)
1479   \p{Block: Symbols_For_Legacy_Computing} (256: U+1FB00..1FBFF)
1480   \p{Block: Syriac}       (NOT \p{Syriac} NOR \p{Is_Syriac}) (80:
1481                             U+0700..074F)
1482   \p{Block: Syriac_Sup}   \p{Block=Syriac_Supplement} (16)
1483   \p{Block: Syriac_Supplement} (Short: \p{Blk=SyriacSup}) (16:
1484                             U+0860..086F)
1485   \p{Block: Tagalog}      (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32:
1486                             U+1700..171F)
1487   \p{Block: Tagbanwa}     (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa})
1488                             (32: U+1760..177F)
1489   \p{Block: Tags}         (128: U+E0000..E007F)
1490   \p{Block: Tai_Le}       (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48:
1491                             U+1950..197F)
1492   \p{Block: Tai_Tham}     (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
1493                             (144: U+1A20..1AAF)
1494   \p{Block: Tai_Viet}     (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet})
1495                             (96: U+AA80..AADF)
1496   \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
1497   \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96:
1498                             U+1D300..1D35F)
1499   \p{Block: Takri}        (NOT \p{Takri} NOR \p{Is_Takri}) (80:
1500                             U+11680..116CF)
1501   \p{Block: Tamil}        (NOT \p{Tamil} NOR \p{Is_Tamil}) (128:
1502                             U+0B80..0BFF)
1503   \p{Block: Tamil_Sup}    \p{Block=Tamil_Supplement} (64)
1504   \p{Block: Tamil_Supplement} (Short: \p{Blk=TamilSup}) (64:
1505                             U+11FC0..11FFF)
1506   \p{Block: Tangsa}       (NOT \p{Tangsa} NOR \p{Is_Tangsa}) (96:
1507                             U+16A70..16ACF)
1508   \p{Block: Tangut}       (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144:
1509                             U+17000..187FF)
1510   \p{Block: Tangut_Components} (768: U+18800..18AFF)
1511   \p{Block: Tangut_Sup}   \p{Block=Tangut_Supplement} (128)
1512   \p{Block: Tangut_Supplement} (Short: \p{Blk=TangutSup}) (128:
1513                             U+18D00..18D7F)
1514   \p{Block: Telugu}       (NOT \p{Telugu} NOR \p{Is_Telugu}) (128:
1515                             U+0C00..0C7F)
1516   \p{Block: Thaana}       (NOT \p{Thaana} NOR \p{Is_Thaana}) (64:
1517                             U+0780..07BF)
1518   \p{Block: Thai}         (NOT \p{Thai} NOR \p{Is_Thai}) (128:
1519                             U+0E00..0E7F)
1520   \p{Block: Tibetan}      (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256:
1521                             U+0F00..0FFF)
1522   \p{Block: Tifinagh}     (NOT \p{Tifinagh} NOR \p{Is_Tifinagh})
1523                             (80: U+2D30..2D7F)
1524   \p{Block: Tirhuta}      (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96:
1525                             U+11480..114DF)
1526   \p{Block: Toto}         (NOT \p{Toto} NOR \p{Is_Toto}) (48:
1527                             U+1E290..1E2BF)
1528   \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
1529                             (128)
1530   \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
1531                             TransportAndMap}) (128: U+1F680..1F6FF)
1532   \p{Block: UCAS}         \p{Block=
1533                             Unified_Canadian_Aboriginal_Syllabics}
1534                             (640)
1535   \p{Block: UCAS_Ext}     \p{Block=
1536                             Unified_Canadian_Aboriginal_Syllabics_-
1537                             Extended} (80)
1538   \p{Block: UCAS_Ext_A}   \p{Block=
1539                             Unified_Canadian_Aboriginal_Syllabics_-
1540                             Extended_A} (16)
1541   \p{Block: Ugaritic}     (NOT \p{Ugaritic} NOR \p{Is_Ugaritic})
1542                             (32: U+10380..1039F)
1543   \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
1544                             UCAS}) (640: U+1400..167F)
1545   \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
1546                             \p{Blk=UCASExt}) (80: U+18B0..18FF)
1547   \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended_A}
1548                             (Short: \p{Blk=UCASExtA}) (16:
1549                             U+11AB0..11ABF)
1550   \p{Block: Vai}          (NOT \p{Vai} NOR \p{Is_Vai}) (320:
1551                             U+A500..A63F)
1552   \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
1553                             \p{Variation_Selector} NOR \p{Is_VS})
1554                             (16: U+FE00..FE0F)
1555   \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
1556                             (240: U+E0100..E01EF)
1557   \p{Block: Vedic_Ext}    \p{Block=Vedic_Extensions} (48)
1558   \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48:
1559                             U+1CD0..1CFF)
1560   \p{Block: Vertical_Forms} (16: U+FE10..FE1F)
1561   \p{Block: Vithkuqi}     (NOT \p{Vithkuqi} NOR \p{Is_Vithkuqi})
1562                             (80: U+10570..105BF)
1563   \p{Block: VS}           \p{Block=Variation_Selectors} (NOT
1564                             \p{Variation_Selector} NOR \p{Is_VS})
1565                             (16)
1566   \p{Block: VS_Sup}       \p{Block=Variation_Selectors_Supplement}
1567                             (240)
1568   \p{Block: Wancho}       (NOT \p{Wancho} NOR \p{Is_Wancho}) (64:
1569                             U+1E2C0..1E2FF)
1570   \p{Block: Warang_Citi}  (NOT \p{Warang_Citi} NOR
1571                             \p{Is_Warang_Citi}) (96: U+118A0..118FF)
1572   \p{Block: Yezidi}       (NOT \p{Yezidi} NOR \p{Is_Yezidi}) (64:
1573                             U+10E80..10EBF)
1574   \p{Block: Yi_Radicals}  (64: U+A490..A4CF)
1575   \p{Block: Yi_Syllables} (1168: U+A000..A48F)
1576   \p{Block: Yijing}       \p{Block=Yijing_Hexagram_Symbols} (64)
1577   \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64:
1578                             U+4DC0..4DFF)
1579   \p{Block: Zanabazar_Square} (NOT \p{Zanabazar_Square} NOR
1580                             \p{Is_Zanabazar_Square}) (80:
1581                             U+11A00..11A4F)
1582   \p{Block: Znamenny_Music} \p{Block=Znamenny_Musical_Notation} (208)
1583   \p{Block: Znamenny_Musical_Notation} (Short: \p{Blk=
1584                             ZnamennyMusic}) (208: U+1CF00..1CFCF)
1585 X \p{Block_Elements}      \p{Block=Block_Elements} (32)
1586   \p{Bopo}                \p{Bopomofo} (= \p{Script_Extensions=
1587                             Bopomofo}) (NOT \p{Block=Bopomofo}) (117)
1588   \p{Bopomofo}            \p{Script_Extensions=Bopomofo} (Short:
1589                             \p{Bopo}; NOT \p{Block=Bopomofo}) (117)
1590 X \p{Bopomofo_Ext}        \p{Bopomofo_Extended} (= \p{Block=
1591                             Bopomofo_Extended}) (32)
1592 X \p{Bopomofo_Extended}   \p{Block=Bopomofo_Extended} (Short:
1593                             \p{InBopomofoExt}) (32)
1594 X \p{Box_Drawing}         \p{Block=Box_Drawing} (128)
1595   \p{Bpt: *}              \p{Bidi_Paired_Bracket_Type: *}
1596   \p{Brah}                \p{Brahmi} (= \p{Script_Extensions=
1597                             Brahmi}) (NOT \p{Block=Brahmi}) (115)
1598   \p{Brahmi}              \p{Script_Extensions=Brahmi} (Short:
1599                             \p{Brah}; NOT \p{Block=Brahmi}) (115)
1600   \p{Brai}                \p{Braille} (= \p{Script_Extensions=
1601                             Braille}) (256)
1602   \p{Braille}             \p{Script_Extensions=Braille} (Short:
1603                             \p{Brai}) (256)
1604 X \p{Braille_Patterns}    \p{Block=Braille_Patterns} (Short:
1605                             \p{InBraille}) (256)
1606   \p{Bugi}                \p{Buginese} (= \p{Script_Extensions=
1607                             Buginese}) (NOT \p{Block=Buginese}) (31)
1608   \p{Buginese}            \p{Script_Extensions=Buginese} (Short:
1609                             \p{Bugi}; NOT \p{Block=Buginese}) (31)
1610   \p{Buhd}                \p{Buhid} (= \p{Script_Extensions=Buhid})
1611                             (NOT \p{Block=Buhid}) (22)
1612   \p{Buhid}               \p{Script_Extensions=Buhid} (Short:
1613                             \p{Buhd}; NOT \p{Block=Buhid}) (22)
1614 X \p{Byzantine_Music}     \p{Byzantine_Musical_Symbols} (= \p{Block=
1615                             Byzantine_Musical_Symbols}) (256)
1616 X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
1617                             (Short: \p{InByzantineMusic}) (256)
1618   \p{C} \pC               \p{Other} (= \p{General_Category=Other})
1619                             (965_096 plus all above-Unicode code
1620                             points)
1621   \p{Cakm}                \p{Chakma} (= \p{Script_Extensions=
1622                             Chakma}) (NOT \p{Block=Chakma}) (91)
1623   \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
1624                             (Short: \p{Cans}) (726)
1625 X \p{Canadian_Syllabics}  \p{Unified_Canadian_Aboriginal_Syllabics}
1626                             (= \p{Block=
1627                             Unified_Canadian_Aboriginal_Syllabics})
1628                             (640)
1629 T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
1630                             Not_Reordered} (1_113_190 plus all
1631                             above-Unicode code points)
1632 T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
1633                             Overlay} (32)
1634 T \p{Canonical_Combining_Class: 6} \p{Canonical_Combining_Class=
1635                             Han_Reading} (2)
1636 T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
1637                             Nukta} (27)
1638 T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
1639                             Kana_Voicing} (2)
1640 T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
1641                             Virama} (65)
1642 T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
1643                             CCC10} (1)
1644   \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1:
1645                             U+05B0)
1646 T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
1647                             CCC11} (1)
1648   \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1:
1649                             U+05B1)
1650 T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
1651                             CCC12} (1)
1652   \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1:
1653                             U+05B2)
1654 T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
1655                             CCC13} (1)
1656   \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1:
1657                             U+05B3)
1658 T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
1659                             CCC14} (1)
1660   \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1:
1661                             U+05B4)
1662 T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
1663                             CCC15} (1)
1664   \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1:
1665                             U+05B5)
1666 T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
1667                             CCC16} (1)
1668   \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1:
1669                             U+05B6)
1670 T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
1671                             CCC17} (1)
1672   \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1:
1673                             U+05B7)
1674 T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
1675                             CCC18} (2)
1676   \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2:
1677                             U+05B8, U+05C7)
1678 T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
1679                             CCC19} (2)
1680   \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2:
1681                             U+05B9..05BA)
1682 T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
1683                             CCC20} (1)
1684   \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1:
1685                             U+05BB)
1686 T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
1687                             CCC21} (1)
1688   \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1:
1689                             U+05BC)
1690 T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
1691                             CCC22} (1)
1692   \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1:
1693                             U+05BD)
1694 T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
1695                             CCC23} (1)
1696   \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1:
1697                             U+05BF)
1698 T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
1699                             CCC24} (1)
1700   \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1:
1701                             U+05C1)
1702 T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
1703                             CCC25} (1)
1704   \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1:
1705                             U+05C2)
1706 T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
1707                             CCC26} (1)
1708   \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1:
1709                             U+FB1E)
1710 T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
1711                             CCC27} (2)
1712   \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2:
1713                             U+064B, U+08F0)
1714 T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
1715                             CCC28} (2)
1716   \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2:
1717                             U+064C, U+08F1)
1718 T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
1719                             CCC29} (2)
1720   \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2:
1721                             U+064D, U+08F2)
1722 T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
1723                             CCC30} (2)
1724   \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2:
1725                             U+0618, U+064E)
1726 T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
1727                             CCC31} (2)
1728   \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2:
1729                             U+0619, U+064F)
1730 T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
1731                             CCC32} (2)
1732   \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2:
1733                             U+061A, U+0650)
1734 T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
1735                             CCC33} (1)
1736   \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1:
1737                             U+0651)
1738 T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
1739                             CCC34} (1)
1740   \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1:
1741                             U+0652)
1742 T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
1743                             CCC35} (1)
1744   \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1:
1745                             U+0670)
1746 T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
1747                             CCC36} (1)
1748   \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1:
1749                             U+0711)
1750 T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
1751                             CCC84} (1)
1752   \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1:
1753                             U+0C55)
1754 T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
1755                             CCC91} (1)
1756   \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1:
1757                             U+0C56)
1758 T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
1759                             CCC103} (2)
1760   \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2:
1761                             U+0E38..0E39)
1762 T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
1763                             CCC107} (4)
1764   \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4:
1765                             U+0E48..0E4B)
1766 T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
1767                             CCC118} (2)
1768   \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2:
1769                             U+0EB8..0EB9)
1770 T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
1771                             CCC122} (4)
1772   \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4:
1773                             U+0EC8..0ECB)
1774 T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
1775                             CCC129} (1)
1776   \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1:
1777                             U+0F71)
1778 T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
1779                             CCC130} (6)
1780   \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6:
1781                             U+0F72, U+0F7A..0F7D, U+0F80)
1782 T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
1783                             CCC132} (1)
1784   \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1:
1785                             U+0F74)
1786 T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
1787                             CCC133} (0)
1788   \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
1789 T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
1790                             Attached_Below_Left} (0)
1791 T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
1792                             Attached_Below} (5)
1793 T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
1794                             Attached_Above} (1)
1795 T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
1796                             Attached_Above_Right} (9)
1797 T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
1798                             Below_Left} (2)
1799 T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
1800                             Below} (181)
1801 T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
1802                             Below_Right} (4)
1803 T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
1804                             Left} (2)
1805 T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
1806                             Right} (1)
1807 T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
1808                             Above_Left} (5)
1809 T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
1810                             Above} (510)
1811 T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
1812                             Above_Right} (7)
1813 T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
1814                             Double_Below} (4)
1815 T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
1816                             Double_Above} (5)
1817 T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
1818                             Iota_Subscript} (1)
1819   \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
1820                             Above} (510)
1821   \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (510:
1822                             U+0300..0314, U+033D..0344, U+0346,
1823                             U+034A..034C, U+0350..0352, U+0357 ...)
1824   \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (5:
1825                             U+05AE, U+18A9, U+1DF7..1DF8, U+302B)
1826   \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (7:
1827                             U+0315, U+031A, U+0358, U+1DF6, U+302C,
1828                             U+1E4EC..1E4ED)
1829   \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
1830                             Above_Left} (5)
1831   \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
1832                             Above_Right} (7)
1833   \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
1834                             Attached_Above} (1)
1835   \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
1836                             Attached_Above_Right} (9)
1837   \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
1838                             Attached_Below} (5)
1839   \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
1840                             Attached_Below_Left} (0)
1841   \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
1842                             (1: U+1DCE)
1843   \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
1844                             \p{Ccc=ATAR}) (9: U+031B, U+0F39,
1845                             U+1D165..1D166, U+1D16E..1D172)
1846   \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
1847                             (5: U+0321..0322, U+0327..0328, U+1DD0)
1848   \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
1849                             ATBL}) (0)
1850   \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
1851                             Below} (181)
1852   \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (181:
1853                             U+0316..0319, U+031C..0320,
1854                             U+0323..0326, U+0329..0333,
1855                             U+0339..033C, U+0347..0349 ...)
1856   \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (2:
1857                             U+1DFA, U+302A)
1858   \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4:
1859                             U+059A, U+05AD, U+1939, U+302D)
1860   \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
1861                             Below_Left} (2)
1862   \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
1863                             Below_Right} (4)
1864   \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
1865                             Double_Above} (5)
1866   \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
1867                             Double_Below} (4)
1868   \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA})
1869                             (5: U+035D..035E, U+0360..0361, U+1DCD)
1870   \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB})
1871                             (4: U+035C, U+035F, U+0362, U+1DFC)
1872   \p{Canonical_Combining_Class: Han_Reading} (Short: \p{Ccc=HANR})
1873                             (2: U+16FF0..16FF1)
1874   \p{Canonical_Combining_Class: HANR} \p{Canonical_Combining_Class=
1875                             Han_Reading} (2)
1876   \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
1877                             (1: U+0345)
1878   \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
1879                             Iota_Subscript} (1)
1880   \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV})
1881                             (2: U+3099..309A)
1882   \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
1883                             Kana_Voicing} (2)
1884   \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
1885                             Left} (2)
1886   \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2:
1887                             U+302E..302F)
1888   \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
1889                             Nukta} (27)
1890   \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
1891                             (1_113_190 plus all above-Unicode code
1892                             points: U+0000..02FF, U+034F,
1893                             U+0370..0482, U+0488..0590, U+05BE,
1894                             U+05C0 ...)
1895   \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
1896                             Not_Reordered} (1_113_190 plus all
1897                             above-Unicode code points)
1898   \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (27:
1899                             U+093C, U+09BC, U+0A3C, U+0ABC, U+0B3C,
1900                             U+0C3C ...)
1901   \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
1902                             Overlay} (32)
1903   \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32:
1904                             U+0334..0338, U+1CD4, U+1CE2..1CE8,
1905                             U+20D2..20D3, U+20D8..20DA, U+20E5..20E6
1906                             ...)
1907   \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
1908                             Right} (1)
1909   \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1:
1910                             U+1D16D)
1911   \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (65:
1912                             U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
1913                             U+0BCD ...)
1914   \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
1915                             Virama} (65)
1916   \p{Cans}                \p{Canadian_Aboriginal} (=
1917                             \p{Script_Extensions=
1918                             Canadian_Aboriginal}) (726)
1919   \p{Cari}                \p{Carian} (= \p{Script_Extensions=
1920                             Carian}) (NOT \p{Block=Carian}) (49)
1921   \p{Carian}              \p{Script_Extensions=Carian} (Short:
1922                             \p{Cari}; NOT \p{Block=Carian}) (49)
1923   \p{Case_Ignorable}      \p{Case_Ignorable=Y} (Short: \p{CI}) (2707)
1924   \p{Case_Ignorable: N*}  (Short: \p{CI=N}, \P{CI}) (1_111_405 plus
1925                             all above-Unicode code points: [\x00-
1926                             \x20!\"#\$\%&\(\)*+,\-\/0-9;<=>?\@A-Z
1927                             \[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-\xac\xae
1928                             \xb0-\xb3\xb5-\xb6\xb9-\xff],
1929                             U+0100..02AF, U+0370..0373,
1930                             U+0376..0379, U+037B..0383, U+0386 ...)
1931   \p{Case_Ignorable: Y*}  (Short: \p{CI=Y}, \p{CI}) (2707: [\'.:\^`
1932                             \xa8\xad\xaf\xb4\xb7-\xb8],
1933                             U+02B0..036F, U+0374..0375, U+037A,
1934                             U+0384..0385, U+0387 ...)
1935   \p{Cased}               \p{Cased=Y} (4526)
1936   \p{Cased: N*}           (Single: \P{Cased}) (1_109_586 plus all
1937                             above-Unicode code points: [\x00-\x20!
1938                             \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
1939                             \^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9
1940                             \xbb-\xbf\xd7\xf7], U+01BB,
1941                             U+01C0..01C3, U+0294, U+02B9..02BF,
1942                             U+02C2..02DF ...)
1943   \p{Cased: Y*}           (Single: \p{Cased}) (4526: [A-Za-z\xaa
1944                             \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
1945                             U+0100..01BA, U+01BC..01BF,
1946                             U+01C4..0293, U+0295..02B8, U+02C0..02C1
1947                             ...)
1948   \p{Cased_Letter}        \p{General_Category=Cased_Letter} (Short:
1949                             \p{LC}) (4095)
1950   \p{Category: *}         \p{General_Category: *}
1951   \p{Caucasian_Albanian}  \p{Script_Extensions=Caucasian_Albanian}
1952                             (Short: \p{Aghb}; NOT \p{Block=
1953                             Caucasian_Albanian}) (53)
1954   \p{Cc}                  \p{XPosixCntrl} (= \p{General_Category=
1955                             Control}) (65)
1956   \p{Ccc: *}              \p{Canonical_Combining_Class: *}
1957   \p{CE}                  \p{Composition_Exclusion} (=
1958                             \p{Composition_Exclusion=Y}) (81)
1959   \p{CE: *}               \p{Composition_Exclusion: *}
1960   \p{Cf}                  \p{Format} (= \p{General_Category=Format})
1961                             (170)
1962   \p{Chakma}              \p{Script_Extensions=Chakma} (Short:
1963                             \p{Cakm}; NOT \p{Block=Chakma}) (91)
1964   \p{Cham}                \p{Script_Extensions=Cham} (NOT \p{Block=
1965                             Cham}) (83)
1966   \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
1967                             \p{CWCF}) (1506)
1968   \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
1969                             (1_112_606 plus all above-Unicode code
1970                             points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1971                             \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1972                             \xb4\xb6-\xbf\xd7\xe0-\xff], U+0101,
1973                             U+0103, U+0105, U+0107, U+0109 ...)
1974   \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
1975                             (1506: [A-Z\xb5\xc0-\xd6\xd8-\xdf],
1976                             U+0100, U+0102, U+0104, U+0106, U+0108
1977                             ...)
1978   \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
1979                             \p{CWCM}) (2927)
1980   \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
1981                             (1_111_185 plus all above-Unicode code
1982                             points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1983                             \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xb4
1984                             \xb6-\xbf\xd7\xf7], U+0138, U+018D,
1985                             U+019B, U+01AA..01AB, U+01BA..01BB ...)
1986   \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
1987                             (2927: [A-Za-z\xb5\xc0-\xd6\xd8-\xf6
1988                             \xf8-\xff], U+0100..0137, U+0139..018C,
1989                             U+018E..019A, U+019C..01A9, U+01AC..01B9
1990                             ...)
1991   \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
1992                             \p{CWL}) (1433)
1993   \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
1994                             (1_112_679 plus all above-Unicode code
1995                             points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
1996                             \/0-9:;<=>?\@\[\\\]\^_`a-z\{\|\}~\x7f-
1997                             \xbf\xd7\xdf-\xff], U+0101, U+0103,
1998                             U+0105, U+0107, U+0109 ...)
1999   \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1433:
2000                             [A-Z\xc0-\xd6\xd8-\xde], U+0100, U+0102,
2001                             U+0104, U+0106, U+0108 ...)
2002   \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
2003                             Y} (Short: \p{CWKCF}) (10_491)
2004   \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
2005                             \P{CWKCF}) (1_103_621 plus all above-
2006                             Unicode code points: [\x00-\x20!\"#\$
2007                             \%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-
2008                             z\{\|\}~\x7f-\x9f\xa1-\xa7\xa9\xab-\xac
2009                             \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xd7\xe0-
2010                             \xff], U+0101, U+0103, U+0105, U+0107,
2011                             U+0109 ...)
2012   \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
2013                             \p{CWKCF}) (10_491: [A-Z\xa0\xa8\xaa
2014                             \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
2015                             \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
2016                             U+0106, U+0108 ...)
2017   \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
2018                             \p{CWT}) (1452)
2019   \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
2020                             (1_112_660 plus all above-Unicode code
2021                             points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
2022                             \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
2023                             \xb4\xb6-\xde\xf7], U+0100, U+0102,
2024                             U+0104, U+0106, U+0108 ...)
2025   \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1452:
2026                             [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
2027                             U+0103, U+0105, U+0107, U+0109 ...)
2028   \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
2029                             \p{CWU}) (1525)
2030   \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
2031                             (1_112_587 plus all above-Unicode code
2032                             points: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
2033                             \/0-9:;<=>?\@A-Z\[\\\]\^_`\{\|\}~\x7f-
2034                             \xb4\xb6-\xde\xf7], U+0100, U+0102,
2035                             U+0104, U+0106, U+0108 ...)
2036   \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1525:
2037                             [a-z\xb5\xdf-\xf6\xf8-\xff], U+0101,
2038                             U+0103, U+0105, U+0107, U+0109 ...)
2039   \p{Cher}                \p{Cherokee} (= \p{Script_Extensions=
2040                             Cherokee}) (NOT \p{Block=Cherokee}) (172)
2041   \p{Cherokee}            \p{Script_Extensions=Cherokee} (Short:
2042                             \p{Cher}; NOT \p{Block=Cherokee}) (172)
2043 X \p{Cherokee_Sup}        \p{Cherokee_Supplement} (= \p{Block=
2044                             Cherokee_Supplement}) (80)
2045 X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
2046                             \p{InCherokeeSup}) (80)
2047 X \p{Chess_Symbols}       \p{Block=Chess_Symbols} (112)
2048   \p{Chorasmian}          \p{Script_Extensions=Chorasmian} (Short:
2049                             \p{Chrs}; NOT \p{Block=Chorasmian}) (28)
2050   \p{Chrs}                \p{Chorasmian} (= \p{Script_Extensions=
2051                             Chorasmian}) (NOT \p{Block=Chorasmian})
2052                             (28)
2053   \p{CI}                  \p{Case_Ignorable} (= \p{Case_Ignorable=
2054                             Y}) (2707)
2055   \p{CI: *}               \p{Case_Ignorable: *}
2056 X \p{CJK}                 \p{CJK_Unified_Ideographs} (= \p{Block=
2057                             CJK_Unified_Ideographs}) (20_992)
2058 X \p{CJK_Compat}          \p{CJK_Compatibility} (= \p{Block=
2059                             CJK_Compatibility}) (256)
2060 X \p{CJK_Compat_Forms}    \p{CJK_Compatibility_Forms} (= \p{Block=
2061                             CJK_Compatibility_Forms}) (32)
2062 X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
2063                             \p{Block=CJK_Compatibility_Ideographs})
2064                             (512)
2065 X \p{CJK_Compat_Ideographs_Sup}
2066                             \p{CJK_Compatibility_Ideographs_-
2067                             Supplement} (= \p{Block=
2068                             CJK_Compatibility_Ideographs_-
2069                             Supplement}) (544)
2070 X \p{CJK_Compatibility}   \p{Block=CJK_Compatibility} (Short:
2071                             \p{InCJKCompat}) (256)
2072 X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
2073                             (Short: \p{InCJKCompatForms}) (32)
2074 X \p{CJK_Compatibility_Ideographs} \p{Block=
2075                             CJK_Compatibility_Ideographs} (Short:
2076                             \p{InCJKCompatIdeographs}) (512)
2077 X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
2078                             CJK_Compatibility_Ideographs_Supplement}
2079                             (Short: \p{InCJKCompatIdeographsSup})
2080                             (544)
2081 X \p{CJK_Ext_A}           \p{CJK_Unified_Ideographs_Extension_A} (=
2082                             \p{Block=
2083                             CJK_Unified_Ideographs_Extension_A})
2084                             (6592)
2085 X \p{CJK_Ext_B}           \p{CJK_Unified_Ideographs_Extension_B} (=
2086                             \p{Block=
2087                             CJK_Unified_Ideographs_Extension_B})
2088                             (42_720)
2089 X \p{CJK_Ext_C}           \p{CJK_Unified_Ideographs_Extension_C} (=
2090                             \p{Block=
2091                             CJK_Unified_Ideographs_Extension_C})
2092                             (4160)
2093 X \p{CJK_Ext_D}           \p{CJK_Unified_Ideographs_Extension_D} (=
2094                             \p{Block=
2095                             CJK_Unified_Ideographs_Extension_D})
2096                             (224)
2097 X \p{CJK_Ext_E}           \p{CJK_Unified_Ideographs_Extension_E} (=
2098                             \p{Block=
2099                             CJK_Unified_Ideographs_Extension_E})
2100                             (5776)
2101 X \p{CJK_Ext_F}           \p{CJK_Unified_Ideographs_Extension_F} (=
2102                             \p{Block=
2103                             CJK_Unified_Ideographs_Extension_F})
2104                             (7488)
2105 X \p{CJK_Ext_G}           \p{CJK_Unified_Ideographs_Extension_G} (=
2106                             \p{Block=
2107                             CJK_Unified_Ideographs_Extension_G})
2108                             (4944)
2109 X \p{CJK_Ext_H}           \p{CJK_Unified_Ideographs_Extension_H} (=
2110                             \p{Block=
2111                             CJK_Unified_Ideographs_Extension_H})
2112                             (4192)
2113 X \p{CJK_Radicals_Sup}    \p{CJK_Radicals_Supplement} (= \p{Block=
2114                             CJK_Radicals_Supplement}) (128)
2115 X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
2116                             (Short: \p{InCJKRadicalsSup}) (128)
2117 X \p{CJK_Strokes}         \p{Block=CJK_Strokes} (48)
2118 X \p{CJK_Symbols}         \p{CJK_Symbols_And_Punctuation} (=
2119                             \p{Block=CJK_Symbols_And_Punctuation})
2120                             (64)
2121 X \p{CJK_Symbols_And_Punctuation} \p{Block=
2122                             CJK_Symbols_And_Punctuation} (Short:
2123                             \p{InCJKSymbols}) (64)
2124 X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
2125                             (Short: \p{InCJK}) (20_992)
2126 X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
2127                             CJK_Unified_Ideographs_Extension_A}
2128                             (Short: \p{InCJKExtA}) (6592)
2129 X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
2130                             CJK_Unified_Ideographs_Extension_B}
2131                             (Short: \p{InCJKExtB}) (42_720)
2132 X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
2133                             CJK_Unified_Ideographs_Extension_C}
2134                             (Short: \p{InCJKExtC}) (4160)
2135 X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
2136                             CJK_Unified_Ideographs_Extension_D}
2137                             (Short: \p{InCJKExtD}) (224)
2138 X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
2139                             CJK_Unified_Ideographs_Extension_E}
2140                             (Short: \p{InCJKExtE}) (5776)
2141 X \p{CJK_Unified_Ideographs_Extension_F} \p{Block=
2142                             CJK_Unified_Ideographs_Extension_F}
2143                             (Short: \p{InCJKExtF}) (7488)
2144 X \p{CJK_Unified_Ideographs_Extension_G} \p{Block=
2145                             CJK_Unified_Ideographs_Extension_G}
2146                             (Short: \p{InCJKExtG}) (4944)
2147 X \p{CJK_Unified_Ideographs_Extension_H} \p{Block=
2148                             CJK_Unified_Ideographs_Extension_H}
2149                             (Short: \p{InCJKExtH}) (4192)
2150   \p{Close_Punctuation}   \p{General_Category=Close_Punctuation}
2151                             (Short: \p{Pe}) (77)
2152   \p{Cn}                  \p{Unassigned} (= \p{General_Category=
2153                             Unassigned}) (825_345 plus all above-
2154                             Unicode code points)
2155   \p{Cntrl}               \p{XPosixCntrl} (= \p{General_Category=
2156                             Control}) (65)
2157   \p{Co}                  \p{Private_Use} (= \p{General_Category=
2158                             Private_Use}) (NOT \p{Private_Use_Area})
2159                             (137_468)
2160 X \p{Combining_Diacritical_Marks} \p{Block=
2161                             Combining_Diacritical_Marks} (Short:
2162                             \p{InDiacriticals}) (112)
2163 X \p{Combining_Diacritical_Marks_Extended} \p{Block=
2164                             Combining_Diacritical_Marks_Extended}
2165                             (Short: \p{InDiacriticalsExt}) (80)
2166 X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
2167                             Combining_Diacritical_Marks_For_Symbols}
2168                             (Short: \p{InDiacriticalsForSymbols})
2169                             (48)
2170 X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
2171                             Combining_Diacritical_Marks_Supplement}
2172                             (Short: \p{InDiacriticalsSup}) (64)
2173 X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
2174                             \p{InHalfMarks}) (16)
2175   \p{Combining_Mark}      \p{Mark} (= \p{General_Category=Mark})
2176                             (2450)
2177 X \p{Combining_Marks_For_Symbols}
2178                             \p{Combining_Diacritical_Marks_For_-
2179                             Symbols} (= \p{Block=
2180                             Combining_Diacritical_Marks_For_-
2181                             Symbols}) (48)
2182   \p{Common}              \p{Script_Extensions=Common} (Short:
2183                             \p{Zyyy}) (7873)
2184 X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
2185                             (Short: \p{InIndicNumberForms}) (16)
2186   \p{Comp_Ex}             \p{Full_Composition_Exclusion} (=
2187                             \p{Full_Composition_Exclusion=Y}) (1120)
2188   \p{Comp_Ex: *}          \p{Full_Composition_Exclusion: *}
2189 X \p{Compat_Jamo}         \p{Hangul_Compatibility_Jamo} (= \p{Block=
2190                             Hangul_Compatibility_Jamo}) (96)
2191   \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
2192                             \p{CE}) (81)
2193   \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
2194                             plus all above-Unicode code points:
2195                             U+0000..0957, U+0960..09DB, U+09DE,
2196                             U+09E0..0A32, U+0A34..0A35, U+0A37..0A58
2197                             ...)
2198   \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81:
2199                             U+0958..095F, U+09DC..09DD, U+09DF,
2200                             U+0A33, U+0A36, U+0A59..0A5B ...)
2201   \p{Connector_Punctuation} \p{General_Category=
2202                             Connector_Punctuation} (Short: \p{Pc})
2203                             (10)
2204   \p{Control}             \p{XPosixCntrl} (= \p{General_Category=
2205                             Control}) (65)
2206 X \p{Control_Pictures}    \p{Block=Control_Pictures} (64)
2207   \p{Copt}                \p{Coptic} (= \p{Script_Extensions=
2208                             Coptic}) (NOT \p{Block=Coptic}) (165)
2209   \p{Coptic}              \p{Script_Extensions=Coptic} (Short:
2210                             \p{Copt}; NOT \p{Block=Coptic}) (165)
2211 X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
2212 X \p{Counting_Rod}        \p{Counting_Rod_Numerals} (= \p{Block=
2213                             Counting_Rod_Numerals}) (32)
2214 X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
2215                             \p{InCountingRod}) (32)
2216   \p{Cpmn}                \p{Cypro_Minoan} (= \p{Script_Extensions=
2217                             Cypro_Minoan}) (NOT \p{Block=
2218                             Cypro_Minoan}) (101)
2219   \p{Cprt}                \p{Cypriot} (= \p{Script_Extensions=
2220                             Cypriot}) (112)
2221   \p{Cs}                  \p{Surrogate} (= \p{General_Category=
2222                             Surrogate}) (2048)
2223   \p{Cuneiform}           \p{Script_Extensions=Cuneiform} (Short:
2224                             \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
2225 X \p{Cuneiform_Numbers}   \p{Cuneiform_Numbers_And_Punctuation} (=
2226                             \p{Block=
2227                             Cuneiform_Numbers_And_Punctuation}) (128)
2228 X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
2229                             Cuneiform_Numbers_And_Punctuation}
2230                             (Short: \p{InCuneiformNumbers}) (128)
2231   \p{Currency_Symbol}     \p{General_Category=Currency_Symbol}
2232                             (Short: \p{Sc}) (63)
2233 X \p{Currency_Symbols}    \p{Block=Currency_Symbols} (48)
2234   \p{CWCF}                \p{Changes_When_Casefolded} (=
2235                             \p{Changes_When_Casefolded=Y}) (1506)
2236   \p{CWCF: *}             \p{Changes_When_Casefolded: *}
2237   \p{CWCM}                \p{Changes_When_Casemapped} (=
2238                             \p{Changes_When_Casemapped=Y}) (2927)
2239   \p{CWCM: *}             \p{Changes_When_Casemapped: *}
2240   \p{CWKCF}               \p{Changes_When_NFKC_Casefolded} (=
2241                             \p{Changes_When_NFKC_Casefolded=Y})
2242                             (10_491)
2243   \p{CWKCF: *}            \p{Changes_When_NFKC_Casefolded: *}
2244   \p{CWL}                 \p{Changes_When_Lowercased} (=
2245                             \p{Changes_When_Lowercased=Y}) (1433)
2246   \p{CWL: *}              \p{Changes_When_Lowercased: *}
2247   \p{CWT}                 \p{Changes_When_Titlecased} (=
2248                             \p{Changes_When_Titlecased=Y}) (1452)
2249   \p{CWT: *}              \p{Changes_When_Titlecased: *}
2250   \p{CWU}                 \p{Changes_When_Uppercased} (=
2251                             \p{Changes_When_Uppercased=Y}) (1525)
2252   \p{CWU: *}              \p{Changes_When_Uppercased: *}
2253   \p{Cypriot}             \p{Script_Extensions=Cypriot} (Short:
2254                             \p{Cprt}) (112)
2255 X \p{Cypriot_Syllabary}   \p{Block=Cypriot_Syllabary} (64)
2256   \p{Cypro_Minoan}        \p{Script_Extensions=Cypro_Minoan} (Short:
2257                             \p{Cpmn}; NOT \p{Block=Cypro_Minoan})
2258                             (101)
2259   \p{Cyrillic}            \p{Script_Extensions=Cyrillic} (Short:
2260                             \p{Cyrl}; NOT \p{Block=Cyrillic}) (510)
2261 X \p{Cyrillic_Ext_A}      \p{Cyrillic_Extended_A} (= \p{Block=
2262                             Cyrillic_Extended_A}) (32)
2263 X \p{Cyrillic_Ext_B}      \p{Cyrillic_Extended_B} (= \p{Block=
2264                             Cyrillic_Extended_B}) (96)
2265 X \p{Cyrillic_Ext_C}      \p{Cyrillic_Extended_C} (= \p{Block=
2266                             Cyrillic_Extended_C}) (16)
2267 X \p{Cyrillic_Ext_D}      \p{Cyrillic_Extended_D} (= \p{Block=
2268                             Cyrillic_Extended_D}) (96)
2269 X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
2270                             \p{InCyrillicExtA}) (32)
2271 X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
2272                             \p{InCyrillicExtB}) (96)
2273 X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
2274                             \p{InCyrillicExtC}) (16)
2275 X \p{Cyrillic_Extended_D} \p{Block=Cyrillic_Extended_D} (Short:
2276                             \p{InCyrillicExtD}) (96)
2277 X \p{Cyrillic_Sup}        \p{Cyrillic_Supplement} (= \p{Block=
2278                             Cyrillic_Supplement}) (48)
2279 X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
2280                             \p{InCyrillicSup}) (48)
2281 X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
2282                             Cyrillic_Supplement}) (48)
2283   \p{Cyrl}                \p{Cyrillic} (= \p{Script_Extensions=
2284                             Cyrillic}) (NOT \p{Block=Cyrillic}) (510)
2285   \p{Dash}                \p{Dash=Y} (30)
2286   \p{Dash: N*}            (Single: \P{Dash}) (1_114_082 plus all
2287                             above-Unicode code points: [\x00-\x20!
2288                             \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
2289                             \[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2290                             U+0100..0589, U+058B..05BD,
2291                             U+05BF..13FF, U+1401..1805, U+1807..200F
2292                             ...)
2293   \p{Dash: Y*}            (Single: \p{Dash}) (30: [\-], U+058A,
2294                             U+05BE, U+1400, U+1806, U+2010..2015 ...)
2295   \p{Dash_Punctuation}    \p{General_Category=Dash_Punctuation}
2296                             (Short: \p{Pd}) (26)
2297   \p{Decimal_Number}      \p{XPosixDigit} (= \p{General_Category=
2298                             Decimal_Number}) (680)
2299   \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
2300                             (13_233)
2301   \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_233:
2302                             [\xc0-\xc5\xc7-\xcf\xd1-\xd6\xd9-\xdd
2303                             \xe0-\xe5\xe7-\xef\xf1-\xf6\xf9-\xfd
2304                             \xff], U+0100..010F, U+0112..0125,
2305                             U+0128..0130, U+0134..0137, U+0139..013E
2306                             ...)
2307   \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240:
2308                             U+2460..2473, U+24B6..24EA,
2309                             U+3244..3247, U+3251..327E,
2310                             U+3280..32BF, U+32D0..32FE ...)
2311   \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
2312   \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720: [\xa8
2313                             \xaf\xb4-\xb5\xb8], U+0132..0133,
2314                             U+013F..0140, U+0149, U+017F,
2315                             U+01C4..01CC ...)
2316   \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
2317   \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
2318   \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240: U+FB51,
2319                             U+FB53, U+FB57, U+FB5B, U+FB5F, U+FB63
2320                             ...)
2321   \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1194: U+2102,
2322                             U+210A..2113, U+2115, U+2119..211D,
2323                             U+2124, U+2128 ...)
2324   \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
2325   \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20: [\xbc-
2326                             \xbe], U+2150..215F, U+2189)
2327   \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
2328   \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171: U+FB54,
2329                             U+FB58, U+FB5C, U+FB60, U+FB64, U+FB68
2330                             ...)
2331   \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
2332   \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238: U+FB50,
2333                             U+FB52, U+FB56, U+FB5A, U+FB5E, U+FB62
2334                             ...)
2335   \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
2336   \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82: U+FB55,
2337                             U+FB59, U+FB5D, U+FB61, U+FB65, U+FB69
2338                             ...)
2339   \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
2340   \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122:
2341                             U+FF61..FFBE, U+FFC2..FFC7,
2342                             U+FFCA..FFCF, U+FFD2..FFD7,
2343                             U+FFDA..FFDC, U+FFE8..FFEE)
2344   \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
2345   \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5: [\xa0],
2346                             U+0F0C, U+2007, U+2011, U+202F)
2347   \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
2348                             Non_Canonical} (Perl extension) (3796)
2349   \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
2350                             decompositions (Short: \p{Dt=NonCanon})
2351                             (Perl extension) (3796: [\xa0\xa8\xaa
2352                             \xaf\xb2-\xb5\xb8-\xba\xbc-\xbe],
2353                             U+0132..0133, U+013F..0140, U+0149,
2354                             U+017F, U+01C4..01CC ...)
2355   \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_083 plus
2356                             all above-Unicode code points: [\x00-
2357                             \x9f\xa1-\xa7\xa9\xab-\xae\xb0-\xb1\xb6-
2358                             \xb7\xbb\xbf\xc6\xd0\xd7-\xd8\xde-\xdf
2359                             \xe6\xf0\xf7-\xf8\xfe], U+0110..0111,
2360                             U+0126..0127, U+0131, U+0138,
2361                             U+0141..0142 ...)
2362   \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26:
2363                             U+FE50..FE52, U+FE54..FE66, U+FE68..FE6B)
2364   \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
2365   \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (286)
2366   \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (286: U+3250,
2367                             U+32CC..32CF, U+32FF..3357,
2368                             U+3371..33DF, U+33FF, U+1F130..1F14F ...)
2369   \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (64: U+1D62..1D6A,
2370                             U+2080..208E, U+2090..209C, U+2C7C,
2371                             U+1E051..1E06A)
2372   \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (249)
2373   \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (249: [\xaa\xb2-
2374                             \xb3\xb9-\xba], U+02B0..02B8,
2375                             U+02E0..02E4, U+10FC, U+1D2C..1D2E,
2376                             U+1D30..1D3A ...)
2377   \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
2378   \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35: U+309F,
2379                             U+30FF, U+FE10..FE19, U+FE30..FE44,
2380                             U+FE47..FE48)
2381   \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104: U+3000,
2382                             U+FF01..FF60, U+FFE0..FFE6)
2383   \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
2384                             Y} (Short: \p{DI}) (4174)
2385   \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
2386                             (1_109_938 plus all above-Unicode code
2387                             points: [\x00-\xac\xae-\xff],
2388                             U+0100..034E, U+0350..061B,
2389                             U+061D..115E, U+1161..17B3, U+17B6..180A
2390                             ...)
2391   \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
2392                             (4174: [\xad], U+034F, U+061C,
2393                             U+115F..1160, U+17B4..17B5, U+180B..180F
2394                             ...)
2395   \p{Dep}                 \p{Deprecated} (= \p{Deprecated=Y}) (15)
2396   \p{Dep: *}              \p{Deprecated: *}
2397   \p{Deprecated}          \p{Deprecated=Y} (Short: \p{Dep}) (15)
2398   \p{Deprecated: N*}      (Short: \p{Dep=N}, \P{Dep}) (1_114_097
2399                             plus all above-Unicode code points:
2400                             U+0000..0148, U+014A..0672,
2401                             U+0674..0F76, U+0F78, U+0F7A..17A2,
2402                             U+17A5..2069 ...)
2403   \p{Deprecated: Y*}      (Short: \p{Dep=Y}, \p{Dep}) (15: U+0149,
2404                             U+0673, U+0F77, U+0F79, U+17A3..17A4,
2405                             U+206A..206F ...)
2406   \p{Deseret}             \p{Script_Extensions=Deseret} (Short:
2407                             \p{Dsrt}) (80)
2408   \p{Deva}                \p{Devanagari} (= \p{Script_Extensions=
2409                             Devanagari}) (NOT \p{Block=Devanagari})
2410                             (220)
2411   \p{Devanagari}          \p{Script_Extensions=Devanagari} (Short:
2412                             \p{Deva}; NOT \p{Block=Devanagari}) (220)
2413 X \p{Devanagari_Ext}      \p{Devanagari_Extended} (= \p{Block=
2414                             Devanagari_Extended}) (32)
2415 X \p{Devanagari_Ext_A}    \p{Devanagari_Extended_A} (= \p{Block=
2416                             Devanagari_Extended_A}) (96)
2417 X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
2418                             \p{InDevanagariExt}) (32)
2419 X \p{Devanagari_Extended_A} \p{Block=Devanagari_Extended_A} (Short:
2420                             \p{InDevanagariExtA}) (96)
2421   \p{DI}                  \p{Default_Ignorable_Code_Point} (=
2422                             \p{Default_Ignorable_Code_Point=Y})
2423                             (4174)
2424   \p{DI: *}               \p{Default_Ignorable_Code_Point: *}
2425   \p{Dia}                 \p{Diacritic} (= \p{Diacritic=Y}) (1144)
2426   \p{Dia: *}              \p{Diacritic: *}
2427   \p{Diacritic}           \p{Diacritic=Y} (Short: \p{Dia}) (1144)
2428   \p{Diacritic: N*}       (Short: \p{Dia=N}, \P{Dia}) (1_112_968
2429                             plus all above-Unicode code points:
2430                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
2431                             >?\@A-Z\[\\\]_a-z\{\|\}~\x7f-\xa7\xa9-
2432                             \xae\xb0-\xb3\xb5-\xb6\xb9-\xff],
2433                             U+0100..02AF, U+034F, U+0358..035C,
2434                             U+0363..0373, U+0376..0379 ...)
2435   \p{Diacritic: Y*}       (Short: \p{Dia=Y}, \p{Dia}) (1144: [\^`
2436                             \xa8\xaf\xb4\xb7-\xb8], U+02B0..034E,
2437                             U+0350..0357, U+035D..0362,
2438                             U+0374..0375, U+037A ...)
2439 X \p{Diacriticals}        \p{Combining_Diacritical_Marks} (=
2440                             \p{Block=Combining_Diacritical_Marks})
2441                             (112)
2442 X \p{Diacriticals_Ext}    \p{Combining_Diacritical_Marks_Extended}
2443                             (= \p{Block=
2444                             Combining_Diacritical_Marks_Extended})
2445                             (80)
2446 X \p{Diacriticals_For_Symbols}
2447                             \p{Combining_Diacritical_Marks_For_-
2448                             Symbols} (= \p{Block=
2449                             Combining_Diacritical_Marks_For_-
2450                             Symbols}) (48)
2451 X \p{Diacriticals_Sup}    \p{Combining_Diacritical_Marks_Supplement}
2452                             (= \p{Block=
2453                             Combining_Diacritical_Marks_Supplement})
2454                             (64)
2455   \p{Diak}                \p{Dives_Akuru} (= \p{Script_Extensions=
2456                             Dives_Akuru}) (NOT \p{Block=
2457                             Dives_Akuru}) (72)
2458   \p{Digit}               \p{XPosixDigit} (= \p{General_Category=
2459                             Decimal_Number}) (680)
2460 X \p{Dingbats}            \p{Block=Dingbats} (192)
2461   \p{Dives_Akuru}         \p{Script_Extensions=Dives_Akuru} (Short:
2462                             \p{Diak}; NOT \p{Block=Dives_Akuru}) (72)
2463   \p{Dogr}                \p{Dogra} (= \p{Script_Extensions=Dogra})
2464                             (NOT \p{Block=Dogra}) (82)
2465   \p{Dogra}               \p{Script_Extensions=Dogra} (Short:
2466                             \p{Dogr}; NOT \p{Block=Dogra}) (82)
2467 X \p{Domino}              \p{Domino_Tiles} (= \p{Block=
2468                             Domino_Tiles}) (112)
2469 X \p{Domino_Tiles}        \p{Block=Domino_Tiles} (Short:
2470                             \p{InDomino}) (112)
2471   \p{Dsrt}                \p{Deseret} (= \p{Script_Extensions=
2472                             Deseret}) (80)
2473   \p{Dt: *}               \p{Decomposition_Type: *}
2474   \p{Dupl}                \p{Duployan} (= \p{Script_Extensions=
2475                             Duployan}) (NOT \p{Block=Duployan}) (147)
2476   \p{Duployan}            \p{Script_Extensions=Duployan} (Short:
2477                             \p{Dupl}; NOT \p{Block=Duployan}) (147)
2478   \p{Ea: *}               \p{East_Asian_Width: *}
2479 X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
2480                             (208)
2481   \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
2482   \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739: [\xa1
2483                             \xa4\xa7-\xa8\xaa\xad-\xae\xb0-\xb4\xb6-
2484                             \xba\xbc-\xbf\xc6\xd0\xd7-\xd8\xde-\xe1
2485                             \xe6\xe8-\xea\xec-\xed\xf0\xf2-\xf3\xf7-
2486                             \xfa\xfc\xfe], U+0101, U+0111, U+0113,
2487                             U+011B, U+0126..0127 ...)
2488   \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
2489   \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104: U+3000,
2490                             U+FF01..FF60, U+FFE0..FFE6)
2491   \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
2492   \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123: U+20A9,
2493                             U+FF61..FFBE, U+FFC2..FFC7,
2494                             U+FFCA..FFCF, U+FFD2..FFD7, U+FFDA..FFDC
2495                             ...)
2496   \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (792_623 plus
2497                             all above-Unicode code points)
2498   \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
2499   \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111: [\x20-\x7e
2500                             \xa2-\xa3\xa5-\xa6\xac\xaf],
2501                             U+27E6..27ED, U+2985..2986)
2502   \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (792_623 plus all
2503                             above-Unicode code points: [\x00-\x1f
2504                             \x7f-\xa0\xa9\xab\xb5\xbb\xc0-\xc5\xc7-
2505                             \xcf\xd1-\xd6\xd9-\xdd\xe2-\xe5\xe7\xeb
2506                             \xee-\xef\xf1\xf4-\xf6\xfb\xfd\xff],
2507                             U+00FF..0100, U+0102..0110, U+0112,
2508                             U+0114..011A, U+011C..0125 ...)
2509   \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (182_412)
2510   \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (182_412:
2511                             U+1100..115F, U+231A..231B,
2512                             U+2329..232A, U+23E9..23EC, U+23F0,
2513                             U+23F3 ...)
2514   \p{EBase}               \p{Emoji_Modifier_Base} (=
2515                             \p{Emoji_Modifier_Base=Y}) (134)
2516   \p{EBase: *}            \p{Emoji_Modifier_Base: *}
2517   \p{EComp}               \p{Emoji_Component} (= \p{Emoji_Component=
2518                             Y}) (146)
2519   \p{EComp: *}            \p{Emoji_Component: *}
2520   \p{Egyp}                \p{Egyptian_Hieroglyphs} (=
2521                             \p{Script_Extensions=
2522                             Egyptian_Hieroglyphs}) (NOT \p{Block=
2523                             Egyptian_Hieroglyphs}) (1110)
2524 X \p{Egyptian_Hieroglyph_Format_Controls} \p{Block=
2525                             Egyptian_Hieroglyph_Format_Controls} (48)
2526   \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
2527                             Egyptian_Hieroglyphs} (Short: \p{Egyp};
2528                             NOT \p{Block=Egyptian_Hieroglyphs})
2529                             (1110)
2530   \p{Elba}                \p{Elbasan} (= \p{Script_Extensions=
2531                             Elbasan}) (NOT \p{Block=Elbasan}) (40)
2532   \p{Elbasan}             \p{Script_Extensions=Elbasan} (Short:
2533                             \p{Elba}; NOT \p{Block=Elbasan}) (40)
2534   \p{Elym}                \p{Elymaic} (= \p{Script_Extensions=
2535                             Elymaic}) (NOT \p{Block=Elymaic}) (23)
2536   \p{Elymaic}             \p{Script_Extensions=Elymaic} (Short:
2537                             \p{Elym}; NOT \p{Block=Elymaic}) (23)
2538   \p{EMod}                \p{Emoji_Modifier} (= \p{Emoji_Modifier=
2539                             Y}) (5)
2540   \p{EMod: *}             \p{Emoji_Modifier: *}
2541   \p{Emoji}               \p{Emoji=Y} (1424)
2542   \p{Emoji: N*}           (Single: \P{Emoji}) (1_112_688 plus all
2543                             above-Unicode code points: [\x00-\x20!
2544                             \"\$\%&\'\(\)+,\-.\/:;<=>?\@A-Z\[\\\]
2545                             \^_`a-z\{\|\}~\x7f-\xa8\xaa-\xad\xaf-
2546                             \xff], U+0100..203B, U+203D..2048,
2547                             U+204A..2121, U+2123..2138, U+213A..2193
2548                             ...)
2549   \p{Emoji: Y*}           (Single: \p{Emoji}) (1424: [#*0-9\xa9
2550                             \xae], U+203C, U+2049, U+2122, U+2139,
2551                             U+2194..2199 ...)
2552   \p{Emoji_Component}     \p{Emoji_Component=Y} (Short: \p{EComp})
2553                             (146)
2554   \p{Emoji_Component: N*} (Short: \p{EComp=N}, \P{EComp}) (1_113_966
2555                             plus all above-Unicode code points:
2556                             [\x00-\x20!\"\$\%&\'\(\)+,\-.\/:;<=>?
2557                             \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
2558                             U+0100..200C, U+200E..20E2,
2559                             U+20E4..FE0E, U+FE10..1F1E5,
2560                             U+1F200..1F3FA ...)
2561   \p{Emoji_Component: Y*} (Short: \p{EComp=Y}, \p{EComp}) (146:
2562                             [#*0-9], U+200D, U+20E3, U+FE0F,
2563                             U+1F1E6..1F1FF, U+1F3FB..1F3FF ...)
2564   \p{Emoji_Modifier}      \p{Emoji_Modifier=Y} (Short: \p{EMod}) (5)
2565   \p{Emoji_Modifier: N*}  (Short: \p{EMod=N}, \P{EMod}) (1_114_107
2566                             plus all above-Unicode code points:
2567                             U+0000..1F3FA, U+1F400..infinity)
2568   \p{Emoji_Modifier: Y*}  (Short: \p{EMod=Y}, \p{EMod}) (5:
2569                             U+1F3FB..1F3FF)
2570   \p{Emoji_Modifier_Base} \p{Emoji_Modifier_Base=Y} (Short:
2571                             \p{EBase}) (134)
2572   \p{Emoji_Modifier_Base: N*} (Short: \p{EBase=N}, \P{EBase})
2573                             (1_113_978 plus all above-Unicode code
2574                             points: U+0000..261C, U+261E..26F8,
2575                             U+26FA..2709, U+270E..1F384,
2576                             U+1F386..1F3C1, U+1F3C5..1F3C6 ...)
2577   \p{Emoji_Modifier_Base: Y*} (Short: \p{EBase=Y}, \p{EBase}) (134:
2578                             U+261D, U+26F9, U+270A..270D, U+1F385,
2579                             U+1F3C2..1F3C4, U+1F3C7 ...)
2580   \p{Emoji_Presentation}  \p{Emoji_Presentation=Y} (Short:
2581                             \p{EPres}) (1205)
2582   \p{Emoji_Presentation: N*} (Short: \p{EPres=N}, \P{EPres})
2583                             (1_112_907 plus all above-Unicode code
2584                             points: U+0000..2319, U+231C..23E8,
2585                             U+23ED..23EF, U+23F1..23F2,
2586                             U+23F4..25FC, U+25FF..2613 ...)
2587   \p{Emoji_Presentation: Y*} (Short: \p{EPres=Y}, \p{EPres}) (1205:
2588                             U+231A..231B, U+23E9..23EC, U+23F0,
2589                             U+23F3, U+25FD..25FE, U+2614..2615 ...)
2590 X \p{Emoticons}           \p{Block=Emoticons} (80)
2591 X \p{Enclosed_Alphanum}   \p{Enclosed_Alphanumerics} (= \p{Block=
2592                             Enclosed_Alphanumerics}) (160)
2593 X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
2594                             \p{Block=
2595                             Enclosed_Alphanumeric_Supplement}) (256)
2596 X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
2597                             Enclosed_Alphanumeric_Supplement}
2598                             (Short: \p{InEnclosedAlphanumSup}) (256)
2599 X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
2600                             (Short: \p{InEnclosedAlphanum}) (160)
2601 X \p{Enclosed_CJK}        \p{Enclosed_CJK_Letters_And_Months} (=
2602                             \p{Block=
2603                             Enclosed_CJK_Letters_And_Months}) (256)
2604 X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
2605                             Enclosed_CJK_Letters_And_Months} (Short:
2606                             \p{InEnclosedCJK}) (256)
2607 X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
2608                             (= \p{Block=
2609                             Enclosed_Ideographic_Supplement}) (256)
2610 X \p{Enclosed_Ideographic_Supplement} \p{Block=
2611                             Enclosed_Ideographic_Supplement} (Short:
2612                             \p{InEnclosedIdeographicSup}) (256)
2613   \p{Enclosing_Mark}      \p{General_Category=Enclosing_Mark}
2614                             (Short: \p{Me}) (13)
2615   \p{EPres}               \p{Emoji_Presentation} (=
2616                             \p{Emoji_Presentation=Y}) (1205)
2617   \p{EPres: *}            \p{Emoji_Presentation: *}
2618   \p{Ethi}                \p{Ethiopic} (= \p{Script_Extensions=
2619                             Ethiopic}) (NOT \p{Block=Ethiopic}) (523)
2620   \p{Ethiopic}            \p{Script_Extensions=Ethiopic} (Short:
2621                             \p{Ethi}; NOT \p{Block=Ethiopic}) (523)
2622 X \p{Ethiopic_Ext}        \p{Ethiopic_Extended} (= \p{Block=
2623                             Ethiopic_Extended}) (96)
2624 X \p{Ethiopic_Ext_A}      \p{Ethiopic_Extended_A} (= \p{Block=
2625                             Ethiopic_Extended_A}) (48)
2626 X \p{Ethiopic_Ext_B}      \p{Ethiopic_Extended_B} (= \p{Block=
2627                             Ethiopic_Extended_B}) (32)
2628 X \p{Ethiopic_Extended}   \p{Block=Ethiopic_Extended} (Short:
2629                             \p{InEthiopicExt}) (96)
2630 X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
2631                             \p{InEthiopicExtA}) (48)
2632 X \p{Ethiopic_Extended_B} \p{Block=Ethiopic_Extended_B} (Short:
2633                             \p{InEthiopicExtB}) (32)
2634 X \p{Ethiopic_Sup}        \p{Ethiopic_Supplement} (= \p{Block=
2635                             Ethiopic_Supplement}) (32)
2636 X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
2637                             \p{InEthiopicSup}) (32)
2638   \p{Ext}                 \p{Extender} (= \p{Extender=Y}) (50)
2639   \p{Ext: *}              \p{Extender: *}
2640   \p{Extended_Pictographic} \p{Extended_Pictographic=Y} (Short:
2641                             \p{ExtPict}) (3537)
2642   \p{Extended_Pictographic: N*} (Short: \p{ExtPict=N}, \P{ExtPict})
2643                             (1_110_575 plus all above-Unicode code
2644                             points: [\x00-\xa8\xaa-\xad\xaf-\xff],
2645                             U+0100..203B, U+203D..2048,
2646                             U+204A..2121, U+2123..2138, U+213A..2193
2647                             ...)
2648   \p{Extended_Pictographic: Y*} (Short: \p{ExtPict=Y}, \p{ExtPict})
2649                             (3537: [\xa9\xae], U+203C, U+2049,
2650                             U+2122, U+2139, U+2194..2199 ...)
2651   \p{Extender}            \p{Extender=Y} (Short: \p{Ext}) (50)
2652   \p{Extender: N*}        (Short: \p{Ext=N}, \P{Ext}) (1_114_062
2653                             plus all above-Unicode code points:
2654                             [\x00-\xb6\xb8-\xff], U+0100..02CF,
2655                             U+02D2..063F, U+0641..07F9,
2656                             U+07FB..0B54, U+0B56..0E45 ...)
2657   \p{Extender: Y*}        (Short: \p{Ext=Y}, \p{Ext}) (50: [\xb7],
2658                             U+02D0..02D1, U+0640, U+07FA, U+0B55,
2659                             U+0E46 ...)
2660   \p{ExtPict}             \p{Extended_Pictographic} (=
2661                             \p{Extended_Pictographic=Y}) (3537)
2662   \p{ExtPict: *}          \p{Extended_Pictographic: *}
2663   \p{Final_Punctuation}   \p{General_Category=Final_Punctuation}
2664                             (Short: \p{Pf}) (10)
2665   \p{Format}              \p{General_Category=Format} (Short:
2666                             \p{Cf}) (170)
2667   \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
2668                             (Short: \p{CompEx}) (1120)
2669   \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
2670                             \P{CompEx}) (1_112_992 plus all above-
2671                             Unicode code points: U+0000..033F,
2672                             U+0342, U+0345..0373, U+0375..037D,
2673                             U+037F..0386, U+0388..0957 ...)
2674   \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
2675                             \p{CompEx}) (1120: U+0340..0341,
2676                             U+0343..0344, U+0374, U+037E, U+0387,
2677                             U+0958..095F ...)
2678   \p{Gc: *}               \p{General_Category: *}
2679   \p{GCB: *}              \p{Grapheme_Cluster_Break: *}
2680   \p{General_Category: C} \p{General_Category=Other} (965_096 plus
2681                             all above-Unicode code points)
2682   \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
2683                             \p{Gc=LC}, \p{LC}) (4095: [A-Za-z\xb5
2684                             \xc0-\xd6\xd8-\xf6\xf8-\xff],
2685                             U+0100..01BA, U+01BC..01BF,
2686                             U+01C4..0293, U+0295..02AF, U+0370..0373
2687                             ...)
2688   \p{General_Category: Cc} \p{General_Category=Control} (65)
2689   \p{General_Category: Cf} \p{General_Category=Format} (170)
2690   \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
2691                             (77: [\)\]\}], U+0F3B, U+0F3D, U+169C,
2692                             U+2046, U+207E ...)
2693   \p{General_Category: Cn} \p{General_Category=Unassigned} (825_345
2694                             plus all above-Unicode code points)
2695   \p{General_Category: Cntrl} \p{General_Category=Control} (65)
2696   \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
2697   \p{General_Category: Combining_Mark} \p{General_Category=Mark}
2698                             (2450)
2699   \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
2700                             \p{Pc}) (10: [_], U+203F..2040, U+2054,
2701                             U+FE33..FE34, U+FE4D..FE4F, U+FF3F)
2702   \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65:
2703                             [\x00-\x1f\x7f-\x9f])
2704   \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
2705   \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
2706                             (63: [\$\xa2-\xa5], U+058F, U+060B,
2707                             U+07FE..07FF, U+09F2..09F3, U+09FB ...)
2708   \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
2709                             (26: [\-], U+058A, U+05BE, U+1400,
2710                             U+1806, U+2010..2015 ...)
2711   \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
2712                             (680: [0-9], U+0660..0669, U+06F0..06F9,
2713                             U+07C0..07C9, U+0966..096F, U+09E6..09EF
2714                             ...)
2715   \p{General_Category: Digit} \p{General_Category=Decimal_Number}
2716                             (680)
2717   \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
2718                             (13: U+0488..0489, U+1ABE, U+20DD..20E0,
2719                             U+20E2..20E4, U+A670..A672)
2720   \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
2721                             (10: [\xbb], U+2019, U+201D, U+203A,
2722                             U+2E03, U+2E05 ...)
2723   \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (170:
2724                             [\xad], U+0600..0605, U+061C, U+06DD,
2725                             U+070F, U+0890..0891 ...)
2726   \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
2727                             \p{Pi}) (12: [\xab], U+2018,
2728                             U+201B..201C, U+201F, U+2039, U+2E02 ...)
2729   \p{General_Category: L} \p{General_Category=Letter} (136_104)
2730 X \p{General_Category: L&} \p{General_Category=Cased_Letter} (4095)
2731 X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
2732                             the trailing '_' matters in spite of
2733                             loose matching rules. (4095)
2734   \p{General_Category: LC} \p{General_Category=Cased_Letter} (4095)
2735   \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (136_104:
2736                             [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
2737                             \xf8-\xff], U+0100..02C1, U+02C6..02D1,
2738                             U+02E0..02E4, U+02EC, U+02EE ...)
2739   \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
2740                             (236: U+16EE..16F0, U+2160..2182,
2741                             U+2185..2188, U+3007, U+3021..3029,
2742                             U+3038..303A ...)
2743   \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl})
2744                             (1: U+2028)
2745   \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
2746                             (/i= General_Category=Cased_Letter)
2747                             (2233)
2748   \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (397)
2749   \p{General_Category: Lo} \p{General_Category=Other_Letter}
2750                             (131_612)
2751   \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
2752                             /i= General_Category=Cased_Letter)
2753                             (2233: [a-z\xb5\xdf-\xf6\xf8-\xff],
2754                             U+0101, U+0103, U+0105, U+0107, U+0109
2755                             ...)
2756   \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
2757                             (/i= General_Category=Cased_Letter) (31)
2758   \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
2759                             (/i= General_Category=Cased_Letter)
2760                             (1831)
2761   \p{General_Category: M} \p{General_Category=Mark} (2450)
2762   \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2450:
2763                             U+0300..036F, U+0483..0489,
2764                             U+0591..05BD, U+05BF, U+05C1..05C2,
2765                             U+05C4..05C5 ...)
2766   \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948:
2767                             [+<=>\|~\xac\xb1\xd7\xf7], U+03F6,
2768                             U+0606..0608, U+2044, U+2052,
2769                             U+207A..207C ...)
2770   \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (452)
2771   \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
2772   \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
2773                             (1985)
2774   \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
2775                             (397: U+02B0..02C1, U+02C6..02D1,
2776                             U+02E0..02E4, U+02EC, U+02EE, U+0374 ...)
2777   \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
2778                             (125: [\^`\xa8\xaf\xb4\xb8],
2779                             U+02C2..02C5, U+02D2..02DF,
2780                             U+02E5..02EB, U+02ED, U+02EF..02FF ...)
2781   \p{General_Category: N} \p{General_Category=Number} (1831)
2782   \p{General_Category: Nd} \p{General_Category=Decimal_Number} (680)
2783   \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
2784   \p{General_Category: No} \p{General_Category=Other_Number} (915)
2785   \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
2786                             (1985: U+0300..036F, U+0483..0487,
2787                             U+0591..05BD, U+05BF, U+05C1..05C2,
2788                             U+05C4..05C5 ...)
2789   \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1831: [0-9
2790                             \xb2-\xb3\xb9\xbc-\xbe], U+0660..0669,
2791                             U+06F0..06F9, U+07C0..07C9,
2792                             U+0966..096F, U+09E6..09EF ...)
2793   \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
2794                             (79: [\(\[\{], U+0F3A, U+0F3C, U+169B,
2795                             U+201A, U+201E ...)
2796   \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (965_096 plus
2797                             all above-Unicode code points: [\x00-
2798                             \x1f\x7f-\x9f\xad], U+0378..0379,
2799                             U+0380..0383, U+038B, U+038D, U+03A2 ...)
2800   \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
2801                             (131_612: [\xaa\xba], U+01BB,
2802                             U+01C0..01C3, U+0294, U+05D0..05EA,
2803                             U+05EF..05F2 ...)
2804   \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No})
2805                             (915: [\xb2-\xb3\xb9\xbc-\xbe],
2806                             U+09F4..09F9, U+0B72..0B77,
2807                             U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
2808                             ...)
2809   \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
2810                             (628: [!\"#\%&\'*,.\/:;?\@\\\xa1\xa7
2811                             \xb6-\xb7\xbf], U+037E, U+0387,
2812                             U+055A..055F, U+0589, U+05C0 ...)
2813   \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
2814                             (6634: [\xa6\xa9\xae\xb0], U+0482,
2815                             U+058D..058E, U+060E..060F, U+06DE,
2816                             U+06E9 ...)
2817   \p{General_Category: P} \p{General_Category=Punctuation} (842)
2818   \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
2819                             \p{Zp}) (1: U+2029)
2820   \p{General_Category: Pc} \p{General_Category=
2821                             Connector_Punctuation} (10)
2822   \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (26)
2823   \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
2824                             (77)
2825   \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
2826                             (10)
2827   \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
2828                             (12)
2829   \p{General_Category: Po} \p{General_Category=Other_Punctuation}
2830                             (628)
2831   \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
2832                             (137_468: U+E000..F8FF, U+F0000..FFFFD,
2833                             U+100000..10FFFD)
2834   \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (79)
2835   \p{General_Category: Punct} \p{General_Category=Punctuation} (842)
2836   \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (842:
2837                             [!\"#\%&\'\(\)*,\-.\/:;?\@\[\\\]_\{\}
2838                             \xa1\xa7\xab\xb6-\xb7\xbb\xbf], U+037E,
2839                             U+0387, U+055A..055F, U+0589..058A,
2840                             U+05BE ...)
2841   \p{General_Category: S} \p{General_Category=Symbol} (7770)
2842   \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (63)
2843   \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19:
2844                             [\x20\xa0], U+1680, U+2000..200A,
2845                             U+2028..2029, U+202F, U+205F ...)
2846   \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (125)
2847   \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
2848   \p{General_Category: So} \p{General_Category=Other_Symbol} (6634)
2849   \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
2850                             (17: [\x20\xa0], U+1680, U+2000..200A,
2851                             U+202F, U+205F, U+3000)
2852   \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc})
2853                             (452: U+0903, U+093B, U+093E..0940,
2854                             U+0949..094C, U+094E..094F, U+0982..0983
2855                             ...)
2856   \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048:
2857                             U+D800..DFFF)
2858   \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (7770:
2859                             [\$+<=>\^`\|~\xa2-\xa6\xa8-\xa9\xac\xae-
2860                             \xb1\xb4\xb8\xd7\xf7], U+02C2..02C5,
2861                             U+02D2..02DF, U+02E5..02EB, U+02ED,
2862                             U+02EF..02FF ...)
2863   \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
2864                             /i= General_Category=Cased_Letter) (31:
2865                             U+01C5, U+01C8, U+01CB, U+01F2,
2866                             U+1F88..1F8F, U+1F98..1F9F ...)
2867   \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
2868                             (825_345 plus all above-Unicode code
2869                             points: U+0378..0379, U+0380..0383,
2870                             U+038B, U+038D, U+03A2, U+0530 ...)
2871   \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
2872                             /i= General_Category=Cased_Letter)
2873                             (1831: [A-Z\xc0-\xd6\xd8-\xde], U+0100,
2874                             U+0102, U+0104, U+0106, U+0108 ...)
2875   \p{General_Category: Z} \p{General_Category=Separator} (19)
2876   \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
2877   \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
2878                             (1)
2879   \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
2880 X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
2881                             \p{InPunctuation}) (112)
2882 X \p{Geometric_Shapes}    \p{Block=Geometric_Shapes} (96)
2883 X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
2884                             \p{Block=Geometric_Shapes_Extended})
2885                             (128)
2886 X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
2887                             (Short: \p{InGeometricShapesExt}) (128)
2888   \p{Geor}                \p{Georgian} (= \p{Script_Extensions=
2889                             Georgian}) (NOT \p{Block=Georgian}) (174)
2890   \p{Georgian}            \p{Script_Extensions=Georgian} (Short:
2891                             \p{Geor}; NOT \p{Block=Georgian}) (174)
2892 X \p{Georgian_Ext}        \p{Georgian_Extended} (= \p{Block=
2893                             Georgian_Extended}) (48)
2894 X \p{Georgian_Extended}   \p{Block=Georgian_Extended} (Short:
2895                             \p{InGeorgianExt}) (48)
2896 X \p{Georgian_Sup}        \p{Georgian_Supplement} (= \p{Block=
2897                             Georgian_Supplement}) (48)
2898 X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
2899                             \p{InGeorgianSup}) (48)
2900   \p{Glag}                \p{Glagolitic} (= \p{Script_Extensions=
2901                             Glagolitic}) (NOT \p{Block=Glagolitic})
2902                             (138)
2903   \p{Glagolitic}          \p{Script_Extensions=Glagolitic} (Short:
2904                             \p{Glag}; NOT \p{Block=Glagolitic}) (138)
2905 X \p{Glagolitic_Sup}      \p{Glagolitic_Supplement} (= \p{Block=
2906                             Glagolitic_Supplement}) (48)
2907 X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
2908                             \p{InGlagoliticSup}) (48)
2909   \p{Gong}                \p{Gunjala_Gondi} (= \p{Script_Extensions=
2910                             Gunjala_Gondi}) (NOT \p{Block=
2911                             Gunjala_Gondi}) (65)
2912   \p{Gonm}                \p{Masaram_Gondi} (= \p{Script_Extensions=
2913                             Masaram_Gondi}) (NOT \p{Block=
2914                             Masaram_Gondi}) (77)
2915   \p{Goth}                \p{Gothic} (= \p{Script_Extensions=
2916                             Gothic}) (NOT \p{Block=Gothic}) (27)
2917   \p{Gothic}              \p{Script_Extensions=Gothic} (Short:
2918                             \p{Goth}; NOT \p{Block=Gothic}) (27)
2919   \p{Gr_Base}             \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
2920                             (146_986)
2921   \p{Gr_Base: *}          \p{Grapheme_Base: *}
2922   \p{Gr_Ext}              \p{Grapheme_Extend} (= \p{Grapheme_Extend=
2923                             Y}) (2125)
2924   \p{Gr_Ext: *}           \p{Grapheme_Extend: *}
2925   \p{Gran}                \p{Grantha} (= \p{Script_Extensions=
2926                             Grantha}) (NOT \p{Block=Grantha}) (116)
2927   \p{Grantha}             \p{Script_Extensions=Grantha} (Short:
2928                             \p{Gran}; NOT \p{Block=Grantha}) (116)
2929   \p{Graph}               \p{XPosixGraph} (286_635)
2930   \p{Grapheme_Base}       \p{Grapheme_Base=Y} (Short: \p{GrBase})
2931                             (146_986)
2932   \p{Grapheme_Base: N*}   (Short: \p{GrBase=N}, \P{GrBase}) (967_126
2933                             plus all above-Unicode code points:
2934                             [\x00-\x1f\x7f-\x9f\xad], U+0300..036F,
2935                             U+0378..0379, U+0380..0383, U+038B,
2936                             U+038D ...)
2937   \p{Grapheme_Base: Y*}   (Short: \p{GrBase=Y}, \p{GrBase})
2938                             (146_986: [\x20-\x7e\xa0-\xac\xae-\xff],
2939                             U+0100..02FF, U+0370..0377,
2940                             U+037A..037F, U+0384..038A, U+038C ...)
2941   \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
2942                             (3893)
2943   \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (3893: [^
2944                             \n\r\x20-\x7e\xa0-\xac\xae-\xff],
2945                             U+061C, U+180E, U+200B, U+200E..200F,
2946                             U+2028..202E ...)
2947   \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1: [\r])
2948   \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (0)
2949   \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (0)
2950   \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (0)
2951   \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
2952                             (0)
2953   \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
2954                             E_Base_GAZ} (0)
2955   \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
2956                             E_Modifier} (0)
2957   \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
2958                             (2130)
2959   \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (2130:
2960                             U+0300..036F, U+0483..0489,
2961                             U+0591..05BD, U+05BF, U+05C1..05C2,
2962                             U+05C4..05C5 ...)
2963   \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
2964                             Glue_After_Zwj} (0)
2965   \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (0)
2966   \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125:
2967                             U+1100..115F, U+A960..A97C)
2968   \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1: [\n])
2969   \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399: U+AC00,
2970                             U+AC1C, U+AC38, U+AC54, U+AC70, U+AC8C
2971                             ...)
2972   \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773:
2973                             U+AC01..AC1B, U+AC1D..AC37,
2974                             U+AC39..AC53, U+AC55..AC6F,
2975                             U+AC71..AC8B, U+AC8D..ACA7 ...)
2976   \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_096_109
2977                             plus all above-Unicode code points:
2978                             [\x20-\x7e\xa0-\xac\xae-\xff],
2979                             U+0100..02FF, U+0370..0482,
2980                             U+048A..0590, U+05BE, U+05C0 ...)
2981   \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
2982                             (27)
2983   \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (27:
2984                             U+0600..0605, U+06DD, U+070F,
2985                             U+0890..0891, U+08E2, U+0D4E ...)
2986   \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
2987                             (26: U+1F1E6..1F1FF)
2988   \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
2989                             Regional_Indicator} (26)
2990   \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
2991                             SpacingMark} (395)
2992   \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (395:
2993                             U+0903, U+093B, U+093E..0940,
2994                             U+0949..094C, U+094E..094F, U+0982..0983
2995                             ...)
2996   \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137:
2997                             U+11A8..11FF, U+D7CB..D7FB)
2998   \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95:
2999                             U+1160..11A7, U+D7B0..D7C6)
3000   \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
3001                             (1_096_109 plus all above-Unicode code
3002                             points)
3003   \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1: U+200D)
3004   \p{Grapheme_Extend}     \p{Grapheme_Extend=Y} (Short: \p{GrExt})
3005                             (2125)
3006   \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_111_987
3007                             plus all above-Unicode code points:
3008                             U+0000..02FF, U+0370..0482,
3009                             U+048A..0590, U+05BE, U+05C0, U+05C3 ...)
3010   \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (2125:
3011                             U+0300..036F, U+0483..0489,
3012                             U+0591..05BD, U+05BF, U+05C1..05C2,
3013                             U+05C4..05C5 ...)
3014   \p{Greek}               \p{Script_Extensions=Greek} (Short:
3015                             \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
3016 X \p{Greek_And_Coptic}    \p{Block=Greek_And_Coptic} (Short:
3017                             \p{InGreek}) (144)
3018 X \p{Greek_Ext}           \p{Greek_Extended} (= \p{Block=
3019                             Greek_Extended}) (256)
3020 X \p{Greek_Extended}      \p{Block=Greek_Extended} (Short:
3021                             \p{InGreekExt}) (256)
3022   \p{Grek}                \p{Greek} (= \p{Script_Extensions=Greek})
3023                             (NOT \p{Greek_And_Coptic}) (522)
3024   \p{Gujarati}            \p{Script_Extensions=Gujarati} (Short:
3025                             \p{Gujr}; NOT \p{Block=Gujarati}) (105)
3026   \p{Gujr}                \p{Gujarati} (= \p{Script_Extensions=
3027                             Gujarati}) (NOT \p{Block=Gujarati}) (105)
3028   \p{Gunjala_Gondi}       \p{Script_Extensions=Gunjala_Gondi}
3029                             (Short: \p{Gong}; NOT \p{Block=
3030                             Gunjala_Gondi}) (65)
3031   \p{Gurmukhi}            \p{Script_Extensions=Gurmukhi} (Short:
3032                             \p{Guru}; NOT \p{Block=Gurmukhi}) (94)
3033   \p{Guru}                \p{Gurmukhi} (= \p{Script_Extensions=
3034                             Gurmukhi}) (NOT \p{Block=Gurmukhi}) (94)
3035 X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
3036                             \p{Block=Halfwidth_And_Fullwidth_Forms})
3037                             (240)
3038 X \p{Half_Marks}          \p{Combining_Half_Marks} (= \p{Block=
3039                             Combining_Half_Marks}) (16)
3040 X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
3041                             Halfwidth_And_Fullwidth_Forms} (Short:
3042                             \p{InHalfAndFullForms}) (240)
3043   \p{Han}                 \p{Script_Extensions=Han} (98_696)
3044   \p{Hang}                \p{Hangul} (= \p{Script_Extensions=
3045                             Hangul}) (NOT \p{Hangul_Syllables})
3046                             (11_775)
3047   \p{Hangul}              \p{Script_Extensions=Hangul} (Short:
3048                             \p{Hang}; NOT \p{Hangul_Syllables})
3049                             (11_775)
3050 X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
3051                             (Short: \p{InCompatJamo}) (96)
3052 X \p{Hangul_Jamo}         \p{Block=Hangul_Jamo} (Short: \p{InJamo})
3053                             (256)
3054 X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
3055                             (Short: \p{InJamoExtA}) (32)
3056 X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
3057                             (Short: \p{InJamoExtB}) (80)
3058   \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
3059                             (125)
3060   \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125:
3061                             U+1100..115F, U+A960..A97C)
3062   \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
3063                             (399)
3064   \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399:
3065                             U+AC00, U+AC1C, U+AC38, U+AC54, U+AC70,
3066                             U+AC8C ...)
3067   \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
3068                             LVT_Syllable} (10_773)
3069   \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
3070                             (10_773: U+AC01..AC1B, U+AC1D..AC37,
3071                             U+AC39..AC53, U+AC55..AC6F,
3072                             U+AC71..AC8B, U+AC8D..ACA7 ...)
3073   \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
3074                             Not_Applicable} (1_102_583 plus all
3075                             above-Unicode code points)
3076   \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
3077                             (1_102_583 plus all above-Unicode code
3078                             points: U+0000..10FF, U+1200..A95F,
3079                             U+A97D..ABFF, U+D7A4..D7AF,
3080                             U+D7C7..D7CA, U+D7FC..infinity)
3081   \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
3082                             (137)
3083   \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137:
3084                             U+11A8..11FF, U+D7CB..D7FB)
3085   \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
3086                             (95)
3087   \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95:
3088                             U+1160..11A7, U+D7B0..D7C6)
3089 X \p{Hangul_Syllables}    \p{Block=Hangul_Syllables} (Short:
3090                             \p{InHangul}) (11_184)
3091   \p{Hani}                \p{Han} (= \p{Script_Extensions=Han})
3092                             (98_696)
3093   \p{Hanifi_Rohingya}     \p{Script_Extensions=Hanifi_Rohingya}
3094                             (Short: \p{Rohg}; NOT \p{Block=
3095                             Hanifi_Rohingya}) (55)
3096   \p{Hano}                \p{Hanunoo} (= \p{Script_Extensions=
3097                             Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
3098   \p{Hanunoo}             \p{Script_Extensions=Hanunoo} (Short:
3099                             \p{Hano}; NOT \p{Block=Hanunoo}) (23)
3100   \p{Hatr}                \p{Hatran} (= \p{Script_Extensions=
3101                             Hatran}) (NOT \p{Block=Hatran}) (26)
3102   \p{Hatran}              \p{Script_Extensions=Hatran} (Short:
3103                             \p{Hatr}; NOT \p{Block=Hatran}) (26)
3104   \p{Hebr}                \p{Hebrew} (= \p{Script_Extensions=
3105                             Hebrew}) (NOT \p{Block=Hebrew}) (134)
3106   \p{Hebrew}              \p{Script_Extensions=Hebrew} (Short:
3107                             \p{Hebr}; NOT \p{Block=Hebrew}) (134)
3108   \p{Hex}                 \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
3109   \p{Hex: *}              \p{Hex_Digit: *}
3110   \p{Hex_Digit}           \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
3111   \p{Hex_Digit: N*}       (Short: \p{Hex=N}, \P{Hex}) (1_114_068
3112                             plus all above-Unicode code points:
3113                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
3114                             \@G-Z\[\\\]\^_`g-z\{\|\}~\x7f-\xff],
3115                             U+0100..FF0F, U+FF1A..FF20,
3116                             U+FF27..FF40, U+FF47..infinity)
3117   \p{Hex_Digit: Y*}       (Short: \p{Hex=Y}, \p{Hex}) (44: [0-9A-Fa-
3118                             f], U+FF10..FF19, U+FF21..FF26,
3119                             U+FF41..FF46)
3120 X \p{High_Private_Use_Surrogates} \p{Block=
3121                             High_Private_Use_Surrogates} (Short:
3122                             \p{InHighPUSurrogates}) (128)
3123 X \p{High_PU_Surrogates}  \p{High_Private_Use_Surrogates} (=
3124                             \p{Block=High_Private_Use_Surrogates})
3125                             (128)
3126 X \p{High_Surrogates}     \p{Block=High_Surrogates} (896)
3127   \p{Hira}                \p{Hiragana} (= \p{Script_Extensions=
3128                             Hiragana}) (NOT \p{Block=Hiragana}) (433)
3129   \p{Hiragana}            \p{Script_Extensions=Hiragana} (Short:
3130                             \p{Hira}; NOT \p{Block=Hiragana}) (433)
3131   \p{Hluw}                \p{Anatolian_Hieroglyphs} (=
3132                             \p{Script_Extensions=
3133                             Anatolian_Hieroglyphs}) (NOT \p{Block=
3134                             Anatolian_Hieroglyphs}) (583)
3135   \p{Hmng}                \p{Pahawh_Hmong} (= \p{Script_Extensions=
3136                             Pahawh_Hmong}) (NOT \p{Block=
3137                             Pahawh_Hmong}) (127)
3138   \p{Hmnp}                \p{Nyiakeng_Puachue_Hmong} (=
3139                             \p{Script_Extensions=
3140                             Nyiakeng_Puachue_Hmong}) (NOT \p{Block=
3141                             Nyiakeng_Puachue_Hmong}) (71)
3142   \p{HorizSpace}          \p{XPosixBlank} (18)
3143   \p{Hst: *}              \p{Hangul_Syllable_Type: *}
3144   \p{Hung}                \p{Old_Hungarian} (= \p{Script_Extensions=
3145                             Old_Hungarian}) (NOT \p{Block=
3146                             Old_Hungarian}) (108)
3147 D \p{Hyphen}              \p{Hyphen=Y} (11)
3148 D \p{Hyphen: N*}          Supplanted by Line_Break property values;
3149                             see www.unicode.org/reports/tr14
3150                             (Single: \P{Hyphen}) (1_114_101 plus all
3151                             above-Unicode code points: [\x00-\x20!
3152                             \"#\$\%&\'\(\)*+,.\/0-9:;<=>?\@A-Z
3153                             \[\\\]\^_`a-z\{\|\}~\x7f-\xac\xae-\xff],
3154                             U+0100..0589, U+058B..1805,
3155                             U+1807..200F, U+2012..2E16, U+2E18..30FA
3156                             ...)
3157 D \p{Hyphen: Y*}          Supplanted by Line_Break property values;
3158                             see www.unicode.org/reports/tr14
3159                             (Single: \p{Hyphen}) (11: [\-\xad],
3160                             U+058A, U+1806, U+2010..2011, U+2E17,
3161                             U+30FB ...)
3162   \p{ID_Continue}         \p{ID_Continue=Y} (Short: \p{IDC}; NOT
3163                             \p{Ideographic_Description_Characters})
3164                             (139_482)
3165   \p{ID_Continue: N*}     (Short: \p{IDC=N}, \P{IDC}) (974_630 plus
3166                             all above-Unicode code points: [\x00-
3167                             \x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
3168                             \[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4\xb6
3169                             \xb8-\xb9\xbb-\xbf\xd7\xf7],
3170                             U+02C2..02C5, U+02D2..02DF,
3171                             U+02E5..02EB, U+02ED, U+02EF..02FF ...)
3172   \p{ID_Continue: Y*}     (Short: \p{IDC=Y}, \p{IDC}) (139_482:
3173                             [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
3174                             \xd8-\xf6\xf8-\xff], U+0100..02C1,
3175                             U+02C6..02D1, U+02E0..02E4, U+02EC,
3176                             U+02EE ...)
3177   \p{ID_Start}            \p{ID_Start=Y} (Short: \p{IDS}) (136_345)
3178   \p{ID_Start: N*}        (Short: \p{IDS=N}, \P{IDS}) (977_767 plus
3179                             all above-Unicode code points: [\x00-
3180                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@
3181                             \[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4\xb6-
3182                             \xb9\xbb-\xbf\xd7\xf7], U+02C2..02C5,
3183                             U+02D2..02DF, U+02E5..02EB, U+02ED,
3184                             U+02EF..036F ...)
3185   \p{ID_Start: Y*}        (Short: \p{IDS=Y}, \p{IDS}) (136_345: [A-
3186                             Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
3187                             \xff], U+0100..02C1, U+02C6..02D1,
3188                             U+02E0..02E4, U+02EC, U+02EE ...)
3189   \p{IDC}                 \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
3190                             \p{Ideographic_Description_Characters})
3191                             (139_482)
3192   \p{IDC: *}              \p{ID_Continue: *}
3193   \p{Identifier_Status: Allowed} (112_159: [\'\-.0-9:A-Z_a-z\xb7
3194                             \xc0-\xd6\xd8-\xf6\xf8-\xff],
3195                             U+0100..0131, U+0134..013E,
3196                             U+0141..0148, U+014A..017E, U+018F ...)
3197   \p{Identifier_Status: Restricted} (1_001_953 plus all above-
3198                             Unicode code points: [\x00-\x20!\"#\$
3199                             \%&\(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x7f-
3200                             \xb6\xb8-\xbf\xd7\xf7], U+0132..0133,
3201                             U+013F..0140, U+0149, U+017F..018E,
3202                             U+0190..019F ...)
3203   \p{Identifier_Type: Default_Ignorable} (398: [\xad], U+034F,
3204                             U+061C, U+115F..1160, U+17B4..17B5,
3205                             U+180B..180F ...)
3206   \p{Identifier_Type: Deprecated} (15: U+0149, U+0673, U+0F77,
3207                             U+0F79, U+17A3..17A4, U+206A..206F ...)
3208   \p{Identifier_Type: Exclusion} (17_241: U+03E2..03EF,
3209                             U+0800..082D, U+0830..083E,
3210                             U+1680..169C, U+16A0..16EA, U+16EE..16F8
3211                             ...)
3212   \p{Identifier_Type: Inclusion} (17: [\'\-.:\xb7], U+0375, U+058A,
3213                             U+05F3..05F4, U+06FD..06FE, U+0F0B ...)
3214   \p{Identifier_Type: Limited_Use} (5268: U+0700..070D,
3215                             U+070F..074A, U+074D..074F,
3216                             U+07C0..07FA, U+07FD..07FF, U+0840..085B
3217                             ...)
3218   \p{Identifier_Type: Not_Character} (964_920 plus all above-Unicode
3219                             code points: [^\t\n\cK\f\r\x20-\x7e\x85
3220                             \xa0-\xff], U+0378..0379, U+0380..0383,
3221                             U+038B, U+038D, U+03A2 ...)
3222   \p{Identifier_Type: Not_NFKC} (4921: [\xa0\xa8\xaa\xaf\xb2-\xb5
3223                             \xb8-\xba\xbc-\xbe], U+0132..0133,
3224                             U+013F..0140, U+017F, U+01C4..01CC,
3225                             U+01F1..01F3 ...)
3226   \p{Identifier_Type: Not_XID} (8277: [\t\n\cK\f\r\x20!\"#\$\%&
3227                             \(\)*+,\/;<=>?\@\[\\\]\^`\{\|\}~\x85
3228                             \xa1-\xa7\xa9\xab-\xac\xae\xb0-\xb1\xb6
3229                             \xbb\xbf\xd7\xf7], U+02C2..02C5,
3230                             U+02D2..02D7, U+02DE..02DF,
3231                             U+02E5..02EB, U+02ED ...)
3232   \p{Identifier_Type: Obsolete} (1627: U+018D, U+01AA..01AB,
3233                             U+01B9..01BB, U+01BE..01BF,
3234                             U+01F6..01F7, U+021C..021D ...)
3235   \p{Identifier_Type: Recommended} (112_142: [0-9A-Z_a-z\xc0-\xd6
3236                             \xd8-\xf6\xf8-\xff], U+0100..0131,
3237                             U+0134..013E, U+0141..0148,
3238                             U+014A..017E, U+018F ...)
3239   \p{Identifier_Type: Technical} (1660: U+0180, U+018D,
3240                             U+01AA..01AB, U+01BA..01BB, U+01BE,
3241                             U+01C0..01C3 ...)
3242   \p{Identifier_Type: Uncommon_Use} (396: U+0181..018C, U+018E,
3243                             U+0190..019F, U+01A2..01A9,
3244                             U+01AC..01AE, U+01B1..01B8 ...)
3245   \p{Ideo}                \p{Ideographic} (= \p{Ideographic=Y})
3246                             (105_854)
3247   \p{Ideo: *}             \p{Ideographic: *}
3248   \p{Ideographic}         \p{Ideographic=Y} (Short: \p{Ideo})
3249                             (105_854)
3250   \p{Ideographic: N*}     (Short: \p{Ideo=N}, \P{Ideo}) (1_008_258
3251                             plus all above-Unicode code points:
3252                             U+0000..3005, U+3008..3020,
3253                             U+302A..3037, U+303B..33FF,
3254                             U+4DC0..4DFF, U+A000..F8FF ...)
3255   \p{Ideographic: Y*}     (Short: \p{Ideo=Y}, \p{Ideo}) (105_854:
3256                             U+3006..3007, U+3021..3029,
3257                             U+3038..303A, U+3400..4DBF,
3258                             U+4E00..9FFF, U+F900..FA6D ...)
3259 X \p{Ideographic_Description_Characters} \p{Block=
3260                             Ideographic_Description_Characters}
3261                             (Short: \p{InIDC}) (16)
3262 X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
3263                             \p{Block=
3264                             Ideographic_Symbols_And_Punctuation})
3265                             (32)
3266 X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
3267                             Ideographic_Symbols_And_Punctuation}
3268                             (Short: \p{InIdeographicSymbols}) (32)
3269   \p{IDS}                 \p{ID_Start} (= \p{ID_Start=Y}) (136_345)
3270   \p{IDS: *}              \p{ID_Start: *}
3271   \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
3272                             \p{IDSB}) (10)
3273   \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
3274                             (1_114_102 plus all above-Unicode code
3275                             points: U+0000..2FEF, U+2FF2..2FF3,
3276                             U+2FFC..infinity)
3277   \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10:
3278                             U+2FF0..2FF1, U+2FF4..2FFB)
3279   \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
3280                             \p{IDST}) (2)
3281   \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
3282                             (1_114_110 plus all above-Unicode code
3283                             points: U+0000..2FF1, U+2FF4..infinity)
3284   \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2:
3285                             U+2FF2..2FF3)
3286   \p{IDSB}                \p{IDS_Binary_Operator} (=
3287                             \p{IDS_Binary_Operator=Y}) (10)
3288   \p{IDSB: *}             \p{IDS_Binary_Operator: *}
3289   \p{IDST}                \p{IDS_Trinary_Operator} (=
3290                             \p{IDS_Trinary_Operator=Y}) (2)
3291   \p{IDST: *}             \p{IDS_Trinary_Operator: *}
3292   \p{Imperial_Aramaic}    \p{Script_Extensions=Imperial_Aramaic}
3293                             (Short: \p{Armi}; NOT \p{Block=
3294                             Imperial_Aramaic}) (31)
3295   \p{In: *}               \p{Present_In: *} (Perl extension)
3296 X \p{In_*}                \p{Block: *}
3297 X \p{Indic_Number_Forms}  \p{Common_Indic_Number_Forms} (= \p{Block=
3298                             Common_Indic_Number_Forms}) (16)
3299   \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
3300                             (356: U+093C, U+0941..0944, U+094D,
3301                             U+0952, U+0956..0957, U+0962..0963 ...)
3302   \p{Indic_Positional_Category: Bottom_And_Left} (Short: \p{InPC=
3303                             BottomAndLeft}) (1: U+A9BF)
3304   \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
3305                             BottomAndRight}) (4: U+1B3B, U+A9BE,
3306                             U+A9C0, U+11942)
3307   \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (66:
3308                             U+093F, U+094E, U+09BF, U+09C7..09C8,
3309                             U+0A3F, U+0ABF ...)
3310   \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
3311                             LeftAndRight}) (22: U+09CB..09CC,
3312                             U+0B4B, U+0BCA..0BCC, U+0D4A..0D4C,
3313                             U+0DDC, U+0DDE ...)
3314   \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_112_875
3315                             plus all above-Unicode code points:
3316                             U+0000..08FF, U+0904..0939, U+093D,
3317                             U+0950, U+0953..0954, U+0958..0961 ...)
3318   \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
3319                             Overstruck}) (10: U+1CD4, U+1CE2..1CE8,
3320                             U+10A01, U+10A06)
3321   \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (295:
3322                             U+0903, U+093B, U+093E, U+0940,
3323                             U+0949..094C, U+094F ...)
3324   \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (428:
3325                             U+0900..0902, U+093A, U+0945..0948,
3326                             U+0951, U+0955, U+0981 ...)
3327   \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
3328                             TopAndBottom}) (10: U+0C48, U+0F73,
3329                             U+0F76..0F79, U+0F81, U+1B3C,
3330                             U+1112E..1112F)
3331   \p{Indic_Positional_Category: Top_And_Bottom_And_Left} (Short:
3332                             \p{InPC=TopAndBottomAndLeft}) (2:
3333                             U+103C, U+1171E)
3334   \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
3335                             \p{InPC=TopAndBottomAndRight}) (1:
3336                             U+1B3D)
3337   \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
3338                             TopAndLeft}) (6: U+0B48, U+0DDA, U+17BE,
3339                             U+1C29, U+114BB, U+115B9)
3340   \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
3341                             \p{InPC=TopAndLeftAndRight}) (4: U+0B4C,
3342                             U+0DDD, U+17BF, U+115BB)
3343   \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
3344                             TopAndRight}) (13: U+0AC9, U+0B57,
3345                             U+0CC0, U+0CC7..0CC8, U+0CCA..0CCB,
3346                             U+1925..1926 ...)
3347   \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
3348                             VisualOrderLeft}) (19: U+0E40..0E44,
3349                             U+0EC0..0EC4, U+19B5..19B7, U+19BA,
3350                             U+AAB5..AAB6, U+AAB9 ...)
3351 X \p{Indic_Siyaq_Numbers} \p{Block=Indic_Siyaq_Numbers} (80)
3352   \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
3353                             (17: U+093D, U+09BD, U+0ABD, U+0B3D,
3354                             U+0C3D, U+0CBD ...)
3355   \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (94:
3356                             U+0900..0902, U+0981..0982, U+09FC,
3357                             U+0A01..0A02, U+0A70, U+0A81..0A82 ...)
3358   \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
3359                             \p{InSC=BrahmiJoiningNumber}) (20:
3360                             U+11052..11065)
3361   \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
3362                             CantillationMark}) (58: U+0951..0952,
3363                             U+0A51, U+0AFA, U+0AFC, U+1CD0..1CD2,
3364                             U+1CD4..1CE1 ...)
3365   \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
3366                             (2241: U+0915..0939, U+0958..095F,
3367                             U+0978..097F, U+0995..09A8,
3368                             U+09AA..09B0, U+09B2 ...)
3369   \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
3370                             ConsonantDead}) (14: U+09CE, U+0C5D,
3371                             U+0CDD, U+0D54..0D56, U+0D7A..0D7F,
3372                             U+1CF2..1CF3)
3373   \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
3374                             ConsonantFinal}) (70: U+1930..1931,
3375                             U+1933..1939, U+19C1..19C7,
3376                             U+1A58..1A59, U+1B03, U+1B81 ...)
3377   \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
3378                             \p{InSC=ConsonantHeadLetter}) (5:
3379                             U+0F88..0F8C)
3380   \p{Indic_Syllabic_Category: Consonant_Initial_Postfixed} (Short:
3381                             \p{InSC=ConsonantInitialPostfixed}) (1:
3382                             U+1A5A)
3383   \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
3384                             ConsonantKiller}) (2: U+0E4C, U+17CD)
3385   \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
3386                             ConsonantMedial}) (31: U+0A75,
3387                             U+0EBC..0EBD, U+103B..103E,
3388                             U+105E..1060, U+1082, U+1A55..1A56 ...)
3389   \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
3390                             \p{InSC=ConsonantPlaceholder}) (22: [\-
3391                             \xa0\xd7], U+0980, U+0A72..0A73, U+104B,
3392                             U+104E, U+1900 ...)
3393   \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
3394                             \p{InSC=ConsonantPrecedingRepha}) (4:
3395                             U+0D4E, U+11941, U+11D46, U+11F02)
3396   \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
3397                             ConsonantPrefixed}) (10: U+111C2..111C3,
3398                             U+1193F, U+11A3A, U+11A84..11A89)
3399   \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
3400                             ConsonantSubjoined}) (94: U+0F8D..0F97,
3401                             U+0F99..0FBC, U+1929..192B, U+1A57,
3402                             U+1A5B..1A5E, U+1BA1..1BA3 ...)
3403   \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
3404                             \p{InSC=ConsonantSucceedingRepha}) (1:
3405                             U+17CC)
3406   \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
3407                             \p{InSC=ConsonantWithStacker}) (8:
3408                             U+0CF1..0CF2, U+1CF5..1CF6,
3409                             U+11003..11004, U+11460..11461)
3410   \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
3411                             GeminationMark}) (4: U+0A71, U+0AFB,
3412                             U+11237, U+11A98)
3413   \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
3414                             InvisibleStacker}) (13: U+1039, U+17D2,
3415                             U+1A60, U+1BAB, U+AAF6, U+10A3F ...)
3416   \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1:
3417                             U+200D)
3418   \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
3419                             ModifyingLetter}) (1: U+0B83)
3420   \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
3421                             NonJoiner}) (1: U+200C)
3422   \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (32:
3423                             U+093C, U+09BC, U+0A3C, U+0ABC,
3424                             U+0AFD..0AFF, U+0B3C ...)
3425   \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (501:
3426                             [0-9], U+0966..096F, U+09E6..09EF,
3427                             U+0A66..0A6F, U+0AE6..0AEF, U+0B66..0B6F
3428                             ...)
3429   \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
3430                             NumberJoiner}) (1: U+1107F)
3431   \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
3432                             (1_109_473 plus all above-Unicode code
3433                             points: [\x00-\x20!\"#\$\%&\'\(\)*+,.
3434                             \/:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
3435                             \x9f\xa1-\xb1\xb4-\xd6\xd8-\xff],
3436                             U+0100..08FF, U+0950, U+0953..0954,
3437                             U+0964..0965, U+0970..0971 ...)
3438   \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
3439                             PureKiller}) (26: U+0D3B..0D3C, U+0E3A,
3440                             U+0E4E, U+0EBA, U+0F84, U+103A ...)
3441   \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
3442                             RegisterShifter}) (2: U+17C9..17CA)
3443   \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
3444                             SyllableModifier}) (26: [\xb2-\xb3],
3445                             U+09FE, U+0ECE, U+0F35, U+0F37, U+0FC6
3446                             ...)
3447   \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
3448                             ToneLetter}) (7: U+1970..1974, U+AAC0,
3449                             U+AAC2)
3450   \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
3451                             (42: U+0E48..0E4B, U+0EC8..0ECB, U+1037,
3452                             U+1063..1064, U+1069..106D, U+1087..108D
3453                             ...)
3454   \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (27:
3455                             U+094D, U+09CD, U+0A4D, U+0ACD, U+0B4D,
3456                             U+0BCD ...)
3457   \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga})
3458                             (36: U+0903, U+0983, U+0A03, U+0A83,
3459                             U+0B03, U+0C03 ...)
3460   \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30:
3461                             U+1963..196D, U+A85E..A861, U+A866,
3462                             U+A922..A92A, U+11150..11154)
3463   \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
3464                             VowelDependent}) (697: U+093A..093B,
3465                             U+093E..094C, U+094E..094F,
3466                             U+0955..0957, U+0962..0963, U+09BE..09C4
3467                             ...)
3468   \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
3469                             VowelIndependent}) (500: U+0904..0914,
3470                             U+0960..0961, U+0972..0977,
3471                             U+0985..098C, U+098F..0990, U+0993..0994
3472                             ...)
3473   \p{Inherited}           \p{Script_Extensions=Inherited} (Short:
3474                             \p{Zinh}) (586)
3475   \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
3476                             (Short: \p{Pi}) (12)
3477   \p{InPC: *}             \p{Indic_Positional_Category: *}
3478   \p{InSC: *}             \p{Indic_Syllabic_Category: *}
3479   \p{Inscriptional_Pahlavi} \p{Script_Extensions=
3480                             Inscriptional_Pahlavi} (Short: \p{Phli};
3481                             NOT \p{Block=Inscriptional_Pahlavi}) (27)
3482   \p{Inscriptional_Parthian} \p{Script_Extensions=
3483                             Inscriptional_Parthian} (Short:
3484                             \p{Prti}; NOT \p{Block=
3485                             Inscriptional_Parthian}) (30)
3486 X \p{IPA_Ext}             \p{IPA_Extensions} (= \p{Block=
3487                             IPA_Extensions}) (96)
3488 X \p{IPA_Extensions}      \p{Block=IPA_Extensions} (Short:
3489                             \p{InIPAExt}) (96)
3490   \p{Is_*}                \p{*} (Any exceptions are individually
3491                             noted beginning with the word NOT.) If
3492                             an entry has flag(s) at its beginning,
3493                             like "D", the "Is_" form has the same
3494                             flag(s)
3495   \p{Ital}                \p{Old_Italic} (= \p{Script_Extensions=
3496                             Old_Italic}) (NOT \p{Block=Old_Italic})
3497                             (39)
3498 X \p{Jamo}                \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
3499                             (256)
3500 X \p{Jamo_Ext_A}          \p{Hangul_Jamo_Extended_A} (= \p{Block=
3501                             Hangul_Jamo_Extended_A}) (32)
3502 X \p{Jamo_Ext_B}          \p{Hangul_Jamo_Extended_B} (= \p{Block=
3503                             Hangul_Jamo_Extended_B}) (80)
3504   \p{Java}                \p{Javanese} (= \p{Script_Extensions=
3505                             Javanese}) (NOT \p{Block=Javanese}) (91)
3506   \p{Javanese}            \p{Script_Extensions=Javanese} (Short:
3507                             \p{Java}; NOT \p{Block=Javanese}) (91)
3508   \p{Jg: *}               \p{Joining_Group: *}
3509   \p{Join_C}              \p{Join_Control} (= \p{Join_Control=Y}) (2)
3510   \p{Join_C: *}           \p{Join_Control: *}
3511   \p{Join_Control}        \p{Join_Control=Y} (Short: \p{JoinC}) (2)
3512   \p{Join_Control: N*}    (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
3513                             plus all above-Unicode code points:
3514                             U+0000..200B, U+200E..infinity)
3515   \p{Join_Control: Y*}    (Short: \p{JoinC=Y}, \p{JoinC}) (2:
3516                             U+200C..200D)
3517   \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1:
3518                             U+08BB)
3519   \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1:
3520                             U+08BD)
3521   \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (2:
3522                             U+08BC, U+08C4)
3523   \p{Joining_Group: Ain}  (Short: \p{Jg=Ain}) (9: U+0639..063A,
3524                             U+06A0, U+06FC, U+075D..075F, U+08B3,
3525                             U+08C3)
3526   \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1: U+0710)
3527   \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (29: U+0622..0623,
3528                             U+0625, U+0627, U+0671..0673, U+0675,
3529                             U+0773..0774 ...)
3530   \p{Joining_Group: Beh}  (Short: \p{Jg=Beh}) (27: U+0628,
3531                             U+062A..062B, U+066E, U+0679..0680,
3532                             U+0750..0756, U+08A0..08A1 ...)
3533   \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2: U+0712, U+072D)
3534   \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
3535                             BurushaskiYehBarree}) (2: U+077A..077B)
3536   \p{Joining_Group: Dal}  (Short: \p{Jg=Dal}) (15: U+062F..0630,
3537                             U+0688..0690, U+06EE, U+0759..075A,
3538                             U+08AE)
3539   \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4:
3540                             U+0715..0716, U+072A, U+072F)
3541   \p{Joining_Group: E}    (Short: \p{Jg=E}) (1: U+0725)
3542   \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7:
3543                             U+063D..063F, U+06CC, U+06CE,
3544                             U+0775..0776)
3545   \p{Joining_Group: Fe}   (Short: \p{Jg=Fe}) (1: U+074F)
3546   \p{Joining_Group: Feh}  (Short: \p{Jg=Feh}) (10: U+0641,
3547                             U+06A1..06A6, U+0760..0761, U+08A4)
3548   \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1:
3549                             U+0724)
3550   \p{Joining_Group: Gaf}  (Short: \p{Jg=Gaf}) (17: U+063B..063C,
3551                             U+06A9, U+06AB, U+06AF..06B4,
3552                             U+0762..0764, U+088D ...)
3553   \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3: U+0713..0714,
3554                             U+072E)
3555   \p{Joining_Group: Hah}  (Short: \p{Jg=Hah}) (22: U+062C..062E,
3556                             U+0681..0687, U+06BF, U+0757..0758,
3557                             U+076E..076F, U+0772 ...)
3558   \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
3559                             HamzaOnHehGoal}) (1: U+06C3)
3560   \p{Joining_Group: Hanifi_Rohingya_Kinna_Ya} (Short: \p{Jg=
3561                             HanifiRohingyaKinnaYa}) (4: U+10D19,
3562                             U+10D1E, U+10D20, U+10D23)
3563   \p{Joining_Group: Hanifi_Rohingya_Pa} (Short: \p{Jg=
3564                             HanifiRohingyaPa}) (3: U+10D02, U+10D09,
3565                             U+10D1C)
3566   \p{Joining_Group: He}   (Short: \p{Jg=He}) (1: U+0717)
3567   \p{Joining_Group: Heh}  (Short: \p{Jg=Heh}) (1: U+0647)
3568   \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2:
3569                             U+06C1..06C2)
3570   \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1: U+071A)
3571   \p{Joining_Group: Kaf}  (Short: \p{Jg=Kaf}) (6: U+0643,
3572                             U+06AC..06AE, U+077F, U+08B4)
3573   \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1: U+071F)
3574   \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1: U+074E)
3575   \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2:
3576                             U+06BE, U+06FF)
3577   \p{Joining_Group: Lam}  (Short: \p{Jg=Lam}) (8: U+0644,
3578                             U+06B5..06B8, U+076A, U+08A6, U+08C7)
3579   \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1: U+0720)
3580   \p{Joining_Group: Malayalam_Bha} (Short: \p{Jg=MalayalamBha}) (1:
3581                             U+0866)
3582   \p{Joining_Group: Malayalam_Ja} (Short: \p{Jg=MalayalamJa}) (1:
3583                             U+0861)
3584   \p{Joining_Group: Malayalam_Lla} (Short: \p{Jg=MalayalamLla}) (1:
3585                             U+0868)
3586   \p{Joining_Group: Malayalam_Llla} (Short: \p{Jg=MalayalamLlla})
3587                             (1: U+0869)
3588   \p{Joining_Group: Malayalam_Nga} (Short: \p{Jg=MalayalamNga}) (1:
3589                             U+0860)
3590   \p{Joining_Group: Malayalam_Nna} (Short: \p{Jg=MalayalamNna}) (1:
3591                             U+0864)
3592   \p{Joining_Group: Malayalam_Nnna} (Short: \p{Jg=MalayalamNnna})
3593                             (1: U+0865)
3594   \p{Joining_Group: Malayalam_Nya} (Short: \p{Jg=MalayalamNya}) (1:
3595                             U+0862)
3596   \p{Joining_Group: Malayalam_Ra} (Short: \p{Jg=MalayalamRa}) (1:
3597                             U+0867)
3598   \p{Joining_Group: Malayalam_Ssa} (Short: \p{Jg=MalayalamSsa}) (1:
3599                             U+086A)
3600   \p{Joining_Group: Malayalam_Tta} (Short: \p{Jg=MalayalamTta}) (1:
3601                             U+0863)
3602   \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
3603                             ManichaeanAleph}) (1: U+10AC0)
3604   \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
3605                             (2: U+10AD9..10ADA)
3606   \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
3607                             (2: U+10AC1..10AC2)
3608   \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
3609                             ManichaeanDaleth}) (1: U+10AC5)
3610   \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
3611                             ManichaeanDhamedh}) (1: U+10AD4)
3612   \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
3613                             (1: U+10AEC)
3614   \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
3615                             ManichaeanGimel}) (2: U+10AC3..10AC4)
3616   \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
3617                             (1: U+10ACD)
3618   \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
3619                             ManichaeanHundred}) (1: U+10AEF)
3620   \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
3621                             (3: U+10AD0..10AD2)
3622   \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
3623                             ManichaeanLamedh}) (1: U+10AD3)
3624   \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem})
3625                             (1: U+10AD6)
3626   \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun})
3627                             (1: U+10AD7)
3628   \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne})
3629                             (1: U+10AEB)
3630   \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2:
3631                             U+10ADB..10ADC)
3632   \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
3633                             (3: U+10ADE..10AE0)
3634   \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
3635                             (1: U+10AE1)
3636   \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
3637                             ManichaeanSadhe}) (1: U+10ADD)
3638   \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
3639                             ManichaeanSamekh}) (1: U+10AD8)
3640   \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw})
3641                             (1: U+10AE4)
3642   \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen})
3643                             (1: U+10AED)
3644   \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
3645                             (1: U+10ACE)
3646   \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
3647                             ManichaeanThamedh}) (1: U+10AD5)
3648   \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
3649                             ManichaeanTwenty}) (1: U+10AEE)
3650   \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw})
3651                             (1: U+10AC7)
3652   \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
3653                             (1: U+10ACF)
3654   \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
3655                             ManichaeanZayin}) (2: U+10AC9..10ACA)
3656   \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4: U+0645,
3657                             U+0765..0766, U+08A7)
3658   \p{Joining_Group: Mim}  (Short: \p{Jg=Mim}) (1: U+0721)
3659   \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
3660                             (1_113_762 plus all above-Unicode code
3661                             points: U+0000..061F, U+0621, U+0640,
3662                             U+064B..066D, U+0670, U+0674 ...)
3663   \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (9: U+0646,
3664                             U+06B9..06BC, U+0767..0769, U+0889)
3665   \p{Joining_Group: Nun}  (Short: \p{Jg=Nun}) (1: U+0722)
3666   \p{Joining_Group: Nya}  (Short: \p{Jg=Nya}) (1: U+06BD)
3667   \p{Joining_Group: Pe}   (Short: \p{Jg=Pe}) (1: U+0726)
3668   \p{Joining_Group: Qaf}  (Short: \p{Jg=Qaf}) (6: U+0642, U+066F,
3669                             U+06A7..06A8, U+08A5, U+08B5)
3670   \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1: U+0729)
3671   \p{Joining_Group: Reh}  (Short: \p{Jg=Reh}) (19: U+0631..0632,
3672                             U+0691..0699, U+06EF, U+075B,
3673                             U+076B..076C, U+0771 ...)
3674   \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1:
3675                             U+0727)
3676   \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1:
3677                             U+08AC)
3678   \p{Joining_Group: Sad}  (Short: \p{Jg=Sad}) (6: U+0635..0636,
3679                             U+069D..069E, U+06FB, U+08AF)
3680   \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1: U+0728)
3681   \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11: U+0633..0634,
3682                             U+069A..069C, U+06FA, U+075C, U+076D,
3683                             U+0770 ...)
3684   \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1: U+0723)
3685   \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1: U+072B)
3686   \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1:
3687                             U+08B1)
3688   \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1: U+06AA)
3689   \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1: U+0718)
3690   \p{Joining_Group: Tah}  (Short: \p{Jg=Tah}) (6: U+0637..0638,
3691                             U+069F, U+088B..088C, U+08A3)
3692   \p{Joining_Group: Taw}  (Short: \p{Jg=Taw}) (1: U+072C)
3693   \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3:
3694                             U+0629, U+06C0, U+06D5)
3695   \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
3696                             Hamza_On_Heh_Goal} (1)
3697   \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2: U+071B..071C)
3698   \p{Joining_Group: Thin_Yeh} (Short: \p{Jg=ThinYeh}) (1: U+0886)
3699   \p{Joining_Group: Vertical_Tail} (Short: \p{Jg=VerticalTail}) (1:
3700                             U+088E)
3701   \p{Joining_Group: Waw}  (Short: \p{Jg=Waw}) (16: U+0624, U+0648,
3702                             U+0676..0677, U+06C4..06CB, U+06CF,
3703                             U+0778..0779 ...)
3704   \p{Joining_Group: Yeh}  (Short: \p{Jg=Yeh}) (11: U+0620, U+0626,
3705                             U+0649..064A, U+0678, U+06D0..06D1,
3706                             U+0777 ...)
3707   \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2:
3708                             U+06D2..06D3)
3709   \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1:
3710                             U+06CD)
3711   \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1: U+071D)
3712   \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1: U+071E)
3713   \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1: U+0719)
3714   \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1: U+074D)
3715   \p{Joining_Type: C}     \p{Joining_Type=Join_Causing} (7)
3716   \p{Joining_Type: D}     \p{Joining_Type=Dual_Joining} (610)
3717   \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (610: U+0620,
3718                             U+0626, U+0628, U+062A..062E,
3719                             U+0633..063F, U+0641..0647 ...)
3720   \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (7: U+0640,
3721                             U+07FA, U+0883..0885, U+180A, U+200D)
3722   \p{Joining_Type: L}     \p{Joining_Type=Left_Joining} (5)
3723   \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (5: U+A872,
3724                             U+10ACD, U+10AD7, U+10D00, U+10FCB)
3725   \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_188 plus
3726                             all above-Unicode code points: [\x00-
3727                             \xac\xae-\xff], U+0100..02FF,
3728                             U+0370..0482, U+048A..0590, U+05BE,
3729                             U+05C0 ...)
3730   \p{Joining_Type: R}     \p{Joining_Type=Right_Joining} (152)
3731   \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (152:
3732                             U+0622..0625, U+0627, U+0629,
3733                             U+062F..0632, U+0648, U+0671..0673 ...)
3734   \p{Joining_Type: T}     \p{Joining_Type=Transparent} (2150)
3735   \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (2150: [\xad],
3736                             U+0300..036F, U+0483..0489,
3737                             U+0591..05BD, U+05BF, U+05C1..05C2 ...)
3738   \p{Joining_Type: U}     \p{Joining_Type=Non_Joining} (1_111_188
3739                             plus all above-Unicode code points)
3740   \p{Jt: *}               \p{Joining_Type: *}
3741   \p{Kaithi}              \p{Script_Extensions=Kaithi} (Short:
3742                             \p{Kthi}; NOT \p{Block=Kaithi}) (88)
3743 X \p{Kaktovik_Numerals}   \p{Block=Kaktovik_Numerals} (32)
3744   \p{Kali}                \p{Kayah_Li} (= \p{Script_Extensions=
3745                             Kayah_Li}) (48)
3746   \p{Kana}                \p{Katakana} (= \p{Script_Extensions=
3747                             Katakana}) (NOT \p{Block=Katakana}) (373)
3748 X \p{Kana_Ext_A}          \p{Kana_Extended_A} (= \p{Block=
3749                             Kana_Extended_A}) (48)
3750 X \p{Kana_Ext_B}          \p{Kana_Extended_B} (= \p{Block=
3751                             Kana_Extended_B}) (16)
3752 X \p{Kana_Extended_A}     \p{Block=Kana_Extended_A} (Short:
3753                             \p{InKanaExtA}) (48)
3754 X \p{Kana_Extended_B}     \p{Block=Kana_Extended_B} (Short:
3755                             \p{InKanaExtB}) (16)
3756 X \p{Kana_Sup}            \p{Kana_Supplement} (= \p{Block=
3757                             Kana_Supplement}) (256)
3758 X \p{Kana_Supplement}     \p{Block=Kana_Supplement} (Short:
3759                             \p{InKanaSup}) (256)
3760 X \p{Kanbun}              \p{Block=Kanbun} (16)
3761 X \p{Kangxi}              \p{Kangxi_Radicals} (= \p{Block=
3762                             Kangxi_Radicals}) (224)
3763 X \p{Kangxi_Radicals}     \p{Block=Kangxi_Radicals} (Short:
3764                             \p{InKangxi}) (224)
3765   \p{Kannada}             \p{Script_Extensions=Kannada} (Short:
3766                             \p{Knda}; NOT \p{Block=Kannada}) (106)
3767   \p{Katakana}            \p{Script_Extensions=Katakana} (Short:
3768                             \p{Kana}; NOT \p{Block=Katakana}) (373)
3769 X \p{Katakana_Ext}        \p{Katakana_Phonetic_Extensions} (=
3770                             \p{Block=Katakana_Phonetic_Extensions})
3771                             (16)
3772 X \p{Katakana_Phonetic_Extensions} \p{Block=
3773                             Katakana_Phonetic_Extensions} (Short:
3774                             \p{InKatakanaExt}) (16)
3775   \p{Kawi}                \p{Script_Extensions=Kawi} (NOT \p{Block=
3776                             Kawi}) (86)
3777   \p{Kayah_Li}            \p{Script_Extensions=Kayah_Li} (Short:
3778                             \p{Kali}) (48)
3779   \p{Khar}                \p{Kharoshthi} (= \p{Script_Extensions=
3780                             Kharoshthi}) (NOT \p{Block=Kharoshthi})
3781                             (68)
3782   \p{Kharoshthi}          \p{Script_Extensions=Kharoshthi} (Short:
3783                             \p{Khar}; NOT \p{Block=Kharoshthi}) (68)
3784   \p{Khitan_Small_Script} \p{Script_Extensions=Khitan_Small_Script}
3785                             (Short: \p{Kits}; NOT \p{Block=
3786                             Khitan_Small_Script}) (471)
3787   \p{Khmer}               \p{Script_Extensions=Khmer} (Short:
3788                             \p{Khmr}; NOT \p{Block=Khmer}) (146)
3789 X \p{Khmer_Symbols}       \p{Block=Khmer_Symbols} (32)
3790   \p{Khmr}                \p{Khmer} (= \p{Script_Extensions=Khmer})
3791                             (NOT \p{Block=Khmer}) (146)
3792   \p{Khoj}                \p{Khojki} (= \p{Script_Extensions=
3793                             Khojki}) (NOT \p{Block=Khojki}) (85)
3794   \p{Khojki}              \p{Script_Extensions=Khojki} (Short:
3795                             \p{Khoj}; NOT \p{Block=Khojki}) (85)
3796   \p{Khudawadi}           \p{Script_Extensions=Khudawadi} (Short:
3797                             \p{Sind}; NOT \p{Block=Khudawadi}) (81)
3798   \p{Kits}                \p{Khitan_Small_Script} (=
3799                             \p{Script_Extensions=
3800                             Khitan_Small_Script}) (NOT \p{Block=
3801                             Khitan_Small_Script}) (471)
3802   \p{Knda}                \p{Kannada} (= \p{Script_Extensions=
3803                             Kannada}) (NOT \p{Block=Kannada}) (106)
3804   \p{Kthi}                \p{Kaithi} (= \p{Script_Extensions=
3805                             Kaithi}) (NOT \p{Block=Kaithi}) (88)
3806   \p{L} \pL               \p{Letter} (= \p{General_Category=Letter})
3807                             (136_104)
3808 X \p{L&}                  \p{Cased_Letter} (= \p{General_Category=
3809                             Cased_Letter}) (4095)
3810 X \p{L_}                  \p{Cased_Letter} (= \p{General_Category=
3811                             Cased_Letter}) Note the trailing '_'
3812                             matters in spite of loose matching
3813                             rules. (4095)
3814   \p{Lana}                \p{Tai_Tham} (= \p{Script_Extensions=
3815                             Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
3816   \p{Lao}                 \p{Script_Extensions=Lao} (NOT \p{Block=
3817                             Lao}) (83)
3818   \p{Laoo}                \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
3819                             \p{Block=Lao}) (83)
3820   \p{Latin}               \p{Script_Extensions=Latin} (Short:
3821                             \p{Latn}) (1510)
3822 X \p{Latin_1}             \p{Latin_1_Supplement} (= \p{Block=
3823                             Latin_1_Supplement}) (128)
3824 X \p{Latin_1_Sup}         \p{Latin_1_Supplement} (= \p{Block=
3825                             Latin_1_Supplement}) (128)
3826 X \p{Latin_1_Supplement}  \p{Block=Latin_1_Supplement} (Short:
3827                             \p{InLatin1}) (128)
3828 X \p{Latin_Ext_A}         \p{Latin_Extended_A} (= \p{Block=
3829                             Latin_Extended_A}) (128)
3830 X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
3831                             \p{Block=Latin_Extended_Additional})
3832                             (256)
3833 X \p{Latin_Ext_B}         \p{Latin_Extended_B} (= \p{Block=
3834                             Latin_Extended_B}) (208)
3835 X \p{Latin_Ext_C}         \p{Latin_Extended_C} (= \p{Block=
3836                             Latin_Extended_C}) (32)
3837 X \p{Latin_Ext_D}         \p{Latin_Extended_D} (= \p{Block=
3838                             Latin_Extended_D}) (224)
3839 X \p{Latin_Ext_E}         \p{Latin_Extended_E} (= \p{Block=
3840                             Latin_Extended_E}) (64)
3841 X \p{Latin_Ext_F}         \p{Latin_Extended_F} (= \p{Block=
3842                             Latin_Extended_F}) (64)
3843 X \p{Latin_Ext_G}         \p{Latin_Extended_G} (= \p{Block=
3844                             Latin_Extended_G}) (256)
3845 X \p{Latin_Extended_A}    \p{Block=Latin_Extended_A} (Short:
3846                             \p{InLatinExtA}) (128)
3847 X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
3848                             (Short: \p{InLatinExtAdditional}) (256)
3849 X \p{Latin_Extended_B}    \p{Block=Latin_Extended_B} (Short:
3850                             \p{InLatinExtB}) (208)
3851 X \p{Latin_Extended_C}    \p{Block=Latin_Extended_C} (Short:
3852                             \p{InLatinExtC}) (32)
3853 X \p{Latin_Extended_D}    \p{Block=Latin_Extended_D} (Short:
3854                             \p{InLatinExtD}) (224)
3855 X \p{Latin_Extended_E}    \p{Block=Latin_Extended_E} (Short:
3856                             \p{InLatinExtE}) (64)
3857 X \p{Latin_Extended_F}    \p{Block=Latin_Extended_F} (Short:
3858                             \p{InLatinExtF}) (64)
3859 X \p{Latin_Extended_G}    \p{Block=Latin_Extended_G} (Short:
3860                             \p{InLatinExtG}) (256)
3861   \p{Latn}                \p{Latin} (= \p{Script_Extensions=Latin})
3862                             (1510)
3863   \p{Lb: *}               \p{Line_Break: *}
3864   \p{LC}                  \p{Cased_Letter} (= \p{General_Category=
3865                             Cased_Letter}) (4095)
3866   \p{Lepc}                \p{Lepcha} (= \p{Script_Extensions=
3867                             Lepcha}) (NOT \p{Block=Lepcha}) (74)
3868   \p{Lepcha}              \p{Script_Extensions=Lepcha} (Short:
3869                             \p{Lepc}; NOT \p{Block=Lepcha}) (74)
3870   \p{Letter}              \p{General_Category=Letter} (Short: \p{L})
3871                             (136_104)
3872   \p{Letter_Number}       \p{General_Category=Letter_Number} (Short:
3873                             \p{Nl}) (236)
3874 X \p{Letterlike_Symbols}  \p{Block=Letterlike_Symbols} (80)
3875   \p{Limb}                \p{Limbu} (= \p{Script_Extensions=Limbu})
3876                             (NOT \p{Block=Limbu}) (69)
3877   \p{Limbu}               \p{Script_Extensions=Limbu} (Short:
3878                             \p{Limb}; NOT \p{Block=Limbu}) (69)
3879   \p{Lina}                \p{Linear_A} (= \p{Script_Extensions=
3880                             Linear_A}) (NOT \p{Block=Linear_A}) (386)
3881   \p{Linb}                \p{Linear_B} (= \p{Script_Extensions=
3882                             Linear_B}) (268)
3883   \p{Line_Break: AI}      \p{Line_Break=Ambiguous} (707)
3884   \p{Line_Break: AL}      \p{Line_Break=Alphabetic} (22_215)
3885   \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (22_215: [#&*<=>\@A-
3886                             Z\^_`a-z~\xa6\xa9\xac\xae-\xaf\xb5\xc0-
3887                             \xd6\xd8-\xf6\xf8-\xff], U+0100..02C6,
3888                             U+02CE..02CF, U+02D1..02D7, U+02DC,
3889                             U+02DE ...)
3890   \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707: [\xa7-\xa8\xaa
3891                             \xb2-\xb3\xb6-\xba\xbc-\xbe\xd7\xf7],
3892                             U+02C7, U+02C9..02CB, U+02CD, U+02D0,
3893                             U+02D8..02DB ...)
3894   \p{Line_Break: B2}      \p{Line_Break=Break_Both} (3)
3895   \p{Line_Break: BA}      \p{Line_Break=Break_After} (249)
3896   \p{Line_Break: BB}      \p{Line_Break=Break_Before} (55)
3897   \p{Line_Break: BK}      \p{Line_Break=Mandatory_Break} (4)
3898   \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (249: [\t\|\xad],
3899                             U+058A, U+05BE, U+0964..0965,
3900                             U+0E5A..0E5B, U+0F0B ...)
3901   \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (55: [\xb4],
3902                             U+02C8, U+02CC, U+02DF, U+0C77, U+0C84
3903                             ...)
3904   \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3: U+2014,
3905                             U+2E3A..2E3B)
3906   \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1: [\/])
3907   \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1: [\r])
3908   \p{Line_Break: CB}      \p{Line_Break=Contingent_Break} (1)
3909   \p{Line_Break: CJ}      \p{Line_Break=
3910                             Conditional_Japanese_Starter} (60)
3911   \p{Line_Break: CL}      \p{Line_Break=Close_Punctuation} (97)
3912   \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2: [\)\]])
3913   \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (97: [\}],
3914                             U+0F3B, U+0F3D, U+169C, U+2046, U+207E
3915                             ...)
3916   \p{Line_Break: CM}      \p{Line_Break=Combining_Mark} (2438)
3917   \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2438: [^\t\n
3918                             \cK\f\r\x20-\x7e\x85\xa0-\xff],
3919                             U+0300..034E, U+0350..035B,
3920                             U+0363..036F, U+0483..0489, U+0591..05BD
3921                             ...)
3922   \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (758:
3923                             U+0E01..0E3A, U+0E40..0E4E,
3924                             U+0E81..0E82, U+0E84, U+0E86..0E8A,
3925                             U+0E8C..0EA3 ...)
3926   \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
3927                             (60: U+3041, U+3043, U+3045, U+3047,
3928                             U+3049, U+3063 ...)
3929   \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1: U+FFFC)
3930   \p{Line_Break: CP}      \p{Line_Break=Close_Parenthesis} (2)
3931   \p{Line_Break: CR}      \p{Line_Break=Carriage_Return} (1)
3932   \p{Line_Break: E_Base}  (Short: \p{Lb=EB}) (134: U+261D, U+26F9,
3933                             U+270A..270D, U+1F385, U+1F3C2..1F3C4,
3934                             U+1F3C7 ...)
3935   \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5: U+1F3FB..1F3FF)
3936   \p{Line_Break: EB}      \p{Line_Break=E_Base} (134)
3937   \p{Line_Break: EM}      \p{Line_Break=E_Modifier} (5)
3938   \p{Line_Break: EX}      \p{Line_Break=Exclamation} (40)
3939   \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (40: [!?], U+05C6,
3940                             U+061B, U+061D..061F, U+06D4, U+07F9 ...)
3941   \p{Line_Break: GL}      \p{Line_Break=Glue} (31)
3942   \p{Line_Break: Glue}    (Short: \p{Lb=GL}) (31: [\xa0], U+034F,
3943                             U+035C..0362, U+0F08, U+0F0C, U+0F12 ...)
3944   \p{Line_Break: H2}      (Short: \p{Lb=H2}) (399: U+AC00, U+AC1C,
3945                             U+AC38, U+AC54, U+AC70, U+AC8C ...)
3946   \p{Line_Break: H3}      (Short: \p{Lb=H3}) (10_773: U+AC01..AC1B,
3947                             U+AC1D..AC37, U+AC39..AC53,
3948                             U+AC55..AC6F, U+AC71..AC8B, U+AC8D..ACA7
3949                             ...)
3950   \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (75:
3951                             U+05D0..05EA, U+05EF..05F2, U+FB1D,
3952                             U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
3953                             ...)
3954   \p{Line_Break: HL}      \p{Line_Break=Hebrew_Letter} (75)
3955   \p{Line_Break: HY}      \p{Line_Break=Hyphen} (1)
3956   \p{Line_Break: Hyphen}  (Short: \p{Lb=HY}) (1: [\-])
3957   \p{Line_Break: ID}      \p{Line_Break=Ideographic} (172_465)
3958   \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_465:
3959                             U+231A..231B, U+23F0..23F3,
3960                             U+2600..2603, U+2614..2615, U+2618,
3961                             U+261A..261C ...)
3962   \p{Line_Break: IN}      \p{Line_Break=Inseparable} (6)
3963   \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13: [,.:;],
3964                             U+037E, U+0589, U+060C..060D, U+07F8,
3965                             U+2044 ...)
3966   \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6: U+2024..2026,
3967                             U+22EF, U+FE19, U+10AF6)
3968   \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
3969   \p{Line_Break: IS}      \p{Line_Break=Infix_Numeric} (13)
3970   \p{Line_Break: JL}      (Short: \p{Lb=JL}) (125: U+1100..115F,
3971                             U+A960..A97C)
3972   \p{Line_Break: JT}      (Short: \p{Lb=JT}) (137: U+11A8..11FF,
3973                             U+D7CB..D7FB)
3974   \p{Line_Break: JV}      (Short: \p{Lb=JV}) (95: U+1160..11A7,
3975                             U+D7B0..D7C6)
3976   \p{Line_Break: LF}      \p{Line_Break=Line_Feed} (1)
3977   \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1: [\n])
3978   \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4: [\cK\f],
3979                             U+2028..2029)
3980   \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1: [\x85])
3981   \p{Line_Break: NL}      \p{Line_Break=Next_Line} (1)
3982   \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (33: U+17D6,
3983                             U+203C..203D, U+2047..2049, U+3005,
3984                             U+301C, U+303B..303C ...)
3985   \p{Line_Break: NS}      \p{Line_Break=Nonstarter} (33)
3986   \p{Line_Break: NU}      \p{Line_Break=Numeric} (672)
3987   \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (672: [0-9],
3988                             U+0660..0669, U+066B..066C,
3989                             U+06F0..06F9, U+07C0..07C9, U+0966..096F
3990                             ...)
3991   \p{Line_Break: OP}      \p{Line_Break=Open_Punctuation} (94)
3992   \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (94: [\(\[\{
3993                             \xa1\xbf], U+0F3A, U+0F3C, U+169B,
3994                             U+201A, U+201E ...)
3995   \p{Line_Break: PO}      \p{Line_Break=Postfix_Numeric} (38)
3996   \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (38: [\%\xa2
3997                             \xb0], U+0609..060B, U+066A,
3998                             U+09F2..09F3, U+09F9, U+0D79 ...)
3999   \p{Line_Break: PR}      \p{Line_Break=Prefix_Numeric} (67)
4000   \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (67: [\$+\\\xa3-
4001                             \xa5\xb1], U+058F, U+07FE..07FF, U+09FB,
4002                             U+0AF1, U+0BF9 ...)
4003   \p{Line_Break: QU}      \p{Line_Break=Quotation} (39)
4004   \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39: [\"\'\xab\xbb],
4005                             U+2018..2019, U+201B..201D, U+201F,
4006                             U+2039..203A, U+275B..2760 ...)
4007   \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26:
4008                             U+1F1E6..1F1FF)
4009   \p{Line_Break: RI}      \p{Line_Break=Regional_Indicator} (26)
4010   \p{Line_Break: SA}      \p{Line_Break=Complex_Context} (758)
4011 D \p{Line_Break: SG}      \p{Line_Break=Surrogate} (2048)
4012   \p{Line_Break: SP}      \p{Line_Break=Space} (1)
4013   \p{Line_Break: Space}   (Short: \p{Lb=SP}) (1: [\x20])
4014 D \p{Line_Break: Surrogate} Surrogates should never appear in well-
4015                             formed text, and therefore shouldn't be
4016                             the basis for line breaking (Short:
4017                             \p{Lb=SG}) (2048: U+D800..DFFF)
4018   \p{Line_Break: SY}      \p{Line_Break=Break_Symbols} (1)
4019   \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (900_198 plus all
4020                             above-Unicode code points: U+0378..0379,
4021                             U+0380..0383, U+038B, U+038D, U+03A2,
4022                             U+0530 ...)
4023   \p{Line_Break: WJ}      \p{Line_Break=Word_Joiner} (2)
4024   \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2: U+2060, U+FEFF)
4025   \p{Line_Break: XX}      \p{Line_Break=Unknown} (900_198 plus all
4026                             above-Unicode code points)
4027   \p{Line_Break: ZW}      \p{Line_Break=ZWSpace} (1)
4028   \p{Line_Break: ZWJ}     (Short: \p{Lb=ZWJ}) (1: U+200D)
4029   \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1: U+200B)
4030   \p{Line_Separator}      \p{General_Category=Line_Separator}
4031                             (Short: \p{Zl}) (1)
4032   \p{Linear_A}            \p{Script_Extensions=Linear_A} (Short:
4033                             \p{Lina}; NOT \p{Block=Linear_A}) (386)
4034   \p{Linear_B}            \p{Script_Extensions=Linear_B} (Short:
4035                             \p{Linb}) (268)
4036 X \p{Linear_B_Ideograms}  \p{Block=Linear_B_Ideograms} (128)
4037 X \p{Linear_B_Syllabary}  \p{Block=Linear_B_Syllabary} (128)
4038   \p{Lisu}                \p{Script_Extensions=Lisu} (NOT \p{Block=
4039                             Lisu}) (49)
4040 X \p{Lisu_Sup}            \p{Lisu_Supplement} (= \p{Block=
4041                             Lisu_Supplement}) (16)
4042 X \p{Lisu_Supplement}     \p{Block=Lisu_Supplement} (Short:
4043                             \p{InLisuSup}) (16)
4044   \p{Ll}                  \p{Lowercase_Letter} (=
4045                             \p{General_Category=Lowercase_Letter})
4046                             (/i= General_Category=Cased_Letter)
4047                             (2233)
4048   \p{Lm}                  \p{Modifier_Letter} (=
4049                             \p{General_Category=Modifier_Letter})
4050                             (397)
4051   \p{Lo}                  \p{Other_Letter} (= \p{General_Category=
4052                             Other_Letter}) (131_612)
4053   \p{LOE}                 \p{Logical_Order_Exception} (=
4054                             \p{Logical_Order_Exception=Y}) (19)
4055   \p{LOE: *}              \p{Logical_Order_Exception: *}
4056   \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
4057                             \p{LOE}) (19)
4058   \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
4059                             (1_114_093 plus all above-Unicode code
4060                             points: U+0000..0E3F, U+0E45..0EBF,
4061                             U+0EC5..19B4, U+19B8..19B9,
4062                             U+19BB..AAB4, U+AAB7..AAB8 ...)
4063   \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19:
4064                             U+0E40..0E44, U+0EC0..0EC4,
4065                             U+19B5..19B7, U+19BA, U+AAB5..AAB6,
4066                             U+AAB9 ...)
4067 X \p{Low_Surrogates}      \p{Block=Low_Surrogates} (1024)
4068   \p{Lower}               \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
4069                             Cased=Yes) (2544)
4070   \p{Lower: *}            \p{Lowercase: *}
4071   \p{Lowercase}           \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
4072                             Cased=Yes) (2544)
4073   \p{Lowercase: N*}       (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
4074                             No) (1_111_568 plus all above-Unicode
4075                             code points: [\x00-\x20!\"#\$\%&\'
4076                             \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`\{
4077                             \|\}~\x7f-\xa9\xab-\xb4\xb6-\xb9\xbb-
4078                             \xde\xf7], U+0100, U+0102, U+0104,
4079                             U+0106, U+0108 ...)
4080   \p{Lowercase: Y*}       (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
4081                             Yes) (2544: [a-z\xaa\xb5\xba\xdf-\xf6
4082                             \xf8-\xff], U+0101, U+0103, U+0105,
4083                             U+0107, U+0109 ...)
4084   \p{Lowercase_Letter}    \p{General_Category=Lowercase_Letter}
4085                             (Short: \p{Ll}; /i= General_Category=
4086                             Cased_Letter) (2233)
4087   \p{Lt}                  \p{Titlecase_Letter} (=
4088                             \p{General_Category=Titlecase_Letter})
4089                             (/i= General_Category=Cased_Letter) (31)
4090   \p{Lu}                  \p{Uppercase_Letter} (=
4091                             \p{General_Category=Uppercase_Letter})
4092                             (/i= General_Category=Cased_Letter)
4093                             (1831)
4094   \p{Lyci}                \p{Lycian} (= \p{Script_Extensions=
4095                             Lycian}) (NOT \p{Block=Lycian}) (29)
4096   \p{Lycian}              \p{Script_Extensions=Lycian} (Short:
4097                             \p{Lyci}; NOT \p{Block=Lycian}) (29)
4098   \p{Lydi}                \p{Lydian} (= \p{Script_Extensions=
4099                             Lydian}) (NOT \p{Block=Lydian}) (27)
4100   \p{Lydian}              \p{Script_Extensions=Lydian} (Short:
4101                             \p{Lydi}; NOT \p{Block=Lydian}) (27)
4102   \p{M} \pM               \p{Mark} (= \p{General_Category=Mark})
4103                             (2450)
4104   \p{Mahajani}            \p{Script_Extensions=Mahajani} (Short:
4105                             \p{Mahj}; NOT \p{Block=Mahajani}) (61)
4106   \p{Mahj}                \p{Mahajani} (= \p{Script_Extensions=
4107                             Mahajani}) (NOT \p{Block=Mahajani}) (61)
4108 X \p{Mahjong}             \p{Mahjong_Tiles} (= \p{Block=
4109                             Mahjong_Tiles}) (48)
4110 X \p{Mahjong_Tiles}       \p{Block=Mahjong_Tiles} (Short:
4111                             \p{InMahjong}) (48)
4112   \p{Maka}                \p{Makasar} (= \p{Script_Extensions=
4113                             Makasar}) (NOT \p{Block=Makasar}) (25)
4114   \p{Makasar}             \p{Script_Extensions=Makasar} (Short:
4115                             \p{Maka}; NOT \p{Block=Makasar}) (25)
4116   \p{Malayalam}           \p{Script_Extensions=Malayalam} (Short:
4117                             \p{Mlym}; NOT \p{Block=Malayalam}) (126)
4118   \p{Mand}                \p{Mandaic} (= \p{Script_Extensions=
4119                             Mandaic}) (NOT \p{Block=Mandaic}) (30)
4120   \p{Mandaic}             \p{Script_Extensions=Mandaic} (Short:
4121                             \p{Mand}; NOT \p{Block=Mandaic}) (30)
4122   \p{Mani}                \p{Manichaean} (= \p{Script_Extensions=
4123                             Manichaean}) (NOT \p{Block=Manichaean})
4124                             (52)
4125   \p{Manichaean}          \p{Script_Extensions=Manichaean} (Short:
4126                             \p{Mani}; NOT \p{Block=Manichaean}) (52)
4127   \p{Marc}                \p{Marchen} (= \p{Script_Extensions=
4128                             Marchen}) (NOT \p{Block=Marchen}) (68)
4129   \p{Marchen}             \p{Script_Extensions=Marchen} (Short:
4130                             \p{Marc}; NOT \p{Block=Marchen}) (68)
4131   \p{Mark}                \p{General_Category=Mark} (Short: \p{M})
4132                             (2450)
4133   \p{Masaram_Gondi}       \p{Script_Extensions=Masaram_Gondi}
4134                             (Short: \p{Gonm}; NOT \p{Block=
4135                             Masaram_Gondi}) (77)
4136   \p{Math}                \p{Math=Y} (2310)
4137   \p{Math: N*}            (Single: \P{Math}) (1_111_802 plus all
4138                             above-Unicode code points: [\x00-\x20!
4139                             \"#\$\%&\'\(\)*,\-.\/0-9:;?\@A-Z
4140                             \[\\\]_`a-z\{\}\x7f-\xab\xad-\xb0\xb2-
4141                             \xd6\xd8-\xf6\xf8-\xff], U+0100..03CF,
4142                             U+03D3..03D4, U+03D6..03EF,
4143                             U+03F2..03F3, U+03F7..0605 ...)
4144   \p{Math: Y*}            (Single: \p{Math}) (2310: [+<=>\^\|~\xac
4145                             \xb1\xd7\xf7], U+03D0..03D2, U+03D5,
4146                             U+03F0..03F1, U+03F4..03F6, U+0606..0608
4147                             ...)
4148 X \p{Math_Alphanum}       \p{Mathematical_Alphanumeric_Symbols} (=
4149                             \p{Block=
4150                             Mathematical_Alphanumeric_Symbols})
4151                             (1024)
4152 X \p{Math_Operators}      \p{Mathematical_Operators} (= \p{Block=
4153                             Mathematical_Operators}) (256)
4154   \p{Math_Symbol}         \p{General_Category=Math_Symbol} (Short:
4155                             \p{Sm}) (948)
4156 X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
4157                             Mathematical_Alphanumeric_Symbols}
4158                             (Short: \p{InMathAlphanum}) (1024)
4159 X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
4160                             (Short: \p{InMathOperators}) (256)
4161 X \p{Mayan_Numerals}      \p{Block=Mayan_Numerals} (32)
4162   \p{Mc}                  \p{Spacing_Mark} (= \p{General_Category=
4163                             Spacing_Mark}) (452)
4164   \p{Me}                  \p{Enclosing_Mark} (= \p{General_Category=
4165                             Enclosing_Mark}) (13)
4166   \p{Medefaidrin}         \p{Script_Extensions=Medefaidrin} (Short:
4167                             \p{Medf}; NOT \p{Block=Medefaidrin}) (91)
4168   \p{Medf}                \p{Medefaidrin} (= \p{Script_Extensions=
4169                             Medefaidrin}) (NOT \p{Block=
4170                             Medefaidrin}) (91)
4171   \p{Meetei_Mayek}        \p{Script_Extensions=Meetei_Mayek} (Short:
4172                             \p{Mtei}; NOT \p{Block=Meetei_Mayek})
4173                             (79)
4174 X \p{Meetei_Mayek_Ext}    \p{Meetei_Mayek_Extensions} (= \p{Block=
4175                             Meetei_Mayek_Extensions}) (32)
4176 X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
4177                             (Short: \p{InMeeteiMayekExt}) (32)
4178   \p{Mend}                \p{Mende_Kikakui} (= \p{Script_Extensions=
4179                             Mende_Kikakui}) (NOT \p{Block=
4180                             Mende_Kikakui}) (213)
4181   \p{Mende_Kikakui}       \p{Script_Extensions=Mende_Kikakui}
4182                             (Short: \p{Mend}; NOT \p{Block=
4183                             Mende_Kikakui}) (213)
4184   \p{Merc}                \p{Meroitic_Cursive} (=
4185                             \p{Script_Extensions=Meroitic_Cursive})
4186                             (NOT \p{Block=Meroitic_Cursive}) (90)
4187   \p{Mero}                \p{Meroitic_Hieroglyphs} (=
4188                             \p{Script_Extensions=
4189                             Meroitic_Hieroglyphs}) (32)
4190   \p{Meroitic_Cursive}    \p{Script_Extensions=Meroitic_Cursive}
4191                             (Short: \p{Merc}; NOT \p{Block=
4192                             Meroitic_Cursive}) (90)
4193   \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
4194                             Meroitic_Hieroglyphs} (Short: \p{Mero})
4195                             (32)
4196   \p{Miao}                \p{Script_Extensions=Miao} (NOT \p{Block=
4197                             Miao}) (149)
4198 X \p{Misc_Arrows}         \p{Miscellaneous_Symbols_And_Arrows} (=
4199                             \p{Block=
4200                             Miscellaneous_Symbols_And_Arrows}) (256)
4201 X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
4202                             (= \p{Block=
4203                             Miscellaneous_Mathematical_Symbols_A})
4204                             (48)
4205 X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
4206                             (= \p{Block=
4207                             Miscellaneous_Mathematical_Symbols_B})
4208                             (128)
4209 X \p{Misc_Pictographs}    \p{Miscellaneous_Symbols_And_Pictographs}
4210                             (= \p{Block=
4211                             Miscellaneous_Symbols_And_Pictographs})
4212                             (768)
4213 X \p{Misc_Symbols}        \p{Miscellaneous_Symbols} (= \p{Block=
4214                             Miscellaneous_Symbols}) (256)
4215 X \p{Misc_Technical}      \p{Miscellaneous_Technical} (= \p{Block=
4216                             Miscellaneous_Technical}) (256)
4217 X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
4218                             Miscellaneous_Mathematical_Symbols_A}
4219                             (Short: \p{InMiscMathSymbolsA}) (48)
4220 X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
4221                             Miscellaneous_Mathematical_Symbols_B}
4222                             (Short: \p{InMiscMathSymbolsB}) (128)
4223 X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
4224                             \p{InMiscSymbols}) (256)
4225 X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
4226                             Miscellaneous_Symbols_And_Arrows}
4227                             (Short: \p{InMiscArrows}) (256)
4228 X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
4229                             Miscellaneous_Symbols_And_Pictographs}
4230                             (Short: \p{InMiscPictographs}) (768)
4231 X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
4232                             (Short: \p{InMiscTechnical}) (256)
4233   \p{Mlym}                \p{Malayalam} (= \p{Script_Extensions=
4234                             Malayalam}) (NOT \p{Block=Malayalam})
4235                             (126)
4236   \p{Mn}                  \p{Nonspacing_Mark} (=
4237                             \p{General_Category=Nonspacing_Mark})
4238                             (1985)
4239   \p{Modi}                \p{Script_Extensions=Modi} (NOT \p{Block=
4240                             Modi}) (89)
4241   \p{Modifier_Letter}     \p{General_Category=Modifier_Letter}
4242                             (Short: \p{Lm}) (397)
4243 X \p{Modifier_Letters}    \p{Spacing_Modifier_Letters} (= \p{Block=
4244                             Spacing_Modifier_Letters}) (80)
4245   \p{Modifier_Symbol}     \p{General_Category=Modifier_Symbol}
4246                             (Short: \p{Sk}) (125)
4247 X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
4248   \p{Mong}                \p{Mongolian} (= \p{Script_Extensions=
4249                             Mongolian}) (NOT \p{Block=Mongolian})
4250                             (172)
4251   \p{Mongolian}           \p{Script_Extensions=Mongolian} (Short:
4252                             \p{Mong}; NOT \p{Block=Mongolian}) (172)
4253 X \p{Mongolian_Sup}       \p{Mongolian_Supplement} (= \p{Block=
4254                             Mongolian_Supplement}) (32)
4255 X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
4256                             \p{InMongolianSup}) (32)
4257   \p{Mro}                 \p{Script_Extensions=Mro} (NOT \p{Block=
4258                             Mro}) (43)
4259   \p{Mroo}                \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
4260                             \p{Block=Mro}) (43)
4261   \p{Mtei}                \p{Meetei_Mayek} (= \p{Script_Extensions=
4262                             Meetei_Mayek}) (NOT \p{Block=
4263                             Meetei_Mayek}) (79)
4264   \p{Mult}                \p{Multani} (= \p{Script_Extensions=
4265                             Multani}) (NOT \p{Block=Multani}) (48)
4266   \p{Multani}             \p{Script_Extensions=Multani} (Short:
4267                             \p{Mult}; NOT \p{Block=Multani}) (48)
4268 X \p{Music}               \p{Musical_Symbols} (= \p{Block=
4269                             Musical_Symbols}) (256)
4270 X \p{Musical_Symbols}     \p{Block=Musical_Symbols} (Short:
4271                             \p{InMusic}) (256)
4272   \p{Myanmar}             \p{Script_Extensions=Myanmar} (Short:
4273                             \p{Mymr}; NOT \p{Block=Myanmar}) (224)
4274 X \p{Myanmar_Ext_A}       \p{Myanmar_Extended_A} (= \p{Block=
4275                             Myanmar_Extended_A}) (32)
4276 X \p{Myanmar_Ext_B}       \p{Myanmar_Extended_B} (= \p{Block=
4277                             Myanmar_Extended_B}) (32)
4278 X \p{Myanmar_Extended_A}  \p{Block=Myanmar_Extended_A} (Short:
4279                             \p{InMyanmarExtA}) (32)
4280 X \p{Myanmar_Extended_B}  \p{Block=Myanmar_Extended_B} (Short:
4281                             \p{InMyanmarExtB}) (32)
4282   \p{Mymr}                \p{Myanmar} (= \p{Script_Extensions=
4283                             Myanmar}) (NOT \p{Block=Myanmar}) (224)
4284   \p{N} \pN               \p{Number} (= \p{General_Category=Number})
4285                             (1831)
4286   \p{Na=*}                \p{Name=*}
4287   \p{Nabataean}           \p{Script_Extensions=Nabataean} (Short:
4288                             \p{Nbat}; NOT \p{Block=Nabataean}) (40)
4289   \p{Nag_Mundari}         \p{Script_Extensions=Nag_Mundari} (Short:
4290                             \p{Nagm}; NOT \p{Block=Nag_Mundari}) (42)
4291   \p{Nagm}                \p{Nag_Mundari} (= \p{Script_Extensions=
4292                             Nag_Mundari}) (NOT \p{Block=
4293                             Nag_Mundari}) (42)
4294   \p{Name=*}              Combination of Name and Name_Alias
4295                             properties; has special loose matching
4296                             rules, for which see Unicode UAX #44
4297   \p{Nand}                \p{Nandinagari} (= \p{Script_Extensions=
4298                             Nandinagari}) (NOT \p{Block=
4299                             Nandinagari}) (86)
4300   \p{Nandinagari}         \p{Script_Extensions=Nandinagari} (Short:
4301                             \p{Nand}; NOT \p{Block=Nandinagari}) (86)
4302   \p{Narb}                \p{Old_North_Arabian} (=
4303                             \p{Script_Extensions=Old_North_Arabian})
4304                             (32)
4305 X \p{NB}                  \p{No_Block} (= \p{Block=No_Block})
4306                             (820_944 plus all above-Unicode code
4307                             points)
4308   \p{Nbat}                \p{Nabataean} (= \p{Script_Extensions=
4309                             Nabataean}) (NOT \p{Block=Nabataean})
4310                             (40)
4311   \p{NChar}               \p{Noncharacter_Code_Point} (=
4312                             \p{Noncharacter_Code_Point=Y}) (66)
4313   \p{NChar: *}            \p{Noncharacter_Code_Point: *}
4314   \p{Nd}                  \p{XPosixDigit} (= \p{General_Category=
4315                             Decimal_Number}) (680)
4316   \p{New_Tai_Lue}         \p{Script_Extensions=New_Tai_Lue} (Short:
4317                             \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
4318   \p{Newa}                \p{Script_Extensions=Newa} (NOT \p{Block=
4319                             Newa}) (97)
4320   \p{NFC_QC: *}           \p{NFC_Quick_Check: *}
4321   \p{NFC_Quick_Check: M}  \p{NFC_Quick_Check=Maybe} (111)
4322   \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (111:
4323                             U+0300..0304, U+0306..030C, U+030F,
4324                             U+0311, U+0313..0314, U+031B ...)
4325   \p{NFC_Quick_Check: N}  \p{NFC_Quick_Check=No} (NOT
4326                             \P{NFC_Quick_Check} NOR \P{NFC_QC})
4327                             (1120)
4328   \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
4329                             \P{NFC_Quick_Check} NOR \P{NFC_QC})
4330                             (1120: U+0340..0341, U+0343..0344,
4331                             U+0374, U+037E, U+0387, U+0958..095F ...)
4332   \p{NFC_Quick_Check: Y}  \p{NFC_Quick_Check=Yes} (NOT
4333                             \p{NFC_Quick_Check} NOR \p{NFC_QC})
4334                             (1_112_881 plus all above-Unicode code
4335                             points)
4336   \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
4337                             \p{NFC_Quick_Check} NOR \p{NFC_QC})
4338                             (1_112_881 plus all above-Unicode code
4339                             points: U+0000..02FF, U+0305,
4340                             U+030D..030E, U+0310, U+0312,
4341                             U+0315..031A ...)
4342   \p{NFD_QC: *}           \p{NFD_Quick_Check: *}
4343   \p{NFD_Quick_Check: N}  \p{NFD_Quick_Check=No} (NOT
4344                             \P{NFD_Quick_Check} NOR \P{NFD_QC})
4345                             (13_233)
4346   \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
4347                             \P{NFD_Quick_Check} NOR \P{NFD_QC})
4348                             (13_233: [\xc0-\xc5\xc7-\xcf\xd1-\xd6
4349                             \xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-\xf6
4350                             \xf9-\xfd\xff], U+0100..010F,
4351                             U+0112..0125, U+0128..0130,
4352                             U+0134..0137, U+0139..013E ...)
4353   \p{NFD_Quick_Check: Y}  \p{NFD_Quick_Check=Yes} (NOT
4354                             \p{NFD_Quick_Check} NOR \p{NFD_QC})
4355                             (1_100_879 plus all above-Unicode code
4356                             points)
4357   \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
4358                             \p{NFD_Quick_Check} NOR \p{NFD_QC})
4359                             (1_100_879 plus all above-Unicode code
4360                             points: [\x00-\xbf\xc6\xd0\xd7-\xd8\xde-
4361                             \xdf\xe6\xf0\xf7-\xf8\xfe],
4362                             U+0110..0111, U+0126..0127,
4363                             U+0131..0133, U+0138, U+013F..0142 ...)
4364   \p{NFKC_QC: *}          \p{NFKC_Quick_Check: *}
4365   \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (111)
4366   \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (111:
4367                             U+0300..0304, U+0306..030C, U+030F,
4368                             U+0311, U+0313..0314, U+031B ...)
4369   \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
4370                             \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4371                             (4928)
4372   \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
4373                             \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
4374                             (4928: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4375                             \xba\xbc-\xbe], U+0132..0133,
4376                             U+013F..0140, U+0149, U+017F,
4377                             U+01C4..01CC ...)
4378   \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
4379                             \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4380                             (1_109_073 plus all above-Unicode code
4381                             points)
4382   \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
4383                             \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
4384                             (1_109_073 plus all above-Unicode code
4385                             points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4386                             \xae\xb0-\xb1\xb6-\xb7\xbb\xbf-\xff],
4387                             U+0100..0131, U+0134..013E,
4388                             U+0141..0148, U+014A..017E, U+0180..01C3
4389                             ...)
4390   \p{NFKD_QC: *}          \p{NFKD_Quick_Check: *}
4391   \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
4392                             \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4393                             (17_029)
4394   \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
4395                             \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
4396                             (17_029: [\xa0\xa8\xaa\xaf\xb2-\xb5\xb8-
4397                             \xba\xbc-\xbe\xc0-\xc5\xc7-\xcf\xd1-
4398                             \xd6\xd9-\xdd\xe0-\xe5\xe7-\xef\xf1-
4399                             \xf6\xf9-\xfd\xff], U+0100..010F,
4400                             U+0112..0125, U+0128..0130,
4401                             U+0132..0137, U+0139..0140 ...)
4402   \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
4403                             \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4404                             (1_097_083 plus all above-Unicode code
4405                             points)
4406   \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
4407                             \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
4408                             (1_097_083 plus all above-Unicode code
4409                             points: [\x00-\x9f\xa1-\xa7\xa9\xab-
4410                             \xae\xb0-\xb1\xb6-\xb7\xbb\xbf\xc6\xd0
4411                             \xd7-\xd8\xde-\xdf\xe6\xf0\xf7-\xf8
4412                             \xfe], U+0110..0111, U+0126..0127,
4413                             U+0131, U+0138, U+0141..0142 ...)
4414   \p{Nko}                 \p{Script_Extensions=Nko} (NOT \p{Block=
4415                             NKo}) (67)
4416   \p{Nkoo}                \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
4417                             \p{Block=NKo}) (67)
4418   \p{Nl}                  \p{Letter_Number} (= \p{General_Category=
4419                             Letter_Number}) (236)
4420   \p{No}                  \p{Other_Number} (= \p{General_Category=
4421                             Other_Number}) (915)
4422 X \p{No_Block}            \p{Block=No_Block} (Short: \p{InNB})
4423                             (820_944 plus all above-Unicode code
4424                             points)
4425   \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
4426                             \p{NChar}) (66)
4427   \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
4428                             (1_114_046 plus all above-Unicode code
4429                             points: U+0000..FDCF, U+FDF0..FFFD,
4430                             U+10000..1FFFD, U+20000..2FFFD,
4431                             U+30000..3FFFD, U+40000..4FFFD ...)
4432   \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
4433                             (66: U+FDD0..FDEF, U+FFFE..FFFF,
4434                             U+1FFFE..1FFFF, U+2FFFE..2FFFF,
4435                             U+3FFFE..3FFFF, U+4FFFE..4FFFF ...)
4436   \p{Nonspacing_Mark}     \p{General_Category=Nonspacing_Mark}
4437                             (Short: \p{Mn}) (1985)
4438   \p{Nshu}                \p{Nushu} (= \p{Script_Extensions=Nushu})
4439                             (NOT \p{Block=Nushu}) (397)
4440   \p{Nt: *}               \p{Numeric_Type: *}
4441   \p{Number}              \p{General_Category=Number} (Short: \p{N})
4442                             (1831)
4443 X \p{Number_Forms}        \p{Block=Number_Forms} (64)
4444   \p{Numeric_Type: De}    \p{Numeric_Type=Decimal} (680)
4445   \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (680: [0-9],
4446                             U+0660..0669, U+06F0..06F9,
4447                             U+07C0..07C9, U+0966..096F, U+09E6..09EF
4448                             ...)
4449   \p{Numeric_Type: Di}    \p{Numeric_Type=Digit} (128)
4450   \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128: [\xb2-\xb3\xb9],
4451                             U+1369..1371, U+19DA, U+2070,
4452                             U+2074..2079, U+2080..2089 ...)
4453   \p{Numeric_Type: None}  (Short: \p{Nt=None}) (1_112_200 plus all
4454                             above-Unicode code points: [\x00-\x20!
4455                             \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4456                             \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4457                             \xbb\xbf-\xff], U+0100..065F,
4458                             U+066A..06EF, U+06FA..07BF,
4459                             U+07CA..0965, U+0970..09E5 ...)
4460   \p{Numeric_Type: Nu}    \p{Numeric_Type=Numeric} (1104)
4461   \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (1104: [\xbc-\xbe],
4462                             U+09F4..09F9, U+0B72..0B77,
4463                             U+0BF0..0BF2, U+0C78..0C7E, U+0D58..0D5E
4464                             ...)
4465 T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1: U+0F33)
4466 T \p{Numeric_Value: 0}    (Short: \p{Nv=0}) (87: [0], U+0660,
4467                             U+06F0, U+07C0, U+0966, U+09E6 ...)
4468 T \p{Numeric_Value: 1/320} (Short: \p{Nv=1/320}) (2: U+11FC0,
4469                             U+11FD4)
4470 T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (2: U+0D58, U+11FC1)
4471 T \p{Numeric_Value: 1/80} (Short: \p{Nv=1/80}) (1: U+11FC2)
4472 T \p{Numeric_Value: 1/64} (Short: \p{Nv=1/64}) (1: U+11FC3)
4473 T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (2: U+0D59, U+11FC4)
4474 T \p{Numeric_Value: 1/32} (Short: \p{Nv=1/32}) (1: U+11FC5)
4475 T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (2: U+0D5A, U+11FC6)
4476 T \p{Numeric_Value: 3/64} (Short: \p{Nv=3/64}) (1: U+11FC7)
4477 T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (2: U+0D5B, U+11FC8)
4478 T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (6: U+09F4, U+0B75,
4479                             U+0D76, U+A833, U+11FC9..11FCA)
4480 T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1: U+109F6)
4481 T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (3: U+0D5C, U+2152,
4482                             U+11FCB)
4483 T \p{Numeric_Value: 1/9}  (Short: \p{Nv=1/9}) (1: U+2151)
4484 T \p{Numeric_Value: 1/8}  (Short: \p{Nv=1/8}) (7: U+09F5, U+0B76,
4485                             U+0D77, U+215B, U+A834, U+11FCC ...)
4486 T \p{Numeric_Value: 1/7}  (Short: \p{Nv=1/7}) (1: U+2150)
4487 T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (2: U+0D5D, U+11FCD)
4488 T \p{Numeric_Value: 1/6}  (Short: \p{Nv=1/6}) (4: U+2159, U+109F7,
4489                             U+12461, U+1ED3D)
4490 T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (5: U+09F6, U+0B77,
4491                             U+0D78, U+A835, U+11FCE)
4492 T \p{Numeric_Value: 1/5}  (Short: \p{Nv=1/5}) (3: U+0D5E, U+2155,
4493                             U+11FCF)
4494 T \p{Numeric_Value: 1/4}  (Short: \p{Nv=1/4}) (14: [\xbc], U+09F7,
4495                             U+0B72, U+0D73, U+A830, U+10140 ...)
4496 T \p{Numeric_Value: 1/3}  (Short: \p{Nv=1/3}) (6: U+2153, U+109F9,
4497                             U+10E7D, U+1245A, U+1245D, U+12465)
4498 T \p{Numeric_Value: 3/8}  (Short: \p{Nv=3/8}) (1: U+215C)
4499 T \p{Numeric_Value: 2/5}  (Short: \p{Nv=2/5}) (1: U+2156)
4500 T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1: U+109FA)
4501 T \p{Numeric_Value: 1/2}  (Short: \p{Nv=1/2}) (19: [\xbd], U+0B73,
4502                             U+0D74, U+0F2A, U+2CFD, U+A831 ...)
4503 T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1: U+109FC)
4504 T \p{Numeric_Value: 3/5}  (Short: \p{Nv=3/5}) (1: U+2157)
4505 T \p{Numeric_Value: 5/8}  (Short: \p{Nv=5/8}) (1: U+215D)
4506 T \p{Numeric_Value: 2/3}  (Short: \p{Nv=2/3}) (7: U+2154, U+10177,
4507                             U+109FD, U+10E7E, U+1245B, U+1245E ...)
4508 T \p{Numeric_Value: 3/4}  (Short: \p{Nv=3/4}) (9: [\xbe], U+09F8,
4509                             U+0B74, U+0D75, U+A832, U+10178 ...)
4510 T \p{Numeric_Value: 4/5}  (Short: \p{Nv=4/5}) (1: U+2158)
4511 T \p{Numeric_Value: 5/6}  (Short: \p{Nv=5/6}) (3: U+215A, U+109FF,
4512                             U+1245C)
4513 T \p{Numeric_Value: 7/8}  (Short: \p{Nv=7/8}) (1: U+215E)
4514 T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1: U+109BC)
4515 T \p{Numeric_Value: 1}    (Short: \p{Nv=1}) (144: [1\xb9], U+0661,
4516                             U+06F1, U+07C1, U+0967, U+09E7 ...)
4517 T \p{Numeric_Value: 3/2}  (Short: \p{Nv=3/2}) (1: U+0F2B)
4518 T \p{Numeric_Value: 2}    (Short: \p{Nv=2}) (143: [2\xb2], U+0662,
4519                             U+06F2, U+07C2, U+0968, U+09E8 ...)
4520 T \p{Numeric_Value: 5/2}  (Short: \p{Nv=5/2}) (1: U+0F2C)
4521 T \p{Numeric_Value: 3}    (Short: \p{Nv=3}) (144: [3\xb3], U+0663,
4522                             U+06F3, U+07C3, U+0969, U+09E9 ...)
4523 T \p{Numeric_Value: 7/2}  (Short: \p{Nv=7/2}) (1: U+0F2D)
4524 T \p{Numeric_Value: 4}    (Short: \p{Nv=4}) (135: [4], U+0664,
4525                             U+06F4, U+07C4, U+096A, U+09EA ...)
4526 T \p{Numeric_Value: 9/2}  (Short: \p{Nv=9/2}) (1: U+0F2E)
4527 T \p{Numeric_Value: 5}    (Short: \p{Nv=5}) (133: [5], U+0665,
4528                             U+06F5, U+07C5, U+096B, U+09EB ...)
4529 T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1: U+0F2F)
4530 T \p{Numeric_Value: 6}    (Short: \p{Nv=6}) (117: [6], U+0666,
4531                             U+06F6, U+07C6, U+096C, U+09EC ...)
4532 T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1: U+0F30)
4533 T \p{Numeric_Value: 7}    (Short: \p{Nv=7}) (116: [7], U+0667,
4534                             U+06F7, U+07C7, U+096D, U+09ED ...)
4535 T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1: U+0F31)
4536 T \p{Numeric_Value: 8}    (Short: \p{Nv=8}) (112: [8], U+0668,
4537                             U+06F8, U+07C8, U+096E, U+09EE ...)
4538 T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1: U+0F32)
4539 T \p{Numeric_Value: 9}    (Short: \p{Nv=9}) (116: [9], U+0669,
4540                             U+06F9, U+07C9, U+096F, U+09EF ...)
4541 T \p{Numeric_Value: 10}   (Short: \p{Nv=10}) (63: U+0BF0, U+0D70,
4542                             U+1372, U+2169, U+2179, U+2469 ...)
4543 T \p{Numeric_Value: 11}   (Short: \p{Nv=11}) (9: U+216A, U+217A,
4544                             U+246A, U+247E, U+2492, U+24EB ...)
4545 T \p{Numeric_Value: 12}   (Short: \p{Nv=12}) (9: U+216B, U+217B,
4546                             U+246B, U+247F, U+2493, U+24EC ...)
4547 T \p{Numeric_Value: 13}   (Short: \p{Nv=13}) (7: U+246C, U+2480,
4548                             U+2494, U+24ED, U+16E8D, U+1D2CD ...)
4549 T \p{Numeric_Value: 14}   (Short: \p{Nv=14}) (7: U+246D, U+2481,
4550                             U+2495, U+24EE, U+16E8E, U+1D2CE ...)
4551 T \p{Numeric_Value: 15}   (Short: \p{Nv=15}) (7: U+246E, U+2482,
4552                             U+2496, U+24EF, U+16E8F, U+1D2CF ...)
4553 T \p{Numeric_Value: 16}   (Short: \p{Nv=16}) (8: U+09F9, U+246F,
4554                             U+2483, U+2497, U+24F0, U+16E90 ...)
4555 T \p{Numeric_Value: 17}   (Short: \p{Nv=17}) (8: U+16EE, U+2470,
4556                             U+2484, U+2498, U+24F1, U+16E91 ...)
4557 T \p{Numeric_Value: 18}   (Short: \p{Nv=18}) (8: U+16EF, U+2471,
4558                             U+2485, U+2499, U+24F2, U+16E92 ...)
4559 T \p{Numeric_Value: 19}   (Short: \p{Nv=19}) (8: U+16F0, U+2472,
4560                             U+2486, U+249A, U+24F3, U+16E93 ...)
4561 T \p{Numeric_Value: 20}   (Short: \p{Nv=20}) (36: U+1373, U+2473,
4562                             U+2487, U+249B, U+24F4, U+3039 ...)
4563 T \p{Numeric_Value: 21}   (Short: \p{Nv=21}) (1: U+3251)
4564 T \p{Numeric_Value: 22}   (Short: \p{Nv=22}) (1: U+3252)
4565 T \p{Numeric_Value: 23}   (Short: \p{Nv=23}) (1: U+3253)
4566 T \p{Numeric_Value: 24}   (Short: \p{Nv=24}) (1: U+3254)
4567 T \p{Numeric_Value: 25}   (Short: \p{Nv=25}) (1: U+3255)
4568 T \p{Numeric_Value: 26}   (Short: \p{Nv=26}) (1: U+3256)
4569 T \p{Numeric_Value: 27}   (Short: \p{Nv=27}) (1: U+3257)
4570 T \p{Numeric_Value: 28}   (Short: \p{Nv=28}) (1: U+3258)
4571 T \p{Numeric_Value: 29}   (Short: \p{Nv=29}) (1: U+3259)
4572 T \p{Numeric_Value: 30}   (Short: \p{Nv=30}) (19: U+1374, U+303A,
4573                             U+324A, U+325A, U+5345, U+10112 ...)
4574 T \p{Numeric_Value: 31}   (Short: \p{Nv=31}) (1: U+325B)
4575 T \p{Numeric_Value: 32}   (Short: \p{Nv=32}) (1: U+325C)
4576 T \p{Numeric_Value: 33}   (Short: \p{Nv=33}) (1: U+325D)
4577 T \p{Numeric_Value: 34}   (Short: \p{Nv=34}) (1: U+325E)
4578 T \p{Numeric_Value: 35}   (Short: \p{Nv=35}) (1: U+325F)
4579 T \p{Numeric_Value: 36}   (Short: \p{Nv=36}) (1: U+32B1)
4580 T \p{Numeric_Value: 37}   (Short: \p{Nv=37}) (1: U+32B2)
4581 T \p{Numeric_Value: 38}   (Short: \p{Nv=38}) (1: U+32B3)
4582 T \p{Numeric_Value: 39}   (Short: \p{Nv=39}) (1: U+32B4)
4583 T \p{Numeric_Value: 40}   (Short: \p{Nv=40}) (18: U+1375, U+324B,
4584                             U+32B5, U+534C, U+10113, U+102ED ...)
4585 T \p{Numeric_Value: 41}   (Short: \p{Nv=41}) (1: U+32B6)
4586 T \p{Numeric_Value: 42}   (Short: \p{Nv=42}) (1: U+32B7)
4587 T \p{Numeric_Value: 43}   (Short: \p{Nv=43}) (1: U+32B8)
4588 T \p{Numeric_Value: 44}   (Short: \p{Nv=44}) (1: U+32B9)
4589 T \p{Numeric_Value: 45}   (Short: \p{Nv=45}) (1: U+32BA)
4590 T \p{Numeric_Value: 46}   (Short: \p{Nv=46}) (1: U+32BB)
4591 T \p{Numeric_Value: 47}   (Short: \p{Nv=47}) (1: U+32BC)
4592 T \p{Numeric_Value: 48}   (Short: \p{Nv=48}) (1: U+32BD)
4593 T \p{Numeric_Value: 49}   (Short: \p{Nv=49}) (1: U+32BE)
4594 T \p{Numeric_Value: 50}   (Short: \p{Nv=50}) (29: U+1376, U+216C,
4595                             U+217C, U+2186, U+324C, U+32BF ...)
4596 T \p{Numeric_Value: 60}   (Short: \p{Nv=60}) (13: U+1377, U+324D,
4597                             U+10115, U+102EF, U+109CE, U+10E6E ...)
4598 T \p{Numeric_Value: 70}   (Short: \p{Nv=70}) (13: U+1378, U+324E,
4599                             U+10116, U+102F0, U+109CF, U+10E6F ...)
4600 T \p{Numeric_Value: 80}   (Short: \p{Nv=80}) (12: U+1379, U+324F,
4601                             U+10117, U+102F1, U+10E70, U+11062 ...)
4602 T \p{Numeric_Value: 90}   (Short: \p{Nv=90}) (12: U+137A, U+10118,
4603                             U+102F2, U+10341, U+10E71, U+11063 ...)
4604 T \p{Numeric_Value: 100}  (Short: \p{Nv=100}) (35: U+0BF1, U+0D71,
4605                             U+137B, U+216D, U+217D, U+4F70 ...)
4606 T \p{Numeric_Value: 200}  (Short: \p{Nv=200}) (6: U+1011A, U+102F4,
4607                             U+109D3, U+10E73, U+1EC84, U+1ED14)
4608 T \p{Numeric_Value: 300}  (Short: \p{Nv=300}) (7: U+1011B, U+1016B,
4609                             U+102F5, U+109D4, U+10E74, U+1EC85 ...)
4610 T \p{Numeric_Value: 400}  (Short: \p{Nv=400}) (7: U+1011C, U+102F6,
4611                             U+109D5, U+10E75, U+1EC86, U+1ED16 ...)
4612 T \p{Numeric_Value: 500}  (Short: \p{Nv=500}) (16: U+216E, U+217E,
4613                             U+1011D, U+10145, U+1014C, U+10153 ...)
4614 T \p{Numeric_Value: 600}  (Short: \p{Nv=600}) (7: U+1011E, U+102F8,
4615                             U+109D7, U+10E77, U+1EC88, U+1ED18 ...)
4616 T \p{Numeric_Value: 700}  (Short: \p{Nv=700}) (6: U+1011F, U+102F9,
4617                             U+109D8, U+10E78, U+1EC89, U+1ED19)
4618 T \p{Numeric_Value: 800}  (Short: \p{Nv=800}) (6: U+10120, U+102FA,
4619                             U+109D9, U+10E79, U+1EC8A, U+1ED1A)
4620 T \p{Numeric_Value: 900}  (Short: \p{Nv=900}) (7: U+10121, U+102FB,
4621                             U+1034A, U+109DA, U+10E7A, U+1EC8B ...)
4622 T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (22: U+0BF2, U+0D72,
4623                             U+216F, U+217F..2180, U+4EDF, U+5343 ...)
4624 T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (5: U+10123, U+109DC,
4625                             U+1EC8D, U+1ED1D, U+1ED3A)
4626 T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (4: U+10124, U+109DD,
4627                             U+1EC8E, U+1ED1E)
4628 T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (4: U+10125, U+109DE,
4629                             U+1EC8F, U+1ED1F)
4630 T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (8: U+2181, U+10126,
4631                             U+10146, U+1014E, U+10172, U+109DF ...)
4632 T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (4: U+10127, U+109E0,
4633                             U+1EC91, U+1ED21)
4634 T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (4: U+10128, U+109E1,
4635                             U+1EC92, U+1ED22)
4636 T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (4: U+10129, U+109E2,
4637                             U+1EC93, U+1ED23)
4638 T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (4: U+1012A, U+109E3,
4639                             U+1EC94, U+1ED24)
4640 T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (13:
4641                             U+137C, U+2182, U+4E07, U+842C, U+1012B,
4642                             U+10155 ...)
4643 T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (4:
4644                             U+1012C, U+109E5, U+1EC96, U+1ED26)
4645 T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (4:
4646                             U+1012D, U+109E6, U+1EC97, U+1ED27)
4647 T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (4:
4648                             U+1012E, U+109E7, U+1EC98, U+1ED28)
4649 T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (7:
4650                             U+2187, U+1012F, U+10147, U+10156,
4651                             U+109E8, U+1EC99 ...)
4652 T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (4:
4653                             U+10130, U+109E9, U+1EC9A, U+1ED2A)
4654 T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (4:
4655                             U+10131, U+109EA, U+1EC9B, U+1ED2B)
4656 T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (4:
4657                             U+10132, U+109EB, U+1EC9C, U+1ED2C)
4658 T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (4:
4659                             U+10133, U+109EC, U+1EC9D, U+1ED2D)
4660 T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (5:
4661                             U+2188, U+109ED, U+1EC9E, U+1ECA0,
4662                             U+1ECB4)
4663 T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (2:
4664                             U+109EE, U+1EC9F)
4665 T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1:
4666                             U+12432)
4667 T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1:
4668                             U+109EF)
4669 T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1:
4670                             U+109F0)
4671 T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1:
4672                             U+12433)
4673 T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1:
4674                             U+109F1)
4675 T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1:
4676                             U+109F2)
4677 T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1:
4678                             U+109F3)
4679 T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1:
4680                             U+109F4)
4681 T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1:
4682                             U+109F5)
4683 T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1:
4684                             U+16B5E)
4685 T \p{Numeric_Value: 10000000} (= 1.0e+07) (Short: \p{Nv=10000000})
4686                             (1: U+1ECA1)
4687 T \p{Numeric_Value: 20000000} (= 2.0e+07) (Short: \p{Nv=20000000})
4688                             (1: U+1ECA2)
4689 T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
4690                             (3: U+4EBF, U+5104, U+16B5F)
4691 T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
4692                             10000000000}) (1: U+16B60)
4693 T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
4694                             1000000000000}) (2: U+5146, U+16B61)
4695   \p{Numeric_Value: NaN}  (Short: \p{Nv=NaN}) (1_112_200 plus all
4696                             above-Unicode code points: [\x00-\x20!
4697                             \"#\$\%&\'\(\)*+,\-.\/:;<=>?\@A-Z\[\\\]
4698                             \^_`a-z\{\|\}~\x7f-\xb1\xb4-\xb8\xba-
4699                             \xbb\xbf-\xff], U+0100..065F,
4700                             U+066A..06EF, U+06FA..07BF,
4701                             U+07CA..0965, U+0970..09E5 ...)
4702   \p{Nushu}               \p{Script_Extensions=Nushu} (Short:
4703                             \p{Nshu}; NOT \p{Block=Nushu}) (397)
4704   \p{Nv: *}               \p{Numeric_Value: *}
4705   \p{Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
4706                             Nyiakeng_Puachue_Hmong} (Short:
4707                             \p{Hmnp}; NOT \p{Block=
4708                             Nyiakeng_Puachue_Hmong}) (71)
4709 X \p{OCR}                 \p{Optical_Character_Recognition} (=
4710                             \p{Block=Optical_Character_Recognition})
4711                             (32)
4712   \p{Ogam}                \p{Ogham} (= \p{Script_Extensions=Ogham})
4713                             (NOT \p{Block=Ogham}) (29)
4714   \p{Ogham}               \p{Script_Extensions=Ogham} (Short:
4715                             \p{Ogam}; NOT \p{Block=Ogham}) (29)
4716   \p{Ol_Chiki}            \p{Script_Extensions=Ol_Chiki} (Short:
4717                             \p{Olck}) (48)
4718   \p{Olck}                \p{Ol_Chiki} (= \p{Script_Extensions=
4719                             Ol_Chiki}) (48)
4720   \p{Old_Hungarian}       \p{Script_Extensions=Old_Hungarian}
4721                             (Short: \p{Hung}; NOT \p{Block=
4722                             Old_Hungarian}) (108)
4723   \p{Old_Italic}          \p{Script_Extensions=Old_Italic} (Short:
4724                             \p{Ital}; NOT \p{Block=Old_Italic}) (39)
4725   \p{Old_North_Arabian}   \p{Script_Extensions=Old_North_Arabian}
4726                             (Short: \p{Narb}) (32)
4727   \p{Old_Permic}          \p{Script_Extensions=Old_Permic} (Short:
4728                             \p{Perm}; NOT \p{Block=Old_Permic}) (44)
4729   \p{Old_Persian}         \p{Script_Extensions=Old_Persian} (Short:
4730                             \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
4731   \p{Old_Sogdian}         \p{Script_Extensions=Old_Sogdian} (Short:
4732                             \p{Sogo}; NOT \p{Block=Old_Sogdian}) (40)
4733   \p{Old_South_Arabian}   \p{Script_Extensions=Old_South_Arabian}
4734                             (Short: \p{Sarb}) (32)
4735   \p{Old_Turkic}          \p{Script_Extensions=Old_Turkic} (Short:
4736                             \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
4737   \p{Old_Uyghur}          \p{Script_Extensions=Old_Uyghur} (Short:
4738                             \p{Ougr}; NOT \p{Block=Old_Uyghur}) (28)
4739   \p{Open_Punctuation}    \p{General_Category=Open_Punctuation}
4740                             (Short: \p{Ps}) (79)
4741 X \p{Optical_Character_Recognition} \p{Block=
4742                             Optical_Character_Recognition} (Short:
4743                             \p{InOCR}) (32)
4744   \p{Oriya}               \p{Script_Extensions=Oriya} (Short:
4745                             \p{Orya}; NOT \p{Block=Oriya}) (97)
4746   \p{Orkh}                \p{Old_Turkic} (= \p{Script_Extensions=
4747                             Old_Turkic}) (NOT \p{Block=Old_Turkic})
4748                             (73)
4749 X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
4750   \p{Orya}                \p{Oriya} (= \p{Script_Extensions=Oriya})
4751                             (NOT \p{Block=Oriya}) (97)
4752   \p{Osage}               \p{Script_Extensions=Osage} (Short:
4753                             \p{Osge}; NOT \p{Block=Osage}) (72)
4754   \p{Osge}                \p{Osage} (= \p{Script_Extensions=Osage})
4755                             (NOT \p{Block=Osage}) (72)
4756   \p{Osma}                \p{Osmanya} (= \p{Script_Extensions=
4757                             Osmanya}) (NOT \p{Block=Osmanya}) (40)
4758   \p{Osmanya}             \p{Script_Extensions=Osmanya} (Short:
4759                             \p{Osma}; NOT \p{Block=Osmanya}) (40)
4760   \p{Other}               \p{General_Category=Other} (Short: \p{C})
4761                             (965_096 plus all above-Unicode code
4762                             points)
4763   \p{Other_Letter}        \p{General_Category=Other_Letter} (Short:
4764                             \p{Lo}) (131_612)
4765   \p{Other_Number}        \p{General_Category=Other_Number} (Short:
4766                             \p{No}) (915)
4767   \p{Other_Punctuation}   \p{General_Category=Other_Punctuation}
4768                             (Short: \p{Po}) (628)
4769   \p{Other_Symbol}        \p{General_Category=Other_Symbol} (Short:
4770                             \p{So}) (6634)
4771 X \p{Ottoman_Siyaq_Numbers} \p{Block=Ottoman_Siyaq_Numbers} (80)
4772   \p{Ougr}                \p{Old_Uyghur} (= \p{Script_Extensions=
4773                             Old_Uyghur}) (NOT \p{Block=Old_Uyghur})
4774                             (28)
4775   \p{P} \pP               \p{Punct} (= \p{General_Category=
4776                             Punctuation}) (NOT
4777                             \p{General_Punctuation}) (842)
4778   \p{Pahawh_Hmong}        \p{Script_Extensions=Pahawh_Hmong} (Short:
4779                             \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
4780                             (127)
4781   \p{Palm}                \p{Palmyrene} (= \p{Script_Extensions=
4782                             Palmyrene}) (32)
4783   \p{Palmyrene}           \p{Script_Extensions=Palmyrene} (Short:
4784                             \p{Palm}) (32)
4785   \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
4786                             (Short: \p{Zp}) (1)
4787   \p{Pat_Syn}             \p{Pattern_Syntax} (= \p{Pattern_Syntax=
4788                             Y}) (2760)
4789   \p{Pat_Syn: *}          \p{Pattern_Syntax: *}
4790   \p{Pat_WS}              \p{Pattern_White_Space} (=
4791                             \p{Pattern_White_Space=Y}) (11)
4792   \p{Pat_WS: *}           \p{Pattern_White_Space: *}
4793   \p{Pattern_Syntax}      \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
4794                             (2760)
4795   \p{Pattern_Syntax: N*}  (Short: \p{PatSyn=N}, \P{PatSyn})
4796                             (1_111_352 plus all above-Unicode code
4797                             points: [\x00-\x200-9A-Z_a-z\x7f-\xa0
4798                             \xa8\xaa\xad\xaf\xb2-\xb5\xb7-\xba\xbc-
4799                             \xbe\xc0-\xd6\xd8-\xf6\xf8-\xff],
4800                             U+0100..200F, U+2028..202F,
4801                             U+203F..2040, U+2054, U+205F..218F ...)
4802   \p{Pattern_Syntax: Y*}  (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760:
4803                             [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
4804                             \^`\{\|\}~\xa1-\xa7\xa9\xab-\xac\xae
4805                             \xb0-\xb1\xb6\xbb\xbf\xd7\xf7],
4806                             U+2010..2027, U+2030..203E,
4807                             U+2041..2053, U+2055..205E, U+2190..245F
4808                             ...)
4809   \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
4810                             \p{PatWS}) (11)
4811   \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
4812                             (1_114_101 plus all above-Unicode code
4813                             points: [^\t\n\cK\f\r\x20\x85],
4814                             U+0100..200D, U+2010..2027,
4815                             U+202A..infinity)
4816   \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11:
4817                             [\t\n\cK\f\r\x20\x85], U+200E..200F,
4818                             U+2028..2029)
4819   \p{Pau_Cin_Hau}         \p{Script_Extensions=Pau_Cin_Hau} (Short:
4820                             \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
4821   \p{Pauc}                \p{Pau_Cin_Hau} (= \p{Script_Extensions=
4822                             Pau_Cin_Hau}) (NOT \p{Block=
4823                             Pau_Cin_Hau}) (57)
4824   \p{Pc}                  \p{Connector_Punctuation} (=
4825                             \p{General_Category=
4826                             Connector_Punctuation}) (10)
4827   \p{PCM}                 \p{Prepended_Concatenation_Mark} (=
4828                             \p{Prepended_Concatenation_Mark=Y}) (13)
4829   \p{PCM: *}              \p{Prepended_Concatenation_Mark: *}
4830   \p{Pd}                  \p{Dash_Punctuation} (=
4831                             \p{General_Category=Dash_Punctuation})
4832                             (26)
4833   \p{Pe}                  \p{Close_Punctuation} (=
4834                             \p{General_Category=Close_Punctuation})
4835                             (77)
4836   \p{PerlSpace}           \p{PosixSpace} (6)
4837   \p{PerlWord}            \p{PosixWord} (63)
4838   \p{Perm}                \p{Old_Permic} (= \p{Script_Extensions=
4839                             Old_Permic}) (NOT \p{Block=Old_Permic})
4840                             (44)
4841   \p{Pf}                  \p{Final_Punctuation} (=
4842                             \p{General_Category=Final_Punctuation})
4843                             (10)
4844   \p{Phag}                \p{Phags_Pa} (= \p{Script_Extensions=
4845                             Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
4846   \p{Phags_Pa}            \p{Script_Extensions=Phags_Pa} (Short:
4847                             \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
4848 X \p{Phaistos}            \p{Phaistos_Disc} (= \p{Block=
4849                             Phaistos_Disc}) (48)
4850 X \p{Phaistos_Disc}       \p{Block=Phaistos_Disc} (Short:
4851                             \p{InPhaistos}) (48)
4852   \p{Phli}                \p{Inscriptional_Pahlavi} (=
4853                             \p{Script_Extensions=
4854                             Inscriptional_Pahlavi}) (NOT \p{Block=
4855                             Inscriptional_Pahlavi}) (27)
4856   \p{Phlp}                \p{Psalter_Pahlavi} (=
4857                             \p{Script_Extensions=Psalter_Pahlavi})
4858                             (NOT \p{Block=Psalter_Pahlavi}) (30)
4859   \p{Phnx}                \p{Phoenician} (= \p{Script_Extensions=
4860                             Phoenician}) (NOT \p{Block=Phoenician})
4861                             (29)
4862   \p{Phoenician}          \p{Script_Extensions=Phoenician} (Short:
4863                             \p{Phnx}; NOT \p{Block=Phoenician}) (29)
4864 X \p{Phonetic_Ext}        \p{Phonetic_Extensions} (= \p{Block=
4865                             Phonetic_Extensions}) (128)
4866 X \p{Phonetic_Ext_Sup}    \p{Phonetic_Extensions_Supplement} (=
4867                             \p{Block=
4868                             Phonetic_Extensions_Supplement}) (64)
4869 X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
4870                             \p{InPhoneticExt}) (128)
4871 X \p{Phonetic_Extensions_Supplement} \p{Block=
4872                             Phonetic_Extensions_Supplement} (Short:
4873                             \p{InPhoneticExtSup}) (64)
4874   \p{Pi}                  \p{Initial_Punctuation} (=
4875                             \p{General_Category=
4876                             Initial_Punctuation}) (12)
4877 X \p{Playing_Cards}       \p{Block=Playing_Cards} (96)
4878   \p{Plrd}                \p{Miao} (= \p{Script_Extensions=Miao})
4879                             (NOT \p{Block=Miao}) (149)
4880   \p{Po}                  \p{Other_Punctuation} (=
4881                             \p{General_Category=Other_Punctuation})
4882                             (628)
4883   \p{PosixAlnum}          (62: [0-9A-Za-z])
4884   \p{PosixAlpha}          (52: [A-Za-z])
4885   \p{PosixBlank}          (2: [\t\x20])
4886   \p{PosixCntrl}          ASCII control characters (33: ACK, BEL,
4887                             BS, CAN, CR, DC1, DC2, DC3, DC4, DEL,
4888                             DLE, ENQ, EOM, EOT, ESC, ETB, ETX, FF,
4889                             FS, GS, HT, LF, NAK, NUL, RS, SI, SO,
4890                             SOH, STX, SUB, SYN, US, VT)
4891   \p{PosixDigit}          (10: [0-9])
4892   \p{PosixGraph}          (94: [!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
4893                             Z\[\\\]\^_`a-z\{\|\}~])
4894   \p{PosixLower}          (/i= PosixAlpha) (26: [a-z])
4895   \p{PosixPrint}          (95: [\x20-\x7e])
4896   \p{PosixPunct}          (32: [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@
4897                             \[\\\]\^_`\{\|\}~])
4898   \p{PosixSpace}          (Short: \p{PerlSpace}) (6: [\t\n\cK\f\r
4899                             \x20])
4900   \p{PosixUpper}          (/i= PosixAlpha) (26: [A-Z])
4901   \p{PosixWord}           \w, restricted to ASCII (Short:
4902                             \p{PerlWord}) (63: [0-9A-Z_a-z])
4903   \p{PosixXDigit}         \p{ASCII_Hex_Digit=Y} (Short: \p{AHex})
4904                             (22)
4905   \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
4906                             Y} (Short: \p{PCM}) (13)
4907   \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
4908                             (1_114_099 plus all above-Unicode code
4909                             points: U+0000..05FF, U+0606..06DC,
4910                             U+06DE..070E, U+0710..088F,
4911                             U+0892..08E1, U+08E3..110BC ...)
4912   \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
4913                             (13: U+0600..0605, U+06DD, U+070F,
4914                             U+0890..0891, U+08E2, U+110BD ...)
4915 T \p{Present_In: 1.1}     \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
4916                             extension) (33_979)
4917   \p{Present_In: V1_1}    \p{Present_In=1.1} (= \p{Age=V1_1}) (Perl
4918                             extension) (33_979)
4919 T \p{Present_In: 2.0}     Code point's usage introduced in version
4920                             2.0 or earlier (Short: \p{In=2.0}) (Perl
4921                             extension) (178_500: U+0000..01F5,
4922                             U+01FA..0217, U+0250..02A8,
4923                             U+02B0..02DE, U+02E0..02E9, U+0300..0345
4924                             ...)
4925   \p{Present_In: V2_0}    \p{Present_In=2.0} (Perl extension)
4926                             (178_500)
4927 T \p{Present_In: 2.1}     Code point's usage introduced in version
4928                             2.1 or earlier (Short: \p{In=2.1}) (Perl
4929                             extension) (178_502: U+0000..01F5,
4930                             U+01FA..0217, U+0250..02A8,
4931                             U+02B0..02DE, U+02E0..02E9, U+0300..0345
4932                             ...)
4933   \p{Present_In: V2_1}    \p{Present_In=2.1} (Perl extension)
4934                             (178_502)
4935 T \p{Present_In: 3.0}     Code point's usage introduced in version
4936                             3.0 or earlier (Short: \p{In=3.0}) (Perl
4937                             extension) (188_809: U+0000..021F,
4938                             U+0222..0233, U+0250..02AD,
4939                             U+02B0..02EE, U+0300..034E, U+0360..0362
4940                             ...)
4941   \p{Present_In: V3_0}    \p{Present_In=3.0} (Perl extension)
4942                             (188_809)
4943 T \p{Present_In: 3.1}     Code point's usage introduced in version
4944                             3.1 or earlier (Short: \p{In=3.1}) (Perl
4945                             extension) (233_787: U+0000..021F,
4946                             U+0222..0233, U+0250..02AD,
4947                             U+02B0..02EE, U+0300..034E, U+0360..0362
4948                             ...)
4949   \p{Present_In: V3_1}    \p{Present_In=3.1} (Perl extension)
4950                             (233_787)
4951 T \p{Present_In: 3.2}     Code point's usage introduced in version
4952                             3.2 or earlier (Short: \p{In=3.2}) (Perl
4953                             extension) (234_803: U+0000..0220,
4954                             U+0222..0233, U+0250..02AD,
4955                             U+02B0..02EE, U+0300..034F, U+0360..036F
4956                             ...)
4957   \p{Present_In: V3_2}    \p{Present_In=3.2} (Perl extension)
4958                             (234_803)
4959 T \p{Present_In: 4.0}     Code point's usage introduced in version
4960                             4.0 or earlier (Short: \p{In=4.0}) (Perl
4961                             extension) (236_029: U+0000..0236,
4962                             U+0250..0357, U+035D..036F,
4963                             U+0374..0375, U+037A, U+037E ...)
4964   \p{Present_In: V4_0}    \p{Present_In=4.0} (Perl extension)
4965                             (236_029)
4966 T \p{Present_In: 4.1}     Code point's usage introduced in version
4967                             4.1 or earlier (Short: \p{In=4.1}) (Perl
4968                             extension) (237_302: U+0000..0241,
4969                             U+0250..036F, U+0374..0375, U+037A,
4970                             U+037E, U+0384..038A ...)
4971   \p{Present_In: V4_1}    \p{Present_In=4.1} (Perl extension)
4972                             (237_302)
4973 T \p{Present_In: 5.0}     Code point's usage introduced in version
4974                             5.0 or earlier (Short: \p{In=5.0}) (Perl
4975                             extension) (238_671: U+0000..036F,
4976                             U+0374..0375, U+037A..037E,
4977                             U+0384..038A, U+038C, U+038E..03A1 ...)
4978   \p{Present_In: V5_0}    \p{Present_In=5.0} (Perl extension)
4979                             (238_671)
4980 T \p{Present_In: 5.1}     Code point's usage introduced in version
4981                             5.1 or earlier (Short: \p{In=5.1}) (Perl
4982                             extension) (240_295: U+0000..0377,
4983                             U+037A..037E, U+0384..038A, U+038C,
4984                             U+038E..03A1, U+03A3..0523 ...)
4985   \p{Present_In: V5_1}    \p{Present_In=5.1} (Perl extension)
4986                             (240_295)
4987 T \p{Present_In: 5.2}     Code point's usage introduced in version
4988                             5.2 or earlier (Short: \p{In=5.2}) (Perl
4989                             extension) (246_943: U+0000..0377,
4990                             U+037A..037E, U+0384..038A, U+038C,
4991                             U+038E..03A1, U+03A3..0525 ...)
4992   \p{Present_In: V5_2}    \p{Present_In=5.2} (Perl extension)
4993                             (246_943)
4994 T \p{Present_In: 6.0}     Code point's usage introduced in version
4995                             6.0 or earlier (Short: \p{In=6.0}) (Perl
4996                             extension) (249_031: U+0000..0377,
4997                             U+037A..037E, U+0384..038A, U+038C,
4998                             U+038E..03A1, U+03A3..0527 ...)
4999   \p{Present_In: V6_0}    \p{Present_In=6.0} (Perl extension)
5000                             (249_031)
5001 T \p{Present_In: 6.1}     Code point's usage introduced in version
5002                             6.1 or earlier (Short: \p{In=6.1}) (Perl
5003                             extension) (249_763: U+0000..0377,
5004                             U+037A..037E, U+0384..038A, U+038C,
5005                             U+038E..03A1, U+03A3..0527 ...)
5006   \p{Present_In: V6_1}    \p{Present_In=6.1} (Perl extension)
5007                             (249_763)
5008 T \p{Present_In: 6.2}     Code point's usage introduced in version
5009                             6.2 or earlier (Short: \p{In=6.2}) (Perl
5010                             extension) (249_764: U+0000..0377,
5011                             U+037A..037E, U+0384..038A, U+038C,
5012                             U+038E..03A1, U+03A3..0527 ...)
5013   \p{Present_In: V6_2}    \p{Present_In=6.2} (Perl extension)
5014                             (249_764)
5015 T \p{Present_In: 6.3}     Code point's usage introduced in version
5016                             6.3 or earlier (Short: \p{In=6.3}) (Perl
5017                             extension) (249_769: U+0000..0377,
5018                             U+037A..037E, U+0384..038A, U+038C,
5019                             U+038E..03A1, U+03A3..0527 ...)
5020   \p{Present_In: V6_3}    \p{Present_In=6.3} (Perl extension)
5021                             (249_769)
5022 T \p{Present_In: 7.0}     Code point's usage introduced in version
5023                             7.0 or earlier (Short: \p{In=7.0}) (Perl
5024                             extension) (252_603: U+0000..0377,
5025                             U+037A..037F, U+0384..038A, U+038C,
5026                             U+038E..03A1, U+03A3..052F ...)
5027   \p{Present_In: V7_0}    \p{Present_In=7.0} (Perl extension)
5028                             (252_603)
5029 T \p{Present_In: 8.0}     Code point's usage introduced in version
5030                             8.0 or earlier (Short: \p{In=8.0}) (Perl
5031                             extension) (260_319: U+0000..0377,
5032                             U+037A..037F, U+0384..038A, U+038C,
5033                             U+038E..03A1, U+03A3..052F ...)
5034   \p{Present_In: V8_0}    \p{Present_In=8.0} (Perl extension)
5035                             (260_319)
5036 T \p{Present_In: 9.0}     Code point's usage introduced in version
5037                             9.0 or earlier (Short: \p{In=9.0}) (Perl
5038                             extension) (267_819: U+0000..0377,
5039                             U+037A..037F, U+0384..038A, U+038C,
5040                             U+038E..03A1, U+03A3..052F ...)
5041   \p{Present_In: V9_0}    \p{Present_In=9.0} (Perl extension)
5042                             (267_819)
5043 T \p{Present_In: 10.0}    Code point's usage introduced in version
5044                             10.0 or earlier (Short: \p{In=10.0})
5045                             (Perl extension) (276_337: U+0000..0377,
5046                             U+037A..037F, U+0384..038A, U+038C,
5047                             U+038E..03A1, U+03A3..052F ...)
5048   \p{Present_In: V10_0}   \p{Present_In=10.0} (Perl extension)
5049                             (276_337)
5050 T \p{Present_In: 11.0}    Code point's usage introduced in version
5051                             11.0 or earlier (Short: \p{In=11.0})
5052                             (Perl extension) (277_021: U+0000..0377,
5053                             U+037A..037F, U+0384..038A, U+038C,
5054                             U+038E..03A1, U+03A3..052F ...)
5055   \p{Present_In: V11_0}   \p{Present_In=11.0} (Perl extension)
5056                             (277_021)
5057 T \p{Present_In: 12.0}    Code point's usage introduced in version
5058                             12.0 or earlier (Short: \p{In=12.0})
5059                             (Perl extension) (277_575: U+0000..0377,
5060                             U+037A..037F, U+0384..038A, U+038C,
5061                             U+038E..03A1, U+03A3..052F ...)
5062   \p{Present_In: V12_0}   \p{Present_In=12.0} (Perl extension)
5063                             (277_575)
5064 T \p{Present_In: 12.1}    Code point's usage introduced in version
5065                             12.1 or earlier (Short: \p{In=12.1})
5066                             (Perl extension) (277_576: U+0000..0377,
5067                             U+037A..037F, U+0384..038A, U+038C,
5068                             U+038E..03A1, U+03A3..052F ...)
5069   \p{Present_In: V12_1}   \p{Present_In=12.1} (Perl extension)
5070                             (277_576)
5071 T \p{Present_In: 13.0}    Code point's usage introduced in version
5072                             13.0 or earlier (Short: \p{In=13.0})
5073                             (Perl extension) (283_506: U+0000..0377,
5074                             U+037A..037F, U+0384..038A, U+038C,
5075                             U+038E..03A1, U+03A3..052F ...)
5076   \p{Present_In: V13_0}   \p{Present_In=13.0} (Perl extension)
5077                             (283_506)
5078 T \p{Present_In: 14.0}    Code point's usage introduced in version
5079                             14.0 or earlier (Short: \p{In=14.0})
5080                             (Perl extension) (284_344: U+0000..0377,
5081                             U+037A..037F, U+0384..038A, U+038C,
5082                             U+038E..03A1, U+03A3..052F ...)
5083   \p{Present_In: V14_0}   \p{Present_In=14.0} (Perl extension)
5084                             (284_344)
5085 T \p{Present_In: 15.0}    Code point's usage introduced in version
5086                             15.0 or earlier (Short: \p{In=15.0})
5087                             (Perl extension) (288_833: U+0000..0377,
5088                             U+037A..037F, U+0384..038A, U+038C,
5089                             U+038E..03A1, U+03A3..052F ...)
5090   \p{Present_In: V15_0}   \p{Present_In=15.0} (Perl extension)
5091                             (288_833)
5092   \p{Present_In: NA}      \p{Present_In=Unassigned} (= \p{Age=
5093                             Unassigned}) (Perl extension) (825_279
5094                             plus all above-Unicode code points)
5095   \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=NA})
5096                             (Perl extension) (825_279 plus all
5097                             above-Unicode code points)
5098   \p{Print}               \p{XPosixPrint} (286_652)
5099   \p{Private_Use}         \p{General_Category=Private_Use} (Short:
5100                             \p{Co}; NOT \p{Private_Use_Area})
5101                             (137_468)
5102 X \p{Private_Use_Area}    \p{Block=Private_Use_Area} (Short:
5103                             \p{InPUA}) (6400)
5104   \p{Prti}                \p{Inscriptional_Parthian} (=
5105                             \p{Script_Extensions=
5106                             Inscriptional_Parthian}) (NOT \p{Block=
5107                             Inscriptional_Parthian}) (30)
5108   \p{Ps}                  \p{Open_Punctuation} (=
5109                             \p{General_Category=Open_Punctuation})
5110                             (79)
5111   \p{Psalter_Pahlavi}     \p{Script_Extensions=Psalter_Pahlavi}
5112                             (Short: \p{Phlp}; NOT \p{Block=
5113                             Psalter_Pahlavi}) (30)
5114 X \p{PUA}                 \p{Private_Use_Area} (= \p{Block=
5115                             Private_Use_Area}) (6400)
5116   \p{Punct}               \p{General_Category=Punctuation} (Short:
5117                             \p{P}; NOT \p{General_Punctuation}) (842)
5118   \p{Punctuation}         \p{Punct} (= \p{General_Category=
5119                             Punctuation}) (NOT
5120                             \p{General_Punctuation}) (842)
5121   \p{Qaac}                \p{Coptic} (= \p{Script_Extensions=
5122                             Coptic}) (NOT \p{Block=Coptic}) (165)
5123   \p{Qaai}                \p{Inherited} (= \p{Script_Extensions=
5124                             Inherited}) (586)
5125   \p{QMark}               \p{Quotation_Mark} (= \p{Quotation_Mark=
5126                             Y}) (30)
5127   \p{QMark: *}            \p{Quotation_Mark: *}
5128   \p{Quotation_Mark}      \p{Quotation_Mark=Y} (Short: \p{QMark})
5129                             (30)
5130   \p{Quotation_Mark: N*}  (Short: \p{QMark=N}, \P{QMark}) (1_114_082
5131                             plus all above-Unicode code points:
5132                             [\x00-\x20!#\$\%&\(\)*+,\-.\/0-9:;<=>?
5133                             \@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xaa\xac-
5134                             \xba\xbc-\xff], U+0100..2017,
5135                             U+2020..2038, U+203B..2E41,
5136                             U+2E43..300B, U+3010..301C ...)
5137   \p{Quotation_Mark: Y*}  (Short: \p{QMark=Y}, \p{QMark}) (30: [\"
5138                             \'\xab\xbb], U+2018..201F, U+2039..203A,
5139                             U+2E42, U+300C..300F, U+301D..301F ...)
5140   \p{Radical}             \p{Radical=Y} (329)
5141   \p{Radical: N*}         (Single: \P{Radical}) (1_113_783 plus all
5142                             above-Unicode code points: U+0000..2E7F,
5143                             U+2E9A, U+2EF4..2EFF, U+2FD6..infinity)
5144   \p{Radical: Y*}         (Single: \p{Radical}) (329: U+2E80..2E99,
5145                             U+2E9B..2EF3, U+2F00..2FD5)
5146   \p{Regional_Indicator}  \p{Regional_Indicator=Y} (Short: \p{RI})
5147                             (26)
5148   \p{Regional_Indicator: N*} (Short: \p{RI=N}, \P{RI}) (1_114_086
5149                             plus all above-Unicode code points:
5150                             U+0000..1F1E5, U+1F200..infinity)
5151   \p{Regional_Indicator: Y*} (Short: \p{RI=Y}, \p{RI}) (26:
5152                             U+1F1E6..1F1FF)
5153   \p{Rejang}              \p{Script_Extensions=Rejang} (Short:
5154                             \p{Rjng}; NOT \p{Block=Rejang}) (37)
5155   \p{RI}                  \p{Regional_Indicator} (=
5156                             \p{Regional_Indicator=Y}) (26)
5157   \p{RI: *}               \p{Regional_Indicator: *}
5158   \p{Rjng}                \p{Rejang} (= \p{Script_Extensions=
5159                             Rejang}) (NOT \p{Block=Rejang}) (37)
5160   \p{Rohg}                \p{Hanifi_Rohingya} (=
5161                             \p{Script_Extensions=Hanifi_Rohingya})
5162                             (NOT \p{Block=Hanifi_Rohingya}) (55)
5163 X \p{Rumi}                \p{Rumi_Numeral_Symbols} (= \p{Block=
5164                             Rumi_Numeral_Symbols}) (32)
5165 X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
5166                             \p{InRumi}) (32)
5167   \p{Runic}               \p{Script_Extensions=Runic} (Short:
5168                             \p{Runr}; NOT \p{Block=Runic}) (86)
5169   \p{Runr}                \p{Runic} (= \p{Script_Extensions=Runic})
5170                             (NOT \p{Block=Runic}) (86)
5171   \p{S} \pS               \p{Symbol} (= \p{General_Category=Symbol})
5172                             (7770)
5173   \p{Samaritan}           \p{Script_Extensions=Samaritan} (Short:
5174                             \p{Samr}; NOT \p{Block=Samaritan}) (61)
5175   \p{Samr}                \p{Samaritan} (= \p{Script_Extensions=
5176                             Samaritan}) (NOT \p{Block=Samaritan})
5177                             (61)
5178   \p{Sarb}                \p{Old_South_Arabian} (=
5179                             \p{Script_Extensions=Old_South_Arabian})
5180                             (32)
5181   \p{Saur}                \p{Saurashtra} (= \p{Script_Extensions=
5182                             Saurashtra}) (NOT \p{Block=Saurashtra})
5183                             (82)
5184   \p{Saurashtra}          \p{Script_Extensions=Saurashtra} (Short:
5185                             \p{Saur}; NOT \p{Block=Saurashtra}) (82)
5186   \p{SB: *}               \p{Sentence_Break: *}
5187   \p{Sc}                  \p{Currency_Symbol} (=
5188                             \p{General_Category=Currency_Symbol})
5189                             (63)
5190   \p{Sc: *}               \p{Script: *}
5191   \p{Script: Adlam}       (Short: \p{Sc=Adlm}) (88: U+1E900..1E94B,
5192                             U+1E950..1E959, U+1E95E..1E95F)
5193   \p{Script: Adlm}        \p{Script=Adlam} (88)
5194   \p{Script: Aghb}        \p{Script=Caucasian_Albanian} (=
5195                             \p{Script_Extensions=
5196                             Caucasian_Albanian}) (53)
5197   \p{Script: Ahom}        \p{Script_Extensions=Ahom} (Short: \p{Sc=
5198                             Ahom}, \p{Ahom}) (65)
5199   \p{Script: Anatolian_Hieroglyphs} \p{Script_Extensions=
5200                             Anatolian_Hieroglyphs} (Short: \p{Sc=
5201                             Hluw}, \p{Hluw}) (583)
5202   \p{Script: Arab}        \p{Script=Arabic} (1368)
5203   \p{Script: Arabic}      (Short: \p{Sc=Arab}) (1368: U+0600..0604,
5204                             U+0606..060B, U+060D..061A,
5205                             U+061C..061E, U+0620..063F, U+0641..064A
5206                             ...)
5207   \p{Script: Armenian}    \p{Script_Extensions=Armenian} (Short:
5208                             \p{Sc=Armn}, \p{Armn}) (96)
5209   \p{Script: Armi}        \p{Script=Imperial_Aramaic} (=
5210                             \p{Script_Extensions=Imperial_Aramaic})
5211                             (31)
5212   \p{Script: Armn}        \p{Script=Armenian} (=
5213                             \p{Script_Extensions=Armenian}) (96)
5214   \p{Script: Avestan}     \p{Script_Extensions=Avestan} (Short:
5215                             \p{Sc=Avst}, \p{Avst}) (61)
5216   \p{Script: Avst}        \p{Script=Avestan} (=
5217                             \p{Script_Extensions=Avestan}) (61)
5218   \p{Script: Bali}        \p{Script=Balinese} (=
5219                             \p{Script_Extensions=Balinese}) (124)
5220   \p{Script: Balinese}    \p{Script_Extensions=Balinese} (Short:
5221                             \p{Sc=Bali}, \p{Bali}) (124)
5222   \p{Script: Bamu}        \p{Script=Bamum} (= \p{Script_Extensions=
5223                             Bamum}) (657)
5224   \p{Script: Bamum}       \p{Script_Extensions=Bamum} (Short: \p{Sc=
5225                             Bamu}, \p{Bamu}) (657)
5226   \p{Script: Bass}        \p{Script=Bassa_Vah} (=
5227                             \p{Script_Extensions=Bassa_Vah}) (36)
5228   \p{Script: Bassa_Vah}   \p{Script_Extensions=Bassa_Vah} (Short:
5229                             \p{Sc=Bass}, \p{Bass}) (36)
5230   \p{Script: Batak}       \p{Script_Extensions=Batak} (Short: \p{Sc=
5231                             Batk}, \p{Batk}) (56)
5232   \p{Script: Batk}        \p{Script=Batak} (= \p{Script_Extensions=
5233                             Batak}) (56)
5234   \p{Script: Beng}        \p{Script=Bengali} (96)
5235   \p{Script: Bengali}     (Short: \p{Sc=Beng}) (96: U+0980..0983,
5236                             U+0985..098C, U+098F..0990,
5237                             U+0993..09A8, U+09AA..09B0, U+09B2 ...)
5238   \p{Script: Bhaiksuki}   \p{Script_Extensions=Bhaiksuki} (Short:
5239                             \p{Sc=Bhks}, \p{Bhks}) (97)
5240   \p{Script: Bhks}        \p{Script=Bhaiksuki} (=
5241                             \p{Script_Extensions=Bhaiksuki}) (97)
5242   \p{Script: Bopo}        \p{Script=Bopomofo} (77)
5243   \p{Script: Bopomofo}    (Short: \p{Sc=Bopo}) (77: U+02EA..02EB,
5244                             U+3105..312F, U+31A0..31BF)
5245   \p{Script: Brah}        \p{Script=Brahmi} (= \p{Script_Extensions=
5246                             Brahmi}) (115)
5247   \p{Script: Brahmi}      \p{Script_Extensions=Brahmi} (Short:
5248                             \p{Sc=Brah}, \p{Brah}) (115)
5249   \p{Script: Brai}        \p{Script=Braille} (=
5250                             \p{Script_Extensions=Braille}) (256)
5251   \p{Script: Braille}     \p{Script_Extensions=Braille} (Short:
5252                             \p{Sc=Brai}, \p{Brai}) (256)
5253   \p{Script: Bugi}        \p{Script=Buginese} (30)
5254   \p{Script: Buginese}    (Short: \p{Sc=Bugi}) (30: U+1A00..1A1B,
5255                             U+1A1E..1A1F)
5256   \p{Script: Buhd}        \p{Script=Buhid} (20)
5257   \p{Script: Buhid}       (Short: \p{Sc=Buhd}) (20: U+1740..1753)
5258   \p{Script: Cakm}        \p{Script=Chakma} (71)
5259   \p{Script: Canadian_Aboriginal} \p{Script_Extensions=
5260                             Canadian_Aboriginal} (Short: \p{Sc=
5261                             Cans}, \p{Cans}) (726)
5262   \p{Script: Cans}        \p{Script=Canadian_Aboriginal} (=
5263                             \p{Script_Extensions=
5264                             Canadian_Aboriginal}) (726)
5265   \p{Script: Cari}        \p{Script=Carian} (= \p{Script_Extensions=
5266                             Carian}) (49)
5267   \p{Script: Carian}      \p{Script_Extensions=Carian} (Short:
5268                             \p{Sc=Cari}, \p{Cari}) (49)
5269   \p{Script: Caucasian_Albanian} \p{Script_Extensions=
5270                             Caucasian_Albanian} (Short: \p{Sc=Aghb},
5271                             \p{Aghb}) (53)
5272   \p{Script: Chakma}      (Short: \p{Sc=Cakm}) (71: U+11100..11134,
5273                             U+11136..11147)
5274   \p{Script: Cham}        \p{Script_Extensions=Cham} (Short: \p{Sc=
5275                             Cham}, \p{Cham}) (83)
5276   \p{Script: Cher}        \p{Script=Cherokee} (=
5277                             \p{Script_Extensions=Cherokee}) (172)
5278   \p{Script: Cherokee}    \p{Script_Extensions=Cherokee} (Short:
5279                             \p{Sc=Cher}, \p{Cher}) (172)
5280   \p{Script: Chorasmian}  \p{Script_Extensions=Chorasmian} (Short:
5281                             \p{Sc=Chrs}, \p{Chrs}) (28)
5282   \p{Script: Chrs}        \p{Script=Chorasmian} (=
5283                             \p{Script_Extensions=Chorasmian}) (28)
5284   \p{Script: Common}      (Short: \p{Sc=Zyyy}) (8301: [\x00-\x20!
5285                             \"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@\[\\\]
5286                             \^_`\{\|\}~\x7f-\xa9\xab-\xb9\xbb-\xbf
5287                             \xd7\xf7], U+02B9..02DF, U+02E5..02E9,
5288                             U+02EC..02FF, U+0374, U+037E ...)
5289   \p{Script: Copt}        \p{Script=Coptic} (137)
5290   \p{Script: Coptic}      (Short: \p{Sc=Copt}) (137: U+03E2..03EF,
5291                             U+2C80..2CF3, U+2CF9..2CFF)
5292   \p{Script: Cpmn}        \p{Script=Cypro_Minoan} (99)
5293   \p{Script: Cprt}        \p{Script=Cypriot} (55)
5294   \p{Script: Cuneiform}   \p{Script_Extensions=Cuneiform} (Short:
5295                             \p{Sc=Xsux}, \p{Xsux}) (1234)
5296   \p{Script: Cypriot}     (Short: \p{Sc=Cprt}) (55: U+10800..10805,
5297                             U+10808, U+1080A..10835, U+10837..10838,
5298                             U+1083C, U+1083F)
5299   \p{Script: Cypro_Minoan} (Short: \p{Sc=Cpmn}) (99: U+12F90..12FF2)
5300   \p{Script: Cyrillic}    (Short: \p{Sc=Cyrl}) (506: U+0400..0484,
5301                             U+0487..052F, U+1C80..1C88, U+1D2B,
5302                             U+1D78, U+2DE0..2DFF ...)
5303   \p{Script: Cyrl}        \p{Script=Cyrillic} (506)
5304   \p{Script: Deseret}     \p{Script_Extensions=Deseret} (Short:
5305                             \p{Sc=Dsrt}, \p{Dsrt}) (80)
5306   \p{Script: Deva}        \p{Script=Devanagari} (164)
5307   \p{Script: Devanagari}  (Short: \p{Sc=Deva}) (164: U+0900..0950,
5308                             U+0955..0963, U+0966..097F,
5309                             U+A8E0..A8FF, U+11B00..11B09)
5310   \p{Script: Diak}        \p{Script=Dives_Akuru} (=
5311                             \p{Script_Extensions=Dives_Akuru}) (72)
5312   \p{Script: Dives_Akuru} \p{Script_Extensions=Dives_Akuru} (Short:
5313                             \p{Sc=Diak}, \p{Diak}) (72)
5314   \p{Script: Dogr}        \p{Script=Dogra} (60)
5315   \p{Script: Dogra}       (Short: \p{Sc=Dogr}) (60: U+11800..1183B)
5316   \p{Script: Dsrt}        \p{Script=Deseret} (=
5317                             \p{Script_Extensions=Deseret}) (80)
5318   \p{Script: Dupl}        \p{Script=Duployan} (143)
5319   \p{Script: Duployan}    (Short: \p{Sc=Dupl}) (143: U+1BC00..1BC6A,
5320                             U+1BC70..1BC7C, U+1BC80..1BC88,
5321                             U+1BC90..1BC99, U+1BC9C..1BC9F)
5322   \p{Script: Egyp}        \p{Script=Egyptian_Hieroglyphs} (=
5323                             \p{Script_Extensions=
5324                             Egyptian_Hieroglyphs}) (1110)
5325   \p{Script: Egyptian_Hieroglyphs} \p{Script_Extensions=
5326                             Egyptian_Hieroglyphs} (Short: \p{Sc=
5327                             Egyp}, \p{Egyp}) (1110)
5328   \p{Script: Elba}        \p{Script=Elbasan} (=
5329                             \p{Script_Extensions=Elbasan}) (40)
5330   \p{Script: Elbasan}     \p{Script_Extensions=Elbasan} (Short:
5331                             \p{Sc=Elba}, \p{Elba}) (40)
5332   \p{Script: Elym}        \p{Script=Elymaic} (=
5333                             \p{Script_Extensions=Elymaic}) (23)
5334   \p{Script: Elymaic}     \p{Script_Extensions=Elymaic} (Short:
5335                             \p{Sc=Elym}, \p{Elym}) (23)
5336   \p{Script: Ethi}        \p{Script=Ethiopic} (=
5337                             \p{Script_Extensions=Ethiopic}) (523)
5338   \p{Script: Ethiopic}    \p{Script_Extensions=Ethiopic} (Short:
5339                             \p{Sc=Ethi}, \p{Ethi}) (523)
5340   \p{Script: Geor}        \p{Script=Georgian} (173)
5341   \p{Script: Georgian}    (Short: \p{Sc=Geor}) (173: U+10A0..10C5,
5342                             U+10C7, U+10CD, U+10D0..10FA,
5343                             U+10FC..10FF, U+1C90..1CBA ...)
5344   \p{Script: Glag}        \p{Script=Glagolitic} (134)
5345   \p{Script: Glagolitic}  (Short: \p{Sc=Glag}) (134: U+2C00..2C5F,
5346                             U+1E000..1E006, U+1E008..1E018,
5347                             U+1E01B..1E021, U+1E023..1E024,
5348                             U+1E026..1E02A)
5349   \p{Script: Gong}        \p{Script=Gunjala_Gondi} (63)
5350   \p{Script: Gonm}        \p{Script=Masaram_Gondi} (75)
5351   \p{Script: Goth}        \p{Script=Gothic} (= \p{Script_Extensions=
5352                             Gothic}) (27)
5353   \p{Script: Gothic}      \p{Script_Extensions=Gothic} (Short:
5354                             \p{Sc=Goth}, \p{Goth}) (27)
5355   \p{Script: Gran}        \p{Script=Grantha} (85)
5356   \p{Script: Grantha}     (Short: \p{Sc=Gran}) (85: U+11300..11303,
5357                             U+11305..1130C, U+1130F..11310,
5358                             U+11313..11328, U+1132A..11330,
5359                             U+11332..11333 ...)
5360   \p{Script: Greek}       (Short: \p{Sc=Grek}) (518: U+0370..0373,
5361                             U+0375..0377, U+037A..037D, U+037F,
5362                             U+0384, U+0386 ...)
5363   \p{Script: Grek}        \p{Script=Greek} (518)
5364   \p{Script: Gujarati}    (Short: \p{Sc=Gujr}) (91: U+0A81..0A83,
5365                             U+0A85..0A8D, U+0A8F..0A91,
5366                             U+0A93..0AA8, U+0AAA..0AB0, U+0AB2..0AB3
5367                             ...)
5368   \p{Script: Gujr}        \p{Script=Gujarati} (91)
5369   \p{Script: Gunjala_Gondi} (Short: \p{Sc=Gong}) (63:
5370                             U+11D60..11D65, U+11D67..11D68,
5371                             U+11D6A..11D8E, U+11D90..11D91,
5372                             U+11D93..11D98, U+11DA0..11DA9)
5373   \p{Script: Gurmukhi}    (Short: \p{Sc=Guru}) (80: U+0A01..0A03,
5374                             U+0A05..0A0A, U+0A0F..0A10,
5375                             U+0A13..0A28, U+0A2A..0A30, U+0A32..0A33
5376                             ...)
5377   \p{Script: Guru}        \p{Script=Gurmukhi} (80)
5378   \p{Script: Han}         (Short: \p{Sc=Han}) (98_408: U+2E80..2E99,
5379                             U+2E9B..2EF3, U+2F00..2FD5, U+3005,
5380                             U+3007, U+3021..3029 ...)
5381   \p{Script: Hang}        \p{Script=Hangul} (11_739)
5382   \p{Script: Hangul}      (Short: \p{Sc=Hang}) (11_739:
5383                             U+1100..11FF, U+302E..302F,
5384                             U+3131..318E, U+3200..321E,
5385                             U+3260..327E, U+A960..A97C ...)
5386   \p{Script: Hani}        \p{Script=Han} (98_408)
5387   \p{Script: Hanifi_Rohingya} (Short: \p{Sc=Rohg}) (50:
5388                             U+10D00..10D27, U+10D30..10D39)
5389   \p{Script: Hano}        \p{Script=Hanunoo} (21)
5390   \p{Script: Hanunoo}     (Short: \p{Sc=Hano}) (21: U+1720..1734)
5391   \p{Script: Hatr}        \p{Script=Hatran} (= \p{Script_Extensions=
5392                             Hatran}) (26)
5393   \p{Script: Hatran}      \p{Script_Extensions=Hatran} (Short:
5394                             \p{Sc=Hatr}, \p{Hatr}) (26)
5395   \p{Script: Hebr}        \p{Script=Hebrew} (= \p{Script_Extensions=
5396                             Hebrew}) (134)
5397   \p{Script: Hebrew}      \p{Script_Extensions=Hebrew} (Short:
5398                             \p{Sc=Hebr}, \p{Hebr}) (134)
5399   \p{Script: Hira}        \p{Script=Hiragana} (381)
5400   \p{Script: Hiragana}    (Short: \p{Sc=Hira}) (381: U+3041..3096,
5401                             U+309D..309F, U+1B001..1B11F, U+1B132,
5402                             U+1B150..1B152, U+1F200)
5403   \p{Script: Hluw}        \p{Script=Anatolian_Hieroglyphs} (=
5404                             \p{Script_Extensions=
5405                             Anatolian_Hieroglyphs}) (583)
5406   \p{Script: Hmng}        \p{Script=Pahawh_Hmong} (=
5407                             \p{Script_Extensions=Pahawh_Hmong}) (127)
5408   \p{Script: Hmnp}        \p{Script=Nyiakeng_Puachue_Hmong} (=
5409                             \p{Script_Extensions=
5410                             Nyiakeng_Puachue_Hmong}) (71)
5411   \p{Script: Hung}        \p{Script=Old_Hungarian} (=
5412                             \p{Script_Extensions=Old_Hungarian})
5413                             (108)
5414   \p{Script: Imperial_Aramaic} \p{Script_Extensions=
5415                             Imperial_Aramaic} (Short: \p{Sc=Armi},
5416                             \p{Armi}) (31)
5417   \p{Script: Inherited}   (Short: \p{Sc=Zinh}) (657: U+0300..036F,
5418                             U+0485..0486, U+064B..0655, U+0670,
5419                             U+0951..0954, U+1AB0..1ACE ...)
5420   \p{Script: Inscriptional_Pahlavi} \p{Script_Extensions=
5421                             Inscriptional_Pahlavi} (Short: \p{Sc=
5422                             Phli}, \p{Phli}) (27)
5423   \p{Script: Inscriptional_Parthian} \p{Script_Extensions=
5424                             Inscriptional_Parthian} (Short: \p{Sc=
5425                             Prti}, \p{Prti}) (30)
5426   \p{Script: Ital}        \p{Script=Old_Italic} (=
5427                             \p{Script_Extensions=Old_Italic}) (39)
5428   \p{Script: Java}        \p{Script=Javanese} (90)
5429   \p{Script: Javanese}    (Short: \p{Sc=Java}) (90: U+A980..A9CD,
5430                             U+A9D0..A9D9, U+A9DE..A9DF)
5431   \p{Script: Kaithi}      (Short: \p{Sc=Kthi}) (68: U+11080..110C2,
5432                             U+110CD)
5433   \p{Script: Kali}        \p{Script=Kayah_Li} (47)
5434   \p{Script: Kana}        \p{Script=Katakana} (321)
5435   \p{Script: Kannada}     (Short: \p{Sc=Knda}) (91: U+0C80..0C8C,
5436                             U+0C8E..0C90, U+0C92..0CA8,
5437                             U+0CAA..0CB3, U+0CB5..0CB9, U+0CBC..0CC4
5438                             ...)
5439   \p{Script: Katakana}    (Short: \p{Sc=Kana}) (321: U+30A1..30FA,
5440                             U+30FD..30FF, U+31F0..31FF,
5441                             U+32D0..32FE, U+3300..3357, U+FF66..FF6F
5442                             ...)
5443   \p{Script: Kawi}        \p{Script_Extensions=Kawi} (Short: \p{Sc=
5444                             Kawi}, \p{Kawi}) (86)
5445   \p{Script: Kayah_Li}    (Short: \p{Sc=Kali}) (47: U+A900..A92D,
5446                             U+A92F)
5447   \p{Script: Khar}        \p{Script=Kharoshthi} (=
5448                             \p{Script_Extensions=Kharoshthi}) (68)
5449   \p{Script: Kharoshthi}  \p{Script_Extensions=Kharoshthi} (Short:
5450                             \p{Sc=Khar}, \p{Khar}) (68)
5451   \p{Script: Khitan_Small_Script} \p{Script_Extensions=
5452                             Khitan_Small_Script} (Short: \p{Sc=
5453                             Kits}, \p{Kits}) (471)
5454   \p{Script: Khmer}       \p{Script_Extensions=Khmer} (Short: \p{Sc=
5455                             Khmr}, \p{Khmr}) (146)
5456   \p{Script: Khmr}        \p{Script=Khmer} (= \p{Script_Extensions=
5457                             Khmer}) (146)
5458   \p{Script: Khoj}        \p{Script=Khojki} (65)
5459   \p{Script: Khojki}      (Short: \p{Sc=Khoj}) (65: U+11200..11211,
5460                             U+11213..11241)
5461   \p{Script: Khudawadi}   (Short: \p{Sc=Sind}) (69: U+112B0..112EA,
5462                             U+112F0..112F9)
5463   \p{Script: Kits}        \p{Script=Khitan_Small_Script} (=
5464                             \p{Script_Extensions=
5465                             Khitan_Small_Script}) (471)
5466   \p{Script: Knda}        \p{Script=Kannada} (91)
5467   \p{Script: Kthi}        \p{Script=Kaithi} (68)
5468   \p{Script: Lana}        \p{Script=Tai_Tham} (=
5469                             \p{Script_Extensions=Tai_Tham}) (127)
5470   \p{Script: Lao}         \p{Script_Extensions=Lao} (Short: \p{Sc=
5471                             Lao}, \p{Lao}) (83)
5472   \p{Script: Laoo}        \p{Script=Lao} (= \p{Script_Extensions=
5473                             Lao}) (83)
5474   \p{Script: Latin}       (Short: \p{Sc=Latn}) (1481: [A-Za-z\xaa
5475                             \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
5476                             U+0100..02B8, U+02E0..02E4,
5477                             U+1D00..1D25, U+1D2C..1D5C, U+1D62..1D65
5478                             ...)
5479   \p{Script: Latn}        \p{Script=Latin} (1481)
5480   \p{Script: Lepc}        \p{Script=Lepcha} (= \p{Script_Extensions=
5481                             Lepcha}) (74)
5482   \p{Script: Lepcha}      \p{Script_Extensions=Lepcha} (Short:
5483                             \p{Sc=Lepc}, \p{Lepc}) (74)
5484   \p{Script: Limb}        \p{Script=Limbu} (68)
5485   \p{Script: Limbu}       (Short: \p{Sc=Limb}) (68: U+1900..191E,
5486                             U+1920..192B, U+1930..193B, U+1940,
5487                             U+1944..194F)
5488   \p{Script: Lina}        \p{Script=Linear_A} (341)
5489   \p{Script: Linb}        \p{Script=Linear_B} (211)
5490   \p{Script: Linear_A}    (Short: \p{Sc=Lina}) (341: U+10600..10736,
5491                             U+10740..10755, U+10760..10767)
5492   \p{Script: Linear_B}    (Short: \p{Sc=Linb}) (211: U+10000..1000B,
5493                             U+1000D..10026, U+10028..1003A,
5494                             U+1003C..1003D, U+1003F..1004D,
5495                             U+10050..1005D ...)
5496   \p{Script: Lisu}        \p{Script_Extensions=Lisu} (Short: \p{Sc=
5497                             Lisu}, \p{Lisu}) (49)
5498   \p{Script: Lyci}        \p{Script=Lycian} (= \p{Script_Extensions=
5499                             Lycian}) (29)
5500   \p{Script: Lycian}      \p{Script_Extensions=Lycian} (Short:
5501                             \p{Sc=Lyci}, \p{Lyci}) (29)
5502   \p{Script: Lydi}        \p{Script=Lydian} (= \p{Script_Extensions=
5503                             Lydian}) (27)
5504   \p{Script: Lydian}      \p{Script_Extensions=Lydian} (Short:
5505                             \p{Sc=Lydi}, \p{Lydi}) (27)
5506   \p{Script: Mahajani}    (Short: \p{Sc=Mahj}) (39: U+11150..11176)
5507   \p{Script: Mahj}        \p{Script=Mahajani} (39)
5508   \p{Script: Maka}        \p{Script=Makasar} (=
5509                             \p{Script_Extensions=Makasar}) (25)
5510   \p{Script: Makasar}     \p{Script_Extensions=Makasar} (Short:
5511                             \p{Sc=Maka}, \p{Maka}) (25)
5512   \p{Script: Malayalam}   (Short: \p{Sc=Mlym}) (118: U+0D00..0D0C,
5513                             U+0D0E..0D10, U+0D12..0D44,
5514                             U+0D46..0D48, U+0D4A..0D4F, U+0D54..0D63
5515                             ...)
5516   \p{Script: Mand}        \p{Script=Mandaic} (29)
5517   \p{Script: Mandaic}     (Short: \p{Sc=Mand}) (29: U+0840..085B,
5518                             U+085E)
5519   \p{Script: Mani}        \p{Script=Manichaean} (51)
5520   \p{Script: Manichaean}  (Short: \p{Sc=Mani}) (51: U+10AC0..10AE6,
5521                             U+10AEB..10AF6)
5522   \p{Script: Marc}        \p{Script=Marchen} (=
5523                             \p{Script_Extensions=Marchen}) (68)
5524   \p{Script: Marchen}     \p{Script_Extensions=Marchen} (Short:
5525                             \p{Sc=Marc}, \p{Marc}) (68)
5526   \p{Script: Masaram_Gondi} (Short: \p{Sc=Gonm}) (75:
5527                             U+11D00..11D06, U+11D08..11D09,
5528                             U+11D0B..11D36, U+11D3A, U+11D3C..11D3D,
5529                             U+11D3F..11D47 ...)
5530   \p{Script: Medefaidrin} \p{Script_Extensions=Medefaidrin} (Short:
5531                             \p{Sc=Medf}, \p{Medf}) (91)
5532   \p{Script: Medf}        \p{Script=Medefaidrin} (=
5533                             \p{Script_Extensions=Medefaidrin}) (91)
5534   \p{Script: Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek}
5535                             (Short: \p{Sc=Mtei}, \p{Mtei}) (79)
5536   \p{Script: Mend}        \p{Script=Mende_Kikakui} (=
5537                             \p{Script_Extensions=Mende_Kikakui})
5538                             (213)
5539   \p{Script: Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
5540                             (Short: \p{Sc=Mend}, \p{Mend}) (213)
5541   \p{Script: Merc}        \p{Script=Meroitic_Cursive} (=
5542                             \p{Script_Extensions=Meroitic_Cursive})
5543                             (90)
5544   \p{Script: Mero}        \p{Script=Meroitic_Hieroglyphs} (=
5545                             \p{Script_Extensions=
5546                             Meroitic_Hieroglyphs}) (32)
5547   \p{Script: Meroitic_Cursive} \p{Script_Extensions=
5548                             Meroitic_Cursive} (Short: \p{Sc=Merc},
5549                             \p{Merc}) (90)
5550   \p{Script: Meroitic_Hieroglyphs} \p{Script_Extensions=
5551                             Meroitic_Hieroglyphs} (Short: \p{Sc=
5552                             Mero}, \p{Mero}) (32)
5553   \p{Script: Miao}        \p{Script_Extensions=Miao} (Short: \p{Sc=
5554                             Miao}, \p{Miao}) (149)
5555   \p{Script: Mlym}        \p{Script=Malayalam} (118)
5556   \p{Script: Modi}        (Short: \p{Sc=Modi}) (79: U+11600..11644,
5557                             U+11650..11659)
5558   \p{Script: Mong}        \p{Script=Mongolian} (168)
5559   \p{Script: Mongolian}   (Short: \p{Sc=Mong}) (168: U+1800..1801,
5560                             U+1804, U+1806..1819, U+1820..1878,
5561                             U+1880..18AA, U+11660..1166C)
5562   \p{Script: Mro}         \p{Script_Extensions=Mro} (Short: \p{Sc=
5563                             Mro}, \p{Mro}) (43)
5564   \p{Script: Mroo}        \p{Script=Mro} (= \p{Script_Extensions=
5565                             Mro}) (43)
5566   \p{Script: Mtei}        \p{Script=Meetei_Mayek} (=
5567                             \p{Script_Extensions=Meetei_Mayek}) (79)
5568   \p{Script: Mult}        \p{Script=Multani} (38)
5569   \p{Script: Multani}     (Short: \p{Sc=Mult}) (38: U+11280..11286,
5570                             U+11288, U+1128A..1128D, U+1128F..1129D,
5571                             U+1129F..112A9)
5572   \p{Script: Myanmar}     (Short: \p{Sc=Mymr}) (223: U+1000..109F,
5573                             U+A9E0..A9FE, U+AA60..AA7F)
5574   \p{Script: Mymr}        \p{Script=Myanmar} (223)
5575   \p{Script: Nabataean}   \p{Script_Extensions=Nabataean} (Short:
5576                             \p{Sc=Nbat}, \p{Nbat}) (40)
5577   \p{Script: Nag_Mundari} \p{Script_Extensions=Nag_Mundari} (Short:
5578                             \p{Sc=Nagm}, \p{Nagm}) (42)
5579   \p{Script: Nagm}        \p{Script=Nag_Mundari} (=
5580                             \p{Script_Extensions=Nag_Mundari}) (42)
5581   \p{Script: Nand}        \p{Script=Nandinagari} (65)
5582   \p{Script: Nandinagari} (Short: \p{Sc=Nand}) (65: U+119A0..119A7,
5583                             U+119AA..119D7, U+119DA..119E4)
5584   \p{Script: Narb}        \p{Script=Old_North_Arabian} (=
5585                             \p{Script_Extensions=Old_North_Arabian})
5586                             (32)
5587   \p{Script: Nbat}        \p{Script=Nabataean} (=
5588                             \p{Script_Extensions=Nabataean}) (40)
5589   \p{Script: New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
5590                             \p{Sc=Talu}, \p{Talu}) (83)
5591   \p{Script: Newa}        \p{Script_Extensions=Newa} (Short: \p{Sc=
5592                             Newa}, \p{Newa}) (97)
5593   \p{Script: Nko}         (Short: \p{Sc=Nko}) (62: U+07C0..07FA,
5594                             U+07FD..07FF)
5595   \p{Script: Nkoo}        \p{Script=Nko} (62)
5596   \p{Script: Nshu}        \p{Script=Nushu} (= \p{Script_Extensions=
5597                             Nushu}) (397)
5598   \p{Script: Nushu}       \p{Script_Extensions=Nushu} (Short: \p{Sc=
5599                             Nshu}, \p{Nshu}) (397)
5600   \p{Script: Nyiakeng_Puachue_Hmong} \p{Script_Extensions=
5601                             Nyiakeng_Puachue_Hmong} (Short: \p{Sc=
5602                             Hmnp}, \p{Hmnp}) (71)
5603   \p{Script: Ogam}        \p{Script=Ogham} (= \p{Script_Extensions=
5604                             Ogham}) (29)
5605   \p{Script: Ogham}       \p{Script_Extensions=Ogham} (Short: \p{Sc=
5606                             Ogam}, \p{Ogam}) (29)
5607   \p{Script: Ol_Chiki}    \p{Script_Extensions=Ol_Chiki} (Short:
5608                             \p{Sc=Olck}, \p{Olck}) (48)
5609   \p{Script: Olck}        \p{Script=Ol_Chiki} (=
5610                             \p{Script_Extensions=Ol_Chiki}) (48)
5611   \p{Script: Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
5612                             (Short: \p{Sc=Hung}, \p{Hung}) (108)
5613   \p{Script: Old_Italic}  \p{Script_Extensions=Old_Italic} (Short:
5614                             \p{Sc=Ital}, \p{Ital}) (39)
5615   \p{Script: Old_North_Arabian} \p{Script_Extensions=
5616                             Old_North_Arabian} (Short: \p{Sc=Narb},
5617                             \p{Narb}) (32)
5618   \p{Script: Old_Permic}  (Short: \p{Sc=Perm}) (43: U+10350..1037A)
5619   \p{Script: Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
5620                             \p{Sc=Xpeo}, \p{Xpeo}) (50)
5621   \p{Script: Old_Sogdian} \p{Script_Extensions=Old_Sogdian} (Short:
5622                             \p{Sc=Sogo}, \p{Sogo}) (40)
5623   \p{Script: Old_South_Arabian} \p{Script_Extensions=
5624                             Old_South_Arabian} (Short: \p{Sc=Sarb},
5625                             \p{Sarb}) (32)
5626   \p{Script: Old_Turkic}  \p{Script_Extensions=Old_Turkic} (Short:
5627                             \p{Sc=Orkh}, \p{Orkh}) (73)
5628   \p{Script: Old_Uyghur}  (Short: \p{Sc=Ougr}) (26: U+10F70..10F89)
5629   \p{Script: Oriya}       (Short: \p{Sc=Orya}) (91: U+0B01..0B03,
5630                             U+0B05..0B0C, U+0B0F..0B10,
5631                             U+0B13..0B28, U+0B2A..0B30, U+0B32..0B33
5632                             ...)
5633   \p{Script: Orkh}        \p{Script=Old_Turkic} (=
5634                             \p{Script_Extensions=Old_Turkic}) (73)
5635   \p{Script: Orya}        \p{Script=Oriya} (91)
5636   \p{Script: Osage}       \p{Script_Extensions=Osage} (Short: \p{Sc=
5637                             Osge}, \p{Osge}) (72)
5638   \p{Script: Osge}        \p{Script=Osage} (= \p{Script_Extensions=
5639                             Osage}) (72)
5640   \p{Script: Osma}        \p{Script=Osmanya} (=
5641                             \p{Script_Extensions=Osmanya}) (40)
5642   \p{Script: Osmanya}     \p{Script_Extensions=Osmanya} (Short:
5643                             \p{Sc=Osma}, \p{Osma}) (40)
5644   \p{Script: Ougr}        \p{Script=Old_Uyghur} (26)
5645   \p{Script: Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong}
5646                             (Short: \p{Sc=Hmng}, \p{Hmng}) (127)
5647   \p{Script: Palm}        \p{Script=Palmyrene} (=
5648                             \p{Script_Extensions=Palmyrene}) (32)
5649   \p{Script: Palmyrene}   \p{Script_Extensions=Palmyrene} (Short:
5650                             \p{Sc=Palm}, \p{Palm}) (32)
5651   \p{Script: Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
5652                             \p{Sc=Pauc}, \p{Pauc}) (57)
5653   \p{Script: Pauc}        \p{Script=Pau_Cin_Hau} (=
5654                             \p{Script_Extensions=Pau_Cin_Hau}) (57)
5655   \p{Script: Perm}        \p{Script=Old_Permic} (43)
5656   \p{Script: Phag}        \p{Script=Phags_Pa} (56)
5657   \p{Script: Phags_Pa}    (Short: \p{Sc=Phag}) (56: U+A840..A877)
5658   \p{Script: Phli}        \p{Script=Inscriptional_Pahlavi} (=
5659                             \p{Script_Extensions=
5660                             Inscriptional_Pahlavi}) (27)
5661   \p{Script: Phlp}        \p{Script=Psalter_Pahlavi} (29)
5662   \p{Script: Phnx}        \p{Script=Phoenician} (=
5663                             \p{Script_Extensions=Phoenician}) (29)
5664   \p{Script: Phoenician}  \p{Script_Extensions=Phoenician} (Short:
5665                             \p{Sc=Phnx}, \p{Phnx}) (29)
5666   \p{Script: Plrd}        \p{Script=Miao} (= \p{Script_Extensions=
5667                             Miao}) (149)
5668   \p{Script: Prti}        \p{Script=Inscriptional_Parthian} (=
5669                             \p{Script_Extensions=
5670                             Inscriptional_Parthian}) (30)
5671   \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29:
5672                             U+10B80..10B91, U+10B99..10B9C,
5673                             U+10BA9..10BAF)
5674   \p{Script: Qaac}        \p{Script=Coptic} (137)
5675   \p{Script: Qaai}        \p{Script=Inherited} (657)
5676   \p{Script: Rejang}      \p{Script_Extensions=Rejang} (Short:
5677                             \p{Sc=Rjng}, \p{Rjng}) (37)
5678   \p{Script: Rjng}        \p{Script=Rejang} (= \p{Script_Extensions=
5679                             Rejang}) (37)
5680   \p{Script: Rohg}        \p{Script=Hanifi_Rohingya} (50)
5681   \p{Script: Runic}       \p{Script_Extensions=Runic} (Short: \p{Sc=
5682                             Runr}, \p{Runr}) (86)
5683   \p{Script: Runr}        \p{Script=Runic} (= \p{Script_Extensions=
5684                             Runic}) (86)
5685   \p{Script: Samaritan}   \p{Script_Extensions=Samaritan} (Short:
5686                             \p{Sc=Samr}, \p{Samr}) (61)
5687   \p{Script: Samr}        \p{Script=Samaritan} (=
5688                             \p{Script_Extensions=Samaritan}) (61)
5689   \p{Script: Sarb}        \p{Script=Old_South_Arabian} (=
5690                             \p{Script_Extensions=Old_South_Arabian})
5691                             (32)
5692   \p{Script: Saur}        \p{Script=Saurashtra} (=
5693                             \p{Script_Extensions=Saurashtra}) (82)
5694   \p{Script: Saurashtra}  \p{Script_Extensions=Saurashtra} (Short:
5695                             \p{Sc=Saur}, \p{Saur}) (82)
5696   \p{Script: Sgnw}        \p{Script=SignWriting} (=
5697                             \p{Script_Extensions=SignWriting}) (672)
5698   \p{Script: Sharada}     (Short: \p{Sc=Shrd}) (96: U+11180..111DF)
5699   \p{Script: Shavian}     \p{Script_Extensions=Shavian} (Short:
5700                             \p{Sc=Shaw}, \p{Shaw}) (48)
5701   \p{Script: Shaw}        \p{Script=Shavian} (=
5702                             \p{Script_Extensions=Shavian}) (48)
5703   \p{Script: Shrd}        \p{Script=Sharada} (96)
5704   \p{Script: Sidd}        \p{Script=Siddham} (=
5705                             \p{Script_Extensions=Siddham}) (92)
5706   \p{Script: Siddham}     \p{Script_Extensions=Siddham} (Short:
5707                             \p{Sc=Sidd}, \p{Sidd}) (92)
5708   \p{Script: SignWriting} \p{Script_Extensions=SignWriting} (Short:
5709                             \p{Sc=Sgnw}, \p{Sgnw}) (672)
5710   \p{Script: Sind}        \p{Script=Khudawadi} (69)
5711   \p{Script: Sinh}        \p{Script=Sinhala} (111)
5712   \p{Script: Sinhala}     (Short: \p{Sc=Sinh}) (111: U+0D81..0D83,
5713                             U+0D85..0D96, U+0D9A..0DB1,
5714                             U+0DB3..0DBB, U+0DBD, U+0DC0..0DC6 ...)
5715   \p{Script: Sogd}        \p{Script=Sogdian} (42)
5716   \p{Script: Sogdian}     (Short: \p{Sc=Sogd}) (42: U+10F30..10F59)
5717   \p{Script: Sogo}        \p{Script=Old_Sogdian} (=
5718                             \p{Script_Extensions=Old_Sogdian}) (40)
5719   \p{Script: Sora}        \p{Script=Sora_Sompeng} (=
5720                             \p{Script_Extensions=Sora_Sompeng}) (35)
5721   \p{Script: Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng}
5722                             (Short: \p{Sc=Sora}, \p{Sora}) (35)
5723   \p{Script: Soyo}        \p{Script=Soyombo} (=
5724                             \p{Script_Extensions=Soyombo}) (83)
5725   \p{Script: Soyombo}     \p{Script_Extensions=Soyombo} (Short:
5726                             \p{Sc=Soyo}, \p{Soyo}) (83)
5727   \p{Script: Sund}        \p{Script=Sundanese} (=
5728                             \p{Script_Extensions=Sundanese}) (72)
5729   \p{Script: Sundanese}   \p{Script_Extensions=Sundanese} (Short:
5730                             \p{Sc=Sund}, \p{Sund}) (72)
5731   \p{Script: Sylo}        \p{Script=Syloti_Nagri} (45)
5732   \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (45: U+A800..A82C)
5733   \p{Script: Syrc}        \p{Script=Syriac} (88)
5734   \p{Script: Syriac}      (Short: \p{Sc=Syrc}) (88: U+0700..070D,
5735                             U+070F..074A, U+074D..074F, U+0860..086A)
5736   \p{Script: Tagalog}     (Short: \p{Sc=Tglg}) (23: U+1700..1715,
5737                             U+171F)
5738   \p{Script: Tagb}        \p{Script=Tagbanwa} (18)
5739   \p{Script: Tagbanwa}    (Short: \p{Sc=Tagb}) (18: U+1760..176C,
5740                             U+176E..1770, U+1772..1773)
5741   \p{Script: Tai_Le}      (Short: \p{Sc=Tale}) (35: U+1950..196D,
5742                             U+1970..1974)
5743   \p{Script: Tai_Tham}    \p{Script_Extensions=Tai_Tham} (Short:
5744                             \p{Sc=Lana}, \p{Lana}) (127)
5745   \p{Script: Tai_Viet}    \p{Script_Extensions=Tai_Viet} (Short:
5746                             \p{Sc=Tavt}, \p{Tavt}) (72)
5747   \p{Script: Takr}        \p{Script=Takri} (68)
5748   \p{Script: Takri}       (Short: \p{Sc=Takr}) (68: U+11680..116B9,
5749                             U+116C0..116C9)
5750   \p{Script: Tale}        \p{Script=Tai_Le} (35)
5751   \p{Script: Talu}        \p{Script=New_Tai_Lue} (=
5752                             \p{Script_Extensions=New_Tai_Lue}) (83)
5753   \p{Script: Tamil}       (Short: \p{Sc=Taml}) (123: U+0B82..0B83,
5754                             U+0B85..0B8A, U+0B8E..0B90,
5755                             U+0B92..0B95, U+0B99..0B9A, U+0B9C ...)
5756   \p{Script: Taml}        \p{Script=Tamil} (123)
5757   \p{Script: Tang}        \p{Script=Tangut} (= \p{Script_Extensions=
5758                             Tangut}) (6914)
5759   \p{Script: Tangsa}      \p{Script_Extensions=Tangsa} (Short:
5760                             \p{Sc=Tnsa}, \p{Tnsa}) (89)
5761   \p{Script: Tangut}      \p{Script_Extensions=Tangut} (Short:
5762                             \p{Sc=Tang}, \p{Tang}) (6914)
5763   \p{Script: Tavt}        \p{Script=Tai_Viet} (=
5764                             \p{Script_Extensions=Tai_Viet}) (72)
5765   \p{Script: Telu}        \p{Script=Telugu} (100)
5766   \p{Script: Telugu}      (Short: \p{Sc=Telu}) (100: U+0C00..0C0C,
5767                             U+0C0E..0C10, U+0C12..0C28,
5768                             U+0C2A..0C39, U+0C3C..0C44, U+0C46..0C48
5769                             ...)
5770   \p{Script: Tfng}        \p{Script=Tifinagh} (=
5771                             \p{Script_Extensions=Tifinagh}) (59)
5772   \p{Script: Tglg}        \p{Script=Tagalog} (23)
5773   \p{Script: Thaa}        \p{Script=Thaana} (50)
5774   \p{Script: Thaana}      (Short: \p{Sc=Thaa}) (50: U+0780..07B1)
5775   \p{Script: Thai}        \p{Script_Extensions=Thai} (Short: \p{Sc=
5776                             Thai}, \p{Thai}) (86)
5777   \p{Script: Tibetan}     \p{Script_Extensions=Tibetan} (Short:
5778                             \p{Sc=Tibt}, \p{Tibt}) (207)
5779   \p{Script: Tibt}        \p{Script=Tibetan} (=
5780                             \p{Script_Extensions=Tibetan}) (207)
5781   \p{Script: Tifinagh}    \p{Script_Extensions=Tifinagh} (Short:
5782                             \p{Sc=Tfng}, \p{Tfng}) (59)
5783   \p{Script: Tirh}        \p{Script=Tirhuta} (82)
5784   \p{Script: Tirhuta}     (Short: \p{Sc=Tirh}) (82: U+11480..114C7,
5785                             U+114D0..114D9)
5786   \p{Script: Tnsa}        \p{Script=Tangsa} (= \p{Script_Extensions=
5787                             Tangsa}) (89)
5788   \p{Script: Toto}        \p{Script_Extensions=Toto} (Short: \p{Sc=
5789                             Toto}, \p{Toto}) (31)
5790   \p{Script: Ugar}        \p{Script=Ugaritic} (=
5791                             \p{Script_Extensions=Ugaritic}) (31)
5792   \p{Script: Ugaritic}    \p{Script_Extensions=Ugaritic} (Short:
5793                             \p{Sc=Ugar}, \p{Ugar}) (31)
5794   \p{Script: Unknown}     \p{Script_Extensions=Unknown} (Short:
5795                             \p{Sc=Zzzz}, \p{Zzzz}) (964_861 plus all
5796                             above-Unicode code points)
5797   \p{Script: Vai}         \p{Script_Extensions=Vai} (Short: \p{Sc=
5798                             Vai}, \p{Vai}) (300)
5799   \p{Script: Vaii}        \p{Script=Vai} (= \p{Script_Extensions=
5800                             Vai}) (300)
5801   \p{Script: Vith}        \p{Script=Vithkuqi} (=
5802                             \p{Script_Extensions=Vithkuqi}) (70)
5803   \p{Script: Vithkuqi}    \p{Script_Extensions=Vithkuqi} (Short:
5804                             \p{Sc=Vith}, \p{Vith}) (70)
5805   \p{Script: Wancho}      \p{Script_Extensions=Wancho} (Short:
5806                             \p{Sc=Wcho}, \p{Wcho}) (59)
5807   \p{Script: Wara}        \p{Script=Warang_Citi} (=
5808                             \p{Script_Extensions=Warang_Citi}) (84)
5809   \p{Script: Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
5810                             \p{Sc=Wara}, \p{Wara}) (84)
5811   \p{Script: Wcho}        \p{Script=Wancho} (= \p{Script_Extensions=
5812                             Wancho}) (59)
5813   \p{Script: Xpeo}        \p{Script=Old_Persian} (=
5814                             \p{Script_Extensions=Old_Persian}) (50)
5815   \p{Script: Xsux}        \p{Script=Cuneiform} (=
5816                             \p{Script_Extensions=Cuneiform}) (1234)
5817   \p{Script: Yezi}        \p{Script=Yezidi} (47)
5818   \p{Script: Yezidi}      (Short: \p{Sc=Yezi}) (47: U+10E80..10EA9,
5819                             U+10EAB..10EAD, U+10EB0..10EB1)
5820   \p{Script: Yi}          (Short: \p{Sc=Yi}) (1220: U+A000..A48C,
5821                             U+A490..A4C6)
5822   \p{Script: Yiii}        \p{Script=Yi} (1220)
5823   \p{Script: Zanabazar_Square} \p{Script_Extensions=
5824                             Zanabazar_Square} (Short: \p{Sc=Zanb},
5825                             \p{Zanb}) (72)
5826   \p{Script: Zanb}        \p{Script=Zanabazar_Square} (=
5827                             \p{Script_Extensions=Zanabazar_Square})
5828                             (72)
5829   \p{Script: Zinh}        \p{Script=Inherited} (657)
5830   \p{Script: Zyyy}        \p{Script=Common} (8301)
5831   \p{Script: Zzzz}        \p{Script=Unknown} (=
5832                             \p{Script_Extensions=Unknown}) (964_861
5833                             plus all above-Unicode code points)
5834   \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (90:
5835                             U+061F, U+0640, U+1E900..1E94B,
5836                             U+1E950..1E959, U+1E95E..1E95F)
5837   \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (90)
5838   \p{Script_Extensions: Aghb} \p{Script_Extensions=
5839                             Caucasian_Albanian} (53)
5840   \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (65:
5841                             U+11700..1171A, U+1171D..1172B,
5842                             U+11730..11746)
5843   \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
5844                             \p{Hluw}) (583: U+14400..14646)
5845   \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1414)
5846   \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
5847                             (1414: U+0600..0604, U+0606..06DC,
5848                             U+06DE..06FF, U+0750..077F,
5849                             U+0870..088E, U+0890..0891 ...)
5850   \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
5851                             (96: U+0531..0556, U+0559..058A,
5852                             U+058D..058F, U+FB13..FB17)
5853   \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
5854                             (31)
5855   \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (96)
5856   \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst})
5857                             (61: U+10B00..10B35, U+10B39..10B3F)
5858   \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
5859   \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (124)
5860   \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
5861                             (124: U+1B00..1B4C, U+1B50..1B7E)
5862   \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
5863   \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657:
5864                             U+A6A0..A6F7, U+16800..16A38)
5865   \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
5866   \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
5867                             (36: U+16AD0..16AED, U+16AF0..16AF5)
5868   \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56:
5869                             U+1BC0..1BF3, U+1BFC..1BFF)
5870   \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
5871   \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (113)
5872   \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng})
5873                             (113: U+0951..0952, U+0964..0965,
5874                             U+0980..0983, U+0985..098C,
5875                             U+098F..0990, U+0993..09A8 ...)
5876   \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
5877                             (97: U+11C00..11C08, U+11C0A..11C36,
5878                             U+11C38..11C45, U+11C50..11C6C)
5879   \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
5880   \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (117)
5881   \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
5882                             (117: U+02EA..02EB, U+3001..3003,
5883                             U+3008..3011, U+3013..301F,
5884                             U+302A..302D, U+3030 ...)
5885   \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (115)
5886   \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah})
5887                             (115: U+11000..1104D, U+11052..11075,
5888                             U+1107F)
5889   \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
5890   \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
5891                             (256: U+2800..28FF)
5892   \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
5893   \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
5894                             (31: U+1A00..1A1B, U+1A1E..1A1F, U+A9CF)
5895   \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
5896   \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22:
5897                             U+1735..1736, U+1740..1753)
5898   \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (91)
5899   \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
5900                             \p{Cans}) (726: U+1400..167F,
5901                             U+18B0..18F5, U+11AB0..11ABF)
5902   \p{Script_Extensions: Cans} \p{Script_Extensions=
5903                             Canadian_Aboriginal} (726)
5904   \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
5905   \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49:
5906                             U+102A0..102D0)
5907   \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
5908                             \p{Aghb}) (53: U+10530..10563, U+1056F)
5909   \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (91:
5910                             U+09E6..09EF, U+1040..1049,
5911                             U+11100..11134, U+11136..11147)
5912   \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83:
5913                             U+AA00..AA36, U+AA40..AA4D,
5914                             U+AA50..AA59, U+AA5C..AA5F)
5915   \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
5916   \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
5917                             (172: U+13A0..13F5, U+13F8..13FD,
5918                             U+AB70..ABBF)
5919   \p{Script_Extensions: Chorasmian} (Short: \p{Scx=Chrs}, \p{Chrs})
5920                             (28: U+10FB0..10FCB)
5921   \p{Script_Extensions: Chrs} \p{Script_Extensions=Chorasmian} (28)
5922   \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
5923                             (7873: [\x00-\x20!\"#\$\%&\'\(\)*+,\-.
5924                             \/0-9:;<=>?\@\[\\\]\^_`\{\|\}~\x7f-\xa9
5925                             \xab-\xb9\xbb-\xbf\xd7\xf7],
5926                             U+02B9..02DF, U+02E5..02E9,
5927                             U+02EC..02FF, U+0374, U+037E ...)
5928   \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
5929   \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt})
5930                             (165: U+03E2..03EF, U+2C80..2CF3,
5931                             U+2CF9..2CFF, U+102E0..102FB)
5932   \p{Script_Extensions: Cpmn} \p{Script_Extensions=Cypro_Minoan}
5933                             (101)
5934   \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
5935   \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
5936                             (1234: U+12000..12399, U+12400..1246E,
5937                             U+12470..12474, U+12480..12543)
5938   \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
5939                             (112: U+10100..10102, U+10107..10133,
5940                             U+10137..1013F, U+10800..10805, U+10808,
5941                             U+1080A..10835 ...)
5942   \p{Script_Extensions: Cypro_Minoan} (Short: \p{Scx=Cpmn},
5943                             \p{Cpmn}) (101: U+10100..10101,
5944                             U+12F90..12FF2)
5945   \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
5946                             (510: U+0400..052F, U+1C80..1C88,
5947                             U+1D2B, U+1D78, U+1DF8, U+2DE0..2DFF ...)
5948   \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (510)
5949   \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt})
5950                             (80: U+10400..1044F)
5951   \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (220)
5952   \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
5953                             (220: U+0900..0952, U+0955..097F,
5954                             U+1CD0..1CF6, U+1CF8..1CF9, U+20F0,
5955                             U+A830..A839 ...)
5956   \p{Script_Extensions: Diak} \p{Script_Extensions=Dives_Akuru} (72)
5957   \p{Script_Extensions: Dives_Akuru} (Short: \p{Scx=Diak}, \p{Diak})
5958                             (72: U+11900..11906, U+11909,
5959                             U+1190C..11913, U+11915..11916,
5960                             U+11918..11935, U+11937..11938 ...)
5961   \p{Script_Extensions: Dogr} \p{Script_Extensions=Dogra} (82)
5962   \p{Script_Extensions: Dogra} (Short: \p{Scx=Dogr}, \p{Dogr}) (82:
5963                             U+0964..096F, U+A830..A839,
5964                             U+11800..1183B)
5965   \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
5966   \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
5967   \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
5968                             (147: U+1BC00..1BC6A, U+1BC70..1BC7C,
5969                             U+1BC80..1BC88, U+1BC90..1BC99,
5970                             U+1BC9C..1BCA3)
5971   \p{Script_Extensions: Egyp} \p{Script_Extensions=
5972                             Egyptian_Hieroglyphs} (1110)
5973   \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
5974                             \p{Egyp}) (1110: U+13000..13455)
5975   \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
5976   \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba})
5977                             (40: U+10500..10527)
5978   \p{Script_Extensions: Elym} \p{Script_Extensions=Elymaic} (23)
5979   \p{Script_Extensions: Elymaic} (Short: \p{Scx=Elym}, \p{Elym})
5980                             (23: U+10FE0..10FF6)
5981   \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (523)
5982   \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
5983                             (523: U+1200..1248, U+124A..124D,
5984                             U+1250..1256, U+1258, U+125A..125D,
5985                             U+1260..1288 ...)
5986   \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (174)
5987   \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
5988                             (174: U+10A0..10C5, U+10C7, U+10CD,
5989                             U+10D0..10FF, U+1C90..1CBA, U+1CBD..1CBF
5990                             ...)
5991   \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (138)
5992   \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
5993                             (138: U+0484, U+0487, U+2C00..2C5F,
5994                             U+2E43, U+A66F, U+1E000..1E006 ...)
5995   \p{Script_Extensions: Gong} \p{Script_Extensions=Gunjala_Gondi}
5996                             (65)
5997   \p{Script_Extensions: Gonm} \p{Script_Extensions=Masaram_Gondi}
5998                             (77)
5999   \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
6000   \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27:
6001                             U+10330..1034A)
6002   \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (116)
6003   \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
6004                             (116: U+0951..0952, U+0964..0965,
6005                             U+0BE6..0BF3, U+1CD0, U+1CD2..1CD3,
6006                             U+1CF2..1CF4 ...)
6007   \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522:
6008                             U+0342, U+0345, U+0370..0373,
6009                             U+0375..0377, U+037A..037D, U+037F ...)
6010   \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
6011   \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
6012                             (105: U+0951..0952, U+0964..0965,
6013                             U+0A81..0A83, U+0A85..0A8D,
6014                             U+0A8F..0A91, U+0A93..0AA8 ...)
6015   \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (105)
6016   \p{Script_Extensions: Gunjala_Gondi} (Short: \p{Scx=Gong},
6017                             \p{Gong}) (65: U+0964..0965,
6018                             U+11D60..11D65, U+11D67..11D68,
6019                             U+11D6A..11D8E, U+11D90..11D91,
6020                             U+11D93..11D98 ...)
6021   \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
6022                             (94: U+0951..0952, U+0964..0965,
6023                             U+0A01..0A03, U+0A05..0A0A,
6024                             U+0A0F..0A10, U+0A13..0A28 ...)
6025   \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (94)
6026   \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (98_696:
6027                             U+2E80..2E99, U+2E9B..2EF3,
6028                             U+2F00..2FD5, U+3001..3003,
6029                             U+3005..3011, U+3013..301F ...)
6030   \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
6031   \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
6032                             (11_775: U+1100..11FF, U+3001..3003,
6033                             U+3008..3011, U+3013..301F,
6034                             U+302E..3030, U+3037 ...)
6035   \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (98_696)
6036   \p{Script_Extensions: Hanifi_Rohingya} (Short: \p{Scx=Rohg},
6037                             \p{Rohg}) (55: U+060C, U+061B, U+061F,
6038                             U+0640, U+06D4, U+10D00..10D27 ...)
6039   \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
6040   \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano})
6041                             (23: U+1720..1736)
6042   \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
6043   \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26:
6044                             U+108E0..108F2, U+108F4..108F5,
6045                             U+108FB..108FF)
6046   \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (134)
6047   \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr})
6048                             (134: U+0591..05C7, U+05D0..05EA,
6049                             U+05EF..05F4, U+FB1D..FB36,
6050                             U+FB38..FB3C, U+FB3E ...)
6051   \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (433)
6052   \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
6053                             (433: U+3001..3003, U+3008..3011,
6054                             U+3013..301F, U+3030..3035, U+3037,
6055                             U+303C..303D ...)
6056   \p{Script_Extensions: Hluw} \p{Script_Extensions=
6057                             Anatolian_Hieroglyphs} (583)
6058   \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
6059                             (127)
6060   \p{Script_Extensions: Hmnp} \p{Script_Extensions=
6061                             Nyiakeng_Puachue_Hmong} (71)
6062   \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
6063                             (108)
6064   \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
6065                             \p{Armi}) (31: U+10840..10855,
6066                             U+10857..1085F)
6067   \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
6068                             (586: U+0300..0341, U+0343..0344,
6069                             U+0346..0362, U+0953..0954,
6070                             U+1AB0..1ACE, U+1DC2..1DF7 ...)
6071   \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
6072                             \p{Phli}) (27: U+10B60..10B72,
6073                             U+10B78..10B7F)
6074   \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
6075                             Prti}, \p{Prti}) (30: U+10B40..10B55,
6076                             U+10B58..10B5F)
6077   \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (39)
6078   \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
6079   \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
6080                             (91: U+A980..A9CD, U+A9CF..A9D9,
6081                             U+A9DE..A9DF)
6082   \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (88:
6083                             U+0966..096F, U+A830..A839,
6084                             U+11080..110C2, U+110CD)
6085   \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
6086   \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (373)
6087   \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
6088                             (106: U+0951..0952, U+0964..0965,
6089                             U+0C80..0C8C, U+0C8E..0C90,
6090                             U+0C92..0CA8, U+0CAA..0CB3 ...)
6091   \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
6092                             (373: U+3001..3003, U+3008..3011,
6093                             U+3013..301F, U+3030..3035, U+3037,
6094                             U+303C..303D ...)
6095   \p{Script_Extensions: Kawi} (Short: \p{Scx=Kawi}, \p{Kawi}) (86:
6096                             U+11F00..11F10, U+11F12..11F3A,
6097                             U+11F3E..11F59)
6098   \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
6099                             (48: U+A900..A92F)
6100   \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (68)
6101   \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
6102                             (68: U+10A00..10A03, U+10A05..10A06,
6103                             U+10A0C..10A13, U+10A15..10A17,
6104                             U+10A19..10A35, U+10A38..10A3A ...)
6105   \p{Script_Extensions: Khitan_Small_Script} (Short: \p{Scx=Kits},
6106                             \p{Kits}) (471: U+16FE4, U+18B00..18CD5)
6107   \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146:
6108                             U+1780..17DD, U+17E0..17E9,
6109                             U+17F0..17F9, U+19E0..19FF)
6110   \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
6111   \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (85)
6112   \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (85:
6113                             U+0AE6..0AEF, U+A830..A839,
6114                             U+11200..11211, U+11213..11241)
6115   \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
6116                             (81: U+0964..0965, U+A830..A839,
6117                             U+112B0..112EA, U+112F0..112F9)
6118   \p{Script_Extensions: Kits} \p{Script_Extensions=
6119                             Khitan_Small_Script} (471)
6120   \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (106)
6121   \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (88)
6122   \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
6123   \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (83:
6124                             U+0E81..0E82, U+0E84, U+0E86..0E8A,
6125                             U+0E8C..0EA3, U+0EA5, U+0EA7..0EBD ...)
6126   \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (83)
6127   \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn})
6128                             (1510: [A-Za-z\xaa\xba\xc0-\xd6\xd8-
6129                             \xf6\xf8-\xff], U+0100..02B8,
6130                             U+02E0..02E4, U+0363..036F,
6131                             U+0485..0486, U+0951..0952 ...)
6132   \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1510)
6133   \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
6134   \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74:
6135                             U+1C00..1C37, U+1C3B..1C49, U+1C4D..1C4F)
6136   \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
6137   \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69:
6138                             U+0965, U+1900..191E, U+1920..192B,
6139                             U+1930..193B, U+1940, U+1944..194F)
6140   \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
6141   \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
6142   \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
6143                             (386: U+10107..10133, U+10600..10736,
6144                             U+10740..10755, U+10760..10767)
6145   \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
6146                             (268: U+10000..1000B, U+1000D..10026,
6147                             U+10028..1003A, U+1003C..1003D,
6148                             U+1003F..1004D, U+10050..1005D ...)
6149   \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (49:
6150                             U+A4D0..A4FF, U+11FB0)
6151   \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
6152   \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29:
6153                             U+10280..1029C)
6154   \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
6155   \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27:
6156                             U+10920..10939, U+1093F)
6157   \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
6158                             (61: U+0964..096F, U+A830..A839,
6159                             U+11150..11176)
6160   \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
6161   \p{Script_Extensions: Maka} \p{Script_Extensions=Makasar} (25)
6162   \p{Script_Extensions: Makasar} (Short: \p{Scx=Maka}, \p{Maka})
6163                             (25: U+11EE0..11EF8)
6164   \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
6165                             (126: U+0951..0952, U+0964..0965,
6166                             U+0D00..0D0C, U+0D0E..0D10,
6167                             U+0D12..0D44, U+0D46..0D48 ...)
6168   \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
6169   \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand})
6170                             (30: U+0640, U+0840..085B, U+085E)
6171   \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
6172   \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
6173                             (52: U+0640, U+10AC0..10AE6,
6174                             U+10AEB..10AF6)
6175   \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
6176   \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc})
6177                             (68: U+11C70..11C8F, U+11C92..11CA7,
6178                             U+11CA9..11CB6)
6179   \p{Script_Extensions: Masaram_Gondi} (Short: \p{Scx=Gonm},
6180                             \p{Gonm}) (77: U+0964..0965,
6181                             U+11D00..11D06, U+11D08..11D09,
6182                             U+11D0B..11D36, U+11D3A, U+11D3C..11D3D
6183                             ...)
6184   \p{Script_Extensions: Medefaidrin} (Short: \p{Scx=Medf}, \p{Medf})
6185                             (91: U+16E40..16E9A)
6186   \p{Script_Extensions: Medf} \p{Script_Extensions=Medefaidrin} (91)
6187   \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
6188                             \p{Mtei}) (79: U+AAE0..AAF6,
6189                             U+ABC0..ABED, U+ABF0..ABF9)
6190   \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
6191                             (213)
6192   \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
6193                             \p{Mend}) (213: U+1E800..1E8C4,
6194                             U+1E8C7..1E8D6)
6195   \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
6196                             (90)
6197   \p{Script_Extensions: Mero} \p{Script_Extensions=
6198                             Meroitic_Hieroglyphs} (32)
6199   \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
6200                             \p{Merc}) (90: U+109A0..109B7,
6201                             U+109BC..109CF, U+109D2..109FF)
6202   \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
6203                             \p{Mero}) (32: U+10980..1099F)
6204   \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (149:
6205                             U+16F00..16F4A, U+16F4F..16F87,
6206                             U+16F8F..16F9F)
6207   \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (126)
6208   \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89:
6209                             U+A830..A839, U+11600..11644,
6210                             U+11650..11659)
6211   \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (172)
6212   \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
6213                             (172: U+1800..1819, U+1820..1878,
6214                             U+1880..18AA, U+202F, U+11660..1166C)
6215   \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43:
6216                             U+16A40..16A5E, U+16A60..16A69,
6217                             U+16A6E..16A6F)
6218   \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
6219   \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
6220   \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
6221   \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult})
6222                             (48: U+0A66..0A6F, U+11280..11286,
6223                             U+11288, U+1128A..1128D, U+1128F..1129D,
6224                             U+1129F..112A9)
6225   \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
6226                             (224: U+1000..109F, U+A92E,
6227                             U+A9E0..A9FE, U+AA60..AA7F)
6228   \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
6229   \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
6230                             (40: U+10880..1089E, U+108A7..108AF)
6231   \p{Script_Extensions: Nag_Mundari} (Short: \p{Scx=Nagm}, \p{Nagm})
6232                             (42: U+1E4D0..1E4F9)
6233   \p{Script_Extensions: Nagm} \p{Script_Extensions=Nag_Mundari} (42)
6234   \p{Script_Extensions: Nand} \p{Script_Extensions=Nandinagari} (86)
6235   \p{Script_Extensions: Nandinagari} (Short: \p{Scx=Nand}, \p{Nand})
6236                             (86: U+0964..0965, U+0CE6..0CEF, U+1CE9,
6237                             U+1CF2, U+1CFA, U+A830..A835 ...)
6238   \p{Script_Extensions: Narb} \p{Script_Extensions=
6239                             Old_North_Arabian} (32)
6240   \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
6241   \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
6242                             (83: U+1980..19AB, U+19B0..19C9,
6243                             U+19D0..19DA, U+19DE..19DF)
6244   \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (97:
6245                             U+11400..1145B, U+1145D..11461)
6246   \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (67:
6247                             U+060C, U+061B, U+061F, U+07C0..07FA,
6248                             U+07FD..07FF, U+FD3E..FD3F)
6249   \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (67)
6250   \p{Script_Extensions: Nshu} \p{Script_Extensions=Nushu} (397)
6251   \p{Script_Extensions: Nushu} (Short: \p{Scx=Nshu}, \p{Nshu}) (397:
6252                             U+16FE1, U+1B170..1B2FB)
6253   \p{Script_Extensions: Nyiakeng_Puachue_Hmong} (Short: \p{Scx=
6254                             Hmnp}, \p{Hmnp}) (71: U+1E100..1E12C,
6255                             U+1E130..1E13D, U+1E140..1E149,
6256                             U+1E14E..1E14F)
6257   \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
6258   \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29:
6259                             U+1680..169C)
6260   \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
6261                             (48: U+1C50..1C7F)
6262   \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
6263   \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
6264                             \p{Hung}) (108: U+10C80..10CB2,
6265                             U+10CC0..10CF2, U+10CFA..10CFF)
6266   \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
6267                             (39: U+10300..10323, U+1032D..1032F)
6268   \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
6269                             \p{Narb}) (32: U+10A80..10A9F)
6270   \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
6271                             (44: U+0483, U+10350..1037A)
6272   \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
6273                             (50: U+103A0..103C3, U+103C8..103D5)
6274   \p{Script_Extensions: Old_Sogdian} (Short: \p{Scx=Sogo}, \p{Sogo})
6275                             (40: U+10F00..10F27)
6276   \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
6277                             \p{Sarb}) (32: U+10A60..10A7F)
6278   \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
6279                             (73: U+10C00..10C48)
6280   \p{Script_Extensions: Old_Uyghur} (Short: \p{Scx=Ougr}, \p{Ougr})
6281                             (28: U+0640, U+10AF2, U+10F70..10F89)
6282   \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (97:
6283                             U+0951..0952, U+0964..0965,
6284                             U+0B01..0B03, U+0B05..0B0C,
6285                             U+0B0F..0B10, U+0B13..0B28 ...)
6286   \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
6287   \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (97)
6288   \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72:
6289                             U+104B0..104D3, U+104D8..104FB)
6290   \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
6291   \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
6292   \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma})
6293                             (40: U+10480..1049D, U+104A0..104A9)
6294   \p{Script_Extensions: Ougr} \p{Script_Extensions=Old_Uyghur} (28)
6295   \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
6296                             \p{Hmng}) (127: U+16B00..16B45,
6297                             U+16B50..16B59, U+16B5B..16B61,
6298                             U+16B63..16B77, U+16B7D..16B8F)
6299   \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
6300   \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
6301                             (32: U+10860..1087F)
6302   \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
6303                             (57: U+11AC0..11AF8)
6304   \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
6305   \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
6306   \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
6307   \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
6308                             (59: U+1802..1803, U+1805, U+A840..A877)
6309   \p{Script_Extensions: Phli} \p{Script_Extensions=
6310                             Inscriptional_Pahlavi} (27)
6311   \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
6312                             (30)
6313   \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
6314   \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
6315                             (29: U+10900..1091B, U+1091F)
6316   \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (149)
6317   \p{Script_Extensions: Prti} \p{Script_Extensions=
6318                             Inscriptional_Parthian} (30)
6319   \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
6320                             \p{Phlp}) (30: U+0640, U+10B80..10B91,
6321                             U+10B99..10B9C, U+10BA9..10BAF)
6322   \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
6323   \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (586)
6324   \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37:
6325                             U+A930..A953, U+A95F)
6326   \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
6327   \p{Script_Extensions: Rohg} \p{Script_Extensions=Hanifi_Rohingya}
6328                             (55)
6329   \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86:
6330                             U+16A0..16EA, U+16EE..16F8)
6331   \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
6332   \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
6333                             (61: U+0800..082D, U+0830..083E)
6334   \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
6335   \p{Script_Extensions: Sarb} \p{Script_Extensions=
6336                             Old_South_Arabian} (32)
6337   \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
6338   \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
6339                             (82: U+A880..A8C5, U+A8CE..A8D9)
6340   \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
6341   \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
6342                             (102: U+0951, U+1CD7, U+1CD9,
6343                             U+1CDC..1CDD, U+1CE0, U+11180..111DF)
6344   \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw})
6345                             (48: U+10450..1047F)
6346   \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
6347   \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (102)
6348   \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
6349   \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd})
6350                             (92: U+11580..115B5, U+115B8..115DD)
6351   \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
6352                             (672: U+1D800..1DA8B, U+1DA9B..1DA9F,
6353                             U+1DAA1..1DAAF)
6354   \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
6355   \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (113)
6356   \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
6357                             (113: U+0964..0965, U+0D81..0D83,
6358                             U+0D85..0D96, U+0D9A..0DB1,
6359                             U+0DB3..0DBB, U+0DBD ...)
6360   \p{Script_Extensions: Sogd} \p{Script_Extensions=Sogdian} (43)
6361   \p{Script_Extensions: Sogdian} (Short: \p{Scx=Sogd}, \p{Sogd})
6362                             (43: U+0640, U+10F30..10F59)
6363   \p{Script_Extensions: Sogo} \p{Script_Extensions=Old_Sogdian} (40)
6364   \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
6365   \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
6366                             \p{Sora}) (35: U+110D0..110E8,
6367                             U+110F0..110F9)
6368   \p{Script_Extensions: Soyo} \p{Script_Extensions=Soyombo} (83)
6369   \p{Script_Extensions: Soyombo} (Short: \p{Scx=Soyo}, \p{Soyo})
6370                             (83: U+11A50..11AA2)
6371   \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
6372   \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
6373                             (72: U+1B80..1BBF, U+1CC0..1CC7)
6374   \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (57)
6375   \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
6376                             \p{Sylo}) (57: U+0964..0965,
6377                             U+09E6..09EF, U+A800..A82C)
6378   \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (107)
6379   \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc})
6380                             (107: U+060C, U+061B..061C, U+061F,
6381                             U+0640, U+064B..0655, U+0670 ...)
6382   \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg})
6383                             (25: U+1700..1715, U+171F, U+1735..1736)
6384   \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
6385   \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
6386                             (20: U+1735..1736, U+1760..176C,
6387                             U+176E..1770, U+1772..1773)
6388   \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45:
6389                             U+1040..1049, U+1950..196D, U+1970..1974)
6390   \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
6391                             (127: U+1A20..1A5E, U+1A60..1A7C,
6392                             U+1A7F..1A89, U+1A90..1A99, U+1AA0..1AAD)
6393   \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
6394                             (72: U+AA80..AAC2, U+AADB..AADF)
6395   \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (80)
6396   \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (80:
6397                             U+0964..0965, U+A830..A839,
6398                             U+11680..116B9, U+116C0..116C9)
6399   \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
6400   \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
6401   \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (133:
6402                             U+0951..0952, U+0964..0965,
6403                             U+0B82..0B83, U+0B85..0B8A,
6404                             U+0B8E..0B90, U+0B92..0B95 ...)
6405   \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (133)
6406   \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6914)
6407   \p{Script_Extensions: Tangsa} (Short: \p{Scx=Tnsa}, \p{Tnsa}) (89:
6408                             U+16A70..16ABE, U+16AC0..16AC9)
6409   \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
6410                             (6914: U+16FE0, U+17000..187F7,
6411                             U+18800..18AFF, U+18D00..18D08)
6412   \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
6413   \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (106)
6414   \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu})
6415                             (106: U+0951..0952, U+0964..0965,
6416                             U+0C00..0C0C, U+0C0E..0C10,
6417                             U+0C12..0C28, U+0C2A..0C39 ...)
6418   \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
6419   \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (25)
6420   \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (66)
6421   \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (66:
6422                             U+060C, U+061B..061C, U+061F,
6423                             U+0660..0669, U+0780..07B1, U+FDF2 ...)
6424   \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86:
6425                             U+0E01..0E3A, U+0E40..0E5B)
6426   \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
6427                             (207: U+0F00..0F47, U+0F49..0F6C,
6428                             U+0F71..0F97, U+0F99..0FBC,
6429                             U+0FBE..0FCC, U+0FCE..0FD4 ...)
6430   \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
6431   \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
6432                             (59: U+2D30..2D67, U+2D6F..2D70, U+2D7F)
6433   \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (97)
6434   \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh})
6435                             (97: U+0951..0952, U+0964..0965, U+1CF2,
6436                             U+A830..A839, U+11480..114C7,
6437                             U+114D0..114D9)
6438   \p{Script_Extensions: Tnsa} \p{Script_Extensions=Tangsa} (89)
6439   \p{Script_Extensions: Toto} (Short: \p{Scx=Toto}, \p{Toto}) (31:
6440                             U+1E290..1E2AE)
6441   \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
6442   \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
6443                             (31: U+10380..1039D, U+1039F)
6444   \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
6445                             (964_861 plus all above-Unicode code
6446                             points: U+0378..0379, U+0380..0383,
6447                             U+038B, U+038D, U+03A2, U+0530 ...)
6448   \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300:
6449                             U+A500..A62B)
6450   \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
6451   \p{Script_Extensions: Vith} \p{Script_Extensions=Vithkuqi} (70)
6452   \p{Script_Extensions: Vithkuqi} (Short: \p{Scx=Vith}, \p{Vith})
6453                             (70: U+10570..1057A, U+1057C..1058A,
6454                             U+1058C..10592, U+10594..10595,
6455                             U+10597..105A1, U+105A3..105B1 ...)
6456   \p{Script_Extensions: Wancho} (Short: \p{Scx=Wcho}, \p{Wcho}) (59:
6457                             U+1E2C0..1E2F9, U+1E2FF)
6458   \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
6459   \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
6460                             (84: U+118A0..118F2, U+118FF)
6461   \p{Script_Extensions: Wcho} \p{Script_Extensions=Wancho} (59)
6462   \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
6463   \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
6464   \p{Script_Extensions: Yezi} \p{Script_Extensions=Yezidi} (60)
6465   \p{Script_Extensions: Yezidi} (Short: \p{Scx=Yezi}, \p{Yezi}) (60:
6466                             U+060C, U+061B, U+061F, U+0660..0669,
6467                             U+10E80..10EA9, U+10EAB..10EAD ...)
6468   \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246:
6469                             U+3001..3002, U+3008..3011,
6470                             U+3014..301B, U+30FB, U+A000..A48C,
6471                             U+A490..A4C6 ...)
6472   \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
6473   \p{Script_Extensions: Zanabazar_Square} (Short: \p{Scx=Zanb},
6474                             \p{Zanb}) (72: U+11A00..11A47)
6475   \p{Script_Extensions: Zanb} \p{Script_Extensions=Zanabazar_Square}
6476                             (72)
6477   \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (586)
6478   \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (7873)
6479   \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (964_861
6480                             plus all above-Unicode code points)
6481   \p{Scx: *}              \p{Script_Extensions: *}
6482   \p{SD}                  \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (50)
6483   \p{SD: *}               \p{Soft_Dotted: *}
6484   \p{Sentence_Break: AT}  \p{Sentence_Break=ATerm} (4)
6485   \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4: [.], U+2024,
6486                             U+FE52, U+FF0E)
6487   \p{Sentence_Break: CL}  \p{Sentence_Break=Close} (195)
6488   \p{Sentence_Break: Close} (Short: \p{SB=CL}) (195: [\"\'\(\)\[\]
6489                             \{\}\xab\xbb], U+0F3A..0F3D,
6490                             U+169B..169C, U+2018..201F,
6491                             U+2039..203A, U+2045..2046 ...)
6492   \p{Sentence_Break: CR}  (Short: \p{SB=CR}) (1: [\r])
6493   \p{Sentence_Break: EX}  \p{Sentence_Break=Extend} (2550)
6494   \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2550: U+0300..036F,
6495                             U+0483..0489, U+0591..05BD, U+05BF,
6496                             U+05C1..05C2, U+05C4..05C5 ...)
6497   \p{Sentence_Break: FO}  \p{Sentence_Break=Format} (72)
6498   \p{Sentence_Break: Format} (Short: \p{SB=FO}) (72: [\xad],
6499                             U+0600..0605, U+061C, U+06DD, U+070F,
6500                             U+0890..0891 ...)
6501   \p{Sentence_Break: LE}  \p{Sentence_Break=OLetter} (132_036)
6502   \p{Sentence_Break: LF}  (Short: \p{SB=LF}) (1: [\n])
6503   \p{Sentence_Break: LO}  \p{Sentence_Break=Lower} (2497)
6504   \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2497: [a-z\xaa\xb5
6505                             \xba\xdf-\xf6\xf8-\xff], U+0101, U+0103,
6506                             U+0105, U+0107, U+0109 ...)
6507   \p{Sentence_Break: NU}  \p{Sentence_Break=Numeric} (682)
6508   \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (682: [0-9],
6509                             U+0660..0669, U+066B..066C,
6510                             U+06F0..06F9, U+07C0..07C9, U+0966..096F
6511                             ...)
6512   \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (132_036: U+01BB,
6513                             U+01C0..01C3, U+0294, U+02B9..02BF,
6514                             U+02C6..02D1, U+02EC ...)
6515   \p{Sentence_Break: Other} (Short: \p{SB=XX}) (973_938 plus all
6516                             above-Unicode code points: [^\t\n\cK\f
6517                             \r\x20!\"\'\(\),\-.0-9:?A-Z\[\]a-z\{\}
6518                             \x85\xa0\xaa-\xab\xad\xb5\xba-\xbb\xc0-
6519                             \xd6\xd8-\xf6\xf8-\xff], U+02C2..02C5,
6520                             U+02D2..02DF, U+02E5..02EB, U+02ED,
6521                             U+02EF..02FF ...)
6522   \p{Sentence_Break: SC}  \p{Sentence_Break=SContinue} (26)
6523   \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26: [,\-:],
6524                             U+055D, U+060C..060D, U+07F8, U+1802,
6525                             U+1808 ...)
6526   \p{Sentence_Break: SE}  \p{Sentence_Break=Sep} (3)
6527   \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3: [\x85],
6528                             U+2028..2029)
6529   \p{Sentence_Break: Sp}  (Short: \p{SB=Sp}) (20: [\t\cK\f\x20\xa0],
6530                             U+1680, U+2000..200A, U+202F, U+205F,
6531                             U+3000)
6532   \p{Sentence_Break: ST}  \p{Sentence_Break=STerm} (151)
6533   \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (151: [!?], U+0589,
6534                             U+061D..061F, U+06D4, U+0700..0702,
6535                             U+07F9 ...)
6536   \p{Sentence_Break: UP}  \p{Sentence_Break=Upper} (1936)
6537   \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1936: [A-Z\xc0-\xd6
6538                             \xd8-\xde], U+0100, U+0102, U+0104,
6539                             U+0106, U+0108 ...)
6540   \p{Sentence_Break: XX}  \p{Sentence_Break=Other} (973_938 plus all
6541                             above-Unicode code points)
6542   \p{Sentence_Terminal}   \p{Sentence_Terminal=Y} (Short: \p{STerm})
6543                             (154)
6544   \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
6545                             (1_113_958 plus all above-Unicode code
6546                             points: [\x00-\x20\"#\$\%&\'\(\)*+,\-
6547                             \/0-9:;<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-
6548                             \xff], U+0100..0588, U+058A..061C,
6549                             U+0620..06D3, U+06D5..06FF, U+0703..07F8
6550                             ...)
6551   \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (154:
6552                             [!.?], U+0589, U+061D..061F, U+06D4,
6553                             U+0700..0702, U+07F9 ...)
6554   \p{Separator}           \p{General_Category=Separator} (Short:
6555                             \p{Z}) (19)
6556   \p{Sgnw}                \p{SignWriting} (= \p{Script_Extensions=
6557                             SignWriting}) (672)
6558   \p{Sharada}             \p{Script_Extensions=Sharada} (Short:
6559                             \p{Shrd}; NOT \p{Block=Sharada}) (102)
6560   \p{Shavian}             \p{Script_Extensions=Shavian} (Short:
6561                             \p{Shaw}) (48)
6562   \p{Shaw}                \p{Shavian} (= \p{Script_Extensions=
6563                             Shavian}) (48)
6564 X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
6565                             (16)
6566   \p{Shrd}                \p{Sharada} (= \p{Script_Extensions=
6567                             Sharada}) (NOT \p{Block=Sharada}) (102)
6568   \p{Sidd}                \p{Siddham} (= \p{Script_Extensions=
6569                             Siddham}) (NOT \p{Block=Siddham}) (92)
6570   \p{Siddham}             \p{Script_Extensions=Siddham} (Short:
6571                             \p{Sidd}; NOT \p{Block=Siddham}) (92)
6572   \p{SignWriting}         \p{Script_Extensions=SignWriting} (Short:
6573                             \p{Sgnw}) (672)
6574   \p{Sind}                \p{Khudawadi} (= \p{Script_Extensions=
6575                             Khudawadi}) (NOT \p{Block=Khudawadi})
6576                             (81)
6577   \p{Sinh}                \p{Sinhala} (= \p{Script_Extensions=
6578                             Sinhala}) (NOT \p{Block=Sinhala}) (113)
6579   \p{Sinhala}             \p{Script_Extensions=Sinhala} (Short:
6580                             \p{Sinh}; NOT \p{Block=Sinhala}) (113)
6581 X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
6582   \p{Sk}                  \p{Modifier_Symbol} (=
6583                             \p{General_Category=Modifier_Symbol})
6584                             (125)
6585   \p{Sm}                  \p{Math_Symbol} (= \p{General_Category=
6586                             Math_Symbol}) (948)
6587 X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
6588                             \p{InSmallForms}) (32)
6589 X \p{Small_Forms}         \p{Small_Form_Variants} (= \p{Block=
6590                             Small_Form_Variants}) (32)
6591 X \p{Small_Kana_Ext}      \p{Small_Kana_Extension} (= \p{Block=
6592                             Small_Kana_Extension}) (64)
6593 X \p{Small_Kana_Extension} \p{Block=Small_Kana_Extension} (Short:
6594                             \p{InSmallKanaExt}) (64)
6595   \p{So}                  \p{Other_Symbol} (= \p{General_Category=
6596                             Other_Symbol}) (6634)
6597   \p{Soft_Dotted}         \p{Soft_Dotted=Y} (Short: \p{SD}) (50)
6598   \p{Soft_Dotted: N*}     (Short: \p{SD=N}, \P{SD}) (1_114_062 plus
6599                             all above-Unicode code points: [\x00-
6600                             \x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=>?\@A-
6601                             Z\[\\\]\^_`a-hk-z\{\|\}~\x7f-\xff],
6602                             U+0100..012E, U+0130..0248,
6603                             U+024A..0267, U+0269..029C, U+029E..02B1
6604                             ...)
6605   \p{Soft_Dotted: Y*}     (Short: \p{SD=Y}, \p{SD}) (50: [i-j],
6606                             U+012F, U+0249, U+0268, U+029D, U+02B2
6607                             ...)
6608   \p{Sogd}                \p{Sogdian} (= \p{Script_Extensions=
6609                             Sogdian}) (NOT \p{Block=Sogdian}) (43)
6610   \p{Sogdian}             \p{Script_Extensions=Sogdian} (Short:
6611                             \p{Sogd}; NOT \p{Block=Sogdian}) (43)
6612   \p{Sogo}                \p{Old_Sogdian} (= \p{Script_Extensions=
6613                             Old_Sogdian}) (NOT \p{Block=
6614                             Old_Sogdian}) (40)
6615   \p{Sora}                \p{Sora_Sompeng} (= \p{Script_Extensions=
6616                             Sora_Sompeng}) (NOT \p{Block=
6617                             Sora_Sompeng}) (35)
6618   \p{Sora_Sompeng}        \p{Script_Extensions=Sora_Sompeng} (Short:
6619                             \p{Sora}; NOT \p{Block=Sora_Sompeng})
6620                             (35)
6621   \p{Soyo}                \p{Soyombo} (= \p{Script_Extensions=
6622                             Soyombo}) (NOT \p{Block=Soyombo}) (83)
6623   \p{Soyombo}             \p{Script_Extensions=Soyombo} (Short:
6624                             \p{Soyo}; NOT \p{Block=Soyombo}) (83)
6625   \p{Space}               \p{White_Space} (= \p{White_Space=Y}) (25)
6626   \p{Space: *}            \p{White_Space: *}
6627   \p{Space_Separator}     \p{General_Category=Space_Separator}
6628                             (Short: \p{Zs}) (17)
6629   \p{SpacePerl}           \p{XPosixSpace} (25)
6630   \p{Spacing_Mark}        \p{General_Category=Spacing_Mark} (Short:
6631                             \p{Mc}) (452)
6632 X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
6633                             (Short: \p{InModifierLetters}) (80)
6634 X \p{Specials}            \p{Block=Specials} (16)
6635   \p{STerm}               \p{Sentence_Terminal} (=
6636                             \p{Sentence_Terminal=Y}) (154)
6637   \p{STerm: *}            \p{Sentence_Terminal: *}
6638   \p{Sund}                \p{Sundanese} (= \p{Script_Extensions=
6639                             Sundanese}) (NOT \p{Block=Sundanese})
6640                             (72)
6641   \p{Sundanese}           \p{Script_Extensions=Sundanese} (Short:
6642                             \p{Sund}; NOT \p{Block=Sundanese}) (72)
6643 X \p{Sundanese_Sup}       \p{Sundanese_Supplement} (= \p{Block=
6644                             Sundanese_Supplement}) (16)
6645 X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
6646                             \p{InSundaneseSup}) (16)
6647 X \p{Sup_Arrows_A}        \p{Supplemental_Arrows_A} (= \p{Block=
6648                             Supplemental_Arrows_A}) (16)
6649 X \p{Sup_Arrows_B}        \p{Supplemental_Arrows_B} (= \p{Block=
6650                             Supplemental_Arrows_B}) (128)
6651 X \p{Sup_Arrows_C}        \p{Supplemental_Arrows_C} (= \p{Block=
6652                             Supplemental_Arrows_C}) (256)
6653 X \p{Sup_Math_Operators}  \p{Supplemental_Mathematical_Operators} (=
6654                             \p{Block=
6655                             Supplemental_Mathematical_Operators})
6656                             (256)
6657 X \p{Sup_PUA_A}           \p{Supplementary_Private_Use_Area_A} (=
6658                             \p{Block=
6659                             Supplementary_Private_Use_Area_A})
6660                             (65_536)
6661 X \p{Sup_PUA_B}           \p{Supplementary_Private_Use_Area_B} (=
6662                             \p{Block=
6663                             Supplementary_Private_Use_Area_B})
6664                             (65_536)
6665 X \p{Sup_Punctuation}     \p{Supplemental_Punctuation} (= \p{Block=
6666                             Supplemental_Punctuation}) (128)
6667 X \p{Sup_Symbols_And_Pictographs}
6668                             \p{Supplemental_Symbols_And_Pictographs}
6669                             (= \p{Block=
6670                             Supplemental_Symbols_And_Pictographs})
6671                             (256)
6672 X \p{Super_And_Sub}       \p{Superscripts_And_Subscripts} (=
6673                             \p{Block=Superscripts_And_Subscripts})
6674                             (48)
6675 X \p{Superscripts_And_Subscripts} \p{Block=
6676                             Superscripts_And_Subscripts} (Short:
6677                             \p{InSuperAndSub}) (48)
6678 X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
6679                             \p{InSupArrowsA}) (16)
6680 X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
6681                             \p{InSupArrowsB}) (128)
6682 X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
6683                             \p{InSupArrowsC}) (256)
6684 X \p{Supplemental_Mathematical_Operators} \p{Block=
6685                             Supplemental_Mathematical_Operators}
6686                             (Short: \p{InSupMathOperators}) (256)
6687 X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
6688                             (Short: \p{InSupPunctuation}) (128)
6689 X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
6690                             Supplemental_Symbols_And_Pictographs}
6691                             (Short: \p{InSupSymbolsAndPictographs})
6692                             (256)
6693 X \p{Supplementary_Private_Use_Area_A} \p{Block=
6694                             Supplementary_Private_Use_Area_A}
6695                             (Short: \p{InSupPUAA}) (65_536)
6696 X \p{Supplementary_Private_Use_Area_B} \p{Block=
6697                             Supplementary_Private_Use_Area_B}
6698                             (Short: \p{InSupPUAB}) (65_536)
6699   \p{Surrogate}           \p{General_Category=Surrogate} (Short:
6700                             \p{Cs}) (2048)
6701 X \p{Sutton_SignWriting}  \p{Block=Sutton_SignWriting} (688)
6702   \p{Sylo}                \p{Syloti_Nagri} (= \p{Script_Extensions=
6703                             Syloti_Nagri}) (NOT \p{Block=
6704                             Syloti_Nagri}) (57)
6705   \p{Syloti_Nagri}        \p{Script_Extensions=Syloti_Nagri} (Short:
6706                             \p{Sylo}; NOT \p{Block=Syloti_Nagri})
6707                             (57)
6708   \p{Symbol}              \p{General_Category=Symbol} (Short: \p{S})
6709                             (7770)
6710 X \p{Symbols_And_Pictographs_Ext_A}
6711                             \p{Symbols_And_Pictographs_Extended_A}
6712                             (= \p{Block=
6713                             Symbols_And_Pictographs_Extended_A})
6714                             (144)
6715 X \p{Symbols_And_Pictographs_Extended_A} \p{Block=
6716                             Symbols_And_Pictographs_Extended_A} (144)
6717 X \p{Symbols_For_Legacy_Computing} \p{Block=
6718                             Symbols_For_Legacy_Computing} (256)
6719   \p{Syrc}                \p{Syriac} (= \p{Script_Extensions=
6720                             Syriac}) (NOT \p{Block=Syriac}) (107)
6721   \p{Syriac}              \p{Script_Extensions=Syriac} (Short:
6722                             \p{Syrc}; NOT \p{Block=Syriac}) (107)
6723 X \p{Syriac_Sup}          \p{Syriac_Supplement} (= \p{Block=
6724                             Syriac_Supplement}) (16)
6725 X \p{Syriac_Supplement}   \p{Block=Syriac_Supplement} (Short:
6726                             \p{InSyriacSup}) (16)
6727   \p{Tagalog}             \p{Script_Extensions=Tagalog} (Short:
6728                             \p{Tglg}; NOT \p{Block=Tagalog}) (25)
6729   \p{Tagb}                \p{Tagbanwa} (= \p{Script_Extensions=
6730                             Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
6731   \p{Tagbanwa}            \p{Script_Extensions=Tagbanwa} (Short:
6732                             \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
6733 X \p{Tags}                \p{Block=Tags} (128)
6734   \p{Tai_Le}              \p{Script_Extensions=Tai_Le} (Short:
6735                             \p{Tale}; NOT \p{Block=Tai_Le}) (45)
6736   \p{Tai_Tham}            \p{Script_Extensions=Tai_Tham} (Short:
6737                             \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
6738   \p{Tai_Viet}            \p{Script_Extensions=Tai_Viet} (Short:
6739                             \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
6740 X \p{Tai_Xuan_Jing}       \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
6741                             Tai_Xuan_Jing_Symbols}) (96)
6742 X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
6743                             \p{InTaiXuanJing}) (96)
6744   \p{Takr}                \p{Takri} (= \p{Script_Extensions=Takri})
6745                             (NOT \p{Block=Takri}) (80)
6746   \p{Takri}               \p{Script_Extensions=Takri} (Short:
6747                             \p{Takr}; NOT \p{Block=Takri}) (80)
6748   \p{Tale}                \p{Tai_Le} (= \p{Script_Extensions=
6749                             Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
6750   \p{Talu}                \p{New_Tai_Lue} (= \p{Script_Extensions=
6751                             New_Tai_Lue}) (NOT \p{Block=
6752                             New_Tai_Lue}) (83)
6753   \p{Tamil}               \p{Script_Extensions=Tamil} (Short:
6754                             \p{Taml}; NOT \p{Block=Tamil}) (133)
6755 X \p{Tamil_Sup}           \p{Tamil_Supplement} (= \p{Block=
6756                             Tamil_Supplement}) (64)
6757 X \p{Tamil_Supplement}    \p{Block=Tamil_Supplement} (Short:
6758                             \p{InTamilSup}) (64)
6759   \p{Taml}                \p{Tamil} (= \p{Script_Extensions=Tamil})
6760                             (NOT \p{Block=Tamil}) (133)
6761   \p{Tang}                \p{Tangut} (= \p{Script_Extensions=
6762                             Tangut}) (NOT \p{Block=Tangut}) (6914)
6763   \p{Tangsa}              \p{Script_Extensions=Tangsa} (Short:
6764                             \p{Tnsa}; NOT \p{Block=Tangsa}) (89)
6765   \p{Tangut}              \p{Script_Extensions=Tangut} (Short:
6766                             \p{Tang}; NOT \p{Block=Tangut}) (6914)
6767 X \p{Tangut_Components}   \p{Block=Tangut_Components} (768)
6768 X \p{Tangut_Sup}          \p{Tangut_Supplement} (= \p{Block=
6769                             Tangut_Supplement}) (128)
6770 X \p{Tangut_Supplement}   \p{Block=Tangut_Supplement} (Short:
6771                             \p{InTangutSup}) (128)
6772   \p{Tavt}                \p{Tai_Viet} (= \p{Script_Extensions=
6773                             Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
6774   \p{Telu}                \p{Telugu} (= \p{Script_Extensions=
6775                             Telugu}) (NOT \p{Block=Telugu}) (106)
6776   \p{Telugu}              \p{Script_Extensions=Telugu} (Short:
6777                             \p{Telu}; NOT \p{Block=Telugu}) (106)
6778   \p{Term}                \p{Terminal_Punctuation} (=
6779                             \p{Terminal_Punctuation=Y}) (278)
6780   \p{Term: *}             \p{Terminal_Punctuation: *}
6781   \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
6782                             \p{Term}) (278)
6783   \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
6784                             (1_113_834 plus all above-Unicode code
6785                             points: [\x00-\x20\"#\$\%&\'\(\)*+\-\/0-
6786                             9<=>\@A-Z\[\\\]\^_`a-z\{\|\}~\x7f-\xff],
6787                             U+0100..037D, U+037F..0386,
6788                             U+0388..0588, U+058A..05C2, U+05C4..060B
6789                             ...)
6790   \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (278:
6791                             [!,.:;?], U+037E, U+0387, U+0589,
6792                             U+05C3, U+060C ...)
6793   \p{Tfng}                \p{Tifinagh} (= \p{Script_Extensions=
6794                             Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
6795   \p{Tglg}                \p{Tagalog} (= \p{Script_Extensions=
6796                             Tagalog}) (NOT \p{Block=Tagalog}) (25)
6797   \p{Thaa}                \p{Thaana} (= \p{Script_Extensions=
6798                             Thaana}) (NOT \p{Block=Thaana}) (66)
6799   \p{Thaana}              \p{Script_Extensions=Thaana} (Short:
6800                             \p{Thaa}; NOT \p{Block=Thaana}) (66)
6801   \p{Thai}                \p{Script_Extensions=Thai} (NOT \p{Block=
6802                             Thai}) (86)
6803   \p{Tibetan}             \p{Script_Extensions=Tibetan} (Short:
6804                             \p{Tibt}; NOT \p{Block=Tibetan}) (207)
6805   \p{Tibt}                \p{Tibetan} (= \p{Script_Extensions=
6806                             Tibetan}) (NOT \p{Block=Tibetan}) (207)
6807   \p{Tifinagh}            \p{Script_Extensions=Tifinagh} (Short:
6808                             \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
6809   \p{Tirh}                \p{Tirhuta} (= \p{Script_Extensions=
6810                             Tirhuta}) (NOT \p{Block=Tirhuta}) (97)
6811   \p{Tirhuta}             \p{Script_Extensions=Tirhuta} (Short:
6812                             \p{Tirh}; NOT \p{Block=Tirhuta}) (97)
6813   \p{Title}               \p{Titlecase} (/i= Cased=Yes) (31)
6814   \p{Titlecase}           (= \p{Gc=Lt}) (Short: \p{Title}; /i=
6815                             Cased=Yes) (31: U+01C5, U+01C8, U+01CB,
6816                             U+01F2, U+1F88..1F8F, U+1F98..1F9F ...)
6817   \p{Titlecase_Letter}    \p{General_Category=Titlecase_Letter}
6818                             (Short: \p{Lt}; /i= General_Category=
6819                             Cased_Letter) (31)
6820   \p{Tnsa}                \p{Tangsa} (= \p{Script_Extensions=
6821                             Tangsa}) (NOT \p{Block=Tangsa}) (89)
6822   \p{Toto}                \p{Script_Extensions=Toto} (NOT \p{Block=
6823                             Toto}) (31)
6824 X \p{Transport_And_Map}   \p{Transport_And_Map_Symbols} (= \p{Block=
6825                             Transport_And_Map_Symbols}) (128)
6826 X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
6827                             (Short: \p{InTransportAndMap}) (128)
6828 X \p{UCAS}                \p{Unified_Canadian_Aboriginal_Syllabics}
6829                             (= \p{Block=
6830                             Unified_Canadian_Aboriginal_Syllabics})
6831                             (640)
6832 X \p{UCAS_Ext}            \p{Unified_Canadian_Aboriginal_Syllabics_-
6833                             Extended} (= \p{Block=
6834                             Unified_Canadian_Aboriginal_Syllabics_-
6835                             Extended}) (80)
6836 X \p{UCAS_Ext_A}          \p{Unified_Canadian_Aboriginal_Syllabics_-
6837                             Extended_A} (= \p{Block=
6838                             Unified_Canadian_Aboriginal_Syllabics_-
6839                             Extended_A}) (16)
6840   \p{Ugar}                \p{Ugaritic} (= \p{Script_Extensions=
6841                             Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
6842   \p{Ugaritic}            \p{Script_Extensions=Ugaritic} (Short:
6843                             \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
6844   \p{UIdeo}               \p{Unified_Ideograph} (=
6845                             \p{Unified_Ideograph=Y}) (97_058)
6846   \p{UIdeo: *}            \p{Unified_Ideograph: *}
6847   \p{Unassigned}          \p{General_Category=Unassigned} (Short:
6848                             \p{Cn}) (825_345 plus all above-Unicode
6849                             code points)
6850   \p{Unicode}             \p{Any} (1_114_112)
6851 X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
6852                             Unified_Canadian_Aboriginal_Syllabics}
6853                             (Short: \p{InUCAS}) (640)
6854 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
6855                             Unified_Canadian_Aboriginal_Syllabics_-
6856                             Extended} (Short: \p{InUCASExt}) (80)
6857 X \p{Unified_Canadian_Aboriginal_Syllabics_Extended_A} \p{Block=
6858                             Unified_Canadian_Aboriginal_Syllabics_-
6859                             Extended_A} (Short: \p{InUCASExtA}) (16)
6860   \p{Unified_Ideograph}   \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
6861                             (97_058)
6862   \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
6863                             (1_017_054 plus all above-Unicode code
6864                             points: U+0000..33FF, U+4DC0..4DFF,
6865                             U+A000..FA0D, U+FA10, U+FA12,
6866                             U+FA15..FA1E ...)
6867   \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (97_058:
6868                             U+3400..4DBF, U+4E00..9FFF,
6869                             U+FA0E..FA0F, U+FA11, U+FA13..FA14,
6870                             U+FA1F ...)
6871   \p{Unknown}             \p{Script_Extensions=Unknown} (Short:
6872                             \p{Zzzz}) (964_861 plus all above-
6873                             Unicode code points)
6874   \p{Upper}               \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6875                             Cased=Yes) (1951)
6876   \p{Upper: *}            \p{Uppercase: *}
6877   \p{Uppercase}           \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
6878                             Cased=Yes) (1951)
6879   \p{Uppercase: N*}       (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
6880                             No) (1_112_161 plus all above-Unicode
6881                             code points: [\x00-\x20!\"#\$\%&\'
6882                             \(\)*+,\-.\/0-9:;<=>?\@\[\\\]\^_`a-z\{
6883                             \|\}~\x7f-\xbf\xd7\xdf-\xff], U+0101,
6884                             U+0103, U+0105, U+0107, U+0109 ...)
6885   \p{Uppercase: Y*}       (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
6886                             Yes) (1951: [A-Z\xc0-\xd6\xd8-\xde],
6887                             U+0100, U+0102, U+0104, U+0106, U+0108
6888                             ...)
6889   \p{Uppercase_Letter}    \p{General_Category=Uppercase_Letter}
6890                             (Short: \p{Lu}; /i= General_Category=
6891                             Cased_Letter) (1831)
6892   \p{Vai}                 \p{Script_Extensions=Vai} (NOT \p{Block=
6893                             Vai}) (300)
6894   \p{Vaii}                \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
6895                             \p{Block=Vai}) (300)
6896   \p{Variation_Selector}  \p{Variation_Selector=Y} (Short: \p{VS};
6897                             NOT \p{Variation_Selectors}) (260)
6898   \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_852
6899                             plus all above-Unicode code points:
6900                             U+0000..180A, U+180E, U+1810..FDFF,
6901                             U+FE10..E00FF, U+E01F0..infinity)
6902   \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (260:
6903                             U+180B..180D, U+180F, U+FE00..FE0F,
6904                             U+E0100..E01EF)
6905 X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
6906                             \p{InVS}) (16)
6907 X \p{Variation_Selectors_Supplement} \p{Block=
6908                             Variation_Selectors_Supplement} (Short:
6909                             \p{InVSSup}) (240)
6910 X \p{Vedic_Ext}           \p{Vedic_Extensions} (= \p{Block=
6911                             Vedic_Extensions}) (48)
6912 X \p{Vedic_Extensions}    \p{Block=Vedic_Extensions} (Short:
6913                             \p{InVedicExt}) (48)
6914 X \p{Vertical_Forms}      \p{Block=Vertical_Forms} (16)
6915   \p{Vertical_Orientation: R} \p{Vertical_Orientation=Rotated}
6916                             (786_609 plus all above-Unicode code
6917                             points)
6918   \p{Vertical_Orientation: Rotated} (Short: \p{Vo=R}) (786_609 plus
6919                             all above-Unicode code points: [\x00-
6920                             \xa6\xa8\xaa-\xad\xaf-\xb0\xb2-\xbb\xbf-
6921                             \xd6\xd8-\xf6\xf8-\xff], U+0100..02E9,
6922                             U+02EC..10FF, U+1200..1400,
6923                             U+1680..18AF, U+1900..2015 ...)
6924   \p{Vertical_Orientation: Tr} \p{Vertical_Orientation=
6925                             Transformed_Rotated} (47)
6926   \p{Vertical_Orientation: Transformed_Rotated} (Short: \p{Vo=Tr})
6927                             (47: U+2329..232A, U+3008..3011,
6928                             U+3014..301F, U+3030, U+30A0, U+30FC ...)
6929   \p{Vertical_Orientation: Transformed_Upright} (Short: \p{Vo=Tu})
6930                             (148: U+3001..3002, U+3041, U+3043,
6931                             U+3045, U+3047, U+3049 ...)
6932   \p{Vertical_Orientation: Tu} \p{Vertical_Orientation=
6933                             Transformed_Upright} (148)
6934   \p{Vertical_Orientation: U} \p{Vertical_Orientation=Upright}
6935                             (327_308)
6936   \p{Vertical_Orientation: Upright} (Short: \p{Vo=U}) (327_308:
6937                             [\xa7\xa9\xae\xb1\xbc-\xbe\xd7\xf7],
6938                             U+02EA..02EB, U+1100..11FF,
6939                             U+1401..167F, U+18B0..18FF, U+2016 ...)
6940   \p{VertSpace}           \v (7: [\n\cK\f\r\x85], U+2028..2029)
6941   \p{Vith}                \p{Vithkuqi} (= \p{Script_Extensions=
6942                             Vithkuqi}) (NOT \p{Block=Vithkuqi}) (70)
6943   \p{Vithkuqi}            \p{Script_Extensions=Vithkuqi} (Short:
6944                             \p{Vith}; NOT \p{Block=Vithkuqi}) (70)
6945   \p{Vo: *}               \p{Vertical_Orientation: *}
6946   \p{VS}                  \p{Variation_Selector} (=
6947                             \p{Variation_Selector=Y}) (NOT
6948                             \p{Variation_Selectors}) (260)
6949   \p{VS: *}               \p{Variation_Selector: *}
6950 X \p{VS_Sup}              \p{Variation_Selectors_Supplement} (=
6951                             \p{Block=
6952                             Variation_Selectors_Supplement}) (240)
6953   \p{Wancho}              \p{Script_Extensions=Wancho} (Short:
6954                             \p{Wcho}; NOT \p{Block=Wancho}) (59)
6955   \p{Wara}                \p{Warang_Citi} (= \p{Script_Extensions=
6956                             Warang_Citi}) (NOT \p{Block=
6957                             Warang_Citi}) (84)
6958   \p{Warang_Citi}         \p{Script_Extensions=Warang_Citi} (Short:
6959                             \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
6960   \p{WB: *}               \p{Word_Break: *}
6961   \p{Wcho}                \p{Wancho} (= \p{Script_Extensions=
6962                             Wancho}) (NOT \p{Block=Wancho}) (59)
6963   \p{White_Space}         \p{White_Space=Y} (Short: \p{Space}) (25)
6964   \p{White_Space: N*}     (Short: \p{Space=N}, \P{Space}) (1_114_087
6965                             plus all above-Unicode code points: [^
6966                             \t\n\cK\f\r\x20\x85\xa0], U+0100..167F,
6967                             U+1681..1FFF, U+200B..2027,
6968                             U+202A..202E, U+2030..205E ...)
6969   \p{White_Space: Y*}     (Short: \p{Space=Y}, \p{Space}) (25: [\t
6970                             \n\cK\f\r\x20\x85\xa0], U+1680,
6971                             U+2000..200A, U+2028..2029, U+202F,
6972                             U+205F ...)
6973   \p{Word}                \p{XPosixWord} (139_612)
6974   \p{Word_Break: ALetter} (Short: \p{WB=LE}) (29_489: [A-Za-z\xaa
6975                             \xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
6976                             U+0100..02D7, U+02DE..02FF,
6977                             U+0370..0374, U+0376..0377, U+037A..037D
6978                             ...)
6979   \p{Word_Break: CR}      (Short: \p{WB=CR}) (1: [\r])
6980   \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1: [\"])
6981   \p{Word_Break: DQ}      \p{Word_Break=Double_Quote} (1)
6982   \p{Word_Break: E_Base}  (Short: \p{WB=EB}) (0)
6983   \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (0)
6984   \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (0)
6985   \p{Word_Break: EB}      \p{Word_Break=E_Base} (0)
6986   \p{Word_Break: EBG}     \p{Word_Break=E_Base_GAZ} (0)
6987   \p{Word_Break: EM}      \p{Word_Break=E_Modifier} (0)
6988   \p{Word_Break: EX}      \p{Word_Break=ExtendNumLet} (11)
6989   \p{Word_Break: Extend}  (Short: \p{WB=Extend}) (2554:
6990                             U+0300..036F, U+0483..0489,
6991                             U+0591..05BD, U+05BF, U+05C1..05C2,
6992                             U+05C4..05C5 ...)
6993   \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11: [_], U+202F,
6994                             U+203F..2040, U+2054, U+FE33..FE34,
6995                             U+FE4D..FE4F ...)
6996   \p{Word_Break: FO}      \p{Word_Break=Format} (71)
6997   \p{Word_Break: Format}  (Short: \p{WB=FO}) (71: [\xad],
6998                             U+0600..0605, U+061C, U+06DD, U+070F,
6999                             U+0890..0891 ...)
7000   \p{Word_Break: GAZ}     \p{Word_Break=Glue_After_Zwj} (0)
7001   \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (0)
7002   \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (75:
7003                             U+05D0..05EA, U+05EF..05F2, U+FB1D,
7004                             U+FB1F..FB28, U+FB2A..FB36, U+FB38..FB3C
7005                             ...)
7006   \p{Word_Break: HL}      \p{Word_Break=Hebrew_Letter} (75)
7007   \p{Word_Break: KA}      \p{Word_Break=Katakana} (331)
7008   \p{Word_Break: Katakana} (Short: \p{WB=KA}) (331: U+3031..3035,
7009                             U+309B..309C, U+30A0..30FA,
7010                             U+30FC..30FF, U+31F0..31FF, U+32D0..32FE
7011                             ...)
7012   \p{Word_Break: LE}      \p{Word_Break=ALetter} (29_489)
7013   \p{Word_Break: LF}      (Short: \p{WB=LF}) (1: [\n])
7014   \p{Word_Break: MB}      \p{Word_Break=MidNumLet} (7)
7015   \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9: [:\xb7], U+0387,
7016                             U+055F, U+05F4, U+2027, U+FE13 ...)
7017   \p{Word_Break: MidNum}  (Short: \p{WB=MN}) (15: [,;], U+037E,
7018                             U+0589, U+060C..060D, U+066C, U+07F8 ...)
7019   \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7: [.],
7020                             U+2018..2019, U+2024, U+FE52, U+FF07,
7021                             U+FF0E)
7022   \p{Word_Break: ML}      \p{Word_Break=MidLetter} (9)
7023   \p{Word_Break: MN}      \p{Word_Break=MidNum} (15)
7024   \p{Word_Break: Newline} (Short: \p{WB=NL}) (5: [\cK\f\x85],
7025                             U+2028..2029)
7026   \p{Word_Break: NL}      \p{Word_Break=Newline} (5)
7027   \p{Word_Break: NU}      \p{Word_Break=Numeric} (681)
7028   \p{Word_Break: Numeric} (Short: \p{WB=NU}) (681: [0-9],
7029                             U+0660..0669, U+066B, U+06F0..06F9,
7030                             U+07C0..07C9, U+0966..096F ...)
7031   \p{Word_Break: Other}   (Short: \p{WB=XX}) (1_080_819 plus all
7032                             above-Unicode code points: [^\n\cK\f\r
7033                             \x20\"\',.0-9:;A-Z_a-z\x85\xaa\xad\xb5
7034                             \xb7\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7035                             U+02D8..02DD, U+0375, U+0378..0379,
7036                             U+0380..0385, U+038B ...)
7037   \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26:
7038                             U+1F1E6..1F1FF)
7039   \p{Word_Break: RI}      \p{Word_Break=Regional_Indicator} (26)
7040   \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1: [\'])
7041   \p{Word_Break: SQ}      \p{Word_Break=Single_Quote} (1)
7042   \p{Word_Break: WSegSpace} (Short: \p{WB=WSegSpace}) (14: [\x20],
7043                             U+1680, U+2000..2006, U+2008..200A,
7044                             U+205F, U+3000)
7045   \p{Word_Break: XX}      \p{Word_Break=Other} (1_080_819 plus all
7046                             above-Unicode code points)
7047   \p{Word_Break: ZWJ}     (Short: \p{WB=ZWJ}) (1: U+200D)
7048   \p{WSpace}              \p{White_Space} (= \p{White_Space=Y}) (25)
7049   \p{WSpace: *}           \p{White_Space: *}
7050   \p{XDigit}              \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
7051   \p{XID_Continue}        \p{XID_Continue=Y} (Short: \p{XIDC})
7052                             (139_463)
7053   \p{XID_Continue: N*}    (Short: \p{XIDC=N}, \P{XIDC}) (974_649
7054                             plus all above-Unicode code points:
7055                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/:;<=>?
7056                             \@\[\\\]\^`\{\|\}~\x7f-\xa9\xab-\xb4
7057                             \xb6\xb8-\xb9\xbb-\xbf\xd7\xf7],
7058                             U+02C2..02C5, U+02D2..02DF,
7059                             U+02E5..02EB, U+02ED, U+02EF..02FF ...)
7060   \p{XID_Continue: Y*}    (Short: \p{XIDC=Y}, \p{XIDC}) (139_463:
7061                             [0-9A-Z_a-z\xaa\xb5\xb7\xba\xc0-\xd6
7062                             \xd8-\xf6\xf8-\xff], U+0100..02C1,
7063                             U+02C6..02D1, U+02E0..02E4, U+02EC,
7064                             U+02EE ...)
7065   \p{XID_Start}           \p{XID_Start=Y} (Short: \p{XIDS}) (136_322)
7066   \p{XID_Start: N*}       (Short: \p{XIDS=N}, \P{XIDS}) (977_790
7067                             plus all above-Unicode code points:
7068                             [\x00-\x20!\"#\$\%&\'\(\)*+,\-.\/0-9:;<=
7069                             >?\@\[\\\]\^_`\{\|\}~\x7f-\xa9\xab-\xb4
7070                             \xb6-\xb9\xbb-\xbf\xd7\xf7],
7071                             U+02C2..02C5, U+02D2..02DF,
7072                             U+02E5..02EB, U+02ED, U+02EF..036F ...)
7073   \p{XID_Start: Y*}       (Short: \p{XIDS=Y}, \p{XIDS}) (136_322:
7074                             [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6
7075                             \xf8-\xff], U+0100..02C1, U+02C6..02D1,
7076                             U+02E0..02E4, U+02EC, U+02EE ...)
7077   \p{XIDC}                \p{XID_Continue} (= \p{XID_Continue=Y})
7078                             (139_463)
7079   \p{XIDC: *}             \p{XID_Continue: *}
7080   \p{XIDS}                \p{XID_Start} (= \p{XID_Start=Y}) (136_322)
7081   \p{XIDS: *}             \p{XID_Start: *}
7082   \p{Xpeo}                \p{Old_Persian} (= \p{Script_Extensions=
7083                             Old_Persian}) (NOT \p{Block=
7084                             Old_Persian}) (50)
7085   \p{XPerlSpace}          \p{XPosixSpace} (25)
7086   \p{XPosixAlnum}         Alphabetic and (decimal) Numeric (Short:
7087                             \p{Alnum}) (138_445: [0-9A-Za-z\xaa\xb5
7088                             \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7089                             U+0100..02C1, U+02C6..02D1,
7090                             U+02E0..02E4, U+02EC, U+02EE ...)
7091   \p{XPosixAlpha}         \p{Alphabetic=Y} (Short: \p{Alpha})
7092                             (137_765)
7093   \p{XPosixBlank}         \h, Horizontal white space (Short:
7094                             \p{Blank}) (18: [\t\x20\xa0], U+1680,
7095                             U+2000..200A, U+202F, U+205F, U+3000)
7096   \p{XPosixCntrl}         \p{General_Category=Control} Control
7097                             characters (Short: \p{Cc}) (65)
7098   \p{XPosixDigit}         \p{General_Category=Decimal_Number} [0-9]
7099                             + all other decimal digits (Short:
7100                             \p{Nd}) (680)
7101   \p{XPosixGraph}         Characters that are graphical (Short:
7102                             \p{Graph}) (286_635: [!\"#\$\%&\'
7103                             \(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]\^_`a-z
7104                             \{\|\}~\xa1-\xff], U+0100..0377,
7105                             U+037A..037F, U+0384..038A, U+038C,
7106                             U+038E..03A1 ...)
7107   \p{XPosixLower}         \p{Lowercase=Y} (Short: \p{Lower}; /i=
7108                             Cased=Yes) (2544)
7109   \p{XPosixPrint}         Characters that are graphical plus space
7110                             characters (but no controls) (Short:
7111                             \p{Print}) (286_652: [\x20-\x7e\xa0-
7112                             \xff], U+0100..0377, U+037A..037F,
7113                             U+0384..038A, U+038C, U+038E..03A1 ...)
7114   \p{XPosixPunct}         \p{Punct} + ASCII-range \p{Symbol} (851:
7115                             [!\"#\$\%&\'\(\)*+,\-.\/:;<=>?\@\[\\\]
7116                             \^_`\{\|\}~\xa1\xa7\xab\xb6-\xb7\xbb
7117                             \xbf], U+037E, U+0387, U+055A..055F,
7118                             U+0589..058A, U+05BE ...)
7119   \p{XPosixSpace}         \s including beyond ASCII and vertical tab
7120                             (Short: \p{SpacePerl}) (25: [\t\n\cK\f
7121                             \r\x20\x85\xa0], U+1680, U+2000..200A,
7122                             U+2028..2029, U+202F, U+205F ...)
7123   \p{XPosixUpper}         \p{Uppercase=Y} (Short: \p{Upper}; /i=
7124                             Cased=Yes) (1951)
7125   \p{XPosixWord}          \w, including beyond ASCII; = \p{Alnum} +
7126                             \pM + \p{Pc} + \p{Join_Control} (Short:
7127                             \p{Word}) (139_612: [0-9A-Z_a-z\xaa\xb5
7128                             \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7129                             U+0100..02C1, U+02C6..02D1,
7130                             U+02E0..02E4, U+02EC, U+02EE ...)
7131   \p{XPosixXDigit}        \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
7132   \p{Xsux}                \p{Cuneiform} (= \p{Script_Extensions=
7133                             Cuneiform}) (NOT \p{Block=Cuneiform})
7134                             (1234)
7135   \p{Yezi}                \p{Yezidi} (= \p{Script_Extensions=
7136                             Yezidi}) (NOT \p{Block=Yezidi}) (60)
7137   \p{Yezidi}              \p{Script_Extensions=Yezidi} (Short:
7138                             \p{Yezi}; NOT \p{Block=Yezidi}) (60)
7139   \p{Yi}                  \p{Script_Extensions=Yi} (1246)
7140 X \p{Yi_Radicals}         \p{Block=Yi_Radicals} (64)
7141 X \p{Yi_Syllables}        \p{Block=Yi_Syllables} (1168)
7142   \p{Yiii}                \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
7143 X \p{Yijing}              \p{Yijing_Hexagram_Symbols} (= \p{Block=
7144                             Yijing_Hexagram_Symbols}) (64)
7145 X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
7146                             (Short: \p{InYijing}) (64)
7147   \p{Z} \pZ               \p{Separator} (= \p{General_Category=
7148                             Separator}) (19)
7149   \p{Zanabazar_Square}    \p{Script_Extensions=Zanabazar_Square}
7150                             (Short: \p{Zanb}; NOT \p{Block=
7151                             Zanabazar_Square}) (72)
7152   \p{Zanb}                \p{Zanabazar_Square} (=
7153                             \p{Script_Extensions=Zanabazar_Square})
7154                             (NOT \p{Block=Zanabazar_Square}) (72)
7155   \p{Zinh}                \p{Inherited} (= \p{Script_Extensions=
7156                             Inherited}) (586)
7157   \p{Zl}                  \p{Line_Separator} (= \p{General_Category=
7158                             Line_Separator}) (1)
7159 X \p{Znamenny_Music}      \p{Znamenny_Musical_Notation} (= \p{Block=
7160                             Znamenny_Musical_Notation}) (208)
7161 X \p{Znamenny_Musical_Notation} \p{Block=Znamenny_Musical_Notation}
7162                             (Short: \p{InZnamennyMusic}) (208)
7163   \p{Zp}                  \p{Paragraph_Separator} (=
7164                             \p{General_Category=
7165                             Paragraph_Separator}) (1)
7166   \p{Zs}                  \p{Space_Separator} (=
7167                             \p{General_Category=Space_Separator})
7168                             (17)
7169   \p{Zyyy}                \p{Common} (= \p{Script_Extensions=
7170                             Common}) (7873)
7171   \p{Zzzz}                \p{Unknown} (= \p{Script_Extensions=
7172                             Unknown}) (964_861 plus all above-
7173                             Unicode code points)
7174
7175
7176
7177=head2 Legal C<\p{}> and C<\P{}> constructs that match no characters
7178
7179Unicode has some property-value pairs that currently don't match anything.
7180This happens generally either because they are obsolete, or they exist for
7181symmetry with other forms, but no language has yet been encoded that uses
7182them.  In this version of Unicode, the following match zero code points:
7183
7184=over 4
7185
7186=item \p{Canonical_Combining_Class=Attached_Below_Left}
7187
7188=item \p{Canonical_Combining_Class=CCC133}
7189
7190=item \p{Grapheme_Cluster_Break=E_Base}
7191
7192=item \p{Grapheme_Cluster_Break=E_Base_GAZ}
7193
7194=item \p{Grapheme_Cluster_Break=E_Modifier}
7195
7196=item \p{Grapheme_Cluster_Break=Glue_After_Zwj}
7197
7198=item \p{Word_Break=E_Base}
7199
7200=item \p{Word_Break=E_Base_GAZ}
7201
7202=item \p{Word_Break=E_Modifier}
7203
7204=item \p{Word_Break=Glue_After_Zwj}
7205
7206=back
7207
7208
7209
7210=head1 Properties accessible through Unicode::UCD
7211
7212The value of any Unicode (not including Perl extensions) character
7213property mentioned above for any single code point is available through
7214L<Unicode::UCD/charprop()>.  L<Unicode::UCD/charprops_all()> returns the
7215values of all the Unicode properties for a given code point.
7216
7217Besides these, all the Unicode character properties mentioned above
7218(except for those marked as for internal use by Perl) are also
7219accessible by L<Unicode::UCD/prop_invlist()>.
7220
7221Due to their nature, not all Unicode character properties are suitable for
7222regular expression matches, nor C<prop_invlist()>.  The remaining
7223non-provisional, non-internal ones are accessible via
7224L<Unicode::UCD/prop_invmap()> (except for those that this Perl installation
7225hasn't included; see L<below for which those are|/Unicode character properties
7226that are NOT accepted by Perl>).
7227
7228For compatibility with other parts of Perl, all the single forms given in the
7229table in the L<section above|/Properties accessible through \p{} and \P{}>
7230are recognized.  BUT, there are some ambiguities between some Perl extensions
7231and the Unicode properties, all of which are silently resolved in favor of the
7232official Unicode property.  To avoid surprises, you should only use
7233C<prop_invmap()> for forms listed in the table below, which omits the
7234non-recommended ones.  The affected forms are the Perl single form equivalents
7235of Unicode properties, such as C<\p{sc}> being a single-form equivalent of
7236C<\p{gc=sc}>, which is treated by C<prop_invmap()> as the C<Script> property,
7237whose short name is C<sc>.  The table indicates the current ambiguities in the
7238INFO column, beginning with the word C<"NOT">.
7239
7240The standard Unicode properties listed below are documented in
7241L<http://www.unicode.org/reports/tr44/>; Perl_Decimal_Digit is documented in
7242L<Unicode::UCD/prop_invmap()>.  The other Perl extensions are in
7243L<perlunicode/Other Properties>;
7244
7245The first column in the table is a name for the property; the second column is
7246an alternative name, if any, plus possibly some annotations.  The alternative
7247name is the property's full name, unless that would simply repeat the first
7248column, in which case the second column indicates the property's short name
7249(if different).  The annotations are given only in the entry for the full
7250name.  The annotations for binary properties include a list of the first few
7251ranges that the property matches.  To avoid any ambiguity, the SPACE character
7252is represented as C<\x20>.
7253
7254If a property is obsolete, etc, the entry will be flagged with the same
7255characters used in the table in the L<section above|/Properties accessible
7256through \p{} and \P{}>, like B<D> or B<S>.
7257
7258   NAME                      INFO
7259
7260   Age
7261   AHex                    ASCII_Hex_Digit
7262   All                     (Perl extension).  All code points,
7263                           including those above Unicode.  Same as
7264                           qr/./s.  U+0000..infinity
7265   Alnum                   XPosixAlnum.  (Perl extension)
7266   Alpha                   Alphabetic
7267   Alphabetic              (Short: Alpha).  [A-Za-z\xaa\xb5\xba\xc0-
7268                           \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7269                           U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7270                           ...
7271   Any                     (Perl extension).  All Unicode code
7272                           points.  U+0000..10FFFF
7273   ASCII                   Block=Basic_Latin.  (Perl extension).
7274                           [\x00-\x7f]
7275   ASCII_Hex_Digit         (Short: AHex).  [0-9A-Fa-f]
7276   Assigned                (Perl extension).  All assigned code
7277                           points.  U+0000..0377, U+037A..037F,
7278                           U+0384..038A, U+038C, U+038E..03A1,
7279                           U+03A3..052F ...
7280   Bc                      Bidi_Class
7281   Bidi_C                  Bidi_Control
7282   Bidi_Class              (Short: bc)
7283   Bidi_Control            (Short: Bidi_C).  U+061C, U+200E..200F,
7284                           U+202A..202E, U+2066..2069
7285   Bidi_M                  Bidi_Mirrored
7286   Bidi_Mirrored           (Short: Bidi_M).  [\(\)<>\[\]\{\}\xab
7287                           \xbb], U+0F3A..0F3D, U+169B..169C,
7288                           U+2039..203A, U+2045..2046, U+207D..207E
7289                           ...
7290   Bidi_Mirroring_Glyph    (Short: bmg)
7291   Bidi_Paired_Bracket     (Short: bpb)
7292   Bidi_Paired_Bracket_Type (Short: bpt)
7293   Blank                   XPosixBlank.  (Perl extension)
7294   Blk                     Block
7295   Block                   (Short: blk)
7296   Bmg                     Bidi_Mirroring_Glyph
7297   Bpb                     Bidi_Paired_Bracket
7298   Bpt                     Bidi_Paired_Bracket_Type
7299   Canonical_Combining_Class (Short: ccc)
7300   Case_Folding            (Short: cf)
7301   Case_Ignorable          (Short: CI).  [\'.:\^`\xa8\xad\xaf\xb4
7302                           \xb7-\xb8], U+02B0..036F, U+0374..0375,
7303                           U+037A, U+0384..0385, U+0387 ...
7304   Cased                   [A-Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7305                           \xff], U+0100..01BA, U+01BC..01BF,
7306                           U+01C4..0293, U+0295..02B8, U+02C0..02C1
7307                           ...
7308   Category                General_Category
7309   Ccc                     Canonical_Combining_Class
7310   CE                      Composition_Exclusion
7311   Cf                      Case_Folding; NOT 'cf' meaning
7312                           'General_Category=Format'
7313   Changes_When_Casefolded (Short: CWCF).  [A-Z\xb5\xc0-\xd6\xd8-
7314                           \xdf], U+0100, U+0102, U+0104, U+0106,
7315                           U+0108 ...
7316   Changes_When_Casemapped (Short: CWCM).  [A-Za-z\xb5\xc0-\xd6\xd8-
7317                           \xf6\xf8-\xff], U+0100..0137,
7318                           U+0139..018C, U+018E..019A, U+019C..01A9,
7319                           U+01AC..01B9 ...
7320   Changes_When_Lowercased (Short: CWL).  [A-Z\xc0-\xd6\xd8-\xde],
7321                           U+0100, U+0102, U+0104, U+0106, U+0108 ...
7322   Changes_When_NFKC_Casefolded (Short: CWKCF).  [A-Z\xa0\xa8\xaa
7323                           \xad\xaf\xb2-\xb5\xb8-\xba\xbc-\xbe\xc0-
7324                           \xd6\xd8-\xdf], U+0100, U+0102, U+0104,
7325                           U+0106, U+0108 ...
7326   Changes_When_Titlecased (Short: CWT).  [a-z\xb5\xdf-\xf6\xf8-
7327                           \xff], U+0101, U+0103, U+0105, U+0107,
7328                           U+0109 ...
7329   Changes_When_Uppercased (Short: CWU).  [a-z\xb5\xdf-\xf6\xf8-
7330                           \xff], U+0101, U+0103, U+0105, U+0107,
7331                           U+0109 ...
7332   CI                      Case_Ignorable
7333   Cntrl                   XPosixCntrl (=General_Category=Control).
7334                           (Perl extension)
7335   Comp_Ex                 Full_Composition_Exclusion
7336   Composition_Exclusion   (Short: CE).  U+0958..095F, U+09DC..09DD,
7337                           U+09DF, U+0A33, U+0A36, U+0A59..0A5B ...
7338   CWCF                    Changes_When_Casefolded
7339   CWCM                    Changes_When_Casemapped
7340   CWKCF                   Changes_When_NFKC_Casefolded
7341   CWL                     Changes_When_Lowercased
7342   CWT                     Changes_When_Titlecased
7343   CWU                     Changes_When_Uppercased
7344   Dash                    [\-], U+058A, U+05BE, U+1400, U+1806,
7345                           U+2010..2015 ...
7346   Decomposition_Mapping   (Short: dm)
7347   Decomposition_Type      (Short: dt)
7348   Default_Ignorable_Code_Point (Short: DI).  [\xad], U+034F, U+061C,
7349                           U+115F..1160, U+17B4..17B5, U+180B..180F
7350                           ...
7351   Dep                     Deprecated
7352   Deprecated              (Short: Dep).  U+0149, U+0673, U+0F77,
7353                           U+0F79, U+17A3..17A4, U+206A..206F ...
7354   DI                      Default_Ignorable_Code_Point
7355   Dia                     Diacritic
7356   Diacritic               (Short: Dia).  [\^`\xa8\xaf\xb4\xb7-\xb8],
7357                           U+02B0..034E, U+0350..0357, U+035D..0362,
7358                           U+0374..0375, U+037A ...
7359   Digit                   XPosixDigit (=General_Category=
7360                           Decimal_Number).  (Perl extension)
7361   Dm                      Decomposition_Mapping
7362   Dt                      Decomposition_Type
7363   Ea                      East_Asian_Width
7364   East_Asian_Width        (Short: ea)
7365   EBase                   Emoji_Modifier_Base
7366   EComp                   Emoji_Component
7367   EMod                    Emoji_Modifier
7368   Emoji                   [#*0-9\xa9\xae], U+203C, U+2049, U+2122,
7369                           U+2139, U+2194..2199 ...
7370   Emoji_Component         (Short: EComp).  [#*0-9], U+200D, U+20E3,
7371                           U+FE0F, U+1F1E6..1F1FF, U+1F3FB..1F3FF ...
7372   Emoji_Modifier          (Short: EMod).  U+1F3FB..1F3FF
7373   Emoji_Modifier_Base     (Short: EBase).  U+261D, U+26F9,
7374                           U+270A..270D, U+1F385, U+1F3C2..1F3C4,
7375                           U+1F3C7 ...
7376   Emoji_Presentation      (Short: EPres).  U+231A..231B,
7377                           U+23E9..23EC, U+23F0, U+23F3,
7378                           U+25FD..25FE, U+2614..2615 ...
7379   EPres                   Emoji_Presentation
7380   EqUIdeo                 Equivalent_Unified_Ideograph
7381   Equivalent_Unified_Ideograph (Short: EqUIdeo)
7382   Ext                     Extender
7383   Extended_Pictographic   (Short: ExtPict).  [\xa9\xae], U+203C,
7384                           U+2049, U+2122, U+2139, U+2194..2199 ...
7385   Extender                (Short: Ext).  [\xb7], U+02D0..02D1,
7386                           U+0640, U+07FA, U+0B55, U+0E46 ...
7387   ExtPict                 Extended_Pictographic
7388   Full_Composition_Exclusion (Short: Comp_Ex).  U+0340..0341,
7389                           U+0343..0344, U+0374, U+037E, U+0387,
7390                           U+0958..095F ...
7391   Gc                      General_Category
7392   GCB                     Grapheme_Cluster_Break
7393   General_Category        (Short: gc)
7394   Gr_Base                 Grapheme_Base
7395   Gr_Ext                  Grapheme_Extend
7396   Graph                   XPosixGraph.  (Perl extension)
7397   Grapheme_Base           (Short: Gr_Base).  [\x20-\x7e\xa0-\xac
7398                           \xae-\xff], U+0100..02FF, U+0370..0377,
7399                           U+037A..037F, U+0384..038A, U+038C ...
7400   Grapheme_Cluster_Break  (Short: GCB)
7401   Grapheme_Extend         (Short: Gr_Ext).  U+0300..036F,
7402                           U+0483..0489, U+0591..05BD, U+05BF,
7403                           U+05C1..05C2, U+05C4..05C5 ...
7404   Hangul_Syllable_Type    (Short: hst)
7405   Hex                     Hex_Digit
7406   Hex_Digit               (Short: Hex).  [0-9A-Fa-f], U+FF10..FF19,
7407                           U+FF21..FF26, U+FF41..FF46
7408   HorizSpace              XPosixBlank.  (Perl extension)
7409   Hst                     Hangul_Syllable_Type
7410 D Hyphen                  [\-\xad], U+058A, U+1806, U+2010..2011,
7411                           U+2E17, U+30FB ...  Supplanted by
7412                           Line_Break property values; see
7413                           www.unicode.org/reports/tr14
7414   ID_Continue             (Short: IDC).  [0-9A-Z_a-z\xaa\xb5\xb7
7415                           \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7416                           U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7417                           U+02EC, U+02EE ...
7418   ID_Start                (Short: IDS).  [A-Za-z\xaa\xb5\xba\xc0-
7419                           \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7420                           U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7421                           ...
7422   IDC                     ID_Continue
7423   Identifier_Status
7424   Identifier_Type
7425   Ideo                    Ideographic
7426   Ideographic             (Short: Ideo).  U+3006..3007,
7427                           U+3021..3029, U+3038..303A, U+3400..4DBF,
7428                           U+4E00..9FFF, U+F900..FA6D ...
7429   IDS                     ID_Start
7430   IDS_Binary_Operator     (Short: IDSB).  U+2FF0..2FF1, U+2FF4..2FFB
7431   IDS_Trinary_Operator    (Short: IDST).  U+2FF2..2FF3
7432   IDSB                    IDS_Binary_Operator
7433   IDST                    IDS_Trinary_Operator
7434   In                      Present_In.  (Perl extension)
7435   Indic_Positional_Category (Short: InPC)
7436   Indic_Syllabic_Category (Short: InSC)
7437   InPC                    Indic_Positional_Category
7438   InSC                    Indic_Syllabic_Category
7439   Isc                     ISO_Comment; NOT 'isc' meaning
7440                           'General_Category=Other'
7441   ISO_Comment             (Short: isc)
7442   Jg                      Joining_Group
7443   Join_C                  Join_Control
7444   Join_Control            (Short: Join_C).  U+200C..200D
7445   Joining_Group           (Short: jg)
7446   Joining_Type            (Short: jt)
7447   Jt                      Joining_Type
7448   Lb                      Line_Break
7449   Lc                      Lowercase_Mapping; NOT 'lc' meaning
7450                           'General_Category=Cased_Letter'
7451   Line_Break              (Short: lb)
7452   LOE                     Logical_Order_Exception
7453   Logical_Order_Exception (Short: LOE).  U+0E40..0E44, U+0EC0..0EC4,
7454                           U+19B5..19B7, U+19BA, U+AAB5..AAB6, U+AAB9
7455                           ...
7456   Lower                   Lowercase
7457   Lowercase               (Short: Lower).  [a-z\xaa\xb5\xba\xdf-
7458                           \xf6\xf8-\xff], U+0101, U+0103, U+0105,
7459                           U+0107, U+0109 ...
7460   Lowercase_Mapping       (Short: lc)
7461   Math                    [+<=>\^\|~\xac\xb1\xd7\xf7], U+03D0..03D2,
7462                           U+03D5, U+03F0..03F1, U+03F4..03F6,
7463                           U+0606..0608 ...
7464   Na                      Name
7465   Na1                     Unicode_1_Name
7466   Name                    (Short: na)
7467   Name_Alias
7468   NChar                   Noncharacter_Code_Point
7469   NFC_QC                  NFC_Quick_Check
7470   NFC_Quick_Check         (Short: NFC_QC)
7471   NFD_QC                  NFD_Quick_Check
7472   NFD_Quick_Check         (Short: NFD_QC)
7473   NFKC_Casefold           (Short: NFKC_CF)
7474   NFKC_CF                 NFKC_Casefold
7475   NFKC_QC                 NFKC_Quick_Check
7476   NFKC_Quick_Check        (Short: NFKC_QC)
7477   NFKD_QC                 NFKD_Quick_Check
7478   NFKD_Quick_Check        (Short: NFKD_QC)
7479   Noncharacter_Code_Point (Short: NChar).  U+FDD0..FDEF,
7480                           U+FFFE..FFFF, U+1FFFE..1FFFF,
7481                           U+2FFFE..2FFFF, U+3FFFE..3FFFF,
7482                           U+4FFFE..4FFFF ...
7483   Nt                      Numeric_Type
7484   Numeric_Type            (Short: nt)
7485   Numeric_Value           (Short: nv)
7486   Nv                      Numeric_Value
7487   Pat_Syn                 Pattern_Syntax
7488   Pat_WS                  Pattern_White_Space
7489   Pattern_Syntax          (Short: Pat_Syn).  [!\"#\$\%&\'\(\)*+,\-.
7490                           \/:;<=>?\@\[\\\]\^`\{\|\}~\xa1-\xa7\xa9
7491                           \xab-\xac\xae\xb0-\xb1\xb6\xbb\xbf\xd7
7492                           \xf7], U+2010..2027, U+2030..203E,
7493                           U+2041..2053, U+2055..205E, U+2190..245F
7494                           ...
7495   Pattern_White_Space     (Short: Pat_WS).  [\t\n\cK\f\r\x20\x85],
7496                           U+200E..200F, U+2028..2029
7497   PCM                     Prepended_Concatenation_Mark
7498   Perl_Decimal_Digit      (Perl extension)
7499   PerlSpace               PosixSpace.  (Perl extension)
7500   PerlWord                PosixWord.  (Perl extension)
7501   PosixAlnum              (Perl extension).  [0-9A-Za-z]
7502   PosixAlpha              (Perl extension).  [A-Za-z]
7503   PosixBlank              (Perl extension).  [\t\x20]
7504   PosixCntrl              (Perl extension).  ASCII control
7505                           characters.  ACK, BEL, BS, CAN, CR, DC1,
7506                           DC2, DC3, DC4, DEL, DLE, ENQ, EOM, EOT,
7507                           ESC, ETB, ETX, FF, FS, GS, HT, LF, NAK,
7508                           NUL, RS, SI, SO, SOH, STX, SUB, SYN, US, VT
7509   PosixDigit              (Perl extension).  [0-9]
7510   PosixGraph              (Perl extension).  [!\"#\$\%&\'\(\)*+,\-.
7511                           \/0-9:;<=>?\@A-Z\[\\\]\^_`a-z\{\|\}~]
7512   PosixLower              (Perl extension).  [a-z]
7513   PosixPrint              (Perl extension).  [\x20-\x7e]
7514   PosixPunct              (Perl extension).  [!\"#\$\%&\'\(\)*+,\-.
7515                           \/:;<=>?\@\[\\\]\^_`\{\|\}~]
7516   PosixSpace              (Perl extension).  [\t\n\cK\f\r\x20]
7517   PosixUpper              (Perl extension).  [A-Z]
7518   PosixWord               (Perl extension).  \w, restricted to
7519                           ASCII.  [0-9A-Z_a-z]
7520   PosixXDigit             ASCII_Hex_Digit.  (Perl extension).
7521                           [0-9A-Fa-f]
7522   Prepended_Concatenation_Mark (Short: PCM).  U+0600..0605, U+06DD,
7523                           U+070F, U+0890..0891, U+08E2, U+110BD ...
7524   Present_In              (Short: In).  (Perl extension)
7525   Print                   XPosixPrint.  (Perl extension)
7526   Punct                   General_Category=Punctuation.  (Perl
7527                           extension).  [!\"#\%&\'\(\)*,\-.\/:;?\@
7528                           \[\\\]_\{\}\xa1\xa7\xab\xb6-\xb7\xbb\xbf],
7529                           U+037E, U+0387, U+055A..055F,
7530                           U+0589..058A, U+05BE ...
7531   QMark                   Quotation_Mark
7532   Quotation_Mark          (Short: QMark).  [\"\'\xab\xbb],
7533                           U+2018..201F, U+2039..203A, U+2E42,
7534                           U+300C..300F, U+301D..301F ...
7535   Radical                 U+2E80..2E99, U+2E9B..2EF3, U+2F00..2FD5
7536   Regional_Indicator      (Short: RI).  U+1F1E6..1F1FF
7537   RI                      Regional_Indicator
7538   SB                      Sentence_Break
7539   Sc                      Script; NOT 'sc' meaning
7540                           'General_Category=Currency_Symbol'
7541   Scf                     Simple_Case_Folding
7542   Script                  (Short: sc)
7543   Script_Extensions       (Short: scx)
7544   Scx                     Script_Extensions
7545   SD                      Soft_Dotted
7546   Sentence_Break          (Short: SB)
7547   Sentence_Terminal       (Short: STerm).  [!.?], U+0589,
7548                           U+061D..061F, U+06D4, U+0700..0702, U+07F9
7549                           ...
7550   Sfc                     Simple_Case_Folding
7551   Simple_Case_Folding     (Short: scf)
7552   Simple_Lowercase_Mapping (Short: slc)
7553   Simple_Titlecase_Mapping (Short: stc)
7554   Simple_Uppercase_Mapping (Short: suc)
7555   Slc                     Simple_Lowercase_Mapping
7556   Soft_Dotted             (Short: SD).  [i-j], U+012F, U+0249,
7557                           U+0268, U+029D, U+02B2 ...
7558   Space                   White_Space
7559   SpacePerl               XPosixSpace.  (Perl extension)
7560   Stc                     Simple_Titlecase_Mapping
7561   STerm                   Sentence_Terminal
7562   Suc                     Simple_Uppercase_Mapping
7563   Tc                      Titlecase_Mapping
7564   Term                    Terminal_Punctuation
7565   Terminal_Punctuation    (Short: Term).  [!,.:;?], U+037E, U+0387,
7566                           U+0589, U+05C3, U+060C ...
7567   Title                   Titlecase.  (Perl extension)
7568   Titlecase               (Short: Title).  (Perl extension).  (=
7569                           \p{Gc=Lt}).  U+01C5, U+01C8, U+01CB,
7570                           U+01F2, U+1F88..1F8F, U+1F98..1F9F ...
7571   Titlecase_Mapping       (Short: tc)
7572   Uc                      Uppercase_Mapping
7573   UIdeo                   Unified_Ideograph
7574   Unicode                 Any.  (Perl extension)
7575   Unicode_1_Name          (Short: na1)
7576   Unified_Ideograph       (Short: UIdeo).  U+3400..4DBF,
7577                           U+4E00..9FFF, U+FA0E..FA0F, U+FA11,
7578                           U+FA13..FA14, U+FA1F ...
7579   Upper                   Uppercase
7580   Uppercase               (Short: Upper).  [A-Z\xc0-\xd6\xd8-\xde],
7581                           U+0100, U+0102, U+0104, U+0106, U+0108 ...
7582   Uppercase_Mapping       (Short: uc)
7583   Variation_Selector      (Short: VS).  U+180B..180D, U+180F,
7584                           U+FE00..FE0F, U+E0100..E01EF
7585   Vertical_Orientation    (Short: vo)
7586   VertSpace               (Perl extension).  \v.  [\n\cK\f\r\x85],
7587                           U+2028..2029
7588   Vo                      Vertical_Orientation
7589   VS                      Variation_Selector
7590   WB                      Word_Break
7591   White_Space             (Short: WSpace).  [\t\n\cK\f\r\x20\x85
7592                           \xa0], U+1680, U+2000..200A, U+2028..2029,
7593                           U+202F, U+205F ...
7594   Word                    XPosixWord.  (Perl extension)
7595   Word_Break              (Short: WB)
7596   WSpace                  White_Space
7597   XDigit                  XPosixXDigit (=Hex_Digit).  (Perl
7598                           extension)
7599   XID_Continue            (Short: XIDC).  [0-9A-Z_a-z\xaa\xb5\xb7
7600                           \xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7601                           U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7602                           U+02EC, U+02EE ...
7603   XID_Start               (Short: XIDS).  [A-Za-z\xaa\xb5\xba\xc0-
7604                           \xd6\xd8-\xf6\xf8-\xff], U+0100..02C1,
7605                           U+02C6..02D1, U+02E0..02E4, U+02EC, U+02EE
7606                           ...
7607   XIDC                    XID_Continue
7608   XIDS                    XID_Start
7609   XPerlSpace              XPosixSpace.  (Perl extension)
7610   XPosixAlnum             (Short: Alnum).  (Perl extension).
7611                           Alphabetic and (decimal) Numeric.  [0-9A-
7612                           Za-z\xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-
7613                           \xff], U+0100..02C1, U+02C6..02D1,
7614                           U+02E0..02E4, U+02EC, U+02EE ...
7615   XPosixAlpha             Alphabetic.  (Perl extension).  [A-Za-z
7616                           \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7617                           U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7618                           U+02EC, U+02EE ...
7619   XPosixBlank             (Short: Blank).  (Perl extension).  \h,
7620                           Horizontal white space.  [\t\x20\xa0],
7621                           U+1680, U+2000..200A, U+202F, U+205F,
7622                           U+3000
7623   XPosixCntrl             General_Category=Control  (Short: Cntrl).
7624                           (Perl extension).  Control characters.
7625                           [\x00-\x1f\x7f-\x9f]
7626   XPosixDigit             General_Category=Decimal_Number  (Short:
7627                           Digit).  (Perl extension).  [0-9] + all
7628                           other decimal digits.  [0-9],
7629                           U+0660..0669, U+06F0..06F9, U+07C0..07C9,
7630                           U+0966..096F, U+09E6..09EF ...
7631   XPosixGraph             (Short: Graph).  (Perl extension).
7632                           Characters that are graphical.  [!\"#\$
7633                           \%&\'\(\)*+,\-.\/0-9:;<=>?\@A-Z\[\\\]
7634                           \^_`a-z\{\|\}~\xa1-\xff], U+0100..0377,
7635                           U+037A..037F, U+0384..038A, U+038C,
7636                           U+038E..03A1 ...
7637   XPosixLower             Lowercase.  (Perl extension).  [a-z\xaa
7638                           \xb5\xba\xdf-\xf6\xf8-\xff], U+0101,
7639                           U+0103, U+0105, U+0107, U+0109 ...
7640   XPosixPrint             (Short: Print).  (Perl extension).
7641                           Characters that are graphical plus space
7642                           characters (but no controls).  [\x20-\x7e
7643                           \xa0-\xff], U+0100..0377, U+037A..037F,
7644                           U+0384..038A, U+038C, U+038E..03A1 ...
7645   XPosixPunct             (Perl extension).  \p{Punct} + ASCII-range
7646                           \p{Symbol}.  [!\"#\$\%&\'\(\)*+,\-.\/:;<=
7647                           >?\@\[\\\]\^_`\{\|\}~\xa1\xa7\xab\xb6-
7648                           \xb7\xbb\xbf], U+037E, U+0387,
7649                           U+055A..055F, U+0589..058A, U+05BE ...
7650   XPosixSpace             (Perl extension).  \s including beyond
7651                           ASCII and vertical tab.  [\t\n\cK\f\r\x20
7652                           \x85\xa0], U+1680, U+2000..200A,
7653                           U+2028..2029, U+202F, U+205F ...
7654   XPosixUpper             Uppercase.  (Perl extension).  [A-Z\xc0-
7655                           \xd6\xd8-\xde], U+0100, U+0102, U+0104,
7656                           U+0106, U+0108 ...
7657   XPosixWord              (Short: Word).  (Perl extension).  \w,
7658                           including beyond ASCII; = \p{Alnum} + \pM
7659                           + \p{Pc} + \p{Join_Control}.  [0-9A-Z_a-z
7660                           \xaa\xb5\xba\xc0-\xd6\xd8-\xf6\xf8-\xff],
7661                           U+0100..02C1, U+02C6..02D1, U+02E0..02E4,
7662                           U+02EC, U+02EE ...
7663   XPosixXDigit            Hex_Digit  (Short: XDigit).  (Perl
7664                           extension).  [0-9A-Fa-f], U+FF10..FF19,
7665                           U+FF21..FF26, U+FF41..FF46
7666
7667
7668=head1 Properties accessible through other means
7669
7670Certain properties are accessible also via core function calls.  These are:
7671
7672 Lowercase_Mapping          lc() and lcfirst()
7673 Titlecase_Mapping          ucfirst()
7674 Uppercase_Mapping          uc()
7675
7676Also, Case_Folding is accessible through the C</i> modifier in regular
7677expressions, the C<\F> transliteration escape, and the C<L<fc|perlfunc/fc>>
7678operator.
7679
7680Besides being able to say C<\p{Name=...}>, the Name and Name_Aliases
7681properties are accessible through the C<\N{}> interpolation in double-quoted
7682strings and regular expressions; and functions C<charnames::viacode()>,
7683C<charnames::vianame()>, and C<charnames::string_vianame()> (which require a
7684C<use charnames ();> to be specified.
7685
7686Finally, most properties related to decomposition are accessible via
7687L<Unicode::Normalize>.
7688
7689=head1 Unicode character properties that are NOT accepted by Perl
7690
7691Perl will generate an error for a few character properties in Unicode when
7692used in a regular expression.  The non-Unihan ones are listed below, with the
7693reasons they are not accepted, perhaps with work-arounds.  The short names for
7694the properties are listed enclosed in (parentheses).
7695As described after the list, an installation can change the defaults and choose
7696to accept any of these.  The list is machine generated based on the
7697choices made for the installation that generated this document.
7698
7699
7700=over 4
7701
7702
7703
7704=item I<Expands_On_NFC> (XO_NFC)
7705
7706=item I<Expands_On_NFD> (XO_NFD)
7707
7708=item I<Expands_On_NFKC> (XO_NFKC)
7709
7710=item I<Expands_On_NFKD> (XO_NFKD)
7711
7712Deprecated by Unicode.  These are characters that expand to more than one character in the specified normalization form, but whether they actually take up more bytes or not depends on the encoding being used.  For example, a UTF-8 encoded character may expand to a different number of bytes than a UTF-32 encoded character.
7713
7714
7715
7716=item I<Grapheme_Link> (Gr_Link)
7717
7718Duplicates ccc=vr (Canonical_Combining_Class=Virama)
7719
7720
7721
7722=item I<Jamo_Short_Name> (JSN)
7723
7724=item I<Other_Alphabetic> (OAlpha)
7725
7726=item I<Other_Default_Ignorable_Code_Point> (ODI)
7727
7728=item I<Other_Grapheme_Extend> (OGr_Ext)
7729
7730=item I<Other_ID_Continue> (OIDC)
7731
7732=item I<Other_ID_Start> (OIDS)
7733
7734=item I<Other_Lowercase> (OLower)
7735
7736=item I<Other_Math> (OMath)
7737
7738=item I<Other_Uppercase> (OUpper)
7739
7740Used by Unicode internally for generating other properties and not intended to be used stand-alone
7741
7742
7743
7744=item I<Script=Katakana_Or_Hiragana> (sc=Hrkt)
7745
7746Obsolete.  All code points previously matched by this have been moved to "Script=Common".  Consider instead using "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or both)
7747
7748
7749
7750=item I<Script_Extensions=Katakana_Or_Hiragana> (scx=Hrkt)
7751
7752All code points that would be matched by this are matched by either "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"
7753
7754=back
7755
7756
7757An installation can choose to allow any of these to be matched by downloading
7758the Unicode database from L<http://www.unicode.org/Public/> to
7759C<$Config{privlib}>/F<unicore/> in the Perl source tree, changing the
7760controlling lists contained in the program
7761C<$Config{privlib}>/F<unicore/mktables> and then re-compiling and installing.
7762(C<%Config> is available from the Config module).
7763
7764Also, perl can be recompiled to operate on an earlier version of the Unicode
7765standard.  Further information is at
7766C<$Config{privlib}>/F<unicore/README.perl>.
7767
7768=head1 Other information in the Unicode data base
7769
7770The Unicode data base is delivered in two different formats.  The XML version
7771is valid for more modern Unicode releases.  The other version is a collection
7772of files.  The two are intended to give equivalent information.  Perl uses the
7773older form; this allows you to recompile Perl to use early Unicode releases.
7774
7775The only non-character property that Perl currently supports is Named
7776Sequences, in which a sequence of code points
7777is given a name and generally treated as a single entity.  (Perl supports
7778these via the C<\N{...}> double-quotish construct,
7779L<charnames/charnames::string_vianame(name)>, and L<Unicode::UCD/namedseq()>.
7780
7781Below is a list of the files in the Unicode data base that Perl doesn't
7782currently use, along with very brief descriptions of their purposes.
7783Some of the names of the files have been shortened from those that Unicode
7784uses, in order to allow them to be distinguishable from similarly named files
7785on file systems for which only the first 8 characters of a name are
7786significant.
7787
7788=over 4
7789
7790
7791
7792
7793=item F<auxiliary/GraphemeBreakTest.html>
7794
7795=item F<auxiliary/LineBreakTest.html>
7796
7797=item F<auxiliary/SentenceBreakTest.html>
7798
7799=item F<auxiliary/WordBreakTest.html>
7800
7801Documentation of validation Tests
7802
7803
7804
7805=item F<BidiCharacterTest.txt>
7806
7807=item F<BidiTest.txt>
7808
7809Validation Tests
7810
7811
7812
7813=item F<CJKRadicals.txt>
7814
7815Maps the kRSUnicode property values to corresponding code points
7816
7817
7818
7819=item F<confusables.txt>
7820
7821=item F<confusablesSummary.txt>
7822
7823=item F<intentional.txt>
7824
7825Currently unused by Perl
7826
7827
7828
7829=item F<emoji/ReadMe.txt>
7830
7831=item F<ReadMe.txt>
7832
7833Documentation
7834
7835
7836
7837=item F<EmojiSources.txt>
7838
7839Maps certain Unicode code points to their legacy Japanese cell-phone values
7840
7841
7842
7843=item F<extracted/DName.txt>
7844
7845This file adds no new information not already present in other files
7846
7847
7848
7849=item F<Index.txt>
7850
7851Alphabetical index of Unicode characters
7852
7853
7854
7855=item F<NamedSqProv.txt>
7856
7857Named sequences proposed for inclusion in a later version of the Unicode Standard; if you need them now, you can append this file to F<NamedSequences.txt> and recompile perl
7858
7859
7860
7861=item F<NamesList.html>
7862
7863Describes the format and contents of F<NamesList.txt>
7864
7865
7866
7867=item F<NamesList.txt>
7868
7869Annotated list of characters
7870
7871
7872
7873=item F<NormalizationCorrections.txt>
7874
7875Documentation of corrections already incorporated into the Unicode data base
7876
7877
7878
7879=item F<NushuSources.txt>
7880
7881Specifies source material for Nushu characters
7882
7883
7884
7885=item F<StandardizedVariants.html>
7886
7887Obsoleted as of Unicode 9.0, but previously provided a visual display of the standard variant sequences derived from F<StandardizedVariants.txt>.
7888
7889
7890
7891=item F<StandardizedVariants.txt>
7892
7893Certain glyph variations for character display are standardized.  This lists the non-Unihan ones; the Unihan ones are also not used by Perl, and are in a separate Unicode data base L<http://www.unicode.org/ivd>
7894
7895
7896
7897=item F<TangutSources.txt>
7898
7899Specifies source mappings for Tangut ideographs and components. This data file also includes informative radical-stroke values that are used internally by Unicode
7900
7901
7902
7903=item F<USourceData.txt>
7904
7905Documentation of status and cross reference of proposals for encoding by Unicode of Unihan characters
7906
7907
7908
7909=item F<USourceGlyphs.pdf>
7910
7911Pictures of the characters in F<USourceData.txt>
7912
7913
7914=back
7915
7916=head1 SEE ALSO
7917
7918L<http://www.unicode.org/reports/tr44/>
7919
7920L<perlrecharclass>
7921
7922L<perlunicode>
7923
7924