Perl version

Modules

  • ABCDE
  • FGHIL
  • MNOPS
  • TUX

perluniprops

Perl 5 version 26.1 documentation
Recently read

perluniprops

NAME

perluniprops - Index of Unicode Version 9.0.0 character properties in Perl

DESCRIPTION

This document provides information about the portion of the Unicode database that deals with character properties, that is the portion that is defined on single code points. (Other information in the Unicode data base below briefly mentions other data that Unicode provides.)

Perl can provide access to all non-provisional Unicode character properties, though not all are enabled by default. The omitted ones are the Unihan properties (accessible via the CPAN module Unicode::Unihan) and certain deprecated or Unicode-internal properties. (An installation may choose to recompile Perl's tables to change this. See Unicode character properties that are NOT accepted by Perl.)

For most purposes, access to Unicode properties from the Perl core is through regular expression matches, as described in the next section. For some special purposes, and to access the properties that are not suitable for regular expression matching, all the Unicode character properties that Perl handles are accessible via the standard Unicode::UCD module, as described in the section Properties accessible through Unicode::UCD.

Perl also provides some additional extensions and short-cut synonyms for Unicode properties.

This document merely lists all available properties and does not attempt to explain what each property really means. There is a brief description of each Perl extension; see Other Properties in perlunicode for more information on these. There is some detail about Blocks, Scripts, General_Category, and Bidi_Class in perlunicode, but to find out about the intricacies of the official Unicode properties, refer to the Unicode standard. A good starting place is http://www.unicode.org/reports/tr44/.

Note that you can define your own properties; see User-Defined Character Properties in perlunicode.

Properties accessible through \p{} and \P{}

The Perl regular expression \p{} and \P{} constructs give access to most of the Unicode character properties. The table below shows all these constructs, both single and compound forms.

Compound forms consist of two components, separated by an equals sign or a colon. The first component is the property name, and the second component is the particular value of the property to match against, for example, \p{Script: Greek} and \p{Script=Greek} both mean to match characters whose Script property value is Greek.

Single forms, like \p{Greek} , are mostly Perl-defined shortcuts for their equivalent compound forms. The table shows these equivalences. (In our example, \p{Greek} is a just a shortcut for \p{Script=Greek} .) There are also a few Perl-defined single forms that are not shortcuts for a compound form. One such is \p{Word} . These are also listed in the table.

In parsing these constructs, Perl always ignores Upper/lower case differences everywhere within the {braces}. Thus \p{Greek} means the same thing as \p{greek} . But note that changing the case of the "p" or "P" before the left brace completely changes the meaning of the construct, from "match" (for \p{} ) to "doesn't match" (for \P{} ). Casing in this document is for improved legibility.

Also, white space, hyphens, and underscores are normally ignored everywhere between the {braces}, and hence can be freely added or removed even if the /x modifier hasn't been specified on the regular expression. But in the table below a 'T' at the beginning of an entry means that tighter (stricter) rules are used for that entry:

Some properties are considered obsolete by Unicode, but still available. There are several varieties of obsolescence:

The table below has two columns. The left column contains the \p{} constructs to look up, possibly preceded by the flags mentioned above; and the right column contains information about them, like a description, or synonyms. The table shows both the single and compound forms for each property that has them. If the left column is a short name for a property, the right column will give its longer, more descriptive name; and if the left column is the longest name, the right column will show any equivalent shortest name, in both single and compound forms if applicable.

If braces are not needed to specify a property (e.g., \pL ), the left column contains both forms, with and without braces.

The right column will also caution you if a property means something different than what might normally be expected.

All single forms are Perl extensions; a few compound forms are as well, and are noted as such.

Numbers in (parentheses) indicate the total number of Unicode code points matched by the property. For emphasis, those properties that match no code points at all are listed as well in a separate section following the table.

Most properties match the same code points regardless of whether "/i" case-insensitive matching is specified or not. But a few properties are affected. These are shown with the notation (/i= other_property) in the second column. Under case-insensitive matching they match the same code pode points as the property other_property.

There is no description given for most non-Perl defined properties (See http://www.unicode.org/reports/tr44/ for that).

For compactness, '*' is used as a wildcard instead of showing all possible combinations. For example, entries like:

  1. \p{Gc: *} \p{General_Category: *}

mean that 'Gc' is a synonym for 'General_Category', and anything that is valid for the latter is also valid for the former. Similarly,

  1. \p{Is_*} \p{*}

means that if and only if, for example, \p{Foo} exists, then \p{Is_Foo} and \p{IsFoo} are also valid and all mean the same thing. And similarly, \p{Foo=Bar} means the same as \p{Is_Foo=Bar} and \p{IsFoo=Bar} . "*" here is restricted to something not beginning with an underscore.

Also, in binary properties, 'Yes', 'T', and 'True' are all synonyms for 'Y'. And 'No', 'F', and 'False' are all synonyms for 'N'. The table shows 'Y*' and 'N*' to indicate this, and doesn't have separate entries for the other possibilities. Note that not all properties which have values 'Yes' and 'No' are binary, and they have all their values spelled out without using this wild card, and a NOT clause in their description that highlights their not being binary. These also require the compound form to match them, whereas true binary properties have both single and compound forms available.

Note that all non-essential underscores are removed in the display of the short names below.

Legend summary:

  • * is a wild-card
  • (\d+) in the info column gives the number of Unicode code points matched by this property.
  • D means this is deprecated.
  • O means this is obsolete.
  • S means this is stabilized.
  • T means tighter (stricter) name matching applies.
  • X means use of this form is discouraged, and may not be stable.
  1. NAME INFO
  2. \p{Adlam} \p{Script_Extensions=Adlam} (Short:
  3. \p{Adlm}; NOT \p{Block=Adlam}) (88)
  4. \p{Adlm} \p{Adlam} (= \p{Script_Extensions=Adlam})
  5. (NOT \p{Block=Adlam}) (88)
  6. X \p{Aegean_Numbers} \p{Block=Aegean_Numbers} (64)
  7. T \p{Age: 1.1} \p{Age=V1_1} (33_979)
  8. T \p{Age: 2.0} \p{Age=V2_0} (144_521)
  9. T \p{Age: 2.1} \p{Age=V2_1} (2)
  10. T \p{Age: 3.0} \p{Age=V3_0} (10_307)
  11. T \p{Age: 3.1} \p{Age=V3_1} (44_978)
  12. T \p{Age: 3.2} \p{Age=V3_2} (1016)
  13. T \p{Age: 4.0} \p{Age=V4_0} (1226)
  14. T \p{Age: 4.1} \p{Age=V4_1} (1273)
  15. T \p{Age: 5.0} \p{Age=V5_0} (1369)
  16. T \p{Age: 5.1} \p{Age=V5_1} (1624)
  17. T \p{Age: 5.2} \p{Age=V5_2} (6648)
  18. T \p{Age: 6.0} \p{Age=V6_0} (2088)
  19. T \p{Age: 6.1} \p{Age=V6_1} (732)
  20. T \p{Age: 6.2} \p{Age=V6_2} (1)
  21. T \p{Age: 6.3} \p{Age=V6_3} (5)
  22. T \p{Age: 7.0} \p{Age=V7_0} (2834)
  23. T \p{Age: 8.0} \p{Age=V8_0} (7716)
  24. T \p{Age: 9.0} \p{Age=V9_0} (7500)
  25. \p{Age: NA} \p{Age=Unassigned} (846_293 plus all
  26. above-Unicode code points)
  27. \p{Age: Unassigned} Code point's usage has not been assigned
  28. in any Unicode release thus far. (Short:
  29. \p{Age=NA}) (846_293 plus all above-
  30. Unicode code points)
  31. \p{Age: V1_1} Code point's usage introduced in version
  32. 1.1 (33_979)
  33. \p{Age: V2_0} Code point's usage was introduced in
  34. version 2.0; See also Property
  35. 'Present_In' (144_521)
  36. \p{Age: V2_1} Code point's usage was introduced in
  37. version 2.1; See also Property
  38. 'Present_In' (2)
  39. \p{Age: V3_0} Code point's usage was introduced in
  40. version 3.0; See also Property
  41. 'Present_In' (10_307)
  42. \p{Age: V3_1} Code point's usage was introduced in
  43. version 3.1; See also Property
  44. 'Present_In' (44_978)
  45. \p{Age: V3_2} Code point's usage was introduced in
  46. version 3.2; See also Property
  47. 'Present_In' (1016)
  48. \p{Age: V4_0} Code point's usage was introduced in
  49. version 4.0; See also Property
  50. 'Present_In' (1226)
  51. \p{Age: V4_1} Code point's usage was introduced in
  52. version 4.1; See also Property
  53. 'Present_In' (1273)
  54. \p{Age: V5_0} Code point's usage was introduced in
  55. version 5.0; See also Property
  56. 'Present_In' (1369)
  57. \p{Age: V5_1} Code point's usage was introduced in
  58. version 5.1; See also Property
  59. 'Present_In' (1624)
  60. \p{Age: V5_2} Code point's usage was introduced in
  61. version 5.2; See also Property
  62. 'Present_In' (6648)
  63. \p{Age: V6_0} Code point's usage was introduced in
  64. version 6.0; See also Property
  65. 'Present_In' (2088)
  66. \p{Age: V6_1} Code point's usage was introduced in
  67. version 6.1; See also Property
  68. 'Present_In' (732)
  69. \p{Age: V6_2} Code point's usage was introduced in
  70. version 6.2; See also Property
  71. 'Present_In' (1)
  72. \p{Age: V6_3} Code point's usage was introduced in
  73. version 6.3; See also Property
  74. 'Present_In' (5)
  75. \p{Age: V7_0} Code point's usage was introduced in
  76. version 7.0; See also Property
  77. 'Present_In' (2834)
  78. \p{Age: V8_0} Code point's usage was introduced in
  79. version 8.0; See also Property
  80. 'Present_In' (7716)
  81. \p{Age: V9_0} Code point's usage was introduced in
  82. version 9.0; See also Property
  83. 'Present_In' (7500)
  84. \p{Aghb} \p{Caucasian_Albanian} (=
  85. \p{Script_Extensions=
  86. Caucasian_Albanian}) (NOT \p{Block=
  87. Caucasian_Albanian}) (53)
  88. \p{AHex} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  89. (22)
  90. \p{AHex: *} \p{ASCII_Hex_Digit: *}
  91. \p{Ahom} \p{Script_Extensions=Ahom} (NOT \p{Block=
  92. Ahom}) (57)
  93. X \p{Alchemical} \p{Alchemical_Symbols} (= \p{Block=
  94. Alchemical_Symbols}) (128)
  95. X \p{Alchemical_Symbols} \p{Block=Alchemical_Symbols} (Short:
  96. \p{InAlchemical}) (128)
  97. \p{All} All code points, including those above
  98. Unicode. Same as qr/./s (1_114_112 plus
  99. all above-Unicode code points)
  100. \p{Alnum} \p{XPosixAlnum} (118_820)
  101. \p{Alpha} \p{XPosixAlpha} (= \p{Alphabetic=Y})
  102. (118_240)
  103. \p{Alpha: *} \p{Alphabetic: *}
  104. \p{Alphabetic} \p{XPosixAlpha} (= \p{Alphabetic=Y})
  105. (118_240)
  106. \p{Alphabetic: N*} (Short: \p{Alpha=N}, \P{Alpha}) (995_872
  107. plus all above-Unicode code points)
  108. \p{Alphabetic: Y*} (Short: \p{Alpha=Y}, \p{Alpha}) (118_240)
  109. X \p{Alphabetic_PF} \p{Alphabetic_Presentation_Forms} (=
  110. \p{Block=Alphabetic_Presentation_Forms})
  111. (80)
  112. X \p{Alphabetic_Presentation_Forms} \p{Block=
  113. Alphabetic_Presentation_Forms} (Short:
  114. \p{InAlphabeticPF}) (80)
  115. \p{Anatolian_Hieroglyphs} \p{Script_Extensions=
  116. Anatolian_Hieroglyphs} (Short: \p{Hluw};
  117. NOT \p{Block=Anatolian_Hieroglyphs})
  118. (583)
  119. X \p{Ancient_Greek_Music} \p{Ancient_Greek_Musical_Notation} (=
  120. \p{Block=
  121. Ancient_Greek_Musical_Notation}) (80)
  122. X \p{Ancient_Greek_Musical_Notation} \p{Block=
  123. Ancient_Greek_Musical_Notation} (Short:
  124. \p{InAncientGreekMusic}) (80)
  125. X \p{Ancient_Greek_Numbers} \p{Block=Ancient_Greek_Numbers} (80)
  126. X \p{Ancient_Symbols} \p{Block=Ancient_Symbols} (64)
  127. \p{Any} All Unicode code points: [\x{0000}-
  128. \x{10FFFF}] (1_114_112)
  129. \p{Arab} \p{Arabic} (= \p{Script_Extensions=
  130. Arabic}) (NOT \p{Block=Arabic}) (1323)
  131. \p{Arabic} \p{Script_Extensions=Arabic} (Short:
  132. \p{Arab}; NOT \p{Block=Arabic}) (1323)
  133. X \p{Arabic_Ext_A} \p{Arabic_Extended_A} (= \p{Block=
  134. Arabic_Extended_A}) (96)
  135. X \p{Arabic_Extended_A} \p{Block=Arabic_Extended_A} (Short:
  136. \p{InArabicExtA}) (96)
  137. X \p{Arabic_Math} \p{Arabic_Mathematical_Alphabetic_Symbols}
  138. (= \p{Block=
  139. Arabic_Mathematical_Alphabetic_Symbols})
  140. (256)
  141. X \p{Arabic_Mathematical_Alphabetic_Symbols} \p{Block=
  142. Arabic_Mathematical_Alphabetic_Symbols}
  143. (Short: \p{InArabicMath}) (256)
  144. X \p{Arabic_PF_A} \p{Arabic_Presentation_Forms_A} (=
  145. \p{Block=Arabic_Presentation_Forms_A})
  146. (688)
  147. X \p{Arabic_PF_B} \p{Arabic_Presentation_Forms_B} (=
  148. \p{Block=Arabic_Presentation_Forms_B})
  149. (144)
  150. X \p{Arabic_Presentation_Forms_A} \p{Block=
  151. Arabic_Presentation_Forms_A} (Short:
  152. \p{InArabicPFA}) (688)
  153. X \p{Arabic_Presentation_Forms_B} \p{Block=
  154. Arabic_Presentation_Forms_B} (Short:
  155. \p{InArabicPFB}) (144)
  156. X \p{Arabic_Sup} \p{Arabic_Supplement} (= \p{Block=
  157. Arabic_Supplement}) (48)
  158. X \p{Arabic_Supplement} \p{Block=Arabic_Supplement} (Short:
  159. \p{InArabicSup}) (48)
  160. \p{Armenian} \p{Script_Extensions=Armenian} (Short:
  161. \p{Armn}; NOT \p{Block=Armenian}) (94)
  162. \p{Armi} \p{Imperial_Aramaic} (=
  163. \p{Script_Extensions=Imperial_Aramaic})
  164. (NOT \p{Block=Imperial_Aramaic}) (31)
  165. \p{Armn} \p{Armenian} (= \p{Script_Extensions=
  166. Armenian}) (NOT \p{Block=Armenian}) (94)
  167. X \p{Arrows} \p{Block=Arrows} (112)
  168. \p{ASCII} \p{Block=Basic_Latin} [[:ASCII:]] (128)
  169. \p{ASCII_Hex_Digit} \p{PosixXDigit} (= \p{ASCII_Hex_Digit=Y})
  170. (22)
  171. \p{ASCII_Hex_Digit: N*} (Short: \p{AHex=N}, \P{AHex}) (1_114_090
  172. plus all above-Unicode code points)
  173. \p{ASCII_Hex_Digit: Y*} (Short: \p{AHex=Y}, \p{AHex}) (22)
  174. \p{Assigned} All assigned code points (267_753)
  175. \p{Avestan} \p{Script_Extensions=Avestan} (Short:
  176. \p{Avst}; NOT \p{Block=Avestan}) (61)
  177. \p{Avst} \p{Avestan} (= \p{Script_Extensions=
  178. Avestan}) (NOT \p{Block=Avestan}) (61)
  179. \p{Bali} \p{Balinese} (= \p{Script_Extensions=
  180. Balinese}) (NOT \p{Block=Balinese}) (121)
  181. \p{Balinese} \p{Script_Extensions=Balinese} (Short:
  182. \p{Bali}; NOT \p{Block=Balinese}) (121)
  183. \p{Bamu} \p{Bamum} (= \p{Script_Extensions=Bamum})
  184. (NOT \p{Block=Bamum}) (657)
  185. \p{Bamum} \p{Script_Extensions=Bamum} (Short:
  186. \p{Bamu}; NOT \p{Block=Bamum}) (657)
  187. X \p{Bamum_Sup} \p{Bamum_Supplement} (= \p{Block=
  188. Bamum_Supplement}) (576)
  189. X \p{Bamum_Supplement} \p{Block=Bamum_Supplement} (Short:
  190. \p{InBamumSup}) (576)
  191. X \p{Basic_Latin} \p{ASCII} (= \p{Block=Basic_Latin}) (128)
  192. \p{Bass} \p{Bassa_Vah} (= \p{Script_Extensions=
  193. Bassa_Vah}) (NOT \p{Block=Bassa_Vah})
  194. (36)
  195. \p{Bassa_Vah} \p{Script_Extensions=Bassa_Vah} (Short:
  196. \p{Bass}; NOT \p{Block=Bassa_Vah}) (36)
  197. \p{Batak} \p{Script_Extensions=Batak} (Short:
  198. \p{Batk}; NOT \p{Block=Batak}) (56)
  199. \p{Batk} \p{Batak} (= \p{Script_Extensions=Batak})
  200. (NOT \p{Block=Batak}) (56)
  201. \p{Bc: *} \p{Bidi_Class: *}
  202. \p{Beng} \p{Bengali} (= \p{Script_Extensions=
  203. Bengali}) (NOT \p{Block=Bengali}) (98)
  204. \p{Bengali} \p{Script_Extensions=Bengali} (Short:
  205. \p{Beng}; NOT \p{Block=Bengali}) (98)
  206. \p{Bhaiksuki} \p{Script_Extensions=Bhaiksuki} (Short:
  207. \p{Bhks}; NOT \p{Block=Bhaiksuki}) (97)
  208. \p{Bhks} \p{Bhaiksuki} (= \p{Script_Extensions=
  209. Bhaiksuki}) (NOT \p{Block=Bhaiksuki})
  210. (97)
  211. \p{Bidi_C} \p{Bidi_Control} (= \p{Bidi_Control=Y})
  212. (12)
  213. \p{Bidi_C: *} \p{Bidi_Control: *}
  214. \p{Bidi_Class: AL} \p{Bidi_Class=Arabic_Letter} (1420)
  215. \p{Bidi_Class: AN} \p{Bidi_Class=Arabic_Number} (51)
  216. \p{Bidi_Class: Arabic_Letter} (Short: \p{Bc=AL}) (1420)
  217. \p{Bidi_Class: Arabic_Number} (Short: \p{Bc=AN}) (51)
  218. \p{Bidi_Class: B} \p{Bidi_Class=Paragraph_Separator} (7)
  219. \p{Bidi_Class: BN} \p{Bidi_Class=Boundary_Neutral} (4016)
  220. \p{Bidi_Class: Boundary_Neutral} (Short: \p{Bc=BN}) (4016)
  221. \p{Bidi_Class: Common_Separator} (Short: \p{Bc=CS}) (15)
  222. \p{Bidi_Class: CS} \p{Bidi_Class=Common_Separator} (15)
  223. \p{Bidi_Class: EN} \p{Bidi_Class=European_Number} (158)
  224. \p{Bidi_Class: ES} \p{Bidi_Class=European_Separator} (12)
  225. \p{Bidi_Class: ET} \p{Bidi_Class=European_Terminator} (87)
  226. \p{Bidi_Class: European_Number} (Short: \p{Bc=EN}) (158)
  227. \p{Bidi_Class: European_Separator} (Short: \p{Bc=ES}) (12)
  228. \p{Bidi_Class: European_Terminator} (Short: \p{Bc=ET}) (87)
  229. \p{Bidi_Class: First_Strong_Isolate} (Short: \p{Bc=FSI}) (1)
  230. \p{Bidi_Class: FSI} \p{Bidi_Class=First_Strong_Isolate} (1)
  231. \p{Bidi_Class: L} \p{Bidi_Class=Left_To_Right} (1_097_280
  232. plus all above-Unicode code points)
  233. \p{Bidi_Class: Left_To_Right} (Short: \p{Bc=L}) (1_097_280 plus
  234. all above-Unicode code points)
  235. \p{Bidi_Class: Left_To_Right_Embedding} (Short: \p{Bc=LRE}) (1)
  236. \p{Bidi_Class: Left_To_Right_Isolate} (Short: \p{Bc=LRI}) (1)
  237. \p{Bidi_Class: Left_To_Right_Override} (Short: \p{Bc=LRO}) (1)
  238. \p{Bidi_Class: LRE} \p{Bidi_Class=Left_To_Right_Embedding} (1)
  239. \p{Bidi_Class: LRI} \p{Bidi_Class=Left_To_Right_Isolate} (1)
  240. \p{Bidi_Class: LRO} \p{Bidi_Class=Left_To_Right_Override} (1)
  241. \p{Bidi_Class: Nonspacing_Mark} (Short: \p{Bc=NSM}) (1700)
  242. \p{Bidi_Class: NSM} \p{Bidi_Class=Nonspacing_Mark} (1700)
  243. \p{Bidi_Class: ON} \p{Bidi_Class=Other_Neutral} (5267)
  244. \p{Bidi_Class: Other_Neutral} (Short: \p{Bc=ON}) (5267)
  245. \p{Bidi_Class: Paragraph_Separator} (Short: \p{Bc=B}) (7)
  246. \p{Bidi_Class: PDF} \p{Bidi_Class=Pop_Directional_Format} (1)
  247. \p{Bidi_Class: PDI} \p{Bidi_Class=Pop_Directional_Isolate} (1)
  248. \p{Bidi_Class: Pop_Directional_Format} (Short: \p{Bc=PDF}) (1)
  249. \p{Bidi_Class: Pop_Directional_Isolate} (Short: \p{Bc=PDI}) (1)
  250. \p{Bidi_Class: R} \p{Bidi_Class=Right_To_Left} (4070)
  251. \p{Bidi_Class: Right_To_Left} (Short: \p{Bc=R}) (4070)
  252. \p{Bidi_Class: Right_To_Left_Embedding} (Short: \p{Bc=RLE}) (1)
  253. \p{Bidi_Class: Right_To_Left_Isolate} (Short: \p{Bc=RLI}) (1)
  254. \p{Bidi_Class: Right_To_Left_Override} (Short: \p{Bc=RLO}) (1)
  255. \p{Bidi_Class: RLE} \p{Bidi_Class=Right_To_Left_Embedding} (1)
  256. \p{Bidi_Class: RLI} \p{Bidi_Class=Right_To_Left_Isolate} (1)
  257. \p{Bidi_Class: RLO} \p{Bidi_Class=Right_To_Left_Override} (1)
  258. \p{Bidi_Class: S} \p{Bidi_Class=Segment_Separator} (3)
  259. \p{Bidi_Class: Segment_Separator} (Short: \p{Bc=S}) (3)
  260. \p{Bidi_Class: White_Space} (Short: \p{Bc=WS}) (17)
  261. \p{Bidi_Class: WS} \p{Bidi_Class=White_Space} (17)
  262. \p{Bidi_Control} \p{Bidi_Control=Y} (Short: \p{BidiC}) (12)
  263. \p{Bidi_Control: N*} (Short: \p{BidiC=N}, \P{BidiC}) (1_114_100
  264. plus all above-Unicode code points)
  265. \p{Bidi_Control: Y*} (Short: \p{BidiC=Y}, \p{BidiC}) (12)
  266. \p{Bidi_M} \p{Bidi_Mirrored} (= \p{Bidi_Mirrored=Y})
  267. (545)
  268. \p{Bidi_M: *} \p{Bidi_Mirrored: *}
  269. \p{Bidi_Mirrored} \p{Bidi_Mirrored=Y} (Short: \p{BidiM})
  270. (545)
  271. \p{Bidi_Mirrored: N*} (Short: \p{BidiM=N}, \P{BidiM}) (1_113_567
  272. plus all above-Unicode code points)
  273. \p{Bidi_Mirrored: Y*} (Short: \p{BidiM=Y}, \p{BidiM}) (545)
  274. \p{Bidi_Paired_Bracket_Type: C} \p{Bidi_Paired_Bracket_Type=Close}
  275. (60)
  276. \p{Bidi_Paired_Bracket_Type: Close} (Short: \p{Bpt=C}) (60)
  277. \p{Bidi_Paired_Bracket_Type: N} \p{Bidi_Paired_Bracket_Type=None}
  278. (1_113_992 plus all above-Unicode code
  279. points)
  280. \p{Bidi_Paired_Bracket_Type: None} (Short: \p{Bpt=N}) (1_113_992
  281. plus all above-Unicode code points)
  282. \p{Bidi_Paired_Bracket_Type: O} \p{Bidi_Paired_Bracket_Type=Open}
  283. (60)
  284. \p{Bidi_Paired_Bracket_Type: Open} (Short: \p{Bpt=O}) (60)
  285. \p{Blank} \p{XPosixBlank} (18)
  286. \p{Blk: *} \p{Block: *}
  287. \p{Block: Adlam} (NOT \p{Adlam} NOR \p{Is_Adlam}) (96)
  288. \p{Block: Aegean_Numbers} (64)
  289. \p{Block: Ahom} (NOT \p{Ahom} NOR \p{Is_Ahom}) (64)
  290. \p{Block: Alchemical} \p{Block=Alchemical_Symbols} (128)
  291. \p{Block: Alchemical_Symbols} (Short: \p{Blk=Alchemical}) (128)
  292. \p{Block: Alphabetic_PF} \p{Block=Alphabetic_Presentation_Forms}
  293. (80)
  294. \p{Block: Alphabetic_Presentation_Forms} (Short: \p{Blk=
  295. AlphabeticPF}) (80)
  296. \p{Block: Anatolian_Hieroglyphs} (NOT \p{Anatolian_Hieroglyphs}
  297. NOR \p{Is_Anatolian_Hieroglyphs}) (640)
  298. \p{Block: Ancient_Greek_Music} \p{Block=
  299. Ancient_Greek_Musical_Notation} (80)
  300. \p{Block: Ancient_Greek_Musical_Notation} (Short: \p{Blk=
  301. AncientGreekMusic}) (80)
  302. \p{Block: Ancient_Greek_Numbers} (80)
  303. \p{Block: Ancient_Symbols} (64)
  304. \p{Block: Arabic} (NOT \p{Arabic} NOR \p{Is_Arabic}) (256)
  305. \p{Block: Arabic_Ext_A} \p{Block=Arabic_Extended_A} (96)
  306. \p{Block: Arabic_Extended_A} (Short: \p{Blk=ArabicExtA}) (96)
  307. \p{Block: Arabic_Math} \p{Block=
  308. Arabic_Mathematical_Alphabetic_Symbols}
  309. (256)
  310. \p{Block: Arabic_Mathematical_Alphabetic_Symbols} (Short: \p{Blk=
  311. ArabicMath}) (256)
  312. \p{Block: Arabic_PF_A} \p{Block=Arabic_Presentation_Forms_A} (688)
  313. \p{Block: Arabic_PF_B} \p{Block=Arabic_Presentation_Forms_B} (144)
  314. \p{Block: Arabic_Presentation_Forms_A} (Short: \p{Blk=ArabicPFA})
  315. (688)
  316. \p{Block: Arabic_Presentation_Forms_B} (Short: \p{Blk=ArabicPFB})
  317. (144)
  318. \p{Block: Arabic_Sup} \p{Block=Arabic_Supplement} (48)
  319. \p{Block: Arabic_Supplement} (Short: \p{Blk=ArabicSup}) (48)
  320. \p{Block: Armenian} (NOT \p{Armenian} NOR \p{Is_Armenian}) (96)
  321. \p{Block: Arrows} (112)
  322. \p{Block: ASCII} \p{Block=Basic_Latin} (128)
  323. \p{Block: Avestan} (NOT \p{Avestan} NOR \p{Is_Avestan}) (64)
  324. \p{Block: Balinese} (NOT \p{Balinese} NOR \p{Is_Balinese})
  325. (128)
  326. \p{Block: Bamum} (NOT \p{Bamum} NOR \p{Is_Bamum}) (96)
  327. \p{Block: Bamum_Sup} \p{Block=Bamum_Supplement} (576)
  328. \p{Block: Bamum_Supplement} (Short: \p{Blk=BamumSup}) (576)
  329. \p{Block: Basic_Latin} (Short: \p{Blk=ASCII}) (128)
  330. \p{Block: Bassa_Vah} (NOT \p{Bassa_Vah} NOR \p{Is_Bassa_Vah})
  331. (48)
  332. \p{Block: Batak} (NOT \p{Batak} NOR \p{Is_Batak}) (64)
  333. \p{Block: Bengali} (NOT \p{Bengali} NOR \p{Is_Bengali}) (128)
  334. \p{Block: Bhaiksuki} (NOT \p{Bhaiksuki} NOR \p{Is_Bhaiksuki})
  335. (112)
  336. \p{Block: Block_Elements} (32)
  337. \p{Block: Bopomofo} (NOT \p{Bopomofo} NOR \p{Is_Bopomofo}) (48)
  338. \p{Block: Bopomofo_Ext} \p{Block=Bopomofo_Extended} (32)
  339. \p{Block: Bopomofo_Extended} (Short: \p{Blk=BopomofoExt}) (32)
  340. \p{Block: Box_Drawing} (128)
  341. \p{Block: Brahmi} (NOT \p{Brahmi} NOR \p{Is_Brahmi}) (128)
  342. \p{Block: Braille} \p{Block=Braille_Patterns} (256)
  343. \p{Block: Braille_Patterns} (Short: \p{Blk=Braille}) (256)
  344. \p{Block: Buginese} (NOT \p{Buginese} NOR \p{Is_Buginese}) (32)
  345. \p{Block: Buhid} (NOT \p{Buhid} NOR \p{Is_Buhid}) (32)
  346. \p{Block: Byzantine_Music} \p{Block=Byzantine_Musical_Symbols}
  347. (256)
  348. \p{Block: Byzantine_Musical_Symbols} (Short: \p{Blk=
  349. ByzantineMusic}) (256)
  350. \p{Block: Canadian_Syllabics} \p{Block=
  351. Unified_Canadian_Aboriginal_Syllabics}
  352. (640)
  353. \p{Block: Carian} (NOT \p{Carian} NOR \p{Is_Carian}) (64)
  354. \p{Block: Caucasian_Albanian} (NOT \p{Caucasian_Albanian} NOR
  355. \p{Is_Caucasian_Albanian}) (64)
  356. \p{Block: Chakma} (NOT \p{Chakma} NOR \p{Is_Chakma}) (80)
  357. \p{Block: Cham} (NOT \p{Cham} NOR \p{Is_Cham}) (96)
  358. \p{Block: Cherokee} (NOT \p{Cherokee} NOR \p{Is_Cherokee}) (96)
  359. \p{Block: Cherokee_Sup} \p{Block=Cherokee_Supplement} (80)
  360. \p{Block: Cherokee_Supplement} (Short: \p{Blk=CherokeeSup}) (80)
  361. \p{Block: CJK} \p{Block=CJK_Unified_Ideographs} (20_992)
  362. \p{Block: CJK_Compat} \p{Block=CJK_Compatibility} (256)
  363. \p{Block: CJK_Compat_Forms} \p{Block=CJK_Compatibility_Forms} (32)
  364. \p{Block: CJK_Compat_Ideographs} \p{Block=
  365. CJK_Compatibility_Ideographs} (512)
  366. \p{Block: CJK_Compat_Ideographs_Sup} \p{Block=
  367. CJK_Compatibility_Ideographs_Supplement}
  368. (544)
  369. \p{Block: CJK_Compatibility} (Short: \p{Blk=CJKCompat}) (256)
  370. \p{Block: CJK_Compatibility_Forms} (Short: \p{Blk=CJKCompatForms})
  371. (32)
  372. \p{Block: CJK_Compatibility_Ideographs} (Short: \p{Blk=
  373. CJKCompatIdeographs}) (512)
  374. \p{Block: CJK_Compatibility_Ideographs_Supplement} (Short: \p{Blk=
  375. CJKCompatIdeographsSup}) (544)
  376. \p{Block: CJK_Ext_A} \p{Block=
  377. CJK_Unified_Ideographs_Extension_A}
  378. (6592)
  379. \p{Block: CJK_Ext_B} \p{Block=
  380. CJK_Unified_Ideographs_Extension_B}
  381. (42_720)
  382. \p{Block: CJK_Ext_C} \p{Block=
  383. CJK_Unified_Ideographs_Extension_C}
  384. (4160)
  385. \p{Block: CJK_Ext_D} \p{Block=
  386. CJK_Unified_Ideographs_Extension_D} (224)
  387. \p{Block: CJK_Ext_E} \p{Block=
  388. CJK_Unified_Ideographs_Extension_E}
  389. (5776)
  390. \p{Block: CJK_Radicals_Sup} \p{Block=CJK_Radicals_Supplement} (128)
  391. \p{Block: CJK_Radicals_Supplement} (Short: \p{Blk=CJKRadicalsSup})
  392. (128)
  393. \p{Block: CJK_Strokes} (48)
  394. \p{Block: CJK_Symbols} \p{Block=CJK_Symbols_And_Punctuation} (64)
  395. \p{Block: CJK_Symbols_And_Punctuation} (Short: \p{Blk=CJKSymbols})
  396. (64)
  397. \p{Block: CJK_Unified_Ideographs} (Short: \p{Blk=CJK}) (20_992)
  398. \p{Block: CJK_Unified_Ideographs_Extension_A} (Short: \p{Blk=
  399. CJKExtA}) (6592)
  400. \p{Block: CJK_Unified_Ideographs_Extension_B} (Short: \p{Blk=
  401. CJKExtB}) (42_720)
  402. \p{Block: CJK_Unified_Ideographs_Extension_C} (Short: \p{Blk=
  403. CJKExtC}) (4160)
  404. \p{Block: CJK_Unified_Ideographs_Extension_D} (Short: \p{Blk=
  405. CJKExtD}) (224)
  406. \p{Block: CJK_Unified_Ideographs_Extension_E} (Short: \p{Blk=
  407. CJKExtE}) (5776)
  408. \p{Block: Combining_Diacritical_Marks} (Short: \p{Blk=
  409. Diacriticals}) (112)
  410. \p{Block: Combining_Diacritical_Marks_Extended} (Short: \p{Blk=
  411. DiacriticalsExt}) (80)
  412. \p{Block: Combining_Diacritical_Marks_For_Symbols} (Short: \p{Blk=
  413. DiacriticalsForSymbols}) (48)
  414. \p{Block: Combining_Diacritical_Marks_Supplement} (Short: \p{Blk=
  415. DiacriticalsSup}) (64)
  416. \p{Block: Combining_Half_Marks} (Short: \p{Blk=HalfMarks}) (16)
  417. \p{Block: Combining_Marks_For_Symbols} \p{Block=
  418. Combining_Diacritical_Marks_For_Symbols}
  419. (48)
  420. \p{Block: Common_Indic_Number_Forms} (Short: \p{Blk=
  421. IndicNumberForms}) (16)
  422. \p{Block: Compat_Jamo} \p{Block=Hangul_Compatibility_Jamo} (96)
  423. \p{Block: Control_Pictures} (64)
  424. \p{Block: Coptic} (NOT \p{Coptic} NOR \p{Is_Coptic}) (128)
  425. \p{Block: Coptic_Epact_Numbers} (32)
  426. \p{Block: Counting_Rod} \p{Block=Counting_Rod_Numerals} (32)
  427. \p{Block: Counting_Rod_Numerals} (Short: \p{Blk=CountingRod}) (32)
  428. \p{Block: Cuneiform} (NOT \p{Cuneiform} NOR \p{Is_Cuneiform})
  429. (1024)
  430. \p{Block: Cuneiform_Numbers} \p{Block=
  431. Cuneiform_Numbers_And_Punctuation} (128)
  432. \p{Block: Cuneiform_Numbers_And_Punctuation} (Short: \p{Blk=
  433. CuneiformNumbers}) (128)
  434. \p{Block: Currency_Symbols} (48)
  435. \p{Block: Cypriot_Syllabary} (64)
  436. \p{Block: Cyrillic} (NOT \p{Cyrillic} NOR \p{Is_Cyrillic})
  437. (256)
  438. \p{Block: Cyrillic_Ext_A} \p{Block=Cyrillic_Extended_A} (32)
  439. \p{Block: Cyrillic_Ext_B} \p{Block=Cyrillic_Extended_B} (96)
  440. \p{Block: Cyrillic_Ext_C} \p{Block=Cyrillic_Extended_C} (16)
  441. \p{Block: Cyrillic_Extended_A} (Short: \p{Blk=CyrillicExtA}) (32)
  442. \p{Block: Cyrillic_Extended_B} (Short: \p{Blk=CyrillicExtB}) (96)
  443. \p{Block: Cyrillic_Extended_C} (Short: \p{Blk=CyrillicExtC}) (16)
  444. \p{Block: Cyrillic_Sup} \p{Block=Cyrillic_Supplement} (48)
  445. \p{Block: Cyrillic_Supplement} (Short: \p{Blk=CyrillicSup}) (48)
  446. \p{Block: Cyrillic_Supplementary} \p{Block=Cyrillic_Supplement}
  447. (48)
  448. \p{Block: Deseret} (80)
  449. \p{Block: Devanagari} (NOT \p{Devanagari} NOR \p{Is_Devanagari})
  450. (128)
  451. \p{Block: Devanagari_Ext} \p{Block=Devanagari_Extended} (32)
  452. \p{Block: Devanagari_Extended} (Short: \p{Blk=DevanagariExt}) (32)
  453. \p{Block: Diacriticals} \p{Block=Combining_Diacritical_Marks} (112)
  454. \p{Block: Diacriticals_Ext} \p{Block=
  455. Combining_Diacritical_Marks_Extended}
  456. (80)
  457. \p{Block: Diacriticals_For_Symbols} \p{Block=
  458. Combining_Diacritical_Marks_For_Symbols}
  459. (48)
  460. \p{Block: Diacriticals_Sup} \p{Block=
  461. Combining_Diacritical_Marks_Supplement}
  462. (64)
  463. \p{Block: Dingbats} (192)
  464. \p{Block: Domino} \p{Block=Domino_Tiles} (112)
  465. \p{Block: Domino_Tiles} (Short: \p{Blk=Domino}) (112)
  466. \p{Block: Duployan} (NOT \p{Duployan} NOR \p{Is_Duployan})
  467. (160)
  468. \p{Block: Early_Dynastic_Cuneiform} (208)
  469. \p{Block: Egyptian_Hieroglyphs} (NOT \p{Egyptian_Hieroglyphs} NOR
  470. \p{Is_Egyptian_Hieroglyphs}) (1072)
  471. \p{Block: Elbasan} (NOT \p{Elbasan} NOR \p{Is_Elbasan}) (48)
  472. \p{Block: Emoticons} (80)
  473. \p{Block: Enclosed_Alphanum} \p{Block=Enclosed_Alphanumerics} (160)
  474. \p{Block: Enclosed_Alphanum_Sup} \p{Block=
  475. Enclosed_Alphanumeric_Supplement} (256)
  476. \p{Block: Enclosed_Alphanumeric_Supplement} (Short: \p{Blk=
  477. EnclosedAlphanumSup}) (256)
  478. \p{Block: Enclosed_Alphanumerics} (Short: \p{Blk=
  479. EnclosedAlphanum}) (160)
  480. \p{Block: Enclosed_CJK} \p{Block=Enclosed_CJK_Letters_And_Months}
  481. (256)
  482. \p{Block: Enclosed_CJK_Letters_And_Months} (Short: \p{Blk=
  483. EnclosedCJK}) (256)
  484. \p{Block: Enclosed_Ideographic_Sup} \p{Block=
  485. Enclosed_Ideographic_Supplement} (256)
  486. \p{Block: Enclosed_Ideographic_Supplement} (Short: \p{Blk=
  487. EnclosedIdeographicSup}) (256)
  488. \p{Block: Ethiopic} (NOT \p{Ethiopic} NOR \p{Is_Ethiopic})
  489. (384)
  490. \p{Block: Ethiopic_Ext} \p{Block=Ethiopic_Extended} (96)
  491. \p{Block: Ethiopic_Ext_A} \p{Block=Ethiopic_Extended_A} (48)
  492. \p{Block: Ethiopic_Extended} (Short: \p{Blk=EthiopicExt}) (96)
  493. \p{Block: Ethiopic_Extended_A} (Short: \p{Blk=EthiopicExtA}) (48)
  494. \p{Block: Ethiopic_Sup} \p{Block=Ethiopic_Supplement} (32)
  495. \p{Block: Ethiopic_Supplement} (Short: \p{Blk=EthiopicSup}) (32)
  496. \p{Block: General_Punctuation} (Short: \p{Blk=Punctuation}; NOT
  497. \p{Punct} NOR \p{Is_Punctuation}) (112)
  498. \p{Block: Geometric_Shapes} (96)
  499. \p{Block: Geometric_Shapes_Ext} \p{Block=
  500. Geometric_Shapes_Extended} (128)
  501. \p{Block: Geometric_Shapes_Extended} (Short: \p{Blk=
  502. GeometricShapesExt}) (128)
  503. \p{Block: Georgian} (NOT \p{Georgian} NOR \p{Is_Georgian}) (96)
  504. \p{Block: Georgian_Sup} \p{Block=Georgian_Supplement} (48)
  505. \p{Block: Georgian_Supplement} (Short: \p{Blk=GeorgianSup}) (48)
  506. \p{Block: Glagolitic} (NOT \p{Glagolitic} NOR \p{Is_Glagolitic})
  507. (96)
  508. \p{Block: Glagolitic_Sup} \p{Block=Glagolitic_Supplement} (48)
  509. \p{Block: Glagolitic_Supplement} (Short: \p{Blk=GlagoliticSup})
  510. (48)
  511. \p{Block: Gothic} (NOT \p{Gothic} NOR \p{Is_Gothic}) (32)
  512. \p{Block: Grantha} (NOT \p{Grantha} NOR \p{Is_Grantha}) (128)
  513. \p{Block: Greek} \p{Block=Greek_And_Coptic} (NOT \p{Greek}
  514. NOR \p{Is_Greek}) (144)
  515. \p{Block: Greek_And_Coptic} (Short: \p{Blk=Greek}; NOT \p{Greek}
  516. NOR \p{Is_Greek}) (144)
  517. \p{Block: Greek_Ext} \p{Block=Greek_Extended} (256)
  518. \p{Block: Greek_Extended} (Short: \p{Blk=GreekExt}) (256)
  519. \p{Block: Gujarati} (NOT \p{Gujarati} NOR \p{Is_Gujarati})
  520. (128)
  521. \p{Block: Gurmukhi} (NOT \p{Gurmukhi} NOR \p{Is_Gurmukhi})
  522. (128)
  523. \p{Block: Half_And_Full_Forms} \p{Block=
  524. Halfwidth_And_Fullwidth_Forms} (240)
  525. \p{Block: Half_Marks} \p{Block=Combining_Half_Marks} (16)
  526. \p{Block: Halfwidth_And_Fullwidth_Forms} (Short: \p{Blk=
  527. HalfAndFullForms}) (240)
  528. \p{Block: Hangul} \p{Block=Hangul_Syllables} (NOT \p{Hangul}
  529. NOR \p{Is_Hangul}) (11_184)
  530. \p{Block: Hangul_Compatibility_Jamo} (Short: \p{Blk=CompatJamo})
  531. (96)
  532. \p{Block: Hangul_Jamo} (Short: \p{Blk=Jamo}) (256)
  533. \p{Block: Hangul_Jamo_Extended_A} (Short: \p{Blk=JamoExtA}) (32)
  534. \p{Block: Hangul_Jamo_Extended_B} (Short: \p{Blk=JamoExtB}) (80)
  535. \p{Block: Hangul_Syllables} (Short: \p{Blk=Hangul}; NOT \p{Hangul}
  536. NOR \p{Is_Hangul}) (11_184)
  537. \p{Block: Hanunoo} (NOT \p{Hanunoo} NOR \p{Is_Hanunoo}) (32)
  538. \p{Block: Hatran} (NOT \p{Hatran} NOR \p{Is_Hatran}) (32)
  539. \p{Block: Hebrew} (NOT \p{Hebrew} NOR \p{Is_Hebrew}) (112)
  540. \p{Block: High_Private_Use_Surrogates} (Short: \p{Blk=
  541. HighPUSurrogates}) (128)
  542. \p{Block: High_PU_Surrogates} \p{Block=
  543. High_Private_Use_Surrogates} (128)
  544. \p{Block: High_Surrogates} (896)
  545. \p{Block: Hiragana} (NOT \p{Hiragana} NOR \p{Is_Hiragana}) (96)
  546. \p{Block: IDC} \p{Block=
  547. Ideographic_Description_Characters} (NOT
  548. \p{ID_Continue} NOR \p{Is_IDC}) (16)
  549. \p{Block: Ideographic_Description_Characters} (Short: \p{Blk=IDC};
  550. NOT \p{ID_Continue} NOR \p{Is_IDC}) (16)
  551. \p{Block: Ideographic_Symbols} \p{Block=
  552. Ideographic_Symbols_And_Punctuation} (32)
  553. \p{Block: Ideographic_Symbols_And_Punctuation} (Short: \p{Blk=
  554. IdeographicSymbols}) (32)
  555. \p{Block: Imperial_Aramaic} (NOT \p{Imperial_Aramaic} NOR
  556. \p{Is_Imperial_Aramaic}) (32)
  557. \p{Block: Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  558. (16)
  559. \p{Block: Inscriptional_Pahlavi} (NOT \p{Inscriptional_Pahlavi}
  560. NOR \p{Is_Inscriptional_Pahlavi}) (32)
  561. \p{Block: Inscriptional_Parthian} (NOT \p{Inscriptional_Parthian}
  562. NOR \p{Is_Inscriptional_Parthian}) (32)
  563. \p{Block: IPA_Ext} \p{Block=IPA_Extensions} (96)
  564. \p{Block: IPA_Extensions} (Short: \p{Blk=IPAExt}) (96)
  565. \p{Block: Jamo} \p{Block=Hangul_Jamo} (256)
  566. \p{Block: Jamo_Ext_A} \p{Block=Hangul_Jamo_Extended_A} (32)
  567. \p{Block: Jamo_Ext_B} \p{Block=Hangul_Jamo_Extended_B} (80)
  568. \p{Block: Javanese} (NOT \p{Javanese} NOR \p{Is_Javanese}) (96)
  569. \p{Block: Kaithi} (NOT \p{Kaithi} NOR \p{Is_Kaithi}) (80)
  570. \p{Block: Kana_Sup} \p{Block=Kana_Supplement} (256)
  571. \p{Block: Kana_Supplement} (Short: \p{Blk=KanaSup}) (256)
  572. \p{Block: Kanbun} (16)
  573. \p{Block: Kangxi} \p{Block=Kangxi_Radicals} (224)
  574. \p{Block: Kangxi_Radicals} (Short: \p{Blk=Kangxi}) (224)
  575. \p{Block: Kannada} (NOT \p{Kannada} NOR \p{Is_Kannada}) (128)
  576. \p{Block: Katakana} (NOT \p{Katakana} NOR \p{Is_Katakana}) (96)
  577. \p{Block: Katakana_Ext} \p{Block=Katakana_Phonetic_Extensions} (16)
  578. \p{Block: Katakana_Phonetic_Extensions} (Short: \p{Blk=
  579. KatakanaExt}) (16)
  580. \p{Block: Kayah_Li} (48)
  581. \p{Block: Kharoshthi} (NOT \p{Kharoshthi} NOR \p{Is_Kharoshthi})
  582. (96)
  583. \p{Block: Khmer} (NOT \p{Khmer} NOR \p{Is_Khmer}) (128)
  584. \p{Block: Khmer_Symbols} (32)
  585. \p{Block: Khojki} (NOT \p{Khojki} NOR \p{Is_Khojki}) (80)
  586. \p{Block: Khudawadi} (NOT \p{Khudawadi} NOR \p{Is_Khudawadi})
  587. (80)
  588. \p{Block: Lao} (NOT \p{Lao} NOR \p{Is_Lao}) (128)
  589. \p{Block: Latin_1} \p{Block=Latin_1_Supplement} (128)
  590. \p{Block: Latin_1_Sup} \p{Block=Latin_1_Supplement} (128)
  591. \p{Block: Latin_1_Supplement} (Short: \p{Blk=Latin1}) (128)
  592. \p{Block: Latin_Ext_A} \p{Block=Latin_Extended_A} (128)
  593. \p{Block: Latin_Ext_Additional} \p{Block=
  594. Latin_Extended_Additional} (256)
  595. \p{Block: Latin_Ext_B} \p{Block=Latin_Extended_B} (208)
  596. \p{Block: Latin_Ext_C} \p{Block=Latin_Extended_C} (32)
  597. \p{Block: Latin_Ext_D} \p{Block=Latin_Extended_D} (224)
  598. \p{Block: Latin_Ext_E} \p{Block=Latin_Extended_E} (64)
  599. \p{Block: Latin_Extended_A} (Short: \p{Blk=LatinExtA}) (128)
  600. \p{Block: Latin_Extended_Additional} (Short: \p{Blk=
  601. LatinExtAdditional}) (256)
  602. \p{Block: Latin_Extended_B} (Short: \p{Blk=LatinExtB}) (208)
  603. \p{Block: Latin_Extended_C} (Short: \p{Blk=LatinExtC}) (32)
  604. \p{Block: Latin_Extended_D} (Short: \p{Blk=LatinExtD}) (224)
  605. \p{Block: Latin_Extended_E} (Short: \p{Blk=LatinExtE}) (64)
  606. \p{Block: Lepcha} (NOT \p{Lepcha} NOR \p{Is_Lepcha}) (80)
  607. \p{Block: Letterlike_Symbols} (80)
  608. \p{Block: Limbu} (NOT \p{Limbu} NOR \p{Is_Limbu}) (80)
  609. \p{Block: Linear_A} (NOT \p{Linear_A} NOR \p{Is_Linear_A})
  610. (384)
  611. \p{Block: Linear_B_Ideograms} (128)
  612. \p{Block: Linear_B_Syllabary} (128)
  613. \p{Block: Lisu} (48)
  614. \p{Block: Low_Surrogates} (1024)
  615. \p{Block: Lycian} (NOT \p{Lycian} NOR \p{Is_Lycian}) (32)
  616. \p{Block: Lydian} (NOT \p{Lydian} NOR \p{Is_Lydian}) (32)
  617. \p{Block: Mahajani} (NOT \p{Mahajani} NOR \p{Is_Mahajani}) (48)
  618. \p{Block: Mahjong} \p{Block=Mahjong_Tiles} (48)
  619. \p{Block: Mahjong_Tiles} (Short: \p{Blk=Mahjong}) (48)
  620. \p{Block: Malayalam} (NOT \p{Malayalam} NOR \p{Is_Malayalam})
  621. (128)
  622. \p{Block: Mandaic} (NOT \p{Mandaic} NOR \p{Is_Mandaic}) (32)
  623. \p{Block: Manichaean} (NOT \p{Manichaean} NOR \p{Is_Manichaean})
  624. (64)
  625. \p{Block: Marchen} (NOT \p{Marchen} NOR \p{Is_Marchen}) (80)
  626. \p{Block: Math_Alphanum} \p{Block=
  627. Mathematical_Alphanumeric_Symbols} (1024)
  628. \p{Block: Math_Operators} \p{Block=Mathematical_Operators} (256)
  629. \p{Block: Mathematical_Alphanumeric_Symbols} (Short: \p{Blk=
  630. MathAlphanum}) (1024)
  631. \p{Block: Mathematical_Operators} (Short: \p{Blk=MathOperators})
  632. (256)
  633. \p{Block: Meetei_Mayek} (NOT \p{Meetei_Mayek} NOR
  634. \p{Is_Meetei_Mayek}) (64)
  635. \p{Block: Meetei_Mayek_Ext} \p{Block=Meetei_Mayek_Extensions} (32)
  636. \p{Block: Meetei_Mayek_Extensions} (Short: \p{Blk=MeeteiMayekExt})
  637. (32)
  638. \p{Block: Mende_Kikakui} (NOT \p{Mende_Kikakui} NOR
  639. \p{Is_Mende_Kikakui}) (224)
  640. \p{Block: Meroitic_Cursive} (NOT \p{Meroitic_Cursive} NOR
  641. \p{Is_Meroitic_Cursive}) (96)
  642. \p{Block: Meroitic_Hieroglyphs} (32)
  643. \p{Block: Miao} (NOT \p{Miao} NOR \p{Is_Miao}) (160)
  644. \p{Block: Misc_Arrows} \p{Block=Miscellaneous_Symbols_And_Arrows}
  645. (256)
  646. \p{Block: Misc_Math_Symbols_A} \p{Block=
  647. Miscellaneous_Mathematical_Symbols_A}
  648. (48)
  649. \p{Block: Misc_Math_Symbols_B} \p{Block=
  650. Miscellaneous_Mathematical_Symbols_B}
  651. (128)
  652. \p{Block: Misc_Pictographs} \p{Block=
  653. Miscellaneous_Symbols_And_Pictographs}
  654. (768)
  655. \p{Block: Misc_Symbols} \p{Block=Miscellaneous_Symbols} (256)
  656. \p{Block: Misc_Technical} \p{Block=Miscellaneous_Technical} (256)
  657. \p{Block: Miscellaneous_Mathematical_Symbols_A} (Short: \p{Blk=
  658. MiscMathSymbolsA}) (48)
  659. \p{Block: Miscellaneous_Mathematical_Symbols_B} (Short: \p{Blk=
  660. MiscMathSymbolsB}) (128)
  661. \p{Block: Miscellaneous_Symbols} (Short: \p{Blk=MiscSymbols}) (256)
  662. \p{Block: Miscellaneous_Symbols_And_Arrows} (Short: \p{Blk=
  663. MiscArrows}) (256)
  664. \p{Block: Miscellaneous_Symbols_And_Pictographs} (Short: \p{Blk=
  665. MiscPictographs}) (768)
  666. \p{Block: Miscellaneous_Technical} (Short: \p{Blk=MiscTechnical})
  667. (256)
  668. \p{Block: Modi} (NOT \p{Modi} NOR \p{Is_Modi}) (96)
  669. \p{Block: Modifier_Letters} \p{Block=Spacing_Modifier_Letters} (80)
  670. \p{Block: Modifier_Tone_Letters} (32)
  671. \p{Block: Mongolian} (NOT \p{Mongolian} NOR \p{Is_Mongolian})
  672. (176)
  673. \p{Block: Mongolian_Sup} \p{Block=Mongolian_Supplement} (32)
  674. \p{Block: Mongolian_Supplement} (Short: \p{Blk=MongolianSup}) (32)
  675. \p{Block: Mro} (NOT \p{Mro} NOR \p{Is_Mro}) (48)
  676. \p{Block: Multani} (NOT \p{Multani} NOR \p{Is_Multani}) (48)
  677. \p{Block: Music} \p{Block=Musical_Symbols} (256)
  678. \p{Block: Musical_Symbols} (Short: \p{Blk=Music}) (256)
  679. \p{Block: Myanmar} (NOT \p{Myanmar} NOR \p{Is_Myanmar}) (160)
  680. \p{Block: Myanmar_Ext_A} \p{Block=Myanmar_Extended_A} (32)
  681. \p{Block: Myanmar_Ext_B} \p{Block=Myanmar_Extended_B} (32)
  682. \p{Block: Myanmar_Extended_A} (Short: \p{Blk=MyanmarExtA}) (32)
  683. \p{Block: Myanmar_Extended_B} (Short: \p{Blk=MyanmarExtB}) (32)
  684. \p{Block: Nabataean} (NOT \p{Nabataean} NOR \p{Is_Nabataean})
  685. (48)
  686. \p{Block: NB} \p{Block=No_Block} (842_320 plus all
  687. above-Unicode code points)
  688. \p{Block: New_Tai_Lue} (NOT \p{New_Tai_Lue} NOR
  689. \p{Is_New_Tai_Lue}) (96)
  690. \p{Block: Newa} (NOT \p{Newa} NOR \p{Is_Newa}) (128)
  691. \p{Block: NKo} (NOT \p{Nko} NOR \p{Is_NKo}) (64)
  692. \p{Block: No_Block} (Short: \p{Blk=NB}) (842_320 plus all
  693. above-Unicode code points)
  694. \p{Block: Number_Forms} (64)
  695. \p{Block: OCR} \p{Block=Optical_Character_Recognition}
  696. (32)
  697. \p{Block: Ogham} (NOT \p{Ogham} NOR \p{Is_Ogham}) (32)
  698. \p{Block: Ol_Chiki} (48)
  699. \p{Block: Old_Hungarian} (NOT \p{Old_Hungarian} NOR
  700. \p{Is_Old_Hungarian}) (128)
  701. \p{Block: Old_Italic} (NOT \p{Old_Italic} NOR \p{Is_Old_Italic})
  702. (48)
  703. \p{Block: Old_North_Arabian} (32)
  704. \p{Block: Old_Permic} (NOT \p{Old_Permic} NOR \p{Is_Old_Permic})
  705. (48)
  706. \p{Block: Old_Persian} (NOT \p{Old_Persian} NOR
  707. \p{Is_Old_Persian}) (64)
  708. \p{Block: Old_South_Arabian} (32)
  709. \p{Block: Old_Turkic} (NOT \p{Old_Turkic} NOR \p{Is_Old_Turkic})
  710. (80)
  711. \p{Block: Optical_Character_Recognition} (Short: \p{Blk=OCR}) (32)
  712. \p{Block: Oriya} (NOT \p{Oriya} NOR \p{Is_Oriya}) (128)
  713. \p{Block: Ornamental_Dingbats} (48)
  714. \p{Block: Osage} (NOT \p{Osage} NOR \p{Is_Osage}) (80)
  715. \p{Block: Osmanya} (NOT \p{Osmanya} NOR \p{Is_Osmanya}) (48)
  716. \p{Block: Pahawh_Hmong} (NOT \p{Pahawh_Hmong} NOR
  717. \p{Is_Pahawh_Hmong}) (144)
  718. \p{Block: Palmyrene} (32)
  719. \p{Block: Pau_Cin_Hau} (NOT \p{Pau_Cin_Hau} NOR
  720. \p{Is_Pau_Cin_Hau}) (64)
  721. \p{Block: Phags_Pa} (NOT \p{Phags_Pa} NOR \p{Is_Phags_Pa}) (64)
  722. \p{Block: Phaistos} \p{Block=Phaistos_Disc} (48)
  723. \p{Block: Phaistos_Disc} (Short: \p{Blk=Phaistos}) (48)
  724. \p{Block: Phoenician} (NOT \p{Phoenician} NOR \p{Is_Phoenician})
  725. (32)
  726. \p{Block: Phonetic_Ext} \p{Block=Phonetic_Extensions} (128)
  727. \p{Block: Phonetic_Ext_Sup} \p{Block=
  728. Phonetic_Extensions_Supplement} (64)
  729. \p{Block: Phonetic_Extensions} (Short: \p{Blk=PhoneticExt}) (128)
  730. \p{Block: Phonetic_Extensions_Supplement} (Short: \p{Blk=
  731. PhoneticExtSup}) (64)
  732. \p{Block: Playing_Cards} (96)
  733. \p{Block: Private_Use} \p{Block=Private_Use_Area} (NOT
  734. \p{Private_Use} NOR \p{Is_Private_Use})
  735. (6400)
  736. \p{Block: Private_Use_Area} (Short: \p{Blk=PUA}; NOT
  737. \p{Private_Use} NOR \p{Is_Private_Use})
  738. (6400)
  739. \p{Block: Psalter_Pahlavi} (NOT \p{Psalter_Pahlavi} NOR
  740. \p{Is_Psalter_Pahlavi}) (48)
  741. \p{Block: PUA} \p{Block=Private_Use_Area} (NOT
  742. \p{Private_Use} NOR \p{Is_Private_Use})
  743. (6400)
  744. \p{Block: Punctuation} \p{Block=General_Punctuation} (NOT
  745. \p{Punct} NOR \p{Is_Punctuation}) (112)
  746. \p{Block: Rejang} (NOT \p{Rejang} NOR \p{Is_Rejang}) (48)
  747. \p{Block: Rumi} \p{Block=Rumi_Numeral_Symbols} (32)
  748. \p{Block: Rumi_Numeral_Symbols} (Short: \p{Blk=Rumi}) (32)
  749. \p{Block: Runic} (NOT \p{Runic} NOR \p{Is_Runic}) (96)
  750. \p{Block: Samaritan} (NOT \p{Samaritan} NOR \p{Is_Samaritan})
  751. (64)
  752. \p{Block: Saurashtra} (NOT \p{Saurashtra} NOR \p{Is_Saurashtra})
  753. (96)
  754. \p{Block: Sharada} (NOT \p{Sharada} NOR \p{Is_Sharada}) (96)
  755. \p{Block: Shavian} (48)
  756. \p{Block: Shorthand_Format_Controls} (16)
  757. \p{Block: Siddham} (NOT \p{Siddham} NOR \p{Is_Siddham}) (128)
  758. \p{Block: Sinhala} (NOT \p{Sinhala} NOR \p{Is_Sinhala}) (128)
  759. \p{Block: Sinhala_Archaic_Numbers} (32)
  760. \p{Block: Small_Form_Variants} (Short: \p{Blk=SmallForms}) (32)
  761. \p{Block: Small_Forms} \p{Block=Small_Form_Variants} (32)
  762. \p{Block: Sora_Sompeng} (NOT \p{Sora_Sompeng} NOR
  763. \p{Is_Sora_Sompeng}) (48)
  764. \p{Block: Spacing_Modifier_Letters} (Short: \p{Blk=
  765. ModifierLetters}) (80)
  766. \p{Block: Specials} (16)
  767. \p{Block: Sundanese} (NOT \p{Sundanese} NOR \p{Is_Sundanese})
  768. (64)
  769. \p{Block: Sundanese_Sup} \p{Block=Sundanese_Supplement} (16)
  770. \p{Block: Sundanese_Supplement} (Short: \p{Blk=SundaneseSup}) (16)
  771. \p{Block: Sup_Arrows_A} \p{Block=Supplemental_Arrows_A} (16)
  772. \p{Block: Sup_Arrows_B} \p{Block=Supplemental_Arrows_B} (128)
  773. \p{Block: Sup_Arrows_C} \p{Block=Supplemental_Arrows_C} (256)
  774. \p{Block: Sup_Math_Operators} \p{Block=
  775. Supplemental_Mathematical_Operators}
  776. (256)
  777. \p{Block: Sup_PUA_A} \p{Block=Supplementary_Private_Use_Area_A}
  778. (65_536)
  779. \p{Block: Sup_PUA_B} \p{Block=Supplementary_Private_Use_Area_B}
  780. (65_536)
  781. \p{Block: Sup_Punctuation} \p{Block=Supplemental_Punctuation} (128)
  782. \p{Block: Sup_Symbols_And_Pictographs} \p{Block=
  783. Supplemental_Symbols_And_Pictographs}
  784. (256)
  785. \p{Block: Super_And_Sub} \p{Block=Superscripts_And_Subscripts} (48)
  786. \p{Block: Superscripts_And_Subscripts} (Short: \p{Blk=
  787. SuperAndSub}) (48)
  788. \p{Block: Supplemental_Arrows_A} (Short: \p{Blk=SupArrowsA}) (16)
  789. \p{Block: Supplemental_Arrows_B} (Short: \p{Blk=SupArrowsB}) (128)
  790. \p{Block: Supplemental_Arrows_C} (Short: \p{Blk=SupArrowsC}) (256)
  791. \p{Block: Supplemental_Mathematical_Operators} (Short: \p{Blk=
  792. SupMathOperators}) (256)
  793. \p{Block: Supplemental_Punctuation} (Short: \p{Blk=
  794. SupPunctuation}) (128)
  795. \p{Block: Supplemental_Symbols_And_Pictographs} (Short: \p{Blk=
  796. SupSymbolsAndPictographs}) (256)
  797. \p{Block: Supplementary_Private_Use_Area_A} (Short: \p{Blk=
  798. SupPUAA}) (65_536)
  799. \p{Block: Supplementary_Private_Use_Area_B} (Short: \p{Blk=
  800. SupPUAB}) (65_536)
  801. \p{Block: Sutton_SignWriting} (688)
  802. \p{Block: Syloti_Nagri} (NOT \p{Syloti_Nagri} NOR
  803. \p{Is_Syloti_Nagri}) (48)
  804. \p{Block: Syriac} (NOT \p{Syriac} NOR \p{Is_Syriac}) (80)
  805. \p{Block: Tagalog} (NOT \p{Tagalog} NOR \p{Is_Tagalog}) (32)
  806. \p{Block: Tagbanwa} (NOT \p{Tagbanwa} NOR \p{Is_Tagbanwa}) (32)
  807. \p{Block: Tags} (128)
  808. \p{Block: Tai_Le} (NOT \p{Tai_Le} NOR \p{Is_Tai_Le}) (48)
  809. \p{Block: Tai_Tham} (NOT \p{Tai_Tham} NOR \p{Is_Tai_Tham})
  810. (144)
  811. \p{Block: Tai_Viet} (NOT \p{Tai_Viet} NOR \p{Is_Tai_Viet}) (96)
  812. \p{Block: Tai_Xuan_Jing} \p{Block=Tai_Xuan_Jing_Symbols} (96)
  813. \p{Block: Tai_Xuan_Jing_Symbols} (Short: \p{Blk=TaiXuanJing}) (96)
  814. \p{Block: Takri} (NOT \p{Takri} NOR \p{Is_Takri}) (80)
  815. \p{Block: Tamil} (NOT \p{Tamil} NOR \p{Is_Tamil}) (128)
  816. \p{Block: Tangut} (NOT \p{Tangut} NOR \p{Is_Tangut}) (6144)
  817. \p{Block: Tangut_Components} (768)
  818. \p{Block: Telugu} (NOT \p{Telugu} NOR \p{Is_Telugu}) (128)
  819. \p{Block: Thaana} (NOT \p{Thaana} NOR \p{Is_Thaana}) (64)
  820. \p{Block: Thai} (NOT \p{Thai} NOR \p{Is_Thai}) (128)
  821. \p{Block: Tibetan} (NOT \p{Tibetan} NOR \p{Is_Tibetan}) (256)
  822. \p{Block: Tifinagh} (NOT \p{Tifinagh} NOR \p{Is_Tifinagh}) (80)
  823. \p{Block: Tirhuta} (NOT \p{Tirhuta} NOR \p{Is_Tirhuta}) (96)
  824. \p{Block: Transport_And_Map} \p{Block=Transport_And_Map_Symbols}
  825. (128)
  826. \p{Block: Transport_And_Map_Symbols} (Short: \p{Blk=
  827. TransportAndMap}) (128)
  828. \p{Block: UCAS} \p{Block=
  829. Unified_Canadian_Aboriginal_Syllabics}
  830. (640)
  831. \p{Block: UCAS_Ext} \p{Block=
  832. Unified_Canadian_Aboriginal_Syllabics_-
  833. Extended} (80)
  834. \p{Block: Ugaritic} (NOT \p{Ugaritic} NOR \p{Is_Ugaritic}) (32)
  835. \p{Block: Unified_Canadian_Aboriginal_Syllabics} (Short: \p{Blk=
  836. UCAS}) (640)
  837. \p{Block: Unified_Canadian_Aboriginal_Syllabics_Extended} (Short:
  838. \p{Blk=UCASExt}) (80)
  839. \p{Block: Vai} (NOT \p{Vai} NOR \p{Is_Vai}) (320)
  840. \p{Block: Variation_Selectors} (Short: \p{Blk=VS}; NOT
  841. \p{Variation_Selector} NOR \p{Is_VS})
  842. (16)
  843. \p{Block: Variation_Selectors_Supplement} (Short: \p{Blk=VSSup})
  844. (240)
  845. \p{Block: Vedic_Ext} \p{Block=Vedic_Extensions} (48)
  846. \p{Block: Vedic_Extensions} (Short: \p{Blk=VedicExt}) (48)
  847. \p{Block: Vertical_Forms} (16)
  848. \p{Block: VS} \p{Block=Variation_Selectors} (NOT
  849. \p{Variation_Selector} NOR \p{Is_VS})
  850. (16)
  851. \p{Block: VS_Sup} \p{Block=Variation_Selectors_Supplement}
  852. (240)
  853. \p{Block: Warang_Citi} (NOT \p{Warang_Citi} NOR
  854. \p{Is_Warang_Citi}) (96)
  855. \p{Block: Yi_Radicals} (64)
  856. \p{Block: Yi_Syllables} (1168)
  857. \p{Block: Yijing} \p{Block=Yijing_Hexagram_Symbols} (64)
  858. \p{Block: Yijing_Hexagram_Symbols} (Short: \p{Blk=Yijing}) (64)
  859. X \p{Block_Elements} \p{Block=Block_Elements} (32)
  860. \p{Bopo} \p{Bopomofo} (= \p{Script_Extensions=
  861. Bopomofo}) (NOT \p{Block=Bopomofo}) (110)
  862. \p{Bopomofo} \p{Script_Extensions=Bopomofo} (Short:
  863. \p{Bopo}; NOT \p{Block=Bopomofo}) (110)
  864. X \p{Bopomofo_Ext} \p{Bopomofo_Extended} (= \p{Block=
  865. Bopomofo_Extended}) (32)
  866. X \p{Bopomofo_Extended} \p{Block=Bopomofo_Extended} (Short:
  867. \p{InBopomofoExt}) (32)
  868. X \p{Box_Drawing} \p{Block=Box_Drawing} (128)
  869. \p{Bpt: *} \p{Bidi_Paired_Bracket_Type: *}
  870. \p{Brah} \p{Brahmi} (= \p{Script_Extensions=
  871. Brahmi}) (NOT \p{Block=Brahmi}) (109)
  872. \p{Brahmi} \p{Script_Extensions=Brahmi} (Short:
  873. \p{Brah}; NOT \p{Block=Brahmi}) (109)
  874. \p{Brai} \p{Braille} (= \p{Script_Extensions=
  875. Braille}) (256)
  876. \p{Braille} \p{Script_Extensions=Braille} (Short:
  877. \p{Brai}) (256)
  878. X \p{Braille_Patterns} \p{Block=Braille_Patterns} (Short:
  879. \p{InBraille}) (256)
  880. \p{Bugi} \p{Buginese} (= \p{Script_Extensions=
  881. Buginese}) (NOT \p{Block=Buginese}) (31)
  882. \p{Buginese} \p{Script_Extensions=Buginese} (Short:
  883. \p{Bugi}; NOT \p{Block=Buginese}) (31)
  884. \p{Buhd} \p{Buhid} (= \p{Script_Extensions=Buhid})
  885. (NOT \p{Block=Buhid}) (22)
  886. \p{Buhid} \p{Script_Extensions=Buhid} (Short:
  887. \p{Buhd}; NOT \p{Block=Buhid}) (22)
  888. X \p{Byzantine_Music} \p{Byzantine_Musical_Symbols} (= \p{Block=
  889. Byzantine_Musical_Symbols}) (256)
  890. X \p{Byzantine_Musical_Symbols} \p{Block=Byzantine_Musical_Symbols}
  891. (Short: \p{InByzantineMusic}) (256)
  892. \p{C} \pC \p{Other} (= \p{General_Category=Other})
  893. (986_091 plus all above-Unicode code
  894. points)
  895. \p{Cakm} \p{Chakma} (= \p{Script_Extensions=
  896. Chakma}) (NOT \p{Block=Chakma}) (87)
  897. \p{Canadian_Aboriginal} \p{Script_Extensions=Canadian_Aboriginal}
  898. (Short: \p{Cans}) (710)
  899. X \p{Canadian_Syllabics} \p{Unified_Canadian_Aboriginal_Syllabics}
  900. (= \p{Block=
  901. Unified_Canadian_Aboriginal_Syllabics})
  902. (640)
  903. T \p{Canonical_Combining_Class: 0} \p{Canonical_Combining_Class=
  904. Not_Reordered} (1_113_298 plus all
  905. above-Unicode code points)
  906. T \p{Canonical_Combining_Class: 1} \p{Canonical_Combining_Class=
  907. Overlay} (32)
  908. T \p{Canonical_Combining_Class: 7} \p{Canonical_Combining_Class=
  909. Nukta} (22)
  910. T \p{Canonical_Combining_Class: 8} \p{Canonical_Combining_Class=
  911. Kana_Voicing} (2)
  912. T \p{Canonical_Combining_Class: 9} \p{Canonical_Combining_Class=
  913. Virama} (47)
  914. T \p{Canonical_Combining_Class: 10} \p{Canonical_Combining_Class=
  915. CCC10} (1)
  916. T \p{Canonical_Combining_Class: 11} \p{Canonical_Combining_Class=
  917. CCC11} (1)
  918. T \p{Canonical_Combining_Class: 12} \p{Canonical_Combining_Class=
  919. CCC12} (1)
  920. T \p{Canonical_Combining_Class: 13} \p{Canonical_Combining_Class=
  921. CCC13} (1)
  922. T \p{Canonical_Combining_Class: 14} \p{Canonical_Combining_Class=
  923. CCC14} (1)
  924. T \p{Canonical_Combining_Class: 15} \p{Canonical_Combining_Class=
  925. CCC15} (1)
  926. T \p{Canonical_Combining_Class: 16} \p{Canonical_Combining_Class=
  927. CCC16} (1)
  928. T \p{Canonical_Combining_Class: 17} \p{Canonical_Combining_Class=
  929. CCC17} (1)
  930. T \p{Canonical_Combining_Class: 18} \p{Canonical_Combining_Class=
  931. CCC18} (2)
  932. T \p{Canonical_Combining_Class: 19} \p{Canonical_Combining_Class=
  933. CCC19} (2)
  934. T \p{Canonical_Combining_Class: 20} \p{Canonical_Combining_Class=
  935. CCC20} (1)
  936. T \p{Canonical_Combining_Class: 21} \p{Canonical_Combining_Class=
  937. CCC21} (1)
  938. T \p{Canonical_Combining_Class: 22} \p{Canonical_Combining_Class=
  939. CCC22} (1)
  940. T \p{Canonical_Combining_Class: 23} \p{Canonical_Combining_Class=
  941. CCC23} (1)
  942. T \p{Canonical_Combining_Class: 24} \p{Canonical_Combining_Class=
  943. CCC24} (1)
  944. T \p{Canonical_Combining_Class: 25} \p{Canonical_Combining_Class=
  945. CCC25} (1)
  946. T \p{Canonical_Combining_Class: 26} \p{Canonical_Combining_Class=
  947. CCC26} (1)
  948. T \p{Canonical_Combining_Class: 27} \p{Canonical_Combining_Class=
  949. CCC27} (2)
  950. T \p{Canonical_Combining_Class: 28} \p{Canonical_Combining_Class=
  951. CCC28} (2)
  952. T \p{Canonical_Combining_Class: 29} \p{Canonical_Combining_Class=
  953. CCC29} (2)
  954. T \p{Canonical_Combining_Class: 30} \p{Canonical_Combining_Class=
  955. CCC30} (2)
  956. T \p{Canonical_Combining_Class: 31} \p{Canonical_Combining_Class=
  957. CCC31} (2)
  958. T \p{Canonical_Combining_Class: 32} \p{Canonical_Combining_Class=
  959. CCC32} (2)
  960. T \p{Canonical_Combining_Class: 33} \p{Canonical_Combining_Class=
  961. CCC33} (1)
  962. T \p{Canonical_Combining_Class: 34} \p{Canonical_Combining_Class=
  963. CCC34} (1)
  964. T \p{Canonical_Combining_Class: 35} \p{Canonical_Combining_Class=
  965. CCC35} (1)
  966. T \p{Canonical_Combining_Class: 36} \p{Canonical_Combining_Class=
  967. CCC36} (1)
  968. T \p{Canonical_Combining_Class: 84} \p{Canonical_Combining_Class=
  969. CCC84} (1)
  970. T \p{Canonical_Combining_Class: 91} \p{Canonical_Combining_Class=
  971. CCC91} (1)
  972. T \p{Canonical_Combining_Class: 103} \p{Canonical_Combining_Class=
  973. CCC103} (2)
  974. T \p{Canonical_Combining_Class: 107} \p{Canonical_Combining_Class=
  975. CCC107} (4)
  976. T \p{Canonical_Combining_Class: 118} \p{Canonical_Combining_Class=
  977. CCC118} (2)
  978. T \p{Canonical_Combining_Class: 122} \p{Canonical_Combining_Class=
  979. CCC122} (4)
  980. T \p{Canonical_Combining_Class: 129} \p{Canonical_Combining_Class=
  981. CCC129} (1)
  982. T \p{Canonical_Combining_Class: 130} \p{Canonical_Combining_Class=
  983. CCC130} (6)
  984. T \p{Canonical_Combining_Class: 132} \p{Canonical_Combining_Class=
  985. CCC132} (1)
  986. T \p{Canonical_Combining_Class: 133} \p{Canonical_Combining_Class=
  987. CCC133} (0)
  988. T \p{Canonical_Combining_Class: 200} \p{Canonical_Combining_Class=
  989. Attached_Below_Left} (0)
  990. T \p{Canonical_Combining_Class: 202} \p{Canonical_Combining_Class=
  991. Attached_Below} (5)
  992. T \p{Canonical_Combining_Class: 214} \p{Canonical_Combining_Class=
  993. Attached_Above} (1)
  994. T \p{Canonical_Combining_Class: 216} \p{Canonical_Combining_Class=
  995. Attached_Above_Right} (9)
  996. T \p{Canonical_Combining_Class: 218} \p{Canonical_Combining_Class=
  997. Below_Left} (1)
  998. T \p{Canonical_Combining_Class: 220} \p{Canonical_Combining_Class=
  999. Below} (153)
  1000. T \p{Canonical_Combining_Class: 222} \p{Canonical_Combining_Class=
  1001. Below_Right} (4)
  1002. T \p{Canonical_Combining_Class: 224} \p{Canonical_Combining_Class=
  1003. Left} (2)
  1004. T \p{Canonical_Combining_Class: 226} \p{Canonical_Combining_Class=
  1005. Right} (1)
  1006. T \p{Canonical_Combining_Class: 228} \p{Canonical_Combining_Class=
  1007. Above_Left} (3)
  1008. T \p{Canonical_Combining_Class: 230} \p{Canonical_Combining_Class=
  1009. Above} (461)
  1010. T \p{Canonical_Combining_Class: 232} \p{Canonical_Combining_Class=
  1011. Above_Right} (4)
  1012. T \p{Canonical_Combining_Class: 233} \p{Canonical_Combining_Class=
  1013. Double_Below} (4)
  1014. T \p{Canonical_Combining_Class: 234} \p{Canonical_Combining_Class=
  1015. Double_Above} (5)
  1016. T \p{Canonical_Combining_Class: 240} \p{Canonical_Combining_Class=
  1017. Iota_Subscript} (1)
  1018. \p{Canonical_Combining_Class: A} \p{Canonical_Combining_Class=
  1019. Above} (461)
  1020. \p{Canonical_Combining_Class: Above} (Short: \p{Ccc=A}) (461)
  1021. \p{Canonical_Combining_Class: Above_Left} (Short: \p{Ccc=AL}) (3)
  1022. \p{Canonical_Combining_Class: Above_Right} (Short: \p{Ccc=AR}) (4)
  1023. \p{Canonical_Combining_Class: AL} \p{Canonical_Combining_Class=
  1024. Above_Left} (3)
  1025. \p{Canonical_Combining_Class: AR} \p{Canonical_Combining_Class=
  1026. Above_Right} (4)
  1027. \p{Canonical_Combining_Class: ATA} \p{Canonical_Combining_Class=
  1028. Attached_Above} (1)
  1029. \p{Canonical_Combining_Class: ATAR} \p{Canonical_Combining_Class=
  1030. Attached_Above_Right} (9)
  1031. \p{Canonical_Combining_Class: ATB} \p{Canonical_Combining_Class=
  1032. Attached_Below} (5)
  1033. \p{Canonical_Combining_Class: ATBL} \p{Canonical_Combining_Class=
  1034. Attached_Below_Left} (0)
  1035. \p{Canonical_Combining_Class: Attached_Above} (Short: \p{Ccc=ATA})
  1036. (1)
  1037. \p{Canonical_Combining_Class: Attached_Above_Right} (Short:
  1038. \p{Ccc=ATAR}) (9)
  1039. \p{Canonical_Combining_Class: Attached_Below} (Short: \p{Ccc=ATB})
  1040. (5)
  1041. \p{Canonical_Combining_Class: Attached_Below_Left} (Short: \p{Ccc=
  1042. ATBL}) (0)
  1043. \p{Canonical_Combining_Class: B} \p{Canonical_Combining_Class=
  1044. Below} (153)
  1045. \p{Canonical_Combining_Class: Below} (Short: \p{Ccc=B}) (153)
  1046. \p{Canonical_Combining_Class: Below_Left} (Short: \p{Ccc=BL}) (1)
  1047. \p{Canonical_Combining_Class: Below_Right} (Short: \p{Ccc=BR}) (4)
  1048. \p{Canonical_Combining_Class: BL} \p{Canonical_Combining_Class=
  1049. Below_Left} (1)
  1050. \p{Canonical_Combining_Class: BR} \p{Canonical_Combining_Class=
  1051. Below_Right} (4)
  1052. \p{Canonical_Combining_Class: CCC10} (Short: \p{Ccc=CCC10}) (1)
  1053. \p{Canonical_Combining_Class: CCC103} (Short: \p{Ccc=CCC103}) (2)
  1054. \p{Canonical_Combining_Class: CCC107} (Short: \p{Ccc=CCC107}) (4)
  1055. \p{Canonical_Combining_Class: CCC11} (Short: \p{Ccc=CCC11}) (1)
  1056. \p{Canonical_Combining_Class: CCC118} (Short: \p{Ccc=CCC118}) (2)
  1057. \p{Canonical_Combining_Class: CCC12} (Short: \p{Ccc=CCC12}) (1)
  1058. \p{Canonical_Combining_Class: CCC122} (Short: \p{Ccc=CCC122}) (4)
  1059. \p{Canonical_Combining_Class: CCC129} (Short: \p{Ccc=CCC129}) (1)
  1060. \p{Canonical_Combining_Class: CCC13} (Short: \p{Ccc=CCC13}) (1)
  1061. \p{Canonical_Combining_Class: CCC130} (Short: \p{Ccc=CCC130}) (6)
  1062. \p{Canonical_Combining_Class: CCC132} (Short: \p{Ccc=CCC132}) (1)
  1063. \p{Canonical_Combining_Class: CCC133} (Short: \p{Ccc=CCC133}) (0)
  1064. \p{Canonical_Combining_Class: CCC14} (Short: \p{Ccc=CCC14}) (1)
  1065. \p{Canonical_Combining_Class: CCC15} (Short: \p{Ccc=CCC15}) (1)
  1066. \p{Canonical_Combining_Class: CCC16} (Short: \p{Ccc=CCC16}) (1)
  1067. \p{Canonical_Combining_Class: CCC17} (Short: \p{Ccc=CCC17}) (1)
  1068. \p{Canonical_Combining_Class: CCC18} (Short: \p{Ccc=CCC18}) (2)
  1069. \p{Canonical_Combining_Class: CCC19} (Short: \p{Ccc=CCC19}) (2)
  1070. \p{Canonical_Combining_Class: CCC20} (Short: \p{Ccc=CCC20}) (1)
  1071. \p{Canonical_Combining_Class: CCC21} (Short: \p{Ccc=CCC21}) (1)
  1072. \p{Canonical_Combining_Class: CCC22} (Short: \p{Ccc=CCC22}) (1)
  1073. \p{Canonical_Combining_Class: CCC23} (Short: \p{Ccc=CCC23}) (1)
  1074. \p{Canonical_Combining_Class: CCC24} (Short: \p{Ccc=CCC24}) (1)
  1075. \p{Canonical_Combining_Class: CCC25} (Short: \p{Ccc=CCC25}) (1)
  1076. \p{Canonical_Combining_Class: CCC26} (Short: \p{Ccc=CCC26}) (1)
  1077. \p{Canonical_Combining_Class: CCC27} (Short: \p{Ccc=CCC27}) (2)
  1078. \p{Canonical_Combining_Class: CCC28} (Short: \p{Ccc=CCC28}) (2)
  1079. \p{Canonical_Combining_Class: CCC29} (Short: \p{Ccc=CCC29}) (2)
  1080. \p{Canonical_Combining_Class: CCC30} (Short: \p{Ccc=CCC30}) (2)
  1081. \p{Canonical_Combining_Class: CCC31} (Short: \p{Ccc=CCC31}) (2)
  1082. \p{Canonical_Combining_Class: CCC32} (Short: \p{Ccc=CCC32}) (2)
  1083. \p{Canonical_Combining_Class: CCC33} (Short: \p{Ccc=CCC33}) (1)
  1084. \p{Canonical_Combining_Class: CCC34} (Short: \p{Ccc=CCC34}) (1)
  1085. \p{Canonical_Combining_Class: CCC35} (Short: \p{Ccc=CCC35}) (1)
  1086. \p{Canonical_Combining_Class: CCC36} (Short: \p{Ccc=CCC36}) (1)
  1087. \p{Canonical_Combining_Class: CCC84} (Short: \p{Ccc=CCC84}) (1)
  1088. \p{Canonical_Combining_Class: CCC91} (Short: \p{Ccc=CCC91}) (1)
  1089. \p{Canonical_Combining_Class: DA} \p{Canonical_Combining_Class=
  1090. Double_Above} (5)
  1091. \p{Canonical_Combining_Class: DB} \p{Canonical_Combining_Class=
  1092. Double_Below} (4)
  1093. \p{Canonical_Combining_Class: Double_Above} (Short: \p{Ccc=DA}) (5)
  1094. \p{Canonical_Combining_Class: Double_Below} (Short: \p{Ccc=DB}) (4)
  1095. \p{Canonical_Combining_Class: Iota_Subscript} (Short: \p{Ccc=IS})
  1096. (1)
  1097. \p{Canonical_Combining_Class: IS} \p{Canonical_Combining_Class=
  1098. Iota_Subscript} (1)
  1099. \p{Canonical_Combining_Class: Kana_Voicing} (Short: \p{Ccc=KV}) (2)
  1100. \p{Canonical_Combining_Class: KV} \p{Canonical_Combining_Class=
  1101. Kana_Voicing} (2)
  1102. \p{Canonical_Combining_Class: L} \p{Canonical_Combining_Class=
  1103. Left} (2)
  1104. \p{Canonical_Combining_Class: Left} (Short: \p{Ccc=L}) (2)
  1105. \p{Canonical_Combining_Class: NK} \p{Canonical_Combining_Class=
  1106. Nukta} (22)
  1107. \p{Canonical_Combining_Class: Not_Reordered} (Short: \p{Ccc=NR})
  1108. (1_113_298 plus all above-Unicode code
  1109. points)
  1110. \p{Canonical_Combining_Class: NR} \p{Canonical_Combining_Class=
  1111. Not_Reordered} (1_113_298 plus all
  1112. above-Unicode code points)
  1113. \p{Canonical_Combining_Class: Nukta} (Short: \p{Ccc=NK}) (22)
  1114. \p{Canonical_Combining_Class: OV} \p{Canonical_Combining_Class=
  1115. Overlay} (32)
  1116. \p{Canonical_Combining_Class: Overlay} (Short: \p{Ccc=OV}) (32)
  1117. \p{Canonical_Combining_Class: R} \p{Canonical_Combining_Class=
  1118. Right} (1)
  1119. \p{Canonical_Combining_Class: Right} (Short: \p{Ccc=R}) (1)
  1120. \p{Canonical_Combining_Class: Virama} (Short: \p{Ccc=VR}) (47)
  1121. \p{Canonical_Combining_Class: VR} \p{Canonical_Combining_Class=
  1122. Virama} (47)
  1123. \p{Cans} \p{Canadian_Aboriginal} (=
  1124. \p{Script_Extensions=
  1125. Canadian_Aboriginal}) (710)
  1126. \p{Cari} \p{Carian} (= \p{Script_Extensions=
  1127. Carian}) (NOT \p{Block=Carian}) (49)
  1128. \p{Carian} \p{Script_Extensions=Carian} (Short:
  1129. \p{Cari}; NOT \p{Block=Carian}) (49)
  1130. \p{Case_Ignorable} \p{Case_Ignorable=Y} (Short: \p{CI}) (2240)
  1131. \p{Case_Ignorable: N*} (Short: \p{CI=N}, \P{CI}) (1_111_872 plus
  1132. all above-Unicode code points)
  1133. \p{Case_Ignorable: Y*} (Short: \p{CI=Y}, \p{CI}) (2240)
  1134. \p{Cased} \p{Cased=Y} (4105)
  1135. \p{Cased: N*} (Single: \P{Cased}) (1_110_007 plus all
  1136. above-Unicode code points)
  1137. \p{Cased: Y*} (Single: \p{Cased}) (4105)
  1138. \p{Cased_Letter} \p{General_Category=Cased_Letter} (Short:
  1139. \p{LC}) (3796)
  1140. \p{Category: *} \p{General_Category: *}
  1141. \p{Caucasian_Albanian} \p{Script_Extensions=Caucasian_Albanian}
  1142. (Short: \p{Aghb}; NOT \p{Block=
  1143. Caucasian_Albanian}) (53)
  1144. \p{Cc} \p{XPosixCntrl} (= \p{General_Category=
  1145. Control}) (65)
  1146. \p{Ccc: *} \p{Canonical_Combining_Class: *}
  1147. \p{CE} \p{Composition_Exclusion} (=
  1148. \p{Composition_Exclusion=Y}) (81)
  1149. \p{CE: *} \p{Composition_Exclusion: *}
  1150. \p{Cf} \p{Format} (= \p{General_Category=Format})
  1151. (151)
  1152. \p{Chakma} \p{Script_Extensions=Chakma} (Short:
  1153. \p{Cakm}; NOT \p{Block=Chakma}) (87)
  1154. \p{Cham} \p{Script_Extensions=Cham} (NOT \p{Block=
  1155. Cham}) (83)
  1156. \p{Changes_When_Casefolded} \p{Changes_When_Casefolded=Y} (Short:
  1157. \p{CWCF}) (1377)
  1158. \p{Changes_When_Casefolded: N*} (Short: \p{CWCF=N}, \P{CWCF})
  1159. (1_112_735 plus all above-Unicode code
  1160. points)
  1161. \p{Changes_When_Casefolded: Y*} (Short: \p{CWCF=Y}, \p{CWCF})
  1162. (1377)
  1163. \p{Changes_When_Casemapped} \p{Changes_When_Casemapped=Y} (Short:
  1164. \p{CWCM}) (2669)
  1165. \p{Changes_When_Casemapped: N*} (Short: \p{CWCM=N}, \P{CWCM})
  1166. (1_111_443 plus all above-Unicode code
  1167. points)
  1168. \p{Changes_When_Casemapped: Y*} (Short: \p{CWCM=Y}, \p{CWCM})
  1169. (2669)
  1170. \p{Changes_When_Lowercased} \p{Changes_When_Lowercased=Y} (Short:
  1171. \p{CWL}) (1304)
  1172. \p{Changes_When_Lowercased: N*} (Short: \p{CWL=N}, \P{CWL})
  1173. (1_112_808 plus all above-Unicode code
  1174. points)
  1175. \p{Changes_When_Lowercased: Y*} (Short: \p{CWL=Y}, \p{CWL}) (1304)
  1176. \p{Changes_When_NFKC_Casefolded} \p{Changes_When_NFKC_Casefolded=
  1177. Y} (Short: \p{CWKCF}) (10_227)
  1178. \p{Changes_When_NFKC_Casefolded: N*} (Short: \p{CWKCF=N},
  1179. \P{CWKCF}) (1_103_885 plus all above-
  1180. Unicode code points)
  1181. \p{Changes_When_NFKC_Casefolded: Y*} (Short: \p{CWKCF=Y},
  1182. \p{CWKCF}) (10_227)
  1183. \p{Changes_When_Titlecased} \p{Changes_When_Titlecased=Y} (Short:
  1184. \p{CWT}) (1369)
  1185. \p{Changes_When_Titlecased: N*} (Short: \p{CWT=N}, \P{CWT})
  1186. (1_112_743 plus all above-Unicode code
  1187. points)
  1188. \p{Changes_When_Titlecased: Y*} (Short: \p{CWT=Y}, \p{CWT}) (1369)
  1189. \p{Changes_When_Uppercased} \p{Changes_When_Uppercased=Y} (Short:
  1190. \p{CWU}) (1396)
  1191. \p{Changes_When_Uppercased: N*} (Short: \p{CWU=N}, \P{CWU})
  1192. (1_112_716 plus all above-Unicode code
  1193. points)
  1194. \p{Changes_When_Uppercased: Y*} (Short: \p{CWU=Y}, \p{CWU}) (1396)
  1195. \p{Cher} \p{Cherokee} (= \p{Script_Extensions=
  1196. Cherokee}) (NOT \p{Block=Cherokee}) (172)
  1197. \p{Cherokee} \p{Script_Extensions=Cherokee} (Short:
  1198. \p{Cher}; NOT \p{Block=Cherokee}) (172)
  1199. X \p{Cherokee_Sup} \p{Cherokee_Supplement} (= \p{Block=
  1200. Cherokee_Supplement}) (80)
  1201. X \p{Cherokee_Supplement} \p{Block=Cherokee_Supplement} (Short:
  1202. \p{InCherokeeSup}) (80)
  1203. \p{CI} \p{Case_Ignorable} (= \p{Case_Ignorable=
  1204. Y}) (2240)
  1205. \p{CI: *} \p{Case_Ignorable: *}
  1206. X \p{CJK} \p{CJK_Unified_Ideographs} (= \p{Block=
  1207. CJK_Unified_Ideographs}) (20_992)
  1208. X \p{CJK_Compat} \p{CJK_Compatibility} (= \p{Block=
  1209. CJK_Compatibility}) (256)
  1210. X \p{CJK_Compat_Forms} \p{CJK_Compatibility_Forms} (= \p{Block=
  1211. CJK_Compatibility_Forms}) (32)
  1212. X \p{CJK_Compat_Ideographs} \p{CJK_Compatibility_Ideographs} (=
  1213. \p{Block=CJK_Compatibility_Ideographs})
  1214. (512)
  1215. X \p{CJK_Compat_Ideographs_Sup}
  1216. \p{CJK_Compatibility_Ideographs_-
  1217. Supplement} (= \p{Block=
  1218. CJK_Compatibility_Ideographs_-
  1219. Supplement}) (544)
  1220. X \p{CJK_Compatibility} \p{Block=CJK_Compatibility} (Short:
  1221. \p{InCJKCompat}) (256)
  1222. X \p{CJK_Compatibility_Forms} \p{Block=CJK_Compatibility_Forms}
  1223. (Short: \p{InCJKCompatForms}) (32)
  1224. X \p{CJK_Compatibility_Ideographs} \p{Block=
  1225. CJK_Compatibility_Ideographs} (Short:
  1226. \p{InCJKCompatIdeographs}) (512)
  1227. X \p{CJK_Compatibility_Ideographs_Supplement} \p{Block=
  1228. CJK_Compatibility_Ideographs_Supplement}
  1229. (Short: \p{InCJKCompatIdeographsSup})
  1230. (544)
  1231. X \p{CJK_Ext_A} \p{CJK_Unified_Ideographs_Extension_A} (=
  1232. \p{Block=
  1233. CJK_Unified_Ideographs_Extension_A})
  1234. (6592)
  1235. X \p{CJK_Ext_B} \p{CJK_Unified_Ideographs_Extension_B} (=
  1236. \p{Block=
  1237. CJK_Unified_Ideographs_Extension_B})
  1238. (42_720)
  1239. X \p{CJK_Ext_C} \p{CJK_Unified_Ideographs_Extension_C} (=
  1240. \p{Block=
  1241. CJK_Unified_Ideographs_Extension_C})
  1242. (4160)
  1243. X \p{CJK_Ext_D} \p{CJK_Unified_Ideographs_Extension_D} (=
  1244. \p{Block=
  1245. CJK_Unified_Ideographs_Extension_D})
  1246. (224)
  1247. X \p{CJK_Ext_E} \p{CJK_Unified_Ideographs_Extension_E} (=
  1248. \p{Block=
  1249. CJK_Unified_Ideographs_Extension_E})
  1250. (5776)
  1251. X \p{CJK_Radicals_Sup} \p{CJK_Radicals_Supplement} (= \p{Block=
  1252. CJK_Radicals_Supplement}) (128)
  1253. X \p{CJK_Radicals_Supplement} \p{Block=CJK_Radicals_Supplement}
  1254. (Short: \p{InCJKRadicalsSup}) (128)
  1255. X \p{CJK_Strokes} \p{Block=CJK_Strokes} (48)
  1256. X \p{CJK_Symbols} \p{CJK_Symbols_And_Punctuation} (=
  1257. \p{Block=CJK_Symbols_And_Punctuation})
  1258. (64)
  1259. X \p{CJK_Symbols_And_Punctuation} \p{Block=
  1260. CJK_Symbols_And_Punctuation} (Short:
  1261. \p{InCJKSymbols}) (64)
  1262. X \p{CJK_Unified_Ideographs} \p{Block=CJK_Unified_Ideographs}
  1263. (Short: \p{InCJK}) (20_992)
  1264. X \p{CJK_Unified_Ideographs_Extension_A} \p{Block=
  1265. CJK_Unified_Ideographs_Extension_A}
  1266. (Short: \p{InCJKExtA}) (6592)
  1267. X \p{CJK_Unified_Ideographs_Extension_B} \p{Block=
  1268. CJK_Unified_Ideographs_Extension_B}
  1269. (Short: \p{InCJKExtB}) (42_720)
  1270. X \p{CJK_Unified_Ideographs_Extension_C} \p{Block=
  1271. CJK_Unified_Ideographs_Extension_C}
  1272. (Short: \p{InCJKExtC}) (4160)
  1273. X \p{CJK_Unified_Ideographs_Extension_D} \p{Block=
  1274. CJK_Unified_Ideographs_Extension_D}
  1275. (Short: \p{InCJKExtD}) (224)
  1276. X \p{CJK_Unified_Ideographs_Extension_E} \p{Block=
  1277. CJK_Unified_Ideographs_Extension_E}
  1278. (Short: \p{InCJKExtE}) (5776)
  1279. \p{Close_Punctuation} \p{General_Category=Close_Punctuation}
  1280. (Short: \p{Pe}) (73)
  1281. \p{Cn} \p{Unassigned} (= \p{General_Category=
  1282. Unassigned}) (846_359 plus all above-
  1283. Unicode code points)
  1284. \p{Cntrl} \p{XPosixCntrl} (= \p{General_Category=
  1285. Control}) (65)
  1286. \p{Co} \p{Private_Use} (= \p{General_Category=
  1287. Private_Use}) (NOT \p{Private_Use_Area})
  1288. (137_468)
  1289. X \p{Combining_Diacritical_Marks} \p{Block=
  1290. Combining_Diacritical_Marks} (Short:
  1291. \p{InDiacriticals}) (112)
  1292. X \p{Combining_Diacritical_Marks_Extended} \p{Block=
  1293. Combining_Diacritical_Marks_Extended}
  1294. (Short: \p{InDiacriticalsExt}) (80)
  1295. X \p{Combining_Diacritical_Marks_For_Symbols} \p{Block=
  1296. Combining_Diacritical_Marks_For_Symbols}
  1297. (Short: \p{InDiacriticalsForSymbols})
  1298. (48)
  1299. X \p{Combining_Diacritical_Marks_Supplement} \p{Block=
  1300. Combining_Diacritical_Marks_Supplement}
  1301. (Short: \p{InDiacriticalsSup}) (64)
  1302. X \p{Combining_Half_Marks} \p{Block=Combining_Half_Marks} (Short:
  1303. \p{InHalfMarks}) (16)
  1304. \p{Combining_Mark} \p{Mark} (= \p{General_Category=Mark})
  1305. (2097)
  1306. X \p{Combining_Marks_For_Symbols}
  1307. \p{Combining_Diacritical_Marks_For_-
  1308. Symbols} (= \p{Block=
  1309. Combining_Diacritical_Marks_For_-
  1310. Symbols}) (48)
  1311. \p{Common} \p{Script_Extensions=Common} (Short:
  1312. \p{Zyyy}) (6864)
  1313. X \p{Common_Indic_Number_Forms} \p{Block=Common_Indic_Number_Forms}
  1314. (Short: \p{InIndicNumberForms}) (16)
  1315. \p{Comp_Ex} \p{Full_Composition_Exclusion} (=
  1316. \p{Full_Composition_Exclusion=Y}) (1120)
  1317. \p{Comp_Ex: *} \p{Full_Composition_Exclusion: *}
  1318. X \p{Compat_Jamo} \p{Hangul_Compatibility_Jamo} (= \p{Block=
  1319. Hangul_Compatibility_Jamo}) (96)
  1320. \p{Composition_Exclusion} \p{Composition_Exclusion=Y} (Short:
  1321. \p{CE}) (81)
  1322. \p{Composition_Exclusion: N*} (Short: \p{CE=N}, \P{CE}) (1_114_031
  1323. plus all above-Unicode code points)
  1324. \p{Composition_Exclusion: Y*} (Short: \p{CE=Y}, \p{CE}) (81)
  1325. \p{Connector_Punctuation} \p{General_Category=
  1326. Connector_Punctuation} (Short: \p{Pc})
  1327. (10)
  1328. \p{Control} \p{XPosixCntrl} (= \p{General_Category=
  1329. Control}) (65)
  1330. X \p{Control_Pictures} \p{Block=Control_Pictures} (64)
  1331. \p{Copt} \p{Coptic} (= \p{Script_Extensions=
  1332. Coptic}) (NOT \p{Block=Coptic}) (165)
  1333. \p{Coptic} \p{Script_Extensions=Coptic} (Short:
  1334. \p{Copt}; NOT \p{Block=Coptic}) (165)
  1335. X \p{Coptic_Epact_Numbers} \p{Block=Coptic_Epact_Numbers} (32)
  1336. X \p{Counting_Rod} \p{Counting_Rod_Numerals} (= \p{Block=
  1337. Counting_Rod_Numerals}) (32)
  1338. X \p{Counting_Rod_Numerals} \p{Block=Counting_Rod_Numerals} (Short:
  1339. \p{InCountingRod}) (32)
  1340. \p{Cprt} \p{Cypriot} (= \p{Script_Extensions=
  1341. Cypriot}) (112)
  1342. \p{Cs} \p{Surrogate} (= \p{General_Category=
  1343. Surrogate}) (2048)
  1344. \p{Cuneiform} \p{Script_Extensions=Cuneiform} (Short:
  1345. \p{Xsux}; NOT \p{Block=Cuneiform}) (1234)
  1346. X \p{Cuneiform_Numbers} \p{Cuneiform_Numbers_And_Punctuation} (=
  1347. \p{Block=
  1348. Cuneiform_Numbers_And_Punctuation}) (128)
  1349. X \p{Cuneiform_Numbers_And_Punctuation} \p{Block=
  1350. Cuneiform_Numbers_And_Punctuation}
  1351. (Short: \p{InCuneiformNumbers}) (128)
  1352. \p{Currency_Symbol} \p{General_Category=Currency_Symbol}
  1353. (Short: \p{Sc}) (53)
  1354. X \p{Currency_Symbols} \p{Block=Currency_Symbols} (48)
  1355. \p{CWCF} \p{Changes_When_Casefolded} (=
  1356. \p{Changes_When_Casefolded=Y}) (1377)
  1357. \p{CWCF: *} \p{Changes_When_Casefolded: *}
  1358. \p{CWCM} \p{Changes_When_Casemapped} (=
  1359. \p{Changes_When_Casemapped=Y}) (2669)
  1360. \p{CWCM: *} \p{Changes_When_Casemapped: *}
  1361. \p{CWKCF} \p{Changes_When_NFKC_Casefolded} (=
  1362. \p{Changes_When_NFKC_Casefolded=Y})
  1363. (10_227)
  1364. \p{CWKCF: *} \p{Changes_When_NFKC_Casefolded: *}
  1365. \p{CWL} \p{Changes_When_Lowercased} (=
  1366. \p{Changes_When_Lowercased=Y}) (1304)
  1367. \p{CWL: *} \p{Changes_When_Lowercased: *}
  1368. \p{CWT} \p{Changes_When_Titlecased} (=
  1369. \p{Changes_When_Titlecased=Y}) (1369)
  1370. \p{CWT: *} \p{Changes_When_Titlecased: *}
  1371. \p{CWU} \p{Changes_When_Uppercased} (=
  1372. \p{Changes_When_Uppercased=Y}) (1396)
  1373. \p{CWU: *} \p{Changes_When_Uppercased: *}
  1374. \p{Cypriot} \p{Script_Extensions=Cypriot} (Short:
  1375. \p{Cprt}) (112)
  1376. X \p{Cypriot_Syllabary} \p{Block=Cypriot_Syllabary} (64)
  1377. \p{Cyrillic} \p{Script_Extensions=Cyrillic} (Short:
  1378. \p{Cyrl}; NOT \p{Block=Cyrillic}) (446)
  1379. X \p{Cyrillic_Ext_A} \p{Cyrillic_Extended_A} (= \p{Block=
  1380. Cyrillic_Extended_A}) (32)
  1381. X \p{Cyrillic_Ext_B} \p{Cyrillic_Extended_B} (= \p{Block=
  1382. Cyrillic_Extended_B}) (96)
  1383. X \p{Cyrillic_Ext_C} \p{Cyrillic_Extended_C} (= \p{Block=
  1384. Cyrillic_Extended_C}) (16)
  1385. X \p{Cyrillic_Extended_A} \p{Block=Cyrillic_Extended_A} (Short:
  1386. \p{InCyrillicExtA}) (32)
  1387. X \p{Cyrillic_Extended_B} \p{Block=Cyrillic_Extended_B} (Short:
  1388. \p{InCyrillicExtB}) (96)
  1389. X \p{Cyrillic_Extended_C} \p{Block=Cyrillic_Extended_C} (Short:
  1390. \p{InCyrillicExtC}) (16)
  1391. X \p{Cyrillic_Sup} \p{Cyrillic_Supplement} (= \p{Block=
  1392. Cyrillic_Supplement}) (48)
  1393. X \p{Cyrillic_Supplement} \p{Block=Cyrillic_Supplement} (Short:
  1394. \p{InCyrillicSup}) (48)
  1395. X \p{Cyrillic_Supplementary} \p{Cyrillic_Supplement} (= \p{Block=
  1396. Cyrillic_Supplement}) (48)
  1397. \p{Cyrl} \p{Cyrillic} (= \p{Script_Extensions=
  1398. Cyrillic}) (NOT \p{Block=Cyrillic}) (446)
  1399. \p{Dash} \p{Dash=Y} (28)
  1400. \p{Dash: N*} (Single: \P{Dash}) (1_114_084 plus all
  1401. above-Unicode code points)
  1402. \p{Dash: Y*} (Single: \p{Dash}) (28)
  1403. \p{Dash_Punctuation} \p{General_Category=Dash_Punctuation}
  1404. (Short: \p{Pd}) (24)
  1405. \p{Decimal_Number} \p{XPosixDigit} (= \p{General_Category=
  1406. Decimal_Number}) (580)
  1407. \p{Decomposition_Type: Can} \p{Decomposition_Type=Canonical}
  1408. (13_232)
  1409. \p{Decomposition_Type: Canonical} (Short: \p{Dt=Can}) (13_232)
  1410. \p{Decomposition_Type: Circle} (Short: \p{Dt=Enc}) (240)
  1411. \p{Decomposition_Type: Com} \p{Decomposition_Type=Compat} (720)
  1412. \p{Decomposition_Type: Compat} (Short: \p{Dt=Com}) (720)
  1413. \p{Decomposition_Type: Enc} \p{Decomposition_Type=Circle} (240)
  1414. \p{Decomposition_Type: Fin} \p{Decomposition_Type=Final} (240)
  1415. \p{Decomposition_Type: Final} (Short: \p{Dt=Fin}) (240)
  1416. \p{Decomposition_Type: Font} (Short: \p{Dt=Font}) (1184)
  1417. \p{Decomposition_Type: Fra} \p{Decomposition_Type=Fraction} (20)
  1418. \p{Decomposition_Type: Fraction} (Short: \p{Dt=Fra}) (20)
  1419. \p{Decomposition_Type: Init} \p{Decomposition_Type=Initial} (171)
  1420. \p{Decomposition_Type: Initial} (Short: \p{Dt=Init}) (171)
  1421. \p{Decomposition_Type: Iso} \p{Decomposition_Type=Isolated} (238)
  1422. \p{Decomposition_Type: Isolated} (Short: \p{Dt=Iso}) (238)
  1423. \p{Decomposition_Type: Med} \p{Decomposition_Type=Medial} (82)
  1424. \p{Decomposition_Type: Medial} (Short: \p{Dt=Med}) (82)
  1425. \p{Decomposition_Type: Nar} \p{Decomposition_Type=Narrow} (122)
  1426. \p{Decomposition_Type: Narrow} (Short: \p{Dt=Nar}) (122)
  1427. \p{Decomposition_Type: Nb} \p{Decomposition_Type=Nobreak} (5)
  1428. \p{Decomposition_Type: Nobreak} (Short: \p{Dt=Nb}) (5)
  1429. \p{Decomposition_Type: Non_Canon} \p{Decomposition_Type=
  1430. Non_Canonical} (Perl extension) (3662)
  1431. \p{Decomposition_Type: Non_Canonical} Union of all non-canonical
  1432. decompositions (Short: \p{Dt=NonCanon})
  1433. (Perl extension) (3662)
  1434. \p{Decomposition_Type: None} (Short: \p{Dt=None}) (1_097_218 plus
  1435. all above-Unicode code points)
  1436. \p{Decomposition_Type: Small} (Short: \p{Dt=Sml}) (26)
  1437. \p{Decomposition_Type: Sml} \p{Decomposition_Type=Small} (26)
  1438. \p{Decomposition_Type: Sqr} \p{Decomposition_Type=Square} (285)
  1439. \p{Decomposition_Type: Square} (Short: \p{Dt=Sqr}) (285)
  1440. \p{Decomposition_Type: Sub} (Short: \p{Dt=Sub}) (38)
  1441. \p{Decomposition_Type: Sup} \p{Decomposition_Type=Super} (152)
  1442. \p{Decomposition_Type: Super} (Short: \p{Dt=Sup}) (152)
  1443. \p{Decomposition_Type: Vert} \p{Decomposition_Type=Vertical} (35)
  1444. \p{Decomposition_Type: Vertical} (Short: \p{Dt=Vert}) (35)
  1445. \p{Decomposition_Type: Wide} (Short: \p{Dt=Wide}) (104)
  1446. \p{Default_Ignorable_Code_Point} \p{Default_Ignorable_Code_Point=
  1447. Y} (Short: \p{DI}) (4173)
  1448. \p{Default_Ignorable_Code_Point: N*} (Short: \p{DI=N}, \P{DI})
  1449. (1_109_939 plus all above-Unicode code
  1450. points)
  1451. \p{Default_Ignorable_Code_Point: Y*} (Short: \p{DI=Y}, \p{DI})
  1452. (4173)
  1453. \p{Dep} \p{Deprecated} (= \p{Deprecated=Y}) (15)
  1454. \p{Dep: *} \p{Deprecated: *}
  1455. \p{Deprecated} \p{Deprecated=Y} (Short: \p{Dep}) (15)
  1456. \p{Deprecated: N*} (Short: \p{Dep=N}, \P{Dep}) (1_114_097
  1457. plus all above-Unicode code points)
  1458. \p{Deprecated: Y*} (Short: \p{Dep=Y}, \p{Dep}) (15)
  1459. \p{Deseret} \p{Script_Extensions=Deseret} (Short:
  1460. \p{Dsrt}) (80)
  1461. \p{Deva} \p{Devanagari} (= \p{Script_Extensions=
  1462. Devanagari}) (NOT \p{Block=Devanagari})
  1463. (210)
  1464. \p{Devanagari} \p{Script_Extensions=Devanagari} (Short:
  1465. \p{Deva}; NOT \p{Block=Devanagari}) (210)
  1466. X \p{Devanagari_Ext} \p{Devanagari_Extended} (= \p{Block=
  1467. Devanagari_Extended}) (32)
  1468. X \p{Devanagari_Extended} \p{Block=Devanagari_Extended} (Short:
  1469. \p{InDevanagariExt}) (32)
  1470. \p{DI} \p{Default_Ignorable_Code_Point} (=
  1471. \p{Default_Ignorable_Code_Point=Y})
  1472. (4173)
  1473. \p{DI: *} \p{Default_Ignorable_Code_Point: *}
  1474. \p{Dia} \p{Diacritic} (= \p{Diacritic=Y}) (782)
  1475. \p{Dia: *} \p{Diacritic: *}
  1476. \p{Diacritic} \p{Diacritic=Y} (Short: \p{Dia}) (782)
  1477. \p{Diacritic: N*} (Short: \p{Dia=N}, \P{Dia}) (1_113_330
  1478. plus all above-Unicode code points)
  1479. \p{Diacritic: Y*} (Short: \p{Dia=Y}, \p{Dia}) (782)
  1480. X \p{Diacriticals} \p{Combining_Diacritical_Marks} (=
  1481. \p{Block=Combining_Diacritical_Marks})
  1482. (112)
  1483. X \p{Diacriticals_Ext} \p{Combining_Diacritical_Marks_Extended}
  1484. (= \p{Block=
  1485. Combining_Diacritical_Marks_Extended})
  1486. (80)
  1487. X \p{Diacriticals_For_Symbols}
  1488. \p{Combining_Diacritical_Marks_For_-
  1489. Symbols} (= \p{Block=
  1490. Combining_Diacritical_Marks_For_-
  1491. Symbols}) (48)
  1492. X \p{Diacriticals_Sup} \p{Combining_Diacritical_Marks_Supplement}
  1493. (= \p{Block=
  1494. Combining_Diacritical_Marks_Supplement})
  1495. (64)
  1496. \p{Digit} \p{XPosixDigit} (= \p{General_Category=
  1497. Decimal_Number}) (580)
  1498. X \p{Dingbats} \p{Block=Dingbats} (192)
  1499. X \p{Domino} \p{Domino_Tiles} (= \p{Block=
  1500. Domino_Tiles}) (112)
  1501. X \p{Domino_Tiles} \p{Block=Domino_Tiles} (Short:
  1502. \p{InDomino}) (112)
  1503. \p{Dsrt} \p{Deseret} (= \p{Script_Extensions=
  1504. Deseret}) (80)
  1505. \p{Dt: *} \p{Decomposition_Type: *}
  1506. \p{Dupl} \p{Duployan} (= \p{Script_Extensions=
  1507. Duployan}) (NOT \p{Block=Duployan}) (147)
  1508. \p{Duployan} \p{Script_Extensions=Duployan} (Short:
  1509. \p{Dupl}; NOT \p{Block=Duployan}) (147)
  1510. \p{Ea: *} \p{East_Asian_Width: *}
  1511. X \p{Early_Dynastic_Cuneiform} \p{Block=Early_Dynastic_Cuneiform}
  1512. (208)
  1513. \p{East_Asian_Width: A} \p{East_Asian_Width=Ambiguous} (138_739)
  1514. \p{East_Asian_Width: Ambiguous} (Short: \p{Ea=A}) (138_739)
  1515. \p{East_Asian_Width: F} \p{East_Asian_Width=Fullwidth} (104)
  1516. \p{East_Asian_Width: Fullwidth} (Short: \p{Ea=F}) (104)
  1517. \p{East_Asian_Width: H} \p{East_Asian_Width=Halfwidth} (123)
  1518. \p{East_Asian_Width: Halfwidth} (Short: \p{Ea=H}) (123)
  1519. \p{East_Asian_Width: N} \p{East_Asian_Width=Neutral} (794_146 plus
  1520. all above-Unicode code points)
  1521. \p{East_Asian_Width: Na} \p{East_Asian_Width=Narrow} (111)
  1522. \p{East_Asian_Width: Narrow} (Short: \p{Ea=Na}) (111)
  1523. \p{East_Asian_Width: Neutral} (Short: \p{Ea=N}) (794_146 plus all
  1524. above-Unicode code points)
  1525. \p{East_Asian_Width: W} \p{East_Asian_Width=Wide} (180_889)
  1526. \p{East_Asian_Width: Wide} (Short: \p{Ea=W}) (180_889)
  1527. \p{Egyp} \p{Egyptian_Hieroglyphs} (=
  1528. \p{Script_Extensions=
  1529. Egyptian_Hieroglyphs}) (NOT \p{Block=
  1530. Egyptian_Hieroglyphs}) (1071)
  1531. \p{Egyptian_Hieroglyphs} \p{Script_Extensions=
  1532. Egyptian_Hieroglyphs} (Short: \p{Egyp};
  1533. NOT \p{Block=Egyptian_Hieroglyphs})
  1534. (1071)
  1535. \p{Elba} \p{Elbasan} (= \p{Script_Extensions=
  1536. Elbasan}) (NOT \p{Block=Elbasan}) (40)
  1537. \p{Elbasan} \p{Script_Extensions=Elbasan} (Short:
  1538. \p{Elba}; NOT \p{Block=Elbasan}) (40)
  1539. X \p{Emoticons} \p{Block=Emoticons} (80)
  1540. X \p{Enclosed_Alphanum} \p{Enclosed_Alphanumerics} (= \p{Block=
  1541. Enclosed_Alphanumerics}) (160)
  1542. X \p{Enclosed_Alphanum_Sup} \p{Enclosed_Alphanumeric_Supplement} (=
  1543. \p{Block=
  1544. Enclosed_Alphanumeric_Supplement}) (256)
  1545. X \p{Enclosed_Alphanumeric_Supplement} \p{Block=
  1546. Enclosed_Alphanumeric_Supplement}
  1547. (Short: \p{InEnclosedAlphanumSup}) (256)
  1548. X \p{Enclosed_Alphanumerics} \p{Block=Enclosed_Alphanumerics}
  1549. (Short: \p{InEnclosedAlphanum}) (160)
  1550. X \p{Enclosed_CJK} \p{Enclosed_CJK_Letters_And_Months} (=
  1551. \p{Block=
  1552. Enclosed_CJK_Letters_And_Months}) (256)
  1553. X \p{Enclosed_CJK_Letters_And_Months} \p{Block=
  1554. Enclosed_CJK_Letters_And_Months} (Short:
  1555. \p{InEnclosedCJK}) (256)
  1556. X \p{Enclosed_Ideographic_Sup} \p{Enclosed_Ideographic_Supplement}
  1557. (= \p{Block=
  1558. Enclosed_Ideographic_Supplement}) (256)
  1559. X \p{Enclosed_Ideographic_Supplement} \p{Block=
  1560. Enclosed_Ideographic_Supplement} (Short:
  1561. \p{InEnclosedIdeographicSup}) (256)
  1562. \p{Enclosing_Mark} \p{General_Category=Enclosing_Mark}
  1563. (Short: \p{Me}) (13)
  1564. \p{Ethi} \p{Ethiopic} (= \p{Script_Extensions=
  1565. Ethiopic}) (NOT \p{Block=Ethiopic}) (495)
  1566. \p{Ethiopic} \p{Script_Extensions=Ethiopic} (Short:
  1567. \p{Ethi}; NOT \p{Block=Ethiopic}) (495)
  1568. X \p{Ethiopic_Ext} \p{Ethiopic_Extended} (= \p{Block=
  1569. Ethiopic_Extended}) (96)
  1570. X \p{Ethiopic_Ext_A} \p{Ethiopic_Extended_A} (= \p{Block=
  1571. Ethiopic_Extended_A}) (48)
  1572. X \p{Ethiopic_Extended} \p{Block=Ethiopic_Extended} (Short:
  1573. \p{InEthiopicExt}) (96)
  1574. X \p{Ethiopic_Extended_A} \p{Block=Ethiopic_Extended_A} (Short:
  1575. \p{InEthiopicExtA}) (48)
  1576. X \p{Ethiopic_Sup} \p{Ethiopic_Supplement} (= \p{Block=
  1577. Ethiopic_Supplement}) (32)
  1578. X \p{Ethiopic_Supplement} \p{Block=Ethiopic_Supplement} (Short:
  1579. \p{InEthiopicSup}) (32)
  1580. \p{Ext} \p{Extender} (= \p{Extender=Y}) (42)
  1581. \p{Ext: *} \p{Extender: *}
  1582. \p{Extender} \p{Extender=Y} (Short: \p{Ext}) (42)
  1583. \p{Extender: N*} (Short: \p{Ext=N}, \P{Ext}) (1_114_070
  1584. plus all above-Unicode code points)
  1585. \p{Extender: Y*} (Short: \p{Ext=Y}, \p{Ext}) (42)
  1586. \p{Final_Punctuation} \p{General_Category=Final_Punctuation}
  1587. (Short: \p{Pf}) (10)
  1588. \p{Format} \p{General_Category=Format} (Short:
  1589. \p{Cf}) (151)
  1590. \p{Full_Composition_Exclusion} \p{Full_Composition_Exclusion=Y}
  1591. (Short: \p{CompEx}) (1120)
  1592. \p{Full_Composition_Exclusion: N*} (Short: \p{CompEx=N},
  1593. \P{CompEx}) (1_112_992 plus all above-
  1594. Unicode code points)
  1595. \p{Full_Composition_Exclusion: Y*} (Short: \p{CompEx=Y},
  1596. \p{CompEx}) (1120)
  1597. \p{Gc: *} \p{General_Category: *}
  1598. \p{GCB: *} \p{Grapheme_Cluster_Break: *}
  1599. \p{General_Category: C} \p{General_Category=Other} (986_091 plus
  1600. all above-Unicode code points)
  1601. \p{General_Category: Cased_Letter} [\p{Ll}\p{Lu}\p{Lt}] (Short:
  1602. \p{Gc=LC}, \p{LC}) (3796)
  1603. \p{General_Category: Cc} \p{General_Category=Control} (65)
  1604. \p{General_Category: Cf} \p{General_Category=Format} (151)
  1605. \p{General_Category: Close_Punctuation} (Short: \p{Gc=Pe}, \p{Pe})
  1606. (73)
  1607. \p{General_Category: Cn} \p{General_Category=Unassigned} (846_359
  1608. plus all above-Unicode code points)
  1609. \p{General_Category: Cntrl} \p{General_Category=Control} (65)
  1610. \p{General_Category: Co} \p{General_Category=Private_Use} (137_468)
  1611. \p{General_Category: Combining_Mark} \p{General_Category=Mark}
  1612. (2097)
  1613. \p{General_Category: Connector_Punctuation} (Short: \p{Gc=Pc},
  1614. \p{Pc}) (10)
  1615. \p{General_Category: Control} (Short: \p{Gc=Cc}, \p{Cc}) (65)
  1616. \p{General_Category: Cs} \p{General_Category=Surrogate} (2048)
  1617. \p{General_Category: Currency_Symbol} (Short: \p{Gc=Sc}, \p{Sc})
  1618. (53)
  1619. \p{General_Category: Dash_Punctuation} (Short: \p{Gc=Pd}, \p{Pd})
  1620. (24)
  1621. \p{General_Category: Decimal_Number} (Short: \p{Gc=Nd}, \p{Nd})
  1622. (580)
  1623. \p{General_Category: Digit} \p{General_Category=Decimal_Number}
  1624. (580)
  1625. \p{General_Category: Enclosing_Mark} (Short: \p{Gc=Me}, \p{Me})
  1626. (13)
  1627. \p{General_Category: Final_Punctuation} (Short: \p{Gc=Pf}, \p{Pf})
  1628. (10)
  1629. \p{General_Category: Format} (Short: \p{Gc=Cf}, \p{Cf}) (151)
  1630. \p{General_Category: Initial_Punctuation} (Short: \p{Gc=Pi},
  1631. \p{Pi}) (12)
  1632. \p{General_Category: L} \p{General_Category=Letter} (116_766)
  1633. X \p{General_Category: L&} \p{General_Category=Cased_Letter} (3796)
  1634. X \p{General_Category: L_} \p{General_Category=Cased_Letter} Note
  1635. the trailing '_' matters in spite of
  1636. loose matching rules. (3796)
  1637. \p{General_Category: LC} \p{General_Category=Cased_Letter} (3796)
  1638. \p{General_Category: Letter} (Short: \p{Gc=L}, \p{L}) (116_766)
  1639. \p{General_Category: Letter_Number} (Short: \p{Gc=Nl}, \p{Nl})
  1640. (236)
  1641. \p{General_Category: Line_Separator} (Short: \p{Gc=Zl}, \p{Zl}) (1)
  1642. \p{General_Category: Ll} \p{General_Category=Lowercase_Letter}
  1643. (/i= General_Category=Cased_Letter)
  1644. (2063)
  1645. \p{General_Category: Lm} \p{General_Category=Modifier_Letter} (249)
  1646. \p{General_Category: Lo} \p{General_Category=Other_Letter}
  1647. (112_721)
  1648. \p{General_Category: Lowercase_Letter} (Short: \p{Gc=Ll}, \p{Ll};
  1649. /i= General_Category=Cased_Letter) (2063)
  1650. \p{General_Category: Lt} \p{General_Category=Titlecase_Letter}
  1651. (/i= General_Category=Cased_Letter) (31)
  1652. \p{General_Category: Lu} \p{General_Category=Uppercase_Letter}
  1653. (/i= General_Category=Cased_Letter)
  1654. (1702)
  1655. \p{General_Category: M} \p{General_Category=Mark} (2097)
  1656. \p{General_Category: Mark} (Short: \p{Gc=M}, \p{M}) (2097)
  1657. \p{General_Category: Math_Symbol} (Short: \p{Gc=Sm}, \p{Sm}) (948)
  1658. \p{General_Category: Mc} \p{General_Category=Spacing_Mark} (394)
  1659. \p{General_Category: Me} \p{General_Category=Enclosing_Mark} (13)
  1660. \p{General_Category: Mn} \p{General_Category=Nonspacing_Mark}
  1661. (1690)
  1662. \p{General_Category: Modifier_Letter} (Short: \p{Gc=Lm}, \p{Lm})
  1663. (249)
  1664. \p{General_Category: Modifier_Symbol} (Short: \p{Gc=Sk}, \p{Sk})
  1665. (121)
  1666. \p{General_Category: N} \p{General_Category=Number} (1492)
  1667. \p{General_Category: Nd} \p{General_Category=Decimal_Number} (580)
  1668. \p{General_Category: Nl} \p{General_Category=Letter_Number} (236)
  1669. \p{General_Category: No} \p{General_Category=Other_Number} (676)
  1670. \p{General_Category: Nonspacing_Mark} (Short: \p{Gc=Mn}, \p{Mn})
  1671. (1690)
  1672. \p{General_Category: Number} (Short: \p{Gc=N}, \p{N}) (1492)
  1673. \p{General_Category: Open_Punctuation} (Short: \p{Gc=Ps}, \p{Ps})
  1674. (75)
  1675. \p{General_Category: Other} (Short: \p{Gc=C}, \p{C}) (986_091 plus
  1676. all above-Unicode code points)
  1677. \p{General_Category: Other_Letter} (Short: \p{Gc=Lo}, \p{Lo})
  1678. (112_721)
  1679. \p{General_Category: Other_Number} (Short: \p{Gc=No}, \p{No}) (676)
  1680. \p{General_Category: Other_Punctuation} (Short: \p{Gc=Po}, \p{Po})
  1681. (544)
  1682. \p{General_Category: Other_Symbol} (Short: \p{Gc=So}, \p{So})
  1683. (5777)
  1684. \p{General_Category: P} \p{General_Category=Punctuation} (748)
  1685. \p{General_Category: Paragraph_Separator} (Short: \p{Gc=Zp},
  1686. \p{Zp}) (1)
  1687. \p{General_Category: Pc} \p{General_Category=
  1688. Connector_Punctuation} (10)
  1689. \p{General_Category: Pd} \p{General_Category=Dash_Punctuation} (24)
  1690. \p{General_Category: Pe} \p{General_Category=Close_Punctuation}
  1691. (73)
  1692. \p{General_Category: Pf} \p{General_Category=Final_Punctuation}
  1693. (10)
  1694. \p{General_Category: Pi} \p{General_Category=Initial_Punctuation}
  1695. (12)
  1696. \p{General_Category: Po} \p{General_Category=Other_Punctuation}
  1697. (544)
  1698. \p{General_Category: Private_Use} (Short: \p{Gc=Co}, \p{Co})
  1699. (137_468)
  1700. \p{General_Category: Ps} \p{General_Category=Open_Punctuation} (75)
  1701. \p{General_Category: Punct} \p{General_Category=Punctuation} (748)
  1702. \p{General_Category: Punctuation} (Short: \p{Gc=P}, \p{P}) (748)
  1703. \p{General_Category: S} \p{General_Category=Symbol} (6899)
  1704. \p{General_Category: Sc} \p{General_Category=Currency_Symbol} (53)
  1705. \p{General_Category: Separator} (Short: \p{Gc=Z}, \p{Z}) (19)
  1706. \p{General_Category: Sk} \p{General_Category=Modifier_Symbol} (121)
  1707. \p{General_Category: Sm} \p{General_Category=Math_Symbol} (948)
  1708. \p{General_Category: So} \p{General_Category=Other_Symbol} (5777)
  1709. \p{General_Category: Space_Separator} (Short: \p{Gc=Zs}, \p{Zs})
  1710. (17)
  1711. \p{General_Category: Spacing_Mark} (Short: \p{Gc=Mc}, \p{Mc}) (394)
  1712. \p{General_Category: Surrogate} (Short: \p{Gc=Cs}, \p{Cs}) (2048)
  1713. \p{General_Category: Symbol} (Short: \p{Gc=S}, \p{S}) (6899)
  1714. \p{General_Category: Titlecase_Letter} (Short: \p{Gc=Lt}, \p{Lt};
  1715. /i= General_Category=Cased_Letter) (31)
  1716. \p{General_Category: Unassigned} (Short: \p{Gc=Cn}, \p{Cn})
  1717. (846_359 plus all above-Unicode code
  1718. points)
  1719. \p{General_Category: Uppercase_Letter} (Short: \p{Gc=Lu}, \p{Lu};
  1720. /i= General_Category=Cased_Letter) (1702)
  1721. \p{General_Category: Z} \p{General_Category=Separator} (19)
  1722. \p{General_Category: Zl} \p{General_Category=Line_Separator} (1)
  1723. \p{General_Category: Zp} \p{General_Category=Paragraph_Separator}
  1724. (1)
  1725. \p{General_Category: Zs} \p{General_Category=Space_Separator} (17)
  1726. X \p{General_Punctuation} \p{Block=General_Punctuation} (Short:
  1727. \p{InPunctuation}) (112)
  1728. X \p{Geometric_Shapes} \p{Block=Geometric_Shapes} (96)
  1729. X \p{Geometric_Shapes_Ext} \p{Geometric_Shapes_Extended} (=
  1730. \p{Block=Geometric_Shapes_Extended})
  1731. (128)
  1732. X \p{Geometric_Shapes_Extended} \p{Block=Geometric_Shapes_Extended}
  1733. (Short: \p{InGeometricShapesExt}) (128)
  1734. \p{Geor} \p{Georgian} (= \p{Script_Extensions=
  1735. Georgian}) (NOT \p{Block=Georgian}) (129)
  1736. \p{Georgian} \p{Script_Extensions=Georgian} (Short:
  1737. \p{Geor}; NOT \p{Block=Georgian}) (129)
  1738. X \p{Georgian_Sup} \p{Georgian_Supplement} (= \p{Block=
  1739. Georgian_Supplement}) (48)
  1740. X \p{Georgian_Supplement} \p{Block=Georgian_Supplement} (Short:
  1741. \p{InGeorgianSup}) (48)
  1742. \p{Glag} \p{Glagolitic} (= \p{Script_Extensions=
  1743. Glagolitic}) (NOT \p{Block=Glagolitic})
  1744. (136)
  1745. \p{Glagolitic} \p{Script_Extensions=Glagolitic} (Short:
  1746. \p{Glag}; NOT \p{Block=Glagolitic}) (136)
  1747. X \p{Glagolitic_Sup} \p{Glagolitic_Supplement} (= \p{Block=
  1748. Glagolitic_Supplement}) (48)
  1749. X \p{Glagolitic_Supplement} \p{Block=Glagolitic_Supplement} (Short:
  1750. \p{InGlagoliticSup}) (48)
  1751. \p{Goth} \p{Gothic} (= \p{Script_Extensions=
  1752. Gothic}) (NOT \p{Block=Gothic}) (27)
  1753. \p{Gothic} \p{Script_Extensions=Gothic} (Short:
  1754. \p{Goth}; NOT \p{Block=Gothic}) (27)
  1755. \p{Gr_Base} \p{Grapheme_Base} (= \p{Grapheme_Base=Y})
  1756. (126_288)
  1757. \p{Gr_Base: *} \p{Grapheme_Base: *}
  1758. \p{Gr_Ext} \p{Grapheme_Extend} (= \p{Grapheme_Extend=
  1759. Y}) (1828)
  1760. \p{Gr_Ext: *} \p{Grapheme_Extend: *}
  1761. \p{Gran} \p{Grantha} (= \p{Script_Extensions=
  1762. Grantha}) (NOT \p{Block=Grantha}) (113)
  1763. \p{Grantha} \p{Script_Extensions=Grantha} (Short:
  1764. \p{Gran}; NOT \p{Block=Grantha}) (113)
  1765. \p{Graph} \p{XPosixGraph} (265_621)
  1766. \p{Grapheme_Base} \p{Grapheme_Base=Y} (Short: \p{GrBase})
  1767. (126_288)
  1768. \p{Grapheme_Base: N*} (Short: \p{GrBase=N}, \P{GrBase}) (987_824
  1769. plus all above-Unicode code points)
  1770. \p{Grapheme_Base: Y*} (Short: \p{GrBase=Y}, \p{GrBase}) (126_288)
  1771. \p{Grapheme_Cluster_Break: CN} \p{Grapheme_Cluster_Break=Control}
  1772. (5925)
  1773. \p{Grapheme_Cluster_Break: Control} (Short: \p{GCB=CN}) (5925)
  1774. \p{Grapheme_Cluster_Break: CR} (Short: \p{GCB=CR}) (1)
  1775. \p{Grapheme_Cluster_Break: E_Base} (Short: \p{GCB=EB}) (79)
  1776. \p{Grapheme_Cluster_Break: E_Base_GAZ} (Short: \p{GCB=EBG}) (4)
  1777. \p{Grapheme_Cluster_Break: E_Modifier} (Short: \p{GCB=EM}) (5)
  1778. \p{Grapheme_Cluster_Break: EB} \p{Grapheme_Cluster_Break=E_Base}
  1779. (79)
  1780. \p{Grapheme_Cluster_Break: EBG} \p{Grapheme_Cluster_Break=
  1781. E_Base_GAZ} (4)
  1782. \p{Grapheme_Cluster_Break: EM} \p{Grapheme_Cluster_Break=
  1783. E_Modifier} (5)
  1784. \p{Grapheme_Cluster_Break: EX} \p{Grapheme_Cluster_Break=Extend}
  1785. (1828)
  1786. \p{Grapheme_Cluster_Break: Extend} (Short: \p{GCB=EX}) (1828)
  1787. \p{Grapheme_Cluster_Break: GAZ} \p{Grapheme_Cluster_Break=
  1788. Glue_After_Zwj} (3)
  1789. \p{Grapheme_Cluster_Break: Glue_After_Zwj} (Short: \p{GCB=GAZ}) (3)
  1790. \p{Grapheme_Cluster_Break: L} (Short: \p{GCB=L}) (125)
  1791. \p{Grapheme_Cluster_Break: LF} (Short: \p{GCB=LF}) (1)
  1792. \p{Grapheme_Cluster_Break: LV} (Short: \p{GCB=LV}) (399)
  1793. \p{Grapheme_Cluster_Break: LVT} (Short: \p{GCB=LVT}) (10_773)
  1794. \p{Grapheme_Cluster_Break: Other} (Short: \p{GCB=XX}) (1_094_356
  1795. plus all above-Unicode code points)
  1796. \p{Grapheme_Cluster_Break: PP} \p{Grapheme_Cluster_Break=Prepend}
  1797. (13)
  1798. \p{Grapheme_Cluster_Break: Prepend} (Short: \p{GCB=PP}) (13)
  1799. \p{Grapheme_Cluster_Break: Regional_Indicator} (Short: \p{GCB=RI})
  1800. (26)
  1801. \p{Grapheme_Cluster_Break: RI} \p{Grapheme_Cluster_Break=
  1802. Regional_Indicator} (26)
  1803. \p{Grapheme_Cluster_Break: SM} \p{Grapheme_Cluster_Break=
  1804. SpacingMark} (341)
  1805. \p{Grapheme_Cluster_Break: SpacingMark} (Short: \p{GCB=SM}) (341)
  1806. \p{Grapheme_Cluster_Break: T} (Short: \p{GCB=T}) (137)
  1807. \p{Grapheme_Cluster_Break: V} (Short: \p{GCB=V}) (95)
  1808. \p{Grapheme_Cluster_Break: XX} \p{Grapheme_Cluster_Break=Other}
  1809. (1_094_356 plus all above-Unicode code
  1810. points)
  1811. \p{Grapheme_Cluster_Break: ZWJ} (Short: \p{GCB=ZWJ}) (1)
  1812. \p{Grapheme_Extend} \p{Grapheme_Extend=Y} (Short: \p{GrExt})
  1813. (1828)
  1814. \p{Grapheme_Extend: N*} (Short: \p{GrExt=N}, \P{GrExt}) (1_112_284
  1815. plus all above-Unicode code points)
  1816. \p{Grapheme_Extend: Y*} (Short: \p{GrExt=Y}, \p{GrExt}) (1828)
  1817. \p{Greek} \p{Script_Extensions=Greek} (Short:
  1818. \p{Grek}; NOT \p{Greek_And_Coptic}) (522)
  1819. X \p{Greek_And_Coptic} \p{Block=Greek_And_Coptic} (Short:
  1820. \p{InGreek}) (144)
  1821. X \p{Greek_Ext} \p{Greek_Extended} (= \p{Block=
  1822. Greek_Extended}) (256)
  1823. X \p{Greek_Extended} \p{Block=Greek_Extended} (Short:
  1824. \p{InGreekExt}) (256)
  1825. \p{Grek} \p{Greek} (= \p{Script_Extensions=Greek})
  1826. (NOT \p{Greek_And_Coptic}) (522)
  1827. \p{Gujarati} \p{Script_Extensions=Gujarati} (Short:
  1828. \p{Gujr}; NOT \p{Block=Gujarati}) (99)
  1829. \p{Gujr} \p{Gujarati} (= \p{Script_Extensions=
  1830. Gujarati}) (NOT \p{Block=Gujarati}) (99)
  1831. \p{Gurmukhi} \p{Script_Extensions=Gurmukhi} (Short:
  1832. \p{Guru}; NOT \p{Block=Gurmukhi}) (93)
  1833. \p{Guru} \p{Gurmukhi} (= \p{Script_Extensions=
  1834. Gurmukhi}) (NOT \p{Block=Gurmukhi}) (93)
  1835. X \p{Half_And_Full_Forms} \p{Halfwidth_And_Fullwidth_Forms} (=
  1836. \p{Block=Halfwidth_And_Fullwidth_Forms})
  1837. (240)
  1838. X \p{Half_Marks} \p{Combining_Half_Marks} (= \p{Block=
  1839. Combining_Half_Marks}) (16)
  1840. X \p{Halfwidth_And_Fullwidth_Forms} \p{Block=
  1841. Halfwidth_And_Fullwidth_Forms} (Short:
  1842. \p{InHalfAndFullForms}) (240)
  1843. \p{Han} \p{Script_Extensions=Han} (82_013)
  1844. \p{Hang} \p{Hangul} (= \p{Script_Extensions=
  1845. Hangul}) (NOT \p{Hangul_Syllables})
  1846. (11_775)
  1847. \p{Hangul} \p{Script_Extensions=Hangul} (Short:
  1848. \p{Hang}; NOT \p{Hangul_Syllables})
  1849. (11_775)
  1850. X \p{Hangul_Compatibility_Jamo} \p{Block=Hangul_Compatibility_Jamo}
  1851. (Short: \p{InCompatJamo}) (96)
  1852. X \p{Hangul_Jamo} \p{Block=Hangul_Jamo} (Short: \p{InJamo})
  1853. (256)
  1854. X \p{Hangul_Jamo_Extended_A} \p{Block=Hangul_Jamo_Extended_A}
  1855. (Short: \p{InJamoExtA}) (32)
  1856. X \p{Hangul_Jamo_Extended_B} \p{Block=Hangul_Jamo_Extended_B}
  1857. (Short: \p{InJamoExtB}) (80)
  1858. \p{Hangul_Syllable_Type: L} \p{Hangul_Syllable_Type=Leading_Jamo}
  1859. (125)
  1860. \p{Hangul_Syllable_Type: Leading_Jamo} (Short: \p{Hst=L}) (125)
  1861. \p{Hangul_Syllable_Type: LV} \p{Hangul_Syllable_Type=LV_Syllable}
  1862. (399)
  1863. \p{Hangul_Syllable_Type: LV_Syllable} (Short: \p{Hst=LV}) (399)
  1864. \p{Hangul_Syllable_Type: LVT} \p{Hangul_Syllable_Type=
  1865. LVT_Syllable} (10_773)
  1866. \p{Hangul_Syllable_Type: LVT_Syllable} (Short: \p{Hst=LVT})
  1867. (10_773)
  1868. \p{Hangul_Syllable_Type: NA} \p{Hangul_Syllable_Type=
  1869. Not_Applicable} (1_102_583 plus all
  1870. above-Unicode code points)
  1871. \p{Hangul_Syllable_Type: Not_Applicable} (Short: \p{Hst=NA})
  1872. (1_102_583 plus all above-Unicode code
  1873. points)
  1874. \p{Hangul_Syllable_Type: T} \p{Hangul_Syllable_Type=Trailing_Jamo}
  1875. (137)
  1876. \p{Hangul_Syllable_Type: Trailing_Jamo} (Short: \p{Hst=T}) (137)
  1877. \p{Hangul_Syllable_Type: V} \p{Hangul_Syllable_Type=Vowel_Jamo}
  1878. (95)
  1879. \p{Hangul_Syllable_Type: Vowel_Jamo} (Short: \p{Hst=V}) (95)
  1880. X \p{Hangul_Syllables} \p{Block=Hangul_Syllables} (Short:
  1881. \p{InHangul}) (11_184)
  1882. \p{Hani} \p{Han} (= \p{Script_Extensions=Han})
  1883. (82_013)
  1884. \p{Hano} \p{Hanunoo} (= \p{Script_Extensions=
  1885. Hanunoo}) (NOT \p{Block=Hanunoo}) (23)
  1886. \p{Hanunoo} \p{Script_Extensions=Hanunoo} (Short:
  1887. \p{Hano}; NOT \p{Block=Hanunoo}) (23)
  1888. \p{Hatr} \p{Hatran} (= \p{Script_Extensions=
  1889. Hatran}) (NOT \p{Block=Hatran}) (26)
  1890. \p{Hatran} \p{Script_Extensions=Hatran} (Short:
  1891. \p{Hatr}; NOT \p{Block=Hatran}) (26)
  1892. \p{Hebr} \p{Hebrew} (= \p{Script_Extensions=
  1893. Hebrew}) (NOT \p{Block=Hebrew}) (133)
  1894. \p{Hebrew} \p{Script_Extensions=Hebrew} (Short:
  1895. \p{Hebr}; NOT \p{Block=Hebrew}) (133)
  1896. \p{Hex} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  1897. \p{Hex: *} \p{Hex_Digit: *}
  1898. \p{Hex_Digit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  1899. \p{Hex_Digit: N*} (Short: \p{Hex=N}, \P{Hex}) (1_114_068
  1900. plus all above-Unicode code points)
  1901. \p{Hex_Digit: Y*} (Short: \p{Hex=Y}, \p{Hex}) (44)
  1902. X \p{High_Private_Use_Surrogates} \p{Block=
  1903. High_Private_Use_Surrogates} (Short:
  1904. \p{InHighPUSurrogates}) (128)
  1905. X \p{High_PU_Surrogates} \p{High_Private_Use_Surrogates} (=
  1906. \p{Block=High_Private_Use_Surrogates})
  1907. (128)
  1908. X \p{High_Surrogates} \p{Block=High_Surrogates} (896)
  1909. \p{Hira} \p{Hiragana} (= \p{Script_Extensions=
  1910. Hiragana}) (NOT \p{Block=Hiragana}) (143)
  1911. \p{Hiragana} \p{Script_Extensions=Hiragana} (Short:
  1912. \p{Hira}; NOT \p{Block=Hiragana}) (143)
  1913. \p{Hluw} \p{Anatolian_Hieroglyphs} (=
  1914. \p{Script_Extensions=
  1915. Anatolian_Hieroglyphs}) (NOT \p{Block=
  1916. Anatolian_Hieroglyphs}) (583)
  1917. \p{Hmng} \p{Pahawh_Hmong} (= \p{Script_Extensions=
  1918. Pahawh_Hmong}) (NOT \p{Block=
  1919. Pahawh_Hmong}) (127)
  1920. \p{HorizSpace} \p{XPosixBlank} (18)
  1921. \p{Hst: *} \p{Hangul_Syllable_Type: *}
  1922. \p{Hung} \p{Old_Hungarian} (= \p{Script_Extensions=
  1923. Old_Hungarian}) (NOT \p{Block=
  1924. Old_Hungarian}) (108)
  1925. D \p{Hyphen} \p{Hyphen=Y} (11)
  1926. D \p{Hyphen: N*} Supplanted by Line_Break property values;
  1927. see www.unicode.org/reports/tr14
  1928. (Single: \P{Hyphen}) (1_114_101 plus all
  1929. above-Unicode code points)
  1930. D \p{Hyphen: Y*} Supplanted by Line_Break property values;
  1931. see www.unicode.org/reports/tr14
  1932. (Single: \p{Hyphen}) (11)
  1933. \p{ID_Continue} \p{ID_Continue=Y} (Short: \p{IDC}; NOT
  1934. \p{Ideographic_Description_Characters})
  1935. (119_691)
  1936. \p{ID_Continue: N*} (Short: \p{IDC=N}, \P{IDC}) (994_421 plus
  1937. all above-Unicode code points)
  1938. \p{ID_Continue: Y*} (Short: \p{IDC=Y}, \p{IDC}) (119_691)
  1939. \p{ID_Start} \p{ID_Start=Y} (Short: \p{IDS}) (117_007)
  1940. \p{ID_Start: N*} (Short: \p{IDS=N}, \P{IDS}) (997_105 plus
  1941. all above-Unicode code points)
  1942. \p{ID_Start: Y*} (Short: \p{IDS=Y}, \p{IDS}) (117_007)
  1943. \p{IDC} \p{ID_Continue} (= \p{ID_Continue=Y}) (NOT
  1944. \p{Ideographic_Description_Characters})
  1945. (119_691)
  1946. \p{IDC: *} \p{ID_Continue: *}
  1947. \p{Ideo} \p{Ideographic} (= \p{Ideographic=Y})
  1948. (88_284)
  1949. \p{Ideo: *} \p{Ideographic: *}
  1950. \p{Ideographic} \p{Ideographic=Y} (Short: \p{Ideo})
  1951. (88_284)
  1952. \p{Ideographic: N*} (Short: \p{Ideo=N}, \P{Ideo}) (1_025_828
  1953. plus all above-Unicode code points)
  1954. \p{Ideographic: Y*} (Short: \p{Ideo=Y}, \p{Ideo}) (88_284)
  1955. X \p{Ideographic_Description_Characters} \p{Block=
  1956. Ideographic_Description_Characters}
  1957. (Short: \p{InIDC}) (16)
  1958. X \p{Ideographic_Symbols} \p{Ideographic_Symbols_And_Punctuation} (=
  1959. \p{Block=
  1960. Ideographic_Symbols_And_Punctuation})
  1961. (32)
  1962. X \p{Ideographic_Symbols_And_Punctuation} \p{Block=
  1963. Ideographic_Symbols_And_Punctuation}
  1964. (Short: \p{InIdeographicSymbols}) (32)
  1965. \p{IDS} \p{ID_Start} (= \p{ID_Start=Y}) (117_007)
  1966. \p{IDS: *} \p{ID_Start: *}
  1967. \p{IDS_Binary_Operator} \p{IDS_Binary_Operator=Y} (Short:
  1968. \p{IDSB}) (10)
  1969. \p{IDS_Binary_Operator: N*} (Short: \p{IDSB=N}, \P{IDSB})
  1970. (1_114_102 plus all above-Unicode code
  1971. points)
  1972. \p{IDS_Binary_Operator: Y*} (Short: \p{IDSB=Y}, \p{IDSB}) (10)
  1973. \p{IDS_Trinary_Operator} \p{IDS_Trinary_Operator=Y} (Short:
  1974. \p{IDST}) (2)
  1975. \p{IDS_Trinary_Operator: N*} (Short: \p{IDST=N}, \P{IDST})
  1976. (1_114_110 plus all above-Unicode code
  1977. points)
  1978. \p{IDS_Trinary_Operator: Y*} (Short: \p{IDST=Y}, \p{IDST}) (2)
  1979. \p{IDSB} \p{IDS_Binary_Operator} (=
  1980. \p{IDS_Binary_Operator=Y}) (10)
  1981. \p{IDSB: *} \p{IDS_Binary_Operator: *}
  1982. \p{IDST} \p{IDS_Trinary_Operator} (=
  1983. \p{IDS_Trinary_Operator=Y}) (2)
  1984. \p{IDST: *} \p{IDS_Trinary_Operator: *}
  1985. \p{Imperial_Aramaic} \p{Script_Extensions=Imperial_Aramaic}
  1986. (Short: \p{Armi}; NOT \p{Block=
  1987. Imperial_Aramaic}) (31)
  1988. \p{In: *} \p{Present_In: *} (Perl extension)
  1989. X \p{In_*} \p{Block: *}
  1990. X \p{Indic_Number_Forms} \p{Common_Indic_Number_Forms} (= \p{Block=
  1991. Common_Indic_Number_Forms}) (16)
  1992. \p{Indic_Positional_Category: Bottom} (Short: \p{InPC=Bottom})
  1993. (300)
  1994. \p{Indic_Positional_Category: Bottom_And_Right} (Short: \p{InPC=
  1995. BottomAndRight}) (2)
  1996. \p{Indic_Positional_Category: Left} (Short: \p{InPC=Left}) (57)
  1997. \p{Indic_Positional_Category: Left_And_Right} (Short: \p{InPC=
  1998. LeftAndRight}) (21)
  1999. \p{Indic_Positional_Category: NA} (Short: \p{InPC=NA}) (1_113_069
  2000. plus all above-Unicode code points)
  2001. \p{Indic_Positional_Category: Overstruck} (Short: \p{InPC=
  2002. Overstruck}) (10)
  2003. \p{Indic_Positional_Category: Right} (Short: \p{InPC=Right}) (258)
  2004. \p{Indic_Positional_Category: Top} (Short: \p{InPC=Top}) (342)
  2005. \p{Indic_Positional_Category: Top_And_Bottom} (Short: \p{InPC=
  2006. TopAndBottom}) (10)
  2007. \p{Indic_Positional_Category: Top_And_Bottom_And_Right} (Short:
  2008. \p{InPC=TopAndBottomAndRight}) (1)
  2009. \p{Indic_Positional_Category: Top_And_Left} (Short: \p{InPC=
  2010. TopAndLeft}) (6)
  2011. \p{Indic_Positional_Category: Top_And_Left_And_Right} (Short:
  2012. \p{InPC=TopAndLeftAndRight}) (4)
  2013. \p{Indic_Positional_Category: Top_And_Right} (Short: \p{InPC=
  2014. TopAndRight}) (13)
  2015. \p{Indic_Positional_Category: Visual_Order_Left} (Short: \p{InPC=
  2016. VisualOrderLeft}) (19)
  2017. \p{Indic_Syllabic_Category: Avagraha} (Short: \p{InSC=Avagraha})
  2018. (15)
  2019. \p{Indic_Syllabic_Category: Bindu} (Short: \p{InSC=Bindu}) (67)
  2020. \p{Indic_Syllabic_Category: Brahmi_Joining_Number} (Short:
  2021. \p{InSC=BrahmiJoiningNumber}) (20)
  2022. \p{Indic_Syllabic_Category: Cantillation_Mark} (Short: \p{InSC=
  2023. CantillationMark}) (53)
  2024. \p{Indic_Syllabic_Category: Consonant} (Short: \p{InSC=Consonant})
  2025. (1907)
  2026. \p{Indic_Syllabic_Category: Consonant_Dead} (Short: \p{InSC=
  2027. ConsonantDead}) (10)
  2028. \p{Indic_Syllabic_Category: Consonant_Final} (Short: \p{InSC=
  2029. ConsonantFinal}) (62)
  2030. \p{Indic_Syllabic_Category: Consonant_Head_Letter} (Short:
  2031. \p{InSC=ConsonantHeadLetter}) (5)
  2032. \p{Indic_Syllabic_Category: Consonant_Killer} (Short: \p{InSC=
  2033. ConsonantKiller}) (2)
  2034. \p{Indic_Syllabic_Category: Consonant_Medial} (Short: \p{InSC=
  2035. ConsonantMedial}) (22)
  2036. \p{Indic_Syllabic_Category: Consonant_Placeholder} (Short:
  2037. \p{InSC=ConsonantPlaceholder}) (16)
  2038. \p{Indic_Syllabic_Category: Consonant_Preceding_Repha} (Short:
  2039. \p{InSC=ConsonantPrecedingRepha}) (1)
  2040. \p{Indic_Syllabic_Category: Consonant_Prefixed} (Short: \p{InSC=
  2041. ConsonantPrefixed}) (2)
  2042. \p{Indic_Syllabic_Category: Consonant_Subjoined} (Short: \p{InSC=
  2043. ConsonantSubjoined}) (90)
  2044. \p{Indic_Syllabic_Category: Consonant_Succeeding_Repha} (Short:
  2045. \p{InSC=ConsonantSucceedingRepha}) (4)
  2046. \p{Indic_Syllabic_Category: Consonant_With_Stacker} (Short:
  2047. \p{InSC=ConsonantWithStacker}) (4)
  2048. \p{Indic_Syllabic_Category: Gemination_Mark} (Short: \p{InSC=
  2049. GeminationMark}) (2)
  2050. \p{Indic_Syllabic_Category: Invisible_Stacker} (Short: \p{InSC=
  2051. InvisibleStacker}) (7)
  2052. \p{Indic_Syllabic_Category: Joiner} (Short: \p{InSC=Joiner}) (1)
  2053. \p{Indic_Syllabic_Category: Modifying_Letter} (Short: \p{InSC=
  2054. ModifyingLetter}) (1)
  2055. \p{Indic_Syllabic_Category: Non_Joiner} (Short: \p{InSC=
  2056. NonJoiner}) (1)
  2057. \p{Indic_Syllabic_Category: Nukta} (Short: \p{InSC=Nukta}) (24)
  2058. \p{Indic_Syllabic_Category: Number} (Short: \p{InSC=Number}) (459)
  2059. \p{Indic_Syllabic_Category: Number_Joiner} (Short: \p{InSC=
  2060. NumberJoiner}) (1)
  2061. \p{Indic_Syllabic_Category: Other} (Short: \p{InSC=Other})
  2062. (1_110_129 plus all above-Unicode code
  2063. points)
  2064. \p{Indic_Syllabic_Category: Pure_Killer} (Short: \p{InSC=
  2065. PureKiller}) (16)
  2066. \p{Indic_Syllabic_Category: Register_Shifter} (Short: \p{InSC=
  2067. RegisterShifter}) (2)
  2068. \p{Indic_Syllabic_Category: Syllable_Modifier} (Short: \p{InSC=
  2069. SyllableModifier}) (22)
  2070. \p{Indic_Syllabic_Category: Tone_Letter} (Short: \p{InSC=
  2071. ToneLetter}) (7)
  2072. \p{Indic_Syllabic_Category: Tone_Mark} (Short: \p{InSC=ToneMark})
  2073. (42)
  2074. \p{Indic_Syllabic_Category: Virama} (Short: \p{InSC=Virama}) (24)
  2075. \p{Indic_Syllabic_Category: Visarga} (Short: \p{InSC=Visarga}) (31)
  2076. \p{Indic_Syllabic_Category: Vowel} (Short: \p{InSC=Vowel}) (30)
  2077. \p{Indic_Syllabic_Category: Vowel_Dependent} (Short: \p{InSC=
  2078. VowelDependent}) (602)
  2079. \p{Indic_Syllabic_Category: Vowel_Independent} (Short: \p{InSC=
  2080. VowelIndependent}) (431)
  2081. \p{Inherited} \p{Script_Extensions=Inherited} (Short:
  2082. \p{Zinh}) (496)
  2083. \p{Initial_Punctuation} \p{General_Category=Initial_Punctuation}
  2084. (Short: \p{Pi}) (12)
  2085. \p{InPC: *} \p{Indic_Positional_Category: *}
  2086. \p{InSC: *} \p{Indic_Syllabic_Category: *}
  2087. \p{Inscriptional_Pahlavi} \p{Script_Extensions=
  2088. Inscriptional_Pahlavi} (Short: \p{Phli};
  2089. NOT \p{Block=Inscriptional_Pahlavi}) (27)
  2090. \p{Inscriptional_Parthian} \p{Script_Extensions=
  2091. Inscriptional_Parthian} (Short:
  2092. \p{Prti}; NOT \p{Block=
  2093. Inscriptional_Parthian}) (30)
  2094. X \p{IPA_Ext} \p{IPA_Extensions} (= \p{Block=
  2095. IPA_Extensions}) (96)
  2096. X \p{IPA_Extensions} \p{Block=IPA_Extensions} (Short:
  2097. \p{InIPAExt}) (96)
  2098. \p{Is_*} \p{*} (Any exceptions are individually
  2099. noted beginning with the word NOT.) If
  2100. an entry has flag(s) at its beginning,
  2101. like "D", the "Is_" form has the same
  2102. flag(s)
  2103. \p{Ital} \p{Old_Italic} (= \p{Script_Extensions=
  2104. Old_Italic}) (NOT \p{Block=Old_Italic})
  2105. (36)
  2106. X \p{Jamo} \p{Hangul_Jamo} (= \p{Block=Hangul_Jamo})
  2107. (256)
  2108. X \p{Jamo_Ext_A} \p{Hangul_Jamo_Extended_A} (= \p{Block=
  2109. Hangul_Jamo_Extended_A}) (32)
  2110. X \p{Jamo_Ext_B} \p{Hangul_Jamo_Extended_B} (= \p{Block=
  2111. Hangul_Jamo_Extended_B}) (80)
  2112. \p{Java} \p{Javanese} (= \p{Script_Extensions=
  2113. Javanese}) (NOT \p{Block=Javanese}) (91)
  2114. \p{Javanese} \p{Script_Extensions=Javanese} (Short:
  2115. \p{Java}; NOT \p{Block=Javanese}) (91)
  2116. \p{Jg: *} \p{Joining_Group: *}
  2117. \p{Join_C} \p{Join_Control} (= \p{Join_Control=Y}) (2)
  2118. \p{Join_C: *} \p{Join_Control: *}
  2119. \p{Join_Control} \p{Join_Control=Y} (Short: \p{JoinC}) (2)
  2120. \p{Join_Control: N*} (Short: \p{JoinC=N}, \P{JoinC}) (1_114_110
  2121. plus all above-Unicode code points)
  2122. \p{Join_Control: Y*} (Short: \p{JoinC=Y}, \p{JoinC}) (2)
  2123. \p{Joining_Group: African_Feh} (Short: \p{Jg=AfricanFeh}) (1)
  2124. \p{Joining_Group: African_Noon} (Short: \p{Jg=AfricanNoon}) (1)
  2125. \p{Joining_Group: African_Qaf} (Short: \p{Jg=AfricanQaf}) (1)
  2126. \p{Joining_Group: Ain} (Short: \p{Jg=Ain}) (8)
  2127. \p{Joining_Group: Alaph} (Short: \p{Jg=Alaph}) (1)
  2128. \p{Joining_Group: Alef} (Short: \p{Jg=Alef}) (10)
  2129. \p{Joining_Group: Beh} (Short: \p{Jg=Beh}) (24)
  2130. \p{Joining_Group: Beth} (Short: \p{Jg=Beth}) (2)
  2131. \p{Joining_Group: Burushaski_Yeh_Barree} (Short: \p{Jg=
  2132. BurushaskiYehBarree}) (2)
  2133. \p{Joining_Group: Dal} (Short: \p{Jg=Dal}) (15)
  2134. \p{Joining_Group: Dalath_Rish} (Short: \p{Jg=DalathRish}) (4)
  2135. \p{Joining_Group: E} (Short: \p{Jg=E}) (1)
  2136. \p{Joining_Group: Farsi_Yeh} (Short: \p{Jg=FarsiYeh}) (7)
  2137. \p{Joining_Group: Fe} (Short: \p{Jg=Fe}) (1)
  2138. \p{Joining_Group: Feh} (Short: \p{Jg=Feh}) (10)
  2139. \p{Joining_Group: Final_Semkath} (Short: \p{Jg=FinalSemkath}) (1)
  2140. \p{Joining_Group: Gaf} (Short: \p{Jg=Gaf}) (14)
  2141. \p{Joining_Group: Gamal} (Short: \p{Jg=Gamal}) (3)
  2142. \p{Joining_Group: Hah} (Short: \p{Jg=Hah}) (18)
  2143. \p{Joining_Group: Hamza_On_Heh_Goal} (Short: \p{Jg=
  2144. HamzaOnHehGoal}) (1)
  2145. \p{Joining_Group: He} (Short: \p{Jg=He}) (1)
  2146. \p{Joining_Group: Heh} (Short: \p{Jg=Heh}) (1)
  2147. \p{Joining_Group: Heh_Goal} (Short: \p{Jg=HehGoal}) (2)
  2148. \p{Joining_Group: Heth} (Short: \p{Jg=Heth}) (1)
  2149. \p{Joining_Group: Kaf} (Short: \p{Jg=Kaf}) (6)
  2150. \p{Joining_Group: Kaph} (Short: \p{Jg=Kaph}) (1)
  2151. \p{Joining_Group: Khaph} (Short: \p{Jg=Khaph}) (1)
  2152. \p{Joining_Group: Knotted_Heh} (Short: \p{Jg=KnottedHeh}) (2)
  2153. \p{Joining_Group: Lam} (Short: \p{Jg=Lam}) (7)
  2154. \p{Joining_Group: Lamadh} (Short: \p{Jg=Lamadh}) (1)
  2155. \p{Joining_Group: Manichaean_Aleph} (Short: \p{Jg=
  2156. ManichaeanAleph}) (1)
  2157. \p{Joining_Group: Manichaean_Ayin} (Short: \p{Jg=ManichaeanAyin})
  2158. (2)
  2159. \p{Joining_Group: Manichaean_Beth} (Short: \p{Jg=ManichaeanBeth})
  2160. (2)
  2161. \p{Joining_Group: Manichaean_Daleth} (Short: \p{Jg=
  2162. ManichaeanDaleth}) (1)
  2163. \p{Joining_Group: Manichaean_Dhamedh} (Short: \p{Jg=
  2164. ManichaeanDhamedh}) (1)
  2165. \p{Joining_Group: Manichaean_Five} (Short: \p{Jg=ManichaeanFive})
  2166. (1)
  2167. \p{Joining_Group: Manichaean_Gimel} (Short: \p{Jg=
  2168. ManichaeanGimel}) (2)
  2169. \p{Joining_Group: Manichaean_Heth} (Short: \p{Jg=ManichaeanHeth})
  2170. (1)
  2171. \p{Joining_Group: Manichaean_Hundred} (Short: \p{Jg=
  2172. ManichaeanHundred}) (1)
  2173. \p{Joining_Group: Manichaean_Kaph} (Short: \p{Jg=ManichaeanKaph})
  2174. (3)
  2175. \p{Joining_Group: Manichaean_Lamedh} (Short: \p{Jg=
  2176. ManichaeanLamedh}) (1)
  2177. \p{Joining_Group: Manichaean_Mem} (Short: \p{Jg=ManichaeanMem}) (1)
  2178. \p{Joining_Group: Manichaean_Nun} (Short: \p{Jg=ManichaeanNun}) (1)
  2179. \p{Joining_Group: Manichaean_One} (Short: \p{Jg=ManichaeanOne}) (1)
  2180. \p{Joining_Group: Manichaean_Pe} (Short: \p{Jg=ManichaeanPe}) (2)
  2181. \p{Joining_Group: Manichaean_Qoph} (Short: \p{Jg=ManichaeanQoph})
  2182. (3)
  2183. \p{Joining_Group: Manichaean_Resh} (Short: \p{Jg=ManichaeanResh})
  2184. (1)
  2185. \p{Joining_Group: Manichaean_Sadhe} (Short: \p{Jg=
  2186. ManichaeanSadhe}) (1)
  2187. \p{Joining_Group: Manichaean_Samekh} (Short: \p{Jg=
  2188. ManichaeanSamekh}) (1)
  2189. \p{Joining_Group: Manichaean_Taw} (Short: \p{Jg=ManichaeanTaw}) (1)
  2190. \p{Joining_Group: Manichaean_Ten} (Short: \p{Jg=ManichaeanTen}) (1)
  2191. \p{Joining_Group: Manichaean_Teth} (Short: \p{Jg=ManichaeanTeth})
  2192. (1)
  2193. \p{Joining_Group: Manichaean_Thamedh} (Short: \p{Jg=
  2194. ManichaeanThamedh}) (1)
  2195. \p{Joining_Group: Manichaean_Twenty} (Short: \p{Jg=
  2196. ManichaeanTwenty}) (1)
  2197. \p{Joining_Group: Manichaean_Waw} (Short: \p{Jg=ManichaeanWaw}) (1)
  2198. \p{Joining_Group: Manichaean_Yodh} (Short: \p{Jg=ManichaeanYodh})
  2199. (1)
  2200. \p{Joining_Group: Manichaean_Zayin} (Short: \p{Jg=
  2201. ManichaeanZayin}) (2)
  2202. \p{Joining_Group: Meem} (Short: \p{Jg=Meem}) (4)
  2203. \p{Joining_Group: Mim} (Short: \p{Jg=Mim}) (1)
  2204. \p{Joining_Group: No_Joining_Group} (Short: \p{Jg=NoJoiningGroup})
  2205. (1_113_818 plus all above-Unicode code
  2206. points)
  2207. \p{Joining_Group: Noon} (Short: \p{Jg=Noon}) (8)
  2208. \p{Joining_Group: Nun} (Short: \p{Jg=Nun}) (1)
  2209. \p{Joining_Group: Nya} (Short: \p{Jg=Nya}) (1)
  2210. \p{Joining_Group: Pe} (Short: \p{Jg=Pe}) (1)
  2211. \p{Joining_Group: Qaf} (Short: \p{Jg=Qaf}) (5)
  2212. \p{Joining_Group: Qaph} (Short: \p{Jg=Qaph}) (1)
  2213. \p{Joining_Group: Reh} (Short: \p{Jg=Reh}) (19)
  2214. \p{Joining_Group: Reversed_Pe} (Short: \p{Jg=ReversedPe}) (1)
  2215. \p{Joining_Group: Rohingya_Yeh} (Short: \p{Jg=RohingyaYeh}) (1)
  2216. \p{Joining_Group: Sad} (Short: \p{Jg=Sad}) (6)
  2217. \p{Joining_Group: Sadhe} (Short: \p{Jg=Sadhe}) (1)
  2218. \p{Joining_Group: Seen} (Short: \p{Jg=Seen}) (11)
  2219. \p{Joining_Group: Semkath} (Short: \p{Jg=Semkath}) (1)
  2220. \p{Joining_Group: Shin} (Short: \p{Jg=Shin}) (1)
  2221. \p{Joining_Group: Straight_Waw} (Short: \p{Jg=StraightWaw}) (1)
  2222. \p{Joining_Group: Swash_Kaf} (Short: \p{Jg=SwashKaf}) (1)
  2223. \p{Joining_Group: Syriac_Waw} (Short: \p{Jg=SyriacWaw}) (1)
  2224. \p{Joining_Group: Tah} (Short: \p{Jg=Tah}) (4)
  2225. \p{Joining_Group: Taw} (Short: \p{Jg=Taw}) (1)
  2226. \p{Joining_Group: Teh_Marbuta} (Short: \p{Jg=TehMarbuta}) (3)
  2227. \p{Joining_Group: Teh_Marbuta_Goal} \p{Joining_Group=
  2228. Hamza_On_Heh_Goal} (1)
  2229. \p{Joining_Group: Teth} (Short: \p{Jg=Teth}) (2)
  2230. \p{Joining_Group: Waw} (Short: \p{Jg=Waw}) (16)
  2231. \p{Joining_Group: Yeh} (Short: \p{Jg=Yeh}) (11)
  2232. \p{Joining_Group: Yeh_Barree} (Short: \p{Jg=YehBarree}) (2)
  2233. \p{Joining_Group: Yeh_With_Tail} (Short: \p{Jg=YehWithTail}) (1)
  2234. \p{Joining_Group: Yudh} (Short: \p{Jg=Yudh}) (1)
  2235. \p{Joining_Group: Yudh_He} (Short: \p{Jg=YudhHe}) (1)
  2236. \p{Joining_Group: Zain} (Short: \p{Jg=Zain}) (1)
  2237. \p{Joining_Group: Zhain} (Short: \p{Jg=Zhain}) (1)
  2238. \p{Joining_Type: C} \p{Joining_Type=Join_Causing} (4)
  2239. \p{Joining_Type: D} \p{Joining_Type=Dual_Joining} (501)
  2240. \p{Joining_Type: Dual_Joining} (Short: \p{Jt=D}) (501)
  2241. \p{Joining_Type: Join_Causing} (Short: \p{Jt=C}) (4)
  2242. \p{Joining_Type: L} \p{Joining_Type=Left_Joining} (3)
  2243. \p{Joining_Type: Left_Joining} (Short: \p{Jt=L}) (3)
  2244. \p{Joining_Type: Non_Joining} (Short: \p{Jt=U}) (1_111_653 plus
  2245. all above-Unicode code points)
  2246. \p{Joining_Type: R} \p{Joining_Type=Right_Joining} (112)
  2247. \p{Joining_Type: Right_Joining} (Short: \p{Jt=R}) (112)
  2248. \p{Joining_Type: T} \p{Joining_Type=Transparent} (1839)
  2249. \p{Joining_Type: Transparent} (Short: \p{Jt=T}) (1839)
  2250. \p{Joining_Type: U} \p{Joining_Type=Non_Joining} (1_111_653
  2251. plus all above-Unicode code points)
  2252. \p{Jt: *} \p{Joining_Type: *}
  2253. \p{Kaithi} \p{Script_Extensions=Kaithi} (Short:
  2254. \p{Kthi}; NOT \p{Block=Kaithi}) (86)
  2255. \p{Kali} \p{Kayah_Li} (= \p{Script_Extensions=
  2256. Kayah_Li}) (48)
  2257. \p{Kana} \p{Katakana} (= \p{Script_Extensions=
  2258. Katakana}) (NOT \p{Block=Katakana}) (352)
  2259. X \p{Kana_Sup} \p{Kana_Supplement} (= \p{Block=
  2260. Kana_Supplement}) (256)
  2261. X \p{Kana_Supplement} \p{Block=Kana_Supplement} (Short:
  2262. \p{InKanaSup}) (256)
  2263. X \p{Kanbun} \p{Block=Kanbun} (16)
  2264. X \p{Kangxi} \p{Kangxi_Radicals} (= \p{Block=
  2265. Kangxi_Radicals}) (224)
  2266. X \p{Kangxi_Radicals} \p{Block=Kangxi_Radicals} (Short:
  2267. \p{InKangxi}) (224)
  2268. \p{Kannada} \p{Script_Extensions=Kannada} (Short:
  2269. \p{Knda}; NOT \p{Block=Kannada}) (100)
  2270. \p{Katakana} \p{Script_Extensions=Katakana} (Short:
  2271. \p{Kana}; NOT \p{Block=Katakana}) (352)
  2272. X \p{Katakana_Ext} \p{Katakana_Phonetic_Extensions} (=
  2273. \p{Block=Katakana_Phonetic_Extensions})
  2274. (16)
  2275. X \p{Katakana_Phonetic_Extensions} \p{Block=
  2276. Katakana_Phonetic_Extensions} (Short:
  2277. \p{InKatakanaExt}) (16)
  2278. \p{Kayah_Li} \p{Script_Extensions=Kayah_Li} (Short:
  2279. \p{Kali}) (48)
  2280. \p{Khar} \p{Kharoshthi} (= \p{Script_Extensions=
  2281. Kharoshthi}) (NOT \p{Block=Kharoshthi})
  2282. (65)
  2283. \p{Kharoshthi} \p{Script_Extensions=Kharoshthi} (Short:
  2284. \p{Khar}; NOT \p{Block=Kharoshthi}) (65)
  2285. \p{Khmer} \p{Script_Extensions=Khmer} (Short:
  2286. \p{Khmr}; NOT \p{Block=Khmer}) (146)
  2287. X \p{Khmer_Symbols} \p{Block=Khmer_Symbols} (32)
  2288. \p{Khmr} \p{Khmer} (= \p{Script_Extensions=Khmer})
  2289. (NOT \p{Block=Khmer}) (146)
  2290. \p{Khoj} \p{Khojki} (= \p{Script_Extensions=
  2291. Khojki}) (NOT \p{Block=Khojki}) (72)
  2292. \p{Khojki} \p{Script_Extensions=Khojki} (Short:
  2293. \p{Khoj}; NOT \p{Block=Khojki}) (72)
  2294. \p{Khudawadi} \p{Script_Extensions=Khudawadi} (Short:
  2295. \p{Sind}; NOT \p{Block=Khudawadi}) (81)
  2296. \p{Knda} \p{Kannada} (= \p{Script_Extensions=
  2297. Kannada}) (NOT \p{Block=Kannada}) (100)
  2298. \p{Kthi} \p{Kaithi} (= \p{Script_Extensions=
  2299. Kaithi}) (NOT \p{Block=Kaithi}) (86)
  2300. \p{L} \pL \p{Letter} (= \p{General_Category=Letter})
  2301. (116_766)
  2302. X \p{L&} \p{Cased_Letter} (= \p{General_Category=
  2303. Cased_Letter}) (3796)
  2304. X \p{L_} \p{Cased_Letter} (= \p{General_Category=
  2305. Cased_Letter}) Note the trailing '_'
  2306. matters in spite of loose matching
  2307. rules. (3796)
  2308. \p{Lana} \p{Tai_Tham} (= \p{Script_Extensions=
  2309. Tai_Tham}) (NOT \p{Block=Tai_Tham}) (127)
  2310. \p{Lao} \p{Script_Extensions=Lao} (NOT \p{Block=
  2311. Lao}) (67)
  2312. \p{Laoo} \p{Lao} (= \p{Script_Extensions=Lao}) (NOT
  2313. \p{Block=Lao}) (67)
  2314. \p{Latin} \p{Script_Extensions=Latin} (Short:
  2315. \p{Latn}) (1370)
  2316. X \p{Latin_1} \p{Latin_1_Supplement} (= \p{Block=
  2317. Latin_1_Supplement}) (128)
  2318. X \p{Latin_1_Sup} \p{Latin_1_Supplement} (= \p{Block=
  2319. Latin_1_Supplement}) (128)
  2320. X \p{Latin_1_Supplement} \p{Block=Latin_1_Supplement} (Short:
  2321. \p{InLatin1}) (128)
  2322. X \p{Latin_Ext_A} \p{Latin_Extended_A} (= \p{Block=
  2323. Latin_Extended_A}) (128)
  2324. X \p{Latin_Ext_Additional} \p{Latin_Extended_Additional} (=
  2325. \p{Block=Latin_Extended_Additional})
  2326. (256)
  2327. X \p{Latin_Ext_B} \p{Latin_Extended_B} (= \p{Block=
  2328. Latin_Extended_B}) (208)
  2329. X \p{Latin_Ext_C} \p{Latin_Extended_C} (= \p{Block=
  2330. Latin_Extended_C}) (32)
  2331. X \p{Latin_Ext_D} \p{Latin_Extended_D} (= \p{Block=
  2332. Latin_Extended_D}) (224)
  2333. X \p{Latin_Ext_E} \p{Latin_Extended_E} (= \p{Block=
  2334. Latin_Extended_E}) (64)
  2335. X \p{Latin_Extended_A} \p{Block=Latin_Extended_A} (Short:
  2336. \p{InLatinExtA}) (128)
  2337. X \p{Latin_Extended_Additional} \p{Block=Latin_Extended_Additional}
  2338. (Short: \p{InLatinExtAdditional}) (256)
  2339. X \p{Latin_Extended_B} \p{Block=Latin_Extended_B} (Short:
  2340. \p{InLatinExtB}) (208)
  2341. X \p{Latin_Extended_C} \p{Block=Latin_Extended_C} (Short:
  2342. \p{InLatinExtC}) (32)
  2343. X \p{Latin_Extended_D} \p{Block=Latin_Extended_D} (Short:
  2344. \p{InLatinExtD}) (224)
  2345. X \p{Latin_Extended_E} \p{Block=Latin_Extended_E} (Short:
  2346. \p{InLatinExtE}) (64)
  2347. \p{Latn} \p{Latin} (= \p{Script_Extensions=Latin})
  2348. (1370)
  2349. \p{Lb: *} \p{Line_Break: *}
  2350. \p{LC} \p{Cased_Letter} (= \p{General_Category=
  2351. Cased_Letter}) (3796)
  2352. \p{Lepc} \p{Lepcha} (= \p{Script_Extensions=
  2353. Lepcha}) (NOT \p{Block=Lepcha}) (74)
  2354. \p{Lepcha} \p{Script_Extensions=Lepcha} (Short:
  2355. \p{Lepc}; NOT \p{Block=Lepcha}) (74)
  2356. \p{Letter} \p{General_Category=Letter} (Short: \p{L})
  2357. (116_766)
  2358. \p{Letter_Number} \p{General_Category=Letter_Number} (Short:
  2359. \p{Nl}) (236)
  2360. X \p{Letterlike_Symbols} \p{Block=Letterlike_Symbols} (80)
  2361. \p{Limb} \p{Limbu} (= \p{Script_Extensions=Limbu})
  2362. (NOT \p{Block=Limbu}) (69)
  2363. \p{Limbu} \p{Script_Extensions=Limbu} (Short:
  2364. \p{Limb}; NOT \p{Block=Limbu}) (69)
  2365. \p{Lina} \p{Linear_A} (= \p{Script_Extensions=
  2366. Linear_A}) (NOT \p{Block=Linear_A}) (386)
  2367. \p{Linb} \p{Linear_B} (= \p{Script_Extensions=
  2368. Linear_B}) (268)
  2369. \p{Line_Break: AI} \p{Line_Break=Ambiguous} (707)
  2370. \p{Line_Break: AL} \p{Line_Break=Alphabetic} (19_523)
  2371. \p{Line_Break: Alphabetic} (Short: \p{Lb=AL}) (19_523)
  2372. \p{Line_Break: Ambiguous} (Short: \p{Lb=AI}) (707)
  2373. \p{Line_Break: B2} \p{Line_Break=Break_Both} (3)
  2374. \p{Line_Break: BA} \p{Line_Break=Break_After} (218)
  2375. \p{Line_Break: BB} \p{Line_Break=Break_Before} (37)
  2376. \p{Line_Break: BK} \p{Line_Break=Mandatory_Break} (4)
  2377. \p{Line_Break: Break_After} (Short: \p{Lb=BA}) (218)
  2378. \p{Line_Break: Break_Before} (Short: \p{Lb=BB}) (37)
  2379. \p{Line_Break: Break_Both} (Short: \p{Lb=B2}) (3)
  2380. \p{Line_Break: Break_Symbols} (Short: \p{Lb=SY}) (1)
  2381. \p{Line_Break: Carriage_Return} (Short: \p{Lb=CR}) (1)
  2382. \p{Line_Break: CB} \p{Line_Break=Contingent_Break} (1)
  2383. \p{Line_Break: CJ} \p{Line_Break=
  2384. Conditional_Japanese_Starter} (51)
  2385. \p{Line_Break: CL} \p{Line_Break=Close_Punctuation} (90)
  2386. \p{Line_Break: Close_Parenthesis} (Short: \p{Lb=CP}) (2)
  2387. \p{Line_Break: Close_Punctuation} (Short: \p{Lb=CL}) (90)
  2388. \p{Line_Break: CM} \p{Line_Break=Combining_Mark} (2090)
  2389. \p{Line_Break: Combining_Mark} (Short: \p{Lb=CM}) (2090)
  2390. \p{Line_Break: Complex_Context} (Short: \p{Lb=SA}) (734)
  2391. \p{Line_Break: Conditional_Japanese_Starter} (Short: \p{Lb=CJ})
  2392. (51)
  2393. \p{Line_Break: Contingent_Break} (Short: \p{Lb=CB}) (1)
  2394. \p{Line_Break: CP} \p{Line_Break=Close_Parenthesis} (2)
  2395. \p{Line_Break: CR} \p{Line_Break=Carriage_Return} (1)
  2396. \p{Line_Break: E_Base} (Short: \p{Lb=EB}) (83)
  2397. \p{Line_Break: E_Modifier} (Short: \p{Lb=EM}) (5)
  2398. \p{Line_Break: EB} \p{Line_Break=E_Base} (83)
  2399. \p{Line_Break: EM} \p{Line_Break=E_Modifier} (5)
  2400. \p{Line_Break: EX} \p{Line_Break=Exclamation} (37)
  2401. \p{Line_Break: Exclamation} (Short: \p{Lb=EX}) (37)
  2402. \p{Line_Break: GL} \p{Line_Break=Glue} (18)
  2403. \p{Line_Break: Glue} (Short: \p{Lb=GL}) (18)
  2404. \p{Line_Break: H2} (Short: \p{Lb=H2}) (399)
  2405. \p{Line_Break: H3} (Short: \p{Lb=H3}) (10_773)
  2406. \p{Line_Break: Hebrew_Letter} (Short: \p{Lb=HL}) (74)
  2407. \p{Line_Break: HL} \p{Line_Break=Hebrew_Letter} (74)
  2408. \p{Line_Break: HY} \p{Line_Break=Hyphen} (1)
  2409. \p{Line_Break: Hyphen} (Short: \p{Lb=HY}) (1)
  2410. \p{Line_Break: ID} \p{Line_Break=Ideographic} (172_133)
  2411. \p{Line_Break: Ideographic} (Short: \p{Lb=ID}) (172_133)
  2412. \p{Line_Break: IN} \p{Line_Break=Inseparable} (6)
  2413. \p{Line_Break: Infix_Numeric} (Short: \p{Lb=IS}) (13)
  2414. \p{Line_Break: Inseparable} (Short: \p{Lb=IN}) (6)
  2415. \p{Line_Break: Inseperable} \p{Line_Break=Inseparable} (6)
  2416. \p{Line_Break: IS} \p{Line_Break=Infix_Numeric} (13)
  2417. \p{Line_Break: JL} (Short: \p{Lb=JL}) (125)
  2418. \p{Line_Break: JT} (Short: \p{Lb=JT}) (137)
  2419. \p{Line_Break: JV} (Short: \p{Lb=JV}) (95)
  2420. \p{Line_Break: LF} \p{Line_Break=Line_Feed} (1)
  2421. \p{Line_Break: Line_Feed} (Short: \p{Lb=LF}) (1)
  2422. \p{Line_Break: Mandatory_Break} (Short: \p{Lb=BK}) (4)
  2423. \p{Line_Break: Next_Line} (Short: \p{Lb=NL}) (1)
  2424. \p{Line_Break: NL} \p{Line_Break=Next_Line} (1)
  2425. \p{Line_Break: Nonstarter} (Short: \p{Lb=NS}) (30)
  2426. \p{Line_Break: NS} \p{Line_Break=Nonstarter} (30)
  2427. \p{Line_Break: NU} \p{Line_Break=Numeric} (572)
  2428. \p{Line_Break: Numeric} (Short: \p{Lb=NU}) (572)
  2429. \p{Line_Break: OP} \p{Line_Break=Open_Punctuation} (87)
  2430. \p{Line_Break: Open_Punctuation} (Short: \p{Lb=OP}) (87)
  2431. \p{Line_Break: PO} \p{Line_Break=Postfix_Numeric} (30)
  2432. \p{Line_Break: Postfix_Numeric} (Short: \p{Lb=PO}) (30)
  2433. \p{Line_Break: PR} \p{Line_Break=Prefix_Numeric} (65)
  2434. \p{Line_Break: Prefix_Numeric} (Short: \p{Lb=PR}) (65)
  2435. \p{Line_Break: QU} \p{Line_Break=Quotation} (39)
  2436. \p{Line_Break: Quotation} (Short: \p{Lb=QU}) (39)
  2437. \p{Line_Break: Regional_Indicator} (Short: \p{Lb=RI}) (26)
  2438. \p{Line_Break: RI} \p{Line_Break=Regional_Indicator} (26)
  2439. \p{Line_Break: SA} \p{Line_Break=Complex_Context} (734)
  2440. D \p{Line_Break: SG} \p{Line_Break=Surrogate} (2048)
  2441. \p{Line_Break: SP} \p{Line_Break=Space} (1)
  2442. \p{Line_Break: Space} (Short: \p{Lb=SP}) (1)
  2443. D \p{Line_Break: Surrogate} Deprecated by Unicode because surrogates
  2444. should never appear in well-formed text,
  2445. and therefore shouldn't be the basis for
  2446. line breaking (Short: \p{Lb=SG}) (2048)
  2447. \p{Line_Break: SY} \p{Line_Break=Break_Symbols} (1)
  2448. \p{Line_Break: Unknown} (Short: \p{Lb=XX}) (903_847 plus all
  2449. above-Unicode code points)
  2450. \p{Line_Break: WJ} \p{Line_Break=Word_Joiner} (2)
  2451. \p{Line_Break: Word_Joiner} (Short: \p{Lb=WJ}) (2)
  2452. \p{Line_Break: XX} \p{Line_Break=Unknown} (903_847 plus all
  2453. above-Unicode code points)
  2454. \p{Line_Break: ZW} \p{Line_Break=ZWSpace} (1)
  2455. \p{Line_Break: ZWJ} (Short: \p{Lb=ZWJ}) (1)
  2456. \p{Line_Break: ZWSpace} (Short: \p{Lb=ZW}) (1)
  2457. \p{Line_Separator} \p{General_Category=Line_Separator}
  2458. (Short: \p{Zl}) (1)
  2459. \p{Linear_A} \p{Script_Extensions=Linear_A} (Short:
  2460. \p{Lina}; NOT \p{Block=Linear_A}) (386)
  2461. \p{Linear_B} \p{Script_Extensions=Linear_B} (Short:
  2462. \p{Linb}) (268)
  2463. X \p{Linear_B_Ideograms} \p{Block=Linear_B_Ideograms} (128)
  2464. X \p{Linear_B_Syllabary} \p{Block=Linear_B_Syllabary} (128)
  2465. \p{Lisu} \p{Script_Extensions=Lisu} (48)
  2466. \p{Ll} \p{Lowercase_Letter} (=
  2467. \p{General_Category=Lowercase_Letter})
  2468. (/i= General_Category=Cased_Letter)
  2469. (2063)
  2470. \p{Lm} \p{Modifier_Letter} (=
  2471. \p{General_Category=Modifier_Letter})
  2472. (249)
  2473. \p{Lo} \p{Other_Letter} (= \p{General_Category=
  2474. Other_Letter}) (112_721)
  2475. \p{LOE} \p{Logical_Order_Exception} (=
  2476. \p{Logical_Order_Exception=Y}) (19)
  2477. \p{LOE: *} \p{Logical_Order_Exception: *}
  2478. \p{Logical_Order_Exception} \p{Logical_Order_Exception=Y} (Short:
  2479. \p{LOE}) (19)
  2480. \p{Logical_Order_Exception: N*} (Short: \p{LOE=N}, \P{LOE})
  2481. (1_114_093 plus all above-Unicode code
  2482. points)
  2483. \p{Logical_Order_Exception: Y*} (Short: \p{LOE=Y}, \p{LOE}) (19)
  2484. X \p{Low_Surrogates} \p{Block=Low_Surrogates} (1024)
  2485. \p{Lower} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
  2486. Cased=Yes) (2252)
  2487. \p{Lower: *} \p{Lowercase: *}
  2488. \p{Lowercase} \p{XPosixLower} (= \p{Lowercase=Y}) (/i=
  2489. Cased=Yes) (2252)
  2490. \p{Lowercase: N*} (Short: \p{Lower=N}, \P{Lower}; /i= Cased=
  2491. No) (1_111_860 plus all above-Unicode
  2492. code points)
  2493. \p{Lowercase: Y*} (Short: \p{Lower=Y}, \p{Lower}; /i= Cased=
  2494. Yes) (2252)
  2495. \p{Lowercase_Letter} \p{General_Category=Lowercase_Letter}
  2496. (Short: \p{Ll}; /i= General_Category=
  2497. Cased_Letter) (2063)
  2498. \p{Lt} \p{Titlecase_Letter} (=
  2499. \p{General_Category=Titlecase_Letter})
  2500. (/i= General_Category=Cased_Letter) (31)
  2501. \p{Lu} \p{Uppercase_Letter} (=
  2502. \p{General_Category=Uppercase_Letter})
  2503. (/i= General_Category=Cased_Letter)
  2504. (1702)
  2505. \p{Lyci} \p{Lycian} (= \p{Script_Extensions=
  2506. Lycian}) (NOT \p{Block=Lycian}) (29)
  2507. \p{Lycian} \p{Script_Extensions=Lycian} (Short:
  2508. \p{Lyci}; NOT \p{Block=Lycian}) (29)
  2509. \p{Lydi} \p{Lydian} (= \p{Script_Extensions=
  2510. Lydian}) (NOT \p{Block=Lydian}) (27)
  2511. \p{Lydian} \p{Script_Extensions=Lydian} (Short:
  2512. \p{Lydi}; NOT \p{Block=Lydian}) (27)
  2513. \p{M} \pM \p{Mark} (= \p{General_Category=Mark})
  2514. (2097)
  2515. \p{Mahajani} \p{Script_Extensions=Mahajani} (Short:
  2516. \p{Mahj}; NOT \p{Block=Mahajani}) (61)
  2517. \p{Mahj} \p{Mahajani} (= \p{Script_Extensions=
  2518. Mahajani}) (NOT \p{Block=Mahajani}) (61)
  2519. X \p{Mahjong} \p{Mahjong_Tiles} (= \p{Block=
  2520. Mahjong_Tiles}) (48)
  2521. X \p{Mahjong_Tiles} \p{Block=Mahjong_Tiles} (Short:
  2522. \p{InMahjong}) (48)
  2523. \p{Malayalam} \p{Script_Extensions=Malayalam} (Short:
  2524. \p{Mlym}; NOT \p{Block=Malayalam}) (119)
  2525. \p{Mand} \p{Mandaic} (= \p{Script_Extensions=
  2526. Mandaic}) (NOT \p{Block=Mandaic}) (30)
  2527. \p{Mandaic} \p{Script_Extensions=Mandaic} (Short:
  2528. \p{Mand}; NOT \p{Block=Mandaic}) (30)
  2529. \p{Mani} \p{Manichaean} (= \p{Script_Extensions=
  2530. Manichaean}) (NOT \p{Block=Manichaean})
  2531. (52)
  2532. \p{Manichaean} \p{Script_Extensions=Manichaean} (Short:
  2533. \p{Mani}; NOT \p{Block=Manichaean}) (52)
  2534. \p{Marc} \p{Marchen} (= \p{Script_Extensions=
  2535. Marchen}) (NOT \p{Block=Marchen}) (68)
  2536. \p{Marchen} \p{Script_Extensions=Marchen} (Short:
  2537. \p{Marc}; NOT \p{Block=Marchen}) (68)
  2538. \p{Mark} \p{General_Category=Mark} (Short: \p{M})
  2539. (2097)
  2540. \p{Math} \p{Math=Y} (2310)
  2541. \p{Math: N*} (Single: \P{Math}) (1_111_802 plus all
  2542. above-Unicode code points)
  2543. \p{Math: Y*} (Single: \p{Math}) (2310)
  2544. X \p{Math_Alphanum} \p{Mathematical_Alphanumeric_Symbols} (=
  2545. \p{Block=
  2546. Mathematical_Alphanumeric_Symbols})
  2547. (1024)
  2548. X \p{Math_Operators} \p{Mathematical_Operators} (= \p{Block=
  2549. Mathematical_Operators}) (256)
  2550. \p{Math_Symbol} \p{General_Category=Math_Symbol} (Short:
  2551. \p{Sm}) (948)
  2552. X \p{Mathematical_Alphanumeric_Symbols} \p{Block=
  2553. Mathematical_Alphanumeric_Symbols}
  2554. (Short: \p{InMathAlphanum}) (1024)
  2555. X \p{Mathematical_Operators} \p{Block=Mathematical_Operators}
  2556. (Short: \p{InMathOperators}) (256)
  2557. \p{Mc} \p{Spacing_Mark} (= \p{General_Category=
  2558. Spacing_Mark}) (394)
  2559. \p{Me} \p{Enclosing_Mark} (= \p{General_Category=
  2560. Enclosing_Mark}) (13)
  2561. \p{Meetei_Mayek} \p{Script_Extensions=Meetei_Mayek} (Short:
  2562. \p{Mtei}; NOT \p{Block=Meetei_Mayek})
  2563. (79)
  2564. X \p{Meetei_Mayek_Ext} \p{Meetei_Mayek_Extensions} (= \p{Block=
  2565. Meetei_Mayek_Extensions}) (32)
  2566. X \p{Meetei_Mayek_Extensions} \p{Block=Meetei_Mayek_Extensions}
  2567. (Short: \p{InMeeteiMayekExt}) (32)
  2568. \p{Mend} \p{Mende_Kikakui} (= \p{Script_Extensions=
  2569. Mende_Kikakui}) (NOT \p{Block=
  2570. Mende_Kikakui}) (213)
  2571. \p{Mende_Kikakui} \p{Script_Extensions=Mende_Kikakui}
  2572. (Short: \p{Mend}; NOT \p{Block=
  2573. Mende_Kikakui}) (213)
  2574. \p{Merc} \p{Meroitic_Cursive} (=
  2575. \p{Script_Extensions=Meroitic_Cursive})
  2576. (NOT \p{Block=Meroitic_Cursive}) (90)
  2577. \p{Mero} \p{Meroitic_Hieroglyphs} (=
  2578. \p{Script_Extensions=
  2579. Meroitic_Hieroglyphs}) (32)
  2580. \p{Meroitic_Cursive} \p{Script_Extensions=Meroitic_Cursive}
  2581. (Short: \p{Merc}; NOT \p{Block=
  2582. Meroitic_Cursive}) (90)
  2583. \p{Meroitic_Hieroglyphs} \p{Script_Extensions=
  2584. Meroitic_Hieroglyphs} (Short: \p{Mero})
  2585. (32)
  2586. \p{Miao} \p{Script_Extensions=Miao} (NOT \p{Block=
  2587. Miao}) (133)
  2588. X \p{Misc_Arrows} \p{Miscellaneous_Symbols_And_Arrows} (=
  2589. \p{Block=
  2590. Miscellaneous_Symbols_And_Arrows}) (256)
  2591. X \p{Misc_Math_Symbols_A} \p{Miscellaneous_Mathematical_Symbols_A}
  2592. (= \p{Block=
  2593. Miscellaneous_Mathematical_Symbols_A})
  2594. (48)
  2595. X \p{Misc_Math_Symbols_B} \p{Miscellaneous_Mathematical_Symbols_B}
  2596. (= \p{Block=
  2597. Miscellaneous_Mathematical_Symbols_B})
  2598. (128)
  2599. X \p{Misc_Pictographs} \p{Miscellaneous_Symbols_And_Pictographs}
  2600. (= \p{Block=
  2601. Miscellaneous_Symbols_And_Pictographs})
  2602. (768)
  2603. X \p{Misc_Symbols} \p{Miscellaneous_Symbols} (= \p{Block=
  2604. Miscellaneous_Symbols}) (256)
  2605. X \p{Misc_Technical} \p{Miscellaneous_Technical} (= \p{Block=
  2606. Miscellaneous_Technical}) (256)
  2607. X \p{Miscellaneous_Mathematical_Symbols_A} \p{Block=
  2608. Miscellaneous_Mathematical_Symbols_A}
  2609. (Short: \p{InMiscMathSymbolsA}) (48)
  2610. X \p{Miscellaneous_Mathematical_Symbols_B} \p{Block=
  2611. Miscellaneous_Mathematical_Symbols_B}
  2612. (Short: \p{InMiscMathSymbolsB}) (128)
  2613. X \p{Miscellaneous_Symbols} \p{Block=Miscellaneous_Symbols} (Short:
  2614. \p{InMiscSymbols}) (256)
  2615. X \p{Miscellaneous_Symbols_And_Arrows} \p{Block=
  2616. Miscellaneous_Symbols_And_Arrows}
  2617. (Short: \p{InMiscArrows}) (256)
  2618. X \p{Miscellaneous_Symbols_And_Pictographs} \p{Block=
  2619. Miscellaneous_Symbols_And_Pictographs}
  2620. (Short: \p{InMiscPictographs}) (768)
  2621. X \p{Miscellaneous_Technical} \p{Block=Miscellaneous_Technical}
  2622. (Short: \p{InMiscTechnical}) (256)
  2623. \p{Mlym} \p{Malayalam} (= \p{Script_Extensions=
  2624. Malayalam}) (NOT \p{Block=Malayalam})
  2625. (119)
  2626. \p{Mn} \p{Nonspacing_Mark} (=
  2627. \p{General_Category=Nonspacing_Mark})
  2628. (1690)
  2629. \p{Modi} \p{Script_Extensions=Modi} (NOT \p{Block=
  2630. Modi}) (89)
  2631. \p{Modifier_Letter} \p{General_Category=Modifier_Letter}
  2632. (Short: \p{Lm}) (249)
  2633. X \p{Modifier_Letters} \p{Spacing_Modifier_Letters} (= \p{Block=
  2634. Spacing_Modifier_Letters}) (80)
  2635. \p{Modifier_Symbol} \p{General_Category=Modifier_Symbol}
  2636. (Short: \p{Sk}) (121)
  2637. X \p{Modifier_Tone_Letters} \p{Block=Modifier_Tone_Letters} (32)
  2638. \p{Mong} \p{Mongolian} (= \p{Script_Extensions=
  2639. Mongolian}) (NOT \p{Block=Mongolian})
  2640. (169)
  2641. \p{Mongolian} \p{Script_Extensions=Mongolian} (Short:
  2642. \p{Mong}; NOT \p{Block=Mongolian}) (169)
  2643. X \p{Mongolian_Sup} \p{Mongolian_Supplement} (= \p{Block=
  2644. Mongolian_Supplement}) (32)
  2645. X \p{Mongolian_Supplement} \p{Block=Mongolian_Supplement} (Short:
  2646. \p{InMongolianSup}) (32)
  2647. \p{Mro} \p{Script_Extensions=Mro} (NOT \p{Block=
  2648. Mro}) (43)
  2649. \p{Mroo} \p{Mro} (= \p{Script_Extensions=Mro}) (NOT
  2650. \p{Block=Mro}) (43)
  2651. \p{Mtei} \p{Meetei_Mayek} (= \p{Script_Extensions=
  2652. Meetei_Mayek}) (NOT \p{Block=
  2653. Meetei_Mayek}) (79)
  2654. \p{Mult} \p{Multani} (= \p{Script_Extensions=
  2655. Multani}) (NOT \p{Block=Multani}) (48)
  2656. \p{Multani} \p{Script_Extensions=Multani} (Short:
  2657. \p{Mult}; NOT \p{Block=Multani}) (48)
  2658. X \p{Music} \p{Musical_Symbols} (= \p{Block=
  2659. Musical_Symbols}) (256)
  2660. X \p{Musical_Symbols} \p{Block=Musical_Symbols} (Short:
  2661. \p{InMusic}) (256)
  2662. \p{Myanmar} \p{Script_Extensions=Myanmar} (Short:
  2663. \p{Mymr}; NOT \p{Block=Myanmar}) (224)
  2664. X \p{Myanmar_Ext_A} \p{Myanmar_Extended_A} (= \p{Block=
  2665. Myanmar_Extended_A}) (32)
  2666. X \p{Myanmar_Ext_B} \p{Myanmar_Extended_B} (= \p{Block=
  2667. Myanmar_Extended_B}) (32)
  2668. X \p{Myanmar_Extended_A} \p{Block=Myanmar_Extended_A} (Short:
  2669. \p{InMyanmarExtA}) (32)
  2670. X \p{Myanmar_Extended_B} \p{Block=Myanmar_Extended_B} (Short:
  2671. \p{InMyanmarExtB}) (32)
  2672. \p{Mymr} \p{Myanmar} (= \p{Script_Extensions=
  2673. Myanmar}) (NOT \p{Block=Myanmar}) (224)
  2674. \p{N} \pN \p{Number} (= \p{General_Category=Number})
  2675. (1492)
  2676. \p{Nabataean} \p{Script_Extensions=Nabataean} (Short:
  2677. \p{Nbat}; NOT \p{Block=Nabataean}) (40)
  2678. \p{Narb} \p{Old_North_Arabian} (=
  2679. \p{Script_Extensions=Old_North_Arabian})
  2680. (32)
  2681. X \p{NB} \p{No_Block} (= \p{Block=No_Block})
  2682. (842_320 plus all above-Unicode code
  2683. points)
  2684. \p{Nbat} \p{Nabataean} (= \p{Script_Extensions=
  2685. Nabataean}) (NOT \p{Block=Nabataean})
  2686. (40)
  2687. \p{NChar} \p{Noncharacter_Code_Point} (=
  2688. \p{Noncharacter_Code_Point=Y}) (66)
  2689. \p{NChar: *} \p{Noncharacter_Code_Point: *}
  2690. \p{Nd} \p{XPosixDigit} (= \p{General_Category=
  2691. Decimal_Number}) (580)
  2692. \p{New_Tai_Lue} \p{Script_Extensions=New_Tai_Lue} (Short:
  2693. \p{Talu}; NOT \p{Block=New_Tai_Lue}) (83)
  2694. \p{Newa} \p{Script_Extensions=Newa} (NOT \p{Block=
  2695. Newa}) (92)
  2696. \p{NFC_QC: *} \p{NFC_Quick_Check: *}
  2697. \p{NFC_Quick_Check: M} \p{NFC_Quick_Check=Maybe} (110)
  2698. \p{NFC_Quick_Check: Maybe} (Short: \p{NFCQC=M}) (110)
  2699. \p{NFC_Quick_Check: N} \p{NFC_Quick_Check=No} (NOT
  2700. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  2701. (1120)
  2702. \p{NFC_Quick_Check: No} (Short: \p{NFCQC=N}; NOT
  2703. \P{NFC_Quick_Check} NOR \P{NFC_QC})
  2704. (1120)
  2705. \p{NFC_Quick_Check: Y} \p{NFC_Quick_Check=Yes} (NOT
  2706. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  2707. (1_112_882 plus all above-Unicode code
  2708. points)
  2709. \p{NFC_Quick_Check: Yes} (Short: \p{NFCQC=Y}; NOT
  2710. \p{NFC_Quick_Check} NOR \p{NFC_QC})
  2711. (1_112_882 plus all above-Unicode code
  2712. points)
  2713. \p{NFD_QC: *} \p{NFD_Quick_Check: *}
  2714. \p{NFD_Quick_Check: N} \p{NFD_Quick_Check=No} (NOT
  2715. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  2716. (13_232)
  2717. \p{NFD_Quick_Check: No} (Short: \p{NFDQC=N}; NOT
  2718. \P{NFD_Quick_Check} NOR \P{NFD_QC})
  2719. (13_232)
  2720. \p{NFD_Quick_Check: Y} \p{NFD_Quick_Check=Yes} (NOT
  2721. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  2722. (1_100_880 plus all above-Unicode code
  2723. points)
  2724. \p{NFD_Quick_Check: Yes} (Short: \p{NFDQC=Y}; NOT
  2725. \p{NFD_Quick_Check} NOR \p{NFD_QC})
  2726. (1_100_880 plus all above-Unicode code
  2727. points)
  2728. \p{NFKC_QC: *} \p{NFKC_Quick_Check: *}
  2729. \p{NFKC_Quick_Check: M} \p{NFKC_Quick_Check=Maybe} (110)
  2730. \p{NFKC_Quick_Check: Maybe} (Short: \p{NFKCQC=M}) (110)
  2731. \p{NFKC_Quick_Check: N} \p{NFKC_Quick_Check=No} (NOT
  2732. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  2733. (4794)
  2734. \p{NFKC_Quick_Check: No} (Short: \p{NFKCQC=N}; NOT
  2735. \P{NFKC_Quick_Check} NOR \P{NFKC_QC})
  2736. (4794)
  2737. \p{NFKC_Quick_Check: Y} \p{NFKC_Quick_Check=Yes} (NOT
  2738. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  2739. (1_109_208 plus all above-Unicode code
  2740. points)
  2741. \p{NFKC_Quick_Check: Yes} (Short: \p{NFKCQC=Y}; NOT
  2742. \p{NFKC_Quick_Check} NOR \p{NFKC_QC})
  2743. (1_109_208 plus all above-Unicode code
  2744. points)
  2745. \p{NFKD_QC: *} \p{NFKD_Quick_Check: *}
  2746. \p{NFKD_Quick_Check: N} \p{NFKD_Quick_Check=No} (NOT
  2747. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  2748. (16_894)
  2749. \p{NFKD_Quick_Check: No} (Short: \p{NFKDQC=N}; NOT
  2750. \P{NFKD_Quick_Check} NOR \P{NFKD_QC})
  2751. (16_894)
  2752. \p{NFKD_Quick_Check: Y} \p{NFKD_Quick_Check=Yes} (NOT
  2753. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  2754. (1_097_218 plus all above-Unicode code
  2755. points)
  2756. \p{NFKD_Quick_Check: Yes} (Short: \p{NFKDQC=Y}; NOT
  2757. \p{NFKD_Quick_Check} NOR \p{NFKD_QC})
  2758. (1_097_218 plus all above-Unicode code
  2759. points)
  2760. \p{Nko} \p{Script_Extensions=Nko} (NOT \p{NKo})
  2761. (59)
  2762. \p{Nkoo} \p{Nko} (= \p{Script_Extensions=Nko}) (NOT
  2763. \p{NKo}) (59)
  2764. \p{Nl} \p{Letter_Number} (= \p{General_Category=
  2765. Letter_Number}) (236)
  2766. \p{No} \p{Other_Number} (= \p{General_Category=
  2767. Other_Number}) (676)
  2768. X \p{No_Block} \p{Block=No_Block} (Short: \p{InNB})
  2769. (842_320 plus all above-Unicode code
  2770. points)
  2771. \p{Noncharacter_Code_Point} \p{Noncharacter_Code_Point=Y} (Short:
  2772. \p{NChar}) (66)
  2773. \p{Noncharacter_Code_Point: N*} (Short: \p{NChar=N}, \P{NChar})
  2774. (1_114_046 plus all above-Unicode code
  2775. points)
  2776. \p{Noncharacter_Code_Point: Y*} (Short: \p{NChar=Y}, \p{NChar})
  2777. (66)
  2778. \p{Nonspacing_Mark} \p{General_Category=Nonspacing_Mark}
  2779. (Short: \p{Mn}) (1690)
  2780. \p{Nt: *} \p{Numeric_Type: *}
  2781. \p{Number} \p{General_Category=Number} (Short: \p{N})
  2782. (1492)
  2783. X \p{Number_Forms} \p{Block=Number_Forms} (64)
  2784. \p{Numeric_Type: De} \p{Numeric_Type=Decimal} (580)
  2785. \p{Numeric_Type: Decimal} (Short: \p{Nt=De}) (580)
  2786. \p{Numeric_Type: Di} \p{Numeric_Type=Digit} (128)
  2787. \p{Numeric_Type: Digit} (Short: \p{Nt=Di}) (128)
  2788. \p{Numeric_Type: None} (Short: \p{Nt=None}) (1_112_539 plus all
  2789. above-Unicode code points)
  2790. \p{Numeric_Type: Nu} \p{Numeric_Type=Numeric} (865)
  2791. \p{Numeric_Type: Numeric} (Short: \p{Nt=Nu}) (865)
  2792. T \p{Numeric_Value: -1/2} (Short: \p{Nv=-1/2}) (1)
  2793. T \p{Numeric_Value: 0} (Short: \p{Nv=0}) (74)
  2794. T \p{Numeric_Value: 1/160} (Short: \p{Nv=1/160}) (1)
  2795. T \p{Numeric_Value: 1/40} (Short: \p{Nv=1/40}) (1)
  2796. T \p{Numeric_Value: 3/80} (Short: \p{Nv=3/80}) (1)
  2797. T \p{Numeric_Value: 1/20} (Short: \p{Nv=1/20}) (1)
  2798. T \p{Numeric_Value: 1/16} (Short: \p{Nv=1/16}) (4)
  2799. T \p{Numeric_Value: 1/12} (Short: \p{Nv=1/12}) (1)
  2800. T \p{Numeric_Value: 1/10} (Short: \p{Nv=1/10}) (2)
  2801. T \p{Numeric_Value: 1/9} (Short: \p{Nv=1/9}) (1)
  2802. T \p{Numeric_Value: 1/8} (Short: \p{Nv=1/8}) (6)
  2803. T \p{Numeric_Value: 1/7} (Short: \p{Nv=1/7}) (1)
  2804. T \p{Numeric_Value: 3/20} (Short: \p{Nv=3/20}) (1)
  2805. T \p{Numeric_Value: 1/6} (Short: \p{Nv=1/6}) (3)
  2806. T \p{Numeric_Value: 3/16} (Short: \p{Nv=3/16}) (4)
  2807. T \p{Numeric_Value: 1/5} (Short: \p{Nv=1/5}) (2)
  2808. T \p{Numeric_Value: 1/4} (Short: \p{Nv=1/4}) (12)
  2809. T \p{Numeric_Value: 1/3} (Short: \p{Nv=1/3}) (6)
  2810. T \p{Numeric_Value: 3/8} (Short: \p{Nv=3/8}) (1)
  2811. T \p{Numeric_Value: 2/5} (Short: \p{Nv=2/5}) (1)
  2812. T \p{Numeric_Value: 5/12} (Short: \p{Nv=5/12}) (1)
  2813. T \p{Numeric_Value: 1/2} (Short: \p{Nv=1/2}) (13)
  2814. T \p{Numeric_Value: 7/12} (Short: \p{Nv=7/12}) (1)
  2815. T \p{Numeric_Value: 3/5} (Short: \p{Nv=3/5}) (1)
  2816. T \p{Numeric_Value: 5/8} (Short: \p{Nv=5/8}) (1)
  2817. T \p{Numeric_Value: 2/3} (Short: \p{Nv=2/3}) (7)
  2818. T \p{Numeric_Value: 3/4} (Short: \p{Nv=3/4}) (7)
  2819. T \p{Numeric_Value: 4/5} (Short: \p{Nv=4/5}) (1)
  2820. T \p{Numeric_Value: 5/6} (Short: \p{Nv=5/6}) (3)
  2821. T \p{Numeric_Value: 7/8} (Short: \p{Nv=7/8}) (1)
  2822. T \p{Numeric_Value: 11/12} (Short: \p{Nv=11/12}) (1)
  2823. T \p{Numeric_Value: 1} (Short: \p{Nv=1}) (121)
  2824. T \p{Numeric_Value: 3/2} (Short: \p{Nv=3/2}) (1)
  2825. T \p{Numeric_Value: 2} (Short: \p{Nv=2}) (121)
  2826. T \p{Numeric_Value: 5/2} (Short: \p{Nv=5/2}) (1)
  2827. T \p{Numeric_Value: 3} (Short: \p{Nv=3}) (123)
  2828. T \p{Numeric_Value: 7/2} (Short: \p{Nv=7/2}) (1)
  2829. T \p{Numeric_Value: 4} (Short: \p{Nv=4}) (115)
  2830. T \p{Numeric_Value: 9/2} (Short: \p{Nv=9/2}) (1)
  2831. T \p{Numeric_Value: 5} (Short: \p{Nv=5}) (113)
  2832. T \p{Numeric_Value: 11/2} (Short: \p{Nv=11/2}) (1)
  2833. T \p{Numeric_Value: 6} (Short: \p{Nv=6}) (100)
  2834. T \p{Numeric_Value: 13/2} (Short: \p{Nv=13/2}) (1)
  2835. T \p{Numeric_Value: 7} (Short: \p{Nv=7}) (99)
  2836. T \p{Numeric_Value: 15/2} (Short: \p{Nv=15/2}) (1)
  2837. T \p{Numeric_Value: 8} (Short: \p{Nv=8}) (95)
  2838. T \p{Numeric_Value: 17/2} (Short: \p{Nv=17/2}) (1)
  2839. T \p{Numeric_Value: 9} (Short: \p{Nv=9}) (99)
  2840. T \p{Numeric_Value: 10} (Short: \p{Nv=10}) (54)
  2841. T \p{Numeric_Value: 11} (Short: \p{Nv=11}) (6)
  2842. T \p{Numeric_Value: 12} (Short: \p{Nv=12}) (6)
  2843. T \p{Numeric_Value: 13} (Short: \p{Nv=13}) (4)
  2844. T \p{Numeric_Value: 14} (Short: \p{Nv=14}) (4)
  2845. T \p{Numeric_Value: 15} (Short: \p{Nv=15}) (4)
  2846. T \p{Numeric_Value: 16} (Short: \p{Nv=16}) (5)
  2847. T \p{Numeric_Value: 17} (Short: \p{Nv=17}) (5)
  2848. T \p{Numeric_Value: 18} (Short: \p{Nv=18}) (5)
  2849. T \p{Numeric_Value: 19} (Short: \p{Nv=19}) (5)
  2850. T \p{Numeric_Value: 20} (Short: \p{Nv=20}) (31)
  2851. T \p{Numeric_Value: 21} (Short: \p{Nv=21}) (1)
  2852. T \p{Numeric_Value: 22} (Short: \p{Nv=22}) (1)
  2853. T \p{Numeric_Value: 23} (Short: \p{Nv=23}) (1)
  2854. T \p{Numeric_Value: 24} (Short: \p{Nv=24}) (1)
  2855. T \p{Numeric_Value: 25} (Short: \p{Nv=25}) (1)
  2856. T \p{Numeric_Value: 26} (Short: \p{Nv=26}) (1)
  2857. T \p{Numeric_Value: 27} (Short: \p{Nv=27}) (1)
  2858. T \p{Numeric_Value: 28} (Short: \p{Nv=28}) (1)
  2859. T \p{Numeric_Value: 29} (Short: \p{Nv=29}) (1)
  2860. T \p{Numeric_Value: 30} (Short: \p{Nv=30}) (16)
  2861. T \p{Numeric_Value: 31} (Short: \p{Nv=31}) (1)
  2862. T \p{Numeric_Value: 32} (Short: \p{Nv=32}) (1)
  2863. T \p{Numeric_Value: 33} (Short: \p{Nv=33}) (1)
  2864. T \p{Numeric_Value: 34} (Short: \p{Nv=34}) (1)
  2865. T \p{Numeric_Value: 35} (Short: \p{Nv=35}) (1)
  2866. T \p{Numeric_Value: 36} (Short: \p{Nv=36}) (1)
  2867. T \p{Numeric_Value: 37} (Short: \p{Nv=37}) (1)
  2868. T \p{Numeric_Value: 38} (Short: \p{Nv=38}) (1)
  2869. T \p{Numeric_Value: 39} (Short: \p{Nv=39}) (1)
  2870. T \p{Numeric_Value: 40} (Short: \p{Nv=40}) (16)
  2871. T \p{Numeric_Value: 41} (Short: \p{Nv=41}) (1)
  2872. T \p{Numeric_Value: 42} (Short: \p{Nv=42}) (1)
  2873. T \p{Numeric_Value: 43} (Short: \p{Nv=43}) (1)
  2874. T \p{Numeric_Value: 44} (Short: \p{Nv=44}) (1)
  2875. T \p{Numeric_Value: 45} (Short: \p{Nv=45}) (1)
  2876. T \p{Numeric_Value: 46} (Short: \p{Nv=46}) (1)
  2877. T \p{Numeric_Value: 47} (Short: \p{Nv=47}) (1)
  2878. T \p{Numeric_Value: 48} (Short: \p{Nv=48}) (1)
  2879. T \p{Numeric_Value: 49} (Short: \p{Nv=49}) (1)
  2880. T \p{Numeric_Value: 50} (Short: \p{Nv=50}) (27)
  2881. T \p{Numeric_Value: 60} (Short: \p{Nv=60}) (11)
  2882. T \p{Numeric_Value: 70} (Short: \p{Nv=70}) (11)
  2883. T \p{Numeric_Value: 80} (Short: \p{Nv=80}) (10)
  2884. T \p{Numeric_Value: 90} (Short: \p{Nv=90}) (10)
  2885. T \p{Numeric_Value: 100} (Short: \p{Nv=100}) (30)
  2886. T \p{Numeric_Value: 200} (Short: \p{Nv=200}) (4)
  2887. T \p{Numeric_Value: 300} (Short: \p{Nv=300}) (5)
  2888. T \p{Numeric_Value: 400} (Short: \p{Nv=400}) (4)
  2889. T \p{Numeric_Value: 500} (Short: \p{Nv=500}) (14)
  2890. T \p{Numeric_Value: 600} (Short: \p{Nv=600}) (4)
  2891. T \p{Numeric_Value: 700} (Short: \p{Nv=700}) (4)
  2892. T \p{Numeric_Value: 800} (Short: \p{Nv=800}) (4)
  2893. T \p{Numeric_Value: 900} (Short: \p{Nv=900}) (5)
  2894. T \p{Numeric_Value: 1000} (Short: \p{Nv=1000}) (20)
  2895. T \p{Numeric_Value: 2000} (Short: \p{Nv=2000}) (2)
  2896. T \p{Numeric_Value: 3000} (Short: \p{Nv=3000}) (2)
  2897. T \p{Numeric_Value: 4000} (Short: \p{Nv=4000}) (2)
  2898. T \p{Numeric_Value: 5000} (Short: \p{Nv=5000}) (6)
  2899. T \p{Numeric_Value: 6000} (Short: \p{Nv=6000}) (2)
  2900. T \p{Numeric_Value: 7000} (Short: \p{Nv=7000}) (2)
  2901. T \p{Numeric_Value: 8000} (Short: \p{Nv=8000}) (2)
  2902. T \p{Numeric_Value: 9000} (Short: \p{Nv=9000}) (2)
  2903. T \p{Numeric_Value: 10000} (= 1.0e+04) (Short: \p{Nv=10000}) (9)
  2904. T \p{Numeric_Value: 20000} (= 2.0e+04) (Short: \p{Nv=20000}) (2)
  2905. T \p{Numeric_Value: 30000} (= 3.0e+04) (Short: \p{Nv=30000}) (2)
  2906. T \p{Numeric_Value: 40000} (= 4.0e+04) (Short: \p{Nv=40000}) (2)
  2907. T \p{Numeric_Value: 50000} (= 5.0e+04) (Short: \p{Nv=50000}) (5)
  2908. T \p{Numeric_Value: 60000} (= 6.0e+04) (Short: \p{Nv=60000}) (2)
  2909. T \p{Numeric_Value: 70000} (= 7.0e+04) (Short: \p{Nv=70000}) (2)
  2910. T \p{Numeric_Value: 80000} (= 8.0e+04) (Short: \p{Nv=80000}) (2)
  2911. T \p{Numeric_Value: 90000} (= 9.0e+04) (Short: \p{Nv=90000}) (2)
  2912. T \p{Numeric_Value: 100000} (= 1.0e+05) (Short: \p{Nv=100000}) (2)
  2913. T \p{Numeric_Value: 200000} (= 2.0e+05) (Short: \p{Nv=200000}) (1)
  2914. T \p{Numeric_Value: 216000} (= 2.2e+05) (Short: \p{Nv=216000}) (1)
  2915. T \p{Numeric_Value: 300000} (= 3.0e+05) (Short: \p{Nv=300000}) (1)
  2916. T \p{Numeric_Value: 400000} (= 4.0e+05) (Short: \p{Nv=400000}) (1)
  2917. T \p{Numeric_Value: 432000} (= 4.3e+05) (Short: \p{Nv=432000}) (1)
  2918. T \p{Numeric_Value: 500000} (= 5.0e+05) (Short: \p{Nv=500000}) (1)
  2919. T \p{Numeric_Value: 600000} (= 6.0e+05) (Short: \p{Nv=600000}) (1)
  2920. T \p{Numeric_Value: 700000} (= 7.0e+05) (Short: \p{Nv=700000}) (1)
  2921. T \p{Numeric_Value: 800000} (= 8.0e+05) (Short: \p{Nv=800000}) (1)
  2922. T \p{Numeric_Value: 900000} (= 9.0e+05) (Short: \p{Nv=900000}) (1)
  2923. T \p{Numeric_Value: 1000000} (= 1.0e+06) (Short: \p{Nv=1000000}) (1)
  2924. T \p{Numeric_Value: 100000000} (= 1.0e+08) (Short: \p{Nv=100000000})
  2925. (3)
  2926. T \p{Numeric_Value: 10000000000} (= 1.0e+10) (Short: \p{Nv=
  2927. 10000000000}) (1)
  2928. T \p{Numeric_Value: 1000000000000} (= 1.0e+12) (Short: \p{Nv=
  2929. 1000000000000}) (2)
  2930. \p{Numeric_Value: NaN} (Short: \p{Nv=NaN}) (1_112_539 plus all
  2931. above-Unicode code points)
  2932. \p{Nv: *} \p{Numeric_Value: *}
  2933. X \p{OCR} \p{Optical_Character_Recognition} (=
  2934. \p{Block=Optical_Character_Recognition})
  2935. (32)
  2936. \p{Ogam} \p{Ogham} (= \p{Script_Extensions=Ogham})
  2937. (NOT \p{Block=Ogham}) (29)
  2938. \p{Ogham} \p{Script_Extensions=Ogham} (Short:
  2939. \p{Ogam}; NOT \p{Block=Ogham}) (29)
  2940. \p{Ol_Chiki} \p{Script_Extensions=Ol_Chiki} (Short:
  2941. \p{Olck}) (48)
  2942. \p{Olck} \p{Ol_Chiki} (= \p{Script_Extensions=
  2943. Ol_Chiki}) (48)
  2944. \p{Old_Hungarian} \p{Script_Extensions=Old_Hungarian}
  2945. (Short: \p{Hung}; NOT \p{Block=
  2946. Old_Hungarian}) (108)
  2947. \p{Old_Italic} \p{Script_Extensions=Old_Italic} (Short:
  2948. \p{Ital}; NOT \p{Block=Old_Italic}) (36)
  2949. \p{Old_North_Arabian} \p{Script_Extensions=Old_North_Arabian}
  2950. (Short: \p{Narb}) (32)
  2951. \p{Old_Permic} \p{Script_Extensions=Old_Permic} (Short:
  2952. \p{Perm}; NOT \p{Block=Old_Permic}) (44)
  2953. \p{Old_Persian} \p{Script_Extensions=Old_Persian} (Short:
  2954. \p{Xpeo}; NOT \p{Block=Old_Persian}) (50)
  2955. \p{Old_South_Arabian} \p{Script_Extensions=Old_South_Arabian}
  2956. (Short: \p{Sarb}) (32)
  2957. \p{Old_Turkic} \p{Script_Extensions=Old_Turkic} (Short:
  2958. \p{Orkh}; NOT \p{Block=Old_Turkic}) (73)
  2959. \p{Open_Punctuation} \p{General_Category=Open_Punctuation}
  2960. (Short: \p{Ps}) (75)
  2961. X \p{Optical_Character_Recognition} \p{Block=
  2962. Optical_Character_Recognition} (Short:
  2963. \p{InOCR}) (32)
  2964. \p{Oriya} \p{Script_Extensions=Oriya} (Short:
  2965. \p{Orya}; NOT \p{Block=Oriya}) (94)
  2966. \p{Orkh} \p{Old_Turkic} (= \p{Script_Extensions=
  2967. Old_Turkic}) (NOT \p{Block=Old_Turkic})
  2968. (73)
  2969. X \p{Ornamental_Dingbats} \p{Block=Ornamental_Dingbats} (48)
  2970. \p{Orya} \p{Oriya} (= \p{Script_Extensions=Oriya})
  2971. (NOT \p{Block=Oriya}) (94)
  2972. \p{Osage} \p{Script_Extensions=Osage} (Short:
  2973. \p{Osge}; NOT \p{Block=Osage}) (72)
  2974. \p{Osge} \p{Osage} (= \p{Script_Extensions=Osage})
  2975. (NOT \p{Block=Osage}) (72)
  2976. \p{Osma} \p{Osmanya} (= \p{Script_Extensions=
  2977. Osmanya}) (NOT \p{Block=Osmanya}) (40)
  2978. \p{Osmanya} \p{Script_Extensions=Osmanya} (Short:
  2979. \p{Osma}; NOT \p{Block=Osmanya}) (40)
  2980. \p{Other} \p{General_Category=Other} (Short: \p{C})
  2981. (986_091 plus all above-Unicode code
  2982. points)
  2983. \p{Other_Letter} \p{General_Category=Other_Letter} (Short:
  2984. \p{Lo}) (112_721)
  2985. \p{Other_Number} \p{General_Category=Other_Number} (Short:
  2986. \p{No}) (676)
  2987. \p{Other_Punctuation} \p{General_Category=Other_Punctuation}
  2988. (Short: \p{Po}) (544)
  2989. \p{Other_Symbol} \p{General_Category=Other_Symbol} (Short:
  2990. \p{So}) (5777)
  2991. \p{P} \pP \p{Punct} (= \p{General_Category=
  2992. Punctuation}) (NOT
  2993. \p{General_Punctuation}) (748)
  2994. \p{Pahawh_Hmong} \p{Script_Extensions=Pahawh_Hmong} (Short:
  2995. \p{Hmng}; NOT \p{Block=Pahawh_Hmong})
  2996. (127)
  2997. \p{Palm} \p{Palmyrene} (= \p{Script_Extensions=
  2998. Palmyrene}) (32)
  2999. \p{Palmyrene} \p{Script_Extensions=Palmyrene} (Short:
  3000. \p{Palm}) (32)
  3001. \p{Paragraph_Separator} \p{General_Category=Paragraph_Separator}
  3002. (Short: \p{Zp}) (1)
  3003. \p{Pat_Syn} \p{Pattern_Syntax} (= \p{Pattern_Syntax=
  3004. Y}) (2760)
  3005. \p{Pat_Syn: *} \p{Pattern_Syntax: *}
  3006. \p{Pat_WS} \p{Pattern_White_Space} (=
  3007. \p{Pattern_White_Space=Y}) (11)
  3008. \p{Pat_WS: *} \p{Pattern_White_Space: *}
  3009. \p{Pattern_Syntax} \p{Pattern_Syntax=Y} (Short: \p{PatSyn})
  3010. (2760)
  3011. \p{Pattern_Syntax: N*} (Short: \p{PatSyn=N}, \P{PatSyn})
  3012. (1_111_352 plus all above-Unicode code
  3013. points)
  3014. \p{Pattern_Syntax: Y*} (Short: \p{PatSyn=Y}, \p{PatSyn}) (2760)
  3015. \p{Pattern_White_Space} \p{Pattern_White_Space=Y} (Short:
  3016. \p{PatWS}) (11)
  3017. \p{Pattern_White_Space: N*} (Short: \p{PatWS=N}, \P{PatWS})
  3018. (1_114_101 plus all above-Unicode code
  3019. points)
  3020. \p{Pattern_White_Space: Y*} (Short: \p{PatWS=Y}, \p{PatWS}) (11)
  3021. \p{Pau_Cin_Hau} \p{Script_Extensions=Pau_Cin_Hau} (Short:
  3022. \p{Pauc}; NOT \p{Block=Pau_Cin_Hau}) (57)
  3023. \p{Pauc} \p{Pau_Cin_Hau} (= \p{Script_Extensions=
  3024. Pau_Cin_Hau}) (NOT \p{Block=
  3025. Pau_Cin_Hau}) (57)
  3026. \p{Pc} \p{Connector_Punctuation} (=
  3027. \p{General_Category=
  3028. Connector_Punctuation}) (10)
  3029. \p{PCM} \p{Prepended_Concatenation_Mark} (=
  3030. \p{Prepended_Concatenation_Mark=Y}) (10)
  3031. \p{PCM: *} \p{Prepended_Concatenation_Mark: *}
  3032. \p{Pd} \p{Dash_Punctuation} (=
  3033. \p{General_Category=Dash_Punctuation})
  3034. (24)
  3035. \p{Pe} \p{Close_Punctuation} (=
  3036. \p{General_Category=Close_Punctuation})
  3037. (73)
  3038. \p{PerlSpace} \p{PosixSpace} (6)
  3039. \p{PerlWord} \p{PosixWord} (63)
  3040. \p{Perm} \p{Old_Permic} (= \p{Script_Extensions=
  3041. Old_Permic}) (NOT \p{Block=Old_Permic})
  3042. (44)
  3043. \p{Pf} \p{Final_Punctuation} (=
  3044. \p{General_Category=Final_Punctuation})
  3045. (10)
  3046. \p{Phag} \p{Phags_Pa} (= \p{Script_Extensions=
  3047. Phags_Pa}) (NOT \p{Block=Phags_Pa}) (59)
  3048. \p{Phags_Pa} \p{Script_Extensions=Phags_Pa} (Short:
  3049. \p{Phag}; NOT \p{Block=Phags_Pa}) (59)
  3050. X \p{Phaistos} \p{Phaistos_Disc} (= \p{Block=
  3051. Phaistos_Disc}) (48)
  3052. X \p{Phaistos_Disc} \p{Block=Phaistos_Disc} (Short:
  3053. \p{InPhaistos}) (48)
  3054. \p{Phli} \p{Inscriptional_Pahlavi} (=
  3055. \p{Script_Extensions=
  3056. Inscriptional_Pahlavi}) (NOT \p{Block=
  3057. Inscriptional_Pahlavi}) (27)
  3058. \p{Phlp} \p{Psalter_Pahlavi} (=
  3059. \p{Script_Extensions=Psalter_Pahlavi})
  3060. (NOT \p{Block=Psalter_Pahlavi}) (30)
  3061. \p{Phnx} \p{Phoenician} (= \p{Script_Extensions=
  3062. Phoenician}) (NOT \p{Block=Phoenician})
  3063. (29)
  3064. \p{Phoenician} \p{Script_Extensions=Phoenician} (Short:
  3065. \p{Phnx}; NOT \p{Block=Phoenician}) (29)
  3066. X \p{Phonetic_Ext} \p{Phonetic_Extensions} (= \p{Block=
  3067. Phonetic_Extensions}) (128)
  3068. X \p{Phonetic_Ext_Sup} \p{Phonetic_Extensions_Supplement} (=
  3069. \p{Block=
  3070. Phonetic_Extensions_Supplement}) (64)
  3071. X \p{Phonetic_Extensions} \p{Block=Phonetic_Extensions} (Short:
  3072. \p{InPhoneticExt}) (128)
  3073. X \p{Phonetic_Extensions_Supplement} \p{Block=
  3074. Phonetic_Extensions_Supplement} (Short:
  3075. \p{InPhoneticExtSup}) (64)
  3076. \p{Pi} \p{Initial_Punctuation} (=
  3077. \p{General_Category=
  3078. Initial_Punctuation}) (12)
  3079. X \p{Playing_Cards} \p{Block=Playing_Cards} (96)
  3080. \p{Plrd} \p{Miao} (= \p{Script_Extensions=Miao})
  3081. (NOT \p{Block=Miao}) (133)
  3082. \p{Po} \p{Other_Punctuation} (=
  3083. \p{General_Category=Other_Punctuation})
  3084. (544)
  3085. \p{PosixAlnum} [A-Za-z0-9] (62)
  3086. \p{PosixAlpha} [A-Za-z] (52)
  3087. \p{PosixBlank} \t and ' ' (2)
  3088. \p{PosixCntrl} ASCII control characters: NUL, SOH, STX,
  3089. ETX, EOT, ENQ, ACK, BEL, BS, HT, LF, VT,
  3090. FF, CR, SO, SI, DLE, DC1, DC2, DC3, DC4,
  3091. NAK, SYN, ETB, CAN, EOM, SUB, ESC, FS,
  3092. GS, RS, US, and DEL (33)
  3093. \p{PosixDigit} [0-9] (10)
  3094. \p{PosixGraph} [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~0-9A-Za-
  3095. z] (94)
  3096. \p{PosixLower} [a-z] (/i= PosixAlpha) (26)
  3097. \p{PosixPrint} [- 0-9A-Za-z!"#$%&'()*+,./:;<=
  3098. >?@[\\]^_`{|}~] (95)
  3099. \p{PosixPunct} [-!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~] (32)
  3100. \p{PosixSpace} \t, \n, \cK, \f, \r, and ' '. (\cK is
  3101. vertical tab) (Short: \p{PerlSpace}) (6)
  3102. \p{PosixUpper} [A-Z] (/i= PosixAlpha) (26)
  3103. \p{PosixWord} \w, restricted to ASCII = [A-Za-z0-9_]
  3104. (Short: \p{PerlWord}) (63)
  3105. \p{PosixXDigit} \p{ASCII_Hex_Digit=Y} [0-9A-Fa-f] (Short:
  3106. \p{AHex}) (22)
  3107. \p{Prepended_Concatenation_Mark} \p{Prepended_Concatenation_Mark=
  3108. Y} (Short: \p{PCM}) (10)
  3109. \p{Prepended_Concatenation_Mark: N*} (Short: \p{PCM=N}, \P{PCM})
  3110. (1_114_102 plus all above-Unicode code
  3111. points)
  3112. \p{Prepended_Concatenation_Mark: Y*} (Short: \p{PCM=Y}, \p{PCM})
  3113. (10)
  3114. T \p{Present_In: 1.1} \p{Age=V1_1} (Short: \p{In=1.1}) (Perl
  3115. extension) (33_979)
  3116. T \p{Present_In: 2.0} Code point's usage introduced in version
  3117. 2.0 or earlier (Short: \p{In=2.0}) (Perl
  3118. extension) (178_500)
  3119. T \p{Present_In: 2.1} Code point's usage introduced in version
  3120. 2.1 or earlier (Short: \p{In=2.1}) (Perl
  3121. extension) (178_502)
  3122. T \p{Present_In: 3.0} Code point's usage introduced in version
  3123. 3.0 or earlier (Short: \p{In=3.0}) (Perl
  3124. extension) (188_809)
  3125. T \p{Present_In: 3.1} Code point's usage introduced in version
  3126. 3.1 or earlier (Short: \p{In=3.1}) (Perl
  3127. extension) (233_787)
  3128. T \p{Present_In: 3.2} Code point's usage introduced in version
  3129. 3.2 or earlier (Short: \p{In=3.2}) (Perl
  3130. extension) (234_803)
  3131. T \p{Present_In: 4.0} Code point's usage introduced in version
  3132. 4.0 or earlier (Short: \p{In=4.0}) (Perl
  3133. extension) (236_029)
  3134. T \p{Present_In: 4.1} Code point's usage introduced in version
  3135. 4.1 or earlier (Short: \p{In=4.1}) (Perl
  3136. extension) (237_302)
  3137. T \p{Present_In: 5.0} Code point's usage introduced in version
  3138. 5.0 or earlier (Short: \p{In=5.0}) (Perl
  3139. extension) (238_671)
  3140. T \p{Present_In: 5.1} Code point's usage introduced in version
  3141. 5.1 or earlier (Short: \p{In=5.1}) (Perl
  3142. extension) (240_295)
  3143. T \p{Present_In: 5.2} Code point's usage introduced in version
  3144. 5.2 or earlier (Short: \p{In=5.2}) (Perl
  3145. extension) (246_943)
  3146. T \p{Present_In: 6.0} Code point's usage introduced in version
  3147. 6.0 or earlier (Short: \p{In=6.0}) (Perl
  3148. extension) (249_031)
  3149. T \p{Present_In: 6.1} Code point's usage introduced in version
  3150. 6.1 or earlier (Short: \p{In=6.1}) (Perl
  3151. extension) (249_763)
  3152. T \p{Present_In: 6.2} Code point's usage introduced in version
  3153. 6.2 or earlier (Short: \p{In=6.2}) (Perl
  3154. extension) (249_764)
  3155. T \p{Present_In: 6.3} Code point's usage introduced in version
  3156. 6.3 or earlier (Short: \p{In=6.3}) (Perl
  3157. extension) (249_769)
  3158. T \p{Present_In: 7.0} Code point's usage introduced in version
  3159. 7.0 or earlier (Short: \p{In=7.0}) (Perl
  3160. extension) (252_603)
  3161. T \p{Present_In: 8.0} Code point's usage introduced in version
  3162. 8.0 or earlier (Short: \p{In=8.0}) (Perl
  3163. extension) (260_319)
  3164. T \p{Present_In: 9.0} Code point's usage introduced in version
  3165. 9.0 or earlier (Short: \p{In=9.0}) (Perl
  3166. extension) (267_819)
  3167. \p{Present_In: Unassigned} \p{Age=Unassigned} (Short: \p{In=
  3168. Unassigned}) (Perl extension) (846_293
  3169. plus all above-Unicode code points)
  3170. \p{Print} \p{XPosixPrint} (265_638)
  3171. \p{Private_Use} \p{General_Category=Private_Use} (Short:
  3172. \p{Co}; NOT \p{Private_Use_Area})
  3173. (137_468)
  3174. X \p{Private_Use_Area} \p{Block=Private_Use_Area} (Short:
  3175. \p{InPUA}) (6400)
  3176. \p{Prti} \p{Inscriptional_Parthian} (=
  3177. \p{Script_Extensions=
  3178. Inscriptional_Parthian}) (NOT \p{Block=
  3179. Inscriptional_Parthian}) (30)
  3180. \p{Ps} \p{Open_Punctuation} (=
  3181. \p{General_Category=Open_Punctuation})
  3182. (75)
  3183. \p{Psalter_Pahlavi} \p{Script_Extensions=Psalter_Pahlavi}
  3184. (Short: \p{Phlp}; NOT \p{Block=
  3185. Psalter_Pahlavi}) (30)
  3186. X \p{PUA} \p{Private_Use_Area} (= \p{Block=
  3187. Private_Use_Area}) (6400)
  3188. \p{Punct} \p{General_Category=Punctuation} (Short:
  3189. \p{P}; NOT \p{General_Punctuation}) (748)
  3190. \p{Punctuation} \p{Punct} (= \p{General_Category=
  3191. Punctuation}) (NOT
  3192. \p{General_Punctuation}) (748)
  3193. \p{Qaac} \p{Coptic} (= \p{Script_Extensions=
  3194. Coptic}) (NOT \p{Block=Coptic}) (165)
  3195. \p{Qaai} \p{Inherited} (= \p{Script_Extensions=
  3196. Inherited}) (496)
  3197. \p{QMark} \p{Quotation_Mark} (= \p{Quotation_Mark=
  3198. Y}) (30)
  3199. \p{QMark: *} \p{Quotation_Mark: *}
  3200. \p{Quotation_Mark} \p{Quotation_Mark=Y} (Short: \p{QMark})
  3201. (30)
  3202. \p{Quotation_Mark: N*} (Short: \p{QMark=N}, \P{QMark}) (1_114_082
  3203. plus all above-Unicode code points)
  3204. \p{Quotation_Mark: Y*} (Short: \p{QMark=Y}, \p{QMark}) (30)
  3205. \p{Radical} \p{Radical=Y} (329)
  3206. \p{Radical: N*} (Single: \P{Radical}) (1_113_783 plus all
  3207. above-Unicode code points)
  3208. \p{Radical: Y*} (Single: \p{Radical}) (329)
  3209. \p{Rejang} \p{Script_Extensions=Rejang} (Short:
  3210. \p{Rjng}; NOT \p{Block=Rejang}) (37)
  3211. \p{Rjng} \p{Rejang} (= \p{Script_Extensions=
  3212. Rejang}) (NOT \p{Block=Rejang}) (37)
  3213. X \p{Rumi} \p{Rumi_Numeral_Symbols} (= \p{Block=
  3214. Rumi_Numeral_Symbols}) (32)
  3215. X \p{Rumi_Numeral_Symbols} \p{Block=Rumi_Numeral_Symbols} (Short:
  3216. \p{InRumi}) (32)
  3217. \p{Runic} \p{Script_Extensions=Runic} (Short:
  3218. \p{Runr}; NOT \p{Block=Runic}) (86)
  3219. \p{Runr} \p{Runic} (= \p{Script_Extensions=Runic})
  3220. (NOT \p{Block=Runic}) (86)
  3221. \p{S} \pS \p{Symbol} (= \p{General_Category=Symbol})
  3222. (6899)
  3223. \p{Samaritan} \p{Script_Extensions=Samaritan} (Short:
  3224. \p{Samr}; NOT \p{Block=Samaritan}) (61)
  3225. \p{Samr} \p{Samaritan} (= \p{Script_Extensions=
  3226. Samaritan}) (NOT \p{Block=Samaritan})
  3227. (61)
  3228. \p{Sarb} \p{Old_South_Arabian} (=
  3229. \p{Script_Extensions=Old_South_Arabian})
  3230. (32)
  3231. \p{Saur} \p{Saurashtra} (= \p{Script_Extensions=
  3232. Saurashtra}) (NOT \p{Block=Saurashtra})
  3233. (82)
  3234. \p{Saurashtra} \p{Script_Extensions=Saurashtra} (Short:
  3235. \p{Saur}; NOT \p{Block=Saurashtra}) (82)
  3236. \p{SB: *} \p{Sentence_Break: *}
  3237. \p{Sc} \p{Currency_Symbol} (=
  3238. \p{General_Category=Currency_Symbol})
  3239. (53)
  3240. \p{Sc: *} \p{Script: *}
  3241. \p{Script: Adlam} (Short: \p{Sc=Adlm}) (87)
  3242. \p{Script: Adlm} \p{Script=Adlam} (87)
  3243. \p{Script: Aghb} \p{Script=Caucasian_Albanian} (53)
  3244. \p{Script: Ahom} (Short: \p{Sc=Ahom}) (57)
  3245. \p{Script: Anatolian_Hieroglyphs} (Short: \p{Sc=Hluw}) (583)
  3246. \p{Script: Arab} \p{Script=Arabic} (1279)
  3247. \p{Script: Arabic} (Short: \p{Sc=Arab}) (1279)
  3248. \p{Script: Armenian} (Short: \p{Sc=Armn}) (93)
  3249. \p{Script: Armi} \p{Script=Imperial_Aramaic} (31)
  3250. \p{Script: Armn} \p{Script=Armenian} (93)
  3251. \p{Script: Avestan} (Short: \p{Sc=Avst}) (61)
  3252. \p{Script: Avst} \p{Script=Avestan} (61)
  3253. \p{Script: Bali} \p{Script=Balinese} (121)
  3254. \p{Script: Balinese} (Short: \p{Sc=Bali}) (121)
  3255. \p{Script: Bamu} \p{Script=Bamum} (657)
  3256. \p{Script: Bamum} (Short: \p{Sc=Bamu}) (657)
  3257. \p{Script: Bass} \p{Script=Bassa_Vah} (36)
  3258. \p{Script: Bassa_Vah} (Short: \p{Sc=Bass}) (36)
  3259. \p{Script: Batak} (Short: \p{Sc=Batk}) (56)
  3260. \p{Script: Batk} \p{Script=Batak} (56)
  3261. \p{Script: Beng} \p{Script=Bengali} (93)
  3262. \p{Script: Bengali} (Short: \p{Sc=Beng}) (93)
  3263. \p{Script: Bhaiksuki} (Short: \p{Sc=Bhks}) (97)
  3264. \p{Script: Bhks} \p{Script=Bhaiksuki} (97)
  3265. \p{Script: Bopo} \p{Script=Bopomofo} (70)
  3266. \p{Script: Bopomofo} (Short: \p{Sc=Bopo}) (70)
  3267. \p{Script: Brah} \p{Script=Brahmi} (109)
  3268. \p{Script: Brahmi} (Short: \p{Sc=Brah}) (109)
  3269. \p{Script: Brai} \p{Script=Braille} (256)
  3270. \p{Script: Braille} (Short: \p{Sc=Brai}) (256)
  3271. \p{Script: Bugi} \p{Script=Buginese} (30)
  3272. \p{Script: Buginese} (Short: \p{Sc=Bugi}) (30)
  3273. \p{Script: Buhd} \p{Script=Buhid} (20)
  3274. \p{Script: Buhid} (Short: \p{Sc=Buhd}) (20)
  3275. \p{Script: Cakm} \p{Script=Chakma} (67)
  3276. \p{Script: Canadian_Aboriginal} (Short: \p{Sc=Cans}) (710)
  3277. \p{Script: Cans} \p{Script=Canadian_Aboriginal} (710)
  3278. \p{Script: Cari} \p{Script=Carian} (49)
  3279. \p{Script: Carian} (Short: \p{Sc=Cari}) (49)
  3280. \p{Script: Caucasian_Albanian} (Short: \p{Sc=Aghb}) (53)
  3281. \p{Script: Chakma} (Short: \p{Sc=Cakm}) (67)
  3282. \p{Script: Cham} (Short: \p{Sc=Cham}) (83)
  3283. \p{Script: Cher} \p{Script=Cherokee} (172)
  3284. \p{Script: Cherokee} (Short: \p{Sc=Cher}) (172)
  3285. \p{Script: Common} (Short: \p{Sc=Zyyy}) (7279)
  3286. \p{Script: Copt} \p{Script=Coptic} (137)
  3287. \p{Script: Coptic} (Short: \p{Sc=Copt}) (137)
  3288. \p{Script: Cprt} \p{Script=Cypriot} (55)
  3289. \p{Script: Cuneiform} (Short: \p{Sc=Xsux}) (1234)
  3290. \p{Script: Cypriot} (Short: \p{Sc=Cprt}) (55)
  3291. \p{Script: Cyrillic} (Short: \p{Sc=Cyrl}) (443)
  3292. \p{Script: Cyrl} \p{Script=Cyrillic} (443)
  3293. \p{Script: Deseret} (Short: \p{Sc=Dsrt}) (80)
  3294. \p{Script: Deva} \p{Script=Devanagari} (154)
  3295. \p{Script: Devanagari} (Short: \p{Sc=Deva}) (154)
  3296. \p{Script: Dsrt} \p{Script=Deseret} (80)
  3297. \p{Script: Dupl} \p{Script=Duployan} (143)
  3298. \p{Script: Duployan} (Short: \p{Sc=Dupl}) (143)
  3299. \p{Script: Egyp} \p{Script=Egyptian_Hieroglyphs} (1071)
  3300. \p{Script: Egyptian_Hieroglyphs} (Short: \p{Sc=Egyp}) (1071)
  3301. \p{Script: Elba} \p{Script=Elbasan} (40)
  3302. \p{Script: Elbasan} (Short: \p{Sc=Elba}) (40)
  3303. \p{Script: Ethi} \p{Script=Ethiopic} (495)
  3304. \p{Script: Ethiopic} (Short: \p{Sc=Ethi}) (495)
  3305. \p{Script: Geor} \p{Script=Georgian} (127)
  3306. \p{Script: Georgian} (Short: \p{Sc=Geor}) (127)
  3307. \p{Script: Glag} \p{Script=Glagolitic} (132)
  3308. \p{Script: Glagolitic} (Short: \p{Sc=Glag}) (132)
  3309. \p{Script: Goth} \p{Script=Gothic} (27)
  3310. \p{Script: Gothic} (Short: \p{Sc=Goth}) (27)
  3311. \p{Script: Gran} \p{Script=Grantha} (85)
  3312. \p{Script: Grantha} (Short: \p{Sc=Gran}) (85)
  3313. \p{Script: Greek} (Short: \p{Sc=Grek}) (518)
  3314. \p{Script: Grek} \p{Script=Greek} (518)
  3315. \p{Script: Gujarati} (Short: \p{Sc=Gujr}) (85)
  3316. \p{Script: Gujr} \p{Script=Gujarati} (85)
  3317. \p{Script: Gurmukhi} (Short: \p{Sc=Guru}) (79)
  3318. \p{Script: Guru} \p{Script=Gurmukhi} (79)
  3319. \p{Script: Han} (Short: \p{Sc=Han}) (81_734)
  3320. \p{Script: Hang} \p{Script=Hangul} (11_739)
  3321. \p{Script: Hangul} (Short: \p{Sc=Hang}) (11_739)
  3322. \p{Script: Hani} \p{Script=Han} (81_734)
  3323. \p{Script: Hano} \p{Script=Hanunoo} (21)
  3324. \p{Script: Hanunoo} (Short: \p{Sc=Hano}) (21)
  3325. \p{Script: Hatr} \p{Script=Hatran} (26)
  3326. \p{Script: Hatran} (Short: \p{Sc=Hatr}) (26)
  3327. \p{Script: Hebr} \p{Script=Hebrew} (133)
  3328. \p{Script: Hebrew} (Short: \p{Sc=Hebr}) (133)
  3329. \p{Script: Hira} \p{Script=Hiragana} (91)
  3330. \p{Script: Hiragana} (Short: \p{Sc=Hira}) (91)
  3331. \p{Script: Hluw} \p{Script=Anatolian_Hieroglyphs} (583)
  3332. \p{Script: Hmng} \p{Script=Pahawh_Hmong} (127)
  3333. \p{Script: Hung} \p{Script=Old_Hungarian} (108)
  3334. \p{Script: Imperial_Aramaic} (Short: \p{Sc=Armi}) (31)
  3335. \p{Script: Inherited} (Short: \p{Sc=Zinh}) (564)
  3336. \p{Script: Inscriptional_Pahlavi} (Short: \p{Sc=Phli}) (27)
  3337. \p{Script: Inscriptional_Parthian} (Short: \p{Sc=Prti}) (30)
  3338. \p{Script: Ital} \p{Script=Old_Italic} (36)
  3339. \p{Script: Java} \p{Script=Javanese} (90)
  3340. \p{Script: Javanese} (Short: \p{Sc=Java}) (90)
  3341. \p{Script: Kaithi} (Short: \p{Sc=Kthi}) (66)
  3342. \p{Script: Kali} \p{Script=Kayah_Li} (47)
  3343. \p{Script: Kana} \p{Script=Katakana} (300)
  3344. \p{Script: Kannada} (Short: \p{Sc=Knda}) (88)
  3345. \p{Script: Katakana} (Short: \p{Sc=Kana}) (300)
  3346. \p{Script: Kayah_Li} (Short: \p{Sc=Kali}) (47)
  3347. \p{Script: Khar} \p{Script=Kharoshthi} (65)
  3348. \p{Script: Kharoshthi} (Short: \p{Sc=Khar}) (65)
  3349. \p{Script: Khmer} (Short: \p{Sc=Khmr}) (146)
  3350. \p{Script: Khmr} \p{Script=Khmer} (146)
  3351. \p{Script: Khoj} \p{Script=Khojki} (62)
  3352. \p{Script: Khojki} (Short: \p{Sc=Khoj}) (62)
  3353. \p{Script: Khudawadi} (Short: \p{Sc=Sind}) (69)
  3354. \p{Script: Knda} \p{Script=Kannada} (88)
  3355. \p{Script: Kthi} \p{Script=Kaithi} (66)
  3356. \p{Script: Lana} \p{Script=Tai_Tham} (127)
  3357. \p{Script: Lao} (Short: \p{Sc=Lao}) (67)
  3358. \p{Script: Laoo} \p{Script=Lao} (67)
  3359. \p{Script: Latin} (Short: \p{Sc=Latn}) (1350)
  3360. \p{Script: Latn} \p{Script=Latin} (1350)
  3361. \p{Script: Lepc} \p{Script=Lepcha} (74)
  3362. \p{Script: Lepcha} (Short: \p{Sc=Lepc}) (74)
  3363. \p{Script: Limb} \p{Script=Limbu} (68)
  3364. \p{Script: Limbu} (Short: \p{Sc=Limb}) (68)
  3365. \p{Script: Lina} \p{Script=Linear_A} (341)
  3366. \p{Script: Linb} \p{Script=Linear_B} (211)
  3367. \p{Script: Linear_A} (Short: \p{Sc=Lina}) (341)
  3368. \p{Script: Linear_B} (Short: \p{Sc=Linb}) (211)
  3369. \p{Script: Lisu} (Short: \p{Sc=Lisu}) (48)
  3370. \p{Script: Lyci} \p{Script=Lycian} (29)
  3371. \p{Script: Lycian} (Short: \p{Sc=Lyci}) (29)
  3372. \p{Script: Lydi} \p{Script=Lydian} (27)
  3373. \p{Script: Lydian} (Short: \p{Sc=Lydi}) (27)
  3374. \p{Script: Mahajani} (Short: \p{Sc=Mahj}) (39)
  3375. \p{Script: Mahj} \p{Script=Mahajani} (39)
  3376. \p{Script: Malayalam} (Short: \p{Sc=Mlym}) (114)
  3377. \p{Script: Mand} \p{Script=Mandaic} (29)
  3378. \p{Script: Mandaic} (Short: \p{Sc=Mand}) (29)
  3379. \p{Script: Mani} \p{Script=Manichaean} (51)
  3380. \p{Script: Manichaean} (Short: \p{Sc=Mani}) (51)
  3381. \p{Script: Marc} \p{Script=Marchen} (68)
  3382. \p{Script: Marchen} (Short: \p{Sc=Marc}) (68)
  3383. \p{Script: Meetei_Mayek} (Short: \p{Sc=Mtei}) (79)
  3384. \p{Script: Mend} \p{Script=Mende_Kikakui} (213)
  3385. \p{Script: Mende_Kikakui} (Short: \p{Sc=Mend}) (213)
  3386. \p{Script: Merc} \p{Script=Meroitic_Cursive} (90)
  3387. \p{Script: Mero} \p{Script=Meroitic_Hieroglyphs} (32)
  3388. \p{Script: Meroitic_Cursive} (Short: \p{Sc=Merc}) (90)
  3389. \p{Script: Meroitic_Hieroglyphs} (Short: \p{Sc=Mero}) (32)
  3390. \p{Script: Miao} (Short: \p{Sc=Miao}) (133)
  3391. \p{Script: Mlym} \p{Script=Malayalam} (114)
  3392. \p{Script: Modi} (Short: \p{Sc=Modi}) (79)
  3393. \p{Script: Mong} \p{Script=Mongolian} (166)
  3394. \p{Script: Mongolian} (Short: \p{Sc=Mong}) (166)
  3395. \p{Script: Mro} (Short: \p{Sc=Mro}) (43)
  3396. \p{Script: Mroo} \p{Script=Mro} (43)
  3397. \p{Script: Mtei} \p{Script=Meetei_Mayek} (79)
  3398. \p{Script: Mult} \p{Script=Multani} (38)
  3399. \p{Script: Multani} (Short: \p{Sc=Mult}) (38)
  3400. \p{Script: Myanmar} (Short: \p{Sc=Mymr}) (223)
  3401. \p{Script: Mymr} \p{Script=Myanmar} (223)
  3402. \p{Script: Nabataean} (Short: \p{Sc=Nbat}) (40)
  3403. \p{Script: Narb} \p{Script=Old_North_Arabian} (32)
  3404. \p{Script: Nbat} \p{Script=Nabataean} (40)
  3405. \p{Script: New_Tai_Lue} (Short: \p{Sc=Talu}) (83)
  3406. \p{Script: Newa} (Short: \p{Sc=Newa}) (92)
  3407. \p{Script: Nko} (Short: \p{Sc=Nko}) (59)
  3408. \p{Script: Nkoo} \p{Script=Nko} (59)
  3409. \p{Script: Ogam} \p{Script=Ogham} (29)
  3410. \p{Script: Ogham} (Short: \p{Sc=Ogam}) (29)
  3411. \p{Script: Ol_Chiki} (Short: \p{Sc=Olck}) (48)
  3412. \p{Script: Olck} \p{Script=Ol_Chiki} (48)
  3413. \p{Script: Old_Hungarian} (Short: \p{Sc=Hung}) (108)
  3414. \p{Script: Old_Italic} (Short: \p{Sc=Ital}) (36)
  3415. \p{Script: Old_North_Arabian} (Short: \p{Sc=Narb}) (32)
  3416. \p{Script: Old_Permic} (Short: \p{Sc=Perm}) (43)
  3417. \p{Script: Old_Persian} (Short: \p{Sc=Xpeo}) (50)
  3418. \p{Script: Old_South_Arabian} (Short: \p{Sc=Sarb}) (32)
  3419. \p{Script: Old_Turkic} (Short: \p{Sc=Orkh}) (73)
  3420. \p{Script: Oriya} (Short: \p{Sc=Orya}) (90)
  3421. \p{Script: Orkh} \p{Script=Old_Turkic} (73)
  3422. \p{Script: Orya} \p{Script=Oriya} (90)
  3423. \p{Script: Osage} (Short: \p{Sc=Osge}) (72)
  3424. \p{Script: Osge} \p{Script=Osage} (72)
  3425. \p{Script: Osma} \p{Script=Osmanya} (40)
  3426. \p{Script: Osmanya} (Short: \p{Sc=Osma}) (40)
  3427. \p{Script: Pahawh_Hmong} (Short: \p{Sc=Hmng}) (127)
  3428. \p{Script: Palm} \p{Script=Palmyrene} (32)
  3429. \p{Script: Palmyrene} (Short: \p{Sc=Palm}) (32)
  3430. \p{Script: Pau_Cin_Hau} (Short: \p{Sc=Pauc}) (57)
  3431. \p{Script: Pauc} \p{Script=Pau_Cin_Hau} (57)
  3432. \p{Script: Perm} \p{Script=Old_Permic} (43)
  3433. \p{Script: Phag} \p{Script=Phags_Pa} (56)
  3434. \p{Script: Phags_Pa} (Short: \p{Sc=Phag}) (56)
  3435. \p{Script: Phli} \p{Script=Inscriptional_Pahlavi} (27)
  3436. \p{Script: Phlp} \p{Script=Psalter_Pahlavi} (29)
  3437. \p{Script: Phnx} \p{Script=Phoenician} (29)
  3438. \p{Script: Phoenician} (Short: \p{Sc=Phnx}) (29)
  3439. \p{Script: Plrd} \p{Script=Miao} (133)
  3440. \p{Script: Prti} \p{Script=Inscriptional_Parthian} (30)
  3441. \p{Script: Psalter_Pahlavi} (Short: \p{Sc=Phlp}) (29)
  3442. \p{Script: Qaac} \p{Script=Coptic} (137)
  3443. \p{Script: Qaai} \p{Script=Inherited} (564)
  3444. \p{Script: Rejang} (Short: \p{Sc=Rjng}) (37)
  3445. \p{Script: Rjng} \p{Script=Rejang} (37)
  3446. \p{Script: Runic} (Short: \p{Sc=Runr}) (86)
  3447. \p{Script: Runr} \p{Script=Runic} (86)
  3448. \p{Script: Samaritan} (Short: \p{Sc=Samr}) (61)
  3449. \p{Script: Samr} \p{Script=Samaritan} (61)
  3450. \p{Script: Sarb} \p{Script=Old_South_Arabian} (32)
  3451. \p{Script: Saur} \p{Script=Saurashtra} (82)
  3452. \p{Script: Saurashtra} (Short: \p{Sc=Saur}) (82)
  3453. \p{Script: Sgnw} \p{Script=SignWriting} (672)
  3454. \p{Script: Sharada} (Short: \p{Sc=Shrd}) (94)
  3455. \p{Script: Shavian} (Short: \p{Sc=Shaw}) (48)
  3456. \p{Script: Shaw} \p{Script=Shavian} (48)
  3457. \p{Script: Shrd} \p{Script=Sharada} (94)
  3458. \p{Script: Sidd} \p{Script=Siddham} (92)
  3459. \p{Script: Siddham} (Short: \p{Sc=Sidd}) (92)
  3460. \p{Script: SignWriting} (Short: \p{Sc=Sgnw}) (672)
  3461. \p{Script: Sind} \p{Script=Khudawadi} (69)
  3462. \p{Script: Sinh} \p{Script=Sinhala} (110)
  3463. \p{Script: Sinhala} (Short: \p{Sc=Sinh}) (110)
  3464. \p{Script: Sora} \p{Script=Sora_Sompeng} (35)
  3465. \p{Script: Sora_Sompeng} (Short: \p{Sc=Sora}) (35)
  3466. \p{Script: Sund} \p{Script=Sundanese} (72)
  3467. \p{Script: Sundanese} (Short: \p{Sc=Sund}) (72)
  3468. \p{Script: Sylo} \p{Script=Syloti_Nagri} (44)
  3469. \p{Script: Syloti_Nagri} (Short: \p{Sc=Sylo}) (44)
  3470. \p{Script: Syrc} \p{Script=Syriac} (77)
  3471. \p{Script: Syriac} (Short: \p{Sc=Syrc}) (77)
  3472. \p{Script: Tagalog} (Short: \p{Sc=Tglg}) (20)
  3473. \p{Script: Tagb} \p{Script=Tagbanwa} (18)
  3474. \p{Script: Tagbanwa} (Short: \p{Sc=Tagb}) (18)
  3475. \p{Script: Tai_Le} (Short: \p{Sc=Tale}) (35)
  3476. \p{Script: Tai_Tham} (Short: \p{Sc=Lana}) (127)
  3477. \p{Script: Tai_Viet} (Short: \p{Sc=Tavt}) (72)
  3478. \p{Script: Takr} \p{Script=Takri} (66)
  3479. \p{Script: Takri} (Short: \p{Sc=Takr}) (66)
  3480. \p{Script: Tale} \p{Script=Tai_Le} (35)
  3481. \p{Script: Talu} \p{Script=New_Tai_Lue} (83)
  3482. \p{Script: Tamil} (Short: \p{Sc=Taml}) (72)
  3483. \p{Script: Taml} \p{Script=Tamil} (72)
  3484. \p{Script: Tang} \p{Script=Tangut} (6881)
  3485. \p{Script: Tangut} (Short: \p{Sc=Tang}) (6881)
  3486. \p{Script: Tavt} \p{Script=Tai_Viet} (72)
  3487. \p{Script: Telu} \p{Script=Telugu} (96)
  3488. \p{Script: Telugu} (Short: \p{Sc=Telu}) (96)
  3489. \p{Script: Tfng} \p{Script=Tifinagh} (59)
  3490. \p{Script: Tglg} \p{Script=Tagalog} (20)
  3491. \p{Script: Thaa} \p{Script=Thaana} (50)
  3492. \p{Script: Thaana} (Short: \p{Sc=Thaa}) (50)
  3493. \p{Script: Thai} (Short: \p{Sc=Thai}) (86)
  3494. \p{Script: Tibetan} (Short: \p{Sc=Tibt}) (207)
  3495. \p{Script: Tibt} \p{Script=Tibetan} (207)
  3496. \p{Script: Tifinagh} (Short: \p{Sc=Tfng}) (59)
  3497. \p{Script: Tirh} \p{Script=Tirhuta} (82)
  3498. \p{Script: Tirhuta} (Short: \p{Sc=Tirh}) (82)
  3499. \p{Script: Ugar} \p{Script=Ugaritic} (31)
  3500. \p{Script: Ugaritic} (Short: \p{Sc=Ugar}) (31)
  3501. \p{Script: Unknown} (Short: \p{Sc=Zzzz}) (985_875 plus all
  3502. above-Unicode code points)
  3503. \p{Script: Vai} (Short: \p{Sc=Vai}) (300)
  3504. \p{Script: Vaii} \p{Script=Vai} (300)
  3505. \p{Script: Wara} \p{Script=Warang_Citi} (84)
  3506. \p{Script: Warang_Citi} (Short: \p{Sc=Wara}) (84)
  3507. \p{Script: Xpeo} \p{Script=Old_Persian} (50)
  3508. \p{Script: Xsux} \p{Script=Cuneiform} (1234)
  3509. \p{Script: Yi} (Short: \p{Sc=Yi}) (1220)
  3510. \p{Script: Yiii} \p{Script=Yi} (1220)
  3511. \p{Script: Zinh} \p{Script=Inherited} (564)
  3512. \p{Script: Zyyy} \p{Script=Common} (7279)
  3513. \p{Script: Zzzz} \p{Script=Unknown} (985_875 plus all
  3514. above-Unicode code points)
  3515. \p{Script_Extensions: Adlam} (Short: \p{Scx=Adlm}, \p{Adlm}) (88)
  3516. \p{Script_Extensions: Adlm} \p{Script_Extensions=Adlam} (88)
  3517. \p{Script_Extensions: Aghb} \p{Script_Extensions=
  3518. Caucasian_Albanian} (53)
  3519. \p{Script_Extensions: Ahom} (Short: \p{Scx=Ahom}, \p{Ahom}) (57)
  3520. \p{Script_Extensions: Anatolian_Hieroglyphs} (Short: \p{Scx=Hluw},
  3521. \p{Hluw}) (583)
  3522. \p{Script_Extensions: Arab} \p{Script_Extensions=Arabic} (1323)
  3523. \p{Script_Extensions: Arabic} (Short: \p{Scx=Arab}, \p{Arab})
  3524. (1323)
  3525. \p{Script_Extensions: Armenian} (Short: \p{Scx=Armn}, \p{Armn})
  3526. (94)
  3527. \p{Script_Extensions: Armi} \p{Script_Extensions=Imperial_Aramaic}
  3528. (31)
  3529. \p{Script_Extensions: Armn} \p{Script_Extensions=Armenian} (94)
  3530. \p{Script_Extensions: Avestan} (Short: \p{Scx=Avst}, \p{Avst}) (61)
  3531. \p{Script_Extensions: Avst} \p{Script_Extensions=Avestan} (61)
  3532. \p{Script_Extensions: Bali} \p{Script_Extensions=Balinese} (121)
  3533. \p{Script_Extensions: Balinese} (Short: \p{Scx=Bali}, \p{Bali})
  3534. (121)
  3535. \p{Script_Extensions: Bamu} \p{Script_Extensions=Bamum} (657)
  3536. \p{Script_Extensions: Bamum} (Short: \p{Scx=Bamu}, \p{Bamu}) (657)
  3537. \p{Script_Extensions: Bass} \p{Script_Extensions=Bassa_Vah} (36)
  3538. \p{Script_Extensions: Bassa_Vah} (Short: \p{Scx=Bass}, \p{Bass})
  3539. (36)
  3540. \p{Script_Extensions: Batak} (Short: \p{Scx=Batk}, \p{Batk}) (56)
  3541. \p{Script_Extensions: Batk} \p{Script_Extensions=Batak} (56)
  3542. \p{Script_Extensions: Beng} \p{Script_Extensions=Bengali} (98)
  3543. \p{Script_Extensions: Bengali} (Short: \p{Scx=Beng}, \p{Beng}) (98)
  3544. \p{Script_Extensions: Bhaiksuki} (Short: \p{Scx=Bhks}, \p{Bhks})
  3545. (97)
  3546. \p{Script_Extensions: Bhks} \p{Script_Extensions=Bhaiksuki} (97)
  3547. \p{Script_Extensions: Bopo} \p{Script_Extensions=Bopomofo} (110)
  3548. \p{Script_Extensions: Bopomofo} (Short: \p{Scx=Bopo}, \p{Bopo})
  3549. (110)
  3550. \p{Script_Extensions: Brah} \p{Script_Extensions=Brahmi} (109)
  3551. \p{Script_Extensions: Brahmi} (Short: \p{Scx=Brah}, \p{Brah}) (109)
  3552. \p{Script_Extensions: Brai} \p{Script_Extensions=Braille} (256)
  3553. \p{Script_Extensions: Braille} (Short: \p{Scx=Brai}, \p{Brai})
  3554. (256)
  3555. \p{Script_Extensions: Bugi} \p{Script_Extensions=Buginese} (31)
  3556. \p{Script_Extensions: Buginese} (Short: \p{Scx=Bugi}, \p{Bugi})
  3557. (31)
  3558. \p{Script_Extensions: Buhd} \p{Script_Extensions=Buhid} (22)
  3559. \p{Script_Extensions: Buhid} (Short: \p{Scx=Buhd}, \p{Buhd}) (22)
  3560. \p{Script_Extensions: Cakm} \p{Script_Extensions=Chakma} (87)
  3561. \p{Script_Extensions: Canadian_Aboriginal} (Short: \p{Scx=Cans},
  3562. \p{Cans}) (710)
  3563. \p{Script_Extensions: Cans} \p{Script_Extensions=
  3564. Canadian_Aboriginal} (710)
  3565. \p{Script_Extensions: Cari} \p{Script_Extensions=Carian} (49)
  3566. \p{Script_Extensions: Carian} (Short: \p{Scx=Cari}, \p{Cari}) (49)
  3567. \p{Script_Extensions: Caucasian_Albanian} (Short: \p{Scx=Aghb},
  3568. \p{Aghb}) (53)
  3569. \p{Script_Extensions: Chakma} (Short: \p{Scx=Cakm}, \p{Cakm}) (87)
  3570. \p{Script_Extensions: Cham} (Short: \p{Scx=Cham}, \p{Cham}) (83)
  3571. \p{Script_Extensions: Cher} \p{Script_Extensions=Cherokee} (172)
  3572. \p{Script_Extensions: Cherokee} (Short: \p{Scx=Cher}, \p{Cher})
  3573. (172)
  3574. \p{Script_Extensions: Common} (Short: \p{Scx=Zyyy}, \p{Zyyy})
  3575. (6864)
  3576. \p{Script_Extensions: Copt} \p{Script_Extensions=Coptic} (165)
  3577. \p{Script_Extensions: Coptic} (Short: \p{Scx=Copt}, \p{Copt}) (165)
  3578. \p{Script_Extensions: Cprt} \p{Script_Extensions=Cypriot} (112)
  3579. \p{Script_Extensions: Cuneiform} (Short: \p{Scx=Xsux}, \p{Xsux})
  3580. (1234)
  3581. \p{Script_Extensions: Cypriot} (Short: \p{Scx=Cprt}, \p{Cprt})
  3582. (112)
  3583. \p{Script_Extensions: Cyrillic} (Short: \p{Scx=Cyrl}, \p{Cyrl})
  3584. (446)
  3585. \p{Script_Extensions: Cyrl} \p{Script_Extensions=Cyrillic} (446)
  3586. \p{Script_Extensions: Deseret} (Short: \p{Scx=Dsrt}, \p{Dsrt}) (80)
  3587. \p{Script_Extensions: Deva} \p{Script_Extensions=Devanagari} (210)
  3588. \p{Script_Extensions: Devanagari} (Short: \p{Scx=Deva}, \p{Deva})
  3589. (210)
  3590. \p{Script_Extensions: Dsrt} \p{Script_Extensions=Deseret} (80)
  3591. \p{Script_Extensions: Dupl} \p{Script_Extensions=Duployan} (147)
  3592. \p{Script_Extensions: Duployan} (Short: \p{Scx=Dupl}, \p{Dupl})
  3593. (147)
  3594. \p{Script_Extensions: Egyp} \p{Script_Extensions=
  3595. Egyptian_Hieroglyphs} (1071)
  3596. \p{Script_Extensions: Egyptian_Hieroglyphs} (Short: \p{Scx=Egyp},
  3597. \p{Egyp}) (1071)
  3598. \p{Script_Extensions: Elba} \p{Script_Extensions=Elbasan} (40)
  3599. \p{Script_Extensions: Elbasan} (Short: \p{Scx=Elba}, \p{Elba}) (40)
  3600. \p{Script_Extensions: Ethi} \p{Script_Extensions=Ethiopic} (495)
  3601. \p{Script_Extensions: Ethiopic} (Short: \p{Scx=Ethi}, \p{Ethi})
  3602. (495)
  3603. \p{Script_Extensions: Geor} \p{Script_Extensions=Georgian} (129)
  3604. \p{Script_Extensions: Georgian} (Short: \p{Scx=Geor}, \p{Geor})
  3605. (129)
  3606. \p{Script_Extensions: Glag} \p{Script_Extensions=Glagolitic} (136)
  3607. \p{Script_Extensions: Glagolitic} (Short: \p{Scx=Glag}, \p{Glag})
  3608. (136)
  3609. \p{Script_Extensions: Goth} \p{Script_Extensions=Gothic} (27)
  3610. \p{Script_Extensions: Gothic} (Short: \p{Scx=Goth}, \p{Goth}) (27)
  3611. \p{Script_Extensions: Gran} \p{Script_Extensions=Grantha} (113)
  3612. \p{Script_Extensions: Grantha} (Short: \p{Scx=Gran}, \p{Gran})
  3613. (113)
  3614. \p{Script_Extensions: Greek} (Short: \p{Scx=Grek}, \p{Grek}) (522)
  3615. \p{Script_Extensions: Grek} \p{Script_Extensions=Greek} (522)
  3616. \p{Script_Extensions: Gujarati} (Short: \p{Scx=Gujr}, \p{Gujr})
  3617. (99)
  3618. \p{Script_Extensions: Gujr} \p{Script_Extensions=Gujarati} (99)
  3619. \p{Script_Extensions: Gurmukhi} (Short: \p{Scx=Guru}, \p{Guru})
  3620. (93)
  3621. \p{Script_Extensions: Guru} \p{Script_Extensions=Gurmukhi} (93)
  3622. \p{Script_Extensions: Han} (Short: \p{Scx=Han}, \p{Han}) (82_013)
  3623. \p{Script_Extensions: Hang} \p{Script_Extensions=Hangul} (11_775)
  3624. \p{Script_Extensions: Hangul} (Short: \p{Scx=Hang}, \p{Hang})
  3625. (11_775)
  3626. \p{Script_Extensions: Hani} \p{Script_Extensions=Han} (82_013)
  3627. \p{Script_Extensions: Hano} \p{Script_Extensions=Hanunoo} (23)
  3628. \p{Script_Extensions: Hanunoo} (Short: \p{Scx=Hano}, \p{Hano}) (23)
  3629. \p{Script_Extensions: Hatr} \p{Script_Extensions=Hatran} (26)
  3630. \p{Script_Extensions: Hatran} (Short: \p{Scx=Hatr}, \p{Hatr}) (26)
  3631. \p{Script_Extensions: Hebr} \p{Script_Extensions=Hebrew} (133)
  3632. \p{Script_Extensions: Hebrew} (Short: \p{Scx=Hebr}, \p{Hebr}) (133)
  3633. \p{Script_Extensions: Hira} \p{Script_Extensions=Hiragana} (143)
  3634. \p{Script_Extensions: Hiragana} (Short: \p{Scx=Hira}, \p{Hira})
  3635. (143)
  3636. \p{Script_Extensions: Hluw} \p{Script_Extensions=
  3637. Anatolian_Hieroglyphs} (583)
  3638. \p{Script_Extensions: Hmng} \p{Script_Extensions=Pahawh_Hmong}
  3639. (127)
  3640. \p{Script_Extensions: Hung} \p{Script_Extensions=Old_Hungarian}
  3641. (108)
  3642. \p{Script_Extensions: Imperial_Aramaic} (Short: \p{Scx=Armi},
  3643. \p{Armi}) (31)
  3644. \p{Script_Extensions: Inherited} (Short: \p{Scx=Zinh}, \p{Zinh})
  3645. (496)
  3646. \p{Script_Extensions: Inscriptional_Pahlavi} (Short: \p{Scx=Phli},
  3647. \p{Phli}) (27)
  3648. \p{Script_Extensions: Inscriptional_Parthian} (Short: \p{Scx=
  3649. Prti}, \p{Prti}) (30)
  3650. \p{Script_Extensions: Ital} \p{Script_Extensions=Old_Italic} (36)
  3651. \p{Script_Extensions: Java} \p{Script_Extensions=Javanese} (91)
  3652. \p{Script_Extensions: Javanese} (Short: \p{Scx=Java}, \p{Java})
  3653. (91)
  3654. \p{Script_Extensions: Kaithi} (Short: \p{Scx=Kthi}, \p{Kthi}) (86)
  3655. \p{Script_Extensions: Kali} \p{Script_Extensions=Kayah_Li} (48)
  3656. \p{Script_Extensions: Kana} \p{Script_Extensions=Katakana} (352)
  3657. \p{Script_Extensions: Kannada} (Short: \p{Scx=Knda}, \p{Knda})
  3658. (100)
  3659. \p{Script_Extensions: Katakana} (Short: \p{Scx=Kana}, \p{Kana})
  3660. (352)
  3661. \p{Script_Extensions: Kayah_Li} (Short: \p{Scx=Kali}, \p{Kali})
  3662. (48)
  3663. \p{Script_Extensions: Khar} \p{Script_Extensions=Kharoshthi} (65)
  3664. \p{Script_Extensions: Kharoshthi} (Short: \p{Scx=Khar}, \p{Khar})
  3665. (65)
  3666. \p{Script_Extensions: Khmer} (Short: \p{Scx=Khmr}, \p{Khmr}) (146)
  3667. \p{Script_Extensions: Khmr} \p{Script_Extensions=Khmer} (146)
  3668. \p{Script_Extensions: Khoj} \p{Script_Extensions=Khojki} (72)
  3669. \p{Script_Extensions: Khojki} (Short: \p{Scx=Khoj}, \p{Khoj}) (72)
  3670. \p{Script_Extensions: Khudawadi} (Short: \p{Scx=Sind}, \p{Sind})
  3671. (81)
  3672. \p{Script_Extensions: Knda} \p{Script_Extensions=Kannada} (100)
  3673. \p{Script_Extensions: Kthi} \p{Script_Extensions=Kaithi} (86)
  3674. \p{Script_Extensions: Lana} \p{Script_Extensions=Tai_Tham} (127)
  3675. \p{Script_Extensions: Lao} (Short: \p{Scx=Lao}, \p{Lao}) (67)
  3676. \p{Script_Extensions: Laoo} \p{Script_Extensions=Lao} (67)
  3677. \p{Script_Extensions: Latin} (Short: \p{Scx=Latn}, \p{Latn}) (1370)
  3678. \p{Script_Extensions: Latn} \p{Script_Extensions=Latin} (1370)
  3679. \p{Script_Extensions: Lepc} \p{Script_Extensions=Lepcha} (74)
  3680. \p{Script_Extensions: Lepcha} (Short: \p{Scx=Lepc}, \p{Lepc}) (74)
  3681. \p{Script_Extensions: Limb} \p{Script_Extensions=Limbu} (69)
  3682. \p{Script_Extensions: Limbu} (Short: \p{Scx=Limb}, \p{Limb}) (69)
  3683. \p{Script_Extensions: Lina} \p{Script_Extensions=Linear_A} (386)
  3684. \p{Script_Extensions: Linb} \p{Script_Extensions=Linear_B} (268)
  3685. \p{Script_Extensions: Linear_A} (Short: \p{Scx=Lina}, \p{Lina})
  3686. (386)
  3687. \p{Script_Extensions: Linear_B} (Short: \p{Scx=Linb}, \p{Linb})
  3688. (268)
  3689. \p{Script_Extensions: Lisu} (Short: \p{Scx=Lisu}, \p{Lisu}) (48)
  3690. \p{Script_Extensions: Lyci} \p{Script_Extensions=Lycian} (29)
  3691. \p{Script_Extensions: Lycian} (Short: \p{Scx=Lyci}, \p{Lyci}) (29)
  3692. \p{Script_Extensions: Lydi} \p{Script_Extensions=Lydian} (27)
  3693. \p{Script_Extensions: Lydian} (Short: \p{Scx=Lydi}, \p{Lydi}) (27)
  3694. \p{Script_Extensions: Mahajani} (Short: \p{Scx=Mahj}, \p{Mahj})
  3695. (61)
  3696. \p{Script_Extensions: Mahj} \p{Script_Extensions=Mahajani} (61)
  3697. \p{Script_Extensions: Malayalam} (Short: \p{Scx=Mlym}, \p{Mlym})
  3698. (119)
  3699. \p{Script_Extensions: Mand} \p{Script_Extensions=Mandaic} (30)
  3700. \p{Script_Extensions: Mandaic} (Short: \p{Scx=Mand}, \p{Mand}) (30)
  3701. \p{Script_Extensions: Mani} \p{Script_Extensions=Manichaean} (52)
  3702. \p{Script_Extensions: Manichaean} (Short: \p{Scx=Mani}, \p{Mani})
  3703. (52)
  3704. \p{Script_Extensions: Marc} \p{Script_Extensions=Marchen} (68)
  3705. \p{Script_Extensions: Marchen} (Short: \p{Scx=Marc}, \p{Marc}) (68)
  3706. \p{Script_Extensions: Meetei_Mayek} (Short: \p{Scx=Mtei},
  3707. \p{Mtei}) (79)
  3708. \p{Script_Extensions: Mend} \p{Script_Extensions=Mende_Kikakui}
  3709. (213)
  3710. \p{Script_Extensions: Mende_Kikakui} (Short: \p{Scx=Mend},
  3711. \p{Mend}) (213)
  3712. \p{Script_Extensions: Merc} \p{Script_Extensions=Meroitic_Cursive}
  3713. (90)
  3714. \p{Script_Extensions: Mero} \p{Script_Extensions=
  3715. Meroitic_Hieroglyphs} (32)
  3716. \p{Script_Extensions: Meroitic_Cursive} (Short: \p{Scx=Merc},
  3717. \p{Merc}) (90)
  3718. \p{Script_Extensions: Meroitic_Hieroglyphs} (Short: \p{Scx=Mero},
  3719. \p{Mero}) (32)
  3720. \p{Script_Extensions: Miao} (Short: \p{Scx=Miao}, \p{Miao}) (133)
  3721. \p{Script_Extensions: Mlym} \p{Script_Extensions=Malayalam} (119)
  3722. \p{Script_Extensions: Modi} (Short: \p{Scx=Modi}, \p{Modi}) (89)
  3723. \p{Script_Extensions: Mong} \p{Script_Extensions=Mongolian} (169)
  3724. \p{Script_Extensions: Mongolian} (Short: \p{Scx=Mong}, \p{Mong})
  3725. (169)
  3726. \p{Script_Extensions: Mro} (Short: \p{Scx=Mro}, \p{Mro}) (43)
  3727. \p{Script_Extensions: Mroo} \p{Script_Extensions=Mro} (43)
  3728. \p{Script_Extensions: Mtei} \p{Script_Extensions=Meetei_Mayek} (79)
  3729. \p{Script_Extensions: Mult} \p{Script_Extensions=Multani} (48)
  3730. \p{Script_Extensions: Multani} (Short: \p{Scx=Mult}, \p{Mult}) (48)
  3731. \p{Script_Extensions: Myanmar} (Short: \p{Scx=Mymr}, \p{Mymr})
  3732. (224)
  3733. \p{Script_Extensions: Mymr} \p{Script_Extensions=Myanmar} (224)
  3734. \p{Script_Extensions: Nabataean} (Short: \p{Scx=Nbat}, \p{Nbat})
  3735. (40)
  3736. \p{Script_Extensions: Narb} \p{Script_Extensions=
  3737. Old_North_Arabian} (32)
  3738. \p{Script_Extensions: Nbat} \p{Script_Extensions=Nabataean} (40)
  3739. \p{Script_Extensions: New_Tai_Lue} (Short: \p{Scx=Talu}, \p{Talu})
  3740. (83)
  3741. \p{Script_Extensions: Newa} (Short: \p{Scx=Newa}, \p{Newa}) (92)
  3742. \p{Script_Extensions: Nko} (Short: \p{Scx=Nko}, \p{Nko}) (59)
  3743. \p{Script_Extensions: Nkoo} \p{Script_Extensions=Nko} (59)
  3744. \p{Script_Extensions: Ogam} \p{Script_Extensions=Ogham} (29)
  3745. \p{Script_Extensions: Ogham} (Short: \p{Scx=Ogam}, \p{Ogam}) (29)
  3746. \p{Script_Extensions: Ol_Chiki} (Short: \p{Scx=Olck}, \p{Olck})
  3747. (48)
  3748. \p{Script_Extensions: Olck} \p{Script_Extensions=Ol_Chiki} (48)
  3749. \p{Script_Extensions: Old_Hungarian} (Short: \p{Scx=Hung},
  3750. \p{Hung}) (108)
  3751. \p{Script_Extensions: Old_Italic} (Short: \p{Scx=Ital}, \p{Ital})
  3752. (36)
  3753. \p{Script_Extensions: Old_North_Arabian} (Short: \p{Scx=Narb},
  3754. \p{Narb}) (32)
  3755. \p{Script_Extensions: Old_Permic} (Short: \p{Scx=Perm}, \p{Perm})
  3756. (44)
  3757. \p{Script_Extensions: Old_Persian} (Short: \p{Scx=Xpeo}, \p{Xpeo})
  3758. (50)
  3759. \p{Script_Extensions: Old_South_Arabian} (Short: \p{Scx=Sarb},
  3760. \p{Sarb}) (32)
  3761. \p{Script_Extensions: Old_Turkic} (Short: \p{Scx=Orkh}, \p{Orkh})
  3762. (73)
  3763. \p{Script_Extensions: Oriya} (Short: \p{Scx=Orya}, \p{Orya}) (94)
  3764. \p{Script_Extensions: Orkh} \p{Script_Extensions=Old_Turkic} (73)
  3765. \p{Script_Extensions: Orya} \p{Script_Extensions=Oriya} (94)
  3766. \p{Script_Extensions: Osage} (Short: \p{Scx=Osge}, \p{Osge}) (72)
  3767. \p{Script_Extensions: Osge} \p{Script_Extensions=Osage} (72)
  3768. \p{Script_Extensions: Osma} \p{Script_Extensions=Osmanya} (40)
  3769. \p{Script_Extensions: Osmanya} (Short: \p{Scx=Osma}, \p{Osma}) (40)
  3770. \p{Script_Extensions: Pahawh_Hmong} (Short: \p{Scx=Hmng},
  3771. \p{Hmng}) (127)
  3772. \p{Script_Extensions: Palm} \p{Script_Extensions=Palmyrene} (32)
  3773. \p{Script_Extensions: Palmyrene} (Short: \p{Scx=Palm}, \p{Palm})
  3774. (32)
  3775. \p{Script_Extensions: Pau_Cin_Hau} (Short: \p{Scx=Pauc}, \p{Pauc})
  3776. (57)
  3777. \p{Script_Extensions: Pauc} \p{Script_Extensions=Pau_Cin_Hau} (57)
  3778. \p{Script_Extensions: Perm} \p{Script_Extensions=Old_Permic} (44)
  3779. \p{Script_Extensions: Phag} \p{Script_Extensions=Phags_Pa} (59)
  3780. \p{Script_Extensions: Phags_Pa} (Short: \p{Scx=Phag}, \p{Phag})
  3781. (59)
  3782. \p{Script_Extensions: Phli} \p{Script_Extensions=
  3783. Inscriptional_Pahlavi} (27)
  3784. \p{Script_Extensions: Phlp} \p{Script_Extensions=Psalter_Pahlavi}
  3785. (30)
  3786. \p{Script_Extensions: Phnx} \p{Script_Extensions=Phoenician} (29)
  3787. \p{Script_Extensions: Phoenician} (Short: \p{Scx=Phnx}, \p{Phnx})
  3788. (29)
  3789. \p{Script_Extensions: Plrd} \p{Script_Extensions=Miao} (133)
  3790. \p{Script_Extensions: Prti} \p{Script_Extensions=
  3791. Inscriptional_Parthian} (30)
  3792. \p{Script_Extensions: Psalter_Pahlavi} (Short: \p{Scx=Phlp},
  3793. \p{Phlp}) (30)
  3794. \p{Script_Extensions: Qaac} \p{Script_Extensions=Coptic} (165)
  3795. \p{Script_Extensions: Qaai} \p{Script_Extensions=Inherited} (496)
  3796. \p{Script_Extensions: Rejang} (Short: \p{Scx=Rjng}, \p{Rjng}) (37)
  3797. \p{Script_Extensions: Rjng} \p{Script_Extensions=Rejang} (37)
  3798. \p{Script_Extensions: Runic} (Short: \p{Scx=Runr}, \p{Runr}) (86)
  3799. \p{Script_Extensions: Runr} \p{Script_Extensions=Runic} (86)
  3800. \p{Script_Extensions: Samaritan} (Short: \p{Scx=Samr}, \p{Samr})
  3801. (61)
  3802. \p{Script_Extensions: Samr} \p{Script_Extensions=Samaritan} (61)
  3803. \p{Script_Extensions: Sarb} \p{Script_Extensions=
  3804. Old_South_Arabian} (32)
  3805. \p{Script_Extensions: Saur} \p{Script_Extensions=Saurashtra} (82)
  3806. \p{Script_Extensions: Saurashtra} (Short: \p{Scx=Saur}, \p{Saur})
  3807. (82)
  3808. \p{Script_Extensions: Sgnw} \p{Script_Extensions=SignWriting} (672)
  3809. \p{Script_Extensions: Sharada} (Short: \p{Scx=Shrd}, \p{Shrd})
  3810. (100)
  3811. \p{Script_Extensions: Shavian} (Short: \p{Scx=Shaw}, \p{Shaw}) (48)
  3812. \p{Script_Extensions: Shaw} \p{Script_Extensions=Shavian} (48)
  3813. \p{Script_Extensions: Shrd} \p{Script_Extensions=Sharada} (100)
  3814. \p{Script_Extensions: Sidd} \p{Script_Extensions=Siddham} (92)
  3815. \p{Script_Extensions: Siddham} (Short: \p{Scx=Sidd}, \p{Sidd}) (92)
  3816. \p{Script_Extensions: SignWriting} (Short: \p{Scx=Sgnw}, \p{Sgnw})
  3817. (672)
  3818. \p{Script_Extensions: Sind} \p{Script_Extensions=Khudawadi} (81)
  3819. \p{Script_Extensions: Sinh} \p{Script_Extensions=Sinhala} (112)
  3820. \p{Script_Extensions: Sinhala} (Short: \p{Scx=Sinh}, \p{Sinh})
  3821. (112)
  3822. \p{Script_Extensions: Sora} \p{Script_Extensions=Sora_Sompeng} (35)
  3823. \p{Script_Extensions: Sora_Sompeng} (Short: \p{Scx=Sora},
  3824. \p{Sora}) (35)
  3825. \p{Script_Extensions: Sund} \p{Script_Extensions=Sundanese} (72)
  3826. \p{Script_Extensions: Sundanese} (Short: \p{Scx=Sund}, \p{Sund})
  3827. (72)
  3828. \p{Script_Extensions: Sylo} \p{Script_Extensions=Syloti_Nagri} (56)
  3829. \p{Script_Extensions: Syloti_Nagri} (Short: \p{Scx=Sylo},
  3830. \p{Sylo}) (56)
  3831. \p{Script_Extensions: Syrc} \p{Script_Extensions=Syriac} (93)
  3832. \p{Script_Extensions: Syriac} (Short: \p{Scx=Syrc}, \p{Syrc}) (93)
  3833. \p{Script_Extensions: Tagalog} (Short: \p{Scx=Tglg}, \p{Tglg}) (22)
  3834. \p{Script_Extensions: Tagb} \p{Script_Extensions=Tagbanwa} (20)
  3835. \p{Script_Extensions: Tagbanwa} (Short: \p{Scx=Tagb}, \p{Tagb})
  3836. (20)
  3837. \p{Script_Extensions: Tai_Le} (Short: \p{Scx=Tale}, \p{Tale}) (45)
  3838. \p{Script_Extensions: Tai_Tham} (Short: \p{Scx=Lana}, \p{Lana})
  3839. (127)
  3840. \p{Script_Extensions: Tai_Viet} (Short: \p{Scx=Tavt}, \p{Tavt})
  3841. (72)
  3842. \p{Script_Extensions: Takr} \p{Script_Extensions=Takri} (78)
  3843. \p{Script_Extensions: Takri} (Short: \p{Scx=Takr}, \p{Takr}) (78)
  3844. \p{Script_Extensions: Tale} \p{Script_Extensions=Tai_Le} (45)
  3845. \p{Script_Extensions: Talu} \p{Script_Extensions=New_Tai_Lue} (83)
  3846. \p{Script_Extensions: Tamil} (Short: \p{Scx=Taml}, \p{Taml}) (80)
  3847. \p{Script_Extensions: Taml} \p{Script_Extensions=Tamil} (80)
  3848. \p{Script_Extensions: Tang} \p{Script_Extensions=Tangut} (6881)
  3849. \p{Script_Extensions: Tangut} (Short: \p{Scx=Tang}, \p{Tang})
  3850. (6881)
  3851. \p{Script_Extensions: Tavt} \p{Script_Extensions=Tai_Viet} (72)
  3852. \p{Script_Extensions: Telu} \p{Script_Extensions=Telugu} (101)
  3853. \p{Script_Extensions: Telugu} (Short: \p{Scx=Telu}, \p{Telu}) (101)
  3854. \p{Script_Extensions: Tfng} \p{Script_Extensions=Tifinagh} (59)
  3855. \p{Script_Extensions: Tglg} \p{Script_Extensions=Tagalog} (22)
  3856. \p{Script_Extensions: Thaa} \p{Script_Extensions=Thaana} (65)
  3857. \p{Script_Extensions: Thaana} (Short: \p{Scx=Thaa}, \p{Thaa}) (65)
  3858. \p{Script_Extensions: Thai} (Short: \p{Scx=Thai}, \p{Thai}) (86)
  3859. \p{Script_Extensions: Tibetan} (Short: \p{Scx=Tibt}, \p{Tibt})
  3860. (207)
  3861. \p{Script_Extensions: Tibt} \p{Script_Extensions=Tibetan} (207)
  3862. \p{Script_Extensions: Tifinagh} (Short: \p{Scx=Tfng}, \p{Tfng})
  3863. (59)
  3864. \p{Script_Extensions: Tirh} \p{Script_Extensions=Tirhuta} (94)
  3865. \p{Script_Extensions: Tirhuta} (Short: \p{Scx=Tirh}, \p{Tirh}) (94)
  3866. \p{Script_Extensions: Ugar} \p{Script_Extensions=Ugaritic} (31)
  3867. \p{Script_Extensions: Ugaritic} (Short: \p{Scx=Ugar}, \p{Ugar})
  3868. (31)
  3869. \p{Script_Extensions: Unknown} (Short: \p{Scx=Zzzz}, \p{Zzzz})
  3870. (985_875 plus all above-Unicode code
  3871. points)
  3872. \p{Script_Extensions: Vai} (Short: \p{Scx=Vai}, \p{Vai}) (300)
  3873. \p{Script_Extensions: Vaii} \p{Script_Extensions=Vai} (300)
  3874. \p{Script_Extensions: Wara} \p{Script_Extensions=Warang_Citi} (84)
  3875. \p{Script_Extensions: Warang_Citi} (Short: \p{Scx=Wara}, \p{Wara})
  3876. (84)
  3877. \p{Script_Extensions: Xpeo} \p{Script_Extensions=Old_Persian} (50)
  3878. \p{Script_Extensions: Xsux} \p{Script_Extensions=Cuneiform} (1234)
  3879. \p{Script_Extensions: Yi} (Short: \p{Scx=Yi}, \p{Yi}) (1246)
  3880. \p{Script_Extensions: Yiii} \p{Script_Extensions=Yi} (1246)
  3881. \p{Script_Extensions: Zinh} \p{Script_Extensions=Inherited} (496)
  3882. \p{Script_Extensions: Zyyy} \p{Script_Extensions=Common} (6864)
  3883. \p{Script_Extensions: Zzzz} \p{Script_Extensions=Unknown} (985_875
  3884. plus all above-Unicode code points)
  3885. \p{Scx: *} \p{Script_Extensions: *}
  3886. \p{SD} \p{Soft_Dotted} (= \p{Soft_Dotted=Y}) (46)
  3887. \p{SD: *} \p{Soft_Dotted: *}
  3888. \p{Sentence_Break: AT} \p{Sentence_Break=ATerm} (4)
  3889. \p{Sentence_Break: ATerm} (Short: \p{SB=AT}) (4)
  3890. \p{Sentence_Break: CL} \p{Sentence_Break=Close} (187)
  3891. \p{Sentence_Break: Close} (Short: \p{SB=CL}) (187)
  3892. \p{Sentence_Break: CR} (Short: \p{SB=CR}) (1)
  3893. \p{Sentence_Break: EX} \p{Sentence_Break=Extend} (2197)
  3894. \p{Sentence_Break: Extend} (Short: \p{SB=EX}) (2197)
  3895. \p{Sentence_Break: FO} \p{Sentence_Break=Format} (53)
  3896. \p{Sentence_Break: Format} (Short: \p{SB=FO}) (53)
  3897. \p{Sentence_Break: LE} \p{Sentence_Break=OLetter} (113_027)
  3898. \p{Sentence_Break: LF} (Short: \p{SB=LF}) (1)
  3899. \p{Sentence_Break: LO} \p{Sentence_Break=Lower} (2251)
  3900. \p{Sentence_Break: Lower} (Short: \p{SB=LO}) (2251)
  3901. \p{Sentence_Break: NU} \p{Sentence_Break=Numeric} (572)
  3902. \p{Sentence_Break: Numeric} (Short: \p{SB=NU}) (572)
  3903. \p{Sentence_Break: OLetter} (Short: \p{SB=LE}) (113_027)
  3904. \p{Sentence_Break: Other} (Short: \p{SB=XX}) (993_796 plus all
  3905. above-Unicode code points)
  3906. \p{Sentence_Break: SC} \p{Sentence_Break=SContinue} (26)
  3907. \p{Sentence_Break: SContinue} (Short: \p{SB=SC}) (26)
  3908. \p{Sentence_Break: SE} \p{Sentence_Break=Sep} (3)
  3909. \p{Sentence_Break: Sep} (Short: \p{SB=SE}) (3)
  3910. \p{Sentence_Break: Sp} (Short: \p{SB=Sp}) (20)
  3911. \p{Sentence_Break: ST} \p{Sentence_Break=STerm} (121)
  3912. \p{Sentence_Break: STerm} (Short: \p{SB=ST}) (121)
  3913. \p{Sentence_Break: UP} \p{Sentence_Break=Upper} (1853)
  3914. \p{Sentence_Break: Upper} (Short: \p{SB=UP}) (1853)
  3915. \p{Sentence_Break: XX} \p{Sentence_Break=Other} (993_796 plus all
  3916. above-Unicode code points)
  3917. \p{Sentence_Terminal} \p{Sentence_Terminal=Y} (Short: \p{STerm})
  3918. (124)
  3919. \p{Sentence_Terminal: N*} (Short: \p{STerm=N}, \P{STerm})
  3920. (1_113_988 plus all above-Unicode code
  3921. points)
  3922. \p{Sentence_Terminal: Y*} (Short: \p{STerm=Y}, \p{STerm}) (124)
  3923. \p{Separator} \p{General_Category=Separator} (Short:
  3924. \p{Z}) (19)
  3925. \p{Sgnw} \p{SignWriting} (= \p{Script_Extensions=
  3926. SignWriting}) (672)
  3927. \p{Sharada} \p{Script_Extensions=Sharada} (Short:
  3928. \p{Shrd}; NOT \p{Block=Sharada}) (100)
  3929. \p{Shavian} \p{Script_Extensions=Shavian} (Short:
  3930. \p{Shaw}) (48)
  3931. \p{Shaw} \p{Shavian} (= \p{Script_Extensions=
  3932. Shavian}) (48)
  3933. X \p{Shorthand_Format_Controls} \p{Block=Shorthand_Format_Controls}
  3934. (16)
  3935. \p{Shrd} \p{Sharada} (= \p{Script_Extensions=
  3936. Sharada}) (NOT \p{Block=Sharada}) (100)
  3937. \p{Sidd} \p{Siddham} (= \p{Script_Extensions=
  3938. Siddham}) (NOT \p{Block=Siddham}) (92)
  3939. \p{Siddham} \p{Script_Extensions=Siddham} (Short:
  3940. \p{Sidd}; NOT \p{Block=Siddham}) (92)
  3941. \p{SignWriting} \p{Script_Extensions=SignWriting} (Short:
  3942. \p{Sgnw}) (672)
  3943. \p{Sind} \p{Khudawadi} (= \p{Script_Extensions=
  3944. Khudawadi}) (NOT \p{Block=Khudawadi})
  3945. (81)
  3946. \p{Sinh} \p{Sinhala} (= \p{Script_Extensions=
  3947. Sinhala}) (NOT \p{Block=Sinhala}) (112)
  3948. \p{Sinhala} \p{Script_Extensions=Sinhala} (Short:
  3949. \p{Sinh}; NOT \p{Block=Sinhala}) (112)
  3950. X \p{Sinhala_Archaic_Numbers} \p{Block=Sinhala_Archaic_Numbers} (32)
  3951. \p{Sk} \p{Modifier_Symbol} (=
  3952. \p{General_Category=Modifier_Symbol})
  3953. (121)
  3954. \p{Sm} \p{Math_Symbol} (= \p{General_Category=
  3955. Math_Symbol}) (948)
  3956. X \p{Small_Form_Variants} \p{Block=Small_Form_Variants} (Short:
  3957. \p{InSmallForms}) (32)
  3958. X \p{Small_Forms} \p{Small_Form_Variants} (= \p{Block=
  3959. Small_Form_Variants}) (32)
  3960. \p{So} \p{Other_Symbol} (= \p{General_Category=
  3961. Other_Symbol}) (5777)
  3962. \p{Soft_Dotted} \p{Soft_Dotted=Y} (Short: \p{SD}) (46)
  3963. \p{Soft_Dotted: N*} (Short: \p{SD=N}, \P{SD}) (1_114_066 plus
  3964. all above-Unicode code points)
  3965. \p{Soft_Dotted: Y*} (Short: \p{SD=Y}, \p{SD}) (46)
  3966. \p{Sora} \p{Sora_Sompeng} (= \p{Script_Extensions=
  3967. Sora_Sompeng}) (NOT \p{Block=
  3968. Sora_Sompeng}) (35)
  3969. \p{Sora_Sompeng} \p{Script_Extensions=Sora_Sompeng} (Short:
  3970. \p{Sora}; NOT \p{Block=Sora_Sompeng})
  3971. (35)
  3972. \p{Space} \p{White_Space} (= \p{White_Space=Y}) (25)
  3973. \p{Space: *} \p{White_Space: *}
  3974. \p{Space_Separator} \p{General_Category=Space_Separator}
  3975. (Short: \p{Zs}) (17)
  3976. \p{SpacePerl} \p{XPosixSpace} (25)
  3977. \p{Spacing_Mark} \p{General_Category=Spacing_Mark} (Short:
  3978. \p{Mc}) (394)
  3979. X \p{Spacing_Modifier_Letters} \p{Block=Spacing_Modifier_Letters}
  3980. (Short: \p{InModifierLetters}) (80)
  3981. X \p{Specials} \p{Block=Specials} (16)
  3982. \p{STerm} \p{Sentence_Terminal} (=
  3983. \p{Sentence_Terminal=Y}) (124)
  3984. \p{STerm: *} \p{Sentence_Terminal: *}
  3985. \p{Sund} \p{Sundanese} (= \p{Script_Extensions=
  3986. Sundanese}) (NOT \p{Block=Sundanese})
  3987. (72)
  3988. \p{Sundanese} \p{Script_Extensions=Sundanese} (Short:
  3989. \p{Sund}; NOT \p{Block=Sundanese}) (72)
  3990. X \p{Sundanese_Sup} \p{Sundanese_Supplement} (= \p{Block=
  3991. Sundanese_Supplement}) (16)
  3992. X \p{Sundanese_Supplement} \p{Block=Sundanese_Supplement} (Short:
  3993. \p{InSundaneseSup}) (16)
  3994. X \p{Sup_Arrows_A} \p{Supplemental_Arrows_A} (= \p{Block=
  3995. Supplemental_Arrows_A}) (16)
  3996. X \p{Sup_Arrows_B} \p{Supplemental_Arrows_B} (= \p{Block=
  3997. Supplemental_Arrows_B}) (128)
  3998. X \p{Sup_Arrows_C} \p{Supplemental_Arrows_C} (= \p{Block=
  3999. Supplemental_Arrows_C}) (256)
  4000. X \p{Sup_Math_Operators} \p{Supplemental_Mathematical_Operators} (=
  4001. \p{Block=
  4002. Supplemental_Mathematical_Operators})
  4003. (256)
  4004. X \p{Sup_PUA_A} \p{Supplementary_Private_Use_Area_A} (=
  4005. \p{Block=
  4006. Supplementary_Private_Use_Area_A})
  4007. (65_536)
  4008. X \p{Sup_PUA_B} \p{Supplementary_Private_Use_Area_B} (=
  4009. \p{Block=
  4010. Supplementary_Private_Use_Area_B})
  4011. (65_536)
  4012. X \p{Sup_Punctuation} \p{Supplemental_Punctuation} (= \p{Block=
  4013. Supplemental_Punctuation}) (128)
  4014. X \p{Sup_Symbols_And_Pictographs}
  4015. \p{Supplemental_Symbols_And_Pictographs}
  4016. (= \p{Block=
  4017. Supplemental_Symbols_And_Pictographs})
  4018. (256)
  4019. X \p{Super_And_Sub} \p{Superscripts_And_Subscripts} (=
  4020. \p{Block=Superscripts_And_Subscripts})
  4021. (48)
  4022. X \p{Superscripts_And_Subscripts} \p{Block=
  4023. Superscripts_And_Subscripts} (Short:
  4024. \p{InSuperAndSub}) (48)
  4025. X \p{Supplemental_Arrows_A} \p{Block=Supplemental_Arrows_A} (Short:
  4026. \p{InSupArrowsA}) (16)
  4027. X \p{Supplemental_Arrows_B} \p{Block=Supplemental_Arrows_B} (Short:
  4028. \p{InSupArrowsB}) (128)
  4029. X \p{Supplemental_Arrows_C} \p{Block=Supplemental_Arrows_C} (Short:
  4030. \p{InSupArrowsC}) (256)
  4031. X \p{Supplemental_Mathematical_Operators} \p{Block=
  4032. Supplemental_Mathematical_Operators}
  4033. (Short: \p{InSupMathOperators}) (256)
  4034. X \p{Supplemental_Punctuation} \p{Block=Supplemental_Punctuation}
  4035. (Short: \p{InSupPunctuation}) (128)
  4036. X \p{Supplemental_Symbols_And_Pictographs} \p{Block=
  4037. Supplemental_Symbols_And_Pictographs}
  4038. (Short: \p{InSupSymbolsAndPictographs})
  4039. (256)
  4040. X \p{Supplementary_Private_Use_Area_A} \p{Block=
  4041. Supplementary_Private_Use_Area_A}
  4042. (Short: \p{InSupPUAA}) (65_536)
  4043. X \p{Supplementary_Private_Use_Area_B} \p{Block=
  4044. Supplementary_Private_Use_Area_B}
  4045. (Short: \p{InSupPUAB}) (65_536)
  4046. \p{Surrogate} \p{General_Category=Surrogate} (Short:
  4047. \p{Cs}) (2048)
  4048. X \p{Sutton_SignWriting} \p{Block=Sutton_SignWriting} (688)
  4049. \p{Sylo} \p{Syloti_Nagri} (= \p{Script_Extensions=
  4050. Syloti_Nagri}) (NOT \p{Block=
  4051. Syloti_Nagri}) (56)
  4052. \p{Syloti_Nagri} \p{Script_Extensions=Syloti_Nagri} (Short:
  4053. \p{Sylo}; NOT \p{Block=Syloti_Nagri})
  4054. (56)
  4055. \p{Symbol} \p{General_Category=Symbol} (Short: \p{S})
  4056. (6899)
  4057. \p{Syrc} \p{Syriac} (= \p{Script_Extensions=
  4058. Syriac}) (NOT \p{Block=Syriac}) (93)
  4059. \p{Syriac} \p{Script_Extensions=Syriac} (Short:
  4060. \p{Syrc}; NOT \p{Block=Syriac}) (93)
  4061. \p{Tagalog} \p{Script_Extensions=Tagalog} (Short:
  4062. \p{Tglg}; NOT \p{Block=Tagalog}) (22)
  4063. \p{Tagb} \p{Tagbanwa} (= \p{Script_Extensions=
  4064. Tagbanwa}) (NOT \p{Block=Tagbanwa}) (20)
  4065. \p{Tagbanwa} \p{Script_Extensions=Tagbanwa} (Short:
  4066. \p{Tagb}; NOT \p{Block=Tagbanwa}) (20)
  4067. X \p{Tags} \p{Block=Tags} (128)
  4068. \p{Tai_Le} \p{Script_Extensions=Tai_Le} (Short:
  4069. \p{Tale}; NOT \p{Block=Tai_Le}) (45)
  4070. \p{Tai_Tham} \p{Script_Extensions=Tai_Tham} (Short:
  4071. \p{Lana}; NOT \p{Block=Tai_Tham}) (127)
  4072. \p{Tai_Viet} \p{Script_Extensions=Tai_Viet} (Short:
  4073. \p{Tavt}; NOT \p{Block=Tai_Viet}) (72)
  4074. X \p{Tai_Xuan_Jing} \p{Tai_Xuan_Jing_Symbols} (= \p{Block=
  4075. Tai_Xuan_Jing_Symbols}) (96)
  4076. X \p{Tai_Xuan_Jing_Symbols} \p{Block=Tai_Xuan_Jing_Symbols} (Short:
  4077. \p{InTaiXuanJing}) (96)
  4078. \p{Takr} \p{Takri} (= \p{Script_Extensions=Takri})
  4079. (NOT \p{Block=Takri}) (78)
  4080. \p{Takri} \p{Script_Extensions=Takri} (Short:
  4081. \p{Takr}; NOT \p{Block=Takri}) (78)
  4082. \p{Tale} \p{Tai_Le} (= \p{Script_Extensions=
  4083. Tai_Le}) (NOT \p{Block=Tai_Le}) (45)
  4084. \p{Talu} \p{New_Tai_Lue} (= \p{Script_Extensions=
  4085. New_Tai_Lue}) (NOT \p{Block=
  4086. New_Tai_Lue}) (83)
  4087. \p{Tamil} \p{Script_Extensions=Tamil} (Short:
  4088. \p{Taml}; NOT \p{Block=Tamil}) (80)
  4089. \p{Taml} \p{Tamil} (= \p{Script_Extensions=Tamil})
  4090. (NOT \p{Block=Tamil}) (80)
  4091. \p{Tang} \p{Tangut} (= \p{Script_Extensions=
  4092. Tangut}) (NOT \p{Block=Tangut}) (6881)
  4093. \p{Tangut} \p{Script_Extensions=Tangut} (Short:
  4094. \p{Tang}; NOT \p{Block=Tangut}) (6881)
  4095. X \p{Tangut_Components} \p{Block=Tangut_Components} (768)
  4096. \p{Tavt} \p{Tai_Viet} (= \p{Script_Extensions=
  4097. Tai_Viet}) (NOT \p{Block=Tai_Viet}) (72)
  4098. \p{Telu} \p{Telugu} (= \p{Script_Extensions=
  4099. Telugu}) (NOT \p{Block=Telugu}) (101)
  4100. \p{Telugu} \p{Script_Extensions=Telugu} (Short:
  4101. \p{Telu}; NOT \p{Block=Telugu}) (101)
  4102. \p{Term} \p{Terminal_Punctuation} (=
  4103. \p{Terminal_Punctuation=Y}) (246)
  4104. \p{Term: *} \p{Terminal_Punctuation: *}
  4105. \p{Terminal_Punctuation} \p{Terminal_Punctuation=Y} (Short:
  4106. \p{Term}) (246)
  4107. \p{Terminal_Punctuation: N*} (Short: \p{Term=N}, \P{Term})
  4108. (1_113_866 plus all above-Unicode code
  4109. points)
  4110. \p{Terminal_Punctuation: Y*} (Short: \p{Term=Y}, \p{Term}) (246)
  4111. \p{Tfng} \p{Tifinagh} (= \p{Script_Extensions=
  4112. Tifinagh}) (NOT \p{Block=Tifinagh}) (59)
  4113. \p{Tglg} \p{Tagalog} (= \p{Script_Extensions=
  4114. Tagalog}) (NOT \p{Block=Tagalog}) (22)
  4115. \p{Thaa} \p{Thaana} (= \p{Script_Extensions=
  4116. Thaana}) (NOT \p{Block=Thaana}) (65)
  4117. \p{Thaana} \p{Script_Extensions=Thaana} (Short:
  4118. \p{Thaa}; NOT \p{Block=Thaana}) (65)
  4119. \p{Thai} \p{Script_Extensions=Thai} (NOT \p{Block=
  4120. Thai}) (86)
  4121. \p{Tibetan} \p{Script_Extensions=Tibetan} (Short:
  4122. \p{Tibt}; NOT \p{Block=Tibetan}) (207)
  4123. \p{Tibt} \p{Tibetan} (= \p{Script_Extensions=
  4124. Tibetan}) (NOT \p{Block=Tibetan}) (207)
  4125. \p{Tifinagh} \p{Script_Extensions=Tifinagh} (Short:
  4126. \p{Tfng}; NOT \p{Block=Tifinagh}) (59)
  4127. \p{Tirh} \p{Tirhuta} (= \p{Script_Extensions=
  4128. Tirhuta}) (NOT \p{Block=Tirhuta}) (94)
  4129. \p{Tirhuta} \p{Script_Extensions=Tirhuta} (Short:
  4130. \p{Tirh}; NOT \p{Block=Tirhuta}) (94)
  4131. \p{Title} \p{Titlecase} (/i= Cased=Yes) (31)
  4132. \p{Titlecase} (= \p{Gc=Lt}) (Short: \p{Title}; /i=
  4133. Cased=Yes) (31)
  4134. \p{Titlecase_Letter} \p{General_Category=Titlecase_Letter}
  4135. (Short: \p{Lt}; /i= General_Category=
  4136. Cased_Letter) (31)
  4137. X \p{Transport_And_Map} \p{Transport_And_Map_Symbols} (= \p{Block=
  4138. Transport_And_Map_Symbols}) (128)
  4139. X \p{Transport_And_Map_Symbols} \p{Block=Transport_And_Map_Symbols}
  4140. (Short: \p{InTransportAndMap}) (128)
  4141. X \p{UCAS} \p{Unified_Canadian_Aboriginal_Syllabics}
  4142. (= \p{Block=
  4143. Unified_Canadian_Aboriginal_Syllabics})
  4144. (640)
  4145. X \p{UCAS_Ext} \p{Unified_Canadian_Aboriginal_Syllabics_-
  4146. Extended} (= \p{Block=
  4147. Unified_Canadian_Aboriginal_Syllabics_-
  4148. Extended}) (80)
  4149. \p{Ugar} \p{Ugaritic} (= \p{Script_Extensions=
  4150. Ugaritic}) (NOT \p{Block=Ugaritic}) (31)
  4151. \p{Ugaritic} \p{Script_Extensions=Ugaritic} (Short:
  4152. \p{Ugar}; NOT \p{Block=Ugaritic}) (31)
  4153. \p{UIdeo} \p{Unified_Ideograph} (=
  4154. \p{Unified_Ideograph=Y}) (80_388)
  4155. \p{UIdeo: *} \p{Unified_Ideograph: *}
  4156. \p{Unassigned} \p{General_Category=Unassigned} (Short:
  4157. \p{Cn}) (846_359 plus all above-Unicode
  4158. code points)
  4159. \p{Unicode} \p{Any} (1_114_112)
  4160. X \p{Unified_Canadian_Aboriginal_Syllabics} \p{Block=
  4161. Unified_Canadian_Aboriginal_Syllabics}
  4162. (Short: \p{InUCAS}) (640)
  4163. X \p{Unified_Canadian_Aboriginal_Syllabics_Extended} \p{Block=
  4164. Unified_Canadian_Aboriginal_Syllabics_-
  4165. Extended} (Short: \p{InUCASExt}) (80)
  4166. \p{Unified_Ideograph} \p{Unified_Ideograph=Y} (Short: \p{UIdeo})
  4167. (80_388)
  4168. \p{Unified_Ideograph: N*} (Short: \p{UIdeo=N}, \P{UIdeo})
  4169. (1_033_724 plus all above-Unicode code
  4170. points)
  4171. \p{Unified_Ideograph: Y*} (Short: \p{UIdeo=Y}, \p{UIdeo}) (80_388)
  4172. \p{Unknown} \p{Script_Extensions=Unknown} (Short:
  4173. \p{Zzzz}) (985_875 plus all above-
  4174. Unicode code points)
  4175. \p{Upper} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
  4176. Cased=Yes) (1822)
  4177. \p{Upper: *} \p{Uppercase: *}
  4178. \p{Uppercase} \p{XPosixUpper} (= \p{Uppercase=Y}) (/i=
  4179. Cased=Yes) (1822)
  4180. \p{Uppercase: N*} (Short: \p{Upper=N}, \P{Upper}; /i= Cased=
  4181. No) (1_112_290 plus all above-Unicode
  4182. code points)
  4183. \p{Uppercase: Y*} (Short: \p{Upper=Y}, \p{Upper}; /i= Cased=
  4184. Yes) (1822)
  4185. \p{Uppercase_Letter} \p{General_Category=Uppercase_Letter}
  4186. (Short: \p{Lu}; /i= General_Category=
  4187. Cased_Letter) (1702)
  4188. \p{Vai} \p{Script_Extensions=Vai} (NOT \p{Block=
  4189. Vai}) (300)
  4190. \p{Vaii} \p{Vai} (= \p{Script_Extensions=Vai}) (NOT
  4191. \p{Block=Vai}) (300)
  4192. \p{Variation_Selector} \p{Variation_Selector=Y} (Short: \p{VS};
  4193. NOT \p{Variation_Selectors}) (259)
  4194. \p{Variation_Selector: N*} (Short: \p{VS=N}, \P{VS}) (1_113_853
  4195. plus all above-Unicode code points)
  4196. \p{Variation_Selector: Y*} (Short: \p{VS=Y}, \p{VS}) (259)
  4197. X \p{Variation_Selectors} \p{Block=Variation_Selectors} (Short:
  4198. \p{InVS}) (16)
  4199. X \p{Variation_Selectors_Supplement} \p{Block=
  4200. Variation_Selectors_Supplement} (Short:
  4201. \p{InVSSup}) (240)
  4202. X \p{Vedic_Ext} \p{Vedic_Extensions} (= \p{Block=
  4203. Vedic_Extensions}) (48)
  4204. X \p{Vedic_Extensions} \p{Block=Vedic_Extensions} (Short:
  4205. \p{InVedicExt}) (48)
  4206. X \p{Vertical_Forms} \p{Block=Vertical_Forms} (16)
  4207. \p{VertSpace} \v (7)
  4208. \p{VS} \p{Variation_Selector} (=
  4209. \p{Variation_Selector=Y}) (NOT
  4210. \p{Variation_Selectors}) (259)
  4211. \p{VS: *} \p{Variation_Selector: *}
  4212. X \p{VS_Sup} \p{Variation_Selectors_Supplement} (=
  4213. \p{Block=
  4214. Variation_Selectors_Supplement}) (240)
  4215. \p{Wara} \p{Warang_Citi} (= \p{Script_Extensions=
  4216. Warang_Citi}) (NOT \p{Block=
  4217. Warang_Citi}) (84)
  4218. \p{Warang_Citi} \p{Script_Extensions=Warang_Citi} (Short:
  4219. \p{Wara}; NOT \p{Block=Warang_Citi}) (84)
  4220. \p{WB: *} \p{Word_Break: *}
  4221. \p{White_Space} \p{White_Space=Y} (Short: \p{Space}) (25)
  4222. \p{White_Space: N*} (Short: \p{Space=N}, \P{Space}) (1_114_087
  4223. plus all above-Unicode code points)
  4224. \p{White_Space: Y*} (Short: \p{Space=Y}, \p{Space}) (25)
  4225. \p{Word} \p{XPosixWord} (119_821)
  4226. \p{Word_Break: ALetter} (Short: \p{WB=LE}) (27_992)
  4227. \p{Word_Break: CR} (Short: \p{WB=CR}) (1)
  4228. \p{Word_Break: Double_Quote} (Short: \p{WB=DQ}) (1)
  4229. \p{Word_Break: DQ} \p{Word_Break=Double_Quote} (1)
  4230. \p{Word_Break: E_Base} (Short: \p{WB=EB}) (79)
  4231. \p{Word_Break: E_Base_GAZ} (Short: \p{WB=EBG}) (4)
  4232. \p{Word_Break: E_Modifier} (Short: \p{WB=EM}) (5)
  4233. \p{Word_Break: EB} \p{Word_Break=E_Base} (79)
  4234. \p{Word_Break: EBG} \p{Word_Break=E_Base_GAZ} (4)
  4235. \p{Word_Break: EM} \p{Word_Break=E_Modifier} (5)
  4236. \p{Word_Break: EX} \p{Word_Break=ExtendNumLet} (11)
  4237. \p{Word_Break: Extend} (Short: \p{WB=Extend}) (2196)
  4238. \p{Word_Break: ExtendNumLet} (Short: \p{WB=EX}) (11)
  4239. \p{Word_Break: FO} \p{Word_Break=Format} (52)
  4240. \p{Word_Break: Format} (Short: \p{WB=FO}) (52)
  4241. \p{Word_Break: GAZ} \p{Word_Break=Glue_After_Zwj} (3)
  4242. \p{Word_Break: Glue_After_Zwj} (Short: \p{WB=GAZ}) (3)
  4243. \p{Word_Break: Hebrew_Letter} (Short: \p{WB=HL}) (74)
  4244. \p{Word_Break: HL} \p{Word_Break=Hebrew_Letter} (74)
  4245. \p{Word_Break: KA} \p{Word_Break=Katakana} (310)
  4246. \p{Word_Break: Katakana} (Short: \p{WB=KA}) (310)
  4247. \p{Word_Break: LE} \p{Word_Break=ALetter} (27_992)
  4248. \p{Word_Break: LF} (Short: \p{WB=LF}) (1)
  4249. \p{Word_Break: MB} \p{Word_Break=MidNumLet} (7)
  4250. \p{Word_Break: MidLetter} (Short: \p{WB=ML}) (9)
  4251. \p{Word_Break: MidNum} (Short: \p{WB=MN}) (15)
  4252. \p{Word_Break: MidNumLet} (Short: \p{WB=MB}) (7)
  4253. \p{Word_Break: ML} \p{Word_Break=MidLetter} (9)
  4254. \p{Word_Break: MN} \p{Word_Break=MidNum} (15)
  4255. \p{Word_Break: Newline} (Short: \p{WB=NL}) (5)
  4256. \p{Word_Break: NL} \p{Word_Break=Newline} (5)
  4257. \p{Word_Break: NU} \p{Word_Break=Numeric} (571)
  4258. \p{Word_Break: Numeric} (Short: \p{WB=NU}) (571)
  4259. \p{Word_Break: Other} (Short: \p{WB=XX}) (1_082_748 plus all
  4260. above-Unicode code points)
  4261. \p{Word_Break: Regional_Indicator} (Short: \p{WB=RI}) (26)
  4262. \p{Word_Break: RI} \p{Word_Break=Regional_Indicator} (26)
  4263. \p{Word_Break: Single_Quote} (Short: \p{WB=SQ}) (1)
  4264. \p{Word_Break: SQ} \p{Word_Break=Single_Quote} (1)
  4265. \p{Word_Break: XX} \p{Word_Break=Other} (1_082_748 plus all
  4266. above-Unicode code points)
  4267. \p{Word_Break: ZWJ} (Short: \p{WB=ZWJ}) (1)
  4268. \p{WSpace} \p{White_Space} (= \p{White_Space=Y}) (25)
  4269. \p{WSpace: *} \p{White_Space: *}
  4270. \p{XDigit} \p{XPosixXDigit} (= \p{Hex_Digit=Y}) (44)
  4271. \p{XID_Continue} \p{XID_Continue=Y} (Short: \p{XIDC})
  4272. (119_672)
  4273. \p{XID_Continue: N*} (Short: \p{XIDC=N}, \P{XIDC}) (994_440
  4274. plus all above-Unicode code points)
  4275. \p{XID_Continue: Y*} (Short: \p{XIDC=Y}, \p{XIDC}) (119_672)
  4276. \p{XID_Start} \p{XID_Start=Y} (Short: \p{XIDS}) (116_984)
  4277. \p{XID_Start: N*} (Short: \p{XIDS=N}, \P{XIDS}) (997_128
  4278. plus all above-Unicode code points)
  4279. \p{XID_Start: Y*} (Short: \p{XIDS=Y}, \p{XIDS}) (116_984)
  4280. \p{XIDC} \p{XID_Continue} (= \p{XID_Continue=Y})
  4281. (119_672)
  4282. \p{XIDC: *} \p{XID_Continue: *}
  4283. \p{XIDS} \p{XID_Start} (= \p{XID_Start=Y}) (116_984)
  4284. \p{XIDS: *} \p{XID_Start: *}
  4285. \p{Xpeo} \p{Old_Persian} (= \p{Script_Extensions=
  4286. Old_Persian}) (NOT \p{Block=
  4287. Old_Persian}) (50)
  4288. \p{XPerlSpace} \p{XPosixSpace} (25)
  4289. \p{XPosixAlnum} Alphabetic and (decimal) Numeric (Short:
  4290. \p{Alnum}) (118_820)
  4291. \p{XPosixAlpha} \p{Alphabetic=Y} (Short: \p{Alpha})
  4292. (118_240)
  4293. \p{XPosixBlank} \h, Horizontal white space (Short:
  4294. \p{Blank}) (18)
  4295. \p{XPosixCntrl} \p{General_Category=Control} Control
  4296. characters (Short: \p{Cc}) (65)
  4297. \p{XPosixDigit} \p{General_Category=Decimal_Number} [0-9]
  4298. + all other decimal digits (Short:
  4299. \p{Nd}) (580)
  4300. \p{XPosixGraph} Characters that are graphical (Short:
  4301. \p{Graph}) (265_621)
  4302. \p{XPosixLower} \p{Lowercase=Y} (Short: \p{Lower}; /i=
  4303. Cased=Yes) (2252)
  4304. \p{XPosixPrint} Characters that are graphical plus space
  4305. characters (but no controls) (Short:
  4306. \p{Print}) (265_638)
  4307. \p{XPosixPunct} \p{Punct} + ASCII-range \p{Symbol} (757)
  4308. \p{XPosixSpace} \s including beyond ASCII and vertical tab
  4309. (Short: \p{SpacePerl}) (25)
  4310. \p{XPosixUpper} \p{Uppercase=Y} (Short: \p{Upper}; /i=
  4311. Cased=Yes) (1822)
  4312. \p{XPosixWord} \w, including beyond ASCII; = \p{Alnum} +
  4313. \pM + \p{Pc} (Short: \p{Word}) (119_821)
  4314. \p{XPosixXDigit} \p{Hex_Digit=Y} (Short: \p{Hex}) (44)
  4315. \p{Xsux} \p{Cuneiform} (= \p{Script_Extensions=
  4316. Cuneiform}) (NOT \p{Block=Cuneiform})
  4317. (1234)
  4318. \p{Yi} \p{Script_Extensions=Yi} (1246)
  4319. X \p{Yi_Radicals} \p{Block=Yi_Radicals} (64)
  4320. X \p{Yi_Syllables} \p{Block=Yi_Syllables} (1168)
  4321. \p{Yiii} \p{Yi} (= \p{Script_Extensions=Yi}) (1246)
  4322. X \p{Yijing} \p{Yijing_Hexagram_Symbols} (= \p{Block=
  4323. Yijing_Hexagram_Symbols}) (64)
  4324. X \p{Yijing_Hexagram_Symbols} \p{Block=Yijing_Hexagram_Symbols}
  4325. (Short: \p{InYijing}) (64)
  4326. \p{Z} \pZ \p{Separator} (= \p{General_Category=
  4327. Separator}) (19)
  4328. \p{Zinh} \p{Inherited} (= \p{Script_Extensions=
  4329. Inherited}) (496)
  4330. \p{Zl} \p{Line_Separator} (= \p{General_Category=
  4331. Line_Separator}) (1)
  4332. \p{Zp} \p{Paragraph_Separator} (=
  4333. \p{General_Category=
  4334. Paragraph_Separator}) (1)
  4335. \p{Zs} \p{Space_Separator} (=
  4336. \p{General_Category=Space_Separator})
  4337. (17)
  4338. \p{Zyyy} \p{Common} (= \p{Script_Extensions=
  4339. Common}) (6864)
  4340. \p{Zzzz} \p{Unknown} (= \p{Script_Extensions=
  4341. Unknown}) (985_875 plus all above-
  4342. Unicode code points)
  4343. TX\p{_CanonDCIJ} (For internal use by Perl, not necessarily
  4344. stable) (= \p{Soft_Dotted=Y}) (46)
  4345. TX\p{_Case_Ignorable} (For internal use by Perl, not necessarily
  4346. stable) (= \p{Case_Ignorable=Y}) (2240)
  4347. TX\p{_CombAbove} (For internal use by Perl, not necessarily
  4348. stable) (= \p{Canonical_Combining_Class=
  4349. Above}) (461)

Legal \p{} and \P{} constructs that match no characters

Unicode has some property-value pairs that currently don't match anything. This happens generally either because they are obsolete, or they exist for symmetry with other forms, but no language has yet been encoded that uses them. In this version of Unicode, the following match zero code points:

  • \p{Canonical_Combining_Class=Attached_Below_Left}
  • \p{Canonical_Combining_Class=CCC133}

Properties accessible through Unicode::UCD

The value of any Unicode (not including Perl extensions) character property mentioned above for any single code point is available through charprop() in Unicode::UCD. charprops_all() in Unicode::UCD returns the values of all the Unicode properties for a given code point.

Besides these, all the Unicode character properties mentioned above (except for those marked as for internal use by Perl) are also accessible by prop_invlist() in Unicode::UCD.

Due to their nature, not all Unicode character properties are suitable for regular expression matches, nor prop_invlist() . The remaining non-provisional, non-internal ones are accessible via prop_invmap() in Unicode::UCD (except for those that this Perl installation hasn't included; see below for which those are).

For compatibility with other parts of Perl, all the single forms given in the table in the section above are recognized. BUT, there are some ambiguities between some Perl extensions and the Unicode properties, all of which are silently resolved in favor of the official Unicode property. To avoid surprises, you should only use prop_invmap() for forms listed in the table below, which omits the non-recommended ones. The affected forms are the Perl single form equivalents of Unicode properties, such as \p{sc} being a single-form equivalent of \p{gc=sc} , which is treated by prop_invmap() as the Script property, whose short name is sc . The table indicates the current ambiguities in the INFO column, beginning with the word "NOT" .

The standard Unicode properties listed below are documented in http://www.unicode.org/reports/tr44/; Perl_Decimal_Digit is documented in prop_invmap() in Unicode::UCD. The other Perl extensions are in Other Properties in perlunicode;

The first column in the table is a name for the property; the second column is an alternative name, if any, plus possibly some annotations. The alternative name is the property's full name, unless that would simply repeat the first column, in which case the second column indicates the property's short name (if different). The annotations are given only in the entry for the full name. If a property is obsolete, etc, the entry will be flagged with the same characters used in the table in the section above, like D or S.

  1. NAME INFO
  2. Age
  3. AHex ASCII_Hex_Digit
  4. All (Perl extension). All code points,
  5. including those above Unicode. Same as
  6. qr/./s
  7. Alnum XPosixAlnum. (Perl extension)
  8. Alpha Alphabetic
  9. Alphabetic (Short: Alpha)
  10. Any (Perl extension). All Unicode code
  11. points: [\x{0000}-\x{10FFFF}]
  12. ASCII Block=ASCII. (Perl extension).
  13. [[:ASCII:]]
  14. ASCII_Hex_Digit (Short: AHex)
  15. Assigned (Perl extension). All assigned code points
  16. Bc Bidi_Class
  17. Bidi_C Bidi_Control
  18. Bidi_Class (Short: bc)
  19. Bidi_Control (Short: Bidi_C)
  20. Bidi_M Bidi_Mirrored
  21. Bidi_Mirrored (Short: Bidi_M)
  22. Bidi_Mirroring_Glyph (Short: bmg)
  23. Bidi_Paired_Bracket (Short: bpb)
  24. Bidi_Paired_Bracket_Type (Short: bpt)
  25. Blank XPosixBlank. (Perl extension)
  26. Blk Block
  27. Block (Short: blk)
  28. Bmg Bidi_Mirroring_Glyph
  29. Bpb Bidi_Paired_Bracket
  30. Bpt Bidi_Paired_Bracket_Type
  31. Canonical_Combining_Class (Short: ccc)
  32. Case_Folding (Short: cf)
  33. Case_Ignorable (Short: CI)
  34. Cased
  35. Category General_Category
  36. Ccc Canonical_Combining_Class
  37. CE Composition_Exclusion
  38. Cf Case_Folding; NOT 'cf' meaning
  39. 'General_Category=Format'
  40. Changes_When_Casefolded (Short: CWCF)
  41. Changes_When_Casemapped (Short: CWCM)
  42. Changes_When_Lowercased (Short: CWL)
  43. Changes_When_NFKC_Casefolded (Short: CWKCF)
  44. Changes_When_Titlecased (Short: CWT)
  45. Changes_When_Uppercased (Short: CWU)
  46. CI Case_Ignorable
  47. Cntrl General_Category=XPosixCntrl. (Perl
  48. extension)
  49. Comp_Ex Full_Composition_Exclusion
  50. Composition_Exclusion (Short: CE)
  51. CWCF Changes_When_Casefolded
  52. CWCM Changes_When_Casemapped
  53. CWKCF Changes_When_NFKC_Casefolded
  54. CWL Changes_When_Lowercased
  55. CWT Changes_When_Titlecased
  56. CWU Changes_When_Uppercased
  57. Dash
  58. Decomposition_Mapping (Short: dm)
  59. Decomposition_Type (Short: dt)
  60. Default_Ignorable_Code_Point (Short: DI)
  61. Dep Deprecated
  62. Deprecated (Short: Dep)
  63. DI Default_Ignorable_Code_Point
  64. Dia Diacritic
  65. Diacritic (Short: Dia)
  66. Digit General_Category=XPosixDigit. (Perl
  67. extension)
  68. Dm Decomposition_Mapping
  69. Dt Decomposition_Type
  70. Ea East_Asian_Width
  71. East_Asian_Width (Short: ea)
  72. Ext Extender
  73. Extender (Short: Ext)
  74. Full_Composition_Exclusion (Short: Comp_Ex)
  75. Gc General_Category
  76. GCB Grapheme_Cluster_Break
  77. General_Category (Short: gc)
  78. Gr_Base Grapheme_Base
  79. Gr_Ext Grapheme_Extend
  80. Graph XPosixGraph. (Perl extension)
  81. Grapheme_Base (Short: Gr_Base)
  82. Grapheme_Cluster_Break (Short: GCB)
  83. Grapheme_Extend (Short: Gr_Ext)
  84. Hangul_Syllable_Type (Short: hst)
  85. Hex Hex_Digit
  86. Hex_Digit (Short: Hex)
  87. HorizSpace XPosixBlank. (Perl extension)
  88. Hst Hangul_Syllable_Type
  89. D Hyphen Supplanted by Line_Break property values;
  90. see www.unicode.org/reports/tr14
  91. ID_Continue (Short: IDC)
  92. ID_Start (Short: IDS)
  93. IDC ID_Continue
  94. Ideo Ideographic
  95. Ideographic (Short: Ideo)
  96. IDS ID_Start
  97. IDS_Binary_Operator (Short: IDSB)
  98. IDS_Trinary_Operator (Short: IDST)
  99. IDSB IDS_Binary_Operator
  100. IDST IDS_Trinary_Operator
  101. In Present_In. (Perl extension)
  102. Indic_Positional_Category (Short: InPC)
  103. Indic_Syllabic_Category (Short: InSC)
  104. InPC Indic_Positional_Category
  105. InSC Indic_Syllabic_Category
  106. Isc ISO_Comment; NOT 'isc' meaning
  107. 'General_Category=Other'
  108. ISO_Comment (Short: isc)
  109. Jg Joining_Group
  110. Join_C Join_Control
  111. Join_Control (Short: Join_C)
  112. Joining_Group (Short: jg)
  113. Joining_Type (Short: jt)
  114. Jt Joining_Type
  115. Lb Line_Break
  116. Lc Lowercase_Mapping; NOT 'lc' meaning
  117. 'General_Category=Cased_Letter'
  118. Line_Break (Short: lb)
  119. LOE Logical_Order_Exception
  120. Logical_Order_Exception (Short: LOE)
  121. Lower Lowercase
  122. Lowercase (Short: Lower)
  123. Lowercase_Mapping (Short: lc)
  124. Math
  125. Na Name
  126. Na1 Unicode_1_Name
  127. Name (Short: na)
  128. Name_Alias
  129. NChar Noncharacter_Code_Point
  130. NFC_QC NFC_Quick_Check
  131. NFC_Quick_Check (Short: NFC_QC)
  132. NFD_QC NFD_Quick_Check
  133. NFD_Quick_Check (Short: NFD_QC)
  134. NFKC_Casefold (Short: NFKC_CF)
  135. NFKC_CF NFKC_Casefold
  136. NFKC_QC NFKC_Quick_Check
  137. NFKC_Quick_Check (Short: NFKC_QC)
  138. NFKD_QC NFKD_Quick_Check
  139. NFKD_Quick_Check (Short: NFKD_QC)
  140. Noncharacter_Code_Point (Short: NChar)
  141. Nt Numeric_Type
  142. Numeric_Type (Short: nt)
  143. Numeric_Value (Short: nv)
  144. Nv Numeric_Value
  145. Pat_Syn Pattern_Syntax
  146. Pat_WS Pattern_White_Space
  147. Pattern_Syntax (Short: Pat_Syn)
  148. Pattern_White_Space (Short: Pat_WS)
  149. PCM Prepended_Concatenation_Mark
  150. Perl_Decimal_Digit (Perl extension)
  151. PerlSpace PosixSpace. (Perl extension)
  152. PerlWord PosixWord. (Perl extension)
  153. PosixAlnum (Perl extension). [A-Za-z0-9]
  154. PosixAlpha (Perl extension). [A-Za-z]
  155. PosixBlank (Perl extension). \t and ' '
  156. PosixCntrl (Perl extension). ASCII control
  157. characters: NUL, SOH, STX, ETX, EOT, ENQ,
  158. ACK, BEL, BS, HT, LF, VT, FF, CR, SO, SI,
  159. DLE, DC1, DC2, DC3, DC4, NAK, SYN, ETB,
  160. CAN, EOM, SUB, ESC, FS, GS, RS, US, and DEL
  161. PosixDigit (Perl extension). [0-9]
  162. PosixGraph (Perl extension). [-!"#$%&'()*+,./:;<=
  163. >?@[\\]^_`{|}~0-9A-Za-z]
  164. PosixLower (Perl extension). [a-z]
  165. PosixPrint (Perl extension). [- 0-9A-Za-
  166. z!"#$%&'()*+,./:;<=>?@[\\]^_`{|}~]
  167. PosixPunct (Perl extension). [-!"#$%&'()*+,./:;<=
  168. >?@[\\]^_`{|}~]
  169. PosixSpace (Perl extension). \t, \n, \cK, \f, \r,
  170. and ' '. (\cK is vertical tab)
  171. PosixUpper (Perl extension). [A-Z]
  172. PosixWord (Perl extension). \w, restricted to ASCII
  173. = [A-Za-z0-9_]
  174. PosixXDigit (Perl extension). [0-9A-Fa-f]
  175. Prepended_Concatenation_Mark (Short: PCM)
  176. Present_In (Short: In). (Perl extension)
  177. Print XPosixPrint. (Perl extension)
  178. Punct General_Category=Punct. (Perl extension)
  179. QMark Quotation_Mark
  180. Quotation_Mark (Short: QMark)
  181. Radical
  182. SB Sentence_Break
  183. Sc Script; NOT 'sc' meaning
  184. 'General_Category=Currency_Symbol'
  185. Scf Simple_Case_Folding
  186. Script (Short: sc)
  187. Script_Extensions (Short: scx)
  188. Scx Script_Extensions
  189. SD Soft_Dotted
  190. Sentence_Break (Short: SB)
  191. Sentence_Terminal (Short: STerm)
  192. Sfc Simple_Case_Folding
  193. Simple_Case_Folding (Short: scf)
  194. Simple_Lowercase_Mapping (Short: slc)
  195. Simple_Titlecase_Mapping (Short: stc)
  196. Simple_Uppercase_Mapping (Short: suc)
  197. Slc Simple_Lowercase_Mapping
  198. Soft_Dotted (Short: SD)
  199. Space White_Space
  200. SpacePerl XPosixSpace. (Perl extension)
  201. Stc Simple_Titlecase_Mapping
  202. STerm Sentence_Terminal
  203. Suc Simple_Uppercase_Mapping
  204. Tc Titlecase_Mapping
  205. Term Terminal_Punctuation
  206. Terminal_Punctuation (Short: Term)
  207. Title Titlecase. (Perl extension)
  208. Titlecase (Short: Title). (Perl extension). (=
  209. \p{Gc=Lt})
  210. Titlecase_Mapping (Short: tc)
  211. Uc Uppercase_Mapping
  212. UIdeo Unified_Ideograph
  213. Unicode Any. (Perl extension)
  214. Unicode_1_Name (Short: na1)
  215. Unified_Ideograph (Short: UIdeo)
  216. Upper Uppercase
  217. Uppercase (Short: Upper)
  218. Uppercase_Mapping (Short: uc)
  219. Variation_Selector (Short: VS)
  220. VertSpace (Perl extension). \v
  221. VS Variation_Selector
  222. WB Word_Break
  223. White_Space (Short: WSpace)
  224. Word XPosixWord. (Perl extension)
  225. Word_Break (Short: WB)
  226. WSpace White_Space
  227. XDigit XPosixXDigit. (Perl extension)
  228. XID_Continue (Short: XIDC)
  229. XID_Start (Short: XIDS)
  230. XIDC XID_Continue
  231. XIDS XID_Start
  232. XPerlSpace XPosixSpace. (Perl extension)
  233. XPosixAlnum (Short: Alnum). (Perl extension).
  234. Alphabetic and (decimal) Numeric
  235. XPosixAlpha (Perl extension)
  236. XPosixBlank (Short: Blank). (Perl extension). \h,
  237. Horizontal white space
  238. XPosixCntrl General_Category=XPosixCntrl (Short:
  239. Cntrl). (Perl extension). Control
  240. characters
  241. XPosixDigit General_Category=XPosixDigit (Short:
  242. Digit). (Perl extension). [0-9] + all
  243. other decimal digits
  244. XPosixGraph (Short: Graph). (Perl extension).
  245. Characters that are graphical
  246. XPosixLower (Perl extension)
  247. XPosixPrint (Short: Print). (Perl extension).
  248. Characters that are graphical plus space
  249. characters (but no controls)
  250. XPosixPunct (Perl extension). \p{Punct} + ASCII-range
  251. \p{Symbol}
  252. XPosixSpace (Perl extension). \s including beyond
  253. ASCII and vertical tab
  254. XPosixUpper (Perl extension)
  255. XPosixWord (Short: Word). (Perl extension). \w,
  256. including beyond ASCII; = \p{Alnum} + \pM
  257. + \p{Pc}
  258. XPosixXDigit (Short: XDigit). (Perl extension)

Properties accessible through other means

Certain properties are accessible also via core function calls. These are:

  1. Lowercase_Mapping lc() and lcfirst()
  2. Titlecase_Mapping ucfirst()
  3. Uppercase_Mapping uc()

Also, Case_Folding is accessible through the /i modifier in regular expressions, the \F transliteration escape, and the fc operator.

And, the Name and Name_Aliases properties are accessible through the \N{} interpolation in double-quoted strings and regular expressions; and functions charnames::viacode() , charnames::vianame() , and charnames::string_vianame() (which require a use charnames (); to be specified.

Finally, most properties related to decomposition are accessible via Unicode::Normalize.

Unicode character properties that are NOT accepted by Perl

Perl will generate an error for a few character properties in Unicode when used in a regular expression. The non-Unihan ones are listed below, with the reasons they are not accepted, perhaps with work-arounds. The short names for the properties are listed enclosed in (parentheses). As described after the list, an installation can change the defaults and choose to accept any of these. The list is machine generated based on the choices made for the installation that generated this document.

  • Expands_On_NFC (XO_NFC)
  • Expands_On_NFD (XO_NFD)
  • Expands_On_NFKC (XO_NFKC)
  • Expands_On_NFKD (XO_NFKD)

    Deprecated by Unicode. These are characters that expand to more than one character in the specified normalization form, but whether they actually take up more bytes or not depends on the encoding being used. For example, a UTF-8 encoded character may expand to a different number of bytes than a UTF-32 encoded character.

  • Grapheme_Link (Gr_Link)

    Deprecated by Unicode: Duplicates ccc=vr (Canonical_Combining_Class=Virama)

  • Jamo_Short_Name (JSN)
  • Other_Alphabetic (OAlpha)
  • Other_Default_Ignorable_Code_Point (ODI)
  • Other_Grapheme_Extend (OGr_Ext)
  • Other_ID_Continue (OIDC)
  • Other_ID_Start (OIDS)
  • Other_Lowercase (OLower)
  • Other_Math (OMath)
  • Other_Uppercase (OUpper)

    Used by Unicode internally for generating other properties and not intended to be used stand-alone

  • Script=Katakana_Or_Hiragana (sc=Hrkt)

    Obsolete. All code points previously matched by this have been moved to "Script=Common". Consider instead using "Script_Extensions=Katakana" or "Script_Extensions=Hiragana" (or both)

  • Script_Extensions=Katakana_Or_Hiragana (scx=Hrkt)

    All code points that would be matched by this are matched by either "Script_Extensions=Katakana" or "Script_Extensions=Hiragana"

An installation can choose to allow any of these to be matched by downloading the Unicode database from http://www.unicode.org/Public/ to $Config{privlib} /unicore/ in the Perl source tree, changing the controlling lists contained in the program $Config{privlib} /unicore/mktables and then re-compiling and installing. (%Config is available from the Config module).

Also, perl can be recompiled to operate on an earlier version of the Unicode standard. Further information is at $Config{privlib} /unicore/README.perl.

Other information in the Unicode data base

The Unicode data base is delivered in two different formats. The XML version is valid for more modern Unicode releases. The other version is a collection of files. The two are intended to give equivalent information. Perl uses the older form; this allows you to recompile Perl to use early Unicode releases.

The only non-character property that Perl currently supports is Named Sequences, in which a sequence of code points is given a name and generally treated as a single entity. (Perl supports these via the \N{...} double-quotish construct, charnames::string_vianame(name) in charnames, and namedseq() in Unicode::UCD.

Below is a list of the files in the Unicode data base that Perl doesn't currently use, along with very brief descriptions of their purposes. Some of the names of the files have been shortened from those that Unicode uses, in order to allow them to be distinguishable from similarly named files on file systems for which only the first 8 characters of a name are significant.

  • auxiliary/GraphemeBreakTest.html
  • auxiliary/LineBreakTest.html
  • auxiliary/SentenceBreakTest.html
  • auxiliary/WordBreakTest.html

    Documentation of validation Tests

  • BidiCharacterTest.txt
  • BidiTest.txt
  • NormTest.txt

    Validation Tests

  • CJKRadicals.txt

    Maps the kRSUnicode property values to corresponding code points

  • EmojiSources.txt

    Maps certain Unicode code points to their legacy Japanese cell-phone values

  • Index.txt

    Alphabetical index of Unicode characters

  • NamedSqProv.txt

    Named sequences proposed for inclusion in a later version of the Unicode Standard; if you need them now, you can append this file to NamedSequences.txt and recompile perl

  • NamesList.html

    Describes the format and contents of NamesList.txt

  • NamesList.txt

    Annotated list of characters

  • NormalizationCorrections.txt

    Documentation of corrections already incorporated into the Unicode data base

  • ReadMe.txt

    Documentation

  • StandardizedVariants.html

    Obsoleted as of Unicode 9.0, but previously provided a visual display of the standard variant sequences derived from StandardizedVariants.txt.

  • StandardizedVariants.txt

    Certain glyph variations for character display are standardized. This lists the non-Unihan ones; the Unihan ones are also not used by Perl, and are in a separate Unicode data base http://www.unicode.org/ivd

  • TangutSources.txt

    Specifies source mappings for Tangut ideographs and components. This data file also includes informative radical-stroke values that are used internally by Unicode

  • USourceData.txt

    Documentation of status and cross reference of proposals for encoding by Unicode of Unihan characters

  • USourceGlyphs.pdf

    Pictures of the characters in USourceData.txt

SEE ALSO

http://www.unicode.org/reports/tr44/

perlrecharclass

perlunicode