Module Uucp
Unicode character properties.
Uucp provides efficient access to a selection of character properties of the Unicode character database.
Consult this page for a minimal Unicode introduction and OCaml Unicode tips. Individual modules have sample code related to the properties.
References.
- The Unicode FAQ.
- The Unicode Consortium. The Unicode Standard. (latest version)
- Mark Davis, Ken Whistler. UAX #44 Unicode Character Database. (latest version)
Properties
Consult information about the property distribution in modules and omissions.
module Age : sig ... endAge property.
module Alpha : sig ... endAlphabetic property.
module Block : sig ... endBlock property and block ranges.
module Break : sig ... endBreak properties.
module Case : sig ... endCase properties, mappings and foldings.
module Cjk : sig ... endCJK properties.
module Emoji : sig ... endEmoji properties.
module Func : sig ... endFunction and graphics properties.
module Gc : sig ... endGeneral category property.
module Gen : sig ... endGeneral properties.
module Hangul : sig ... endHangul properties.
module Id : sig ... endIdentifier properties.
module Name : sig ... endName and name alias properties.
module Num : sig ... endNumeric properties.
module Script : sig ... endScript and script extensions properties.
module White : sig ... endWhite space property.
Property module distribution and omissions
Properties are approximatively distributed in modules by scope of use like in this property index table. However some subset of properties live in their own modules.
Obsolete and deprecated properties are omitted. So are those related to normalization, shaping and bidirectionality. Here is the full list of omitted properties, if you think one of these property should be added get in touch with a rationale.
- Case. Simple_Lowercase_Mapping, Simple_Uppercase_Mapping, Simple_Titlecase_Mapping, Simple_Case_folding, Changes_When_Lowercased, Changes_When_Uppercased, Changes_When_Titlecased, Changes_When_Casefolded, Changes_When_Casemapped, Changes_When_NFKC_Casefolded.
- Normalization. All properties under that section name in this table.
- Shaping and rendering. Joining_Group, Joining_Type, Vertical_Orientation, Indic_Syllabic_Category, Indic_Positional_Category, Prepended_Concatenation_Mark
- Bidirectional. All properties under that section name in this table.
- CJK. Unicode_Radical_Stroke, Equivalent_Unified_Ideograph and all the properties of the Unicode HAN Database.
- Miscellaneous. STerm.
- Contributory properties. All properties under that section in this table.