jon.recoil.org

Module Stdlib.UcharSource

Unicode characters.

Sourcetype t

The type for Unicode characters.

A value of this type represents a Unicode scalar value which is an integer in the ranges 0x0000...0xD7FF or 0xE000...0x10FFFF.

Sourceval min : Stdlib.Uchar.t @@ portable

min is U+0000.

Sourceval max : Stdlib.Uchar.t @@ portable

max is U+10FFFF.

Sourceval bom : Stdlib.Uchar.t @@ portable

bom is U+FEFF, the byte order mark (BOM) character.

  • since 4.06
Sourceval rep : Stdlib.Uchar.t @@ portable

rep is U+FFFD, the replacement character.

  • since 4.06
Sourceval succ : Stdlib.Uchar.t -> Stdlib.Uchar.t @@ portable

succ u is the scalar value after u in the set of Unicode scalar values.

Sourceval pred : Stdlib.Uchar.t -> Stdlib.Uchar.t @@ portable

pred u is the scalar value before u in the set of Unicode scalar values.

Sourceval is_valid : int -> bool @@ portable

is_valid n is true if and only if n is a Unicode scalar value (i.e. in the ranges 0x0000...0xD7FF or 0xE000...0x10FFFF).

Sourceval of_int : int -> Stdlib.Uchar.t @@ portable

of_int i is i as a Unicode character.

Sourceval to_int : Stdlib.Uchar.t -> int @@ portable

to_int u is u as an integer.

Sourceval is_char : Stdlib.Uchar.t -> bool @@ portable

is_char u is true if and only if u is a latin1 OCaml character.

Sourceval of_char : char -> Stdlib.Uchar.t @@ portable

of_char c is c as a Unicode character.

Sourceval to_char : Stdlib.Uchar.t -> char @@ portable

to_char u is u as an OCaml latin1 character.

Sourceval equal : Stdlib.Uchar.t -> Stdlib.Uchar.t -> bool @@ portable

equal u u' is u = u'.

Sourceval compare : Stdlib.Uchar.t -> Stdlib.Uchar.t -> int @@ portable

compare u u' is Stdlib.compare u u'.

Sourceval hash : Stdlib.Uchar.t -> int @@ portable

hash u associates a non-negative integer to u.

UTF codecs tools

Sourcetype utf_decode

The type for UTF decode results. Values of this type represent the result of a Unicode Transformation Format decoding attempt.

Sourceval utf_decode_is_valid : Stdlib.Uchar.utf_decode -> bool @@ portable

utf_decode_is_valid d is true if and only if d holds a valid decode.

Sourceval utf_decode_uchar : Stdlib.Uchar.utf_decode -> Stdlib.Uchar.t @@ portable

utf_decode_uchar d is the Unicode character decoded by d if utf_decode_is_valid d is true and Uchar.rep otherwise.

Sourceval utf_decode_length : Stdlib.Uchar.utf_decode -> int @@ portable

utf_decode_length d is the number of elements from the source that were consumed by the decode d. This is always strictly positive and smaller or equal to 4. The kind of source elements depends on the actual decoder; for the decoders of the standard library this function always returns a length in bytes.

Sourceval utf_decode : int -> Stdlib.Uchar.t -> Stdlib.Uchar.utf_decode @@ portable

utf_decode n u is a valid UTF decode for u that consumed n elements from the source for decoding. n must be positive and smaller or equal to 4 (this is not checked by the module).

Sourceval utf_decode_invalid : int -> Stdlib.Uchar.utf_decode @@ portable

utf_decode_invalid n is an invalid UTF decode that consumed n elements from the source to error. n must be positive and smaller or equal to 4 (this is not checked by the module). The resulting decode has rep as the decoded Unicode character.

Sourceval utf_8_byte_length : Stdlib.Uchar.t -> int @@ portable

utf_8_byte_length u is the number of bytes needed to encode u in UTF-8.

Sourceval utf_16_byte_length : Stdlib.Uchar.t -> int @@ portable

utf_16_byte_length u is the number of bytes needed to encode u in UTF-16.