Library
Module
Module type
Parameter
Class
Class type
Low level interface.
This interface may be useful for table based implementers of segmenters. For each kind of break, property values are assigned integer values starting from 0
. An array allows to recover the high-level representation of the corresponding property value.
Warning. Do not mutate these array.
val line : Uchar.t -> int
line u
is an integer that can be used with line_of_int
.
line_max
is the maximal value returned by line
.
val line_of_int : line array
line_of_int.(i)
is the line break property value corresponding to i
.
val grapheme_cluster : Uchar.t -> int
grapheme_cluster u
is an integer that can be used with grapheme_cluster_of_int
.
grapheme_cluster_max
is the maximal value returned by grapheme_cluster
.
val grapheme_cluster_of_int : grapheme_cluster array
grapheme_cluster_of_int.(i)
is the grapheme cluster break property value corresponding to i
.
val word : Uchar.t -> int
word u
is an integer that can be used with word_of_int
.
word_max
is the maximal value returned by word
.
val word_of_int : word array
word_of_int.(i)
is the word break property value corresponding to i
.
val sentence : Uchar.t -> int
sentence u
is an integer that can be used with sentence_of_int
.
sentence_max
is the maximal value returned by sentence
.
val sentence_of_int : sentence array
sentence_of_int.(i)
is the sentence break property value corresponding to i
.
val indic_conjunct_break : Uchar.t -> int
indic_conjunct_break u
is an integer that can be used with indic_conjunct_break_of_int
.
indic_conjunct_break_max
is the maximal value returned by indic_conjunct_break_of_int
.
val indic_conjunct_break_of_int : indic_conjunct_break array
indic_conjunct_break.(i)
is the Indic conjunct break property value corresponding to i
.