- Unicode 15.1.0 support.
- Require OCaml 4.14.0.
- Use module aliases for the property modules. Only pay for the modules you use (#2).
- Use the standard library UTF decoders in the sample code and in
ucharinfo
(#23). - The
Num.numeric_value
had to be changed to accomodate for the data. It now returns either NaN or a list of numbers. This is due to the interpretation of U+5146 and U+79ED which is locale dependent and thus can represent multiple values. In all other cases you should get singelton lists so far. - Rename
Uucd.Cjk.ids_bin_op
toUucd.Cjk.ids_binary_operator
. - Rename
Uucd.Cjk.ids_tri_op
toUccd.Cjk.ids_trinary_operator
. - Add
Uucd.Cjk.ids_unary_operator
, support for the newIDS_Unary_Operator
property. - Add
Uucd.Id.is_id_compat_math_{start,continue}
, support for the newID_Compat_Math_{Start,Continue}
properties. - Add
Uucd.Case.Nfkc_simple_fold.fold
, support for the newNFKC_Simple_Casefold
property. - Add
Uucd.Break.indic_conjunct_break
, support for the newIndic_Conjunct_Break
property.
- Unicode 15.0.0 support.
- Tool
ucharinfo
: allow to query more than one character at a time. - Tool
ucharinfo
: tool allow to search characters by name.
- Unicode 14.0.0 support.
- Tweak
Uucp.Break.tty_width_hint
(09d2186). Thanks to David Kaloper Meršinjak.
- Unicode 13.0.0 support. Adds the
Emoji
module with the new emoji properties. - Reduce data size by improving structure sharing. Thanks to David Kaloper Meršinjak for the help.
- Handle
Pervasives
deprecation. - Require OCaml >= 4.03.0
- Unicode 12.0.0 support.
- Unicode 11.0.0 support.
- Add support for the Join_Control property (
Uucp.Func.is_join_control
) and the Hangul_Syllable_Type property (Uucp.Hangul.syllable_type
).
- Fix wrong build dependencies of ucharinfo. Thanks to Andreas Hauptmann for the report.
- Unicode 10.0.0 support.
- Add ucharinfo tool to query Unicode character information on the cli.
- OCaml 4.05 compatibility (removal of
Uchar.dump
)
- Unicode 9.0.0 support.
- OCaml standard library
Uchar.t
support.- Removes and substitutes
type Uucp.uchar = int
by the (abstract)Uchar.t
type.Uchar.{of,to}_int
allows to recover the previous representation. - Removes the
Uucp.Uchar
module, corresponding functionality can be found inUchar
.
- Removes and substitutes
- Safe string support.
- Build depend on topkg.
- Relicense from BSD3 to ISC.
- Add support for the East Asian width property (
Uucp.Break.east_asian_width
). - Add the non-normative, heuristic function
Uucp.Break.tty_width_hint
. Thanks to David Kaloper for the contribution.
- Updated for Unicode 8.0.0 Incompatible release, new variants cases are introduced, see commit adbb5efc036 for details.
- Add access to the
Line_break
,Grapheme_cluster_break
,Word_break
andSentence_break
Unicode properties. See theUucp.Break
module. - Improvements and fixes to the minimal Unicode Introduction.
First release. Part of the work was sponsored by OCaml Labs.