unicode-categories
is a simple crate that adds many extensions
to the char
type that allow for the querying of whether or not
a character is a member of a certain category of unicode characters.
``` use unicode_categories::UnicodeCategories;
assert!('a'.isletterlowercase()); assert!('A'.isletteruppercase()); assert!('\n'.isothercontrol()); ```
This crate can be obtained from crates.io and used as a dependency like any other crate.
The script in scripts/unicode.py
reads a file named UnicodeData.txt
,
which is assumed to be in the current working directory. This file
can be obtained from ftp://ftp.unicode.org . The script outputs a rust
file to standard out, which can be piped to the desired location.