icu_properties
is one of the [ICU4X
] components.
This component provides definitions of [Unicode Properties] and APIs for retrieving property data in an appropriate data structure.
APIs that return a [UnicodeSet
] exist for binary properties and certain enumerated
properties. See the [sets
] module for more details.
APIs that return a [CodePointTrie
] exist for certain enumerated properties. See the
[maps
] module for more details.
UnicodeSet
s```rust use icu::properties::{maps, sets, GeneralCategory};
let provider = icutestdata::getprovider();
// A binary property as a UnicodeSet
let payload = sets::getemoji(&provider) .expect("The data should be valid"); let datastruct = payload.get(); let emoji = &datastruct.invlist;
assert!(emoji.contains('🎃')); // U+1F383 JACK-O-LANTERN assert!(!emoji.contains('木')); // U+6728
// An individual enumerated property value as a UnicodeSet
let payload = maps::getgeneralcategory(&provider) .expect("The data should be valid"); let datastruct = payload.get(); let gc = &datastruct.codepointtrie; let linesep = gc.getsetforvalue(GeneralCategory::LineSeparator);
assert!(linesep.containsu32(0x2028)); assert!(!linesep.containsu32(0x2029)); ```
CodePointTrie
s```rust use icu::properties::{maps, Script};
let provider = icutestdata::getprovider();
let payload = maps::getscript(&provider) .expect("The data should be valid"); let datastruct = payload.get(); let script = &datastruct.codepoint_trie;
asserteq!(script.get('🎃' as u32), Script::Common); // U+1F383 JACK-O-LANTERN asserteq!(script.get('木' as u32), Script::Han); // U+6728 ```
For more information on development, authorship, contributing etc. please visit ICU4X home page
.