icu_properties crates.io

icu_properties is one of the [ICU4X] components.

This component provides definitions of [Unicode Properties] and APIs for retrieving property data in an appropriate data structure.

APIs that return a [UnicodeSet] exist for binary properties and certain enumerated properties. See the [sets] module for more details.

APIs that return a [CodePointTrie] exist for certain enumerated properties. See the [maps] module for more details.

Examples

Property data as UnicodeSets

```rust use icu::properties::{maps, sets, GeneralCategory};

let provider = icutestdata::getprovider();

// A binary property as a UnicodeSet

let payload = sets::getemoji(&provider) .expect("The data should be valid"); let datastruct = payload.get(); let emoji = &datastruct.invlist;

assert!(emoji.contains('🎃')); // U+1F383 JACK-O-LANTERN assert!(!emoji.contains('木')); // U+6728

// An individual enumerated property value as a UnicodeSet

let payload = maps::getgeneralcategory(&provider) .expect("The data should be valid"); let datastruct = payload.get(); let gc = &datastruct.codepointtrie; let linesep = gc.getsetforvalue(GeneralCategory::LineSeparator);

assert!(linesep.containsu32(0x2028)); assert!(!linesep.containsu32(0x2029)); ```

Property data as CodePointTries

```rust use icu::properties::{maps, Script};

let provider = icutestdata::getprovider();

let payload = maps::getscript(&provider) .expect("The data should be valid"); let datastruct = payload.get(); let script = &datastruct.codepoint_trie;

asserteq!(script.get('🎃' as u32), Script::Common); // U+1F383 JACK-O-LANTERN asserteq!(script.get('木' as u32), Script::Han); // U+6728 ```

More Information

For more information on development, authorship, contributing etc. please visit ICU4X home page.