icu_locid crates.io

Parsing, manipulating, and serializing Unicode Language and Locale Identifiers.

This module is published as its own crate (icu_locid) and as part of the icu crate. See the latter for more details on the ICU4X project.

The module provides algorithms for parsing a string into a well-formed language or locale identifier as defined by [UTS #35: Unicode LDML 3. Unicode Language and Locale Identifiers].

[Locale] is the most common structure to use for storing information about a language, script, region, variants and extensions. In almost all cases, this struct should be used as the base unit for all locale management operations.

[LanguageIdentifier] is a strict subset of [Locale] which can be useful in a narrow range of cases where [Unicode Extensions] are not relevant.

If in doubt, use [Locale].

Examples

```rust use icu::locid::subtags::{Language, Region}; use icu::locid::Locale;

let mut loc: Locale = "en-US".parse().expect("Parsing failed.");

let lang: Language = "en".parse().expect("Parsing failed."); let region: Region = "US".parse().expect("Parsing failed.");

asserteq!(loc.id.language, lang); asserteq!(loc.id.script, None); asserteq!(loc.id.region, Some(region)); asserteq!(loc.id.variants.len(), 0);

let region: Region = "GB".parse().expect("Parsing failed."); loc.id.region = Some(region);

asserteq!(loc.tostring(), "en-GB"); ```

Macros

```rust use icu::locid::{ langid, subtagslanguage as language, subtagsregion as region, };

let lid = langid!("EN_US");

asserteq!(lid.language, language!("en")); asserteq!(lid.region, Some(region!("US"))); ```

For more details, see [Locale] and [LanguageIdentifier].

More Information

For more information on development, authorship, contributing etc. please visit ICU4X home page.