encoding_rs

Build Status crates.io Apache 2 / MIT dual-licensed

encoding_rs aspires to become an implementation of the Encoding Standard that

  1. Is written in Rust.
  2. Is suitable for use in Gecko as a replacement of uconv. (I.e. supports decoding to UTF-16 and encoding from UTF-16.)
  3. Is suitable for use in Rust code (both in Gecko and independently of Gecko). (I.e. supports decoding to UTF-8 and encoding from UTF-8 and provides an API compatible with at least the most common ways of using rust-encoding.)

You should not expect the above-stated aspiration to be fulfilled yet.

Licensing

Please see the file named COPYRIGHT.

API Documentation

Generated API documentation is available online.

Design

For design considerations, please see the associated technical proposal to rewrite uconv in Rust.

Performance goals

For decoding to UTF-16, the goal is to perform at least as well as Gecko's old uconv. For decoding to UTF-8, the goal is to perform at least as well as rust-encoding.

Encoding to UTF-8 should be fast. (UTF-8 to UTF-8 encode should be equivalent to memcpy and UTF-16 to UTF-8 should be fast.)

Speed is a non-goal when encoding to legacy encodings. Encoding to legacy encodings should not be optimized for speed at the expense of code size as long as form submission and URL parsing in Gecko don't become noticeably too slow in real-world use.

Relationship with rust-encoding

This code is being prototyped as a new project as opposed to patches to rust-encoding both to avoid breaking rust-encoding with in-progress exploration and to be able to see where the API design would go from scratch given the goals.

It is expected that encoding_rs will use code from rust-encoding.

Evaluation of whether it makes sense to propose portions of encodingrs to be adopted into rust-encoding will be best deferred until encodingrs is further along as a prototype.

Roadmap