Rut is a small UTF-8 parsing library for applications that need to parse individual char
s.\
It provides a byte-wise parsing mechanism, and functions for processing byte slices.
It is completely #[no_std]
and should produce very small binaries.[citation needed]
Rut aims to be fully conformant to the specifications and restrictions of the [Unicode standard].\ Due to the nature of byte-wise parsing, some [extra caution] might be required when using Rut.
The [parse_one
] and [parse
] functions take care of this.
A few tests validating the expected behavior are already in place, but it is not comprehensive by any means yet. More tests will be added.
I have thrown a fuzzer at it for several minutes, and it passes this stress test for UTF-8 decoders.
```rust use rut::Utf8Parser;
// UTF-8 encoding of '€' let bytes = [0xE2, 0x82, 0xAC];
let mut p = Utf8Parser::new();
asserteq!(p.parsebyte(bytes[0]), Ok(None)); asserteq!(p.parsebyte(bytes[1]), Ok(None)); asserteq!(p.parsebyte(bytes[2]), Ok(Some('€'))); ```