Very fast! And flexible, This library used to serialize and deserialize data in binary format.
Inspaired by bincode, But much more flexible.
By default, the library uses little endian.
If you want to use big endian, you can set BE
features flag. And for native endian use NE
. For example:
toml
[dependencies]
bin-layout = { version = "6", features = ["BE"] }
```rust use bin_layout::*;
struct Car<'a> { name: &'a str, year: u16, is_new: bool, }
struct Company<'a> { name: String, cars: Vec
let old = Company { name: "Tesla".into(), cars: vec![ Car { name: "Model S", year: 2018, isnew: true }, Car { name: "Model X", year: 2019, isnew: false }, ], };
let bytes = old.encode(); let new = Company::decode(&bytes); ```
There is two main reasons for this library to exists.
There is no performance penalty for using this library.
Zero-copy deserialization: Its mean that no data is copied. Instead, the data is referenced.
```rust use bin_layout::*;
struct Msg<'a> { id: u8, data: &'a str, } let bytes = [42, 13, 72, 101, 108, 108, 111, 44, 32, 87, 111, 114, 108, 100, 33]; // ^^ ^^ ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ // Id Len Data
let msg = Msg::decode(&bytes).unwrap(); asserteq!(msg.id, 42); asserteq!(msg.data, "Hello, World!"); // Here, data is referenced. ```
Compile time allocation:
What if the data is fixed size? Then we don't need to allocate any memory at runtime.
For example, The following structs, don't have any dynamic data. So we can have a fixed size buffer at compile time.
```rust use binlayout::*; use stackarray::ArrayBuf;
struct Date { year: u16, month: u8, day: u8, }
struct Record { id: u32, date: Date, value: [u8; 512], }
let record = Record { id: 1, date: Date { year: 2018, month: 1, day: 1 }, value: [0; 512] };
let mut arr: ArrayBuf What happens if we have a dynamic data (like vector, string, etc...) ? Then we have to allocate memory at runtime. But how much memory we need to store the whole data ? When a vector is full, It creates a new vector with larger size, Then move all data to the new vector. Which is expensive. Well, Encoder has a method called For example: ```rust
use bin_layout::*; struct Student {
roll: u32,
name: String, // Here we have a dynamic data.
} let bytes = Student { roll: 42, name: "Jui".into() }.encode();
```size_hint
, which calculates the total size of the data at runtime. Which is cheap to compute. encode
method use size_hint
function internaly.[derive(Encoder, Decoder)]
It work by mantaining a Cursor. Which is a pointer to the current position in the buffer. And the cursor is updated when reading or writing data to the buffer.
It's very easy to implement a custom serializer/deserializer for your own data type.
For example:
```rust use bin_layout::*;
struct Bar(u16); struct Foo { x: u8, y: Bar }
impl Encoder for Foo {
fn encoder(&self, c: &mut impl Array
All primitive types implement this trait.
Vec
, String
, &[T]
, &str
etc.. are encoded with their length value first, Following by each entry.
This encoding ensures that smaller integer values need fewer bytes to encode. Support types are L2
and L3
, both are encoded in little endian.
By default, L2
(u15) is used to encode length (integer) for record. But you override it by setting L3
(u22) in features flag.
Encoding algorithm is very straightforward, reserving one or two most significant bits of the first byte to encode rest of the length.
| MSB | Length | Usable Bits | Range | | :---: | :----: | :---------: | :------- | | 0 | 1 | 7 | 0..127 | | 1 | 2 | 15 | 0..32767 |
| MSB | Length | Usable Bits | Range | | :---: | :----: | :---------: | :--------- | | 0 | 1 | 7 | 0..127 | | 10 | 2 | 14 | 0..16383 | | 11 | 3 | 22 | 0..4194303 |
For example, Binary representation of 0x_C0DE
is 0x_11_00000011_011110
L3(0x_C0DE)
is encoded in 3 bytes:
yml
1st byte: 11_011110 # MSB is 11, so read next 2 bytes
2nd byte: 11
3rd byte: 11
Another example, L3(107)
is encoded in just 1 byte:
yml
1st byte: 0_1101011 # MSB is 0, So we don't have to read extra bytes.
Record can be used to represent fixed-size integer to represent the length of a record.
It accepts fixed-length unsigned interger type of N
(u8
, u32
, usize
, etc..) and a generic type of T
(Vec<T>
, String
etc..)