Rust-tokenizer is a drop-in replacement for the tokenization methods from the Transformers library
Rust-tokenizer requires a rust nightly build in order to use the Python API. Building from source involes the following steps:
python setup.py install
in the repository. This will compile the Rust library and install the python API/tests
folder, including benchmark and integration testsThe library is fully unit tested at the Rust level