finalfusion-utils
is a Rust crate offering various
functionalities to process and query embeddings.
finalfusion-utils
supports conversion between different
formats, quantization of embedding matrices, similarity and
analogy queries as well as evaluation on analogy datasets.
Installing finalfusion-utils
requires a Rust toolchain
with minimum version 1.32
which can be installed via
rustup.
With a valid Rust toolchain, the crate is most easily
installed through cargo
:
~~~shell $ cargo install finalfusion-utils ~~~
finalfusion-utils
can also be built from source,
after cloning this repository execute the following
command in the directory to find the exectuable under
target/release/finalfusion
:
~~~shell $ cargo build --release ~~~
finalfusion-utils
is built as a single binary, the
different functionality is invoked through subcommands:
~~~shell
$ finalfusion convert -f fasttext -t finalfusion \ embeddings.bin embeddings.fifu
$ finalfusion convert -f word2vec -t finalfusion \ embeddings.w2v embeddings.fifu
$ finalfusion convert --help ~~~
~~~shell
$ finalfusion quantize -f finalfusion -q pq -a 1 \ embeddings.pq ~~~
~~~ shell
$ finalfusion similar -f finalfusion -k 15 \ embeddings.fifu
$ finalfusion analogy -f finalfusion -k 5 \ Berlin Deutschland Amsterdam embeddings.fifu ~~~
~~~shell
$ finalfusion compute-accuracy embeddings.fifu \ analogies.txt ~~~
~~~shell
$ finalfusion metadata embeddings.fifu \
metadata.txt ~~~
~~~shell
$ finalfusion completions zsh ~~~