DawnSearch is an open source distributed web search engine that searches by meaning. It can index the Common Crawl data. It uses semantic search (searching on meaning), using all-MiniLM-L6-v2. It uses USearch for vector search. DawnSearch is written in Rust. DawnSearch is licensed AGPLv3.0+.
A public instance is available at dawnsearch.org.
These instructions assume you're running a recent Ubuntu.
Install Rust (if you don't have it already):
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
Install the required packages.
sudo apt update
sudo apt install build-essential libssl-dev pkg-config
Next, install libtorch.
Feel free to open an issue if you encounter problems building DawnSearch!
Please open issues, or create pull requests. Note that DawnSearch is licensed AGPLv3.0+ or later, which is slightly unusual for a Rust project.