DawnSearch

DawnSearch is an open source distributed web search engine that searches by meaning. It can index the Common Crawl data. It uses semantic search (searching on meaning), using all-MiniLM-L6-v2. It uses USearch for vector search. DawnSearch is written in Rust. DawnSearch is licensed AGPLv3.0+.

A public instance is available at dawnsearch.org.

Quick start

These instructions assume you're running a recent Ubuntu.

Install Rust (if you don't have it already):

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh

Install the required packages.

sudo apt update
sudo apt install build-essential libssl-dev pkg-config

Next, install libtorch.

Feel free to open an issue if you encounter problems building DawnSearch!

Contributing

Please open issues, or create pull requests. Note that DawnSearch is licensed AGPLv3.0+ or later, which is slightly unusual for a Rust project.

See also