This Rust crate contains a function to compute trigram-based similarity of strings.
A trigram is a 3-character long string, and it turns out that the set of all the trigrams in a given string is a good way to represent it when computing its similarity to other strings.
Add this to your Cargo.toml
:
toml
[dependencies]
trigram = "0.2.2"
and call it like this:
```rust use trigram::similarity;
fn main() { println!("{}", similarity(&"rustacean", &"crustacean")); } ```
The similarity
function in this crate is a reverse-engineered approximation
of the similarity
function in the Postgresql pg_trgm extension:
https://www.postgresql.org/docs/9.1/pgtrgm.html. It gives exactly the same
answers in many cases, but may disagree in others (none known). If you find a
case where the answers don't match, please file an issue about it!
A good introduction to the Postgres version of this is given on Stack Overflow: https://stackoverflow.com/a/43161051/484529.