This crate is a utility for identifying names of programming languages (and related files) from paths and file extensions.
This is not a crate for detecting natural languages.
Add this to your Cargo.toml
:
toml
[dependencies]
detect-lang = "0.1"
Languages can be identified from paths using from_path
or directly from extensions using from_extension
.
```rust use detectlang::frompath; asserteq!(frompath("foo.rs").unwrap().name(), "Rust"); asserteq!(frompath("foo.md").unwrap().name(), "Markdown");
use detectlang::fromextension; asserteq!(fromextension("rs").unwrap().name(), "Rust"); asserteq!(fromextension("md").unwrap().name(), "Markdown");
// The case is ignored asserteq!(frompath("foo.jSoN").unwrap().name(), "JSON"); asserteq!(fromextension("jSoN").unwrap().name(), "JSON"); ```
In short, the language id
is a lowercase version of name
.
However, it also replaces symbols making it usable as a [URL slug].
For instance foo.hpp
is identified as language name C++
and
language ID cpp
.
```rust use detectlang::frompath; asserteq!(frompath("foo.rs").unwrap().id(), "rust"); asserteq!(frompath("foo.cpp").unwrap().id(), "cpp"); asserteq!(frompath("foo.hpp").unwrap().id(), "cpp");
use detectlang::fromextension; asserteq!(fromextension("rs").unwrap().id(), "rust"); asserteq!(fromextension("cpp").unwrap().id(), "cpp"); asserteq!(fromextension("hpp").unwrap().id(), "cpp");
// The case is ignored asserteq!(frompath("foo.jSoN").unwrap().id(), "json"); asserteq!(fromextension("jSoN").unwrap().id(), "json"); ```
```rust use std::path::Path; use detectlang::{frompath, Language};
let path = Path::new("foo.rs"); match frompath(path) { Some(Language(, "rust")) => println!("This is Rust"), Some(Language(..)) => println!("Well it's not Rust"), None => println!("Ehh, what?"), } ```