This is a Rust crate (i.e. library) for working with a local copy of the
NCBI Taxonomy database.
The database can be downloaded (either taxdump.zip
or taxdump.tar.gz
) from the
NCBI Taxonomy FTP site and reformatted into a SQLite database
using the taxonomy_util
utility's to_sqlite
subcommand.
Documentation is available at crates.io.
(new in 0.1.1)
A tool to filter a NCBI RefSeq FASTA file so that only the ancestors of a given taxon are retained.
```bash $ taxonomyfilterrefseq --help taxonomyfilterrefseq 1.0.0 Peter van Heusden pvh@sanbi.axc.za Filter NCBI RefSeq FASTA files by taxonomic lineage
USAGE:
taxonomyfilterrefseq [FLAGS] [OPTIONS]
FLAGS: --nocurated Don't accept curated RNAs and proteins (NM, NR_ and NP_ accessions) --nopredicted Don't accept computationally predicted RNAs and proteins (XM, XR_ and XP_ accessions) -h, --help Prints help information -V, --version Prints version information
OPTIONS:
-d, --db
ARGS:
(new in version 0.2.0)
```bash $ taxonomyfilterfastq --help taxonomyfilterfastq 1.0.0 Peter van Heusden pvh@sanbi.axc.za Filter FASTQ files whose reads have been classified by Centrifuge or Kraken2, only retaining reads in taxa descending from given ancestor
USAGE:
taxonomyfilterfastq [FLAGS] [OPTIONS]
FLAGS: -d, --output_dir Directory to deposited filtered output files in -C, --centrifuge Filter using report from Centrifuge -h, --help Prints help information -K, --kraken2 Filter using report from Kraken2 -V, --version Prints version information
OPTIONS:
-A, --ancestortaxid
ARGS:
(new in 1.0.0)
Utilities to convert NCBI taxonomy database files into SQLite database (the input format used in other tools).
```bash taxonomy_util 1.0.0 Peter van Heusden pvh@sanbi.axc.za Utilities for working with the NCBI taxonomy database
USAGE: taxonomy_util [OPTIONS] [SUBCOMMAND]
FLAGS: -h, --help Prints help information -V, --version Prints version information
OPTIONS:
-d, --db
SUBCOMMANDS: commonancestordistance find the tree distance to te common ancestor between two taxa getid find taxonomy ID for name getlineage get lineage for name [unimplemented] getname find name for taxonomy ID help Prints this message or the help of the given subcommand(s) tosqlite save taxonomy database loaded from files to SQLite database file ```