A bioinformatics plugin for nushell. This plugin parses most common bioinformatics formats into structured data so you can use them with nushell more effectively.
Go and get nushell, it's great. I'm assuming you have the rust toolchain installed. Then come back!
```nu
git clone https://github.com/Euphrasiologist/nupluginbio
cd nupluginbio
cargo build --release
register nupluginbio/target/release/nupluginbio
open ./tests/test.fasta | get id
open --raw ./tests/test.fasta | from fasta -d | first ```
The backend is a noodles
wrapper, an excellent, all-Rust bioinformatics I/O library.
Aim to support the following: - [x] BAM 1.6 - [x] BCF 2.2 - [x] bcf.gz - [x] VCF 4.3 - [x] vcf.gz - [x] BED(3 only right now) - [x] CRAM 3.0 - [x] FASTA - [x] fa.gz - [x] FASTQ - [x] fq.gz - [x] GFF3 - [ ] GTF 2.2 - [x] SAM 1.6 - [x] GFA 1.0 - [x] gfa.gz
Note that performance will not be optimal with the current state of nu_plugin
, as we cannot access the engine state of nushell, and therefore need to load entire data structures into memory. Testing still needs to be done on large files.
If there's a bioinformatics format you want to add, let me know, or add a PR.