fibertools-rs
fibertools-rs
a CLI tool for creating and interacting with fiberseq bam files.
crates.io
Installation from crates.io
requires the rust package manager cargo
. You can find how to install cargo
here.
Furthermore, a recent version of gcc
and cmake
is required. I have tested and recommend gcc v10.2.0
and cmake v3.21.1
, though other versions may work.
cargo install fibertools-rs
bioconda
mamba install -c conda-forge -c bioconda fibertools-rs
github
(active development)
cargo install --git https://github.com/mrvollger/fibertools-rs
bash
ft --help
Help page for fibertools
fibertools-rs
ft predict-m6a
Help page for predict-m6a. Predict m6A positions using HiFi kinetics data and encode the results in the MM and ML bam tags.
We recommend filtering out ML scores less than 250 for the XGBoost model and ML scores less than 200 for the CNN model.
To allow for m6A predictions with the CNN model you must follow these modified installation instructions.
* Get libtorch
v1.12.0 from the PyTorch website download section and extract the content of the zip file.
* On my linux system with a cuda gpu this is what I downloaded:
* wget https://download.pytorch.org/libtorch/cu116/libtorch-cxx11-abi-shared-with-deps-1.12.0%2Bcu116.zip
* Add the following to your .bashrc
or equivalent, where /path/to/libtorch
is the path to the directory that was created when unzipping the file:
bash
export LIBTORCH=/path/to/libtorch # e.g. export LIBTORCH=/Users/mrvollger/lib/libtorch
export LD_LIBRARY_PATH=${LIBTORCH}/lib:$LD_LIBRARY_PATH
export DYLD_LIBRARY_PATH=${LIBTORCH}/lib:$LD_LIBRARY_PATH
And install fibertools-rs
from cargo
with the cnn
feature enabled:
bash
cargo install --git https://github.com/mrvollger/fibertools-rs --features cnn
To add nucleosome calls to the BAM files you can use the python package fibertools. See that repository for installation and instructions.
ft extract
Help page for extract. Extracts fiberseq data from a bam file into plain text.
ft extract --all
The extract all option is a special option that tries to extract all the fiberseq data into a tabular format. The following is an image of the output. Note that the column names will be preserved across different software versions (unless otherwise noted); however, the order may change and new columns may be added. Therefore, when loading the data (with pandas
e.g.) be sure to use the column names as opposed to indexes for manipulation.
ft center
Help page for center. Center fiberseq reads (bam) around reference position(s).
You can find the docs for the latest release here:
https://docs.rs/fibertools-rs/latest/fibertools_rs/
or download from source and run:
cargo doc --open --features cnn
and the docs will open in your browser.
ft extract
and group writes to try and improve the speedrustybam
stats to ft all
as an option