A rust framework to make using alevin-fry even simpler.
The simpleaf
program is intended to simply the running of alevin-fry
in common usage scenarios. By limiting some of the different options that can be set, it provides a streamlined way to build the splici reference and index in a single command, as well as to process an experiment from raw FASTQ files to a count matrix in a single command.
To work properly, simpleaf
has a few requirements. Specifically, you should have pyroe
(>=0.6.2), salmon
(>=1.5.1), and alevin-fry
(>=0.6.0) installed. These can either simply be in your PATH
variable, or you can explicily provide the path to them using the set-paths
command of simpleaf
, which will then cache them in a JSON
file in your ALEVIN_FRY_HOME
directory. Additionally, simpleaf
requires the following environment variable to be present when it is executed :
ALEVIN_FRY_HOME
REQUIRED — This directory will be used for persistent configuration and small file (<1G) storage between runs. If you provide a directory and it doesn't exist, it will be created. It is easiest to just set this in your enviornment globally so that the same home can be used over many runs without you having to provide the variable explicitly each time. A good choice for this variable might be something like ~/.alevin_fry_home
.
The simpleaf
script has three sub-commands:
set-paths
— The set-paths
command will set the paths to the relevant executables and store them in a configuration file in the ALEVIN_FRY_HOME
directory. If you don't provide an explicit path for a program, simpleaf
will look in your PATH
for a compatible version. This command takes the following optional arguments:```{bash} USAGE: simpleaf set-paths [OPTIONS]
OPTIONS:
-a, --alevin-fry
index
— The index
command will take a reference genome FASTA and GTF as input, build a splici reference using the build_splici_ref.R
script, and then build a sparse salmon
index on the resulting reference. Note: The index
command requires the Rscript
executable to be in the path, as well as all of theR packages that are required by build_splici_ref.R
. The relevant options (which you can obtain by running ./simpleaf index -h
) are:
```{bash}
USAGE:
simpleaf index [OPTIONS] --fasta
OPTIONS:
-d, --dedup deduplicate identical sequences inside the R script when building the splici reference
-f, --fasta
quant
— The quant
command takes as input the index, reads, and relevant information about the experiment (e.g. chemistry), and runs all of the steps of the alevin-fry
pipeline, from mapping with salmon
through quant
with alevin-fry
. The relevant options (which you can obtain by running simpleaf quant -h
) are:
```{bash}
USAGE:
simpleaf quant [OPTIONS] --index
OPTIONS:
-1, --reads1