basebits

A library for memory efficient short DNA sequence encoding.

Synopsis

When to use this library? If you are comparing strings against each other more than 4 times, it becomes more efficient to pay the cost of encoding them.

Operations

Constant time hamming distance calculations

Reference

See 'Constant Time Hamming Distance' section: https://www.biorxiv.org/content/10.1101/648683v1.full

Future directions

FFT stuff?