Russel88/CRISPRCasTyper

Speed up CRISPR stats calculation

Closed this issue · 1 comments

Long CRISPRs take time to process, because each pair of both spacers and repeats are aligned to calculate the average identity of repeats and spacers. It should be sufficient to sample a subset of the repeats/spacers to estimate the identity. Then add a CLI argument to toggle exact versus approximate identity estimation.

Now implemented in development version