rnajena/viralclust
Small pipeline to cluster viral genomes based on their k-mer content. WiP
NextflowGPL-3.0
Issues
- 0
RAM vs Diskspace trade off
#21 opened by klamkiew - 1
HDBSCAN error for large dataset
#20 opened by sandraTriebel - 0
invalid compressed data in ncbi update process
#19 opened by klamkiew - 0
better restrictions for yml files
#18 opened by klamkiew - 1
NCBI Accession IDs enhancement
#4 opened by klamkiew - 0
sort_sequences: different default behavior
#15 opened by klamkiew - 0
- 0
- 0
unclustered sequences in final set in mmseqs2
#10 opened by klamkiew - 0
- 2
Inconsistent use of the label revComp/revcomp
#13 opened by matthuska - 0
- 0
- 0
- 0
NCBI collection year format
#8 opened by klamkiew - 1
NCBI annotation via Regex
#7 opened by klamkiew - 0
- 0
multiprocessing for sort-sequences
#5 opened by klamkiew - 1
empty sequences after umap&hdbscan for k=5
#1 opened by klamkiew - 0
duplicated sequence in final set
#2 opened by klamkiew - 0