jianshu93
Probabilistic data structures in Bioinformatics and Computational Biology. Collaboration with @jean-pierreBoth
University of California, San DiegoSan Diego
jianshu93's Stars
simdjson/simdjson
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
simd-lite/simd-json
Rust port of simdjson
SunPengChuan/wgdi
WGDI: A user-friendly toolkit for evolutionary analyses of whole-genome duplications and ancestral karyotypes
biobakery/Maaslin2
MaAsLin2: Microbiome Multivariate Association with Linear Models
IndexThePlanet/Logan
Logan Unitigs and Contigs
chasewnelson/SNPGenie
Program for estimating πN/πS, dN/dS, and other diversity measures from next-generation sequencing data
unageek/graphest
A faithful graphing calculator
bcgsc/ntEdit
✏️ Genome assembly polishing & SNV detection
5cript/interval-tree
A C++ header only interval tree implementation.
moshi4/COGclassifier
A tool for classifying prokaryote protein sequences into COG(Cluster of Orthologous Genes) functional category
mourisl/centrifuger
Classifier for metagenomic sequences using FM-index with run-block compressed BWT.
yunwilliamyu/hyperminhash
LogLog space version of MinHash by combining ideas from HyperLogLog and b-bit MinHash
CMU-SAFARI/RawHash
RawHash can accurately and efficiently map raw nanopore signals to reference genomes of varying sizes (e.g., from viral to a human genomes) in real-time without basecalling. Described by Firtina et al. (published at https://academic.oup.com/bioinformatics/article/39/Supplement_1/i297/7210440).
RabbitBio/RabbitTClust
RabbitTClust: enabling fast clustering analysis of millions bacteria genomes with MinHash sketches
bluenote-1577/fairy
alignment-free coverage calculation for metagenomic binning >100 times faster
martinjrobins/diffsol
ODE solver library in Rust
kcleal/pywfa
Python wrapper for wavefront alignment using WFA2-lib
prophyle/prophasm
ProphAsm – a rapid computation of simplitigs directly from k-mer sets
ocxtal/dozeu
SIMD-parallel BLAST X-drop DP on sequence graphs
ChrisCreevey/catsequences
A tool for concatenating multiple fasta alignments for supermatrix phylogenetic analyses
pesho-ivanov/minSH
A minimal Python re-implementation of the A* with seed heuristic for exact global alignment (edit distance) in near-linear time
hcho3/skiplist_cuda
A parallel (CUDA) implementation of skiplist
chasewnelson/OLGenie
Program for estimating dN/dS in overlapping genes (OLGs)
vicLeva/bqf
Implementation of a Backpack Quotient Filter
ucrparlay/Edit-Distance
ESA'23: Efficient Parallel Output-Sensitive Edit Distance
natir/knot
KNOT: Knowledge Network Overlap exTraction is a tool for the investigation of fragmented long read assemblies.
ARU-life-sciences/extract_nhmmer_tblout
Get the nhmmer identified sequences back from a FASTA file.
ctb/pp-sketchlib
Library of sketching functions used by PopPUNK
lukaslueg/pyhyperminhash
Very fast, constant memory-footprint cardinality approximation, including intersection and union operation.
LukeMathWalker/googletest-rust