bioinformatics
There are 9903 repositories under bioinformatics topic.
Developer-Y/cs-video-courses
List of Computer Science courses with video lectures.
plotly/dash
Data Apps & Dashboards for Python. No JavaScript Required.
biopython/biopython
Official git repository for Biopython (originally converted from CVS)
google/deepvariant
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
seandavi/awesome-single-cell
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
danielecook/Awesome-Bioinformatics
A curated list of awesome Bioinformatics libraries and software.
nextflow-io/nextflow
A DSL for data-driven computational pipelines
sokrypton/ColabFold
Making Protein folding accessible to all!
OpenGene/fastp
An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)
scverse/scanpy
Single-cell analysis in Python. Scales to >1M cells.
lh3/minimap2
A versatile pairwise aligner for genomic and spliced nucleotide sequences
broadinstitute/gatk
Official code repository for GATK versions 4 and up
allenai/scispacy
A full spaCy pipeline and models for scientific/biomedical documents.
bioconda/bioconda-recipes
Conda recipes for the bioconda channel.
lh3/bwa
Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
soedinglab/MMseqs2
MMseqs2: ultra fast and sensitive search and clustering suite
galaxyproject/galaxy
Data intensive science for everyone.
lh3/seqtk
Toolkit for processing sequences in FASTA/Q formats
shenwei356/seqkit
A cross-platform and ultrafast toolkit for FASTA/Q file manipulation
MultiQC/MultiQC
Aggregate results from bioinformatics analyses across many samples into a single report.
crazyhottommy/getting-started-with-genomics-tools-and-resources
Unix, R and python tools for genomics and data science
lightaime/deep_gcns_torch
Pytorch Repo for DeepGCNs (ICCV'2019 Oral, TPAMI'2021), DeeperGCN (arXiv'2020) and GNN1000(ICML'2021): https://www.deepgcns.org
scipipe/scipipe
Robust, flexible and resource-efficient pipelines using Go and the commandline
a-r-j/graphein
Protein Graph Library
mims-harvard/TDC
Therapeutics Commons (TDC-2): Multimodal Foundation for Therapeutic Science
shenwei356/csvtk
A cross-platform, efficient and practical CSV/TSV toolkit in Golang
plotly/react-plotly.js
A plotly.js React component from Plotly 📈
bigdatagenomics/adam
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
broadinstitute/cromwell
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
kexinhuang12345/DeepPurpose
A Deep Learning Toolkit for DTI, Drug Property, PPI, DDI, Protein Function Prediction (Bioinformatics)
hail-is/hail
Cloud-native genomic dataframes and batch computing
kblin/ncbi-genome-download
Scripts to download genomes from the NCBI FTP servers
scikit-bio/scikit-bio
scikit-bio: a community-driven Python library for bioinformatics, providing versatile data structures, algorithms and educational resources.
shenwei356/rush
A cross-platform command-line tool for executing jobs in parallel
MareesAT/GWA_tutorial
A comprehensive tutorial about GWAS and PRS
steineggerlab/foldseek
Foldseek enables fast and sensitive comparisons of large structure sets.