rexxxx1234's Stars
sjgosai/boda2
Computational Optimization of DNA Activity (CODA)
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
hongtaoh/cv_emulate
Academic CVs that you can emulate
lucidrains/alphafold3-pytorch
Implementation of Alphafold 3 in Pytorch
NVIDIA/RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
terry-r123/RNABenchmark
BEACON: Benchmark for Comprehensive RNA Tasks and Language Models
JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
PiotrNawrot/nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
lucidrains/MEGABYTE-pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
evolutionaryscale/esm
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
apeterswu/Awesome-Bio-Foundation-Models
A collection of awesome bio-foundation models, including protein, RNA, DNA, gene, single-cell, and so on.
LucaOne/LucaOne
The LucaOne’s code, including model code and pre-training code.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
wukevin/proteinclip
Contrastive learning harmonizing protein language models and natural language models
kyegomez/AlphaFold3
Implementation of Alpha Fold 3 from the paper: "Accurate structure prediction of biomolecular interactions with AlphaFold3" in PyTorch
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
MAGICS-LAB/DNABERT_S
DNABERT_S: Learning Species-Aware DNA Embedding with Genome Foundation Models
RyanWangZf/BioBridge
ICLR'24 | BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
bowang-lab/IntegrAO
Integrate Any Omics: Towards genome-wide data integration for patient stratification
Genentech/polygraph
Polygraph evaluates and compares groups of nucleic acid sequences based on their sequence and functional content for effective design of regulatory elements.
a96123155/UTR-LM
seanrjohnson/protein_gibbs_sampler
Gibbs sampling for generating protein sequences
seanrjohnson/protein_scoring
Generating and scoring novel enzyme sequences with a variety of models and metrics
Crystalcareai/BTX
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
PKU-YuanGroup/ProLLaMA
A Protein Large Language Model for Multi-Task Protein Language Processing
taokz/BiomedGPT
BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks