lucidrains

Working with Attention. It's all we need

San Francisco

Pinned Repositories

alphafold3-pytorch
Implementation of Alphafold 3 from Google Deepmind in Pytorch
Language:Python1.5k 46 56203
DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Language:Python5.6k 95 277646
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Language:Python11.3k 122 2121.1k
denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python10k 39 3061.2k
imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Language:Python8.4k 117 301792
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python7.9k 136 52682
transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
Language:Python1.2k 32 3855
vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
Language:Python3.6k 35 162290
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python24k 160 2733.4k
x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python5.6k 57 261479

lucidrains's Repositories

lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python24k 160 2733.4k
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python7.9k 136 52682
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Language:Python5.6k 57 261479
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
Language:Python3.6k 35 162290
lucidrains/alphafold3-pytorch
Implementation of Alphafold 3 from Google Deepmind in Pytorch
Language:Python1.5k 46 56203
lucidrains/native-sparse-attention-pytorch
Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
Language:Python759 6 2248
lucidrains/BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
Language:Python624 13 4024
lucidrains/clinical-calculator-tooluse
Explorations into training LLMs to use clinical calculators from patient history, using open sourced models. Will start with Wells' Criteria
Language:Python316 31 431
lucidrains/se3-transformer-pytorch
Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch. This specific repository is geared towards integration with eventual Alphafold2 replication.
Language:Python305 11 1826
lucidrains/gradnorm-pytorch
A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch
Language:Python108 2 104
lucidrains/evolutionary-policy-optimization
Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University
Language:Python98 4 13
lucidrains/x-transformers-rl
Implementation of a transformer for reinforcement learning using `x-transformers`
Language:Python696
lucidrains/h-net-dynamic-chunking
Implementation of the dynamic chunking mechanism in H-net by Hwang et al. of Carnegie Mellon
Language:Python64 4 11
lucidrains/HRM
Exploration into the proposed architecture from Sapient Intelligence of Singapore 🇸🇬
Language:Python63 2 24
lucidrains/TRI-LBM
Implementation of the Large Behavioral Model architecture for dexterous manipulation from Toyota Research Institute
Language:Python61 4 03
lucidrains/HS-TasNet
Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet"
Language:Python60 3 13
lucidrains/simplicial-attention
Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Roy et al. (2025)
Language:Python46 0 12
lucidrains/contrastive-rl
Contrastive Reinforcement Learning
Language:Python44 3 02
lucidrains/neat
Explorations into NEAT and some of its derivative research
Language:Nim30 0 11
lucidrains/amplify-pytorch
Implementation of Amplify, Actionless Motion Priors for Robot Learning from Videos
Language:Python291
lucidrains/locoformer
LocoFormer - Generalist Locomotion via Long-Context Adaptation
Language:Python29
lucidrains/lbm-training-framework
Training framework for Large Behavioral Models
Language:Python241
lucidrains/lookahead-keys-attention
Causal Attention with Lookahead Keys
Language:Python23 0 01
lucidrains/SRT-H
Implementation of the model architecture for SRT-H
Language:Python202
lucidrains/villa-X
Implementation of ViLLA-X, Enhancing Latent Action Modeling in Vision-Language-Action Models
Language:Python18 2 0
lucidrains/lucidrains.github.io
Language:HTML12 1 01
lucidrains/jvp_flash_attention
Flash Attention Triton kernel with support for second-order derivatives
Language:Python10 0 0
lucidrains/nim-mmcif
Parser for mmCIF files in Nim
Language:Python6
lucidrains/Berkeley-Humanoid-Lite
Codebase for Berkeley Humanoid Lite
Language:Python5 1 0
lucidrains/packages
List of packages for Nimble
Language:Nim