Pinned Repositories
alphafold3-pytorch
Implementation of Alphafold 3 from Google Deepmind in Pytorch
DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
lucidrains's Repositories
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
lucidrains/alphafold3-pytorch
Implementation of Alphafold 3 from Google Deepmind in Pytorch
lucidrains/native-sparse-attention-pytorch
Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
lucidrains/BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
lucidrains/clinical-calculator-tooluse
Explorations into training LLMs to use clinical calculators from patient history, using open sourced models. Will start with Wells' Criteria
lucidrains/se3-transformer-pytorch
Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch. This specific repository is geared towards integration with eventual Alphafold2 replication.
lucidrains/gradnorm-pytorch
A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch
lucidrains/evolutionary-policy-optimization
Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University
lucidrains/x-transformers-rl
Implementation of a transformer for reinforcement learning using `x-transformers`
lucidrains/h-net-dynamic-chunking
Implementation of the dynamic chunking mechanism in H-net by Hwang et al. of Carnegie Mellon
lucidrains/HRM
Exploration into the proposed architecture from Sapient Intelligence of Singapore 🇸🇬
lucidrains/TRI-LBM
Implementation of the Large Behavioral Model architecture for dexterous manipulation from Toyota Research Institute
lucidrains/HS-TasNet
Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet"
lucidrains/simplicial-attention
Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Roy et al. (2025)
lucidrains/contrastive-rl
Contrastive Reinforcement Learning
lucidrains/neat
Explorations into NEAT and some of its derivative research
lucidrains/amplify-pytorch
Implementation of Amplify, Actionless Motion Priors for Robot Learning from Videos
lucidrains/locoformer
LocoFormer - Generalist Locomotion via Long-Context Adaptation
lucidrains/lbm-training-framework
Training framework for Large Behavioral Models
lucidrains/lookahead-keys-attention
Causal Attention with Lookahead Keys
lucidrains/SRT-H
Implementation of the model architecture for SRT-H
lucidrains/villa-X
Implementation of ViLLA-X, Enhancing Latent Action Modeling in Vision-Language-Action Models
lucidrains/lucidrains.github.io
lucidrains/jvp_flash_attention
Flash Attention Triton kernel with support for second-order derivatives
lucidrains/nim-mmcif
Parser for mmCIF files in Nim
lucidrains/Berkeley-Humanoid-Lite
Codebase for Berkeley Humanoid Lite
lucidrains/packages
List of packages for Nimble