AlexMaOLS's Stars
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Stability-AI/generative-models
Generative Models by Stability AI
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
state-spaces/mamba
Mamba SSM architecture
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
Farama-Foundation/HighwayEnv
A minimalist environment for decision-making in autonomous driving
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
real-stanford/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
google-research/deduplicate-text-datasets
thu-ml/RoboticsDiffusionTransformer
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
GFNOrg/gflownet
Generative Flow Networks
autonomousvision/tuplan_garage
[CoRL'23] Parting with Misconceptions about Learning-based Vehicle Motion Planning
yizt/numpy_neural_network
仅使用numpy从头开始实现神经网络,包括反向传播公式推导过程; numpy构建全连接层、卷积层、池化层、Flatten层;以及图像分类案例及精调网络案例等,持续更新中... ...
GFNOrg/torchgfn
A modular, easy to extend GFlowNet library
kuleshov-group/caduceus
Bi-Directional Equivariant Long-Range DNA Sequence Modeling
bhyang/diffusion-es
nv-tlabs/trace
Official implementation of TRACE, the TRAjectory Diffusion Model for Controllable PEdestrians, from the CVPR 2023 paper: "Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion".
BAAI-DCAI/DataOptim
A collection of visual instruction tuning datasets.
BAAI-DCAI/Training-Data-Synthesis
[ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching
nv-tlabs/pacer
Official implementation of PACER, Pedestrian Animation ControllER, of CVPR 2023 paper: "Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion".
scxue/DM-NonUniform
Official code for Accelerating Diffusion Sampling with Optimized Time Steps (CVPR 2024)
ling-pan/GAFN
scxue/SA-Solver
Official code for SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models (NeurIPS 2023)