haibao-yu's Stars
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
OpenBMB/AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
NUS-HPC-AI-Lab/VideoSys
VideoSys: An easy and efficient system for video generation
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
danijar/dreamerv3
Mastering Diverse Domains through World Models
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
opendilab/LMDrive
[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
mit-han-lab/distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
shalfun/DrivingDiffusion
Layout-Guided multi-view driving scene video generation with latent diffusion model
wzzheng/OccWorld
[ECCV 2024] 3D World Model for Autonomous Driving
JeffWang987/DriveDreamer
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
commaai/commavq
commaVQ is a dataset of compressed driving video
OpenDriveLab/ViDAR
[CVPR 2024 Highlight] Visual Point Cloud Forecasting
DiT-3D/DiT-3D
🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"
wenyuqing/panacea
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
HazyResearch/spacetime
Code for SpaceTime 🌌⏱️. Proposed in Effectively Modeling Time Series with Simple Discrete State Spaces, ICLR 2023.
NVlabs/T-Stitch
Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"
AIR-THU/DAIR-RCooper
[CVPR2024] Official implementation of "RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception"
Jiankai-Sun/PlanCP
[NeurIPS 2023] Conformal Prediction for Uncertainty-Aware Planning with Diffusion Dynamics Model
GenerativeAD/Awesome-GenerativeModel4AD-Survey