f1yfisher's Stars
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Johanan528/DepthLab
Official implementation of "DepthLab: From Partial to Complete"
KovenYu/WonderWorld
Code release for https://kovenyu.com/WonderWorld/
ziyc/drivestudio
A 3DGS framework for omni urban scene reconstruction and simulation.
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
esdolo/FreeVS
FreeVS: Generative View Synthesis on Free Driving Trajectory
f1yfisher/DriveDreamer2
wzzheng/Stag
Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
GigaAI-research/ReconDreamer
JeffWang987/DriveDreamer
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
GigaAI-research/DriveDreamer4D
alsyundawy/Microsoft-Office-For-MacOS
Installer & Activited Microsoft Office For MacOS
waymo-research/waymo-open-dataset
Waymo Open Dataset
visitworld123/FedFed
[NeurIPS 2023] "FedFed: Feature Distillation against Data Heterogeneity in Federated Learning"
thu-ml/cond-image-leakage
Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)
maitrix-org/Pandora
Pandora: Towards General World Model with Natural Language Actions and Video States
jjihwan/FIFO-Diffusion_public
Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
DriveDreamer2/DriveDreamer2.github.io
pengsida/learning_research
本人的科研经验
waymo-research/waymax
A JAX-based simulator for autonomous driving research.
PointsCoder/GPT-Driver
Learning to Drive with GPT
princeton-vl/DROID-SLAM
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.