f1yfisher

f1yfisher's Stars

NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
Language:Python2.4k118
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python15.2k1.2k
Johanan528/DepthLab
Official implementation of "DepthLab: From Partial to Complete"
Language:Python37820
KovenYu/WonderWorld
Code release for https://kovenyu.com/WonderWorld/
Language:Python39416
ziyc/drivestudio
A 3DGS framework for omni urban scene reconstruction and simulation.
Language:Python66462
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Language:Python1.1k40
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
Language:Python22k1.8k
esdolo/FreeVS
FreeVS: Generative View Synthesis on Free Driving Trajectory
741
f1yfisher/DriveDreamer2
Language:Python1278
wzzheng/Stag
Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Language:Python653
GigaAI-research/ReconDreamer
Language:Python906
JeffWang987/DriveDreamer
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Language:Python39619
GigaAI-research/DriveDreamer4D
Language:Python12111
alsyundawy/Microsoft-Office-For-MacOS
Installer & Activited Microsoft Office For MacOS
3k359
waymo-research/waymo-open-dataset
Waymo Open Dataset
Language:Python2.8k624
visitworld123/FedFed
[NeurIPS 2023] "FedFed: Feature Distillation against Data Heterogeneity in Federated Learning"
Language:Python1068
thu-ml/cond-image-leakage
Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)
30029
maitrix-org/Pandora
Pandora: Towards General World Model with Natural Language Actions and Video States
Language:Python49235
jjihwan/FIFO-Diffusion_public
Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)
Language:Python42029
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python1.8k183
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python39k4.3k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python23k2.3k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.9k1k
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Language:Python2.5k270
DriveDreamer2/DriveDreamer2.github.io
Language:JavaScript21
pengsida/learning_research
本人的科研经验
6.2k368
waymo-research/waymax
A JAX-based simulator for autonomous driving research.
Language:Python870101
PointsCoder/GPT-Driver
Learning to Drive with GPT
Language:Python24912
princeton-vl/DROID-SLAM
Language:Python1.9k305
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
3.7k212