chenshuo20

Undergraduate Student

Tsinghua University, Weiyang CollegeBeijing, China

chenshuo20's Stars

HengyiWang/spann3r
3D Reconstruction with Spatial Memory
Language:Python1383
cocktailpeanut/fluxgym
Dead simple FLUX LoRA training UI with LOW VRAM support
Language:Python54432
Drexubery/ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Language:Python54014
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Language:Python1.3k139
liuff19/ReconX
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model
40312
btsmart/splatt3r
Official repository for Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs
Language:Python41215
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python1.6k171
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
Language:Python77436
NUS-HPC-AI-Lab/VideoSys
VideoSys: An easy and efficient system for video generation
Language:Python1.6k108
Bujiazi/MotionClone
Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
Language:Python36127
Florian-Barthel/splatviz
Full python interactive 3D Gaussian Splatting viewer for real-time editing and analyzing.
Language:Python1k69
XLabs-AI/x-flux
Language:Python1.3k89
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Language:Python1.6k76
xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters
Language:Python46840
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
Language:Python34421
DL3DV-10K/Dataset
News: the 10k dataset is ready for download.
Language:HTML2724
apple/ml-mdm
Train high-quality text-to-image diffusion models in a data & compute efficient manner
Language:Python39821
SAIS-FUXI/VidGen
Language:Python513
THUDM/CogVideo
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python7.2k659
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python13.4k947
IDEA-Research/TAPTR
[ECCV 2024] Official implementation of the paper "TAPTR: Tracking Any Point with Transformers as Detection"
Language:Python18512
NVlabs/InstantSplat
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds
Language:Python68832
colmap/glomap
GLOMAP - Global Structured-from-Motion Revisited
Language:C++1.3k76
nianticlabs/acezero
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
Language:Python57534
lilygoli/SpotLessSplats
Code for SpotLessSplats: Ignoring Distractors in 3D Gaussian Splatting built on gsplat codebase.
Language:Cuda946
louaaron/Score-Entropy-Discrete-Diffusion
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
Language:Python35033
mihirp1998/VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
Language:Python19315
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.3k1.2k
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.4k192
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.8k107

chenshuo20

chenshuo20's Stars

HengyiWang/spann3r

cocktailpeanut/fluxgym

Drexubery/ViewCrafter

Picsart-AI-Research/StreamingT2V

liuff19/ReconX

btsmart/splatt3r

Vchitect/Latte

showlab/Show-o

NUS-HPC-AI-Lab/VideoSys

Bujiazi/MotionClone

Florian-Barthel/splatviz

XLabs-AI/x-flux

PixArt-alpha/PixArt-sigma

xdit-project/xDiT

Vchitect/VEnhancer

DL3DV-10K/Dataset

apple/ml-mdm

SAIS-FUXI/VidGen

THUDM/CogVideo

black-forest-labs/flux

IDEA-Research/TAPTR

NVlabs/InstantSplat

colmap/glomap

nianticlabs/acezero

lilygoli/SpotLessSplats

louaaron/Score-Entropy-Discrete-Diffusion

mihirp1998/VADER

huggingface/trl

Doubiiu/DynamiCrafter

facebookresearch/chameleon