XiaoyuShi97

Ph.D@MMLab

Hong Kong

XiaoyuShi97's Stars

RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python35.7k 210 1.3k4.1k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.2k 187 5042.2k
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python7k 49 216539
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Language:Jupyter Notebook4.8k 43 125503
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
3.6k 31 88150
praydog/UEVR
Universal Unreal Engine VR Mod (4.8 - 5.4)
Language:C++3.2k 49 258162
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Language:Python1.4k 26 7368
TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
Language:Python1.3k 50 3571
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Language:Python1.2k 14 9065
Junyi42/monst3r
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
Language:Python803 38 4039
henry123-boy/SpaTracker
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Language:Python732 59 4225
pixeli99/SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
Language:Python606 13 5660
OpenGVLab/DCNv4
[CVPR 2024] Deformable Convolution v4
Language:Python514 8 8727
maitrix-org/Pandora
Pandora: Towards General World Model with Natural Language Actions and Video States
Language:Python477 17 834
hehao13/CameraCtrl
Language:Python435 12 1619
segmind/segmoe
Language:Python406 7 2424
tgxs002/HPSv2
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Language:Jupyter Notebook398 10 4112
bytedance/particle-sfm
ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild. ECCV 2022.
Language:C++278 15 2123
mbzuai-oryx/VideoGPT-plus
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Language:Python216 5 2614
lixiaoyu2000/Poly-MOT
Official Repo For IROS 2023 Accepted Paper "Poly-MOT"
Language:Python165 5 3429
LeonHLJ/FouriScale
Official implementation of FouriScale (ECCV2024)
Language:Python137 11 75
CaraJ7/CoMat
[Neurips 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Language:Python134 16 136
G-U-N/Motion-I2V
[SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling
Language:Python105 9 78
rongyaofang/PUMA
Empowering Unified MLLM with Multi-granular Visual Generation
105 5 11
wwsource/SceneTracker
SceneTracker: Long-term Scene Flow Estimation Network
Language:Python104 9 111
jianghd1996/Camera-control
This project explores the opportunities of deep learning for camera control in virtual cinematography.
Language:Python82 3 57
Mawiszus/World-GAN
Official repository for "World-GAN: a Generative Model for Minecraft Worlds" by Maren Awiszus, Frederik Schubert and Bodo Rosenhahn.
Language:Python73 8 57
OmicsML/CellPLM
Official repo for CellPLM: Pre-training of Cell Language Model Beyond Single Cells.
Language:Jupyter Notebook67 3 206
wwsource/SplatFlow
[IJCV 2024] SplatFlow: Learning Multi-frame Optical Flow via Splatting
Language:Python43 8 22
wwsource/SplatFlow3D
13 7 10

XiaoyuShi97

XiaoyuShi97's Stars

RVC-Boss/GPT-SoVITS

hpcaitech/Open-Sora

LiheYoung/Depth-Anything

ChaoningZhang/MobileSAM

deepseek-ai/DeepSeek-V2

praydog/UEVR

dvlab-research/ControlNeXt

TencentARC/MotionCtrl

THUDM/ImageReward

Junyi42/monst3r

henry123-boy/SpaTracker

pixeli99/SVD_Xtend

OpenGVLab/DCNv4

maitrix-org/Pandora

hehao13/CameraCtrl

segmind/segmoe

tgxs002/HPSv2

bytedance/particle-sfm

mbzuai-oryx/VideoGPT-plus

lixiaoyu2000/Poly-MOT

LeonHLJ/FouriScale

CaraJ7/CoMat

G-U-N/Motion-I2V

rongyaofang/PUMA

wwsource/SceneTracker

jianghd1996/Camera-control

Mawiszus/World-GAN

OmicsML/CellPLM

wwsource/SplatFlow

wwsource/SplatFlow3D