OrcustD

OrcustD's Stars

RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python43.9k 232 1.7k4.9k
ultralytics/ultralytics
Ultralytics YOLO11 🚀
Language:Python39.2k 190 10.8k7.6k
nilaoda/BBDown
Bilibili Downloader. 一个命令行式哔哩哔哩下载器.
Language:C#11.3k 56 6991.4k
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python7.4k 50 225565
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Language:Python6.7k 61 142491
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
Language:Python6.1k 53 187654
facebookresearch/sapiens
High-resolution models for human tasks.
Language:Python4.9k 46 198291
obss/sahi
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
Language:Python4.5k 47 0641
GuyTevet/motion-diffusion-model
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
Language:Python3.4k 71 228382
guoqincode/Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
Language:Python2.9k 77 130241
cvlab-columbia/zero123
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
Language:Python2.9k 43 129205
schrodingercatss/tuning_playbook_zh_cn
一本系统地教你将深度学习模型的性能最大化的战术手册。
2.8k 16 5256
IDEA-Research/T-Rex
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Language:Python2.4k 39 89160
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Language:Python1.8k 20 139196
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
Language:Python1.6k 21 149205
piergiaj/pytorch-i3d
Language:Python1k 11 81256
yohanshin/WHAM
Language:Python827 31 11393
dyfcalid/CameraCalibration
Fisheye or Normal Camera Intrinsic and Extrinsic Calibration. Surround Camera Bird Eye View Generator.
Language:Python699 10 20187
zju3dv/mvpose
Code for "Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views" (CVPR 2019, T-PAMI 2021)
Language:Jupyter Notebook521 23 7479
google-deepmind/clrs
Language:Jupyter Notebook478 16 17101
facebookresearch/ViewDiff
ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).
Language:Python363 4 1722
yufu-wang/tram
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos
Language:Python362 14 2935
nv-tlabs/vid2player3d
Official implementation for SIGGRAPH 2023 paper "Learning Physically Simulated Tennis Skills from Broadcast Videos"
Language:Python257 19 1028
SilvioGiancola/SoccerNetv2-DevKit
Development Kit for the SoccerNet Challenge
Language:Python184 8 5840
Dou-Yiming/Pose_to_SMPL
A tool to fit SMPL parameters from 3D-pose datasets that contain key-points of human body.
Language:Python125 3 813
yastrebksv/TrackNet
Unofficial PyTorch implementation of TrackNet
Language:Python110 3 826
sithu31296/pose-estimation
Easy to use SOTA Top-Down Multi-person Pose Estimation Models in PyTorch
Language:Python51 1 913
MiraPurkrabek/RePoGen
The official repository of the RePoGen paper
Language:Python47 3 86
NumesSanguis/Blender-ZMQ-add-on
Blender 3.6, 3.3, 2.93 & 2.8x add-on that allows streaming of data into Blender without freezing the interface (using ZeroMQ sockets).
Language:Python35 5 15
catboyjeans/Blender-Socket-Communication
Blender Socket implementation with Matlab/Simulink for data visualization purposes
Language:Python61