OrcustD's Stars
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ultralytics/ultralytics
Ultralytics YOLO11 🚀
nilaoda/BBDown
Bilibili Downloader. 一个命令行式哔哩哔哩下载器.
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
facebookresearch/sapiens
High-resolution models for human tasks.
obss/sahi
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
GuyTevet/motion-diffusion-model
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
guoqincode/Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
cvlab-columbia/zero123
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
schrodingercatss/tuning_playbook_zh_cn
一本系统地教你将深度学习模型的性能最大化的战术手册。
IDEA-Research/T-Rex
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
piergiaj/pytorch-i3d
yohanshin/WHAM
dyfcalid/CameraCalibration
Fisheye or Normal Camera Intrinsic and Extrinsic Calibration. Surround Camera Bird Eye View Generator.
zju3dv/mvpose
Code for "Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views" (CVPR 2019, T-PAMI 2021)
google-deepmind/clrs
facebookresearch/ViewDiff
ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).
yufu-wang/tram
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos
nv-tlabs/vid2player3d
Official implementation for SIGGRAPH 2023 paper "Learning Physically Simulated Tennis Skills from Broadcast Videos"
SilvioGiancola/SoccerNetv2-DevKit
Development Kit for the SoccerNet Challenge
Dou-Yiming/Pose_to_SMPL
A tool to fit SMPL parameters from 3D-pose datasets that contain key-points of human body.
yastrebksv/TrackNet
Unofficial PyTorch implementation of TrackNet
sithu31296/pose-estimation
Easy to use SOTA Top-Down Multi-person Pose Estimation Models in PyTorch
MiraPurkrabek/RePoGen
The official repository of the RePoGen paper
NumesSanguis/Blender-ZMQ-add-on
Blender 3.6, 3.3, 2.93 & 2.8x add-on that allows streaming of data into Blender without freezing the interface (using ZeroMQ sockets).
catboyjeans/Blender-Socket-Communication
Blender Socket implementation with Matlab/Simulink for data visualization purposes