JeffSoong

JeffSoong's Stars

RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python33.4k 204 1.2k3.8k
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Language:Jupyter Notebook8.9k 89 333842
LargeWorldModel/LWM
Language:Python7.1k 66 71549
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook7k 74 205440
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python4.9k 49 441373
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python4.5k 47 191446
jamesmh/coravel
Near-zero config .NET library that makes advanced application features like Task Scheduling, Caching, Queuing, Event Broadcasting, and more a breeze!
Language:C#3.8k 65 220252
cocopon/tweakpane
:control_knobs: Compact GUI for fine-tuning parameters and monitoring value changes
Language:TypeScript3.6k 29 25690
QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Language:Python3.2k 29 347313
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Language:Python2.9k 30 111185
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Language:Python2.8k 47 53258
modelscope/swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Language:Python2.7k 19 758245
modelscope/modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
Language:Python2.6k 37 203302
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
Language:Python1.9k 24 90121
QwenLM/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.4k 25 67105
NVlabs/FoundationPose
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Language:Python1.4k 29 226185
apple/ml-neuman
Official repository of NeuMan: Neural Human Radiance Field from a Single Video (ECCV 2022)
Language:Python1.3k 34 93141
yyyujintang/Awesome-Mamba-Papers
Awesome Papers related to Mamba.
1.1k 28 1361
deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Language:Python976 15 3847
lizhe00/AnimatableGaussians
Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"
Language:Python895 42 4759
OpenGVLab/VideoMamba
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
Language:Python796 12 8960
roboterax/humanoid-gym
Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer https://arxiv.org/abs/2404.05695
Language:Python699 13 26115
ShenhanQian/GaussianAvatars
[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
Language:Python542 29 5885
nomic-ai/contrastors
Train Models Contrastively in Pytorch
Language:Python512 12 3937
OpenGVLab/Vision-RWKV
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Language:Python346 5 3714
niuzaisheng/ScreenAgent
ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)
Language:Python271 6 3126
JeffWang987/WorldDreamer
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
192 21 44
NationalGAILab/HoT
[CVPR 2024 🔥] Official implementation of the paper "⏳ Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation"
Language:Python179 3 918
ZhengyiLuo/SMPLSim
Simulating SMPL humanoid, supporting PHC/PHC-MJX/PULSE/SimXR code bases.
Language:Python116 8 46
OpenGVLab/Hulk
An official implementation of "Hulk: A Universal Knowledge Translator for Human-Centric Tasks"
Language:Python86 2 174