LinMu7177's Stars
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
LargeWorldModel/LWM
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
MrNeRF/awesome-3D-gaussian-splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
xinghaochen/awesome-hand-pose-estimation
Awesome work on hand pose estimation/tracking
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
JonathonLuiten/Dynamic3DGaussians
weihaox/awesome-digital-human
A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.
lxtGH/OMG-Seg
OMG-LLaVA and OMG-Seg codebase
Meituan-AutoML/MobileVLM
Strong and Open Vision Language Assistant for Mobile Devices
NUS-HPC-AI-Lab/Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
YuelangX/Gaussian-Head-Avatar
[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"
X-T-E-R/Uni-TTS
本项目意图在于让使用各类语音合成引擎的方式变得统一,支持多种语音合成引擎适配器,允许直接作为模组使用或启动后端服务
snap-research/Panda-70M
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
TIGER-AI-Lab/AnyV2V
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks"
AIRI-Institute/HairFastGAN
Official Implementation for "HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach"
showlab/VideoSwap
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
DaiShiResearch/TransNeXt
[CVPR 2024] Code release for TransNeXt model
google-research/syn-rep-learn
Learning from synthetic data - code and models
UX-Decoder/FIND
patienceFromZhou/simpleHand
This is the project page for paper "A Simple Baseline for Efficient Hand Mesh Reconstruction, CVPR2024"
HaiminLuo/GaussianHair
A novel explicit hair representation. It enables comprehensive modeling of hair geometry and appearance from images, fostering innovative illumination effects and dynamic animation capabilities.
casper9429-kth/Siamese-Masked-Autoencoders---Learning-and-Exploration
Course: DD2412 Deep Learning Advanced at KTH Project by Casper, Magnus, and Friso Focus: Self-supervised learning and computer vision with SiamMAE. Replicating core results and potential research extensions.
GuoQiushan/regiongpt.github.io