lzzbutphh's Stars
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
gpt-omni/mini-omni2
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
apachecn/ituring-math-stat-book
:books: 图灵数学统计学丛书
Tencent/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
zmliao/EfficientGS
Efficient 3D Gaussian Splatting
ossrs/ffmpeg-webrtc
Support WebRTC(WHIP) for FFmpeg.
johndpope/MegaPortrait
Implementation of Megaportrait
yerfor/MimicTalk
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code
yerfor/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
johndpope/VASA-1-hack
Using Claude Sonnet 3.5 to forward (reverse) engineer code from VASA white paper - WIP - (this is for La Raza 🎷)
dongxiaoke/VASA-1
Implementation of VASA-1
jdh-algo/MHAD-Dataset
Multimodal Home Activity Dataset with Multi-Angle Videos and Synchronized Physiological Signals
jdh-algo/JoyVASA
hehao13/CameraCtrl
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
uniBruce/Mead
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]
MRzzm/HDTF
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
johndpope/Emote-hack
Emote Portrait Alive - using ai to reverse engineer code from white paper. (abandoned)
weijie-chen/Linear-Algebra-With-Python
Lecture Notes for Linear Algebra Featuring Python. This series of lecture notes will walk you through all the must-know concepts that set the foundation of data science or advanced quantitative skillsets. Suitable for statistician/econometrician, quantitative analysts, data scientists and etc. to quickly refresh the linear algebra with the assistance of Python computation and visualization.
nerfstudio-project/gsplat
CUDA accelerated rasterization of gaussian splatting
hbb1/torch-splatting
A pure pytorch implementation of 3D gaussian Splatting
limacv/GaussianSplattingViewer
Tiny Gaussian Splatting Viewer
mkkellogg/GaussianSplats3D
Three.js-based implementation of 3D Gaussian splatting
playcanvas/supersplat
3D Gaussian Splat Editor
Florian-Barthel/splatviz
Full python interactive 3D Gaussian Splatting viewer for real-time editing and analyzing.
hbb1/2d-gaussian-splatting
[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields