lc150303's Stars
meta-llama/llama
Inference code for Llama models
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
alibaba-damo-academy/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
TencentARC/T2I-Adapter
T2I-Adapter
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
xianfei/SysMocap
A real-time motion capture system for 3D virtual character animating.
sicxu/Deep3DFaceRecon_pytorch
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
radekd91/emoca
Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild
Kedreamix/Awesome-Talking-Head-Synthesis
💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩
joonson/syncnet_python
Out of time: automated lip sync in the wild
marsbroshok/VAD-python
Voice Activity Detector in Python
Danial-Kord/DigiHuman
Automatic 3D Character animation using Pose Estimation and Landmark Generation techniques
DaddyJin/awesome-faceReenactment
papers about Face Reenactment/Talking Face Generation
endink/Mediapipe4u-plugin
psyai-net/EmoTalk_release
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
Advocate99/DiffGesture
[CVPR 2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
Aubrey-ao/HumanBehaviorAnimation
theEricMa/DiffSpeaker
This is the official repository for DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer
PiSugar/sugar-wifi-conf
A BLE service on raspberry pi for wifi configuration and wireless control. 使用微信小程序随时随地设置树莓派wifi连接,控制树莓派
Qaanaaq/Face_Landmark_Link
creates live link app blendshape data formated in csv from video, for facial motion capture
ZhuiyiTechnology/GAU-alpha
基于Gated Attention Unit的Transformer模型(尝鲜版)
Hallway-Inc/AvatarWebKit
Web-first SDK that provides real-time ARKit-compatible 52 blend shapes from a camera feed, video or image at 60 FPS using ML models.