lc150303

computer science student, screeper

University of Science and Technology of ChinaChina

lc150303's Stars

meta-llama/llama
Inference code for Llama models
Language:Python55.2k 515 9539.4k
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python36.9k 431 1.6k3.2k
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook35.1k 322 4304.1k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.3k 220 4572.9k
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python25.2k 174 1304.7k
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Language:Python18.1k 92 2161.9k
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python10.7k 125 217785
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.5k 63 79360
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook5.8k 70 982749
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook4.9k 62 368316
alibaba-damo-academy/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python4k 48 841456
TencentARC/T2I-Adapter
T2I-Adapter
Language:Python3.4k 40 109198
k2-fsa/sherpa-onnx
Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust
Language:C++3k 47 444342
xianfei/SysMocap
A real-time motion capture system for 3D virtual character animating.
Language:JavaScript2.5k 35 55410
sicxu/Deep3DFaceRecon_pytorch
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
Language:Python1.6k 26 175307
radekd91/emoca
Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild
Language:Python685 16 8089
Kedreamix/Awesome-Talking-Head-Synthesis
💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩
663 24 534
joonson/syncnet_python
Out of time: automated lip sync in the wild
Language:Python637 15 61142
marsbroshok/VAD-python
Voice Activity Detector in Python
Language:Python470 21 15133
Danial-Kord/DigiHuman
Automatic 3D Character animation using Pose Estimation and Landmark Generation techniques
Language:C#459 14 1274
DaddyJin/awesome-faceReenactment
papers about Face Reenactment/Talking Face Generation
441 35 145
endink/Mediapipe4u-plugin
Language:Dockerfile357 13 15250
psyai-net/EmoTalk_release
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
Language:Python331 12 2927
Advocate99/DiffGesture
[CVPR 2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
Language:Python222 12 2415
Aubrey-ao/HumanBehaviorAnimation
Language:Python171 17 1314
theEricMa/DiffSpeaker
This is the official repository for DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer
Language:Python132 7 915
PiSugar/sugar-wifi-conf
A BLE service on raspberry pi for wifi configuration and wireless control. 使用微信小程序随时随地设置树莓派wifi连接，控制树莓派
Language:JavaScript129 11 944
Qaanaaq/Face_Landmark_Link
creates live link app blendshape data formated in csv from video, for facial motion capture
Language:Python119 5 519
ZhuiyiTechnology/GAU-alpha
基于Gated Attention Unit的Transformer模型（尝鲜版）
Language:Python94 4 39
Hallway-Inc/AvatarWebKit
Web-first SDK that provides real-time ARKit-compatible 52 blend shapes from a camera feed, video or image at 60 FPS using ML models.
82 2 518

lc150303

lc150303's Stars

meta-llama/llama

LAION-AI/Open-Assistant

suno-ai/bark

Vision-CAIR/MiniGPT-4

svc-develop-team/so-vits-svc

kaixindelele/ChatPaper

InstantID/InstantID

mit-han-lab/streaming-llm

pyannote/pyannote-audio

tencent-ailab/IP-Adapter

alibaba-damo-academy/FunASR

TencentARC/T2I-Adapter

k2-fsa/sherpa-onnx

xianfei/SysMocap

sicxu/Deep3DFaceRecon_pytorch

radekd91/emoca

Kedreamix/Awesome-Talking-Head-Synthesis

joonson/syncnet_python

marsbroshok/VAD-python

Danial-Kord/DigiHuman

DaddyJin/awesome-faceReenactment

endink/Mediapipe4u-plugin

psyai-net/EmoTalk_release

Advocate99/DiffGesture

Aubrey-ao/HumanBehaviorAnimation

theEricMa/DiffSpeaker

PiSugar/sugar-wifi-conf

Qaanaaq/Face_Landmark_Link

ZhuiyiTechnology/GAU-alpha

Hallway-Inc/AvatarWebKit