chenyibo89's Stars
CompVis/stable-diffusion
A latent text-to-image diffusion model
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
JushBJJ/Mr.-Ranedeer-AI-Tutor
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.
s0md3v/roop
one-click face swap
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
microsoft/ai-edu
AI education materials for Chinese students, teachers and IT professionals.
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
ZheC/Realtime_Multi-Person_Pose_Estimation
Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
VainF/Awesome-Anything
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
lucidrains/soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
minivision-ai/Silent-Face-Anti-Spoofing
静默活体检测(Silent-Face-Anti-Spoofing)
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
minivision-ai/Silent-Face-Anti-Spoofing-APK
OpenLLMAI/OpenLLMWiki
OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXiu-貔貅 means fortune.
kenkawakenkenke/stickfigure-recorder
Website to generate stick figure gifs from webcam video.
chenjshnn/Object-Detection-for-Graphical-User-Interface
Object Detection for Graphical User Interface: Old Fashioned or Deep Learning or a Combination?
adrianagaler/Snoring-Detection
Tiny Machine Learning Snoring Detection Model for Embedded devices - Adriana Rotaru
anpengjin/query_repeat_part_by_audio
广告查重;音频指纹;倒排索引;音频去重;音频检索
DragonKing2014/SnoringDetection
This web project and MATLAB project is used for snoring detection , I use MATLAB to implement my algorithm and use JavaEE to config and build my software.
Nyceane/Jetson-SnoreAI
Jetson Nano for Snore Detection