wepe's Stars
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
deepfakes/faceswap
Deepfakes Software For All
iperov/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
mli/paper-reading
深度学习经典、新论文逐段精读
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
lucidrains/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
THUDM/CogVideo
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
erikbern/ann-benchmarks
Benchmarks of approximate nearest neighbor libraries in Python
IDEA-CCNL/Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
fundamentalvision/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
PennyLaneAI/pennylane
PennyLane is a cross-platform Python library for quantum computing, quantum machine learning, and quantum chemistry. Train a quantum computer the same way as a neural network.
LC1332/Chat-Haruhi-Suzumiya
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
aerophile/awesome-deepfakes
Everything Deepfakes
bilibili/LastOrder-Dota2
Dota2 AI bot
reczoo/BARS
BARS: Towards Open Benchmarking for Recommender Systems https://openbenchmark.github.io/BARS
huawei-noah/benchmark
guocheng18/Sequential-Recommendation-Datasets
Download and preprocess popular sequential recommendation datasets
princeton-nlp/EntityQuestions
EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535