wangqq2018's Stars
EbookFoundation/free-programming-books
:books: Freely available programming books
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
mli/paper-reading
深度学习经典、新论文逐段精读
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
horovod/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
microsoft/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
ninja-build/ninja
a small build system with a focus on speed
PaddlePaddle/PaddleGAN
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
NVIDIA/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
microsoft/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
mdnice/markdown-nice
支持主题设计的 Markdown 编辑器,让排版变 Nice
ARM-software/ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
facebook/Ax
Adaptive Experimentation Platform
ARM-software/armnn
Arm NN ML Software. The code here is a read-only mirror of https://review.mlplatform.org/admin/repos/ml/armnn
JiehangXie/PaddleBoBo
基于飞桨开发的虚拟主播
VKCOM/YouTokenToMe
Unsupervised text tokenizer focused on computational efficiency
wafer9/transducer-net