Pinned Repositories
optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
awesome-speaker-embedding
A curated list of speaker-embedding speaker-verification, speaker-identification resources.
lectures
B站视频课程配套资料
mandarin-tts
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
nlpcc2023-shared-task-diaASQ
NLPCC2023 shared-task DiaASQ first-place solution. (NLPCC2023对话式细粒度情感识别大赛第一名方案)
pinyin2hanzi
拼音转汉字, convert pinyin to 汉字 using deep networks
quantizations
A collection of quantization recipes for various large models including Llama-2-70B, QWen-14B, Baichuan-2-13B, and more.
speaker-verification
Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN
ranchlai's Repositories
ranchlai/pinyin2hanzi
拼音转汉字, convert pinyin to 汉字 using deep networks
ranchlai/clip.paddle
OpenAI clip implementation in PaddlePaddle
ranchlai/wav2vec-2.0
Wav2vec2 English speech recognition in PaddlePaddle
ranchlai/VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
ranchlai/dcase2021_task1b
ranchlai/waveglow
A Flow-based Generative Network for Speech Synthesis
ranchlai/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
ranchlai/dcase-2021-baseline
Baseline system for Automated Audio Captioning task (Task 6) in DCASE 2021 Challenge
ranchlai/DeepLearningForAudioWithPython
Code and slides for the "Deep Learning (For Audio) With Python" course on TheSoundOfAI Youtube channel.
ranchlai/einops
Deep learning operations reinvented (for pytorch, tensorflow, jax and others)
ranchlai/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
ranchlai/melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)
ranchlai/spec
This repository contains the specifications.
ranchlai/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference