Pinned Repositories
aft-pytorch
Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.
algorithm
ar-vits
text to speech using autoregressive transformer and VITS
Automatic-Prosody-Annotation
Awesome-pytorch-list
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
BERT-NER
Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).
Electra_CRF_NER
We start a company-name recognition task with a small scale and low quality training data, then using skills to enhanced model training speed and predicting performance with least artificial participation. The methods we use involve lite pre-training models such as Albert-small or Electra-small with financial corpus, knowledge of distillation and multi-stage learning. The result is that we improve the recall rate of company names recognition task from 0.73 to 0.92 and get 4 times as fast as BERT-Bilstm-CRF model.
learningtensorflow.com
select_feature
VoiceFlow-TTS
This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
zshy1205's Repositories
zshy1205/VoiceFlow-TTS
This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
zshy1205/ar-vits
text to speech using autoregressive transformer and VITS
zshy1205/Automatic-Prosody-Annotation
zshy1205/big_dataloader
zshy1205/CUCVAE-TTS
zshy1205/dataspeech
zshy1205/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
zshy1205/espnet
End-to-End Speech Processing Toolkit
zshy1205/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
zshy1205/fish-speech
Brand new TTS solution
zshy1205/flash-attention
Fast and memory-efficient exact attention
zshy1205/FlatTN
Chinese Text Normalization and Dataset
zshy1205/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
zshy1205/g2pW
Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音
zshy1205/HierSpeech
zshy1205/LEBERT
Code for the ACL2021 paper "Lexicon Enhanced Chinese Sequence Labelling Using BERT Adapter"
zshy1205/MSMC-TTS
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
zshy1205/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
zshy1205/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
zshy1205/Phone-Level-Mixture-Density-Network-for-TTS
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
zshy1205/pits
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
zshy1205/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
zshy1205/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
zshy1205/StyleSpeech
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
zshy1205/SyntaSpeech
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
zshy1205/TorchBlocks
A PyTorch-based toolkit for natural language processing
zshy1205/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
zshy1205/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
zshy1205/ttts
Train the next generation of TTS systems.
zshy1205/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E