AnitaLiu98's Stars
google-research/leaf-audio
LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a very small number of parameters.
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
HoiM/UIR-Loss-selected-code
Selected code for Unknown Identity Rejection Loss
amajee11us/UPR-FSL
Code for UNSUPERVISED PROTOTYPE RECTIFICATION FOR FEW-SHOT LEARNING
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
joerick/pyinstrument
🚴 Call stack profiler for Python. Shows you why your code is slow!
rkern/line_profiler
(OLD REPO) Line-by-line profiling for Python - Current repo ->
coder/code-server
VS Code in the browser
ludlows/PESQ
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
Lukelluke/MCD-MEL-CEPSTRAL-DISTANCE-MCD-application
Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav files
microsoft/NeuralSpeech
hbuschme/TextGridTools
Read, write, and manipulate Praat TextGrid files with Python
keonlee9420/Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
syang1993/gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
mmorise/kiritan_singing
東北きりたん歌唱データベースの最新ラベルデータ
michetonu/gradient_reversal_keras_tf
Keras implementation of a gradient reversal layer for the Tensorflow backend
Deepest-Project/AlignTTS
Implementation of the AlignTTS
datawhalechina/pumpkin-book
《机器学习》(西瓜书)公式详解
wenet-e2e/speech-synthesis-paper
List of speech synthesis papers.
Theano/Theano
Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It is being continued as PyTensor: www.github.com/pymc-devs/pytensor
openai/improved-gan
Code for the paper "Improved Techniques for Training GANs"
sanghviyashiitb/GANS-VanillaAndMinibatchDiscrimination
random-weights/Minibatch-Discrimination
Verifying it for MNSIT dataset
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
open-speech/speech-aligner
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
tensorflow/addons
Useful extra functionality for TensorFlow 2.x maintained by SIG-addons
ivanvovk/durian-pytorch
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
espnet/espnet
End-to-End Speech Processing Toolkit
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)