AnitaLiu98

AnitaLiu98's Stars

google-research/leaf-audio
LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a very small number of parameters.
Language:Python49451
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python10.9k1.8k
HoiM/UIR-Loss-selected-code
Selected code for Unknown Identity Rejection Loss
Language:Python43
amajee11us/UPR-FSL
Code for UNSUPERVISED PROTOTYPE RECTIFICATION FOR FEW-SHOT LEARNING
61
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
Language:Python22.9k5.4k
joerick/pyinstrument
🚴 Call stack profiler for Python. Shows you why your code is slow!
Language:Python6.5k228
rkern/line_profiler
(OLD REPO) Line-by-line profiling for Python - Current repo ->
Language:Python3.6k254
coder/code-server
VS Code in the browser
Language:TypeScript67.7k5.6k
ludlows/PESQ
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
Language:C51997
Lukelluke/MCD-MEL-CEPSTRAL-DISTANCE-MCD-application
Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav files
Language:Shell193
microsoft/NeuralSpeech
Language:Python1.4k184
hbuschme/TextGridTools
Read, write, and manipulate Praat TextGrid files with Python
Language:Python12330
keonlee9420/Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
Language:Python18844
syang1993/gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
Language:Python368110
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python6.7k1.2k
mmorise/kiritan_singing
東北きりたん歌唱データベースの最新ラベルデータ
13913
michetonu/gradient_reversal_keras_tf
Keras implementation of a gradient reversal layer for the Tensorflow backend
Language:Python9024
Deepest-Project/AlignTTS
Implementation of the AlignTTS
Language:Jupyter Notebook7612
datawhalechina/pumpkin-book
《机器学习》（西瓜书）公式详解
23.8k4.7k
wenet-e2e/speech-synthesis-paper
List of speech synthesis papers.
990120
Theano/Theano
Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It is being continued as PyTensor: www.github.com/pymc-devs/pytensor
Language:Python9.9k2.5k
openai/improved-gan
Code for the paper "Improved Techniques for Training GANs"
Language:Python2.3k621
sanghviyashiitb/GANS-VanillaAndMinibatchDiscrimination
Language:Jupyter Notebook141
random-weights/Minibatch-Discrimination
Verifying it for MNSIT dataset
Language:Python1
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python1.9k501
open-speech/speech-aligner
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Language:C++393105
tensorflow/addons
Useful extra functionality for TensorFlow 2.x maintained by SIG-addons
Language:Python1.7k610
ivanvovk/durian-pytorch
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
Language:Python18248
espnet/espnet
End-to-End Speech Processing Toolkit
Language:Python8.3k2.2k
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Language:Python3.8k810