Pinned Repositories
adaptive_voice_conversion
adversarial-disentangling-autoencoder-for-spk-representation
Software presented in the article "Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation".
AGAIN-VC
This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization.
Alibaba-MIT-Speech
Alibaba speech technology
asv-subtools
An Open Source Tools for Speaker Recognition
auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
CLUB
Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
fb029ed
scrcpy-opencv-SQ
使用c++对scrcpy进行重构,提供opencv Mat图像,便于二次开发,提供了智慧树知到的自动刷课脚本.
yolov5_cpp_openvino
用c++实现了yolov5使用openvino的部署
fb029ed's Repositories
fb029ed/yolov5_cpp_openvino
用c++实现了yolov5使用openvino的部署
fb029ed/scrcpy-opencv-SQ
使用c++对scrcpy进行重构,提供opencv Mat图像,便于二次开发,提供了智慧树知到的自动刷课脚本.
fb029ed/asv-subtools
An Open Source Tools for Speaker Recognition
fb029ed/fb029ed
fb029ed/adversarial-disentangling-autoencoder-for-spk-representation
Software presented in the article "Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation".
fb029ed/auorange
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
fb029ed/CLUB
Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
fb029ed/conformer
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
fb029ed/ConvS2S-VC
fb029ed/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
fb029ed/g2p
g2p: English Grapheme To Phoneme Conversion
fb029ed/gmm-torch
Gaussian mixture models in PyTorch.
fb029ed/GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
fb029ed/leetcode-master
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
fb029ed/lidbox
End-to-end spoken language identification out of the box. Rewrite in progress for first release (version 1).
fb029ed/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
fb029ed/NeMo
NeMo: a toolkit for conversational AI
fb029ed/openTSNE
Extensible, parallel implementations of t-SNE
fb029ed/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
fb029ed/phonemizer
Simple text to phones converter for multiple languages
fb029ed/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
fb029ed/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
fb029ed/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
fb029ed/SC-WaveRNN
Official PyTorch implementation of Speaker Conditional WaveRNN
fb029ed/snowfall
fb029ed/STL
The ITU-T Software Tool Library (G.191)
fb029ed/TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
fb029ed/VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021
fb029ed/WaveRNN
WaveRNN Vocoder + TTS
fb029ed/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit