Pinned Repositories
esp-skainet
Espressif intelligent voice assistant
esp-sr
Speech recognition
alignment-handbook
Robust recipes for to align language models with human and AI preferences
asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
Audiomer-PyTorch
A Convolutional Transformer for Keyword Spotting
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
deepvac
PyTorch Project Specification.
edge-tts
Microsoft Edge's TTS
feizi's Repositories
feizi/alignment-handbook
Robust recipes for to align language models with human and AI preferences
feizi/asteroid
The PyTorch-based audio source separation toolkit for researchers || Pretrained models available
feizi/Audiomer-PyTorch
A Convolutional Transformer for Keyword Spotting
feizi/Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
feizi/deepvac
PyTorch Project Specification.
feizi/edge-tts
Microsoft Edge's TTS
feizi/esp-box
The ESP-BOX is a new generation AIoT development platform released by Espressif Systems.
feizi/esp-idf
Espressif IoT Development Framework. Official development framework for ESP32.
feizi/esp-skainet
Espressif intelligent voice assistant
feizi/esp-sr
Speech recognition
feizi/feizi.github.io
feizi/flatcc
FlatBuffers Compiler and Library in C for C
feizi/flite
A small fast portable speech synthesis system
feizi/gcc-nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
feizi/huxpro.github.io
My Blog / Jekyll Themes / PWA
feizi/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
feizi/kaldi
This is the official location of the Kaldi project.
feizi/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
feizi/pb_chime5
Speech enhancement system for the CHiME-5 dinner party scenario
feizi/porcupine
On-device wake word detection powered by deep learning.
feizi/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
feizi/README
README文件语法解读,即Github Flavored Markdown语法介绍
feizi/sound-separation
feizi/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
feizi/toolbox-for-speech-signal-processing
A collection of some tools for research on speech signal processing
feizi/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
feizi/unified2021
A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION
feizi/UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
feizi/WavAugment
A library for speech data augmentation in time-domain
feizi/wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.