jingxuan9862

jingxuan9862's Stars

vb000/LookOnceToHear
A novel human-interaction method for real-time speech extraction on headphones.
Language:Python51154
ewan-xu/pyaec
simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link adaptive filters)、frequency domain adaptive filters(frequency domain adaptive filter、frequency domain kalman filter) for acoustic echo cancellation.
Language:Python29997
magenta/mt3
MT3: Multi-Task Multitrack Music Transcription
Language:Python1.4k182
jzi040941/PercepNet
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
Language:C++31591
google/visqol
Perceptual Quality Estimator for speech and audio
Language:C++647118
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Language:Python2.1k162
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.2k2.5k
jingxuan9862/PaddleSpeech
An Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.
1
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python10.7k1.8k
FederatedAI/FATE
An Industrial Grade Federated Learning Framework
Language:Python5.6k1.5k
bytedance/byteps
A high performance and generic framework for distributed DNN training
Language:Python3.6k487
qiuqiangkong/panns_transfer_to_gtzan
Language:Python9638
jameslyons/python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
Language:Python2.4k617
cvondrick/soundnet
SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016
Language:Lua46093
qiuqiangkong/audioset_tagging_cnn
Language:Python1.3k247
deezer/spleeter
Deezer source separation library including pretrained models.
Language:Python25.3k2.8k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python11k2.3k
abisee/pointer-generator
Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"
Language:Python2.2k812
google/sparrowhawk
Language:Shell20458
wenet-e2e/WeTextProcessing.deprecated
Language:C++615
speechio/chinese_text_normalization
Chinese text normalization for speech processing
Language:Python604146
BUTSpeechFIT/speakerbeam
Language:Jupyter Notebook9218
magenta/ddsp
DDSP: Differentiable Digital Signal Processing
Language:Python2.8k332
nanahou/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Language:MATLAB689149
asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.2k421
nay0648/unified2021
A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION
Language:MATLAB10955
etzinis/sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
Language:Jupyter Notebook30234
clovaai/voxceleb_trainer
In defence of metric learning for speaker recognition
Language:Python1k272
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.3k1.3k
lenovo-voice/THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEM
Language:Shell5026