Pinned Repositories
3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
algorithm-visualizer
:fireworks:Interactive Online Platform that Visualizes Algorithms from Code
amazon-chime-sdk-pstn-audio-voice-focus
argus-freesound
Kaggle | 1st place solution for Freesound Audio Tagging 2019
athena
an open-source implementation of sequence-to-sequence based speech processing engine
aubio
a library for audio and music analysis
Audio-Effects
Collection of audio effects plugins implemented from the explanations in the book "Audio Effects: Theory, Implementation and Application" by Joshua D. Reiss and Andrew P. McPherson.
audio-super-res
Audio super resolution using neural networks
Audio_based_identification_beehive_states
bandwidth_extension
simpleishappy's Repositories
simpleishappy/amazon-chime-sdk-pstn-audio-voice-focus
simpleishappy/athena
an open-source implementation of sequence-to-sequence based speech processing engine
simpleishappy/Audio-Effects
Collection of audio effects plugins implemented from the explanations in the book "Audio Effects: Theory, Implementation and Application" by Joshua D. Reiss and Andrew P. McPherson.
simpleishappy/audio-super-res
Audio super resolution using neural networks
simpleishappy/brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
simpleishappy/ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
simpleishappy/conv_arithmetic
A technical report on convolution arithmetic in the context of deep learning
simpleishappy/DCCRN-with-various-loss-functions
DCCRN with various loss functions
simpleishappy/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
simpleishappy/diart
Lightweight python library for streaming speaker diarization in real-time implemented in pytorch
simpleishappy/fdnToolbox
Matlab Toolbox for Feedack Delay Networks (FDNs)
simpleishappy/gnuradio
GNU Radio – the Free and Open Software Radio Ecosystem
simpleishappy/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
simpleishappy/ILRMA
MATLAB script of Independent Low-Rank Matrix Analysis (ILRMA)
simpleishappy/lyra
A Very Low-Bitrate Codec for Speech Compression
simpleishappy/Merlion
Merlion: A Machine Learning Framework for Time Series Intelligence
simpleishappy/MRSP_Tutorials
Multirate Signal Processing Tutorials
simpleishappy/Neural-Speech-Dereverberation
Machine and Deep Learning models for speech dereverberation
simpleishappy/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
simpleishappy/PCS-FIR-Filter
A time-domain extension to "Perceptual Contrast Stretching on Target Feature for Speech Enhancement"
simpleishappy/PercepNet
(Under construct) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
simpleishappy/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
simpleishappy/PyTorch_Tutorial
《Pytorch模型训练实用教程》中配套代码
simpleishappy/RNNoise_Wrapper
A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for training.
simpleishappy/SkipConvNet
Speech Dereverberation using Fully Convolutional Networks
simpleishappy/SoundZone_Tools
Signal Processing Tools for MATLAB
simpleishappy/speechbrain
A PyTorch-based Speech Toolkit
simpleishappy/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
simpleishappy/TorchSnooper
Debug PyTorch code using PySnooper
simpleishappy/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.