Pinned Repositories
basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
de-ess
De-essing software to reduce sibilance in speech
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
dnn_wpe
DSP_SibilanceDetection
A DSP algorithm designed to detect sibilance
fdndlp
A speech dereverberation algorithm, also called wpe
SpeechEnhanceDemo
SpeechEnhancement
Combining Weighted Multi-resolution STFT Loss and Distance Fusion to Optimize Speech Enhancement Generative Adversarial Networks
SpeechEnhancement-Model-s-Deployment
SpeechEnhancement model deployment with C++
Voice-Preprocessing-Toolkit
some voice preprocessing tools in it
Nitin4525's Repositories
Nitin4525/SpeechEnhancement
Combining Weighted Multi-resolution STFT Loss and Distance Fusion to Optimize Speech Enhancement Generative Adversarial Networks
Nitin4525/SpeechEnhancement-Model-s-Deployment
SpeechEnhancement model deployment with C++
Nitin4525/Voice-Preprocessing-Toolkit
some voice preprocessing tools in it
Nitin4525/basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Nitin4525/de-ess
De-essing software to reduce sibilance in speech
Nitin4525/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Nitin4525/dnn_wpe
Nitin4525/DSP_SibilanceDetection
A DSP algorithm designed to detect sibilance
Nitin4525/fdndlp
A speech dereverberation algorithm, also called wpe
Nitin4525/hrbeuthesis
哈尔滨工程大学本硕博学位论文LaTex模板
Nitin4525/SpeechEnhanceDemo
Nitin4525/nara_wpe
去混响
Nitin4525/noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
Nitin4525/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
Nitin4525/reverb-algorithms
A set of scripts implementing popular reverberation audio effect algorithms.
Nitin4525/reverse-interview
Questions to ask the company during your interview
Nitin4525/semetrics
Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)
Nitin4525/speech_dereverbaration_using_lp_residual
This is a single channel speech dereverberation method based on DOI: 10.1109/TSA.2005.858066; implemented in MATLAB
Nitin4525/spleeter
Deezer source separation library including pretrained models.
Nitin4525/SRGAN
A PyTorch implementation of SRGAN based on CVPR 2017 paper "Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network"
Nitin4525/steerable-nafx
Steerable discovery of neural audio effects
Nitin4525/TFG-PitchCorrection
This repository contains all the materials generated for my end of studies project.
Nitin4525/uavs3e
AVS3 encoder which supports AVS3-P2 baseline profile.
Nitin4525/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit