Pinned Repositories
Add_noise_and_rir_to_speech
The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generate far-field speech data using room impulse response data from BUT Speech@FIT Reverb Database.
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
cml
♾️ CML - Continuous Machine Learning | CI/CD for ML
cognitive-services-speech-sdk
Sample code for the Microsoft Cognitive Services Speech SDK
DNS-Challenge-2020
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
google-research
Google Research
honk
PyTorch implementations of neural network models for keyword spotting
kaldi-aslp
ziggy1209.github.io
BLOG
ziggy1209's Repositories
ziggy1209/ziggy1209.github.io
BLOG
ziggy1209/Add_noise_and_rir_to_speech
The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generate far-field speech data using room impulse response data from BUT Speech@FIT Reverb Database.
ziggy1209/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
ziggy1209/audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
ziggy1209/cml
♾️ CML - Continuous Machine Learning | CI/CD for ML
ziggy1209/cognitive-services-speech-sdk
Sample code for the Microsoft Cognitive Services Speech SDK
ziggy1209/DNS-Challenge-2020
This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of this challenge will use the scripts from this repo to create data to train their noise suppressors. They will compare their method with our baseline noise suppressor and report the results.
ziggy1209/google-research
Google Research
ziggy1209/honk
PyTorch implementations of neural network models for keyword spotting
ziggy1209/kaldi-aslp
ziggy1209/kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
ziggy1209/kaldi_egs_CGN
Kaldi recipe for creating Dutch ASR from CGN
ziggy1209/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
ziggy1209/KWS_pytorch
Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
ziggy1209/lite.ai.toolkit
🛠 A lite C++ toolkit of awesome AI models with ONNXRuntime, NCNN, MNN and TNN. YOLOX, YOLOP, YOLOv6, YOLOR, MODNet, YOLOX, YOLOv7, YOLOv5. MNN, NCNN, TNN, ONNXRuntime.
ziggy1209/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
ziggy1209/onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
ziggy1209/pykaldi
A Python wrapper for Kaldi
ziggy1209/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
ziggy1209/shennong
A Python toolbox for speech features extraction
ziggy1209/switch-cuda
A simple bash script for switching between installed versions of CUDA.
ziggy1209/usingcli-book
using command line like a hacker
ziggy1209/vision
Datasets, Transforms and Models specific to Computer Vision
ziggy1209/WebRTC_AGC
Automatic Gain Control Module Port From WebRTC
ziggy1209/WebRTC_NS
Noise Suppression Module Port From WebRTC
ziggy1209/wekws
Production First and Production Ready End-to-End Keyword Spotting Toolkit
ziggy1209/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
ziggy1209/wespeaker
Research and Production Oriented Speaker Recognition Toolkit
ziggy1209/z3
The Z3 Theorem Prover