Pinned Repositories
asteroid
The PyTorch-based audio source separation toolkit for researchers || Current highlight : we got our WHAMR results check it out here !
audfprint
Landmark-based audio fingerprinting
audioset_tagging_cnn
awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
awesome-cpp
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
berkeley-stat-157
Homepage for STAT 157 at UC Berkeley
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
chromaprint
C library for generating audio fingerprints used by AcoustID
CTC-speech-recognition
This is a working example of using CTC for phone recognition on TIMIT
NumpyDL
Deep Learning Library. For education. Based on pure Numpy. Support CNN, RNN, LSTM, GRU etc.
opencvbaby's Repositories
opencvbaby/NumpyDL
Deep Learning Library. For education. Based on pure Numpy. Support CNN, RNN, LSTM, GRU etc.
opencvbaby/audfprint
Landmark-based audio fingerprinting
opencvbaby/CTC-speech-recognition
This is a working example of using CTC for phone recognition on TIMIT
opencvbaby/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
opencvbaby/deeplearning-models
A collection of various deep learning architectures, models, and tips
opencvbaby/dejavu
Audio fingerprinting and recognition in Python
opencvbaby/dejavu_fingerprinter_cpp
A port of the Dejavu fingerprint.py program to C++
opencvbaby/DNN-for-speech-enhancement
DNN-for-speech-enhancement
opencvbaby/espnet
End-to-End Speech Processing Toolkit
opencvbaby/hexo_weibo_image
hexo使用新浪微博图床自动发布
opencvbaby/iNCML-DNNLM
A CUDA-C implementation of FOFE and FSMN
opencvbaby/kaldi
Customized Kaldi
opencvbaby/kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
opencvbaby/kaldi-python
Python wrappers for Kaldi data
opencvbaby/kapre
kapre: Keras Audio Preprocessors
opencvbaby/ML-KWS-for-MCU
opencvbaby/onssen
An open-source speech separation and enhancement library
opencvbaby/pix2pix-tensorflow
Tensorflow port of Image-to-Image Translation with Conditional Adversarial Nets https://phillipi.github.io/pix2pix/
opencvbaby/PRML
PRML algorithms implemented in Python
opencvbaby/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
opencvbaby/segan
Speech Enhancement Generative Adversarial Network in TensorFlow
opencvbaby/SignalGraph
Matlab-based deep learning toolkit that supports arbitrary directed acyclic graphs (DAG). Support DNN, LSTM, CNN layers and many signal processing layers. Include recipes/examples of using the tool for various tasks.
opencvbaby/specGAN
Noisy to clean spectral mapping of speech using adversarial training
opencvbaby/speech
Various scripts and tools for speech recognition model building
opencvbaby/TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners with Latest APIs
opencvbaby/tensorflow-speech-recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
opencvbaby/tensorflow_speech_recognition_solution
code for 3rd place kaggle tensorflow competition
opencvbaby/tf_kaldi_io
A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.
opencvbaby/vim
vim配置文件和插件(不再维护)
opencvbaby/Youtube-8M-WILLOW
Kaggle Youtube 8M WILLOW approach