Pinned Repositories
100DaysOfMLCode
asr-server
FastCGI support for Kaldi ASR
audio-visual-speech-enhancement
Automatic_Speech_Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
awesome-deep-learning-music
List of articles related to deep learning applied to music
awesome-scalability
Scalable, Available, Stable, Performant, and Intelligent System Design Patterns
stanford-cs-229-machine-learning
VIP cheatsheets for Stanford's CS 229 Machine Learning
tensorspace
Neural network 3D visualization framework, build interactive and intuitive model in browsers, support pre-trained deep learning models from TensorFlow, Keras, TensorFlow.js
faiss
A library for efficient similarity search and clustering of dense vectors.
00001101-xt's Repositories
00001101-xt/100DaysOfMLCode
00001101-xt/asr-server
FastCGI support for Kaldi ASR
00001101-xt/Automatic_Speech_Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
00001101-xt/awesome-scalability
Scalable, Available, Stable, Performant, and Intelligent System Design Patterns
00001101-xt/tensorspace
Neural network 3D visualization framework, build interactive and intuitive model in browsers, support pre-trained deep learning models from TensorFlow, Keras, TensorFlow.js
00001101-xt/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
00001101-xt/awesome-kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
00001101-xt/awesome-speech-recognition-speech-synthesis-papers
Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
00001101-xt/DeepAFx
Third-party audio effects plugins as differentiable layers within deep neural networks.
00001101-xt/DeOldify
A Deep Learning based project for colorizing and restoring old images
00001101-xt/dropclass_speaker
DropClass and DropAdapt - repository for the paper submitted to Speaker Odyssey 2020
00001101-xt/google-images-download
Python Script to download hundreds of images from 'Google Images'. It is a ready-to-run code!
00001101-xt/keras-applications
Reference implementations of popular deep learning models.
00001101-xt/madmom
Python audio and music signal processing library
00001101-xt/MakeItTalk
00001101-xt/minimp3
Minimalistic MP3 decoder single header library
00001101-xt/moby
Moby Project - a collaborative project for the container ecosystem to assemble container-based systems
00001101-xt/mss_pytorch
Singing Voice Separation via Recurrent Inference and Skip-Filtering Connections - PyTorch Implementation. Demo:
00001101-xt/musegan
An AI for Music Generation
00001101-xt/PyramidBox-1
This repo implements PyramidBox with pytorch
00001101-xt/PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
00001101-xt/qmc-decoder
Fastest & best convert qmc 2 mp3 | flac tools
00001101-xt/Self-Attentive-tensorflow
Tensorflow implementation of "A Structured Self-Attentive Sentence Embedding"
00001101-xt/Speaker_Verification
Tensorflow implementation of generalized end-to-end loss for speaker verification
00001101-xt/tensorflow-triplet-loss
Implementation of triplet loss in TensorFlow
00001101-xt/tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
00001101-xt/the-art-of-command-line
Master the command line, in one page
00001101-xt/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
00001101-xt/VBDiarization
Speaker diarization based on Kaldi x-vectors using pretrained model from http://kaldi-asr.org/models/0003_sre16_v2_1a.tar.gz
00001101-xt/voice-vector
A deep neural network for finding text-independent speaker embedding written in tensorflow