d2sys's Stars
Avik-Jain/100-Days-Of-ML-Code
100 Days of ML Coding
google-research/bert
TensorFlow code and pre-trained models for BERT
google-research/google-research
Google Research
sebastianruder/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
google-deepmind/sonnet
TensorFlow-based neural network library
espnet/espnet
End-to-End Speech Processing Toolkit
WillKoehrsen/Data-Analysis
Data Science Using Python
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
facebookresearch/UnsupervisedMT
Phrase-Based & Neural Unsupervised Machine Translation
NVIDIA/deepops
Tools for building GPU clusters
facebookarchive/loop
A method to generate speech across multiple speakers
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.
bshall/knn-vc
Voice Conversion With Just Nearest Neighbors
CrowdCurio/audio-annotator
A JavaScript interface for annotating and labeling audio files.
OpenNewsLabs/autoEdit_2
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
petewarden/open-speech-recording
Web application to record speech for an open data set
sagiebenaim/DistanceGAN
Pytorch implementation of "One-Sided Unsupervised Domain Mapping" NIPS 2017
bigpon/vcc20_baseline_cyclevae
Voice Conversion Challenge 2020 CycleVAE baseline system
neulab/word-embeddings-for-nmt
Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018
OpenHypervideo/FrameTrail
FrameTrail is an open source software that let's you experience, manage and edit interactive video directly in your web browser. It enables you to hyperlink filmic contents, include additional multimedia documents (e.g. text overlays, images or interactive maps) and to add supplementing materials (annotations) at specific points.
oseledets/nla2020
Github repository for NLA2020 course
iamyuanchung/speech2vec-pretrained-vectors
Speech2vec pre-trained word vectors
tomgrek/ml-deployment-demo
ML Deployment, Two Ways
emilio-molina/audio_degrader
Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.
a-nagrani/ffmpeg-commands
Collection of useful FFMPEG commands for processing audio and video files.
uclanlp/NamedEntityLanguageModel
oseledets/nla2016
Repository for 2016 NLA course @ Skoltech