d2sys

d2sys's Stars

Avik-Jain/100-Days-Of-ML-Code
100 Days of ML Coding
44.5k 2.4k 010.5k
google-research/bert
TensorFlow code and pre-trained models for BERT
Language:Python37.8k 999 1.1k9.6k
google-research/google-research
Google Research
Language:Jupyter Notebook33.8k 751 1.2k7.8k
sebastianruder/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Language:Python22.5k 1.3k 1003.6k
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell14.1k 696 1.6k5.3k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python11.5k 201 2.2k2.4k
google-deepmind/sonnet
TensorFlow-based neural network library
Language:Python9.7k 423 1931.3k
espnet/espnet
End-to-End Speech Processing Toolkit
Language:Python8.3k 182 2.3k2.2k
WillKoehrsen/Data-Analysis
Data Science Using Python
Language:Jupyter Notebook5.2k 354 613.6k
google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Language:Python1.6k 101 87319
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Language:Jupyter Notebook1.5k 45 255339
facebookresearch/UnsupervisedMT
Phrase-Based & Neural Unsupervised Machine Translation
Language:Python1.5k 121 101262
NVIDIA/deepops
Tools for building GPU clusters
Language:Shell1.3k 51 434326
facebookarchive/loop
A method to generate speech across multiple speakers
Language:Python871 68 75158
githubharald/CTCDecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.
Language:Python816 25 23182
bshall/knn-vc
Voice Conversion With Just Nearest Neighbors
Language:Python447 16 3864
CrowdCurio/audio-annotator
A JavaScript interface for annotating and labeling audio files.
Language:JavaScript433 17 1084
OpenNewsLabs/autoEdit_2
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
Language:JavaScript420 39 7356
petewarden/open-speech-recording
Web application to record speech for an open data set
Language:HTML420 25 7160
sagiebenaim/DistanceGAN
Pytorch implementation of "One-Sided Unsupervised Domain Mapping" NIPS 2017
Language:Python195 12 440
bigpon/vcc20_baseline_cyclevae
Voice Conversion Challenge 2020 CycleVAE baseline system
Language:Python133 6 917
neulab/word-embeddings-for-nmt
Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018
Language:Python119 9 519
OpenHypervideo/FrameTrail
FrameTrail is an open source software that let's you experience, manage and edit interactive video directly in your web browser. It enables you to hyperlink filmic contents, include additional multimedia documents (e.g. text overlays, images or interactive maps) and to add supplementing materials (annotations) at specific points.
Language:JavaScript114 18 3937
oseledets/nla2020
Github repository for NLA2020 course
Language:Jupyter Notebook82 12 140
iamyuanchung/speech2vec-pretrained-vectors
Speech2vec pre-trained word vectors
77 2 311
tomgrek/ml-deployment-demo
ML Deployment, Two Ways
Language:Jupyter Notebook57 3 125
emilio-molina/audio_degrader
Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.
Language:Python56 3 2610
a-nagrani/ffmpeg-commands
Collection of useful FFMPEG commands for processing audio and video files.
43 0 09
uclanlp/NamedEntityLanguageModel
Language:Python32 7 05
oseledets/nla2016
Repository for 2016 NLA course @ Skoltech
Language:Jupyter Notebook20 8 419