sushantakpani
(NLP | TTS | ASR | Voice Biometrics | Machine Learning | Deep Learning)
Michigan State UniversityRedmond, WA
Pinned Repositories
coref
BERT for Coreference Resolution
1D-Triplet-CNN
PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals by A. Chowdhury, and A. Ross.
Algorithms-Collection-Python
Collection of Algorithms implemented in Python
allennlp
An open-source NLP research library, built on PyTorch.
awesome-deep-learning-papers
The most cited deep learning papers
ba-dls-deepspeech
berkeley-coreference-analyser
A tool for classifying errors in coreference resolution
e2e-coref
End-to-end Neural Coreference Resolution
long-doc-coref
Code for the EMNLP 2020 paper "Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks"
sushantakpani.github.io
Sushanta's Website
sushantakpani's Repositories
sushantakpani/Algorithms-Collection-Python
Collection of Algorithms implemented in Python
sushantakpani/long-doc-coref
Code for the EMNLP 2020 paper "Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks"
sushantakpani/sushantakpani.github.io
Sushanta's Website
sushantakpani/coref-hoi
PyTorch implementation of the end-to-end coreference resolution model with different higher-order inference methods.
sushantakpani/cse812
sushantakpani/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
sushantakpani/data-structures-algorithms-python
This tutorial playlist covers data structures and algorithms in python. Every tutorial has theory behind data structure or an algorithm, BIG O Complexity analysis and exercises that you can practice on.
sushantakpani/DeepTalk
sushantakpani/espnet
End-to-End Speech Processing Toolkit
sushantakpani/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
sushantakpani/flite
A small fast portable speech synthesis system
sushantakpani/IndicWav2Vec
Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2
sushantakpani/Machine-Learning-Collection
A resource for learning about ML, DL, PyTorch and TensorFlow. Feedback always appreciated :)
sushantakpani/Machine-Learning-From-Scratch
Implementation of popular ML algorithms from scratch
sushantakpani/model_search
sushantakpani/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
sushantakpani/odia-s2t
sushantakpani/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
sushantakpani/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
sushantakpani/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
sushantakpani/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
sushantakpani/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
sushantakpani/SemanticHearing
Real-time binaural target sound extraction model.
sushantakpani/speechbrain
A PyTorch-based Speech Toolkit
sushantakpani/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
sushantakpani/transformers-text-classification-for-nlp-using-bert-2478096
This repo is for the Linkedin Learning course: Transformers: Text Classification for NLP using BERT
sushantakpani/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
sushantakpani/TTS-1
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
sushantakpani/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
sushantakpani/YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone