vectominist
PhD Candidate @ MIT CSAIL. Speech Processing and Balloon Arts.
Massachusetts Institute of TechnologyCambridge, MA
Pinned Repositories
End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
espnet
End-to-End Speech Processing Toolkit
ML2021-Spring
**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2021 Spring
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
awesome-self-supervised-speech-representation-learning
A comprehensive list of awesome self-supervised speech representation learning papers.
End-to-end-ASR-Pytorch-DLHLP
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)
Face-Image-Morphing
🧒🏻👨🏼👱🏾♀️👶🏻 Face Image Morphing: an OpenCV and NumPy Implementation
MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
spin
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering"
Switchboard-WSJ-Utils
Utilities for preprocessing the Switchboard and WSJ corpora in Python3
vectominist's Repositories
vectominist/MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
vectominist/spin
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering"
vectominist/End-to-end-ASR-Pytorch-DLHLP
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)
vectominist/awesome-self-supervised-speech-representation-learning
A comprehensive list of awesome self-supervised speech representation learning papers.
vectominist/Face-Image-Morphing
🧒🏻👨🏼👱🏾♀️👶🏻 Face Image Morphing: an OpenCV and NumPy Implementation
vectominist/Switchboard-WSJ-Utils
Utilities for preprocessing the Switchboard and WSJ corpora in Python3
vectominist/Course-Map-Visualization
A simple website for visualizing course maps 🎓🗺.
vectominist/SBCSAE-preprocess
Preprocessing and downloading scripts for the Santa Barbara Corpus of Spoken American English (SBCSAE).
vectominist/awesome-self-supervised-learning
A curated list of awesome self-supervised methods
vectominist/FRAIG
Final Project of Data Structure and Programming 2018 Fall, NTUEE
vectominist/MedNLP
Mandarin Medical Dialogue Analysis with Pytorch.
vectominist/ZJ-Solutions-in-Python
💻 Solutions to ZeroJudge in Python
vectominist/Algorithms2019Fall
💻 Solutions to the three programming assignments of the course Algorithms 2019 Fall, NTU EE.
vectominist/End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
vectominist/espnet
End-to-End Speech Processing Toolkit
vectominist/eval-word-vectors
Easy to use scripts for evaluating word vectors on a variety of tasks.
vectominist/GeoRect-Demo
Demo of Deep Learning-based Image Geometric Rectification
vectominist/ICG2020Spring-HW1
🎨 HW1 (shading and transformation) of the course Interactive Computer Graphics, NTU CSIE.
vectominist/ml-ta-helper
vectominist/phone-seg-ssl
Phoneme segmentation using pre-trained speech models
vectominist/receptive-field-calculator
A simple receptive field calculator for convolutional neural networks (CNN).
vectominist/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
vectominist/spectra-review-paper-competition
Competition for best expository article on cutting-edge ML research
vectominist/TaiwanMahjongGame
Taiwan Mahjong Game - Final Project of Computer Programming 2017 Fall, NTUEE
vectominist/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
vectominist/vectominist
vectominist/vectominist.github.io
vectominist/vectominist.github.io.old
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
vectominist/website
vectominist/zr-2021vg_baseline
Baselines for the Zero-Resources Speech Challenge using VisuallyGrounded Models of Spoken Language, 2021 edition