fauxneticien

Stanford UniversityBerkeley, CA

Pinned Repositories

tidylex
Tidy lexicographical data in backslash-coded formats
Language:JavaScript5 5 20
vad-sli-asr
A pipeline to isolate and transcribe one language in mixed-language speech
Language:Python18 3 03
vyov
Visualise your own vowels: A short introduction to Praat for complete beginners
Language:HTML2 5 00
yinarlingi
R package for testing Warlpiri dictionary data structures
Language:R0 6 31
bnf_cnn_qbe-std
Query by example spoken term detection using bottleneck features and a convolutional neural network
Language:Python8 2 11
fastconformer_standalone
Language:Python6 1 10
qbe-std_feats_eval
Evaluation of feature extraction methods for query-by-example spoken term detection with low resource languages
Language:Perl11 5 12
wav2vec2-codebook-indices
Language:Python3 2 01
qbestdocks
A set of pre-configured Docker containers for deploying a Query-by-Example Spoken Term Detection service.
Language:JavaScript0 2 01

fauxneticien's Repositories

fauxneticien/fastconformer_standalone
Language:Python6 1 10
fauxneticien/wav2vec2-codebook-indices
Language:Python3 2 01
fauxneticien/lnl-examples
PyTorch Lightning and Lhotse examples
Language:Jupyter Notebook2 3 01
fauxneticien/phd-dissertation
Language:TeX1 1 0
fauxneticien/PTL2-DS2ish
A toy repository for using PyTorch Lightning 2.x to train an adapted DeepSpeech 2 model
Language:Python1 2 0
fauxneticien/scriptable_hubert_encoder
Development repository for experimenting with a scriptable (and crammable) HuBERT encoder
Language:Jupyter Notebook1 2 0
fauxneticien/ssl-harness
Language:Python1 1 0
fauxneticien/w2v2-10min-exps
Experiments with wav2vec 2.0 models involving only 10 minutes of transcribed speech
Language:Jupyter Notebook1 2 1
fauxneticien/w2v2-batch-size
Code for paper "The effect of batch size on contrastive self-supervised speech representation learning"
Language:Python1 0 0
fauxneticien/w2v2-cpt-transfer
Language:Jupyter Notebook1 1 0
fauxneticien/asr-dataset-prep
Scripts for preparing datasets for automatic speech recognition
Language:Jupyter Notebook2 0
fauxneticien/best-rq-pytorch
Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
fauxneticien/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
Language:Python0 0
fauxneticien/byt5-mt
2 0
fauxneticien/conformer
Implementation of the convolutional module from the Conformer paper, for use in Transformers
fauxneticien/ctc_decoder_files
Language:Python1 0
fauxneticien/E2E-language-diarization-transfer
Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>
Language:Python1 2
fauxneticien/fauxneticien
2 0
fauxneticien/fauxneticien.github.io
Language:JavaScript
fauxneticien/hubert-exploration
Language:Python1 0
fauxneticien/jupyter-book
Create beautiful, publication-quality books and documents from computational content.
Language:Python1 0
fauxneticien/lightning-speech-sampling
Try out different samplers for speech data with PyTorch Lightning
Language:Python2 0
fauxneticien/multi_quantization
Language:Python0 0
fauxneticien/OPUS-MT-train
Training open neural machine translation models
Language:Makefile1 0
fauxneticien/slasr-scripts
Data processing scripts for SLASR project
1 0
fauxneticien/w2v2-10min-replication
Replicate training wav2vec 2.0 model on just 10 minutes of Librispeech data
Language:Python2 0
fauxneticien/w2v2-fairseq-pretrain
Language:Python2 0
fauxneticien/w2v2-hf-pretrain-test
Testing wav2vec 2.0 pre-training with HuggingFace
Language:Python2 0
fauxneticien/w2v2-pretrain-dynamic-batch
Language:Python2 0
fauxneticien/Wav2vec2.0
Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.
Language:Python0 0