Pinned Repositories
.emacs.d
My emacs configuration files
61a-sp14-website
Published files displayed on CS61A website for SP14
CS103
Labs and assignments completed for CSCI 103 Introduction to Programming at USC
CS246
Mining Massive Data Sets from Stanford
CS61B-2
This code is the property of Paul Hilfinger. I do not claim any ownership
kaldi-speaker-diarization
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors
SoftClip
JUCE Audio Plugin that implements a range of soft clipping algorithms. This plugin uses a GenericAudioProcessorEditor and has two parameters (input gain and algorithm choice)
Speaker-Recognition
PyTorch implementation using high-level framework Catalyst of the paper Utterance-level Aggregation For Speaker Recognition In The Wild
Wings-of-the-Points
twistedmove's Repositories
twistedmove/AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.
twistedmove/aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
twistedmove/asr_project_template
Template for ASR project
twistedmove/ASV-Anti-Spoofing-DADA
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
twistedmove/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
twistedmove/byol-a
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
twistedmove/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
twistedmove/Duality-Temporal-Channel-Frequency-Attention-Enhanced-Speaker-Representation-Learning
Unofficial implementation of https://arxiv.org/abs/2110.06565 (for speaker verification)
twistedmove/ECAPATDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
twistedmove/EPSANet
EPSANet
twistedmove/hyperion
Python toolkit for speech processing
twistedmove/LAGConv
lagconv
twistedmove/Loss-Gated-Learning
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
twistedmove/MST-GCN
This is the official implemntation for "Multi-scale spatial temporal graph convolutional network for skeleton-based action recognition" AAAI-2021
twistedmove/New-Grad-Positions-2022
A collection of New Grad full time roles in SWE, Quant, and PM.
twistedmove/RawBoost-antispoofing
This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".
twistedmove/solo-learn
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
twistedmove/sr_labs_book
The project is related to the development of labs for the ITMO Speaker Recognition Course.
twistedmove/ssl-for-slr
Collection of self-supervised models for speaker and language recognition tasks.
twistedmove/SSL_Anti-spoofing
This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".
twistedmove/StreamingSpeakerDiarization
Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"
twistedmove/TorchSSL
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
twistedmove/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
twistedmove/TVConv
[CVPR 2022] TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing
twistedmove/TWIST
Official codes: Self-Supervised Learning by Estimating Twin Class Distribution
twistedmove/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
twistedmove/VisionXformer
Vision Xformers
twistedmove/WAEN
Wavelet Attention Embedding Networks for Video Super-Resolution (ICPR 2020) - Official Repository
twistedmove/WaveletAttention
Wavelet-Attention CNN for Image Classification
twistedmove/WaveMix
2D discrete Wavelet Transform for Image Classification