twistedmove

Pinned Repositories

.emacs.d
My emacs configuration files
Language:Emacs Lisp1 1 00
61a-sp14-website
Published files displayed on CS61A website for SP14
Language:Python1 1 012
CS103
Labs and assignments completed for CSCI 103 Introduction to Programming at USC
Language:C++1 2 02
CS246
Mining Massive Data Sets from Stanford
Language:MATLAB5 1 00
CS61B-2
This code is the property of Paul Hilfinger. I do not claim any ownership
Language:Java1 1 04
kaldi-speaker-diarization
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
Language:Shell1 1 00
pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors
Language:Perl1 1 00
SoftClip
JUCE Audio Plugin that implements a range of soft clipping algorithms. This plugin uses a GenericAudioProcessorEditor and has two parameters (input gain and algorithm choice)
Language:C++1 1 00
Speaker-Recognition
PyTorch implementation using high-level framework Catalyst of the paper Utterance-level Aggregation For Speaker Recognition In The Wild
Language:Python1 1 00
Wings-of-the-Points
Language:HTML1 1 00

twistedmove's Repositories

twistedmove/AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.
Language:Python0 0
twistedmove/aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
Language:Python0 0
twistedmove/asr_project_template
Template for ASR project
Language:Python0 0
twistedmove/ASV-Anti-Spoofing-DADA
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
Language:Python0 0
twistedmove/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
0 0
twistedmove/byol-a
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
Language:Python1 0
twistedmove/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Language:Python0 0
twistedmove/Duality-Temporal-Channel-Frequency-Attention-Enhanced-Speaker-Representation-Learning
Unofficial implementation of https://arxiv.org/abs/2110.06565 (for speaker verification)
Language:Python1 0
twistedmove/ECAPATDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language:Python0 0
twistedmove/EPSANet
EPSANet
Language:Python1 0
twistedmove/hyperion
Python toolkit for speech processing
Language:Python1 0
twistedmove/LAGConv
lagconv
Language:Python0 0
twistedmove/Loss-Gated-Learning
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
Language:Python0 0
twistedmove/MST-GCN
This is the official implemntation for "Multi-scale spatial temporal graph convolutional network for skeleton-based action recognition" AAAI-2021
Language:Python0 0
twistedmove/New-Grad-Positions-2022
A collection of New Grad full time roles in SWE, Quant, and PM.
0 0
twistedmove/RawBoost-antispoofing
This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Speaker Verification Anti-Spoofing".
Language:Python1 0
twistedmove/solo-learn
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
Language:Python0 0
twistedmove/sr_labs_book
The project is related to the development of labs for the ITMO Speaker Recognition Course.
Language:Jupyter Notebook0 0
twistedmove/ssl-for-slr
Collection of self-supervised models for speaker and language recognition tasks.
Language:Jupyter Notebook1 0
twistedmove/SSL_Anti-spoofing
This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation".
Language:Python0 0
twistedmove/StreamingSpeakerDiarization
Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"
Language:Python0 0
twistedmove/TorchSSL
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
Language:Python0 0
twistedmove/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Jupyter Notebook0 0
twistedmove/TVConv
[CVPR 2022] TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing
Language:Python0 0
twistedmove/TWIST
Official codes: Self-Supervised Learning by Estimating Twin Class Distribution
Language:Python0 0
twistedmove/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Language:Python0 0
twistedmove/VisionXformer
Vision Xformers
Language:Python0 0
twistedmove/WAEN
Wavelet Attention Embedding Networks for Video Super-Resolution (ICPR 2020) - Official Repository
Language:Python0 0
twistedmove/WaveletAttention
Wavelet-Attention CNN for Image Classification
Language:Python1 0
twistedmove/WaveMix
2D discrete Wavelet Transform for Image Classification
Language:Python0 0