catherine-qian's Stars
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
google/lyra
A Very Low-Bitrate Codec for Speech Compression
xialeiliu/Awesome-Incremental-Learning
Awesome Incremental Learning
google-research/uda
Unsupervised Data Augmentation (UDA)
JDAI-CV/FaceX-Zoo
A PyTorch Toolbox for Face Recognition
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
qiuqiangkong/audioset_tagging_cnn
google-research/fixmatch
A simple method to perform semi-supervised learning with limited data.
clovaai/voxceleb_trainer
In defence of metric learning for speaker recognition
AndreyGuzhov/AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
YU1ut/MixMatch-pytorch
Code for "MixMatch - A Holistic Approach to Semi-Supervised Learning"
craffel/mir_eval
Evaluation functions for music/audio information retrieval/signal processing algorithms.
raoyongming/DynamicViT
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
EndlessSora/DeeperForensics-1.0
[CVPR 2020] A Large-Scale Dataset for Real-World Face Forgery Detection
chaiyujin/glow-pytorch
pytorch implementation of openai paper "Glow: Generative Flow with Invertible 1×1 Convolutions"
sony/ai-research-code
TaoRuijie/TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
lucidrains/routing-transformer
Fully featured implementation of Routing Transformer
mczhuge/Kaleido-BERT
💐Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
rishikksh20/MLP-Mixer-pytorch
Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision
Data-Science-kosta/Speech-Emotion-Classification-with-PyTorch
This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.
ildoonet/unsupervised-data-augmentation
Unofficial PyTorch Implementation of Unsupervised Data Augmentation.
MihawkHu/DCASE2020_task1
Code for DCASE 2020 task 1a and task 1b.
gemengtju/SpEx_Plus
SpEx+(tied) source code
DavidDiazGuerra/Cross3D
Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
UttaranB127/speech2affective_gestures
This is the official implementation of the paper "Speech2AffectiveGestures: Synthesizing Co-Speech Gestures with Generative Adversarial Affective Expression Learning".
sharathadavanne/seld-dcase2021
Baseline method for sound event localization task of DCASE 2021 challenge
l3das/L3DAS21
sunits/Reverberated_WSJ_2MIX
Code to simulate a reverberated, noisy version of the WSJ-2MIX dataset
kantologist/deep-clustering-1
deep clustering method for single-channel speech separation