catherine-qian

catherine-qian's Stars

xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Language:Python11.3k 104 801.9k
google/lyra
A Very Low-Bitrate Codec for Speech Compression
Language:C++3.8k 113 125356
xialeiliu/Awesome-Incremental-Learning
Awesome Incremental Learning
3.7k 132 46562
google-research/uda
Unsupervised Data Augmentation (UDA)
Language:Python2.2k 44 113312
JDAI-CV/FaceX-Zoo
A PyTorch Toolbox for Face Recognition
Language:Python1.9k 41 158433
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Language:Python1.8k 20 180187
qiuqiangkong/audioset_tagging_cnn
Language:Python1.3k 14 68249
google-research/fixmatch
A simple method to perform semi-supervised learning with limited data.
Language:Python1.1k 19 63172
clovaai/voxceleb_trainer
In defence of metric learning for speaker recognition
Language:Python1k 29 173272
AndreyGuzhov/AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
Language:Python746 17 091
YU1ut/MixMatch-pytorch
Code for "MixMatch - A Holistic Approach to Semi-Supervised Learning"
Language:Python632 12 37129
craffel/mir_eval
Evaluation functions for music/audio information retrieval/signal processing algorithms.
Language:Python600 27 251112
raoyongming/DynamicViT
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
Language:Jupyter Notebook555 10 4470
EndlessSora/DeeperForensics-1.0
[CVPR 2020] A Large-Scale Dataset for Real-World Face Forgery Detection
Language:Python534 36 1470
chaiyujin/glow-pytorch
pytorch implementation of openai paper "Glow: Generative Flow with Invertible 1×1 Convolutions"
Language:Python504 15 2779
sony/ai-research-code
Language:Python347 31 3965
TaoRuijie/TalkNet-ASD
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Language:Python296 8 6766
lucidrains/routing-transformer
Fully featured implementation of Routing Transformer
Language:Python282 12 3029
mczhuge/Kaleido-BERT
💐Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
Language:Python264 3 1519
rishikksh20/MLP-Mixer-pytorch
Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision
Language:Python207 2 327
Data-Science-kosta/Speech-Emotion-Classification-with-PyTorch
This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.
Language:Jupyter Notebook187 7 735
ildoonet/unsupervised-data-augmentation
Unofficial PyTorch Implementation of Unsupervised Data Augmentation.
Language:Python147 13 149
MihawkHu/DCASE2020_task1
Code for DCASE 2020 task 1a and task 1b.
Language:Python85 7 2228
gemengtju/SpEx_Plus
SpEx+(tied) source code
Language:Python72 1 717
DavidDiazGuerra/Cross3D
Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
Language:Python67 3 111
UttaranB127/speech2affective_gestures
This is the official implementation of the paper "Speech2AffectiveGestures: Synthesizing Co-Speech Gestures with Generative Adversarial Affective Expression Learning".
Language:Python44 2 239
sharathadavanne/seld-dcase2021
Baseline method for sound event localization task of DCASE 2021 challenge
Language:Python39 3 1018
l3das/L3DAS21
Language:Python36 6 18
sunits/Reverberated_WSJ_2MIX
Code to simulate a reverberated, noisy version of the WSJ-2MIX dataset
Language:Python20 0 14
kantologist/deep-clustering-1
deep clustering method for single-channel speech separation
Language:Python1 0 0