Sanyuan-Chen's Stars
ytdl-org/youtube-dl
Command-line program to download videos from YouTube.com and other video sites
Anduin2017/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
wandb/wandb
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
lixin4ever/Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
tomgoldstein/loss-landscape
Code for visualizing the loss landscape of neural nets
FLHonker/Awesome-Knowledge-Distillation
Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
htqin/awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
hithesis/hithesis
嗨!thesis!哈尔滨工业大学毕业论文LaTeX模板
microsoft/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
JusperLee/Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
LAION-AI/audio-dataset
Audio Dataset for training CLAP and other models
EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
ddlBoJack/Speech-Resources
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
sacmehta/delight
DeLighT: Very Deep and Light-Weight Transformers
microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
YuanGongND/ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
cywang97/StreamingTransformer
AtmaHou/MetaDialog
Platform for few-shot natural language processing: Text Classification, Sequene Labeling.
AtmaHou/FewShotTagging
Code for ACL2020 paper: Few-shot Slot Tagging with Collapsed Dependency Transfer and Label-enhanced Task-adaptive Projection Network
chenzhuo1011/libri_css
Libri-CSS: dataset and evaluation pipeline
Sanyuan-Chen/CSS_with_Conformer
Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.
AtmaHou/Seq2SeqDataAugmentationForLU
This repo is code for the COLING 2018 paper: Sequence-to-sequence Data Augmentation for Dialogue Language Understanding.
nlpapereading/nlpapereading
magic282/PlutoThesis
哈工大本、硕、博学位论文LaTeX模板
kugwzk/DiDE
Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”
Sanyuan-Chen/C2C-DA
Code for the AAAI-2021 paper: C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling
Sanyuan-Chen/CSS_with_TSTransformer
Code for the INTERSPEECH-2021 paper: Ultra Fast Speech Separation Model with Teacher Student Learning.
Sanyuan-Chen/CSS_with_EETransformer
Code for the ICASSP-2021 paper: Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer