Pinned Repositories
audioset_tagging_cnn
CED
Source code for Consistent ensemble distillation for audio tagging
diff_pattern_mining
divide_lm
google-research
Google Research
icefall
knowledge_distillation
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
lhotse
Tools for handling speech data in machine learning projects.
maml
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
marcoyang1998's Repositories
marcoyang1998/icefall
marcoyang1998/audioset_tagging_cnn
marcoyang1998/CED
Source code for Consistent ensemble distillation for audio tagging
marcoyang1998/diff_pattern_mining
marcoyang1998/divide_lm
marcoyang1998/google-research
Google Research
marcoyang1998/knowledge_distillation
marcoyang1998/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
marcoyang1998/lhotse
Tools for handling speech data in machine learning projects.
marcoyang1998/maml
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
marcoyang1998/MLMI
MPhil Machine Learning and Machine Intelligence @ University of Cambridge
marcoyang1998/models
Models and examples built with TensorFlow
marcoyang1998/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
marcoyang1998/numpy-ml
Machine learning, in numpy
marcoyang1998/panns_transfer_to_gtzan
marcoyang1998/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
marcoyang1998/shadowsocks-heroku
一键部署,随处可用的 免费shadowsocks-heroku
marcoyang1998/sherpa
Speech-to-text server framework with next-gen Kaldi
marcoyang1998/sherpa-ncnn
Real-time (online/streaming) speech recognition using next-gen Kaldi with ncnn. Support embedded systems
marcoyang1998/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
marcoyang1998/whisper
Robust Speech Recognition via Large-Scale Weak Supervision