20050710212's Stars
facebookresearch/Noresqa
This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.
MihawkHu/DCASE2020_task1
Code for DCASE 2020 task 1a and task 1b.
csteinmetz1/pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
JorenSix/Olaf
Olaf: Overly Lightweight Acoustic Fingerprinting is a portable acoustic fingerprinting system.
rwightman/efficientdet-pytorch
A PyTorch impl of EfficientDet faithful to the original Google impl w/ ported weights
nltk/nltk
NLTK Source
microsoft/CNTK
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
louis-she/torchscript-demos
A brief of TorchScript by MNIST
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
qiuqiangkong/audioset_tagging_cnn
lukemelas/EfficientNet-PyTorch
A PyTorch implementation of EfficientNet
facebookresearch/ConvNeXt
Code release for ConvNeXt model
alibaba/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
onnx/onnx
Open standard for machine learning interoperability
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
YuanGongND/psla
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
YuanGongND/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
facebookresearch/vissl
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
microsoft/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
GoogleCloudPlatform/ml-design-patterns
Source code accompanying O'Reilly book: Machine Learning Design Patterns
mynameisfiber/high_performance_python
Code for the book "High Performance Python" by Micha Gorelick and Ian Ozsvald with OReilly
cs231n/cs231n.github.io
Public facing notes page
pythonprofilers/memory_profiler
Monitor Memory usage of Python code
facebookresearch/SlowFast
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Spijkervet/CLMR
Official PyTorch implementation of Contrastive Learning of Musical Representations
music-classification/tutorial
2021 ISMIR tutorial - music classification
pohanchi/AALBERT
The official repository for Audio ALBERT
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit