Pinned Repositories
Alarm-phase-recognition
Cover the gru-svm project
ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
audioset_tagging_cnn
Awesome-Cybersecurity-Datasets
A curated list of amazingly awesome Cybersecurity datasets
Awesome-PyTorch-Chinese
【干货】史上最全的PyTorch学习资源汇总
HotItem
interview-question
LeetCodeProject
Write down the process of completing leetcode's projects.
MachineLearning
《统计学习方法》相关的机器学习实现代码。Machine Learning.
patch-augmentation
pppku's Repositories
pppku/interview-question
pppku/LeetCodeProject
Write down the process of completing leetcode's projects.
pppku/patch-augmentation
pppku/ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
pppku/conformer
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
pppku/consistency_models
Official repo for consistency models.
pppku/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
pppku/deit
Official DeiT repository
pppku/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
pppku/espnet
End-to-End Speech Processing Toolkit
pppku/FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
pppku/jimmy_tools
pppku/leetcode-master
LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
pppku/leetcode_summary
pppku/lyra
A Very Low-Bitrate Codec for Speech Compression
pppku/mtg-jamendo-dataset
Metadata, scripts and baselines for the MTG-Jamendo dataset
pppku/Muskits
An opensource music processing toolkit
pppku/onnx-simplifier
Simplify your onnx model
pppku/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
pppku/PaSST
Efficient Training of Audio Transformers with Patchout
pppku/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
pppku/SoundStream
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
pppku/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
pppku/SVS_system
A system works on singing voice synthesis
pppku/T2T-ViT
pppku/Twins
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
pppku/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
pppku/viewer
ML models and internal tensors 3D visualizer
pppku/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
pppku/whisper
Robust Speech Recognition via Large-Scale Weak Supervision