water1905

water1905's Stars

facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.8k 424 4.2k6.4k
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Language:Python21.4k 155 2703.1k
mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Language:Jupyter Notebook9.5k 185 5661.3k
facebookresearch/ConvNeXt
Code release for ConvNeXt model
Language:Python5.8k 32 130701
pengzhiliang/MAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
Language:Python2.6k 24 97342
pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
Language:Python2.6k 73 944664
michuanhaohao/reid-strong-baseline
Bag of Tricks and A Strong Baseline for Deep Person Re-identification
Language:Python2.3k 52 218575
CoinCheung/pytorch-loss
label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
Language:Python2.2k 23 39374
bytedance/music_source_separation
Language:Python1.3k 27 64197
haoheliu/voicefixer
General Speech Restoration
Language:Python1.1k 18 59132
KinWaiCheuk/nnAudio
Audio processing by using pytorch 1D convolution network
Language:Python1k 20 6390
ildoonet/pytorch-gradual-warmup-lr
Gradually-Warmup Learning Rate Scheduler for PyTorch
Language:Python982 11 18125
justinsalamon/audio_to_midi_melodia
Extract the melody from an audio file and export to MIDI
Language:Python588 32 24104
wenet-e2e/WenetSpeech
A 10000+ hours dataset for Chinese speech recognition
Language:Shell511 6 2549
andreasveit/densenet-pytorch
A PyTorch Implementation for Densely Connected Convolutional Networks (DenseNets)
Language:Python467 4 9144
macosforge/alac
The Apple Lossless Audio Codec (ALAC) is a lossless audio codec developed by Apple and deployed on all of its platforms and devices.
Language:C++363 29 2264
meinardmueller/libfmp
libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)
Language:Python199 4 1118
mimbres/neural-audio-fp
Language:Python186 7 4225
facebookresearch/BinauralSpeechSynthesis
N/A
Language:Python170 20 319
DTennant/reid_baseline_with_syncbn
Reimplementation of Bag of Tricks and A Strong Baseline for Deep Person Re-identification
Language:Python159 3 2035
wq2012/VoiceIdentityBook
《声纹技术：从核心算法到工程实践》
157 5 819
SoundScapeRenderer/ssr
Main source code repository for the SoundScape Renderer
Language:C++134 18 9954
Apm5/ImageNet_ResNet_Tensorflow2.0
Train ResNet on ImageNet in Tensorflow 2.0; ResNet 在ImageNet上完整训练代码
Language:Python84 2 532
seongmin-kye/meta-SR
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
Language:Python73 8 1019
polarch/Array-Response-Simulator
A set of routines that simulate array responses for sensors with arbitrary geometry and directional characteristics.
Language:Matlab49 7 016
zafarrafii/CQHC-Python
Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.
Language:Jupyter Notebook26 1 11
JensAhrens/soundfieldsynthesis
Matlab code for the book "Analytic Methods of Sound Field Synthesis"
Language:MATLAB24 2 111
AME430/Towards-Training-Explainable-Singing-Quality-Assessment-Network-with-Augmented-Data
Codes for paper -- Towards Training Explainable Singing Quality Assessment Network with Augmented Data
Language:Python131
seongmin-kye/CAP
Cross attentive pooling for speaker verification (IEEE SLT, 2021)
Language:Python12 1 16
shanwangshan/Low-latency_deep_clustering_for_speech_separation
Language:Python3 1 11

water1905

water1905's Stars

facebookresearch/fairseq

lucidrains/vit-pytorch

mozilla/TTS

facebookresearch/ConvNeXt

pengzhiliang/MAE-pytorch

pytorch/audio

michuanhaohao/reid-strong-baseline

CoinCheung/pytorch-loss

bytedance/music_source_separation

haoheliu/voicefixer

KinWaiCheuk/nnAudio

ildoonet/pytorch-gradual-warmup-lr

justinsalamon/audio_to_midi_melodia

wenet-e2e/WenetSpeech

andreasveit/densenet-pytorch

macosforge/alac

meinardmueller/libfmp

mimbres/neural-audio-fp

facebookresearch/BinauralSpeechSynthesis

DTennant/reid_baseline_with_syncbn

wq2012/VoiceIdentityBook

SoundScapeRenderer/ssr

Apm5/ImageNet_ResNet_Tensorflow2.0

seongmin-kye/meta-SR

polarch/Array-Response-Simulator

zafarrafii/CQHC-Python

JensAhrens/soundfieldsynthesis

AME430/Towards-Training-Explainable-Singing-Quality-Assessment-Network-with-Augmented-Data

seongmin-kye/CAP

shanwangshan/Low-latency_deep_clustering_for_speech_separation