Pinned Repositories
ACA-Code
Matlab scripts accompanying the book "An Introduction to Audio Content Analysis" (www.AudioContentAnlysis.org)
Baby-Crying-Detect
Using Spectrum Energy Matching algorithms to detect baby's crying. Running on ARM9 demoboard and the performance is not bad.
midi_merge
Merge two midi file into a single file with two tracks.
super_random
True random number generator based on EUR-USD daily exchange rate.
tensorflow-template
A template of tensorflow projects to maximize code reuse.
waveglow_vocoder
A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.
yata
Yet Another Tools for Audio deep learning(for myself).
HudsonHuang's Repositories
HudsonHuang/waveglow_vocoder
A vocoder that can convert audio to Mel-Spectrogram and reverse with WaveGlow, with GPU.
HudsonHuang/yata
Yet Another Tools for Audio deep learning(for myself).
HudsonHuang/ACA-Code
Matlab scripts accompanying the book "An Introduction to Audio Content Analysis" (www.AudioContentAnlysis.org)
HudsonHuang/midi_merge
Merge two midi file into a single file with two tracks.
HudsonHuang/aishell-3-baseline-fc-1
The code for aishell-3 baseline acoustic model
HudsonHuang/AutoSpeech
The 1st place solution for AutoSpeech 2019.
HudsonHuang/autospeech19
3rd place solution of autospeech 2019
HudsonHuang/AutoSpeech2019
Solution for AutoSpeech Challenge 2019
HudsonHuang/autotuner
HudsonHuang/DCASE2020-Task1
Jupyter notebook for DCASE 2020 challenge Task 1
HudsonHuang/DCASE2020_task1
Code for DCASE 2020 task 1a and task 1b.
HudsonHuang/g2pM
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
HudsonHuang/Gender-Classification
Gender Classification of Speech Signals
HudsonHuang/Lenia
Lenia - Mathematical Life Forms
HudsonHuang/Markdown-Resume-Template
BAT程序员自己的简历模板分享出来了 。技术简历追求简单明了,避免没有必要的花哨修饰,大家可以fork到自己仓库中,基于这个模板进行修改。
HudsonHuang/NLNL-Negative-Learning-for-Noisy-Labels
NLNL: Negative Learning for Noisy Labels
HudsonHuang/nnAudio
Audio processing by using pytorch 1D convolution network
HudsonHuang/OpenTransformer
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
HudsonHuang/pase
Problem Agnostic Speech Encoder
HudsonHuang/pitch_jitter_shimmer
Using praat to get pitch, jitter and shimmer parameters of voice file.
HudsonHuang/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
HudsonHuang/Realtime_AudioDenoise_EchoCancellation
HudsonHuang/ShadowsocksBio
记录一下SS的前世今生,以及一个简单的教程总结
HudsonHuang/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
HudsonHuang/spleeter
Deezer source separation library including pretrained models.
HudsonHuang/Spleeter_Android_iOS
Spleeter (Audio Seperation) NN models for Android / iOS APP
HudsonHuang/tacotron2
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
HudsonHuang/videoprocess
CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.
HudsonHuang/wechat-chatgpt
HudsonHuang/zhrtvc
中文语音克隆兼语音合成系统。Zhongwen real time voice cloning and Chinese TTS.