Pinned Repositories
2020AIChallengeSpeechRecognition
2020 AI Challenge 음성 인식 코드
audioset_augmentor
Sound augmentation using Large-scale audio dataset (Audioset)
chatgpt-streamlit
Simple demo project with OpenAI's API and TTS
FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2
multiband_melgan
An unofficial implementation of https://arxiv.org/abs/2005.05106
pytorch_sound
Sound Related Deep Learning Tasks boosting repository with pytorch
PytorchDockerExample
Pytorch 기반의 딥러닝 학습 모델에 대한 예시 제공
PytorchSR
Pytorch based phoneme recognition (TIMIT phoneme classification)
source_separation
Deep learning based speech source separation using Pytorch
tacotron2-pytorch
Pytorch implementation of Tacotron 2 (https://arxiv.org/pdf/1712.05884.pdf)
AppleHolic's Repositories
AppleHolic/pytorch_sound
Sound Related Deep Learning Tasks boosting repository with pytorch
AppleHolic/multiband_melgan
An unofficial implementation of https://arxiv.org/abs/2005.05106
AppleHolic/audioset_augmentor
Sound augmentation using Large-scale audio dataset (Audioset)
AppleHolic/chatgpt-streamlit
Simple demo project with OpenAI's API and TTS
AppleHolic/FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2
AppleHolic/recording_studio_web
Sound Recording Studio Web Front Page
AppleHolic/SpeechInterface
A Speech Interface Toolkit for Neural Speech Synthesis
AppleHolic/music_source_separation
AppleHolic/voicefixer_main
General Speech Restoration
AppleHolic/Appleholic
AppleHolic/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
AppleHolic/AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
AppleHolic/cert-manager
Automatically provision and manage TLS certificates in Kubernetes
AppleHolic/dvector
Speaker embedding (d-vector) trained with GE2E loss
AppleHolic/fastapi-azure-auth
Easy and secure implementation of Azure AD for your FastAPI APIs 🔒 Single- and multi-tenant support.
AppleHolic/grpc-vpn
:mushroom: VPN supporting authentication such as Google OpenID Connect or AWS IAM ..., over GRPC. :shipit:
AppleHolic/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
AppleHolic/jsii
jsii allows code in any language to naturally interact with JavaScript classes. It is the technology that enables the AWS Cloud Development Kit to deliver polyglot libraries from a single codebase!
AppleHolic/ksponspeech
Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.
AppleHolic/kubeflow
Machine Learning Toolkit for Kubernetes
AppleHolic/melgan
MelGAN implementation with Multi-Band and Full Band supports...
AppleHolic/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
AppleHolic/metavoice-src
Foundational model for human-like, expressive TTS
AppleHolic/norbert
Painless Wiener filters for audio separation
AppleHolic/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
AppleHolic/seewav
Audio waveform visualisation, converts any audio to a nice video
AppleHolic/training-operator
Training operators on Kubernetes.
AppleHolic/voicefixer
General Speech Restoration
AppleHolic/WavEncoderCodes
Simple repository for handling wav format file on raw (short) data in Javascript, Kotlin (will be added?)
AppleHolic/wavenet_vocoder
WaveNet vocoder