Pinned Repositories
A-Simple-Heuristic-based-PT-algorithm
AdversarialAudioSeparation
Code accompanying the paper "Semi-supervised adversarial audio source separation applied to singing voice extraction"
AFRCNN-For-Speech-Separation
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
AlignmentDuration
lyrics-to-audio-alignement system. Decoding with Viterbi forced alignment. Note duration aware decoding
annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
ASP
Audio Signal Processing Python Tools
kaldi-online2-gmm-decode
目前kaldi online2的解码均为wav形式,修改支持语音流的形式
pyemd
Accurate, efficient Earth Mover's Distance for Python (and MATLAB).
SherlockMidi
android MIDI synthesizer (soundfont2)
Sigmatizm
A virtual additive synthesizer for Linux and Windows with MIDI support.
agangzz's Repositories
agangzz/arranger
An AI for Automatic Instrumentation
agangzz/audio2midi
agangzz/AudioFile
A simple C++ library for reading and writing audio files.
agangzz/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
agangzz/automatic_melody_harmonization
melody harmoniztion using orderless NADE, chord balancing and blocked Gibbs sampling
agangzz/Catch-A-Waveform
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
agangzz/charsiu
Charsiu: A neural phonetic aligner.
agangzz/clpcnet
Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)
agangzz/ConvNeXt
Code release for ConvNeXt model
agangzz/DeepAFx-ST
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
agangzz/deepperformer
Deep Performer: Score-to-audio music performance synthesis
agangzz/denoising-historical-recordings
A two-stage U-Net for high-fidelity denoising of historical recordings
agangzz/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
agangzz/e2e_lfmmi
E2E system with LF-MMI; word N-gram for Mandarin
agangzz/FullSubNet-plus
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
agangzz/mctx
Monte Carlo tree search in JAX
agangzz/MelSpecVAE
Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis
agangzz/MidiTok
A convenient MIDI tokenizer for Deep Learning networks, with multiple encoding strategies
agangzz/MixCycle
agangzz/MuseMorphose
PyTorch implementation of MuseMorphose, a Transformer-based model for music style transfer.
agangzz/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
agangzz/RapidASR
A Cross platform implementation of Wenet ASR inference. It's based on ONNXRuntime and Wenet. We provide a set of easier APIs to call wenet models.
agangzz/RAVE
Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder
agangzz/RAVE-audition
VST/AU Plugin for Auditioning RAVE Models in Real-time
agangzz/rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
agangzz/ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
agangzz/streaming-source-separation
Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.
agangzz/Supervised-Learning-for-Multi-Zone-Sound-Field-Reproduction-under-Harsh-Environmental-Conditions
This repository provides the source code that was used to create the data for the paper "Supervised Learning for Multi Zone Sound Field Reproduction under Realistic Conditions".
agangzz/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
agangzz/you-only-hear-once