agangzz

Pinned Repositories

A-Simple-Heuristic-based-PT-algorithm
Language:MATLAB0 2 00
AdversarialAudioSeparation
Code accompanying the paper "Semi-supervised adversarial audio source separation applied to singing voice extraction"
Language:Python00
AFRCNN-For-Speech-Separation
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
Language:Python0 1 00
AlignmentDuration
lyrics-to-audio-alignement system. Decoding with Viterbi forced alignment. Note duration aware decoding
Language:Python04
annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Language:C++10
ASP
Audio Signal Processing Python Tools
Language:Python1 2 00
kaldi-online2-gmm-decode
目前kaldi online2的解码均为wav形式，修改支持语音流的形式
Language:C++1 2 04
pyemd
Accurate, efficient Earth Mover's Distance for Python (and MATLAB).
Language:C1 2 00
SherlockMidi
android MIDI synthesizer (soundfont2)
Language:Java10 2 03
Sigmatizm
A virtual additive synthesizer for Linux and Windows with MIDI support.
Language:HTML4 2 01

agangzz's Repositories

agangzz/arranger
An AI for Automatic Instrumentation
agangzz/audio2midi
Language:Python1 0
agangzz/AudioFile
A simple C++ library for reading and writing audio files.
Language:C++1 0
agangzz/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
Language:Python0 0
agangzz/automatic_melody_harmonization
melody harmoniztion using orderless NADE, chord balancing and blocked Gibbs sampling
Language:Python1 0
agangzz/Catch-A-Waveform
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
Language:Python1 0
agangzz/charsiu
Charsiu: A neural phonetic aligner.
Language:Jupyter Notebook0 0
agangzz/clpcnet
Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)
Language:Python0 0
agangzz/ConvNeXt
Code release for ConvNeXt model
Language:Python1 0
agangzz/DeepAFx-ST
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
Language:Python0 0
agangzz/deepperformer
Deep Performer: Score-to-audio music performance synthesis
0 0
agangzz/denoising-historical-recordings
A two-stage U-Net for high-fidelity denoising of historical recordings
Language:Python0 0
agangzz/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Language:Python0 0
agangzz/e2e_lfmmi
E2E system with LF-MMI; word N-gram for Mandarin
Language:Python0 0
agangzz/FullSubNet-plus
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
Language:Python0 0
agangzz/mctx
Monte Carlo tree search in JAX
Language:Python0 0
agangzz/MelSpecVAE
Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis
Language:Jupyter Notebook0 0
agangzz/MidiTok
A convenient MIDI tokenizer for Deep Learning networks, with multiple encoding strategies
Language:Python1 0
agangzz/MixCycle
agangzz/MuseMorphose
PyTorch implementation of MuseMorphose, a Transformer-based model for music style transfer.
Language:Python1 0
agangzz/Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
Language:Jupyter Notebook0 0
agangzz/RapidASR
A Cross platform implementation of Wenet ASR inference. It's based on ONNXRuntime and Wenet. We provide a set of easier APIs to call wenet models.
Language:C++0 0
agangzz/RAVE
Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder
Language:Python0 0
agangzz/RAVE-audition
VST/AU Plugin for Auditioning RAVE Models in Real-time
Language:C++0 0
agangzz/rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
Language:MATLAB0 0
agangzz/ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
Language:Python0 0
agangzz/streaming-source-separation
Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.
agangzz/Supervised-Learning-for-Multi-Zone-Sound-Field-Reproduction-under-Harsh-Environmental-Conditions
This repository provides the source code that was used to create the data for the paper "Supervised Learning for Multi Zone Sound Field Reproduction under Realistic Conditions".
Language:MATLAB1 0
agangzz/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Language:C++0 0
agangzz/you-only-hear-once
Language:Jupyter Notebook0 0