Pinned Repositories
3dti_AudioToolkit
3D Tune-In Toolkit is a custom open-source C++ library developed within the EU-funded project 3D Tune-In. The Toolkit provides a high level of realism and immersiveness within binaural 3D audio simulations, while allowing for the emulation of hearing aid devices and of different typologies of hearing loss.
AirPodsMotionAPI
Test Swift's AirPods Motion API in this sample project
AlignmentDuration
lyrics-to-audio-alignement system. Decoding with Viterbi forced alignment. Note duration aware decoding
ASH-IR-Dataset
An impulse response dataset for binaural synthesis of spatial audio systems on headphones
audioread
cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python
awesome-python-scientific-audio
Curated list of python software and packages related to scientific research in audio
dialogue_enhancement
collect source of dialogue enhancement
speaker-recognition
A Speaker Recognition System
stereoenhance
upmixing, downmixing, virutal surround, hrtf
Upmixer-VST-Plugin
"Upmixer" is a VST Plug-In that I am developing for my Master Thesis, using the JUCE framework.
dltaixlt's Repositories
dltaixlt/stereoenhance
upmixing, downmixing, virutal surround, hrtf
dltaixlt/dialogue_enhancement
collect source of dialogue enhancement
dltaixlt/speaker-recognition
A Speaker Recognition System
dltaixlt/3dti_AudioToolkit
3D Tune-In Toolkit is a custom open-source C++ library developed within the EU-funded project 3D Tune-In. The Toolkit provides a high level of realism and immersiveness within binaural 3D audio simulations, while allowing for the emulation of hearing aid devices and of different typologies of hearing loss.
dltaixlt/AirPodsMotionAPI
Test Swift's AirPods Motion API in this sample project
dltaixlt/ASH-IR-Dataset
An impulse response dataset for binaural synthesis of spatial audio systems on headphones
dltaixlt/audioread
cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python
dltaixlt/bear
Binaural EBU ADM Renderer
dltaixlt/ciglet
Lightweight signal processing library for audio and speech applications
dltaixlt/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
dltaixlt/dspguide
Chinese translation and reading notes
dltaixlt/DynamicAudioNormalizer
Dynamic Audio Normalizer
dltaixlt/eqMac
macOS System-wide Audio Equalizer & Volume Mixer 🎧
dltaixlt/hello-world
Just another repository.
dltaixlt/Jupyter_notebooks_AMS
MT-AMS Experimental Teaching
dltaixlt/libllsm2
Low Level Speech Model (version 2) for high quality speech analysis-synthesis
dltaixlt/libwav
A simple C library for reading/writing PCM wave (.wav) files
dltaixlt/madmom
Python audio and music signal processing library
dltaixlt/meetups
dltaixlt/mir_learn
dltaixlt/MS-SNSD
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
dltaixlt/Neural-Network-Quadraphonic-Upmix
The simplest way to demix stereo content with decent quality and low latency.
dltaixlt/pybw64
Reader and writer for BW64 file type.
dltaixlt/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
dltaixlt/pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
dltaixlt/sap-voicebox
Speech Processing Toolbox for MATLAB
dltaixlt/Singing-voice-analysis
A pytorch model for singing-voice-analysis. (38 artists selected)
dltaixlt/speaker-recognition-collection
some collections of speaker recognition including article, code, blog, book.
dltaixlt/World
A high-quality speech analysis, manipulation and synthesis system
dltaixlt/xlt_sad