dltaixlt

Pinned Repositories

3dti_AudioToolkit
3D Tune-In Toolkit is a custom open-source C++ library developed within the EU-funded project 3D Tune-In. The Toolkit provides a high level of realism and immersiveness within binaural 3D audio simulations, while allowing for the emulation of hearing aid devices and of different typologies of hearing loss.
Language:C++0 0 00
AirPodsMotionAPI
Test Swift's AirPods Motion API in this sample project
Language:Swift0 0 00
AlignmentDuration
lyrics-to-audio-alignement system. Decoding with Viterbi forced alignment. Note duration aware decoding
Language:Python0 0 00
ASH-IR-Dataset
An impulse response dataset for binaural synthesis of spatial audio systems on headphones
Language:HTML0 0 00
audioread
cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python
Language:Python0 0 00
awesome-python-scientific-audio
Curated list of python software and packages related to scientific research in audio
0 0 00
dialogue_enhancement
collect source of dialogue enhancement
Language:MATLAB1 0 00
speaker-recognition
A Speaker Recognition System
Language:C++1 0 00
stereoenhance
upmixing, downmixing, virutal surround, hrtf
Language:Jupyter Notebook4 1 00
Upmixer-VST-Plugin
"Upmixer" is a VST Plug-In that I am developing for my Master Thesis, using the JUCE framework.
Language:C++4 0 06

dltaixlt's Repositories

dltaixlt/stereoenhance
upmixing, downmixing, virutal surround, hrtf
Language:Jupyter Notebook4 1 00
dltaixlt/dialogue_enhancement
collect source of dialogue enhancement
Language:MATLAB1 0 00
dltaixlt/speaker-recognition
A Speaker Recognition System
Language:C++1 0 00
dltaixlt/3dti_AudioToolkit
3D Tune-In Toolkit is a custom open-source C++ library developed within the EU-funded project 3D Tune-In. The Toolkit provides a high level of realism and immersiveness within binaural 3D audio simulations, while allowing for the emulation of hearing aid devices and of different typologies of hearing loss.
Language:C++0 0 00
dltaixlt/AirPodsMotionAPI
Test Swift's AirPods Motion API in this sample project
Language:Swift0 0 00
dltaixlt/ASH-IR-Dataset
An impulse response dataset for binaural synthesis of spatial audio systems on headphones
Language:HTML0 0 00
dltaixlt/audioread
cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python
Language:Python0 0 00
dltaixlt/bear
Binaural EBU ADM Renderer
Language:C++0 0
dltaixlt/ciglet
Lightweight signal processing library for audio and speech applications
Language:C0 0
dltaixlt/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python0 01
dltaixlt/dspguide
Chinese translation and reading notes
Language:Jupyter Notebook0 0
dltaixlt/DynamicAudioNormalizer
Dynamic Audio Normalizer
Language:C++0 0
dltaixlt/eqMac
macOS System-wide Audio Equalizer & Volume Mixer 🎧
Language:Swift0 0
dltaixlt/hello-world
Just another repository.
dltaixlt/Jupyter_notebooks_AMS
MT-AMS Experimental Teaching
Language:HTML0 0
dltaixlt/libllsm2
Low Level Speech Model (version 2) for high quality speech analysis-synthesis
Language:C0 0
dltaixlt/libwav
A simple C library for reading/writing PCM wave (.wav) files
Language:C0 0
dltaixlt/madmom
Python audio and music signal processing library
Language:Python1 0
dltaixlt/meetups
1 0
dltaixlt/mir_learn
Language:Jupyter Notebook1 0
dltaixlt/MS-SNSD
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.
dltaixlt/Neural-Network-Quadraphonic-Upmix
The simplest way to demix stereo content with decent quality and low latency.
dltaixlt/pybw64
Reader and writer for BW64 file type.
Language:Python1 0
dltaixlt/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
Language:Python0 0
dltaixlt/pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Language:Python0 0
dltaixlt/sap-voicebox
Speech Processing Toolbox for MATLAB
Language:MATLAB0 0
dltaixlt/Singing-voice-analysis
A pytorch model for singing-voice-analysis. (38 artists selected)
Language:Python0 0
dltaixlt/speaker-recognition-collection
some collections of speaker recognition including article, code, blog, book.
0 01
dltaixlt/World
A high-quality speech analysis, manipulation and synthesis system
Language:C++0 0
dltaixlt/xlt_sad
Language:C1 0