Pezz89
Researching application of audio signal processing to improve outcomes for hearing impaired and cochlear implant users.
Southampton, UK
Pezz89's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
coqui-ai/TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Textualize/textual
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
charmbracelet/gum
A tool for glamorous shell scripts π
pavlobu/deskreen
Deskreen turns any device with a web browser into a secondary screen for your computer. βοΈ Star to support our work!
PySimpleGUI/PySimpleGUI
Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.
marceloprates/prettymaps
A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.
TomSchimansky/CustomTkinter
A modern and customizable python UI-library based on Tkinter
ange-yaghi/engine-sim
Combustion engine simulator that generates realistic audio.
vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second π
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
spotify/pedalboard
π π A Python library for audio.
pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
rentruewang/koila
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.
rdbende/Sun-Valley-ttk-theme
A gorgeous theme for Tkinter/ttk, based on the Sun Valley visual style β¨
facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
dynamicslab/pysindy
A package for the sparse identification of nonlinear dynamical systems from data
ahmedkhalf/project.nvim
The superior project management solution for neovim.
facebookresearch/svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
AckslD/nvim-neoclip.lua
Clipboard manager neovim plugin with telescope integration
pgjones/tozo
tky823/ssspy
A Python toolkit for sound source separation.
google/audio-to-tactile
Feeling sound with tactile interfaces.
PyTTaMaster/PyTTa
Python in Technical Acoustics and Vibration
maj4e/pyrirtool
Measuring room impulse responses with python and sounddevice
litanli/wavenet-time-series-forecasting
Akascape/TkDial
Tkinter Dial-Knob widgets
PierreKieffer/tag
Git utility to create tags in order to identify specific releases
mohead/Bimodal-Cochlear-Implant-Speech-Intelligibility-Model
This is a functional model that predicts the speech reception thresholds of bimodal cochlear implant users in dB SNR