Pezz89

Researching application of audio signal processing to improve outcomes for hearing impaired and cochlear implant users.

Southampton, UK

Pezz89's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python64.9k 542 07.6k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python32.2k 273 1.1k3.9k
Textualize/textual
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
Language:Python24.4k 165 1.8k751
charmbracelet/gum
A tool for glamorous shell scripts 🎀
Language:Go17.4k 55 316332
pavlobu/deskreen
Deskreen turns any device with a web browser into a secondary screen for your computer. ⭐️ Star to support our work!
Language:TypeScript15.5k 243 160835
PySimpleGUI/PySimpleGUI
Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.
Language:Python13.3k 230 3.6k1.8k
marceloprates/prettymaps
A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.
Language:Jupyter Notebook11k 81 82516
TomSchimansky/CustomTkinter
A modern and customizable python UI-library based on Tkinter
Language:Python10.9k 107 1.3k1k
ange-yaghi/engine-sim
Combustion engine simulator that generates realistic audio.
Language:C++8.6k 122 404786
vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Language:Python8.2k 144 1.2k590
Uberi/speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Language:Python8.2k 281 6012.4k
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:Jupyter Notebook7.5k 119 1.5k1k
spotify/pedalboard
🎛 🔊 A Python library for audio.
Language:C++5k 57 175250
pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
Language:Python2.4k 73 921641
rentruewang/koila
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.
Language:Python1.8k 11 2863
rdbende/Sun-Valley-ttk-theme
A gorgeous theme for Tkinter/ttk, based on the Sun Valley visual style ✨
Language:Tcl1.8k 32 103107
facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python1.6k 38 149299
dynamicslab/pysindy
A package for the sparse identification of nonlinear dynamical systems from data
Language:Python1.4k 33 341304
ahmedkhalf/project.nvim
The superior project management solution for neovim.
Language:Lua1.3k 5 107121
facebookresearch/svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
Language:Python1.2k 25 88175
AckslD/nvim-neoclip.lua
Clipboard manager neovim plugin with telescope integration
Language:Lua944 2 7620
pgjones/tozo
Language:TypeScript133 6 610
tky823/ssspy
A Python toolkit for sound source separation.
Language:Python118 6 10310
google/audio-to-tactile
Feeling sound with tactile interfaces.
Language:C94 8 116
PyTTaMaster/PyTTa
Python in Technical Acoustics and Vibration
Language:Python89 12 1931
maj4e/pyrirtool
Measuring room impulse responses with python and sounddevice
Language:Python65 1 115
litanli/wavenet-time-series-forecasting
Language:Python40 3 17
Akascape/TkDial
Tkinter Dial-Knob widgets
Language:Python39 1 47
PierreKieffer/tag
Git utility to create tags in order to identify specific releases
Language:Shell24 2 26
mohead/Bimodal-Cochlear-Implant-Speech-Intelligibility-Model
This is a functional model that predicts the speech reception thresholds of bimodal cochlear implant users in dB SNR
Language:MATLAB3

Pezz89

Pezz89's Stars

openai/whisper

coqui-ai/TTS

Textualize/textual

charmbracelet/gum

pavlobu/deskreen

PySimpleGUI/PySimpleGUI

marceloprates/prettymaps

TomSchimansky/CustomTkinter

ange-yaghi/engine-sim

vaexio/vaex

Uberi/speech_recognition

alphacep/vosk-api

spotify/pedalboard

pytorch/audio

rentruewang/koila

rdbende/Sun-Valley-ttk-theme

facebookresearch/denoiser

dynamicslab/pysindy

ahmedkhalf/project.nvim

facebookresearch/svoice

AckslD/nvim-neoclip.lua

pgjones/tozo

tky823/ssspy

google/audio-to-tactile

PyTTaMaster/PyTTa

maj4e/pyrirtool

litanli/wavenet-time-series-forecasting

Akascape/TkDial

PierreKieffer/tag

mohead/Bimodal-Cochlear-Implant-Speech-Intelligibility-Model