cyrta
Audio Researcher (Speech & Music Technology) & Data Scientist. I do machine learning in audio domain, speech recognition and high performance computing
Metamedia TechnologiesWarsaw, Poland
Pinned Repositories
50languages
Corpus, dataset of speech recording in 50 languages
awesome-speech-enhancement
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
broadcast-news-videos-dataset
Collection of broadcast news video clips
dscaper
A library for soundscape synthesis and augmentation - extension to add speech, dialogue and room acoustics.
GenChanSim
Generic Channel Simulator for VHF/UHF (WBHF) voice channel - in air radio voice distortion generator
ICPC2015-dataset
ICPC2015 - Dataset of International Chopin Piano Competition 2015
JsFlashC
Javascript <-> Flash (ActionScript) <-> Alchemy (C/C++) communication
nlp_workshops
Let's dive into text analysis
UrbanSounds
SONYC project - UrbanSounds - dataset of sound recordings from New York, trying to recreate papers that classify the sound sources in the stream
voxceleb
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
cyrta's Repositories
cyrta/voxceleb
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
cyrta/awesome-speech-enhancement
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
cyrta/broadcast-news-videos-dataset
Collection of broadcast news video clips
cyrta/50languages
Corpus, dataset of speech recording in 50 languages
cyrta/docker-kaldi
Kaldi ASR (speech-to-text engine) Docker Images
cyrta/GenChanSim
Generic Channel Simulator for VHF/UHF (WBHF) voice channel - in air radio voice distortion generator
cyrta/asteroid
The PyTorch-based audio source separation toolkit for researchers
cyrta/awesome-deep-learning-papers-reading-notes
Notes and reading list on machine learning and deep learning research publications
cyrta/awesome-linuxaudio
A list of software and resources for professional audio/video/live events production on Linux.
cyrta/cheat-scripts
because you cant remember everything
cyrta/dictaphone
Free phonetic dictionaries for automatic speech recognition
cyrta/dscaper
A library for soundscape synthesis and augmentation - extension to add speech, dialogue and room acoustics.
cyrta/nlp_workshops
Let's dive into text analysis
cyrta/sandbox
some code experiments, checks, drafts - mostly shit
cyrta/bielik_vlm
cyrta/cyrta.github.io
cyrta/docker
Docker files for development environment for python, audio analysis, computer vision and neural networks
cyrta/dockerfiles
Compilation of Dockerfiles with automated builds enabled on the Docker Registry
cyrta/drender
dRender - render dialogue using acoustics simulation and mixing to its audio form
cyrta/fftscarf
FFTScarf - A wrapper for FFT implementations dedicated to audio processing
cyrta/kaldi
This is the official location of the Kaldi project.
cyrta/kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
cyrta/polbert
Polish BERT
cyrta/sages-python-wzorce-projektowe
sages python wzorce projektowe
cyrta/sdialog
Synthetic Dialog Generation and Analysis with LLMs
cyrta/textnormalizer
text normalization and cleaning - cli and python package [WIP]
cyrta/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
cyrta/travis-ci-test
cyrta/wavenet_vocoder
WaveNet vocoder
cyrta/yang_vocoder