Pinned Repositories
atco2-corpus
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
audiveris-v5
New generation of Audiveris OMR
BeamformIt
BeamformIt acoustic beamforming software
CQT_toolbox_python
Constant-Q Transform Toolbox for Python/MATLAB
cylimiter
A C++/Cython audio limiter for Python.
eesen
The official repository of the Eesen project
espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
kaldi
Karel's development fork of official kaldi repo.
kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
KarelVesely84's Repositories
KarelVesely84/kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
KarelVesely84/kaldi
Karel's development fork of official kaldi repo.
KarelVesely84/atco2-corpus
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
KarelVesely84/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
KarelVesely84/CQT_toolbox_python
Constant-Q Transform Toolbox for Python/MATLAB
KarelVesely84/cylimiter
A C++/Cython audio limiter for Python.
KarelVesely84/espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
KarelVesely84/fixwav
Quick utility to fix WAV files with incorrect lengths
KarelVesely84/GigaSpeech
Large, modern dataset for speech recognition
KarelVesely84/gpt4all
gpt4all: open-source LLM chatbots that you can run anywhere
KarelVesely84/greek_podcasts_asr
KarelVesely84/grive2
Google Drive client with support for new Drive REST API and partial sync
KarelVesely84/icefall
KarelVesely84/json
JSON for Modern C++
KarelVesely84/k2
FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar
KarelVesely84/kaldi-model-server
Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone
KarelVesely84/kaldi-native-fbank
Kaldi-compatible online fbank extractor without external dependencies
KarelVesely84/kaldi_native_io
python wrapper for kaldi's native I/O
KarelVesely84/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
KarelVesely84/kaldilm
Python wrapper for kaldi's arpa2fst
KarelVesely84/lhotse
Tools for handling speech data in machine learning projects.
KarelVesely84/libriheavy
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
KarelVesely84/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
KarelVesely84/personalVAD
An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.
KarelVesely84/pocolm
Small language toolkit for creation, interpolation and pruning of ARPA language models
KarelVesely84/sherpa
Speech-to-text server framework with next-gen Kaldi
KarelVesely84/soundslike_icefall
Icefall recipe for the SoundsLike project under JSALT 2023 (voxpopuli recipe)
KarelVesely84/vocode-python
🤖 Build voice-based LLM agents. Modular + open source.
KarelVesely84/w2v2-air-traffic
KarelVesely84/wikiextractor
A tool for extracting plain text from Wikipedia dumps