tomschelsen

Pinned Repositories

IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Language:Python460 16 13582
enum-derived
Derive new functionality for rust macros
Language:Rust3 1 11
w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
Language:Jupyter Notebook27 1 55
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook5.3k 64 966706
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
Language:Python2.4k 73 913634
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python3k 41 188328
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Language:Python00
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook00
zellij
A terminal workspace with batteries included
Language:Rust18.6k 101 1.8k582

tomschelsen's Repositories

tomschelsen/IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Language:Python00
tomschelsen/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook00