Pinned Repositories
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
enum-derived
Derive new functionality for rust macros
w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
zellij
A terminal workspace with batteries included
tomschelsen's Repositories
tomschelsen/IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
tomschelsen/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding