speech-dataset
There are 24 repositories under speech-dataset topic.
aishoot/Speech_Feature_Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
hetpandya/youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos
fjxmlzn/RNN-SM
[T-IFS] RNN-SM: Fast Steganalysis of VoIP Streams Using Recurrent Neural Network
manankshastri/Trigger-Word-Detection
Construct a speech dataset and implement an algorithm for trigger word detection (sometimes also called keyword detection, or wakeword detection).
gauthelo/kallaama-speech-dataset
A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.
Rumeysakeskin/Speech-Datasets-for-ASR
Download speech datasets (English and non-English) for Automatic Speech Recognition
petrichorwq/DECRO-dataset
Deepfake cross-lingual evaluation dataset (DECRO) is constructed to evaluate the influence of language differences on deepfake detection.
MahtaFetrat/ManaTTS-Persian-Speech-Dataset
ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
revsic/speechset
Numpy-librosa implementation of Speech dataset pipeline
ina-foss/InaGVAD
Voice activity detection and speaker gender segmentation audiovisual corpus
KanishkNavale/Speech-Emotion-Recognition
A simple CNN-LSTM deep neural model using Tensorflow to classify emotions from a speech dataset
Ralireza/PSDR
Persian spoken digit recognition
neuralwork/speech-collector
A full-stack webapp for collecting and managing speech datasets.
MahtaFetrat/GPTInformal-Persian-Speech-Dataset
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
cyrta/50languages
Corpus, dataset of speech recording in 50 languages
nafiuny/voice_conversion_dataset
top dataset for voice conversion models
PanosAntoniadis/fast-recorder
Simple script that creates a speech dataset quickly
seanpm2001/AI2001_Category-Audio-SC-Speeches
🧠️🖥️2️⃣️0️⃣️0️⃣️1️⃣️🎼️🎷️ The audio:speeches category for AI2001, containing speech datasets
MahtaFetrat/VirgoolInformal-Speech-Dataset
A dataset of informal Persian audio and text chunks, along with a fully open processing pipeline, suitable for ASR and TTS tasks. Created from crawled content on virgool.io.
Nexdata-AI/2-People-New-Zealand-English-Average-Tone-Speech-Synthesis-Corpus
2-People-New-Zealand-English-Average-Tone-Speech-Synthesis-Corpus
Nexdata-AI/393-Hours-Korean-Children-Speech-Data-by-Mobile-Phone
393-Hours-Korean-Children-Speech-Data-by-Mobile-Phone