speech-analysis

There are 129 repositories under speech-analysis topic.

jianchang512/clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频
Language:Python7.5k 38 138767
praat/praat
Praat: Doing Phonetics By Computer
Language:C1.5k 48 262239
mmorise/World
A high-quality speech analysis, manipulation and synthesis system
Language:C++1.2k 70 92255
haoheliu/voicefixer
General Speech Restoration
Language:Python1k 17 59133
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
642 88 442
gemengtju/Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Language:MATLAB448 21 295
speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Language:HTML364 43 529
jcvasquezc/DisVoice
feature extraction from speech signals
Language:Jupyter Notebook355 13 2980
Shahabks/my-voice-analysis
My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.
Language:Python298 12 3191
haoheliu/voicefixer_main
General Speech Restoration
Language:Python276 11 1856
Shahabks/myprosody
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Language:Python237 7 3364
HidekiKawahara/legacy_STRAIGHT
A vocoder framework which had been widely used in research community since 1999.
Language:Matlab176 20 643
philipperemy/tensorflow-ctc-speech-recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Language:Python130 10 1146
at16k/at16k
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Language:Python128 11 1119
JusperLee/Calculate-SNR-SDR
Script to calculate SNR and SDR using python
Language:Python90 3 326
LimingShi/Bayesian-Pitch-Tracking-Using-Harmonic-model
Pitch detection and pitch tracking, voicing unvoicing detection (VAD)，基音检测
Language:MATLAB90 2 121
google/localized-narratives
Localized Narratives
Language:HTML83 10 1014
CSTR-Edinburgh/magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
Language:Python78 20 1331
Speech-Interaction-Technology-Aalto-U/itsp
Introduction to Speech Processing
Language:Jupyter Notebook72 3 714
RichardHladik/outotune
An opensource harmonizer implementation leveraging the DISTRHO Plugin Framework.
Language:C++66 2 23
mjpyeon/wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Language:Python64 5 312
hyeonsangjeon/computing-Korean-STT-error-rates
STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지
Language:Python58 3 18
lennes/spect
SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/
Language:HTML56 5 211
HidekiKawahara/SparkNG
MATLAB real-time/interactive speech tools. This series is obsolete. SP3ARK is the up-to-date series (will be).
Language:MATLAB55 4 315
jcvasquezc/NeuroSpeech
Toolkit to asses speech impairments in patients with neurological disorders
Language:C++51 3 318
MontrealCorpusTools/PolyglotDB
Language data store and linguistic query API
Language:Python39 13 11314
alessandroragano/scoreq
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
Language:Python37 4 04
msalhab96/SNR-Estimation-Using-Deep-Learning
An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning
Language:Jupyter Notebook33 1 16
HidekiKawahara/YANGstraight_source
Analytic signal-based source information analysis for YANGstraight and real-time interactive tools
Language:MATLAB32 5 33
tabahi/WebSpeechAnalyzer
JS speech analyzer for fast speech analysis and labeling
Language:JavaScript32 2 15
praaline/Praaline
Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora
Language:C27 2 15
praweshd/speech_emotion_recognition
In this project, the performance of speech emotion recognition is compared between two methods (SVM vs Bi-LSTM RNN).Conventional classifiers that uses machine learning algorithms has been used for decades in recognizing emotions from speech. However, in recent years, deep learning methods have taken the center stage and have gained popularity for their ability to perform well without any input hand-crafted features. Speech emotion on sets obtained from RAVDESS corpus is classified using a conventionally used Support Vector Machine (SVM) and its performance is compared to that of a bidirectional long short-term memory (LSTM).
Language:Jupyter Notebook26 1 011
operrotin/GFM-IAIF
Glottal Flow Model-based Iterative Adaptive Inverse Filtering
Language:MATLAB23 5 13
ringabout/scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Language:Nim23 6 00
LinkonBSMRSTU/Speech-To-Text-App-iOS
A simple iOS App that can convert speech/voice into text. Only English voice is supported for now. Used Swift 5, AVKit and Speech.
Language:Swift22 2 13
type-a/speechnet
Automatic Speech Recognition
Language:Python20 6 02

speech-analysis

jianchang512/clone-voice

praat/praat

mmorise/World

haoheliu/voicefixer

DmitryRyumin/INTERSPEECH-2023-24-Papers

gemengtju/Tutorial_Separation

speechbrain/speechbrain.github.io

jcvasquezc/DisVoice

Shahabks/my-voice-analysis

haoheliu/voicefixer_main

Shahabks/myprosody

HidekiKawahara/legacy_STRAIGHT

philipperemy/tensorflow-ctc-speech-recognition

at16k/at16k

JusperLee/Calculate-SNR-SDR

LimingShi/Bayesian-Pitch-Tracking-Using-Harmonic-model

google/localized-narratives

CSTR-Edinburgh/magphase

Speech-Interaction-Technology-Aalto-U/itsp

RichardHladik/outotune

mjpyeon/wavenet-classifier

hyeonsangjeon/computing-Korean-STT-error-rates

lennes/spect

HidekiKawahara/SparkNG

jcvasquezc/NeuroSpeech

MontrealCorpusTools/PolyglotDB

alessandroragano/scoreq

msalhab96/SNR-Estimation-Using-Deep-Learning

HidekiKawahara/YANGstraight_source

tabahi/WebSpeechAnalyzer

praaline/Praaline

praweshd/speech_emotion_recognition

operrotin/GFM-IAIF

ringabout/scim

LinkonBSMRSTU/Speech-To-Text-App-iOS

type-a/speechnet