speech-analysis

There are 129 repositories under speech-analysis topic.

  • jianchang512/clone-voice

    A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

    Language:Python7.5k38138767
  • praat/praat

    Praat: Doing Phonetics By Computer

    Language:C1.5k48262239
  • mmorise/World

    A high-quality speech analysis, manipulation and synthesis system

    Language:C++1.2k7092255
  • haoheliu/voicefixer

    General Speech Restoration

    Language:Python1k1759133
  • DmitryRyumin/INTERSPEECH-2023-24-Papers

    INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

  • gemengtju/Tutorial_Separation

    This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

    Language:MATLAB44821295
  • speechbrain/speechbrain.github.io

    The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

    Language:HTML36443529
  • jcvasquezc/DisVoice

    feature extraction from speech signals

    Language:Jupyter Notebook355132980
  • Shahabks/my-voice-analysis

    My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.

    Language:Python298123191
  • haoheliu/voicefixer_main

    General Speech Restoration

    Language:Python276111856
  • Shahabks/myprosody

    A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

    Language:Python23773364
  • HidekiKawahara/legacy_STRAIGHT

    A vocoder framework which had been widely used in research community since 1999.

    Language:Matlab17620643
  • philipperemy/tensorflow-ctc-speech-recognition

    Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

    Language:Python130101146
  • at16k/at16k

    Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.

    Language:Python128111119
  • JusperLee/Calculate-SNR-SDR

    Script to calculate SNR and SDR using python

    Language:Python903326
  • LimingShi/Bayesian-Pitch-Tracking-Using-Harmonic-model

    Pitch detection and pitch tracking, voicing unvoicing detection (VAD),基音检测

    Language:MATLAB902121
  • google/localized-narratives

    Localized Narratives

    Language:HTML83101014
  • CSTR-Edinburgh/magphase

    MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

    Language:Python78201331
  • itsp

    Speech-Interaction-Technology-Aalto-U/itsp

    Introduction to Speech Processing

    Language:Jupyter Notebook723714
  • RichardHladik/outotune

    An opensource harmonizer implementation leveraging the DISTRHO Plugin Framework.

    Language:C++66223
  • mjpyeon/wavenet-classifier

    Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks

    Language:Python645312
  • hyeonsangjeon/computing-Korean-STT-error-rates

    STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지

    Language:Python58318
  • lennes/spect

    SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/

    Language:HTML565211
  • HidekiKawahara/SparkNG

    MATLAB real-time/interactive speech tools. This series is obsolete. SP3ARK is the up-to-date series (will be).

    Language:MATLAB554315
  • jcvasquezc/NeuroSpeech

    Toolkit to asses speech impairments in patients with neurological disorders

    Language:C++513318
  • MontrealCorpusTools/PolyglotDB

    Language data store and linguistic query API

    Language:Python391311314
  • alessandroragano/scoreq

    SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

    Language:Python37404
  • msalhab96/SNR-Estimation-Using-Deep-Learning

    An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning

    Language:Jupyter Notebook33116
  • HidekiKawahara/YANGstraight_source

    Analytic signal-based source information analysis for YANGstraight and real-time interactive tools

    Language:MATLAB32533
  • tabahi/WebSpeechAnalyzer

    JS speech analyzer for fast speech analysis and labeling

    Language:JavaScript32215
  • praaline/Praaline

    Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora

    Language:C27215
  • praweshd/speech_emotion_recognition

    In this project, the performance of speech emotion recognition is compared between two methods (SVM vs Bi-LSTM RNN).Conventional classifiers that uses machine learning algorithms has been used for decades in recognizing emotions from speech. However, in recent years, deep learning methods have taken the center stage and have gained popularity for their ability to perform well without any input hand-crafted features. Speech emotion on sets obtained from RAVDESS corpus is classified using a conventionally used Support Vector Machine (SVM) and its performance is compared to that of a bidirectional long short-term memory (LSTM).

    Language:Jupyter Notebook261011
  • operrotin/GFM-IAIF

    Glottal Flow Model-based Iterative Adaptive Inverse Filtering

    Language:MATLAB23513
  • ringabout/scim

    [wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

    Language:Nim23600
  • LinkonBSMRSTU/Speech-To-Text-App-iOS

    A simple iOS App that can convert speech/voice into text. Only English voice is supported for now. Used Swift 5, AVKit and Speech.

    Language:Swift22213
  • type-a/speechnet

    Automatic Speech Recognition

    Language:Python20602