mel-spectrogram

There are 77 repositories under mel-spectrogram topic.

Sharad24/Neural-Voice-Cloning-with-Few-Samples
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
Language:Python254 13 455
BShakhovsky/PolyphonicPianoTranscription
Recurrent Neural Network for generating piano MIDI-files from audio (MP3, WAV, etc.)
Language:Jupyter Notebook243 9 1241
tiberiu44/TTS-Cube
End-2-end speech synthesis with recurrent neural networks
Language:Python226 19 2445
Data-Science-kosta/Speech-Emotion-Classification-with-PyTorch
This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.
Language:Jupyter Notebook197 8 836
spotify/realbook
Easier audio-based machine learning with TensorFlow.
Language:Python117 7 16
CVxTz/audio_classification
CNN 1D vs 2D audio classification
Language:Jupyter Notebook104 4 426
MycroftAI/sonopy
A simple audio feature extraction library
Language:Python79 8 221
echocatzh/torch-mfcc
A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.
Language:Python74 2 211
zzw922cn/LPC_for_TTS
Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.
Language:Python68 6 210
rednafi/urban-sound-classification
Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)
Language:Jupyter Notebook58 4 115
zafarrafii/Zaf-Python
Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.
Language:Jupyter Notebook55 1 111
zafarrafii/Zaf-Matlab
Zafar's Audio Functions in Matlab for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.
Language:Jupyter Notebook47 3 014
skanderhamdi/attention_cnn_lstm_covid_mel_spectrogram
Attention-based Hybrid CNN-LSTM and Spectral Data Augmentation for COVID-19 Diagnosis from Cough Sound
Language:Python29 2 24
adasegroup/OSM-one-shot-multispeaker
Framework for one-shot multispeaker system based on Deep Learning
Language:Python19 5 04
yoyolicoris/wavenet-like-vocoder
Basic wavenet and fftnet vocoder model.
Language:Python19 3 22
ddman1101/EDM-subgenre-classifier
Code for "Deep Learning Based EDM Subgenre Classification using Mel-Spectrogram and Tempogram Features" arXiv:2110.08862, 2021.
Language:Python18 2 01
Friedrich-M/Audio-signal-classification-and-identification
基于梅尔频谱的信号分类和识别
Language:Python17 1 05
monetjoe/pianos
This study converts piano recordings to mel spectrogram and classifies them by SOTA pre-trained neural network backbones in CV. Comparative experiments show that SqueezeNet achieves a best classification accuracy of 92.37%.|该项目将钢琴录音转为为mel频谱图，使用微调后的前沿计算机视觉领域预训练深度学习骨干网络对其进行分类，对比实验可知SqueezeNet作为最优网络正确率可达92.37%
Language:Python16 2 60
renesemela/masters-thesis-music-autotagging
Master's Thesis: Automatic Tagging of Musical Compositions Using Machine Learning Methods
Language:Python16 1 03
VisionBrain/Neural_Voice_Cloning
Open Source Implementation of Neural Voice Cloning with Few Audio Samples (Baidu Research)
Language:Python16 3 16
Keerthiraj-Nagaraj/cough-detection-with-transfer-learning
Cough detection with Log Mel Spectrogram, Wavelet Transform, Deep learning and Transfer learning techniques
Language:Python15 1 15
goepfert/audio_features
Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js
Language:JavaScript13 2 04
mariamkhmahran/gunshot-detection-system
This repository contains the Python code for a audio classification system designed to detect gunshots in urban settings.
Language:Jupyter Notebook13 2 03
mikex86/SonopyJava
Java Implementation of the Sonopy Audio Feature Extraction Library by MycroftAI
Language:Java13 1 11
baggepinnen/LPVSpectral.jl
Least-squares (sparse) spectral estimation and (sparse) LPV spectral decomposition.
Language:Julia12 4 66
KanikeSaiPrakash/Speech-Emotion-Recognition
Speech Emotion Recognition using Deep Learning
Language:Jupyter Notebook11 2 02
ricardokleinklein/deepMultiSpeech
Deep Multi-Speech model
Language:Python11 5 15
sh3r4zhassan/Sound-Prediction-and-Cancellation-Model
This Model analyzes and predicts the input sound and then using pretrained ANC systems cancels the input sound.
Language:Jupyter Notebook9 1 05
amirragab-ds/Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs
Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MFCCs)
Language:Jupyter Notebook8 1 00
Rumeysakeskin/dtw-compare-audio-files
Compute the MFCCs and measure (dis)similarity between two audio files using DTW
Language:Python8 1 00
zafarrafii/Zaf-Julia
Zafar's Audio Functions in Julia for audio signal analysis: STFT, inverse STFT, CQT kernel, CQT spectrogram, CQT chromagram, MFCC, DCT, DST, MDCT, inverse MDCT.
Language:Jupyter Notebook8 1 01
anirudhs123/Music-Instrument-Classification
In this project we use a Lightweight-CNN based model to classify instruments from the Freesound audio data set. We make use of Mel-Spectrogram features from the input audio data as the input to the CNN model. To add robustness to the model, we use a novel data augmentation technique based on the Cut-Mix algorithm.
Language:Jupyter Notebook7 2 02
cschen1205/cs-mel-spectrogram
Convert audio file to melgram (that is, mel-spectrogram) in .NET
Language:C#7 3 13
RBGTOP/Music-Genre-Recognition
Music genre classification using deep learning
7 1 0
SimpleKidd/Fault-Diagnosis-of-a-Rotor-Bearing-System-using-ML
Analyzing Vibrational Data of the System using Machine Learning
Language:Jupyter Notebook6 2 00
awal-ahmed/AudioViT
This repository contains different CNN methods for audio classification. It starts with canceling noise from audio. Then it converts the audio into a mel-spectrogram and trains with CNN models.
Language:Python5 1 00

mel-spectrogram

Sharad24/Neural-Voice-Cloning-with-Few-Samples

BShakhovsky/PolyphonicPianoTranscription

tiberiu44/TTS-Cube

Data-Science-kosta/Speech-Emotion-Classification-with-PyTorch

spotify/realbook

CVxTz/audio_classification

MycroftAI/sonopy

echocatzh/torch-mfcc

zzw922cn/LPC_for_TTS

rednafi/urban-sound-classification

zafarrafii/Zaf-Python

zafarrafii/Zaf-Matlab

skanderhamdi/attention_cnn_lstm_covid_mel_spectrogram

adasegroup/OSM-one-shot-multispeaker

yoyolicoris/wavenet-like-vocoder

ddman1101/EDM-subgenre-classifier

Friedrich-M/Audio-signal-classification-and-identification

monetjoe/pianos

renesemela/masters-thesis-music-autotagging

VisionBrain/Neural_Voice_Cloning

Keerthiraj-Nagaraj/cough-detection-with-transfer-learning

goepfert/audio_features

mariamkhmahran/gunshot-detection-system

mikex86/SonopyJava

baggepinnen/LPVSpectral.jl

KanikeSaiPrakash/Speech-Emotion-Recognition

ricardokleinklein/deepMultiSpeech

sh3r4zhassan/Sound-Prediction-and-Cancellation-Model

amirragab-ds/Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs

Rumeysakeskin/dtw-compare-audio-files

zafarrafii/Zaf-Julia

anirudhs123/Music-Instrument-Classification

cschen1205/cs-mel-spectrogram

RBGTOP/Music-Genre-Recognition

SimpleKidd/Fault-Diagnosis-of-a-Rotor-Bearing-System-using-ML

awal-ahmed/AudioViT