eryk-urbanski/plap

MFCC From Existing Libraries

Opened this issue · 0 comments

Creation of a module that allows the calculation of Mel-Frequency Cepstral Coefficients using existing open-source libraries. The libraries to use in order of importance: Librosa, Spafe, pyAudioAnalysis. The goal of this module is to allow our own implementation's results to be compared with values obtained from other libraries. This module should be written in a way so the values from different calculations are easily accessible. For example a function called mfcc_librosa returns a numpy array containing mfcc values. This module will be expanded in the future as we implement more features and so it should be constructed as modular and extendible component,

Steps:

  1. Simple version: audio file loaded using i.e. librosa, mfcc results saved to variables, usage of different libraries for the calculations.
  2. Functional enhancement: seperate function for audio input, seperate functions for mfcc calculations for each used library

Important!
Different libraries may perform mfcc calculation differently. They may take as input only a frame (a part of the whole signal) and some may take the whole signal. These details should be checked and documented in some way (i.e. comments in code), so it is much more clear on how the results should be compared with our own implementation.