jameslyons/python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
PythonMIT
Issues
- 2
Does python speech features have a c port?
#56 opened by dmoham1476 - 0
Pypi not updating release?
#89 opened by tjysdsg - 3
[Question:] How to capture intensity or perceived loudness of a given audio file at regular intervals
#61 opened by StanSilas - 2
High CPU Utilization
#98 opened by divyeshrajpura4114 - 3
inconsistency with librosa
#41 opened by chananshgong - 2
missing frequencies in Mel FillterBank
#83 opened by saddlekiller - 1
Error in fbank
#100 opened by tadangkhoa1999 - 1
Std of log mel-filterbank will be close to zero in some dimension when nfilt == 80.
#85 opened by TeaPoly - 0
Use another augmented assignment statement
#99 opened by elfring - 2
logfbank functionstrange winstep size
#79 opened by LeeYongHyeok - 0
viseme generation
#96 opened by AhmadManzoor - 0
[Question:] inverse fbank back to wav
#95 opened by matdtr - 1
Data augmentation using VTLP
#58 opened by bernardohenz - 2
- 0
Minor issue on round vs. floor
#93 opened by keithchugg - 0
Can I use this MFCC function on edf file
#92 opened by tayyabafaisal - 1
Frame length is greater than FFT size
#91 opened by jiwidi - 0
Reason for not windowing by default?
#90 opened by mrullmi - 5
How to ignore the NFFT warning
#88 opened by igo312 - 0
Cannot read audio file sklearn.
#87 opened by ScarletMcLearn - 1
sample audio link is invalid
#86 opened by belm0 - 1
logfbank interface exist error, lack of winfunc
#81 opened by guker - 0
Reading Spectogram
#82 opened by 5y - 0
I do not found code about inverse DFT
#80 opened by Joefi - 2
ImportError: cannot import name 'sigproc'
#72 opened by MarcosBarraza - 2
Regarding: How to cite
#78 opened - 2
How to turn wav in to (N,1)?
#51 opened by KeyKy - 3
WARNING:root
#74 opened by Sunilaryal18 - 3
The max number of numcep
#75 opened by mmmmayi - 14
Question about hamming window length
#73 opened by leeeeeeo - 1
- 2
logfbank missing energy
#59 opened by bhigy - 0
Missing requirements in setup.py
#64 opened by mhsmith - 1
rfft vs fft
#65 opened by mohjaba - 2
Question about length of mfcc output array
#66 opened by pkolb - 3
num_frames = 1 + math.floor(...) ?
#68 opened by lezasantaizi - 3
- 6
Functions hanging on OS X
#57 opened by 96imranahmed - 0
The parameter of the window type
#60 opened by Racial - 3
can't get same result as compute-mfcc-feats.
#50 opened by bjtommychen - 1
- 1
What is the window functions?
#54 opened by song-qun - 2
39 dimensional MFCC
#53 opened by dmoham1476 - 1
Import Error
#49 opened by ngragaei - 7
obtain the noise data
#38 opened by YueWenWu - 2
Length of MFCC arrays
#40 opened by Klervix - 1
Inverse MFCC to signal and rate
#44 opened by WeiJenLee - 2
MFCC extraction using Hann window
#45 opened by radmar10 - 1
Duplicate operations, ineficient implementation for power spectrum (np.square after np.absolute)
#46 opened by radmar10 - 2
Why am I getting double the frames that I am expecting? [Answer: I was using stereo audio [facepalm]]
#42 opened by collinalexbell