Using pre-trained models with our own audio files.
vsakkas96 opened this issue · 2 comments
vsakkas96 commented
Hi, thanks for making this open source project.
I'm trying to use the pre-trained models you provide on my own audio files in order to extract the emotion and sentiment labels, but the baseline.py
does not seem to provide a way to use my own files.
Moreover, the baseline.py
file loads .pkl
instead of .wav
or .mp4
files. How would I go about using my own files an generating similar .pkl
files in order to be used with the pre-trained models?
Thanks.
sanzgiri commented
I am also interested in doing this. Would you be willing to share the code that extracts the text embeddings (Glove + dimensional reduction) & audio embeddings (OpenSmile + dimensional reduction)? Thanks!