SoundNet_Pytorch

converting the pretrained tensorflow SoundNet model to pytorch

Introduction

The code is for converting the pretrained tensorflow soundnet model to pytorch model. So no training code for SoundNet model. The pretrained pytorch soundnet model can be found here.

Prerequisites

tensorflow (cpu or gpu)
python 3.6 with numpy
pytorch 0.4+
weight file: google drive: https://drive.google.com/drive/folders/1zjNiuLgZ1cjCzF80P4mlYe4KSGGOFlta?usp=sharing; 百度网盘：链接：https://pan.baidu.com/s/1v_K2pJvo0KE38EZ__WZJWg 提取码：iz4h

How to use

prepare the code

git clone https://github.com/smallflyingpig/SoundNet_Pytorch.git
cd SoundNet_Pytorch

prepare the tensorflow soundnet model parameters. Download from sound8.npy, which is provided by eborboihuc, and save in the current folder.
install the prerequisites
run

python tf2pytorch.py --tf_param_path ./sound8.npy --pytorch_param_path ./sound8.pth

test the result

download input demo data from demo.py and save to the current folder. We calculate the average feature errors at each convolution block (total 7 conv blocks) and the predictions for object/scene classification (2 layers), and output 9 error totally.

python check_layer.py --tf_param_path ./sound8.npy --pytorch_param_path ./sound8.pth --input_demo_data ./demo.npy

The expected output:

layer error:
[-1.3113022e-06, 0.0, 0.0, 0.0, 1.4901161e-08, 0.0, -6.9849193e-10, 4.7683716e-07, 7.1525574e-07]

This indicates the success of our model conversion.

extract features after the pytorch model is got(save as ./sound8.pth), run the following command to extract features:

python example.py

Acknowledgments

Code for soundnet tensorflow model is ported from soundnet_tensorflow. Thanks for his works!

FAQs

Feel free to mail me(jiguo.li@vipl.ict.ac.cn or jgli@pku.edu.cn) if you have any questions about this project.

reference

Yusuf Aytar, Carl Vondrick, and Antonio Torralba. "Soundnet: Learning sound representations from unlabeled video." Advances in Neural Information Processing Systems. 2016.