DTaoo/VGGish

An implementation of vggish in keras with tf backend

Python

VGGish: A VGG-like audio classification model

This repository provides a VGGish model, implemented in Keras with tensorflow backend. This repository is developed based on the model for AudioSet. For more details, please visit the slim version.

Pretrained weights in Keras h5py:

Model with the top fully connected layers
Model without the top fully connected layers

Reference:

Gemmeke, J. et. al., AudioSet: An ontology and human-labelled dataset for audio events, ICASSP 2017
Hershey, S. et. al., CNN Architectures for Large-Scale Audio Classification, ICASSP 2017