/wtalc-pytorch

W-TALC: Weakly-supervised Temporal Activity Localization and Classification

Primary LanguagePythonMIT LicenseMIT

W-TALC: Weakly-supervised Temporal Activity Localization and Classification

Overview

This package is a PyTorch implementation of the paper W-TALC: Weakly-supervised Temporal Activity Localization and Classification, by Sujoy Paul, Sourya Roy and Amit K Roy-Chowdhury and published at ECCV 2018. The TensorFlow implementation can be found here.

Dependencies

This package uses or depends on the the following packages:

  1. PyTorch 0.4.1, Tensorboard Logger 0.1.0
  2. Python 3.6
  3. numpy, scipy among others

Data

The features for Thumos14 and ActivityNet1.2 dataset can be downloaded here. The annotations are included with this package.

Running

This code can be run using two diferent datasets - Thumos14 and Thumos14reduced. The later dataset contain only the data points which has temporal boundaries of Thumos14. There are two options of features only for Thumos14reduced. The dataset name (with other parameters can be changed in options.py). The file to be executed is main.py. The results can be viewed using tensorboard logger or the text file named .log generated during execution. The options for I3D features are the ones mentioned in options.py. For UNT features, the options to be used are as follows:

python main.py --max-seqlen 1200 --lr 0.00001 --feature-type UNT

Citation

Please cite the following work if you use this package.

@inproceedings{paul2018w,
  title={W-TALC: Weakly-supervised Temporal Activity Localization and Classification},
  author={Paul, Sujoy and Roy, Sourya and Roy-Chowdhury, Amit K},
  booktitle={Proceedings of the European Conference on Computer Vision (ECCV)},
  pages={563--579},
  year={2018}
}

Contact

Please contact the first author of the associated paper - Sujoy Paul (supaul@ece.ucr.edu) for any further queries.