Attentive spatio-temporal representation learning for diving classification

(under preparation)