/coot-videotext

COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers