simon-ging/coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
PythonApache-2.0
Stargazers
- aashiqmuhamedCarnegie Mellon University
- alex-movilaFortech
- Bloodflake
- cfoster0
- cxxzHewlett Packard Enterprise
- dicksondicksonDICKSON
- FedolodicThe University of Texas at Dallas
- fly51flyPRIS
- FruityWelsh
- jayleicnMeta AI
- jianjieluoSun Yat-sen University
- johnny7861532Taiwan
- junyuGaoNational Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences
- KaiserLew
- lizekang@ictnlp @hust-diangroup
- mehdidcJuelich Supercomputing Center (JSC), Forschungszentrum Jülich GmbH, LAION
- mengliu1991
- mzolfaghariUniversity of Freiburg, Zebracat AI
- navigatingbots
- nithinreddyy
- onlyonewater
- Oskop
- parindam
- popeye007
- raijinspecialThe Milky Way
- rm-rf-meBIT
- runvncMcAllen, TX
- simon-gingUniversity of Freiburg, Department of Computer Science
- Sy-ZhangAmazon
- TimPchelintsev@108systems
- udaypk
- weeoooweeooo
- ycxioooongZJU, CUHK, OpenMMLab
- youngfly11ShanghaiTech
- youssefavx
- zhyj3038JDJR