simon-ging/coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
PythonApache-2.0
Stargazers
- lizekangBeijing
- popeye007
- mzolfaghariGermany
- navigatingbots
- simon-gingGermany
- jayleicnSeattle
- jianjieluoGuangzhou & Beijing, China
- fly51flyBeiJing
- ycxioooongHong Kong
- rm-rf-me
- mengliu1991
- junyuGaoChina
- KaiserLew
- Bloodflake
- FruityWelsh
- runvncMcAllen, TX
- aashiqmuhamedPittsburgh
- TimPchelintsevMayapur, West Bengal, India
- youssefavx
- nithinreddyy
- raijinspecialThe Milky Way
- cfoster0
- parindam
- cxxzPalo Alto, CA
- dicksondickson
- FedolodicRichardson, Texas
- mehdidcGermany
- johnny7861532Taiwan
- Oskop
- Sy-ZhangSanta Clara, CA
- alex-movilaIasi
- onlyonewater
- zhyj3038beijing
- youngfly11Shanghai China
- udaypk
- weeoooweeooo