Pinned Repositories
ChiQA
The implementations of various baselines in our CIKM 2022 paper: ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding.
ego-env
Human-centric environment representations from egocentric video
GroundNLQ
The champion solution for Ego4D Natural Language Queries Challenge in CVPR 2023
ego4d_asl
code for Ego4D Workshop@CVPR 2023 - 1st in MQ & 2nd in NLQ challenge
TA2V
test
test
Exo2Ego-V
VectorFusion-pytorch
[CVPR 2023] Unofficial implementation for "VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models"
Ego4d_TalkNet_ASD
AlignEgoExo
Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment" (NeurIPS 2023)