QinghongLin
PhD student at ShowLab @ NUS. Vision+Language & Video-Understanding
National University of SingaporeSingapore
Pinned Repositories
awesome-egocentric-vision
A curated list of egocentric (first-person) vision and related area resources
DSAH
[CIKM2022] Deep Self-Adaptive Hashing for Image Retrieval
EgoVLP_episodic_memory
EgoVLP solution for NLQ & MQ, Ego4D challenges.
QinghongLin.github.io
Awesome-GUI-Agent
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
computer_use_ootb
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
EgoVLP
[NeurIPS2022] Egocentric Video-Language Pretraining
ShowUI
Repository for ShowUI: One Vision-Language-Action Model for GUI Visual Agent
UniVTG
[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
VLog
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
QinghongLin's Repositories
QinghongLin/EgoVLP_episodic_memory
EgoVLP solution for NLQ & MQ, Ego4D challenges.
QinghongLin/QinghongLin.github.io
QinghongLin/DSAH
[CIKM2022] Deep Self-Adaptive Hashing for Image Retrieval
QinghongLin/awesome-egocentric-vision
A curated list of egocentric (first-person) vision and related area resources
QinghongLin/EgoVLPv2
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
QinghongLin/QinghongLin
Config files for my GitHub profile.
QinghongLin/Video_Feature_Extractor
QinghongLin/computer_use_ootb
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
QinghongLin/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite