stdKonjac
I'm currently studying for a M.Sc. degree in Tsinghua University. My research interests include Multimedia Retrieval, Computer Vision, etc.
Tsinghua UniversityShenzhen, Guangdong, China
Pinned Repositories
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
DeepComplexCRN
GluttonousSnake
Qt版贪吃蛇
TVTS
Turning to Video for Transcript Sorting