Seerkfang

:rocket: free and curious mind:dragon:

ZJU -> UCSD -> Nvidia ResearchSanta Clara, United States

Pinned Repositories

haosulab.github.io
Language:HTML4 8 00
minisql
Language:C++0 2 00
verify_cot
Language:Python128 5 38
VILA
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Language:Python2.7k 37 151213
mmtracking
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
Language:Python3.6k 49 465598
K3C
K3C OPENWRT A1/B1/B1G/B2/C1/S1
Language:C0 0 00
Seerkfang.github.io
Language:Python0 1 00
verify_cot
Language:Python0 0 00
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python0 0 00
large_vlm_distillation_ood
Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)
Language:Python55 1 24

Seerkfang's Repositories

Seerkfang/K3C
K3C OPENWRT A1/B1/B1G/B2/C1/S1
Language:C0 0 00
Seerkfang/Seerkfang.github.io
Language:Python0 1 00
Seerkfang/verify_cot
Language:Python0 0 00
Seerkfang/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python0 0 00