lcxrocks's Stars
gaopengcuhk/Tip-Adapter
OatmealLiu/FineR
[ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models
MCG-NJU/VideoEval
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
MCG-NJU/SPLAM
[ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model
jam3scampbell/ProctorAI
The AI to keep you focused 😈
mc-lan/ClearCLIP
[ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
test-time-training/ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
wusize/CLIPSelf
[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
isekai-portal/Link-Context-Learning
OI-wiki/OI-wiki
:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
NVlabs/Bongard-HOI
[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning
MaxZanella/MTA
[CVPR 2024] Zero-shot method for Vision-Language Models based on a robust formulation of the MeanShift algorithm for Test-time Augmentation (MTA).
streamlit/streamlit
Streamlit — A faster way to build and share data apps.
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
yossigandelsman/second_order_lens
Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"
yossigandelsman/clip_text_span
official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"
pumpkin805/FALIP
[ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
zzh-tech/InterpAny-Clearer
[ECCV2024 Oral] Clearer anytime frame interpolation & Manipulated interpolation of anything
CrossmodalGroup/LAPS
Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024
muzairkhattak/PromptSRC
[ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without Forgetting".
MCG-NJU/VFIMamba
[NeurIPS 2024] VFIMamba: Video Frame Interpolation with State Space Models
OpenGVLab/CaFo
[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
sarahpratt/CuPL
nnaisense/bayesian-flow-networks
This is the official code release for Bayesian Flow Networks.
MCG-NJU/PRVG
[CVIU 2024] End-to-end dense video grounding via parallel regression
sachit-menon/classify_by_description_release
karpathy/LLM101n
LLM101n: Let's build a Storyteller
mayneyao/eidos
Offline alternative to Notion. Eidos is an extensible framework for managing your personal data throughout your lifetime in one place.
OpenGVLab/LCL
[NeurIPS 2024] Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning
UCSC-VLAA/Recap-DataComp-1B
This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"