linhuixiao
Ph.D. in Institution of Automation, Chinese Academy of Sciences
UCAS, Chinese Academy of SciencesBeijing, China
Pinned Repositories
adapter-transformers
Huggingface Transformers + Adapters = ❤️
adapting-CLIP
ALBEF
Code for ALBEF: a new vision-language pre-training method
AlphabetRecognizer
Simple Optical Character Recognizer (english-ocr-image-to-text-recognition-sample-trainig-alphabet-photo-data-database-dataset)
apollo
An open autonomous driving platform
Awesome-Visual-Grounding
A Survey on Visual Grounding
CLIP-VG
[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.
DenseRelationalCaptioning
Code of Dense Relational Captioning
HiVG
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
TransVG
linhuixiao's Repositories
linhuixiao/CLIP-VG
[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.
linhuixiao/HiVG
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
linhuixiao/Awesome-Visual-Grounding
A Survey on Visual Grounding
linhuixiao/adapter-transformers
Huggingface Transformers + Adapters = ❤️
linhuixiao/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
linhuixiao/awesome-open-vocabulary-object-detection
linhuixiao/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
linhuixiao/awesome-described-object-detection
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.
linhuixiao/Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
linhuixiao/Books
My book list
linhuixiao/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
linhuixiao/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
linhuixiao/DataOptim
linhuixiao/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
linhuixiao/DN-DETR
[CVPR 2022 Oral]Official implementation of DN-DETR
linhuixiao/GLIP
Grounded Language-Image Pre-training
linhuixiao/Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Speech Inputs
linhuixiao/GroundingDINO
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
linhuixiao/llama
Inference code for Llama models
linhuixiao/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
linhuixiao/mdetr
linhuixiao/ml-ferret
linhuixiao/mmdetection
OpenMMLab Detection Toolbox and Benchmark
linhuixiao/NLP-Interview-Notes
该仓库主要记录 NLP 算法工程师相关的面试题
linhuixiao/open_clip
An open source implementation of CLIP.
linhuixiao/OV-DETR
[Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)
linhuixiao/ovr-cnn
A new framework for open-vocabulary object detection, based on maskrcnn-benchmark
linhuixiao/paper-reading
深度学习经典、新论文逐段精读
linhuixiao/Pseudo-Q
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
linhuixiao/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities