SikaStar

I am now a fifth-year PhD student at National Engineering Lab for Video Technology in Peking University, Beijing, China

Peking UniversityBeijing China

SikaStar's Stars

open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
Language:Python30k 372 8.4k9.5k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16.9k 111 1.1k1.7k
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook9.8k 143 4551.5k
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.8k 108 290491
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k 40 395297
OpenGVLab/InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
Language:Python3.2k 44 50232
ishan0102/vimGPT
Browse the web with GPT-4V and Vimium
Language:Python2.7k 32 22199
microsoft/promptbench
A unified evaluation framework for large language models
Language:Python2.5k 20 58185
IDEA-Research/T-Rex
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Language:Python2.4k 40 89153
czczup/ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
Language:Python1.3k 18 187142
mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Language:Python811 31 8140
muzairkhattak/multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
Language:Python693 7 8556
shenyunhang/APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
Language:Python501 8 6531
longzw1997/Open-GroundingDino
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
Language:Python498 5 9379
OpenGVLab/all-seeing
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"
Language:Python471 23 2417
jianghaojun/Awesome-Parameter-Efficient-Transfer-Learning
A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.
393 21 425
shoumikchow/bbox-visualizer
Make drawing and labeling bounding boxes easy as cake
Language:Python390 6 1231
lzw-lzw/GroundingGPT
[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model
Language:Python311 12 1416
LijieFan/LaCLIP
[NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"
Language:Python264 8 88
amazon-science/prompt-pretraining
Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"
Language:Python254 5 1310
baaivision/CapsFusion
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
Language:Python199 20 85
BAAI-DCAI/Visual-Instruction-Tuning
SVIT: Scaling up Visual Instruction Tuning
Language:Python164 5 154
Surrey-UP-Lab/RegionSpot
Recognize Any Regions
Language:Python122 1 154
CVMI-Lab/CoDet
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Language:Python115 6 228
Koorye/DePT
[CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"
Language:Jupyter Notebook89 3 92
LeapLabTHU/Rank-DETR
[NeurIPS 2023] Rank-DETR for High Quality Object Detection
Language:Python87 2 57
yuxiaochen1103/FDT
Language:Python61 3 55
ArsenalCheng/Meta-Adapter
[NeurIPS 2023] Meta-Adapter
Language:Python42 2 31
Hodasia/Awesome-Vision-Language-Finetune
Awesome List of Vision Language Prompt Papers
41 2 01
cv516Buaa/OV-VG
29 1 102

SikaStar

SikaStar's Stars

open-mmlab/mmdetection

huggingface/peft

NielsRogge/Transformers-Tutorials

01-ai/Yi

baichuan-inc/Baichuan2

OpenGVLab/InternGPT

ishan0102/vimGPT

microsoft/promptbench

IDEA-Research/T-Rex

czczup/ViT-Adapter

mbzuai-oryx/groundingLMM

muzairkhattak/multimodal-prompt-learning

shenyunhang/APE

longzw1997/Open-GroundingDino

OpenGVLab/all-seeing

jianghaojun/Awesome-Parameter-Efficient-Transfer-Learning

shoumikchow/bbox-visualizer

lzw-lzw/GroundingGPT

LijieFan/LaCLIP

amazon-science/prompt-pretraining

baaivision/CapsFusion

BAAI-DCAI/Visual-Instruction-Tuning

Surrey-UP-Lab/RegionSpot

CVMI-Lab/CoDet

Koorye/DePT

LeapLabTHU/Rank-DETR

yuxiaochen1103/FDT

ArsenalCheng/Meta-Adapter

Hodasia/Awesome-Vision-Language-Finetune

cv516Buaa/OV-VG