cpperrpr's Stars
VainF/DeepLabV3Plus-Pytorch
Pretrained DeepLabv3 and DeepLabv3+ for Pascal VOC & Cityscapes
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
simtony/BART-word-orderer
Code for "On the Role of Pre-trained Language Models in Word Ordering: A Case Study with BART, COLING 2022"
allenschmaltz/word_ordering
This repository includes code for replicating the results in the paper "Word Ordering Without Syntax" (2016).
yeyupiaoling/AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
ymcui/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
allenai/visprog
Official code for VisProg (CVPR 2023 Best Paper!)
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
microsoft/GLIP
Grounded Language-Image Pre-training
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
facebookresearch/SLIP
Code release for SLIP Self-supervision meets Language-Image Pre-training
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Yushi-Hu/PromptCap
natual language guided image captioning
microsoft/MM-REACT
Official repo for MM-REACT
OptimalScale/DetGPT
luogen1996/LaVIN
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
ttengwang/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
sauradip/STALE
[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "
joeyz0z/ConZIC
Official implementation of "ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing"
zhenyuw16/UniDetector
Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".
fundamentalvision/Uni-Perceiver
jianjieluo/OpenAI-CLIP-Feature
An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.