cpperrpr

cpperrpr's Stars

VainF/DeepLabV3Plus-Pytorch
Pretrained DeepLabv3 and DeepLabv3+ for Pascal VOC & Cityscapes
Language:Python2k451
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.5k2.3k
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
Language:Jupyter Notebook6.6k553
simtony/BART-word-orderer
Code for "On the Role of Pre-trained Language Models in Word Ordering: A Case Study with BART, COLING 2022"
Language:Python61
allenschmaltz/word_ordering
This repository includes code for replicating the results in the paper "Word Ordering Without Syntax" (2016).
Language:HTML207
yeyupiaoling/AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
Language:Python42385
ymcui/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）
Language:Python9.7k1.4k
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook9.3k833
allenai/visprog
Official code for VisProg (CVPR 2023 Best Paper!)
Language:Python69665
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Language:Python14k2.1k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.1k4.6k
microsoft/GLIP
Grounded Language-Image Pre-training
Language:Python2.2k196
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10k975
facebookresearch/SLIP
Code release for SLIP Self-supervision meets Language-Image Pre-training
Language:Python75169
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Language:Python18.5k1.9k
Yushi-Hu/PromptCap
natual language guided image captioning
Language:Python787
microsoft/MM-REACT
Official repo for MM-REACT
Language:Python93769
OptimalScale/DetGPT
Language:Jupyter Notebook75771
luogen1996/LaVIN
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
Language:Python51038
ttengwang/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Language:Python1.7k102
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Language:Python1.2k65
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Language:Jupyter Notebook2.9k278
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
18.4k2.6k
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook4.9k648
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.7k171
sauradip/STALE
[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "
Language:Python9910
joeyz0z/ConZIC
Official implementation of "ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing"
Language:Python7317
zhenyuw16/UniDetector
Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".
Language:Python54724
fundamentalvision/Uni-Perceiver
Language:Python26921
jianjieluo/OpenAI-CLIP-Feature
An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.
Language:Python1126