prefixRAINSTARsuffix

prefixRAINSTARsuffix's Stars

facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Language:Python30k 385 3.5k7.4k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.5k 299 1.4k2.5k
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
14.9k 194 241.4k
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook11.5k 96 3421.5k
facebookresearch/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Language:Python6.2k 67 247904
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
Language:Jupyter Notebook5.7k 76 2191.1k
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.7k 47 174277
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Language:Python3.2k 58 95318
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.3k 24 168177
IDEA-Research/DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
Language:Python2.2k 31 260234
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Language:Python1.9k 29 48150
Yuliang-Liu/Monkey
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Language:Python1.8k 22 126122
ytongbai/LVM
Language:Python1.7k 121 2254
microsoft/i-Code
Language:Jupyter Notebook1.7k 40 73161
eric-ai-lab/MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
Language:Python842 12 4252
Yuliang-Liu/MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
Language:Python431 13 2829
wenhuchen/Table-Fact-Checking
Data and Code for ICLR2020 Paper "TabFact: A Large-scale Dataset for Table-based Fact Verification"
Language:Python371 10 1152
HaozheZhao/MIC
MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU
Language:Python321 10 3115
HCIILAB/Scene-Text-Recognition-Recommendations
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining
Language:Python318 15 937
SHI-Labs/Rethinking-Text-Segmentation
[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach
Language:Python242 17 3728
tingxueronghua/ChartLlama-code
Language:Python178 11 2317
ZhangYuanhan-AI/visual_prompt_retrieval
[NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"
Language:Python160 4 117
LukeForeverYoung/UReader
Language:Python110 3 155
IST-DASLab/OBC
Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".
Language:Python95 6 1014
lfy79001/TableQAKit
A Toolkit for Table-based Question Answering
Language:Python93 6 24
Mountchicken/Text-Recognition-on-Cross-Domain-Datasets
Improved Text recognition algorithms on different text domains like scene text, handwritten, document, Chinese/English, even ancient books
Language:Python66 1 511
MAEHCM/ICL-D3IE
Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”
Language:Python50 2 38
simplify23/MRN
Official Pytorch implementations of MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition (ICCV 2023)
Language:Python49 3 89
yale-nlp/DocMath-Eval
Language:Python11 2 1
zyuh/BDR-main
Language:Python6 1 00