Pinned Repositories
DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
LabelLLM
The Open-Source Data Annotation Platform
labelU
Data annotation toolbox supports image, audio and video data.
magic-doc
magic-html
MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
OmniDocBench
A Comprehensive Benchmark for Document Parsing and Evaluation
PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
WanJuan1.0
万卷1.0多模态语料
OpenDataLab's Repositories
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
opendatalab/labelU
Data annotation toolbox supports image, audio and video data.
opendatalab/DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
opendatalab/LabelLLM
The Open-Source Data Annotation Platform
opendatalab/magic-html
opendatalab/magic-doc
opendatalab/OmniDocBench
A Comprehensive Benchmark for Document Parsing and Evaluation
opendatalab/UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
opendatalab/LOKI
[ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”
opendatalab/opendatalab-datasets
datasets resource
opendatalab/VHM
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
opendatalab/labelU-Kit
Data annotation component library --provided as NPM packages
opendatalab/MLS-BRN
[CVPR 2024] 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions
opendatalab/OHR-Bench
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
opendatalab/CLIP-Parrot-Bias
ECCV2024_Parrot Captions Teach CLIP to Spot Text
opendatalab/skydiffusion
The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”
opendatalab/CHARM
[ACL 2024 Main Conference] Chinese commonsense benchmark for LLMs
opendatalab/FakeVLM
FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis
opendatalab/UrBench
[AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios”
opendatalab/LEGION
The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"
opendatalab/Miner-PDF-Benchmark
MPB (Miner-PDF-Benchmark) is an end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios.
opendatalab/WanJuan3.0
WanJuan3.0(“万卷·丝路”)一个作为综合性的纯文本语料库,采集了多个国家地区的网络公开信息、文献、专利等资料,数据总规模超1.2TB,Token总数超过300B,处于国际领先水平,首期开源的语料库主要由泰语、俄语、阿拉伯语、韩语和越南语5个子集构成,每个子集的数据规模均超过150GB
opendatalab/ProverGen
[ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation"
opendatalab/CrossViewDiff
The official implementation of the paper "CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis"
opendatalab/dsdl-sdk
opendatalab/PM4Bench
opendatalab/OpenHuEval
opendatalab/CRaFT
[AAAI25] Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning
opendatalab/.github