wangbinDL

@ Shanghai AI LaboratoryChina

Pinned Repositories

DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Language:Python967 7 9874
MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。
Language:Python28.9k 146 1.2k2.2k
OmniDocBench
A Comprehensive Benchmark for Document Parsing and Evaluation
Language:Python288 9 3226
PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Language:Python7.1k 51 162489
UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
Language:Python286 9 4329
VIGC
AAAI 2024: Visual Instruction Generation and Correction
Language:Python91 5 153
Academic-project-page-template
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
Language:JavaScript0 0 00
Automatic-Speech-Recognition-from-Scratch
An minimal Seq2Seq example of Automatic Speech Recognition (ASR) based on Transformer
Language:Python0 0 00
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
0 0 00
MinerU
MinerU is a one-stop, open-source, high-quality data extraction tool，supports PDF/webpage/e-book extraction.
Language:Python1 0 00

wangbinDL's Repositories

wangbinDL/MinerU
MinerU is a one-stop, open-source, high-quality data extraction tool，supports PDF/webpage/e-book extraction.
Language:Python1 0 00
wangbinDL/Academic-project-page-template
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
Language:JavaScript0 0 00
wangbinDL/Automatic-Speech-Recognition-from-Scratch
An minimal Seq2Seq example of Automatic Speech Recognition (ASR) based on Transformer
Language:Python0 0 00
wangbinDL/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
0 0 00
wangbinDL/BERT-pytorch
Google AI 2018 BERT pytorch implementation
Language:Python0 0 00
wangbinDL/HA-DPO
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
Language:Python0 0 00
wangbinDL/InternLM
Official release of InternLM2 7B and 20B base and chat models. 200K context support
Language:Python0 0 00
wangbinDL/InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
Language:Python0 0 00
wangbinDL/VIGC-demo
Language:Python0 0 00
wangbinDL/wangbinDL.github.io
Homepage
Language:SCSS0 1 00
wangbinDL/DocLayout-YOLO
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
wangbinDL/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Language:Python
wangbinDL/learn-rst
从 Markdown 转移到 reStructureText 有多难?
Language:Python0 0
wangbinDL/OmniDocBench
A Comprehensive Benchmark for Document Parsing and Evaluation
Language:Python
wangbinDL/streamlit_quick_start
Language:Python1 0
wangbinDL/texify
Math OCR model that outputs LaTeX and markdown
Language:Python0 0
wangbinDL/thesisuestc
ThesisUESTC-电子科技大学毕业论文模板
Language:TeX0 0
wangbinDL/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

wangbinDL

Pinned Repositories

DocLayout-YOLO

MinerU

OmniDocBench

PDF-Extract-Kit

UniMERNet

VIGC

Academic-project-page-template

Automatic-Speech-Recognition-from-Scratch

Awesome-Multimodal-Large-Language-Models

MinerU

wangbinDL's Repositories

wangbinDL/MinerU

wangbinDL/Academic-project-page-template

wangbinDL/Automatic-Speech-Recognition-from-Scratch

wangbinDL/Awesome-Multimodal-Large-Language-Models

wangbinDL/BERT-pytorch

wangbinDL/HA-DPO

wangbinDL/InternLM

wangbinDL/InternLM-XComposer

wangbinDL/VIGC-demo

wangbinDL/wangbinDL.github.io

wangbinDL/DocLayout-YOLO

wangbinDL/GOT-OCR2.0

wangbinDL/learn-rst

wangbinDL/OmniDocBench

wangbinDL/streamlit_quick_start

wangbinDL/texify

wangbinDL/thesisuestc

wangbinDL/xtuner