layout-analysis

There are 32 repositories under layout-analysis topic.

  • Layout-Parser/layout-parser

    A Unified Toolkit for Deep Learning Based Document Image Analysis

    Language:Python4.6k71145444
  • UglyToad/PdfPig

    Read and extract text and other content from PDFs in C# (port of PDFBox)

    Language:C#1.5k43439223
  • breezedeus/Pix2Text

    An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.

    Language:Jupyter Notebook1.5k1159146
  • mittagessen/kraken

    OCR engine for all the languages

    Language:Python66425462123
  • BobLd/DocumentLayoutAnalysis

    Document Layout Analysis resources repos for development with PdfPig.

    Language:C#53534160
  • mindspore-lab/mindocr

    A toolbox of OCR models, algorithms, and pipelines based on MindSpore

    Language:Python167149145
  • andreagemelli/doc2graph

    Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.

    Language:Jupyter Notebook10671918
  • NormXU/Layout2Graph

    An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"

    Language:Python7111210
  • JPLeoRX/detectron2-publaynet

    Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset

    Language:Python45336
  • MaitySubhajit/SelfDocSeg

    [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)

    Language:Python30432
  • dell-research-harvard/HJDataset

    A Large Dataset of Historical Japanese Documents with Complex Layouts

    Language:Jupyter Notebook28324
  • BobLd/PdfPigMLNetBlockClassifier

    Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.

    Language:C#22406
  • CaseDrive/publaynet-models

    Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset

    Language:Python19201
  • MBAigner/PDFSegmenter

    This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.

    Language:Python19103
  • jiangnanboy/layout_analysis4j

    利用java-yolov8实现版面检测(Chinese layout detection),java-yolov8 is used to detect the layout of Chinese document images

    Language:Java17117
  • VRI-UFPR/ocrd-gbn

    OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil

    Language:Python9300
  • yoshihikoueno/pdfminer-layout-scanner

    A more complete example of programming with PDFMiner, which continues where the default documentation stops

    Language:Python8204
  • pleb631/PdfDet

    PdfDet aims to simplify PDF layout detect tasks for users.

    Language:Python6100
  • ppaanngggg/yolo-doclaynet

    YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis

    Language:Python61
  • VRI-UFPR/page-xml-draw

    A powerful CLI tool for visualization and encoding of PAGE-XML files

    Language:Python65102
  • os-climate/crrf-det

    A web application for PDF content and table extraction, featuring image-based visual layout analysis, indexed document search, batch processing and extraction result annotation.

    Language:C++5503
  • calfa-co/rasam-dataset

    An Open Dataset for the Recognition and Analysis of Scripts in Arabic Maghrebi (ICDAR 2021)

  • empressabyss/nordrassil

    Nordrassil is a keyboard layout that provides an elegant and balanced typing experience by its use of a thumb-alpha, emphasis on middle fingers, and de-prioritisation of pinkies.

  • heshiming/paddlefish

    A Python + C implementation for image-based PDF page layout analysis and content extraction.

    Language:C++2100
  • diegosiqueir4/deepdoctection

    A Repo For Document AI

    Language:Python1100
  • ixalodecte/filestruct

    A python package to structure files using visual and style informations

    Language:Python10
  • matthewleechen/BritishPatents

    Curating a dataset of British patents

    Language:Jupyter Notebook10
  • calfa-co/chi-know-po

    HTR ground truth of the Chi-Know-Po project (Collex Persée)

  • eustro/michael

    BA-thesis in history.

    Language:Python0100
  • majeek/scribble-segmentation

    This repository presents the code of the paper titled "Scribble Based Interactive Page Layout Segmentation Using Gabor Filter" published in ICFHR2016.

    Language:C++0110
  • calfa-co/Patrologia-Graeca

    Main repository of the CGPG project for OCR and Text Analysis of the Patrologia Graeca

  • VRI-UFPR/ocrd-page-xml-draw

    OCR-D wrapper for page-xml-draw

    Language:Python40