/notebooks

Repository for deepdoctection tutorial notebooks

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Deep Doctection Logo

A Document AI Package - Jupyter notebook tutorials

In this repo you will find jupyter notebooks that used to be in the main repo deepdoctection.

In addition, the repo contains a folder with examples that are used in the notebooks.

The following notebooks are stored:

Get_Started.ipynb:

  • Introduction to deepdoctection
  • Analyzer
  • Output structure.

Datasets_and_Eval.ipynb:

  • Creation of custom datasets
  • Evaluation
  • Fine tuning models

Custom_Pipeline.ipynb:

  • Model catalog and registries
  • Predictors
  • Pipeline backbones
  • Pipelines.

Using_LayoutLM_for_sequence_classification.ipynb:

  • Fine tuning LayoutLM for sequence classification on a custom dataset
  • Evaluation Building and running a production pipeline

Running_pre_trained_models_from_other_libraries.ipynb

  • Installing and running pre-trained models provided by Layout-Parser
  • Adding new categories

To use the notebooks deepdoctection must be installed.