Pinned Repositories
alto-ocr-confidence
calculate OCR confidence per page in ALTO
alto-ocr-text
extract text from ALTO file
alto-tools
Python tools for performing various operations on ALTO XML files
hip21_ocrevaluation
A Survey of OCR Evaluation Tools and Metrics (HIP'21)
layout_gallery
vDHd2021 experiment
ner-corpora
Named Entity Recognition corpus for (historical) Dutch, French, German
ocr-conversion
Conversions between various OCR formats
ocr-gt
OCR & Ground Truth Resources
page-to-text
extract text from PAGE file
cneud's Repositories
cneud/ocr-gt
OCR & Ground Truth Resources
cneud/ocr-conversion
Conversions between various OCR formats
cneud/alto-tools
Python tools for performing various operations on ALTO XML files
cneud/alto-ocr-text
extract text from ALTO file
cneud/hip21_ocrevaluation
A Survey of OCR Evaluation Tools and Metrics (HIP'21)
cneud/ner-corpora
Named Entity Recognition corpus for (historical) Dutch, French, German
cneud/alto-ocr-confidence
calculate OCR confidence per page in ALTO
cneud/layout_gallery
vDHd2021 experiment
cneud/page-to-text
extract text from PAGE file
cneud/warcbase
Warcbase is an open-source platform for managing and analyzing web archives
cneud/cneud.github.io
cneud/newspaper-page-classification
Fast classification of newspaper pages using fastai
cneud/ocr-data
cneud/web-wf-design
Forked from http://code.google.com/p/taverna/source/browse/portal/web-wf-design/trunk/web-wf-design/ for experimenting
cneud/interoperability-framework
Interoperability layer supporting the loose coupling of software components developed during the IMPACT project
cneud/alto-editor
Browser based post correction tool for Alto XML files
cneud/altoedit-2.0
edit the alto directly in the xml
cneud/deep-wittgenstein
Classification of Wittgenstein's remarks
cneud/dta_emb
Train word embeddings on DTA texts using fastText
cneud/EN-data_mining
Data Mining Historical Newspaper Metadata (METS/ALTO formats)
cneud/hack4europe
Javascript based portal for searching Europeana collections and creating enrichments on the metadata
cneud/ner-app
Named Entity Recognition tool for Europeana Newspapers
cneud/oldlab
cneud/sbbulk
cneud/scape-tavernahadoop-demonstrator
SCAPE demonstrator project for Taverna and Hadoop
cneud/stringmetric
:dart: String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).