Pinned Repositories
projet_OBBC_AppPy
Application web python-Flask sur un corpus de chants populaires bretons réalisée dans le cadre du cours de développement applicatif du Master TNAH-ENC
KaMI-app
Web application to evaluate transcription task (HTR/OCR) based on Python KaMI-lib package.
KaMi-lib
HTR / OCR models evaluation agnostic Python package, originally based on the Kraken transcription system.
2TAT
2TAT (Tiny Text Annotation Template) : a customized text annotator canvas using Flask-RecogitoJS
L-TERRIEL_memoireDeStage_M2TNAH_ENC
Mémoire de stage et annexes pour le Master 2 Technologies numériques appliquées à l'histoire (TNAH) de l'École nationale des chartes.
semanticat
Annotation tool (NER) for XML documents (TEI, EAD) - WIP
spacyfishing
A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata
text_cleaner_cli
tool to clean textual data
xslt-transformation-d
XSLT transformation and validation Dataiku DSS plugin
Lucaterre's Repositories
Lucaterre/spacyfishing
A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata
Lucaterre/semanticat
Annotation tool (NER) for XML documents (TEI, EAD) - WIP
Lucaterre/2TAT
2TAT (Tiny Text Annotation Template) : a customized text annotator canvas using Flask-RecogitoJS
Lucaterre/L-TERRIEL_memoireDeStage_M2TNAH_ENC
Mémoire de stage et annexes pour le Master 2 Technologies numériques appliquées à l'histoire (TNAH) de l'École nationale des chartes.
Lucaterre/text_cleaner_cli
tool to clean textual data
Lucaterre/xslt-transformation-d
XSLT transformation and validation Dataiku DSS plugin
Lucaterre/cours-data-processing
Introduction to web scraping and text data pre-processing
Lucaterre/Creative-Commons-Markdown
Markdown-formatted Creative Commons licenses
Lucaterre/ehri-entity-matcher
A small single page application to bulk-match names again database entities and export structured data
Lucaterre/entity-fishing
A machine learning tool for fishing entities
Lucaterre/Gobrief
Lucaterre/htr-united
Ground Truth Resources for the HTR of patrimonial documents
Lucaterre/inception
INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.
Lucaterre/inception-external-recommender
Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-service compatible with the external recommender API of INCEpTION.
Lucaterre/kraken-ocr-data
Data used or made up in an (successful) OCR attempt using kraken.
Lucaterre/kraken-ocr-htr-app-st
A mini streamlit app to test segmentation & recognition with kraken OCR/HTR engine
Lucaterre/Lucaterre
My repo
Lucaterre/Lucaterre.github.io
My Website
Lucaterre/nlp-pie-taggers
Extension for pie to include taggers with their models and pre/postprocessors
Lucaterre/Notebooks_stats_NER_corpus
Notebooks collection to make basic statitics on NER corpus during campaign (reusable)
Lucaterre/projet_OBBC_AppPy
Application web python-Flask sur un corpus de chants populaires bretons réalisée dans le cadre du cours de développement applicatif du Master TNAH-ENC
Lucaterre/quote_generator
Quote generator, a Flask project exercise to generate a random quote
Lucaterre/Rock_n_Rocket_Game
Space game in Python 3
Lucaterre/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
Lucaterre/SyncBackup
Lightweight CLI solution for backup data (sync + versioning)
Lucaterre/template-software-paper-dh
Attempt at creating a discussion or creating a template for Software Paper in DH
Lucaterre/Tutoriels
Lucaterre/WebAnnoTSV-converter
CLI prototype to read, write and transform webanno tsv 3.2 format files.
Lucaterre/word_founder
Small tool to help find the start of the offset of a word and its length in a very long text quickly
Lucaterre/xmi2conll
Simple CLI to convert any annotated document in UIMA CAS XMI to CONLL format (IOB schema)