Pinned Repositories
CLEF-HIPE-2020
Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at CLEF 2020.
imp-dhSegment
Impresso version of the 'Generic framework for historical document processing'
impresso-frontend
The frontend application for http://impresso-project.ch/
impresso-pycommons
Python module with bits of code (objects, functions) highly reusable within impresso.
impresso-schemas
Repository of JSON schemas used in the Impresso project.
impresso-text-acquisition
Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.
impresso.github.io
llm-transcript-postcorrection
A repository for preliminary work on HTR/OCR/ASR post-correction based on GPT models.
named-entity-tutorial-dh2019
Tutorial on NE processing for Digital Humanities - DH Utrech 2019
NZZ-black-letter-ground-truth
Media Monitoring of the Past's Repositories
impresso/CLEF-HIPE-2020
Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at CLEF 2020.
impresso/llm-transcript-postcorrection
A repository for preliminary work on HTR/OCR/ASR post-correction based on GPT models.
impresso/impresso-text-acquisition
Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.
impresso/impresso-frontend
The frontend application for http://impresso-project.ch/
impresso/impresso-interface-review
Survey of digitized newspaper interfaces
impresso/impresso-pycommons
Python module with bits of code (objects, functions) highly reusable within impresso.
impresso/impresso-schemas
Repository of JSON schemas used in the Impresso project.
impresso/impresso.github.io
impresso/impresso-datalab-notebooks
Collection of notebooks to do NER tasks
impresso/impresso-datalab-starter-pack
This repository provides a basic Python notebook setup with some preinstalled Python libraries. It includes a Dockerfile for building a Docker image containing the necessary environment and a requirements.txt file listing the required Python libraries.
impresso/impresso-docker-stack
Docker stack for impresso app
impresso/eldorado
The Eldorado workshop, supported by the impresso project, will bring together a group of historians, librarians, computer scientists and designers to discuss how digitisation is changing historical research practices.
impresso/epfl-shs-class
Set of instructions for using data in the frame of EPFL SHS class
impresso/impresso-language-identification
impresso/impresso-user-admin
Basic Django admin to manage user-related data in Impresso's Master DB.
impresso/newsagency-classification
Recognition of news agency mentions in historical news articles (BERT-based token classification).
impresso/transmedia
Website for the Transmedia History Conference
impresso/.github
Special repository to add a README to the public organisation profile.
impresso/digital-history-ch-2024
Extended abstract for the Digital History Switzerland conference using the official template
impresso/epfl-shs-hum-475
impresso/fakenews-quiz
impresso/impresso-data-sanitycheck
Code to perform sanity checks on the acquired newspaper data.
impresso/impresso-datalab
Astro powered website for impresso datalab space
impresso/impresso-essentials
Python package highly reusable modules and functions within impresso.
impresso/impresso-jscommons
Reusable components for impresso-frontend and impresso-middle-layer
impresso/impresso-middle-layer
Middle layer API
impresso/impresso-passim
This repository contains code and sample data related to running the impresso corpus through the text reuse detection software passim.
impresso/impresso-py
Python module to play with the impresso public API
impresso/impresso-text-embedder
multilingual text vectorizer for semantic search and comparison
impresso/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.