Media Monitoring of the Past
Media Monitoring of the Past - Beyond Borders: Connecting Historical Newspapers and Radio.
Switzerland
Pinned Repositories
CLEF-HIPE-2020
Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at CLEF 2020.
imp-dhSegment
Impresso version of the 'Generic framework for historical document processing'
impresso-datalab-notebooks
🔬 Impresso Datalab Notebooks
impresso-frontend
🚀 The frontend application of the Impresso WebApp http://impresso-project.ch/app
impresso-pycommons
Python module with bits of code (objects, functions) highly reusable within impresso.
impresso-schemas
Repository of JSON schemas used in the Impresso project.
impresso-text-acquisition
🛠️ Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.
impresso.github.io
named-entity-tutorial-dh2019
Tutorial on NE processing for Digital Humanities - DH Utrech 2019
NZZ-black-letter-ground-truth
Media Monitoring of the Past's Repositories
impresso/CLEF-HIPE-2020
Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at CLEF 2020.
impresso/impresso-text-acquisition
🛠️ Python library to import OCR data in various formats into the canonical JSON format defined by the Impresso project.
impresso/impresso-frontend
🚀 The frontend application of the Impresso WebApp http://impresso-project.ch/app
impresso/impresso-datalab-notebooks
🔬 Impresso Datalab Notebooks
impresso/impresso-interface-review
Survey of digitized newspaper interfaces
impresso/impresso-pycommons
Python module with bits of code (objects, functions) highly reusable within impresso.
impresso/impresso-schemas
Repository of JSON schemas used in the Impresso project.
impresso/impresso.github.io
impresso/impresso-datalab-starter-pack
This repository provides a basic Python notebook setup with some preinstalled Python libraries. It includes a Dockerfile for building a Docker image containing the necessary environment and a requirements.txt file listing the required Python libraries.
impresso/impresso-docker-stack
Docker stack for impresso app
impresso/llm-transcript-postcorrection
Work on OCR/ASR/HTR post-correction.
impresso/paraphrasus
impresso/dataset-challenge-lid
Ground truth dataset with language identification information for challenging news articles
impresso/impresso-language-identification
impresso/impresso-user-admin
Basic Django admin to manage user-related data in Impresso's Master DB.
impresso/newsagency-classification
Recognition of news agency mentions in historical news articles (BERT-based token classification).
impresso/transmedia
Website for the Transmedia History Conference
impresso/impresso-datalab
Impresso Datalab static Astro website
impresso/.github
Special repository to add a README to the public organisation profile.
impresso/digital-history-ch-2024
Extended abstract for the Digital History Switzerland conference using the official template
impresso/epfl-shs-hum-475
impresso/impresso-data-sanitycheck
Code to perform sanity checks on the acquired newspaper data.
impresso/impresso-essentials
⚙️ Python package highly reusable modules and functions within impresso.
impresso/impresso-jscommons
Reusable components for impresso-frontend and impresso-middle-layer
impresso/impresso-linguistic-processing
Code for running spaCy on rebuilt impresso data.
impresso/impresso-middle-layer
Middle layer API
impresso/impresso-passim
This repository contains code and sample data related to running the impresso corpus through the text reuse detection software passim.
impresso/impresso-py
Impresso Python Library to interact with the Impresso Public API
impresso/impresso-text-embedder
multilingual text vectorizer for semantic search and comparison
impresso/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.