tei
There are 261 repositories under tei topic.
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
freedict/fd-dictionaries
hand-written dictionaries from the FreeDict project
eeditiones/tei-publisher-app
The main TEI Publisher app
clarin-eric/ParlaMint
ParlaMint: Comparable Parliamentary Corpora
karlb/wikdict-gen
Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project
kitodo/kitodo-presentation
Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Library Suite.
jhellingman/tei2html
XSLT stylesheets to convert TEI to HTML and ePub format.
ARTFL-Project/PhiloLogic4
PhiloLogic4
ebeshero/DHClass-Hub
a repository to help introduce and orient students to the GitHub collaboration environment, and to support DH classes.
RJP43/LiliElbe_EngagedLearners
Lili Elbe Digital Archive practicum - learning markup via an engaged markdown community. Visit our wiki!
clefourrier/EtymDB
[LREC 2020] EtymDB, an Etymological DataBase (v2.1)
d-flood/criticus
A collection of computer tools for aiding the text critical workflow from transcription to collation to analysis.
DevsHero/db2vec
db2vec: High-performance Rust CLI to parse database dumps (.sql, .surql), generate vector embeddings via Ollama, TEI, Gemini, and load into vector databases (Pinecone, Redis, Chroma, Milvus, Qdrant, SurrealDB). Optimized for speed on large datasets.
open-editions/corpus-joyce-ulysses-tei
James Joyce's novel Ulysses in TEI XML. Work-in-progress.
open-editions/corpus-joyce-portrait-TEI
The Open Scholarly Edition of James Joyce's A Portrait of the Artist as a Young Man
BCDH/TEI-Completer
A highly customizable plugin for setting up and activating remote-driven autocompletions of attribute values in the oXygen XML Editor.
recogito/recogito-studio
Self hosting code for Recogito-Studio
deutschestextarchiv/dtabf
DTA Base Format (DTABf)
ebeshero/UpTransformation
a repository for materials related to teaching and writing on technologies of up-conversion and project development with the XML family of languages, featuring regex, XPath, XQuery, XSLT, and Schematron.
pruizf/disco
Diachronic Spanish Sonnet Corpus. Canonical and minor authors in Spanish (Europe, America and Asia): 15th to 20th century
philipallfrey/teihub
Automated listing of repos in GitHub with XML files containing teiHeader. Find a project using TEI today!
TEI4HTR/page2tei
A repository for illustrating the transformation of a PAGE XML file into XML-TEI format, resulting from experimentations made for the LECTAUREP project.
THREAD-project/THREAD
Tools for Humanities Research and Editing of Ancient Documents
Edirom/SMuFL-Browser
A web based browser for the Standard Music Font Layout
synopsx/synopsx
SynopsX is a lightweight XML publishing framework
Edirom/WeGA-ODD
ODD files for documenting the Digital Edition of the Carl-Maria-von-Weber-Gesamtausgabe
JoshuaAPhillips/tei-iiif
TEI-IIIF converts TEI-XML into conformant IIIF Annotation manifests.
pfefferniels/probstuecke-digital
A digital edition of the 24 Probstücke of the Oberclasse by Johann Mattheson.
pierpaolosichera/NormaTEI
Analyze the content of one or more XML files. NormaTEI is designed mainly for two uses: control of encoding uniformity and encoding analysis
dracor-org/gerdracor
German Drama Corpus
d-flood/apparatus-explorer
app for viewing XML collation files, editing edges, and exporting a critical apparatus as a docx file
michmech/tei-dictionary.xsl
An XSLT stylesheet for TEI-encoded dictionaries
recogito/tei-standoffconverter-js
Convert between TEI/XML and plaintext without losing markup context.
slub/mets-mods2tei
Convert bibliographic meta data in MODS format to TEI headers