HathiTrust Research Center
Code and artifact repository for the HTRC platform
Indiana Univ/Univ of Illinois
Pinned Repositories
ACS-TT
ACS: The Trace of Theory
Bibframe-Transform
Tramsform MARC records to BIBFRAME and add links
ht-text-prep
HTRC-BookwormGUI
GUI for a Bookworm web app
HTRC-DataCapsules
Secure environment for text analysis at scale of sensitive digitized content
htrc-feature-reader
Tools for working with HTRC Feature Extraction files
HTRC-Portal
HTRC Portal
HTRC-Useful-Datasets
HTRC-WorksetToolkit
Python SDK for Data API and Solr API access
scwared
Home for general documentation about HTRC’s Mellon-funded SCWAReD project.
HathiTrust Research Center's Repositories
htrc/ht-text-prep
htrc/HTRC-FeatureExtractor
Extracts features (token counts, POS tags, etc.) from a list of HT volumes, to aid in non-consumptive research.
htrc/torchlite-backend
Backend API service for Torchlite web dashboard
htrc/HTRC-RightsAPI
htrc/HTRC-Tools-ScalaUtils
Set of utility functions and routines that reduce the boilerplate needed to accomplish some common tasks in Scala.
htrc/HTRC-Tools-SparkUtils
Library that adds useful error handling and non-serializable object management capabilities to Apache Spark applications.
htrc/torchlite-frontend
Torchlite web interface
htrc/torchlite-handbook
Hackathon Handbook
htrc/handbook
Editable files for TORCHLITE Handbook
htrc/htrc-ef-api
Extracted Features API service
htrc/HTRC-Redis-Ingester
htrc/htrc-torchlite-efapi
API access to aggregated EF data for Torchlite
htrc/torchlite-documentation
Documentation for the TORCHLITE application
htrc/HT-Bookworm-Dash
Web app for browsing HathiTrust BW.
htrc/HTRC-MetadataService
A simple service for retrieving metadata for a given set of volume IDs
htrc/HTRC-Tools-HathifilesAuthorTitleMatch
Searches Hathifiles for volumes matching given author, title pairs
htrc/HTRC-Tools-RunningHeaders
Utility library that can be used for performing header/body/footer identification over a set of pages from a volume.
htrc/Metadata-bibframe-entities
Used to extract entities from the BIBFRAME-XML for purposes of enrichment from external sources
htrc/Metadata-bibframe2jsonld
Used to convert enriched BIBFRAME-XML to HTRC metadata JSONLD
htrc/Metadata-combine-seq-files
Utility for combining sequence files
htrc/Metadata-entities-lookup
Used to perform lookup (resolve) entities via external sources like VIAF, LOC, and WorldCat
htrc/Metadata-extract-seq-files
Tool for extracting files out of sequence files
htrc/Metadata-extract-seqfiles-key
htrc/Metadata-marcjson2bibframexml
Converts MARC-in-JSON format to Bibframe XML format
htrc/MicroK8-sDocumentation
htrc/scwared-black-fantastic
htrc/torchlite-argocd
ArgoCD config deployment manifests for Torchlite
htrc/torchlite-hackathon
Informational site for HTRC’s 2024 TORCHLITE Hackathon event
htrc/torchlite-notebooks
Jupyter notebooks demonstrating features of Torchlite
htrc/torchlite-publication-info
Jupyter notebook for viewing and analyzing publication information with HTRC TORCHLITE data and APIs.