Pinned Repositories
data2cdf
databank_processor_deprecated
REST API for Illinois Data Bank background file processing tasks
datafiles
analyze dataset datafile features
getSolrMetadata
Read volume id list, gets metadata from HTRC Solr Proxy API, writes XML file.
HtBibWrapper
RESTful Web Service returns HathiTrust MARCXML string from given VolumeId -- jersey-based Java servlet for Tomcat
htdapiclient-getvol
Client that uses HathiTrust's Data API to serve aggregate zip files containing the images and text of a volume.
HTRC-FE-metadata-formatter
Reads a hathifiles text file one line at a time and creates HTRC Feature Extraction metadata json files
HTRC-FE-RsyncScriptGeneratorAlpha
Generates an rsync script to get Feature Extraction files from HathiTrust Reserach Center
HTRC-RegExt-Workset-Getter
Reads workset list, for each workset gets and writes workset file
mfall3's Repositories
mfall3/HTRC-FE-metadata-formatter
Reads a hathifiles text file one line at a time and creates HTRC Feature Extraction metadata json files
mfall3/data2cdf
mfall3/databank_processor_deprecated
REST API for Illinois Data Bank background file processing tasks
mfall3/datafiles
analyze dataset datafile features
mfall3/getSolrMetadata
Read volume id list, gets metadata from HTRC Solr Proxy API, writes XML file.
mfall3/HtBibWrapper
RESTful Web Service returns HathiTrust MARCXML string from given VolumeId -- jersey-based Java servlet for Tomcat
mfall3/htdapiclient-getvol
Client that uses HathiTrust's Data API to serve aggregate zip files containing the images and text of a volume.
mfall3/HTRC-FE-RsyncScriptGeneratorAlpha
Generates an rsync script to get Feature Extraction files from HathiTrust Reserach Center
mfall3/HTRC-RegExt-Workset-Getter
Reads workset list, for each workset gets and writes workset file
mfall3/HTRC-Solr-query-to-volume-list
Takes a Lucene query string, gets the volume ids from HTRC's Solr Proxy API, then writes the set of volume id to a file.
mfall3/HTRC-Terms2Vols
Reads keyword terms from file, gets volume ids from HTRC Solr Proxy API, then writes volume ids to file.
mfall3/pairtree-subsetter
create a subset of pairtree files, maintaining the pairtree structure
mfall3/setfinder
set of scripts to support discovering datasets related to articles with creators with a specified affiliation
mfall3/sql-novice-survey
Software Carpentry introduction to SQL for novices.
mfall3/uiuc-swc-2017-12-12
mfall3/VolGetter
Reads volume ids from given list in file, gets the aggregate zip files for the volumes, and writes them in a HathiTrust-style pairtree structure. Designed specifically to work with the client for HathiTrust's Data API at https://github.com/mfall3/htdapiclient-getvol .