tedunderwood
Professor of Information Sciences and English Literature at the University of Illinois, Urbana-Champaign.
University of IllinoisUrbana-Champaign
Pinned Repositories
BrowseLDA
R scripts that browse the results of LDA
character
Data and code for analyzing language associated with fictional characters.
DataMunging
Scripts that clean up OCR and munge Hathi metadata.
fiction
Project on the history of genre.
fictional-time-with-GPT4
An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.
genredistance
Exploring textual and social measures of distance between genres.
LDA
A Java package that does basic LDA, without hyperparameter optimization. Folder settings are local. Ymmv.
LIS590DSH
noveltmmeta
Code and data supporting "NovelTM Data Sets for English-Language Fiction."
paceofchange
Code and data to support the article, "How quickly do literary standards change?"
tedunderwood's Repositories
tedunderwood/DataMunging
Scripts that clean up OCR and munge Hathi metadata.
tedunderwood/fictional-time-with-GPT4
An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.
tedunderwood/fiction
Project on the history of genre.
tedunderwood/noveltmmeta
Code and data supporting "NovelTM Data Sets for English-Language Fiction."
tedunderwood/paceofchange
Code and data to support the article, "How quickly do literary standards change?"
tedunderwood/character
Data and code for analyzing language associated with fictional characters.
tedunderwood/genredistance
Exploring textual and social measures of distance between genres.
tedunderwood/horizon
Data and code to support Distant Horizons (University of Chicago Press, 2019).
tedunderwood/plot
Initial exploratory research on patterns of change across narrative time.
tedunderwood/nehuncertainty
Code used in "Broadening Access to Text Analysis by Describing Uncertainty."
tedunderwood/period-cohort
Code and data for an experiment on the relation between individual change and cohort succession in literary history.
tedunderwood/badpublicity
A presentation at MLA 2020 in Seattle, "No Such Thing as Bad Publicity: Toward a Distant Reading of Reception."
tedunderwood/is417
IS 417, Data Science in the Humanities.
tedunderwood/reviews
Parsing periodical indexes and finding book reviews, 1800-2007.
tedunderwood/hathimetadata
Metadata for English-language fiction and poetry beyond 1923 in HathiTrust Digital Library.
tedunderwood/measureperspective
Code and data to support "Machine Learning and Human Perspective."
tedunderwood/moments
Data and code to support "Why Is Literary Time Measured in Minutes?"
tedunderwood/meta2018
A temporary workspace for novelTM metadata reviewed and analyzed in summer 2018.
tedunderwood/next-twist
Code and data supporting the blog post "Can language models predict the next twist in a story?"
tedunderwood/riseandfall
Code and data supporting The Rise and Fall of Genre Differentiation in English-Language Fiction.
tedunderwood/asymmetry
Research on information-theoretic asymmetries in literary history.
tedunderwood/avant
Was the avant-garde really ahead of its time?
tedunderwood/noise
Data and code for measuring consequences of noise in digital libraries.
tedunderwood/oralarg
Code and results related to oral argument in the Supreme Court. Work in progress: Tonja Jacobi, Matthew Sag, and Ted Underwood.
tedunderwood/overlappingcategories
Python 3 code for training models in a multilabel environment where classes overlap. Based on code in the fiction repo, but with bug fixes and improvements.
tedunderwood/roles
Code for a topic modeling variant that allows for character level 'roles' as well as book-level 'themes.'
tedunderwood/time
Further research on narrative pace.
tedunderwood/biographies
Metadata and code for research on biographies, and especially contrasting biographical "character" to fiction.
tedunderwood/class
An exploratory project on representations of class (and age) in fiction.
tedunderwood/post2015