wilkens's Stars
marcotcr/lime
Lime: Explaining the predictions of any machine learning classifier
stanfordnlp/CoreNLP
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
saffsd/langid.py
Stand-alone language identification system
JeffSackmann/tennis_atp
ATP Tennis Rankings, Results, and Stats
UKPLab/emnlp2017-bilstm-cnn-crf
BiLSTM-CNN-CRF architecture for sequence tagging
MIND-Lab/OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
freelawproject/courtlistener
A fully-searchable and accessible archive of court data including growing repositories of opinions, oral arguments, judges, judicial financial records, and federal filings.
dbamman/book-nlp
Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/booknlp)
martingerlach/hSBM_Topicmodel
Using stochastic block models for topic modeling
lmullen/gender
Predict Gender from Names Using Historical Data
carsonfarmer/python_geospatial
Geospatial Data in Python Tutorial Materials
tedunderwood/DataMunging
Scripts that clean up OCR and munge Hathi metadata.
recrm/ArchiveTools
A collection of tools for archiving and analysing the internet.
dbamman/anlp21
Data and code to support "Applied Natural Language Processing" (INFO 256, Fall 2021, UC Berkeley)
jmhessel/FightingWords
Quick implementation of Monroe et al.'s algorithm for comparing languages
htrc/htrc-feature-reader
Tools for working with HTRC Feature Extraction files
kshirley/LDAtools
R package to fit LDA topic models (deprecated)
tedunderwood/LIS590DSH
tedunderwood/character
Data and code for analyzing language associated with fictional characters.
htrc/ht-text-prep
htrc/HTRC-WorksetToolkit
Python SDK for Data API and Solr API access
tedunderwood/genre
Code for Understanding Genre in a Collection of a Million Volumes.
dbamman/comphumF20
gyauney/shakespeare-and-company-social-readership
Code to accompany "The Afterlives of Shakespeare and Company in Online Social Readership"
sandeepsoni/mobility-books
Mobility of characters in fiction
htrc/ACS-TT
ACS: The Trace of Theory
mimno/info-3350-fall-2019
tedunderwood/hathimetadata
Metadata for English-language fiction and poetry beyond 1923 in HathiTrust Digital Library.
tedunderwood/pmla-scripts
Data for 1924-2006 pmla model, plus scripts to turn into Gephi network.
SangrinLee/Oldbook_Project
Associated With Oldbook Correction Project at Northwestern University, M.S. Computer Science (2017-2018)