JoshuaMathias
NLP Data Scientist. Computational Linguistics at UW, Computer Science and Spanish Translation at BYU
USA
JoshuaMathias's Stars
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
iterative/dvc
🦉 Data Versioning and ML Experiments
arangodb/arangodb
🥑 ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.
nmslib/hnswlib
Header-only C++/python library for fast approximate nearest neighbors
facebook/duckling
Language, engine, and tooling for expressing, testing, and evaluating composable language rules on input strings.
facebookresearch/StarSpace
Learning embeddings for classification, retrieval and ranking.
ploomber/ploomber
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
salesforce/ctrl
Conditional Transformer Language Model for Controllable Generation
datamade/parserator
:bookmark: A toolkit for making domain-specific probabilistic parsers
alvations/pywsd
Python Implementations of Word Sense Disambiguation (WSD) Technologies.
dbamman/litbank
Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.
rspeer/langcodes
A Python library for working with and comparing language codes.
markriedl/WikiPlots
A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.
dbamman/book-nlp
Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/booknlp)
NIHOPA/NLPre
Python library for Natural Language Preprocessing (NLPre)
vilcans/screenplain
Write your screenplay in plain text and run it through this program to make it look good
ringgaard/sling
SLING - A natural language frame semantics parser
JonathanReeve/chapterize
A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books for computational text analysis.
alexc-hollywood/screenplay-parser
Generates auto-breakdowns from Final Draft, Adobe Story, Celtx, and Fountain screenplay files.
alexc-hollywood/screen-json
Programmable/cross-platform screenplay files as JSON data documents: a screenwriting data model for the web designed to replace Final Draft Pro/Celtx/Fountain/FadeIn/Adobe Story format chaos.
drwiner/ScreenPy
Automated Screenplay Annotation for Extracting Storytelling Knowledge
akaihola/ipython_pytest
Pytest magic for IPython notebooks
ppapalampidi/SUMMER
Screenplay Summarization using Latent Narrative Structure
julianbrooke/GutenTag
Tagirijus/fountain
a python fountain script parser
czcorpus/InterText_server
Collaborative on-line editor for aligned parallel texts.
muzny/quoteannotator
mbwsims/literary-information-propagation
chickendude/InterText
Tool for creating and translating interlinear texts
Sn0oze/Harry-Potter-Social-Structures
Social Graphs and Interactions Final Project