Pinned Repositories
freebase-python
Python client library for old Freebase API - (deprecated) master is on Google Code. This clone is being update to work with new Freebase v1 APIs
freebase-python-samples
Clone of Google Code project freebase-python-samples
Names
A comprehensive database of name variants
openlibrary-utils
Utilities for working with OpenLibrary
pdf2table
PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz
Places
Place-finder for genealogy
simh
The Computer History Simulation Project
simile-vicino-original
Automatically exported from code.google.com/p/simile-vicino
tfmorris's Repositories
tfmorris/openlibrary-utils
Utilities for working with OpenLibrary
tfmorris/dedupe
A free python library for accurate and scalelable deduplication and entity-resolution. *Under construction*
tfmorris/awesome-public-datasets
An awesome list of high-quality open datasets in public domains (on-going). By everyone, for everyone!
tfmorris/healthNER
tfmorris/acs4_py
Python interface to ACS4
tfmorris/apd-core
Core repo for
tfmorris/arvados
An open source platform for managing and analyzing biomedical big data
tfmorris/bamdst
a lightweight bam file depth statistical tool
tfmorris/BLCSBGLKP_2020
Code for analysis of SARS-CoV-2 sequencing based diagnostic testing data
tfmorris/bookreader
The Internet Archive BookReader
tfmorris/boston-eatery-temporary-permit-suspensions
Boston Eating Establishment Temporary Permit Suspensions
tfmorris/cc-batch-vs-index-warc
A benchmark to explore the speed of reading WARC entries in bulk vs individually.
tfmorris/cc-crawl-statistics
Statistics of Common Crawl monthly archives mined from URL index files
tfmorris/cce-renewals
Tab-delimited versions of Catalog of Copyright Entries renewals
tfmorris/conciliator
OpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
tfmorris/dkpro-c4corpus
DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate removal, language detection, and near-duplicate removal.
tfmorris/freebase-triples
A methodology to process triples data from the Freebase data dumps.
tfmorris/genscrape
JavaScript library that aids in scraping person data off of genealogy websites
tfmorris/GLnexus
Scalable gVCF merging and joint variant calling for population sequencing projects
tfmorris/googleAuthR
Google API Client Library for R. Easy authentication and help to build Google API R libraries with OAuth2. Shiny compatible.
tfmorris/graphd
The Metaweb graph repository server
tfmorris/hathimetadata
Metadata for English-language fiction and poetry beyond 1923 in HathiTrust Digital Library.
tfmorris/infogami
tfmorris/Level
An Android bubble level (native) application available on Google Play.
tfmorris/lifelines
Official lifelines repository
tfmorris/open-cravat
A modular annotation tool for genomic variants
tfmorris/openlibrary-client
Python Client Library for the Archive.org OpenLibrary API
tfmorris/OpenRefine-testing-extension
Just testing setup for OpenRefine extension
tfmorris/pophistory-tutorial
Tutorial on using popular tools for learning about population history
tfmorris/Refine-NER-Extension
Named-Entity Recognition extension for Google Refine / OpenRefine