tanseyem's Stars
nytimes/covid-19-data
A repository of data on coronavirus cases and deaths in the U.S.
pdfminer/pdfminer.six
Community maintained fork of pdfminer - we fathom PDF
euske/pdfminer
Python PDF Parser (Not actively maintained). Check out pdfminer.six.
iipc/awesome-web-archiving
An Awesome List for getting started with web archiving
DocNow/twarc
A command line tool (and Python library) for archiving Twitter JSON
webrecorder/pywb
Core Python Web Archiving Toolkit for replay and recording of web archives
timClicks/slate
The simplest way to extract text from PDFs in Python
LibraryOfCongress/bagit-python
Work with BagIt packages from Python.
edgi-govdata-archiving/overview
🎈 Start here for current projects, how to get involved, and joining community calls, a resource for new and veteran members
ePADD/epadd
ePADD is a software package developed by Stanford University's Special Collections & University Archives that supports archival processes around the appraisal, ingest, processing, discovery, and delivery of email archives.
SAA-SDT/EAD3
justinlittman/fbarc
A commandline tool and Python library for archiving data from Facebook using the Graph API.
uc-borndigital-ckg/uc-guidelines
To improve the clarity and usefulness of finding aids and to promote consistency across campuses, a working group of digital archivists under the aegis of the UC Born-Digital Content Common Knowledge Group (CKG) have collaborated to develop a UC-wide descriptive standard for born-digital archival material.
dhamaniasad/WARCTools
A list of tools related to W(eb)ARC(hive)
Georgetown-University-Libraries/File-Analyzer
A Data Parsing/Data Manipulation Tool Supporting Digitization Projects and Other Data Analysis Projects
DataAccessioner/DataAccessioner
uvalib/transmog
A web application to transmogrify word documents into well-structured XML.
waharnum/inlibraries.com
Flask web application that runs the inlibraries.com website
smith-special-collections/sc-documentation
Smith Special Collections staff user documentation on operations and services.
acocciolo/archives_finder
The objective of this script is to allow archivists to find groups of records that may be inactive because of their age.
hist3907b-winter2015/syllabus
syllabus (start here!)
tw4l/brunnhilde-gui
Graphical user interface for Brunnhilde
datarefuge/bagit-how-to
Tools and workflows for the bagging team
uclibs/derivator
Generate access derivatives from TIF files using PowerShell and ImageMagick
helrond/tansey-test
Enter a URL for a web page to find out if it is actually about archives or not.