Pinned Repositories
awesome-web-archiving
An Awesome List for getting started with web archiving
conventoarchiver
Repository for collecting scripts to help capture MyConvento newsroom press-releases from the MyConvento PR management suite. The README provides an analysis of the MyConvento URL architecture for users hoping to develop a solution for themselves.
httpreserve
Digital Preservation of HTTP in documentary heritage.
linkscanner
A helper package to tokenize textual content and retrieve hyperlinks
linkstat
CLI implementation of httpreserve that can test links and retrieve internet archive replacements
million-dollar-webpage
HTTPreserve Analysis of Million Dollar Web Page
phantomjsscreenshot
A wrapper for phantom.js commands for headless screenshots.
tikalinkextract
Tika based link (URL) extractor for httpreserve
wadl-2017
Resources for WADL 2017
workbench
Client app for httpreserve pkg that generates CSV, JSON, HTTP, and BoltDB
httpreserve suite's Repositories
httpreserve/httpreserve
Digital Preservation of HTTP in documentary heritage.
httpreserve/linkstat
CLI implementation of httpreserve that can test links and retrieve internet archive replacements
httpreserve/tikalinkextract
Tika based link (URL) extractor for httpreserve
httpreserve/linkscanner
A helper package to tokenize textual content and retrieve hyperlinks
httpreserve/awesome-web-archiving
An Awesome List for getting started with web archiving
httpreserve/workbench
Client app for httpreserve pkg that generates CSV, JSON, HTTP, and BoltDB
httpreserve/conventoarchiver
Repository for collecting scripts to help capture MyConvento newsroom press-releases from the MyConvento PR management suite. The README provides an analysis of the MyConvento URL architecture for users hoping to develop a solution for themselves.
httpreserve/mementoqa
QA Mementos using Screenshots
httpreserve/phantomjsscreenshot
A wrapper for phantom.js commands for headless screenshots.
httpreserve/wayback
A restrictied API in Golang for the (semi)-exposed functions of the internet archive.
httpreserve/million-dollar-webpage
HTTPreserve Analysis of Million Dollar Web Page
httpreserve/wadl-2017
Resources for WADL 2017
httpreserve/eaccession-research
A repository to store data associated with HTTPreserve research on Archive NZ's born digital material.
httpreserve/gnomescreenshot
Wrapper for gnome-web-photo for httpreserve demos
httpreserve/simplerequest
Minimal HTTP requests for Golang
httpreserve/urlgetter
Script to disambiguate domain names from where they actually point to.