DavidUnderdown
Now a Data Engineer in @digital-preservation at @nationalarchives - formerly Senior Digital Archivist and prior to that senior DBA
@digital-preservation @nationalarchives Kew, London
DavidUnderdown's Stars
alan-turing-institute/rse-course
Materials for The Alan Turing Institute's Research Software Engineering course
nationalarchives/da-tre-sample-data
Sample data before and after transformation
digipres/awesome-digital-preservation
Carefully curated list of awesome digital preservation resources.
nationalarchives/DiAGRAM
Repository for the Digital Archiving Graphical Risk Assessment Model - DiAGRAM
cmagovuk/selene-core
A framework for efficient, consistent and maintainable webscraping.
machow/siuba
Python library for using dplyr like syntax with pandas and SQL
usnationalarchives/Electronic-Records-Accessioning-Support-Tools
This repository shares NARA-created open source software to support federal agencies in their preparation of metadata and permanent electronic records for transfer to NARA.
usnationalarchives/digital-preservation
NARA digital preservation file format risk analysis and preservation plans
UnquietCode/runbook.py
a tool for defining repeatable processes in code
Lotte-W/Digital-Preservation-Headaches
Digital Preservation Headaches
dbdipview/dbdipview
Viewer for archived databases
the-danish-national-archives/concept-model
Presenting the Danish National Archives' Concept Model for Development of Preservation Plans.
zoho/hawking
A Natural Language Date Time Parser that Extract date and time from text with context and parse to the required format
best-practice-and-impact/govcookiecutter
A cookiecutter template for data science projects within His Majesty's Government and wider public sector.
MIT-Informatics/PreservationSimulation
Code for preservation simulation/modeling project
Digital-Preservation-Finland/dpres-siptools
Pre-Ingest Tool for creating submission information packages
carj/asset-search
Query Preservica v6 for assets using the search API
digital-preservation/pronom-research-week-2019
A repository to capture submissions and share samples during PRONOM Research Week 2019
OCR-D/core
Collection of OCR-related python tools and wrappers from @OCR-D
aloctavodia/BAP
Bayesian Analysis with Python (Second Edition)
nationalarchives/tdr-dev-documentation
Documentation for developers for the TDR project
moj-analytical-services/dataengineeringutils
A python package containing functions that help manage our data management processes on AWS
ericmjl/bayesian-stats-modelling-tutorial
How to do Bayesian statistical modelling using numpy and PyMC3
acocciolo/fixityberry
Environmentally Sustainable Digital Preservation for Very Low Resourced Cultural Heritage Institutions. FixityBerry is software that runs on a Raspberry Pi computer that monitors file fixity of digital archival content held on USB hard drives.
tdda/tdda
Test-Driven Data Analysis Functions
keleshev/schema
Schema validation just got Pythonic
mahmoud/glom
☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️
nationalarchives/python
Course documentation
alphagov/data-standards-demo-content
This is demo content that we are producing to show some of the potential of introducing a common framework.
dstl/baleen
Entity Extraction Text Processor