/DIYHistory-transcribe

A crowdsourcing manuscript transcription platform built with Omeka

DIY Transcription Site

DIYHistory is built upon the open-source generosity of George Mason University's Center for History and New Media, so we feel it's only right to give our sourcecode out to organizations and individuals interested in implementing their own crowdsourcing initiatives. Anyone who can install Omeka and MediaWiki on a web server can easily get started with their own branded version of DIYHistory|transcribe with little to no programmer time. If you find our code useful and build further upon it, we'd love to receive your pull requests at github.com/ui-libraries.

##Overview DIYHistory|transcribe is a tool for engaging users in transcribing handwritten documents, making them more searchable and enhancing them for research. DIYHistory|transcribe is built on the Omeka content management system and uses the Scripto plugin to facilitate transcription. Scripto uses MediaWiki, which allows users to continually improve upon work that has already been done. In building this site, we made significant additions to the Scripto plugin, created a new Omeka theme, and customized other Omeka plugins to style and scale for a library production environment.

##Requirements DIYHistory|transcribe requires the following:

##Features DIYHistory|transcribe introduces the following features to plugin-Scripto:

  • Track completion status of document pages (i.e., ‘Not Started’, ‘Needs Review’, ‘Completed’)
  • Track completion progress of documents based on page statuses.
  • Sort documents within their collection by most completed, floating least completed to the top.
  • Initialize document page text entry box with pre-existing text, if available (helpful if using Scripto to correct OCR for typescript pages).
  • On every page action, automatically import transcriptions from MediaWiki as file metadata.

The Scribe theme directs its focus on guiding users to easy transcription tasks rather than collection management features, offering a clean, thumbnail-oriented transcription view for any number of Omeka image collections.

By default, any member of the public is allowed to edit and save transcription data, but only users with an account can track their progress. Approved account holders can also be granted administrator (or deputy) status, allowing them to finalize documents as “complete”.

Installation

Follow the documentation at each source code repository to install Omeka 2, Daniel-KM/CsvImport, ui-libraries/plugin-Scripto + MediaWiki, and Scribe.