/etl-oldnyc

Primary LanguageJavaScriptMIT LicenseMIT

NYC Space/Time Directory ETL module: OldNYC

ETL module for NYPL’s NYC Space/Time Direcory. This Node.js module downloads, parses, and/or transforms OldNYC data, and creates a NYC Space/Time Directory dataset.

Data

The dataset created by this ETL module’s transform step can be found in the data section of the NYC Space/Time Directory website.

Details

ID oldnyc
Title OldNYC
Description OldNYC provides an alternative way of browsing the NYPL's incredible Photographic Views of New York City, 1870s-1970s collection. Its goal is to help you discover the history behind the places you see every day.
License CC BY-NC
Contributors
  • Bert Spaan (wrangler)
  • Dan Vanderkam (author)
Sources OldNYC's data file
Homepage https://www.oldnyc.org/

JSON Schema of Object data:

{
  "$schema": "http://json-schema.org/draft-04/schema#",
  "type": "object",
  "additionalProperties": false,
  "required": [
    "uuid",
    "imageId",
    "url",
    "imageUrl",
    "nyplUrl"
  ],
  "properties": {
    "uuid": {
      "type": "string",
      "description": "NYPL UUID of item"
    },
    "imageId": {
      "type": "string",
      "description": "NYPL Image ID"
    },
    "text": {
      "type": "string",
      "description": "Text from back of photo, extracted using OCR"
    },
    "folder": {
      "type": "string",
      "description": "Normalized title, used on OldNYC website"
    },
    "url": {
      "type": "string",
      "description": "URL of webpage of photo in OldNYC"
    },
    "imageUrl": {
      "type": "string",
      "description": "URL of image of photo in OldNYC"
    },
    "nyplUrl": {
      "type": "string",
      "description": "URL to item in NYPL Digital Collections"
    }
  }
}

Available steps

  • download
  • transform

Usage

git clone https://github.com/nypl-spacetime/etl-oldnyc.git /path/to/etl-modules
cd /path/to/etl-modules/etl-oldnyc
npm install

spacetime-etl oldnyc[.<step>]

See http://github.com/nypl-spacetime/spacetime-etl for information about Space/Time's ETL tool. More Space/Time ETL modules can be found on GitHub.

This README file is generated by generate-etl-readme.