/indexer

A parser which indexes unstructured collections of data representing William Branham's complete sermon library and structures them for loading into a data ingester

Primary LanguageJavaScriptMIT LicenseMIT

Indexer

Build Status Codacy Badge

A parser which indexes unstructured collections of data representing William Branham's complete sermon library and structures them for loading into a data ingester.

This project is part of a three-part system which collectively stores, indexes, and then outputs a collection of sermons as JSON files:

  1. Original Sources: Sermon metadata from various online data sources
  2. Indexer: This project, takes the information from the above source and processes it as output into the next source
  3. Golden Dataset: JSON files which are generated by the indexer and manually uploaded to the repository

Run this Project

To run this project and generate the intended output, you will need the latest version of NodeJS. Once that is installed, run these commands:

git clone https://github.com/branham-player/indexer.git
git clone https://github.com/branham-player/original-sources.git

cd indexer
npm install
npm run all

The result of a successful execution is the presence of three new files in the root folder:

  • full.json: A complete dataset which contains all of the information the program could gather from the original sources
  • condensed.json: A shortened version of the full.json file which contains the most essential pieces of information for the everyday user
  • months.json: Counts the number of years and months which are present in full.json