/watermaze-pdf-parser

A project for the exp. neuro. department at Charité to automatically process PDF data into a normalized CSV format.

Primary LanguageJavaScript

Parse PDF -> JSON -> CSV

titleimge

How to use

Before the first run you should ensure you have following tools installed:

Dependencies

To run this code you need the following dependencies:

Afterwards you should be able to install the project specific node packages with yarn / npm i

Running the parser

For every experiment the config file needs to be updated with the experiment input directory name:

{
    EXPERIMENT_ID: YOUR_DIR_NAME
}

Then run this command in your Terminal from the root directory yarn run-parser / npm run-parser

Background

This tool helps employees of the Department of Experimental Neurology to process the data from an auto-generated PDF into CSV, which was done by hand before. Because of the PDF being auto-generated this code is very static and depends heavily on the structure of that PDF being consistent.