/IGR204-DataViz-Dream-Bank-Project

Télécom Paris | MS Big Data | IGR 204 | Data Visualization Project with Plotly Dash for improving the accessibility of the +20,000 Dream Bank dataset

Primary LanguageJupyter NotebookMIT LicenseMIT

IGR204 Data Visualization Project - Decrypting Dreambank

Motivation

Our goal is to help identify groups of people having similar dreams, and allows them to dig deeper into dreams by selecting specific dreams or groups of dreams to get information. This would allow researchers to draw parallels and formulate hypotheses regarding the process of dreaming with several examples from different people. They may also be able to detect patterns in dreams and draw links between types of people and themes in dreams.

Instructions

Required dependencies:

In addition to the standard Python packages, the following dependencies are required:

pip install dash
pip install dash-bootstrap-components

To run our application:

Once the repository is cloned, run the following commands in the Console Prompt:

cd IGR204-DataViz-Dream-Bank-Project
python app.py

A localhost will appear where the application can be displayed.

Data Preprocessing

We used baseline NLP approaches relying on word count and TF-IDF to preprocess Dream Bank. Data preprocessing steps are detailed in the following Jupyter Notebook:

Dream Bank Data Preprocessing

Dimensionality Reduction was done with the UMAP algorithm:

Dream Bank UMAP Dimensionality Reduction