/Amazon_Vine_Analysis

Using PySpark to perform the ETL process to extract the dataset, transform the data, connect to an AWS RDS instance, and load the transformed data into pgAdmin. Then using PySpark, Pandas, & SQL to determine if there is any bias toward favorable reviews from Vine members in the dataset.

Primary LanguageJupyter Notebook

Watchers