Deving789/Amazon_Vine_Analysis
Using PySpark to perform the ETL process to extract the dataset, transform the data, connect to an AWS RDS instance, and load the transformed data into pgAdmin. Then using PySpark, Pandas, & SQL to determine if there is any bias toward favorable reviews from Vine members in the dataset.
Jupyter Notebook