/ManagingBigData

For this project we used AWS eco-system to manage the Amazon reviews data(~50 GB) and do sentiment analysis on the reviews.

Primary LanguagePigLatin

ManagingBigData

For this project we used AWS eco-system to manage the Amazon reviews data(~50 GB) and do sentiment analysis on the reviews.

Tools and Environments used.

Amazon EMR, Amazon Athena, Amazon S3, Pig, Apache Spark.

File formats supported.

Parquet, Avro, CSVs, JSON.