Pyspark implementation of data analysis tool (API Level)
To install required packages to run the program follow the following steps:
- git clone the repo
- cd gda
- pip install pipenv
- pipenv install
Now to run the program everytime just go inside the repo folder and run:
- pipenv shell
- python example.py > test.txt
The result will be displayed in the test.txt file in the same folder