Scrapping Agriculture data from eNam website using Selenium library, then clean that raw data using Pandas library in Python and upload it to MongoDB
- The Code is given in jupyter Notebook.ipynb and .py format.
- Directory path of
Gramoday_Datasets_market.xlsx, cmdtyMapData.xlsx, apmcMapData.xlsx file
need to be changed while running on other machine. Gramoday_Datasets_market.xlsx, cmdtyMapData.xlsx, apmcMapData.xlsx file
file is attached.
Python version 3.7.9 (default, Aug 31 2020, 17:10:11) [MSC v.1916 64 bit (AMD64)] Version info. sys.version_info(major=3, minor=7, micro=9, releaselevel='final', serial=0)
open Anaconda prompt, change file directory path i.e., cd path_to_folder
conda create -n eNamScrapping python=3.7
conda activate eNamScrapping
conda install spyder
pip install -r path_of_requirments.txt
pymongo==3.11.4
requests==2.25.1
pandas==1.2.0
- After above installation type
spyder
in Anaconda prompt. Now Spyder IDE will open in sometime. - open
eNamDataPullClean.py
in spyder