Clone the repo and run the following in a terminal:
virtualenv .env
source .env/bin/activate
pip install -r requirements.txt
You will need to install a version of the chromedriver that matches the installed version on your machine from https://chromedriver.chromium.org/downloads and place it in the same directory.
Then you can load the script interactively or modify it!
original dataset is testing_ds_20200529.csv
-
AddressTreeExtract.py has already generated the AddressTree.json
-
run parseAddresses.py to extract an address_obj and produce a new csv called new_extracted.csv
Improved to almost 80% addresses match...