Group 10 Repository for the ESDC Integrity Hackathon with Simon Fraser University: Canadian Companies Deep Dive.
Authors : Debananda Sarkar, Rafael Pilliard Hellwig, Selma Duric, Tanmay Sharma, Zeliha Ural Merpez
- Append meaningful information to a business that could be used for analytics.
- Generate business profiles for integrity and service improvements.
- Generate insights of value for integrity and service improvements.
- Generate visualisation tools to explore business characteristics/insights.
- Scope: a minimum dataset of City of Vancouver business licences is provided, up to a maximum of all Canadian businesses.
- Executing scripts and downloading all required file sources to run app:
- To execute scripts, in command line type
make all
from command line to execute scripts. - To clear all generated files found in
data-processed
anddata
directories, typemake clean
from command line.
-
Creating csv and comparison of Business Licences data and GoogleMaps data:
Examples to run from GitBash:
-
python src/get_google_data.py -i "data-raw/license_data.csv" -o "data-processed"
-
python src/get_google_data.py -i "data-raw/license_data.csv" -o "data-processed" --name "First Memorial Services Ltd"
-
python src/get_google_data.py -i "data-raw/license_data.csv" -o "data-processed" --type "Casino"
Note: In order to access GoogleMaps data, this code requires secret API_Key.
Data used within the app includes:
-
City of Vancouver Business Licence Data (1997-2012, 2013-2021) under the Open Government Licence – Vancouver Licence.
-
Statistics Canada Inter-corporate Ownership: fourth quarter 2019 under the Statistics Canada Open Licence.
-
GoogleMaps API.