The hnyc_standardize repo consist of processes to standardize the addresses and occupations. A separate folder is created for both the processes. Each contains the code, input folder, working files folder and output folder.
To standardize Address , Occupation and Names
Names for example ..
to ..
Occupation for example ..
to ..
Address for example ..
to ..
(A high level approach at the root readme file)
-
/Address
contains all work related to address standardizationreadme.md
has information on what method/approach was taken to standardize, the reference data, sources for them and the it's effectiveness
-
/Occupation
contains all work related to address standardizationreadme.md
has information on what method/approach was taken to standardize, the reference data, sources for them and the it's effectiveness