-
Data cleaning is executed, including gender and age categorization, and data consistency checks.
-
The
SaleDate
field is standardized, and a new fieldSaleDateConverted
is added. -
Property addresses are populated, with a focus on resolving missing data.
-
Property addresses are split into individual columns for better data organization.
-
The 'Sold as Vacant' field is updated to change 'Y' to 'Yes' and 'N' to 'No' for consistency.
-
Duplicates in the data are identified and then deleted, retaining only the minimum
UniqueID
. -
Finally, unused columns such as
TaxDistrict
,OwnerAddress
, andPropertyAddress
are dropped from the dataset.