Create a dataset validator
ManuelAlvarezC opened this issue · 0 comments
ManuelAlvarezC commented
Description
We need a way to ensure that the datasets we are gathering follow as much as possible the Data Model
Implementation proposal
It should be a function on task_geo.tests
with the following prototype:
def validate_dataset(dataset)
"""Validates a dataset against the Data Model.
Arguments:
dataset(pandas.DataFrame)
Returns:
None
"""
It should not return any values simply assert
different validations against the dataset and raise a message if something fails, so the developer can know what to fix.