CoronaWhy/task-geo

Create a dataset validator

ManuelAlvarezC opened this issue · 0 comments

Description

We need a way to ensure that the datasets we are gathering follow as much as possible the Data Model

Implementation proposal

It should be a function on task_geo.tests with the following prototype:

def validate_dataset(dataset)
    """Validates a dataset against the Data Model.

        Arguments:
            dataset(pandas.DataFrame)
        Returns:
            None
    """

It should not return any values simply assertdifferent validations against the dataset and raise a message if something fails, so the developer can know what to fix.