Missing value codes are documented
jeanetteclark opened this issue · 0 comments
jeanetteclark commented
Status: ⌛ Not Started
Description
Check that missing value codes, or data that appears to be missing values, are documented correctly
Priority
- Data Quality: Required
Issues
- need a list of common missing value codes
- NA, NULL, NaN, NAN, -999, characters in a numeric column, others?
- how might we come up with other ways to detect oddball missing value codes (like -1 in a column of otherwise positive numbers)?
Procedure
- retrieve the EML entity for the file
- compare values in each column to the described missing value codes for that column
- look for potential missing value codes that are not described
- pass if all missing value codes are described, fail if obvious ones are not