NCEAS/metadig-checks

Missing value codes are documented

jeanetteclark opened this issue · 0 comments

Status: ⌛ Not Started

Description

Check that missing value codes, or data that appears to be missing values, are documented correctly

Priority

  • Data Quality: Required

Issues

  • need a list of common missing value codes
    • NA, NULL, NaN, NAN, -999, characters in a numeric column, others?
  • how might we come up with other ways to detect oddball missing value codes (like -1 in a column of otherwise positive numbers)?

Procedure

  • retrieve the EML entity for the file
  • compare values in each column to the described missing value codes for that column
  • look for potential missing value codes that are not described
  • pass if all missing value codes are described, fail if obvious ones are not