anvilproject/client-apis

Identifier cross checks

Closed this issue · 0 comments

It would be helpful to do cross checks for identifiers:

Mandatory tables and their checks

  • All sample_ids in sequencing table are also in sample table (and vice versa)
  • All subject_ids in sample table are also in subject table (and vice versa)
  • All sequences files listed in sequencing table are present in bucket (and vice versa)

Optional tables and their checks

  • All family_ids in family table are also in subject table
  • All subject_ids in discovery table are also in subject_table

The above is focused on CMG, but optional tables can be expanded. Any optional table should still have a key identifier that is cross checked with a required table.