Identifier cross checks
Closed this issue · 0 comments
kwuichet commented
It would be helpful to do cross checks for identifiers:
Mandatory tables and their checks
- All sample_ids in sequencing table are also in sample table (and vice versa)
- All subject_ids in sample table are also in subject table (and vice versa)
- All sequences files listed in sequencing table are present in bucket (and vice versa)
Optional tables and their checks
- All family_ids in family table are also in subject table
- All subject_ids in discovery table are also in subject_table
The above is focused on CMG, but optional tables can be expanded. Any optional table should still have a key identifier that is cross checked with a required table.