airr-community/common-repo-wg

Review CRWG Recommendations document

Closed this issue · 11 comments

It is probably time to review these as we approach the May meeting.

I have created a branch that we can use to make changes...

https://github.com/airr-community/common-repo-wg/blob/issue-27/recommendations.md

It appears to me that Recommendation 8 and 9 should be reviewed...

There also appears to be a gap in what I would consider as our main goal of this year - defining an API that makes it possible to perform consistent queries across AIRR compliant repositories.

My two cents on the current version:

  • Rec. 10: PII and PHI are - best to my knowledge - US legal terms. We should use more generic wording that also captures European GDPR.
  • Rec. 13: We should make sure that IEDB can capture all metadata required by MiAIRR. Otherwise this recommendation should be rephrased and recommend deposition at IEDB as a secondary submission (cross-referencing to and cross-referenced by the primary repository).
  • Rec. 14: Do we still have any contact with ImmPort? What is the level of implementation there?

It appears to me that Recommendation 8 and 9 should be reviewed...

@laserson would you be able to have a look at Recommendation 8 here: https://github.com/airr-community/common-repo-wg/blob/issue-27/recommendations.md

I have removed mention of Avro, Thrift, and Protocol Buffers and left in the generic text. Our recollection was that you had suggested these originally.

lgtm

@lgcowell, I did more reading regarding Rec. 10 and we should probably have someone with a legal background go over this:

  • Rec. 10: PII and PHI are - best to my knowledge - US legal terms. We should use more generic wording that also captures European GDPR.

Do you happen to know what the terms are?

  • PII: The term seems to be a bit fuzzy already in the US, depending on the exact translation (e.g. identifiable vs. identifying). The main issue is that this term does not exist in GDPR, which typically refers to the much broader term of "personal data" [Art. 4 (1) GDPR].
  • PHI: This could be described as ''genetic or health data" [Art. 4 (13) & (15) GDPR]]. However, Rec. 10 refers to PHI in regard to metadata only, while from the GDPR perspective the sequencing data itself would already qualify as "genetic data". Note that both genetic and health data are considered "special categories" that underlie enhanced protection [Art. 9 {1} GDPR], typically requiring an informed consent to work with this data [Art. 9 (2) (a) GDPR].

Wasn't this cleaned up and ratified at the last AIRR Community meeting, so we can close?

Yes, I believe so...

Closing...