Improve annotation of cross-references
cthoyt opened this issue · 5 comments
I found that CLO makes cross-references using the rdfs:seeAlso
predicate instead of something a bit more fit-for-purpose like oboInOwl:hasDbXref
or skos:exactMatch
.
I also found that these rdfs:seeAlso
annotations point to strings that potentially contain multiple CURIEs, with varying degrees of heterogeneity in how they're written.
Can you give a bit of insight into why the cross-references were encoded this way?
I wrote a script that attempts to parse and standardize them using the Bioregistry. I posted the output in an SSSOM file in this gist. Would you be interested for me to contribute these back in a more standardized way to CLO?
@cthoyt That's great! We would like to get your contribution.
Sorry for the delayed reply since I just came back from a two-week travel. We have recently had Dr. Jie Zheng join our group. I would suggest having her involved as well. Do you have a time for a meeting on this?
Thanks.
Hi @yongqunh and @zhengj2007, I am also just back from vacation and at a project meeting this week. Next week I am relatively free and on east coast time. It would be great to plan a video conference. For me, around 11AM is best, but I'm flexible. My email is cthoyt@gmail.com if you would rather coordinate on that channel, too (or on slack)
I am available around 11:00 EST Monday, Tuesday, and Thursday next week. It would be great to discuss it with you. @cthoyt
See related discussion on what annotation property should be used to indicate the mapped/equivalent terms