Erroneous/duplicative HGNC names `ZHX1-C8ORF76` and `ZHX1-C8orf76`
jsstevenson opened this issue · 0 comments
Describe the bug
The hgnc
table contains entries for both ZHX1-C8ORF76
and ZHX1-C8orf76
. The view tx_exon_aln_v
contains alignments for each.
To Reproduce
SELECT * FROM tx_exon_aln_v WHERE hgnc = 'ZHX1-C8ORF76';
SELECT * FROM tx_exon_aln_v WHERE hgnc = 'ZHX1-C8orf76';
Expected behavior
The all-upper-case value is erroneous/an artifact of earlier data/something to that effect -- in HGNC symbols, I believe the open read frame "orf" is always supposed to be lower case, and they don't have an entry for the all upper case version: https://www.genenames.org/tools/search/#!/?query=ZHX1-C8ORF76
Additional context
I'm guessing this is just an old value getting stuck somewhere in a table that UTA pulls from, but wanted to put this up in case it's still an issue in the in-progress latest data release.