nestauk/old_nesta_daps

[DAPS] GRID alias lookup was unaware of aliases in the main table

jaklinger opened this issue · 0 comments

The current implementation of the GRID alias lookup ignores the fact that there are a small number of "alias" institutes which GRID has marked as separate entities.

For example University of Korea has two entries in the main GRID table, one based in Seoul and the other in Tokyo. Clearly, the proposed implementation (upcoming PR) will be incorrect in the case where there are two coincidentally similar names. I'm hesitant to implement a fix for that case right now, as it will require rewiring in the arXlive (and other?) pipeline(s) and so is a bit less light-touch.

The downstream effect for arXlive is that (e.g.) University of Korea will be marked as a transnational organisation, whereas it wasn't previously. In the upcoming CB-GRID matching PR, this is absolutely required to match organisations based on country and name (University of Korea is a concrete example).