UoMResearchIT/RSESkillsGraph

Remove / amend non-Wikipedia entries into the skills graph

Closed this issue · 2 comments

As per Mike's findings there are items in the RSE Skills graph that do not have a Wikipedia main artile heading.

Should these be removed / amended or should there be a different policy on what gets included ?

There are also some with spelling variations, does this affect the functionality of the skills graph ? e.g if there is a typo does that stop it being linked with the correct spelling ?

Hi Josh, sorry - only just saw this! Yes, we should amend these. See #50. I have also added a test for this that is shown in each PR. We should probably add a list of extra topics which are not wikipedia entries that we want to have.

If there is a typo, it would just appear as an additional entry. When you click on a topic, the code seems to ask wikipedia for the first 40 results for a search for that topic name, and takes those as the "related" topics to show in the graph. If someone has a typo in their topic name, then they will not show up in this graph.

Duplicate of #50.