See:
- Report about Actors's
- Report about Actor's name
- Report about Actor's text properties
The goal here is to identify what SYMOGIH actors taht already exisits in Geovistory. To help with that, we have found a record linkage library that does exactly that and that is used by the Ministery of Justice in the UK. The library name is SPLINK. We think that identifying the SYMOGIH actors is a good use case to test this library.
The learning, tests, and results are available here.
Doing this analysis, we found that it may be necessary to make a record linkage inside the SYMOGIH actors first, because we already found some dupplicated.
With the same strategy as described above, we try to identify dupplicated actors inside the BHP itself. Unfortunately results were not as good as expected (see issue #1, and report). So we try a more traditionnal way.