ebanalyse/fuzzup

Improve fuzzy clustering algorithm

Opened this issue · 0 comments

  • Currently, for each string, it is assigned all strings that are within the cutoff and then is removed from the pool. It does not take into account, if there is a seperate cluster within the same cutoff that would be better suited.

  • Instead of only traversing column, traverse both row & column to more intelligently cluster.