Improve fuzzy clustering algorithm
Opened this issue · 0 comments
websorz commented
-
Currently, for each string, it is assigned all strings that are within the cutoff and then is removed from the pool. It does not take into account, if there is a seperate cluster within the same cutoff that would be better suited.
-
Instead of only traversing column, traverse both row & column to more intelligently cluster.