ResidentMario/missingno

Distance metric in dendrogram

Opened this issue · 0 comments

Hi,
In the dendrogram function link to scipy the default metric function to calculate distance between pairs of points is Euclidean. Since we are dealing with nullity binary values, won't it be more convenient to use a similarity metric such as Hamming distance (i.e. proportion of disagrees), Jaccard index, etc?

Thanks!