parameter in HIM distance
jkbren opened this issue · 4 comments
The paper does not give much guidance on what a default value of ζ should be. In its absence I think using 1.0 is fine. Maybe we could add a little more documentation, i.e., explicitly say if ζ=0 it's Hamming and as ζ approaches infinity it becomes IM, but I'm not sure what we can do past that.
I don't think there's much more to say other than the fact that HIM is the square root (why?) of the weighted average of H² and IM², and ξ is the ratio of the weights. ξ=1 corresponds to equal weights.
Does anyone know (1) why they take the square root, or (2) why @sdmccabe wrote ζ instead of ξ?
My guess is that the square root is to make it (proportional to) the Euclidean length of a vector with components H and (√ξ)IM. Another way to read it that looks a bit better is (1+ξ)(HIM)²=H²+ξ(IM)².