Clustering features
ArkadiyD opened this issue · 2 comments
ArkadiyD commented
In the line
Line 418 in fa76cb5
linnanwang commented
Thank you. Yes f(x) is considered, otherwise we will only split by x. Here we want to split x based on f(x), so [x f(x)] are needed. I will clarify that in the final revision.
However, there is an improvement here. Instead of using K-mean, to label samples into two groups. We can learn a regressor R on [x, f(x)]. The split now becomes R(x) > f(x) goes to good partition, and R(x) < f(x) goes to a bad partition.
ArkadiyD commented
Thanks!