Statistical Distance Features for Test data

Question

Statistical Distance Features for Test data

zhangxiangnick opened this issue 8 years ago · 1 comments

How do you generate the statistical distance features (described in Sect. 3.2.2 of your notes) for test data? There is no median_relevance labels for test data. How could it possible to group the test data by median_relevance?

Answer 1 · 2016-04-16T05:56:41.000Z

Group the training data by (query, median_relveance) and compute the statistical distance between each sample of test data and the corresponding group of the same query. You can have a look at the code.