/webspam

Groudtruth files (cosine similarity) of the webspam dataset.

Reference

Webspam dataset is a large-scale sparse dataset.

The groudtruths is the bruteforce top-1024 KNN graph result for the first 10,000 datapoints against the rest.