When given several thousand HTML product pages from various retail websites, this program uses a locality-sensitive hashing algorithm to determine which pages are talking about the same products.
DavidYourNeighbor/ProductPageClustering
When given several thousand HTML product pages from various retail websites, this program uses a locality-sensitive hashing algorithm to determine which pages are talking about the same products
Java