/Data_Science_Project

Various sampling methods to speed up clusterring

Primary LanguageJupyter Notebook

CS328 - Introduction to Data Science

Sampling Techniques for better performance of KMeans Algorithm We have implemented the following algorithms in a fairly successful manner:

  • Uniform Sampling
  • Leverage Score based Sampling
  • Volume Sampling
  • Light Weight Coresets based Sampling