Clustering-of-Yelp-Restaurnts

• Parsed large JSON dataset in python and employed sophisticated techniques using pandas and sklearn libraries to analyze clusters to find different culinary districts in Las Vegas.

• Cluster was measured based on:

– Analysis of reviews written by users to pull out user defined categories.

– Categories defined by business owner.

• Cluster methods used:

– KMeans++

– Hierarchical

– GMM