/rladies-clusters

This repo contains the materials for the R Ladies Edinburgh session on Cluster Analysis in R. In the exercise we identify and characterise energy inefficiency clusters in the private rental sector in Greater London.

Cluster analysis in R

Energy efficiency in the private rental sector in Greater London

Like many countries globally, the private rental sector in England contains some of the poorest quality and least energy efficient properties. In this exercise we will use cluster analysis (specifically a k-means clustering approach) to identify patterns in energy inefficiency in the private rental sector across Greater London. We will use new energy performance certificate (EPC) data that offers detailed information about the energy efficiency of properties.

The tutorial builds on an established tradition of geodemographic research in geography. Here, k-means clustering is applied to identify clusters of data points with similar attributes. The technique has been widely applied to understand a range of phenomena including COVID-19, digital infrastructure, cities, ecology, and poverty.

Access the practical exercise: The pdf guide is available here.

Download the accompanying dataset: The geopackage is available from dropbox.

cluster_jpeg