- Data scraped from Weather Underground.
- Plotted using Rstudio/ggplot2.
All data taken from the city's nearest airport.
Total downloaded file size is about 30MB. Play with weather.R
. Remember to set the correct directory using the setwd()
command at the top of the code. I have not included the code I used to scrape Wunderground, mostly because:
- (a) it would probably tick off the website owner if I initiated a Hug of Death,
- (b) the
.csv
files are already present so what's the point, - (c) it only works for Linux (scrapes via
wget
), and - (d) it's very poorly written garbage that nobody should use anyway.
weather-indiv.R
is for individual plots, but make sure you read the commented lines.
A few of these plots on the Temperature plot will have "Grey areas" which indicate that the data is out-of-range. So basically too hot or too cold. You can fix this by changing the scale limits, but it makes the other cities a little less beautiful, so I decided against.
If you're wondering how it's possible to have humidity below water's freezing temperature, you should read this paper [PDF].
I picked the 24 cities based on a balancing act of the following criteria:
- Cities with data-rich sources (some of the best historic data happens to be provided by international airports in the US, so I exclusively used those)
- Cities with high populations (helps with the international airport thing too)
- Cities that are evenly spread throughout the geography of the US. Here's a map of the US cities I used.
- Cool (and, well, hot) anomalies like Alaska, Hawaii, and Siberia.
Below are individualized plots for 24 of the cities on this file, plus a few more I peppered in after planning to use them and then deciding otherwise. If you want to request me to do a city, send me a message and I'll see what I can do when I have the time. Feel free to repost these to your city's subreddit; just give attribution.
Anchorage, Atlanta, Baltimore, Billings, Bismarck, Boston, Charleston, Chicago, Cleveland, Denver, El Paso, Honolulu, Houston, Los Angeles, Miami, Minneapolis, Nashville, New Orleans, New York City, Oklahoma City, Philadelphia, Phoenix, Portland, Rochester, Salt Lake City, San Francisco, Seattle, St. Louis, Topeka
In addition to these plots, I was able to get some alternate plotting done:
- I was going to do an alt-plot that adjusted for wind chill and heat index, but it looked like it might have been painfully time consuming to tabulate. Perhaps another Redditor can use the data to mess with Texas.
- Cloud cover for US cities, as measured in Oktas. Looks like the only data available in "Okta" units is after 1972.
- Precipitation for US cities, measured in inches. Shows some very strange rainy seasons in the 70s and the 90s.
- For international version, Click here
- For Reddit thread, Click here