Jovi Koh (jlk668)
John
Diana Zhao (dz1371)
The cleaning of the raw data is fairly simple: the program takes in the raw data file, and goes through each row to store or not store them. The program will stor the first header, and ignores all subsequent headers. For each row that starts with a year, the program will split the row by spaces, and read and store all data points into floats, then write it to the clean output csv file.
This is also a simple program: reading the 120 data points for each month every decade and calculating the average of those 120 data points will yield the most accurate results. The program will, for each decade, read and calculate the sum of those 120 data points, then divide it by 120 to get the average temperature fluctuation within the decade, and then print those averages to an output text file.
Raw data: data/raw_data.txt
Cleaned up data in csv: data/clean_data.csv
Analysis results in text file: analysis_results.txt