Climate analysis is needed for opening a new surf store that also sells ice-cream in Oahu, Hawaii. This project relates to advanced data storage and retrieval.
Python, Flask, Jupyter Notebook, SQLite, SQLAlchemy using Python, pandas, numpy, matplotlib, Jupyter notebook, VS Code
We need to write queries to filter the data from Measurements table in the hawaii.sqlite database to retrieve all the temperatures for the months of June and December. We will then convert those temperatures to a list, create a DataFrame from the list, and generate the summary statistics. We will write two additional queries to gather more weather data for these two months.
Click here for a link to the code: SurfsUp_Challenge.ipynb
1a. A query that filters the date column from the Measurement table to retrieve all the temperatures for the month of June:
2a. A query that filters the date column from the Measurement table to retrieve all the temperatures for the month of December:
There are 1700 observations from the month of June as compared to 1517 observations from the month of December.
Maximum temperatures in June and December vary by only a couple degrees (85 deg in June vs. 83 deg in Dec).
Minimum temperatures in June and December vary by 8 degrees (64 deg in June vs. 56 deg in Dec).
Mean temperatures in June and Dec vary only by 4 degrees (~75 deg in June vs. ~71 deg in Dec).
After looking carefully at the analysis results it can be summarized that opening a surf store that also serves ice-cream, is totally safe in Oahu because sales will happen the same whether it is June or December. The high temperatures in June vs December are quite similar and are within the range of 83 to 85 deg. Also, the average temperatures are within the range of 71-75 deg. Some additional queries were made to assist in deciding the best location on the island.
A. A query is written that filters the Measurement table to retrieve the Station IDs, number of observations, and temperatures for the month of June. It is then grouped by the Station ID and ordered by the number of observations, then put into a pandas dataframe:
B. A query is written that filters the Measurement table to retrieve the Station IDs, number of observations, and temperatures for the month of December. It is then grouped by the Station ID and ordered by the number of observations, then put into a pandas dataframe:
- There is no relation between the number of observations and average temperatures recorded at a certain station location.
- The location of following stations will be best suited for opening a new store:
- USC00519397 (Most no. of observations and the highest average temp in the month of June)
- USC00514830 (Highest Average temp in the month of December)