The purpose of this project was to create a presentation for a hypothetical audience of a brewing company. The goal was to address the 9 questions / items described below.
- How many breweries are present in each state?
- Merge beer data with the breweries data. Print the first 6 observations and the last six observations to check the merged file.
- Address the missing values in each column.
- Compute the median alcohol content and international bitterness unit for each state. Plot a bar chart to compare.
- Which state has the maximum alcoholic (ABV) beer? Which state has the most bitter (IBU) beer?
- Comment on the summary statistics and distribution of the ABV variable.
- Is there an apparent relationship between the bitterness of the beer and its alcoholic content? Draw a scatter plot. make your best judgment of a relationship and EXPLAIN your answer.
- Budweiser would also like to investigate the difference with respect to IBU and ABV between IPAs (India Pale Ales) and other types of Ale (any beer with “Ale” in its name other than IPA). You decide to use KNN classification to investigate this relationship. Provide statistical evidence one way or the other. You can of course assume your audience is comfortable with percentages.
- Find one other useful inference from the data that you feel Budweiser may be able to find value in. You must convince them why it is important and back up your conviction with appropriate statistical evidence.