/nonparametric-analysis-US-dairy-production-consumption

Project for the Nonparametric Statistics course of the MSc in Mathematical Engineering @ Polimi (A.Y. 2022-2023).

Primary LanguageTeX

Nonparametric Analysis of US Dairy Production and Consumption

This project was developed for the course of "Nonparametric Statistics" held at Politecnico di Milano in the academic year 2022-2023.

🏆 This was the highest ranking project in its year batch.

We present a nonparametric analysis of the evolution of dairy production and consumption in the United States from 1980 to 2021. Starting from the needs of a stakeholders which is about to enter the market, we present a step-by-step analysis and the findings to provide meaningful information to optimize production and pricing. We also provide an outlier analysis in terms of years and, in the last part, a spatial analysis and a clustering of the types of cheese.

Table of contents

Installation

How to clone the repository

git clone https://github.com/SmearyTundra/nonparametric-analysis-US-dairy-production-consumption

How to install the packages

Install the required packages from CRAN

packages_list <-
    c(
        "tidyverse",
        "ggplot2",
        "mgcv",
        "rgl",
        "splines",
        "conformalInference",
        "pbapply",
        "parallel",
        "DepthProc",
        "progress",
        "dbscan",
        "beadplexr",
        "robustbase",
        "readxl",
        "tidyr",
        "car",
        "sp",
        "visreg",
        "mgcViz",
        "usmap",
        "raster",
        "sf",
        "maps",
        "ggspatial",
        "BNPTSclust",
        "roahd",
        "fda.usc",
        "npsp"
    )
install.packages(packages_list)

Analysis and results

The repository contains different files to perform the analysis, here we report their explanation together with a hyperlink to the knitted PDF version:

The final presentations can be found here:

The final report can be found here:

Authors