Deky and Tono
This repository was created as part of assignment on Big Data Analysis course, which will attempt to analyze a database of rodents which were collected from 1977 to 2002 in southern Arizona to determine the relationship between the weight and length of hindfoot, using R Studio.
Rstudio program install.packages("gridExtra") install.packages("ggpubr") library(gridExtra) library(tidyverse) library("ggpubr")
This repository consists of:
- Script R (* .R). It is a raw script written by the authors (Deky and Tono) covering every command used in the analysis.
- Rmarkdown (* .Rmd). It is a summary of the R analysis performed.
- Pdf file (* .pdf). It is a knited result of Rmarkdown.
- html file (* .html). It is a knited result of Rmarkdown.
- Readme.md. An explanation of this repository.
- The data folder containing the data files being analyzed.
- scripts.Rproj. It is the project file for RStudio.
To replicate the process:
- Install the RStudio
- Install the packages
- Load all the library()
- Download all the file/folder on this repository
- Run the scripts.Rproj that will open in RStudio
- Open the * .R file (script file)
- Open the * .Rmd file (Rmarkdown file)
Analysis conducted in this repository included:
- Adding the trendline on the plot to see the R2
- Linear model analysis to see the relationship between two variables
- F-test (statistical test) to see the variances between two population
- T-test (statistical test) to see the difference between two population
Procedures performed in this repository included:
- Cleaning up the incomplete data
- Plotting the general information about the data based on the number of samples for each species and genus, also the proportion of sex (female and male) for each category
- Plotting the distribution of weight and hindfoot length for the overall sample and the distribution of the mean (weight) weight and hindfoot length for each species
- Plotting the weight and hindfoot length relationship analysis for all complete dataset
- Perform the linear model of weight and hindfoot length of the merriami species
- Perform the T-test to see if the mean weight and the hindfoot length of female species are significantly different from the male species.