Data Analysis of the US murders dataset
dslabs
tidyverse
statip
ggrepel
ggthemes
US Murders data
- find out about the different parameters
- know variable count
- find out the class and characteristics of variables
- checking for data integrity
Fixing; - Duplicate data - incomplete data - inaccurate data - inconsistent data
- renaming variables
- creating new columns;
- status = if state is safe to live in or not
- rate = the death rate of gun murders
- central tendency
- spread
- Data component:
variables picked based of geometric component
- Geometric component:
boxplot, scatterplot
- Aeshetic mapping;
colors = different regions
text identification
- Scale component
specific axises in log10 scale
ranges are axises depend on the data
- Labels, titles, Legends
- facets