GitHub issues

Project Infromation

Fundamental purpose of this project is to analyze the data from big retail grocery chains in order to extract patterns and key insights. Visuals from this project will be provided from RSutdio with library ggplot2 and Tableau Public. Furthermore, the project comes with an extra excel file which is exported from tidy data after analyzing them.

Languages

R

Environment

RStudio

Installation

setwd('C"/YourPathGoesHere/')
.libPaths('C:/Users/')

Libraries

library(readr)
library(tidyverse)
library(magrittr)
library(writexl)

Problems Faced

Common Problems
  • Renaming Variables
  • Dealing with NA values
  • Dropping Columns
  • Deleting Rows
Other Problems
  • Each product category belongs into a more generic product category

Reading Files

Markets <- read_csv("WorkingData/markets_all.csv")
Categories <- read_csv("WorkingData/categories.csv")
bar <- read_csv("WorkingData/bar.csv")

Visuals

Visuals created via

  • RStudio and
  • Tableau

What Consumers buy the most from each Market

Sheet 3

Code Samples

ggplot(AB, aes(Category_Name, AB)) +
  geom_bar(stat = 'identity') +
  labs(x = 'Category Name',
       y = 'Total Sum',
       title = 'Which is the most preferred category from Food',
       subtitle = 'Market: AB') +
  coord_flip()

Rplot!