/DDSCaseStudy1

Case study 1 for Doing Data Science at SMU

Primary LanguageR

DDSCaseStudy1

This is a Case Study 1 project for DDS at SMU.

Instructions for running the project

  • Clone the repository:
  • Go in the DDSCaseStudy1 folder and open Makefile.Rmd with RStudio ( having R Working Directory set to the root of the project )
  • Knit the Makefile.Rmd

Files and folders

  • Makefile.Rmd - build the project in a Makefile stile
  • Analysis.Rmd - contains all analysis of the data
  • data - a folder container for the data files
  • source - a folder containing all scripts used for downloading and cleaning
  • source/download.R - a script used for downloading the csv data files
  • source/clean_gdp.R - a script used for cleaning the gdp data
    • cleans data from gdp.csv and creates a tidy gdp_tidy.csv file
  • source/clean_income_groups.R - a script used for cleaning the clean_income_groups data
    • cleans data from income_groups.csv and creates a tidy income_groups_tidy.csv file
  • source/merge.R - a script used for merging the two datasets
    • merges the data from gdp_tidy.csv and income_groups_tidy.csv files and creates merged_data.csv