/EDA-H1B-application

This project was created to showcase the skills learnt in the DA-5020 Collecting Storing and Retrieving Data Course in the Spring 2017 Semester.

Primary LanguageR

EDA-H1B-application

This project was created to showcase the skills learnt in the DA-5020 Collecting Storing and Retrieving Data Course in the Spring 2017 Semester. The main aim of this project was to identify the errors in the data and tidy them as much as possible. After tidying the data, creating a SQL database and storing the data in it. Run some basic low level database queries and extract the data back from the database. And the plot the data into graphs. All the above processes to be carried out in R programming language using the dplyr, ggplot2, RSQ-Lite packages.

Updates 08/09/2018

Functions created to read/transform/combine data into a single data table Significant improvement in reading and transformation speed since transformation are done for every file read as opposed to combine dataframe in the previous iteration