/dsc130-datasci-R

Course repository for an introduction to data science with R.

MIT LicenseMIT

Introduction to Data Science with R

Course description

This course is an introduction to data science using the R programming language with a focus on data acquisition and wrangling, exploratory data analysis, data visualization, inference, modeling, and effective communication of results. Students will create visualizations and models, and use them to gain insights and make predictions. In this model-based course, students are introduced to statistical analysis and models through examples. Students will perform meaningful analysis on real data. This course assumes no prior experience with programming and requires no specific statistical or mathematical knowledge beyond high school algebra.

Open Resources

Resources below are free electronically.

Learning Outcomes

After successful completion of this course the student will be able to:

  • Acquire and wrangle data;
  • Use effective and appropriate methods for visualizing and describing data;
  • Demonstrate "best practice" coding;
  • Demonstrate proficiency building basic statistical models, test hypothesis, and use models for interpretation and prediction; and
  • Communicate results (via interactive dashboard, i.e., RShiny).