The focus of this document is on using R for data processing, programming, modeling, visualization, and presentation of results. It contains exercises for additional practice, and most of the content has been translated to Python and is available via Jupyter notebooks.
-
Understanding Basic R Approaches to Gathering and Processing Data
- Overview of Data Structures
- Getting data in and out
- Indexing
-
Getting Acquainted with Other Approaches to Data Processing
- Pipes, and how to use them
- tidyverse
- data.table
- Misc.
-
Using R more fully
- Dealing with objects
- Iterative programming
- Writing functions
-
Going further
- Code style
- Vectorization
- Regular expressions
-
Model Exploration
- Key concepts
- Understanding and fitting models
- Overview of extensions
-
Model Criticism
- Model Assessment
- Model Comparison
-
Machine Learning
- Concepts
- Demonstration of techniques
-
Thinking Visually
- Visualizing Information
- Color
- Contrast
- and more...
-
Using ggplot2
- Aesthetics
- Layers
- Themes
- and more...
-
Adding Interactivity
- Package demos
- Shiny
- Building Better Data-Driven Products
- Reproducibility concepts
- Starting out with R markdown
- Standard documents
- Customization and more
- Themes, CSS, etc.