/CalPolySLO_STAT431_AdvR

Primary LanguageHTMLGNU General Public License v3.0GPL-3.0

STAT 431: Advanced Statistical Computing with R @ Cal Poly, San Luis Obispo

Advanced techniques for efficient use of computers to perform statistical computations and to analyze large amounts of data. Includes version control systems; tools supporting reproducibility; functional programming; randomization and bootstrapping; dynamic data visualizations; and R package development. 4 lectures.

Tentative Weekly Schedule

Unit 1

Week 1: Review of STAT 331

  • GitHub
  • Base R and the Tidyverse

Week 2: Graphics (/Output?)

  • ggplot2
  • gganimate?
  • leaflet
  • geosphere
  • circlize
  • DT
  • kable

Week 3: Data Sources

  • File types? (CSV, TSV, XML, JSON, TXT...)
  • rvest?
  • APIs
  • Databases

Week 4: Package-based wkflow

  • Function Writing
  • Document/Package (roxygen)
  • (in context of OLS)

Week 5: Wrap-Up and Midterm Exam

Unit 2

Weeks 6 & 7: Multiple Regression

  • Matrix Algebra

Weeks 7 & 8: Gradient Descent

  • Iteration & Convergence
  • Catching Errors
  • Memory

Weeks 9-10: Monte Carlo, Wrap-Up, Miscellaneous Project Topics