/LinearModel-RViz-Summer2019

Primary LanguageHTMLGNU General Public License v3.0GPL-3.0

Linear Model R Visualization Package

Summer Research, 2019

Students: Jack Langston

Faculty advisor: Hunter Glanz

Objective

The objective of this summer research is to create an R package that allows user to visualize "all" visualizable linear models, with ease.

Deliverables

A GitHub repository which contains the following:

  1. An R package for the linear model visualization functions.

  2. A log of hours spent on the summer research by each student, which includes date, hours, and activity summary.

  3. A presentation to the Statistics Department.

  4. A presentation at the CSM annual research conference.

  5. A manuscript to submit to the R Journal detailing the work and submit an abstract to the RStudio Conference.

Specific Aims

1. Utilize GitHub to collaborate on project materials and updates.

2. Adhere to good programming practices.

3. Create an R package that contains visualization functions. At a minimum, this should be downloadable through devtools; as time allows, consider putting it on CRAN.

4. Provide documentation for the R package.

5. Review existing R packages and functionality for linear model visualization.

  • geom_smooth() in ggplot2?

  • How do you plot a regression line in plotly?

  • Many approaches out there just plot predictions of fitted regression model. Find and review these!

6. Create functions (or identify best existing ones) for items that are not currently easy to achieve in R. (Make sure that these cannot be accomplished in the existing R packages).

  • Simple linear regression line

  • Regression model with one categorical variable and one quantitative variable (parallel lines model)

  • Regression model with one categorical variable, one quantitative variable, and the interaction term (different intercepts and slopes)

  • Regression model with one quantitative variable and the interaction with a categorical variable (different slopes, but common intercept)

  • Can these all accommodate polynomial models that include higher order terms of the one quantitative variable?