Project The purpose of the data project is for you to conduct a reproducible analysis with a data set of your choosing. There are two components to the project, the proposal, which will be graded on a pass/fail basis, and the final report. The outline for each of these are provided in the templates. When submitting the assignments, include the R Markdown file (change the name to include your last name, for example Bryer-Proposal.Rmd and Bryer-Project.Rmd) along with any supplementary files necessary to run the R Markdown file (e.g. data files, screenshots, etc.). Suggestions for possible data sources are included below, however you are free to use data not listed below. The only requirement is that you are allowed to share the data. Projects will be shared with others on this website so should be presented in a way that other students can reproduce your analysis.
Checklist / Suggested Outline Abstract (no more than 300 words) Overview slide Context on the data collection Description of the dependent variable (what is being measured) Description of the independent variable (what is being measured; include at least 2 variables) Research question Summary statistics Include appropriate data visualizations. Statistical output Include the appropriate statistics for your method used. For null hypothesis tests (e.g. t-test, chi-squared, ANOVA, etc.), state the null and alternative hypotheses along with relevant statistic and p-value (and confidence interval if appropriate). For regression models, include the regression output and interpret the R-squared value. Conclusion Why is this analysis important? Limitations of the analysis?
Feedback from final project:
- Add other variables and conduct a multiple linear regression instead