The goals of this workshop are to inform MD/PhD Students about how informatics-based approaches can inform their work. We will divide the workshop into multiple parts.
We are curious about how sleep affects cardiovascular disease risk. How can we use open and accessible data as preliminary data for a research grant to help us answer this question? What steps do we need to take?
Sleep Heart Health Study data from https://sleepdata.org
- What data is out there? (20-30 min)
- Problem Formulation (15 min)
- Specifying predictive models using knowledge for a research question (30 min)
- Mapping our model into available public data (15 min)
- Break (15 minutes)
- Assessing association of identified covariates with outcome (30 min)
- Building the model using logistic regression (1.5 hrs)
- Communicating our results to others (30 min)
There are some restrictions about using the sleep study data. Ideally, we would use RStudio.cloud to simplify installations. However, existing data restrictions prevent us from doing this.
- Installation of R/RStudio on personal computer
- Installation of project/data using
usethis::use_project()
- Signed Data Use Agreement with http://sleepdata.org