/learning-datascience

Data science fellow programme run by UCLH. This repository contains documentation and training materials.

Primary LanguageHTML

UCLH Data Science Fellows training August 2021

in development

This repository contains the training materials for a two day course introducing R for the manipulation and visualisation of patient data. It assumes no knowledge of R, but is structured so that those know some R can also gain insights.

The main aim of the course is to develop the capacity of clinicians to use R and other modern data tools to manipulate hospital generated data.

The course uses patient data from UCLH related to the covid pandemic.

In the longer term the aim is to help clinicians develop modern, reproducible data workflows – ‘good enough’ scientific computing - to make their work more efficient. We will be working to improve the interfaces between these practices and the hospital data systems.

Learning objectives for the course

  1. Become familiar with R and RStudio
  2. Troubleshoot inevitable issues and find solutions with Google
  3. Be able to read data into R
  4. Learn about good data practice
  5. Be able to work with R objects, particularly vectors and dataframes
  6. Understand what are R functions and packages
  7. Know how to use RStudio projects
  8. Manipulate data with dplyr
  9. Visualise data with ggplot2
  10. Brief exposure to rmarkdown for reproducible reports and shiny for web applications
  11. Get a feeling for the potential of R and motivation to learn more

See the Instructions folder for more information.

Licensing

This course is licensed under a Creative Commons share-alike licence. Creative Commons

The course has been adapted from the ClinicianCoders and Data Science for Doctors courses.