/Covid19PredictionStudies

Development and validation OHDSI network studies for the covid19 prediction topic

Primary LanguageR

Development and validation of complex and simple patient-level prediction models for predicting various outcomes in COVID patients: a rapid network study to inform the management of COVID-19

Study Status: Results Available

The objective of this study is to develop and validate various patient-level prediction models for COVID-19 patients.

Background

The Corona Virus Disease 2019 (COVID-19), which started in late 2019 as an epidemic in Wuhan, Hubei Province, China, has been classified as a pandemic and a public health emergency of international concern by the World Health Organisation (WHO) in January 2020.

The OHDSI community has initiated a study-athon to attempt to provide evidence to healthcare providers, governments and patients to best aid in the understanding and treatment of the pandemic.

This specific repository contains the work of the Patient-Level Prediction group in aiding this effort.

Patient-Level Prediction Studies

There are various packages contained here:

1) Predicting which patients with signs/diagnosis will require hospitalization

The objective of this study is to inform the triage and early management of patients with diagnosed or suspected COVID-19, by developing and validating a patient-level prediction model to identify adult patients at risk of hospitalization after presenting with flu or flu-like symptoms at a general practitioner (GP), outpatient (OP) or emergency room (ER) visit.

When should this model be used? When a patient first has a diagnosis of symptom of covid-19

Protocol: link

Packages

  • OHDSI model development: link
  • OHDSI full model validation: link
  • Existing model validation: link
  • OHDSI simple model validation: link

Results

2) Predicting which patients sent home after being seen at outpatient for flu or flu-like symptoms end up in hospital 2-30 days later

The objective of this study is to identify the most patients at risk of being hospitalized amongst those who have been sent home after presenting with flu and COVID-19 or symptoms.

When should this model be used? When a patient is about to be sent home after being seen with suspected covid-19

Protocol: link

Packages

  • OHDSI model development: link
  • OHDSI full model validation: link

Results

3) Predicting which patients admitted to hospital for pneumonia will be more severe (e.g., require ventilator or ICU)

The objective of this study is to identify the most high risk patients amongst those who have been admitted to hospital with pneumonia and COVID-19.

When should this model be used? When a patient is first admitted to hospital with suspected covid-19

Protocol: link

Packages

  • OHDSI model development: link
  • OHDSI full model validation: link
  • OHDSI simple model validation: link

Results

4) Predicting survival in patients who recieve intensive care for pneumonia and ARDS

The objective of this study is to identify which critically ill COVID-19 patients are most likely to survive if provided intensive care resources. This would be used within a prioritization scheme for resource.

When should this model be used? When a patient is first admitted to hospital with suspected or known covid-19

Protocol: link

Packages

  • OHDSI simple model validation package: link

Results

Instructions for participation

To run these studies you need the following software installed:

  • [Required] R (version 3.5.0 or higher).
  • [Required] Java (Java can be downloaded from http://www.java.com)

You need to install the latest version of the PatientLevelPrediction R package (version 3.0.15):

install.packages('devtools')
devtools::install_github("OHDSI/FeatureExtraction")
devtools::install_github('ohdsi/PatientLevelPrediction')

You need to install OhdsiSharing to send the results:

devtools::install_github("OHDSI/OhdsiSharing")

To install and run all the validation studies run the code here where you will need to enter the CDM data connection and details, the outputFolder location and sharing settings such as minCellCount.

To submit the results

Once you have sucessfully executed the study you will find a compressed folder in the location specified by '[outputFolder]/allExport.zip'. The study should remove sensitive data but we encourage researchers to also check the contents of this folder.

To send the compressed folder results please message one of the leads (jreps , RossW) and we will give you the privateKeyFileName and userName. You can then run the following R code to share the results:

# If you don't have the R package OhdsiSharing then install it using github (uncomment the line below)
# install_github("ohdsi/OhdsiSharing")

library("OhdsiSharing")
privateKeyFileName <- "message us for this"
userName <- "message us for this"
fileName <- file.path(outputFolder, 'allExport.zip')
sftpUploadFile(privateKeyFileName, userName, fileName)

Result object

After running the study you will get multiple 'validationResult.rds' objects. You can load these objects using the R function readRDS:

result <- readRDS('[location to rds file]/validationResult.rds')

the 'result' object is a list containing the following:

Object Description Edited by minCellCount
result$inputSetting The outcome and cohort ids and the databaseName No
result$executionSummary Information about the R version, PatientLevelPrediction version and execution platform info No
result$model Information about the model (name and type) No
result$analysisRef Used to store a unique reference for the study No
result$covariateSummary A dataframe with summary information about how often the covariates occured for those with and without the outcome Yes
result$performanceEvaluation$evaluationStatistics Performance metrics and sizes No
result$performanceEvaluation$thresholdSummary Operating characteristcs @ 100 thresholds No
result$performanceEvaluation$demographicSummary Calibration per age group Yes
result$performanceEvaluation$calibrationSummary Calibration at risk score deciles No
result$performanceEvaluation$predictionDistribution Distribution of risk score for those with and without the outcome No

When packaging the results all cell counts (that could contain sensitive data) less than minCellCount are replaced by -1.