Additive pre-diagnostic and diagnostic value of routine blood-based biomarkers in the detection of colorectal cancer in the UK Biobank cohort
by Gizem Tanriver, Ece Kocagoncu
This repository is the official implementation of the paper, published in Scientific Reports in January 2023.
All source code used to generate the results and figures in the paper can be found in the respective folders.
preprocessing
folder contains codes used to preprocess and featurise the raw datasetstats
folder contains codes used for statistics on the baseline datacox
folder contains the codes for cox regression modelgpboost
folder contains the codes for gpboost model with feature selection using RFE
The models were run inside Jupyter notebooks.
- Python 3.7
- Lifelines 0.27.1
- GPBoost 0.7.9
- PDPbox 0.2.1
Approval for the study and permission to access the data was granted by the UK Biobank Resource. UK Biobank is an open access resource and bona fide researchers can access the UK Biobank dataset by registering and applying at https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access. This research has been conducted using the UK Biobank Resource under application number 87991 for the project titled ‘Validation of an AI-powered online search strategy for finding optimal biomarker combinations’.
Distributed under GNU General Public License v3.0. See LICENSE for more information.
If this work is helpful, please cite as:
@article{Tanriver2022.11.10.22282166,
author = {Tanriver, Gizem and Kocagoncu, Ece},
title = {Additive pre-diagnostic and diagnostic value of routine bloodbased biomarkers in the detection of colorectal cancer in the UK Biobank cohort},
elocation-id = {2022.11.10.22282166},
year = {2022},
doi = {10.1101/2022.11.10.22282166},
publisher = {Cold Spring Harbor Laboratory Press},
URL = {https://www.medrxiv.org/content/early/2022/11/11/2022.11.10.22282166},
journal = {medRxiv}
}