
DNA methylation-based classification of central nervous system tumours

Primary LanguageHTML


Collection of scripts used to train and validate the classifier presented in

DNA methylation-based classification of central nervous system tumours.

classifier training and cross validation

Reads raw data, performs normalization, basic filtering and batch effect adjustment. Normalized and filtered data is stored in ./results


Trains the Random Forest classifier and stores the final classifier in ./results


Performs nested cross-validation and stores the results in ./CV


Evaluates the results of the cross validation and fits the calibration model that is stored in ./results and compiles a final report showing classifier performance metrics CVresults.html.

tumor purity estimation

To train a Random Forest regression model that predicts ABSOLUTE purity of TCGA brain tumor samples, see purity.html or purity.Rmd