/sex_classifier

Training and validating a logistic regression classification model to predict acute myeloid leukemia patients sex (male vs. female) from gene expression data.

Primary LanguageJupyter Notebook

Description:

Traning and validating logistic regression model to classify the sex (male vs. female) of 1,956 aml patients from gene expression data (2,761 expression arrays) in scikit-learn. The model has 96.01 (+/- 0.08) traning/validation accuracy. The data, code, and supplementary materials can all be found here.

Confusion Matrix: AML_Sex_LR-Classification_Trained_on_44754_ProbSets_from_1956_Subjects-Confusion_Matrix

ROC Curve: AML_Sex_LR-Classification_Trained_on_44754_ProbSets_from_1956_Subjects-ROC_Curve