/FYP-Alzheimer-Harmonisation

This project aims to improve the diagnosis of Alzheimer's disease (AD) and the prediction of Mild Cognitive Impairment (MCI) to AD conversion by comparing the efficiency of four post-hoc harmonisation techniques on a cohort of 1358 participants. We use neuroimaging data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset.

Primary LanguageJupyter NotebookMIT LicenseMIT

Empirical comparison of neuro-imaging dataset harmonisation for Alzheimer's disease prediciton

Status GitHub Issues GitHub Pull Requests License


This project aims to improve the diagnosis of Alzheimer's disease (AD) and the prediction of Mild Cognitive Impairment (MCI) to AD conversion by comparing the efficiency of four post-hoc harmonisation techniques on a cohort of 1358 participants. We use neuroimaging data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset, and build two classifiers using cortical thickness data.

Abstract

Alzheimer’s disease (AD) is a progressive neurodegenerative disease, affecting more than 55 million people worldwide. Multiple studies have shown promising results for the use of neuroimaging data for automatic prediction and diagnosis of AD. However, variations in imaging protocols and scanning devices across different studies and sites can introduce significant noise and bias in the data, making the development of reliable models challenging. Post-hoc dataset harmonisation could be a strategy to reduce variability and improve performance.

In this study, we compare the efficiency of four harmonisation techniques: ComBat, CovBat, ComBat-GAM, and NeuroHarmony, using the performance of two classifiers when trained on data harmonised with these algorithms as a metric. We assess the statistical significance of the results using the McNemar test.

Our findings show that post-hoc harmonisation is a valuable method to increase the sensitivity of AD diagnosis, but its effects are not as significant for MCI to AD conversion.

🏁 Getting Started

To get started, simply run the notebooks.

  • Classifiers_explo.ipynb : Exploration of the dataset, building of test classifiers (!DRAFT)
  • Classifiers.ipynb : Building of draft classifiers only using demographic and genetic data (DRAFT!)
  • Harmo_Compare.impnb : MAIN PROJECT FILE: Comparison of the main harmonisation methods
  • Exploration.ipynb : Data exploration (!DRAFT)

Installing

This project requires multiple python packages to run: sklearn, numpy, pandas, and the harmonisation packages. You will also need to get approval from ADNI (https://adni.loni.usc.edu/) to get access to their datasets. Put all of the .csv files inside a new folder called \data inside the \main folder

🎈 Usage

This code can be used by other researchers to build their own framework to compare post-hoc harmonisation methods. Another potential use is to simply reuse the code to try to reproduce the results.

✍️ Authors

Harmonisation packages

Language Name Link Year
Python NeuroComBat https://github.com/Jfortin1/neuroCombat 2020
Python CovBat https://github.com/andy1764/CovBat_Harmonization 2018
Python ComBat-Gam https://github.com/rpomponio/neuroHarmonize 2021
Python NeuroHarmony https://github.com/garciadias/Neuroharmony 2020
R ComBatFamily https://github.com/andy1764/ComBatFamily 2023

Citations

For the citations, see the project paper.