Supervised Autoencoder

This is the code from : Learning a confidence score and the latent space of a new Supervised Autoencoder for diagnosis and prognosis in clinical metabolomic studies.

In this repository, you will find the code to replicate the statistical study described in the paper.

When using this code, please cite:

Barlaud, M., Guyard, F.: Learning sparse deep neural networks using efficient structured projections on convex constraints for green ai. ICPR 2020 Milan Italy (2020)

and

David Chardin, Cyprien Gille, Thierry Pourcher and Michel Barlaud : Learning a confidence score and the latent space of a new Supervised Autoencoder for diagnosis and prognosis in clinical metabolomic studies. BMC Bioinformatics (2022)

Repository Contents

File/Folder	Description
`script_autoencoder.py`	Main script to train and evaluate the SAE
`script_NN.py`	Script to train and evaluate the neural network classifier
`script_PLSDA_RF_SVM.py`	Script to fit and evaluate classical methods
`datas`	Contains the three databases used in the paper
`functions`	Contains dedicated functions for the three main scripts

Installation

To run this code, you will need :

A version of python, 3.8 or newer. If you are new to using python, we recommend downloading anaconda (here) and using Spyder (available by default from the anaconda navigator) to run the code.
Pytorch.
The following packages, all of which except captum and shap are usually included in the anaconda distribution : numpy, matplotlib, scikit-learn, pandas, shap, captum. To install any package, you can use the anaconda navigator's built-in environment manager.

See requirements.txt for the exact package versions on which this code was developed.

How to use

Everything is ready, you can just run the script you want using for example the 'run code' button of your Spyder IDE. Alternatively, you can run the command python [script_name].py in the Anaconda Prompt from the root of this folder (i.e. where you downloaded and unzipped this repository).

Each script will produce results (statistical metrics, top features...) in a results folder.

You can change the database used, and other parameters, near the start of each script.

CyprienGille/Supervised-Autoencoder

Supervised Autoencoder

Table of Contents

Repository Contents

Installation

How to use