ML4Science Project: Machine Learning replaces Radiative Transfer 🌌

Repository for the second project of the course CS-433 Machine Learning @ EPFL. The team is composed by:

Matteo Calafà (@Teocala)
Giulia Mescolini (@giuliamesc)
Paolo Motta (@paolomotta)

We have worked in the framework of ML4Science projects and collaborated with the LASTRO (Laboratory of Astrophysics) of EPFL Lausanne, under the supervision of Dr. Michele Bianco (@micbia).

The aim of the project is to enhance Machine Learning usage in the study of the radiation behavior in the universe during the Epoch of Reionisation, that is, the period of formation of the first galaxies and stars.

Data Loading

Download the data from here
Store them in a folder called dataset, on the same level of your local reionisation_ML repository

Packages Needed

We have projected our Neural Network with torch, version 1.10.0.

The packages that are needed for the project are:

Core

numpy
matplotlib
torch
sklearn

Utilities

pickle
gc
time

Structure

The code structure is the following:

main.py, the Python script to run
FNN.py, importable script containing the definition of the Fully Connected Neural Network
CNN.py, importable script containing the definition of the Convolutional Neural Network
neigh_generation.py, which preprocesses the input
parameters.py, which contains a list of parameters that you can set here without modifying the main each time (eg.: batch size, number of epochs)
plotting.py, which generates some plots used for accuracy evaluation of the NNs
Weekly Meetings folder, containing our presentations to the tutor with the updates of our work of the week
best_models folder, containing the best and last-epoch trained models for CNN and FNN
Report.pdf is the final 4-pages report delivered

Auto-saving

Our code contains a feature which enables to stop the training and then to restart it from the point on which we interrupted it; thanks to this strategy, the net can be trained on a laptop without being forced to wait until a very long training is completed, and at the same time ensuring a backup in case something goes wrong.

In the first epoch, the status of the net, the losses and the R² score are saved
In the following epochs, the above information are stored 1) for the best model 2) for the last epoch, in order to continue from it the next time we restart the training

All these files are stored in a folder automatically created and called checkpoints.

Instructions for training

To train your net, the following steps need to be taken:

Check that the dataset is locally stored (see Data Loading description)
Set the parameters.py variables
Run neigh_generation.py to generate the neighbors - or download an already generated folder from here, unzip it, name it as cubes and place it on the same level of your main.py
Run main.py to do the training
Run plotting.py to generate and save the loss plot

Some further information:

In case you need to restart the training from where you interrupted, you just need to open parameters.py and switch the "first_run" variable to False. Then you can run main.py again.
All the plots are saved in checkpoints while all the neighbors are saved in cubes. These folders are automatically created and later automatically resetted at the beginning of the following training.

giuliamesc/reionisation_ML