/Air-regression

University-project on Machine Learning in cooperation with ATR Software GmbH

Primary LanguageJupyter Notebook

Air Regression

The goal of the project is to go through the CRISP-DM cycle of a Machine Learning Project. For this project in particular, that means:

  • Analyse the dataset
  • Discover and present patterns that you find
  • Train a regressor on humidity and minimize the mean squared error
  • Use XAI tools to explain your classifier to stakeholders and debug it

The datasets

The dataset is public from the UCI ML repository.

Feature No. Description
0 Date (DD/MM/YYYY)
1 Time (HH.MM.SS)
2 True hourly averaged concentration CO in mg/m^3 (reference analyzer)
3 PT08.S1 (tin oxide) hourly averaged sensor response (nominally CO targeted)
4 True hourly averaged overall Non Metanic HydroCarbons concentration in microg/m^3 (reference analyzer)
5 True hourly averaged Benzene concentration in microg/m^3 (reference analyzer)
6 PT08.S2 (titania) hourly averaged sensor response (nominally NMHC targeted)
7 True hourly averaged NOx concentration in ppb (reference analyzer)
8 PT08.S3 (tungsten oxide) hourly averaged sensor response (nominally NOx targeted)
9 True hourly averaged NO2 concentration in microg/m^3 (reference analyzer)
10 PT08.S4 (tungsten oxide) hourly averaged sensor response (nominally NO2 targeted)
11 PT08.S5 (indium oxide) hourly averaged sensor response (nominally O3 targeted)
12 Temperature in °C
13 Relative Humidity (%)
14 AH Absolute Humidity