/XGBoostLSS

An extension of XGBoost to probabilistic forecasting

Primary LanguagePythonApache License 2.0Apache-2.0

XGBoostLSS - An extension of XGBoost to probabilistic forecasting

We propose a new framework of XGBoost that predicts the entire conditional distribution of a univariate response variable. In particular, XGBoostLSS models all moments of a parametric distribution, i.e., mean, location, scale and shape (LSS), instead of the conditional mean only. Choosing from a wide range of continuous, discrete, and mixed discrete-continuous distribution, modelling and predicting the entire conditional distribution greatly enhances the flexibility of XGBoost, as it allows to create probabilistic forecasts from which prediction intervals and quantiles of interest can be derived.

News

💥 [2021-11-14] XGBoostLSS v0.1.0 is released!

Features

✅ XGBoostLSS supports simultaneous training and updating of all distributional parameters.
✅ Automated hyper-parameter search is done via Optuna.
✅ The output of XGBoostLSS is explained using SHapley Additive exPlanations.
✅ XGBoostLSS is available in Python.

Work in Progress

🚧 Calling XGBoostLSS from R via the reticulate package.
🚧 Function that facilitates the choice of a suitable distribution amongst all of the implemented.

Available Distributions

Currently, XGBoostLSS supports the following distributions. More continuous distributions, as well as discrete, mixed discrete-continuous and zero-inflated distributions are to come soon.

image

Installation

$ pip install git+https://github.com/StatMixedML/XGBoostLSS.git

Quick Start

We refer to the examples section for an example notebook.

Reference Paper

März, Alexander (2019) "XGBoostLSS - An extension of XGBoost to probabilistic forecasting".