feature-selection

This repository contains implementation of permutation importance based feature selection. To run the pipeline for training an evaluation of the framework run:

$ python3 main.py --test_no=1

It needs a file located in ./LIME/data/constituents.csv with the stock set it will build the feature selection on.

Outputs

A metric file found in ./LIME/data/LOOC_metrics_cr_{test_no}.csv that has the following format:

MSE_baseline	MAE_baseline	walk	ticker	MSE_pi	MAE_pi	model	selection_error	removed_FI	removed_error	removed_column
0.00019	0.01085	1.0	FP.PA	0.00027	0.01210	pi		-1.8409e-05	0.00827	Returns_3

with one line for each tuple (walk,ticker,data_type)

Example command

$ python3 main.py 
--data_type pi --start_date 2007-01-01 --end_date 2017-01-01 --no_walks 7
--no_features 1 --train_length 4 --test_length 1
--num_rounds 50 --test_no 11

thefatbandit/xai_statarb_MTP

feature-selection

Outputs

Example command