Install the deps
pip install optuna joblib pyarrow pandas optuna-dashboard
Then you can run the optimization script
python optuna_dataset_optimization.py
Whiel that's running, you can open the dashboard to look at some cool data about the optimization (run this command from this directory!)
optuna-dashboard sqlite:///optuna.sqlite3