dbbert

DB-BERT is a database tuning tools that exploits natural language text as additional input. It extracts recommendations for database parameter settings from tuning-related text via natural language analysis. It optimizes parameter settings for a given workload and performance metric using reinforcement learning.

Quickstart

DB-BERT was tested on an Amazon EC2 p3.2xlarge instance using Deep Learning AMI (Ubuntu 18.04) Version 43.0.
Install Postgres 13.2 and the TPC-H benchmark database.
Install required Python packages, including Huggingface Transformers, stable-baselines-3, psycopg2, Numpy, Pytorch, and streamlit.
DB-BERT uses configuration files, an example file can be found under config/pg_tpch_manydocs.ini.
For a first try, update credentials in the [DBMS] section. You may adapt tuning time and other parameters in the [BENCHMARK] section.
Execute src/run/tuning_run.py, passing the configuration file as first (and only) argument, e.g. (from main directory): PYTHONPATH='src' python3 src/run/tuning_run.py config/pg_tpch_manydocs.ini
DB-BERT will parse input text (text from 100 Web documents), initialize the RL algorithm, and start tuning Postgres for TPC-H.
Try config/pg_tpch_onedoc.ini for tuning using a single, TPC-H specific text document (forum discussion on TPC-H tuning).

GUI

To start the GUI, run streamlit run src/run/interface.py from the root directory.
If accessing DB-BERT on a remote EC2 server, make sure to enable inbound traffic to port 8501.
Enter the URL shown in the console into your Web browser to access the interface.
You can select settings to read from configuration files in the demo_configs folder.
Select a collection of tuning text documents for extraction (e.g., from the demo_docs folder).
You may change parameter related to database access, learning, and tuning goals.
Click on the Start Tuning button to start the tuning process.

Ongoing Work

Switching language models used from BERT to BART (despite the project name).
Refined text document pre-processing: extract most relevant passage for each parameter.

How to Cite

A video talk introducing the vision behind this project is available online.

@inproceedings{trummer2022dbbert,
  title={DB-BERT: a database tuning tool that "reads the manual"},
  author={Trummer, Immanuel},
  booktitle={SIGMOD},
  year={2022}
}

@article{trummer2021case,
  title={The case for NLP-enhanced database tuning: towards tuning tools that" read the manual"},
  author={Trummer, Immanuel},
  journal={Proceedings of the VLDB Endowment},
  volume={14},
  number={7},
  pages={1159--1165},
  year={2021},
  publisher={VLDB Endowment}
}

momo1578/dbbert

dbbert

Quickstart

GUI

Ongoing Work

How to Cite