These scripts installs PostgreSQL 10, PostGIS 2.4 and Oltpbenchmark on AWS Ubuntu 18.04 or 16.04 instances. It also upload benchmark runs and experiment data to server.
- Download the script
sudo apt-get update
sudo apt-get install bsdtar -y
curl -L0k https://github.com/debjyoti385/workload_scripts/archive/master.zip -o master.zip
bsdtar -xf master.zip -s'|[^/]*/||'
rm master.zip
chmod +x install.sh
- Run Benchmarks: (Use
screen
)
TPC-H Benchmark: Scale Factor 3 GB
screen -S install
sudo ./install.sh --benchmark=tpch --pgdata=/home/ubuntu/data/postgres_data --data=/home/ubuntu/data/tpch_data --sf=3
TPC-DS Benchmark: Scale Factor 5 GB
screen -S install
sudo ./install.sh --benchmark=tpcds --pgdata=/home/ubuntu/data/postgres_data --data=/home/ubuntu/data/tpcds_data --sf=5
Spatial Benchmark:
screen -S install
sudo ./install.sh --benchmark=spatial --pgdata=/home/ubuntu/data/postgres_data --data=/home/ubuntu/data/spatial_data
OSM Benchmark:
screen -S install
sudo ./install.sh --benchmark=osm --pgdata=/home/ubuntu/data/postgres_data --data=/home/ubuntu/data/osm_data --osmdb=los_angeles_county
sudo ./install.sh
-h --help
--benchmark=tpch # Benchmark: tpch (default), tpcds, spatial, osm
--pgdata=/home/ubuntu/data/postgres_data # PostgreSQL Data Directory (default)
--data=/home/ubuntu/data/benchmark_data # Benchmark Data Directory (default)
--sf=1 # Scale Factor in GBs (default 1 GB)
--install=1 # install=1 to install prerequisite
# install=0 only run benchmark skip install
--import=1 # import=1 perform data import
# import=0 for no data import
--osmdb=los_angeles_county # options los_angeles_county, new_york_county, salt_lake_county
# los_angeles_city, new_york_city, salt_lake_city
More Run Examples:
sudo ./install.sh --benchmark=osm --pgdata=/dev/shm/data/postgres_data --data=/dev/data/osm_data --osmdb=los_angeles_county --rerun=500 --install=1 --import=1 --epoch=100
If you are using this tool, please cite the following paper.
@article{10.14778/3503585.3503600,
author = {Paul, Debjyoti and Cao, Jie and Li, Feifei and Srikumar, Vivek},
title = {Database Workload Characterization with Query Plan Encoders},
year = {2021},
issue_date = {December 2021},
publisher = {VLDB Endowment},
volume = {15},
number = {4},
issn = {2150-8097},
url = {https://doi.org/10.14778/3503585.3503600},
doi = {10.14778/3503585.3503600},
journal = {Proc. VLDB Endow.},
month = {dec},
pages = {923–935},
numpages = {13}
}