awslabs/amazon-redshift-utils

Missing COUNT(*) in DDL of CloudDataWarehouseBenchmark/TPC-H

yota-p opened this issue · 0 comments

Issue

README.md of Cloud Data Warehouse Benchmark Derived from TPC-H says that ddl.sql

executes SELECT COUNT(*) against all tables at the end to show that all data was loaded.

However ddl.sql does not contain queries for SELECT COUNT(*) (ex. 10GB/ddl.sql). Therefore users cannot validate the number of rows loaded to the database.

In the other hand, TPC-DS have SELECT COUNT(*) and expected number of rows (ex. 1TB/ddl.sql).

Proposal

Add SELECT COUNT(*) at the end of ddl.sql.