Missing COUNT(*) in DDL of CloudDataWarehouseBenchmark/TPC-H
yota-p opened this issue · 0 comments
yota-p commented
Issue
README.md of Cloud Data Warehouse Benchmark Derived from TPC-H says that ddl.sql
executes
SELECT COUNT(*)
against all tables at the end to show that all data was loaded.
However ddl.sql
does not contain queries for SELECT COUNT(*)
(ex. 10GB/ddl.sql). Therefore users cannot validate the number of rows loaded to the database.
In the other hand, TPC-DS have SELECT COUNT(*)
and expected number of rows (ex. 1TB/ddl.sql).
Proposal
Add SELECT COUNT(*)
at the end of ddl.sql.