/SparkSummaryCsv

Primary LanguagePythonApache License 2.0Apache-2.0

SparkSummaryCsv

Performs a summarization of performance parameters relative to spark runs (more precisely relative to stages of such jobs).
This repo requires bash and python 2.7 or compatible technologies.

How to use the scripts

1. Execute run.sh and pass the absolute path of the root directory, following Cineca structure*

*Internal knowledge