This repository contains a number of example applications that can be built and run on PNDA. Each application directory contains more detailed information.
- Examples of consuming data from Kafka and populating both HBase and OpenTSDB with simple Scala based Spark Streaming applications.
- Example of consuming data ingested by Gobblin on a batch basis and producing Parquet datasets, optimized for consumption by Impala.
- Example of a notebook for manipulating network data.
- Application that runs the H2O data science platform as an application on PNDA.
- An example of a package containing multiple application component types, in this case a Spark app and related Jupyter notebook.