Generator is from yangjun.wang, I use its API to develop a Samza Benchmark, but maybe not have same methods, if the final result wa great, I would push a commit to Mr.Wang's repo.
AdvClick: create two streams: Advertisement, AdvClick.
FileToStream: read from specific file and produce it to stream.
KMeansPoints: create one stream with two dimension's point, which is used to process KMeans App.
UniformWordCount: create one stream with words which followed uniform order.
Kafka(see how to instsall in official website or use hello-samza install)
run this, and don't need to tar.
mvn clean package
java -cp generator*.jar fi.aalto.dmg.generator.GeneratorClass (interval)