SparkStreaming-Kafka-example: A Scala repository from wy36101299

SparkStreaming-Kafka-example

build scala-archetype-simple

mvn archetype:generate -B
-DarchetypeGroupId=net.alchim31.maven -DarchetypeArtifactId=scala-archetype-simple -DarchetypeVersion=1.5
-DgroupId=com.hpds -DartifactId=SparkStreaming-Kafka-example -Dversion=0.1-SNAPSHOT -Dpackage=com.hpds

package this example

mvn clean package

how to run

start zookeeper cluster

bin/zkServer.sh start

start kafka cluster every node

JMX_PORT=999x bin/kafka-server-start.sh config/server.properties

start spark cluster

sbin/start-all.sh

kafka create a topic

create topic : bin/kafka-topics.sh --create \
               --replication-factor 3 \
               --partition 3 \
               --topic test_topic \
               --zookeeper ip1:2181,ip2:2181,ip3:2181 ...

run the producer

java -cp SparkStreaming-Kafka-example-0.1-SNAPSHOT-jar-with-dependencies.jar com.hpds.ScalaProducerExample 10000 test_topic localhost:9092 (the port is depend on your kafka server.properties)

run the consumer

./spark-submit --class com.hpds.ScalaConsumerExample --master spark://master:7077 SparkStreaming-Kafka-example-0.1-SNAPSHOT-jar-with-dependencies.jar localhost:2181 test_topic test_topic 1

reference

scala-archetype-simple
kafka-example-in-scala
official example-SparkStreaming kafkaWordCount

wy36101299/SparkStreaming-Kafka-example

build scala-archetype-simple

package this example

how to run

start zookeeper cluster

start kafka cluster every node

start spark cluster

kafka create a topic

run the producer

run the consumer

reference