/twitter-spark-processor

it consumes message from multiple kafka topic, parses the required attribute and serialize it into AVRO format and stores it into local filesystem.

Primary LanguageJava

twitter-spark-processor

Twitter Spark Processor Application cosnumes messages from multiple kafka topic and thier partition
It serailizes and stores only the specific attributes from tweet JSON, serialzes and writes it to AVRO.

When Shutdown Hook is called, It gracefully shuts downs the SparkSession and flushes the Avro to a file in local system.