Pinned Repositories
cuesheet
A framework for writing Spark 2.x applications in a pretty way
hello-world
First repository
kafka-spark-consumer
High Performance Kafka Consumer for Spark Streaming. Compatible with every Spark and Kafka versions including latest Spark 2.2.0 and Kafka 0.11.0. Now supports Kafka Security. Offset management in Zookeeper. Reliable No-Dataloss gurantee. No dependency on HDFS or Checkpointing and WAL. In-built PID rate controller. Support Message Interceptor . Offset Lag checker.
kafka-spark-consumer_TEST
kafka-spark-streaming-example
Simple examle for Spark Streaming over Kafka topic
parquet-examples
Example programs and scripts for accessing parquet files
Spark
Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
spark-streaming-with-kafka
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
spark-streamingsql
Manipulate Spark-streaming by SQL
MusicPlaylist
hokmanto's Repositories
hokmanto/cuesheet
A framework for writing Spark 2.x applications in a pretty way
hokmanto/hello-world
First repository
hokmanto/kafka-spark-consumer
High Performance Kafka Consumer for Spark Streaming. Compatible with every Spark and Kafka versions including latest Spark 2.2.0 and Kafka 0.11.0. Now supports Kafka Security. Offset management in Zookeeper. Reliable No-Dataloss gurantee. No dependency on HDFS or Checkpointing and WAL. In-built PID rate controller. Support Message Interceptor . Offset Lag checker.
hokmanto/kafka-spark-consumer_TEST
hokmanto/kafka-spark-streaming-example
Simple examle for Spark Streaming over Kafka topic
hokmanto/parquet-examples
Example programs and scripts for accessing parquet files
hokmanto/Spark
Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
hokmanto/spark-streaming-with-kafka
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
hokmanto/spark-streamingsql
Manipulate Spark-streaming by SQL