Kafka->HDFS pipeline from LInkedIn. It is a mapreduce job that does distributed data loads out of Kafka.
Primary LanguageJava
No one’s watching this repository yet.