Kafka->HDFS pipeline from LInkedIn. It is a mapreduce job that does distributed data loads out of Kafka.
Primary LanguageJava
No issues in this repository yet.