Pinned Repositories
Beetest
A super simple utility for testing Apache Hive scripts locally for non-Java developers.
CoAnSys
COntent ANalysis SYStem is a framework for mining scientific publications using Apache Hadoop.
CombineWholeFileInputFormat
hadoop-apps-cdh-sources
Exemplary appliactions build with CDH and Maven
hadoop-common
Mirror of Apache Hadoop common
HEEUT
Hadoop Ecosystem Exemplary Unit Tests
Pigitos
Pigitos is a set of tiny, but highly useful UDFs for Apache Pig.
RichImportTsv
Enhanced version of ImportTsv.
zlatanitor
kawaa's Repositories
kawaa/Beetest
A super simple utility for testing Apache Hive scripts locally for non-Java developers.
kawaa/Pigitos
Pigitos is a set of tiny, but highly useful UDFs for Apache Pig.
kawaa/RichImportTsv
Enhanced version of ImportTsv.
kawaa/zlatanitor
kawaa/CoAnSys
COntent ANalysis SYStem is a framework for mining scientific publications using Apache Hadoop.
kawaa/HEEUT
Hadoop Ecosystem Exemplary Unit Tests
kawaa/CombineWholeFileInputFormat
kawaa/hadoop-apps-cdh-sources
Exemplary appliactions build with CDH and Maven
kawaa/hadoop-common
Mirror of Apache Hadoop common
kawaa/hdfs-file-slurper
Utility to easily copy files into HDFS
kawaa/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
kawaa/MapReduceExamples
kawaa/snakebite
A pure python HDFS client
kawaa/SparkHCat
kawaa/tez-autobuild
A Tez dev-setup for HDP2 sandbox
kawaa/workflow-tests
Workflow Tests