Pinned Repositories
avro-maven
A simple example of how to use the Avro Maven plugin to generate Avro sources.
avro-sorting
Examples of built-in and customizable sorting in Avro and Hadoop.
hadoop-book
Source code to accompany the book "Hadoop in Practice", published by Manning.
hadoop-utils
A set of Hadoop utilities to make working with Hadoop a little easier.
hdfs-file-slurper
Utility to easily copy files into HDFS
hiped2
Source code that accompanies the book "Hadoop in Practice, Second Edition".
hsync
HDFS rsync-like utility to replicate data between HDFS clusters
htuple
A library to simplify compound field partitioning, sorting and grouping in MapReduce.
json-mapreduce
InputFormat that can split multi-line JSON
vagrant-hadoop-spark-hive
Vagrant project to spin up a single virtual machine running current versions of Hadoop, Hive and Spark
alexholmes's Repositories
alexholmes/hadoop-book
Source code to accompany the book "Hadoop in Practice", published by Manning.
alexholmes/hiped2
Source code that accompanies the book "Hadoop in Practice, Second Edition".
alexholmes/vagrant-hadoop-spark-hive
Vagrant project to spin up a single virtual machine running current versions of Hadoop, Hive and Spark
alexholmes/hdfs-file-slurper
Utility to easily copy files into HDFS
alexholmes/json-mapreduce
InputFormat that can split multi-line JSON
alexholmes/avro-maven
A simple example of how to use the Avro Maven plugin to generate Avro sources.
alexholmes/hadoop-utils
A set of Hadoop utilities to make working with Hadoop a little easier.
alexholmes/hsync
HDFS rsync-like utility to replicate data between HDFS clusters
alexholmes/htuple
A library to simplify compound field partitioning, sorting and grouping in MapReduce.
alexholmes/avro-sorting
Examples of built-in and customizable sorting in Avro and Hadoop.
alexholmes/blog
alexholmes/filecrush
Remedy small files by combining them into larger ones.
alexholmes/hadoop-book-mvn-repo
alexholmes/java-external-sort
sort large files in Java
alexholmes/props4j
Use Java Annotations to load properties into your beans
alexholmes/redline
Pure Java Rpm Library
alexholmes/storm-trending-words
Quick and dirty trending words example on Storm.
alexholmes/camus
alexholmes/hdfscompact
A HDFS file compacter.
alexholmes/mleap
MLeap: Deploy ML Pipelines to Production
alexholmes/spark
Apache Spark - A unified analytics engine for large-scale data processing