Pinned Repositories
bash-emr
Simple bash functions for manipulating Amazon Elastic MapReduce clusters
bigdata
cascading
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on a Hadoop cluster. Please see https://github.com/cwensel/cascading for access to all WIP branches.
DataGenerator
DataGenerator is a Java library for systematically producing large volumes of data. DataGenerator frames data production as a modeling problem, with a user providing a model of dependencies among variables and the library traversing the model to produce relevant data sets.
HivePartitionCopier
Utility for copying Hive Partitions via direct connection to the Metastore Database.
Impatient
source examples to support the "Cascading for the Impatient" blog post series
IntelSpringSocial
An Implementation of the Intel Cloud Services APIs in Spring Social
SciHadoop
Integration code to enable Hadoop processing of data in NetCDF format
Typical-Spring-Social-Security-With-Roo
The goal of this project is to demonstrate a consolidated implementation of Spring-Security combined Spring-Social (facebook & twitter) using Spring Roo as the scaffolding for the local user and connection store
winsparkle
WinSparkle is Windows version of the venerable Sparkle software update framework used by many Mac OS X apps.
stmcpherson's Repositories
stmcpherson/Typical-Spring-Social-Security-With-Roo
The goal of this project is to demonstrate a consolidated implementation of Spring-Security combined Spring-Social (facebook & twitter) using Spring Roo as the scaffolding for the local user and connection store
stmcpherson/bash-emr
Simple bash functions for manipulating Amazon Elastic MapReduce clusters
stmcpherson/HivePartitionCopier
Utility for copying Hive Partitions via direct connection to the Metastore Database.
stmcpherson/winsparkle
WinSparkle is Windows version of the venerable Sparkle software update framework used by many Mac OS X apps.
stmcpherson/bigdata
stmcpherson/cascading
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on a Hadoop cluster. Please see https://github.com/cwensel/cascading for access to all WIP branches.
stmcpherson/DataGenerator
DataGenerator is a Java library for systematically producing large volumes of data. DataGenerator frames data production as a modeling problem, with a user providing a model of dependencies among variables and the library traversing the model to produce relevant data sets.
stmcpherson/Impatient
source examples to support the "Cascading for the Impatient" blog post series
stmcpherson/IntelSpringSocial
An Implementation of the Intel Cloud Services APIs in Spring Social
stmcpherson/SciHadoop
Integration code to enable Hadoop processing of data in NetCDF format
stmcpherson/spark
Scala framework for iterative and interactive cluster computing.
stmcpherson/tutorials
Tutorials for Cascading, Lingual, Pattern and other projects
stmcpherson/usergrid-stack
Turnkey Platform Stack For Mobile & Rich Client Applications