Pinned Repositories
fire
Financial Regulation (FIRE) Data Standard
fire-spark
Mapping fire datamodel to spark execution
hadoop-hive
Some hive stuff (Spring)
hadoop-mapreduce
MapReduce stuff
hadoop-primitive-clustering
Hadoop implementation of Canopy Clustering using Levenshtein distance
lastfm-mapreduce
ml-registry
Enabling continuous delivery and improvement of Spark pipeline models through devops methodology and ML governance
pathogen
The rooster crows immediately before sunrise, the rooster causes the sun to rise
spark-gdelt
Binding the GDELT universe in a Spark environment
texata-r2-2017
This project has been created in a 4h time for the purpose of the Texata Big Data world championship.
aamend's Repositories
aamend/hadoop-mapreduce
MapReduce stuff
aamend/hadoop-primitive-clustering
Hadoop implementation of Canopy Clustering using Levenshtein distance
aamend/hadoop-hive
Some hive stuff (Spring)
aamend/lastfm-mapreduce
aamend/Mastering-Spark-for-Data-Science
Mastering Spark for Data Science, published by Packt
aamend/algoritm-dsa
Some algorithm stuff
aamend/canopy-clustering
Automatically exported from code.google.com/p/canopy-clustering
aamend/elasticsearch-utils
aamend/hadoop-recordreader
Custom RecordReader to allow custom delimiter (used for legacy 1.2.1 version)
aamend/jlibsvm
Efficient training of Support Vector Machines in Java
aamend/texata-r1-2017
Texata 2017
aamend/vagrant
Vagrant scripts to spin up hadoop instance