Pinned Repositories
ansible-vagrant-dse-spark
Run Cassandra Datastax Enterprise 4.6 with OpsCenter 5.1 and Spark 1.2
geomesa-docker
Set of docker images for running geomesa
learning-hunk
mapreduce-testing
stuff for testing mapreduce locally on Java
seregasheypak's Repositories
seregasheypak/ansible-vagrant-dse-spark
Run Cassandra Datastax Enterprise 4.6 with OpsCenter 5.1 and Spark 1.2
seregasheypak/geomesa-docker
Set of docker images for running geomesa
seregasheypak/airflow
Airflow is a system to programmatically author, schedule and monitor data pipelines.
seregasheypak/akka-persistence-jdbc
Asynchronously writes journal and snapshot entries to configured JDBC databases so that Akka Actors can recover state
seregasheypak/ansible-marathon
Ansible Marathon Playbook
seregasheypak/ansible-mesos
Mesos Playbook for Ansible
seregasheypak/ansible-zookeeper
Ansible playbook for ZooKeeper
seregasheypak/blockchain
seregasheypak/bottledwater-pg
Change data capture from PostgreSQL into Kafka
seregasheypak/databricks-rest-client
seregasheypak/docker-geoserver-geomesa
seregasheypak/docker-osm
A docker compose project to setup an OSM PostGIS database with automatic updates from OSM periodically
seregasheypak/drake
Data workflow tool, like a "Make for data"
seregasheypak/geomesa-cloudera
seregasheypak/hadoop-mini-clusters
hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE
seregasheypak/intellij-zeppelin
Edit code in IntelliJ, eval/run in Zeppelin notebook
seregasheypak/j-text-utils
Automatically exported from code.google.com/p/j-text-utils
seregasheypak/java-ascii-table
Automatically exported from code.google.com/p/java-ascii-table
seregasheypak/mesos-school
Experiments with Mesos
seregasheypak/metrics-influxdb
A reporter for metrics which announces measurements to an InfluxDB server.
seregasheypak/music-api
Simple Rest API to get information about an specific music artist<
seregasheypak/ninja-phoenix-hbase
seregasheypak/pig
Mirror of Apache Pig
seregasheypak/pipeline
End-to-End, Real-time, Advanced Analytics Big Data Reference Pipeline using Spark, Spark SQL, Spark ML, GraphX, Spark Streaming, Kafka, Cassandra, ElasticSearch, Redis, Tachyon, HDFS, Zeppelin, Spark-Notebook, iPython/Jupyter Notebook, Tableau. See https://github.com/fluxcapacitor/pipeline/wiki for Setup Instructions.
seregasheypak/play-swagger
Swagger spec generator for play framework
seregasheypak/sampleproject
A sample project that exists for PyPUG's "Tutorial on Packaging and Distributing Projects"
seregasheypak/scoozie
Scala DSL on top of Oozie XML
seregasheypak/spark
Apache Spark
seregasheypak/testcontainers-java
Testcontainers is a Java library that supports JUnit tests, providing lightweight, throwaway instances of common databases, Selenium web browsers, or anything else that can run in a Docker container.
seregasheypak/vaex
Out-of-Core DataFrames for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀