Pinned Repositories
scala-best-practices
A collection of Scala best practices
spark
Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apache/spark/
presto
The official home of the Presto distributed SQL query engine for big data
awesome-algorithms
A curated list of awesome places to learn and/or practice algorithms.
ipython-notebooks
Collection of IPython Notebooks
K8s-presto
Presto on Kubernetes
Twitter-LDA-
Topic generation on twitteruser links
satybald's Repositories
satybald/ipython-notebooks
Collection of IPython Notebooks
satybald/K8s-presto
Presto on Kubernetes
satybald/auto-complete
Auto-complete
satybald/beam
Apache Beam is a unified programming model for Batch and Streaming
satybald/calc_es_lda
Calculate the LDA taking data from Elasticsearch
satybald/calcite-examples
apache calcite experiments
satybald/ClickHouse
ClickHouse is a free analytic DBMS for big data.
satybald/druid-io.github.io
Druid Project Website
satybald/flink
Apache Flink
satybald/fluence
Fluence is the decentralized database which securely stores structured data
satybald/game-reco-engine
Simple game recommendation enginine
satybald/incubator-druid
Apache Druid (Incubating) - Column oriented distributed data store ideal for powering interactive applications
satybald/incubator-openwhisk
Apache OpenWhisk is a serverless event-based programming service and an Apache Incubator project.
satybald/kafka
Mirror of Apache Kafka
satybald/ksql
KSQL - the Streaming SQL Engine for Apache Kafka
satybald/librdkafka
The Apache Kafka C/C++ library
satybald/linkerd
Resilient service mesh for cloud native apps
satybald/nakadi
A distributed event bus that implements a RESTful API abstraction instead of Kafka-like queues
satybald/new-hope
Training camp for doing Clojure exercises
satybald/presto
Distributed SQL query engine for big data
satybald/presto-server
Docker Image for Presto Server
satybald/pytorch_tutorial
satybald/raspi-spark-streaming-mqtt
The basic purpose of this project is that the raspberryPi sends the temperature event to the server and spark streams the data using the spark streaming
satybald/recsys-101-workshop
A Recommender Systems interactive workshop
satybald/satybald.github.io
Personal Blog Site
satybald/scala-best-practices
A collection of Scala best practices
satybald/scheduler
satybald/spark
Apache Spark enhanced with native Kubernetes scheduler back-end
satybald/sparklens
Qubole Sparklens tool for performance tuning Apache Spark
satybald/zmon-worker
ZMON Python Worker