Pinned Repositories
airpal
Web UI for PrestoDB.
akka-http-cache
Implementing akka-http-cache
alunarbeach.github.io
My Page
amazonaccess
Amazon Employee Access Challenge
bdutil
bootstrap-tag-cloud
A simple Twitter Bootstrap style tag cloud generator. Initial version was written by Collective Push.com
hudi
Upserts, Deletes And Incremental Processing on Big Data.
spark
Apache Spark - A unified analytics engine for large-scale data processing
spark-avro
Avro Data Source for Apache Spark
streamx
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
alunarbeach's Repositories
alunarbeach/airpal
Web UI for PrestoDB.
alunarbeach/akka-http-cache
Implementing akka-http-cache
alunarbeach/alunarbeach.github.io
My Page
alunarbeach/bdutil
alunarbeach/cockroach
A Scalable, Geo-Replicated, Transactional Datastore
alunarbeach/creative-scala-template
Template for those following Creative Scala
alunarbeach/datacatalog-tag-manager
Python package to manage Google Cloud Data Catalog tags, loading metadata from external sources -- currently supports the CSV file format
alunarbeach/dataproc-initialization-actions
Run in all nodes of your cluster before the cluster starts - let's you customize your cluster
alunarbeach/desk
A lightweight workspace manager for the shell
alunarbeach/docker-images
Dockerfiles for Confluent Stream Data Platform
alunarbeach/duckdb_gsheets
DuckDB extension to read and write Google Sheets using SQL
alunarbeach/example-spark
Spark, Spark Streaming and Spark SQL unit testing strategies
alunarbeach/friendly-cicd-helper
alunarbeach/hoodie
Spark Library for Hadoop Upserts And Incrementals
alunarbeach/kafka-connect-hdfs
Kafka Connect HDFS connector
alunarbeach/kafka-exactly-once
alunarbeach/MiddlewareMagicDemos
Middleware Technology Related Demos
alunarbeach/mongodb-hadoop-workshop
MongoDB-Hadoop Workshop Exercises
alunarbeach/practice-project
This is a practice project that people can use to practice using Github and get comfortable with the mechanics of contributing
alunarbeach/proposal-template
A product/engineering proposal template that I've used at multiple companies
alunarbeach/python-confluent-schemaregistry
A client for the Confluent Schema Registry API implemented in Python
alunarbeach/schema-registry
Schema registry for Kafka
alunarbeach/sentiment-analysis-for-chatbots
Jupyter notebooks for "Sentiment analysis for chatbots" training
alunarbeach/solidity-examples
Example Ethereum Solidity Contracts
alunarbeach/spark
Mirror of Apache Spark
alunarbeach/spark-avro
Avro support for Spark, SQL, and DataFrames
alunarbeach/spark-custom-datasource-example
A sample implementation of the Spark Datasource API
alunarbeach/spark-stateful-example
A full example of my blog post regarding Sparks stateful streaming (http://asyncified.io/2016/07/31/exploring-stateful-streaming-with-apache-spark/).
alunarbeach/structured-streaming-avro-demo
Spark 2.2. Structured Streaming Kafka Avro Demo
alunarbeach/zeppelin
Zeppelin is data analytics environment