rangadi

@databricks

Pinned Repositories

armeria
Asynchronous RPC/REST library built on top of Java 8, Netty, HTTP/2, Thrift and gRPC
Language:Java00
beam
Mirror of Apache Beam
Language:Java00
DataflowPythonSDK
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Language:Python10
elephant-bird
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
Language:Java2 3 03
hadoop-lzo
Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
Language:Shell10
hive
Mirror of Apache Hive
Language:Java11
kafka
Mirror of Apache Kafka
Language:Scala10
presto
Distributed SQL query engine for running interactive analytic queries against big data sources.
Language:Java10
shaded-protobuf-classes
A tiny project to create shaded Protobuf Java classes suitable for Spark's Protobuf connector
31
zkclient
a zookeeper client, that makes life a little easier.
Language:Java1 2 00

rangadi's Repositories

rangadi/shaded-protobuf-classes
A tiny project to create shaded Protobuf Java classes suitable for Spark's Protobuf connector
31
rangadi/elephant-bird
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
Language:Java2 3 03
rangadi/DataflowPythonSDK
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Language:Python10
rangadi/hadoop-lzo
Patched, refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20
Language:Shell10
rangadi/hive
Mirror of Apache Hive
Language:Java11
rangadi/kafka
Mirror of Apache Kafka
Language:Scala10
rangadi/presto
Distributed SQL query engine for running interactive analytic queries against big data sources.
Language:Java10
rangadi/zkclient
a zookeeper client, that makes life a little easier.
Language:Java1 2 00
rangadi/armeria
Asynchronous RPC/REST library built on top of Java 8, Netty, HTTP/2, Thrift and gRPC
Language:Java00
rangadi/beam
Mirror of Apache Beam
Language:Java00
rangadi/cascading
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on a Hadoop cluster.
Language:Java00
rangadi/DataflowJavaSDK
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Language:Java2 0
rangadi/dataproc-initialization-actions
Run in all nodes of your cluster before the cluster starts - let's you customize your cluster
Language:Shell
rangadi/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
rangadi/flink-dataflow
Google Dataflow Runner for Apache Flink
Language:Java2 0
rangadi/hadoop-common
Mirror of Apache Hadoop common
Language:Java
rangadi/lzo-split
Language:Java
rangadi/misc
Language:Java
rangadi/modeldb
A system to manage machine learning models
Language:JavaScript
rangadi/scio
A Scala API for Google Cloud Dataflow
Language:Scala
rangadi/scribe
Scribe is a server for aggregating log data streamed in real time from a large number of servers. It is designed to be scalable, extensible without client-side modification, and robust to failure of the network or any specific machine.
Language:C++2 0
rangadi/snakebite
A pure python HDFS client
Language:Python
rangadi/spark
Apache Spark
Language:Scala1
rangadi/summingbird
Streaming MapReduce with Scalding and Storm
Language:Scala
rangadi/trevni
a column file format
Language:Java
rangadi/zookeeper
Mirror of Apache Hadoop ZooKeeper
Language:Java

rangadi

Pinned Repositories

armeria

beam

DataflowPythonSDK

elephant-bird

hadoop-lzo

hive

kafka

presto

shaded-protobuf-classes

zkclient

rangadi's Repositories

rangadi/shaded-protobuf-classes

rangadi/elephant-bird

rangadi/DataflowPythonSDK

rangadi/hadoop-lzo

rangadi/hive

rangadi/kafka

rangadi/presto

rangadi/zkclient

rangadi/armeria

rangadi/beam

rangadi/cascading

rangadi/DataflowJavaSDK

rangadi/dataproc-initialization-actions

rangadi/delta

rangadi/flink-dataflow

rangadi/hadoop-common

rangadi/lzo-split

rangadi/misc

rangadi/modeldb

rangadi/scio

rangadi/scribe

rangadi/snakebite

rangadi/spark

rangadi/summingbird

rangadi/trevni

rangadi/zookeeper