Pinned Repositories
avro
Mirror of Apache Avro
beam
Apache Beam
beam-ordered-processing-example
ExcelFileInputFormat
An FileInputFormat implementation for Excel (xls, xlsx).
hadoop-inputformat-io-beam-example
An example Apache Beam pipeline that reads Orc files from Google Cloud Storage using HadoopInputFormatIO and writes Avro GenericRecord objects using AvroIO
impala-sql-pivot
An example showing how to pivot tables rows into columns and vice-versa in Impala SQL
kite-multipleoutput
DatasetKeyMultipleOutput - A MultipleOutputs implementation using Kite APIs
MRWordCountKite
oozieloop
Loops in Oozie
spark-unzip
sabhyankar's Repositories
sabhyankar/spark-unzip
sabhyankar/hadoop-inputformat-io-beam-example
An example Apache Beam pipeline that reads Orc files from Google Cloud Storage using HadoopInputFormatIO and writes Avro GenericRecord objects using AvroIO
sabhyankar/oozieloop
Loops in Oozie
sabhyankar/avro
Mirror of Apache Avro
sabhyankar/beam
Apache Beam
sabhyankar/beam-ordered-processing-example
sabhyankar/ExcelFileInputFormat
An FileInputFormat implementation for Excel (xls, xlsx).
sabhyankar/impala-sql-pivot
An example showing how to pivot tables rows into columns and vice-versa in Impala SQL
sabhyankar/kite-multipleoutput
DatasetKeyMultipleOutput - A MultipleOutputs implementation using Kite APIs
sabhyankar/MRWordCountKite
sabhyankar/bqtop
Visualizing BigQuery query jobs with Cloud Functions, Firebase and Pub/Sub
sabhyankar/DataflowTemplates
Google-provided Cloud Dataflow template pipelines for solving simple in-Cloud data tasks
sabhyankar/DynamicTemplatesExamples
sabhyankar/hadoop
Mirror of Apache Hadoop
sabhyankar/HiveKudu-Handler
Hive on Kudu
sabhyankar/hoodie
Spark Library for Hadoop Upserts And Incrementals
sabhyankar/kafka
Mirror of Apache Kafka
sabhyankar/kite
Kite SDK
sabhyankar/kudu
Kudu is the engine behind git/hg deployments, WebJobs, and various other features in Azure Web Sites. It can also run outside of Azure.
sabhyankar/kudu-1
Kudu
sabhyankar/oozie
Mirror of Apache Oozie
sabhyankar/professional-services
sabhyankar/scalafy
sabhyankar/spark
Mirror of Apache Spark
sabhyankar/SparkOnHBase
I simple API to interact with HBase with Spark
sabhyankar/SparkOnKudu
Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.
sabhyankar/sqoop
Mirror of Apache Sqoop
sabhyankar/terraform-google-splunk-enterprise
Terraform templates for Splunk Enterprise on GCP