jeromebanks

Tatari.tv

Pinned Repositories

satisfaction
The Next Generation Hadoop Scheduler
Language:JavaScript7 22 03
ambrose
A platform for visualization and real-time monitoring of data workflows
Language:Java1 2 00
brickhouse
Hive UDF's for the data warehouse
Language:Java8 3 0577
experimental_bigdata-interop
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
Language:Java1 2 00
reair
ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.
Language:Java0 2 00
satisfaction
The Next Generation Hadoop Scheduler
Language:Scala1 2 150
sbt-satisfy
SBT Plugin for Satisfaction
Language:Scala10

jeromebanks's Repositories

jeromebanks/brickhouse
Hive UDF's for the data warehouse
Language:Java8 3 0577
jeromebanks/experimental_bigdata-interop
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
Language:Java1 2 00
jeromebanks/satisfaction
The Next Generation Hadoop Scheduler
Language:Scala1 2 150
jeromebanks/reair
ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.
Language:Java0 2 00
jeromebanks/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language:Python1 0
jeromebanks/artemis-corpus-test-framework
A test framework for working with test corpora for unit tests.
Language:Java1 0
jeromebanks/aws-glue-data-catalog-client-for-apache-hive-metastore
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
Language:Java1 0
jeromebanks/boilerpipe
Work in progress transmit from Google Code
Language:Java1 0
jeromebanks/Chat-with-Github-Repo
This repository contains two Python scripts that demonstrate how to create a chatbot using Streamlit, OpenAI GPT-3.5-turbo, and Activeloop's Deep Lake.
Language:Python0 0
jeromebanks/classutil
Scala-friendly, fast class-finder library (using ASM under the covers)
Language:Scala2 0
jeromebanks/docker-spark-k8s-aws
Docker image for running Spark 3 on Kubernetes on AWS
1 0
jeromebanks/document-api-python
Create and modify Tableau workbook and datasource files
Language:Python1 0
jeromebanks/experimental_spark-bigquery
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Language:Scala2 0
jeromebanks/experimental_spark-bigquery-1
Google BigQuery support for Spark, SQL, and DataFrames
Language:Scala2 0
jeromebanks/generalized-kmeans-clustering
This project generalizes the Spark MLLIB Batch and Streaming K-Means clusterers in every practical way.
Language:Scala2 0
jeromebanks/incubator-hivemall
Mirror of Apache Hivemall (incubating)
Language:Java2 0
jeromebanks/influxdb-java
Java client for InfluxDB
Language:Java2 0
jeromebanks/js-murmur3-128
A JavaScript implementation of the 128bit variant of Murmur3 (that is compatible with Guava)
Language:JavaScript
jeromebanks/nutch
Apache Nutch
Language:Java
jeromebanks/okhttp
An HTTP+HTTP/2 client for Android and Java applications.
Language:Java2 0
jeromebanks/reactive-kafka
Reactive Streams API for Apache Kafka
Language:Scala2 0
jeromebanks/redshift-auto-schema
Redshift Auto Schema is a Python library that takes a delimited flat file or parquet file as input, parses it, and provides a variety of functions that allow for the creation and validation of tables within Amazon Redshift.
Language:Python1 0
jeromebanks/sbt-google-cloud-storage
A SBT resolver and publisher for Google Cloud Storage
Language:Scala1 0
jeromebanks/scala.rx
An experimental library for Functional Reactive Programming in Scala
Language:Scala2 0
jeromebanks/spark
Language:Scala2 0
jeromebanks/spark-glue
Spark releases with AWS Glue support
Language:Dockerfile1 0
jeromebanks/spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Language:Go1 0
jeromebanks/spark-on-kubernetes-docker
Spark on Kubernetes infrastructure Docker images repo
Language:Shell1 0
jeromebanks/spark-on-kubernetes-helm
Spark on Kubernetes infrastructure Helm charts repo
Language:HTML1 0
jeromebanks/terrapin
Serving system for batch generated data sets
Language:Java2 0

jeromebanks

Pinned Repositories

satisfaction

ambrose

brickhouse

experimental_bigdata-interop

reair

satisfaction

sbt-satisfy

jeromebanks's Repositories

jeromebanks/brickhouse

jeromebanks/experimental_bigdata-interop

jeromebanks/satisfaction

jeromebanks/reair

jeromebanks/airflow

jeromebanks/artemis-corpus-test-framework

jeromebanks/aws-glue-data-catalog-client-for-apache-hive-metastore

jeromebanks/boilerpipe

jeromebanks/Chat-with-Github-Repo

jeromebanks/classutil

jeromebanks/docker-spark-k8s-aws

jeromebanks/document-api-python

jeromebanks/experimental_spark-bigquery

jeromebanks/experimental_spark-bigquery-1

jeromebanks/generalized-kmeans-clustering

jeromebanks/incubator-hivemall

jeromebanks/influxdb-java

jeromebanks/js-murmur3-128

jeromebanks/nutch

jeromebanks/okhttp

jeromebanks/reactive-kafka

jeromebanks/redshift-auto-schema

jeromebanks/sbt-google-cloud-storage

jeromebanks/scala.rx

jeromebanks/spark

jeromebanks/spark-glue

jeromebanks/spark-on-k8s-operator

jeromebanks/spark-on-kubernetes-docker

jeromebanks/spark-on-kubernetes-helm

jeromebanks/terrapin