Pinned Repositories
arrow
Mirror of Apache Arrow
carbondata
Mirror of Apache CarbonData
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
flink
Mirror of Apache Flink
hive
Mirror of Apache Hive
koalas
Koalas: Pandas API on Apache Spark
mlflow
Open source platform for the machine learning lifecycle
spark
Mirror of Apache Spark
spark-sql-perf
spark-website
Mirror of Apache Spark Website
gatorsmile's Repositories
gatorsmile/koalas
Koalas: Pandas API on Apache Spark
gatorsmile/spark
Mirror of Apache Spark
gatorsmile/spark-sql-perf
gatorsmile/mlflow
Open source platform for the machine learning lifecycle
gatorsmile/carbondata
Mirror of Apache CarbonData
gatorsmile/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
gatorsmile/flink
Mirror of Apache Flink
gatorsmile/spark-website
Mirror of Apache Spark Website
gatorsmile/arrow
Mirror of Apache Arrow
gatorsmile/aws-glue-data-catalog-client-for-apache-hive-metastore
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
gatorsmile/hive
Mirror of Apache Hive
gatorsmile/azure-cosmosdb-spark
Apache Spark Connector for Azure Cosmos DB
gatorsmile/calcite
Mirror of Apache Calcite
gatorsmile/druid
Column oriented distributed data store ideal for powering interactive applications
gatorsmile/gporca
A modular query optimizer for big data
gatorsmile/HANAVora-Extensions
Spark extensions for business contexts
gatorsmile/httl.github.com
HTTL Home Page.
gatorsmile/Impala
Real-time Query for Hadoop; mirror of Apache Impala
gatorsmile/parquet-mr
Mirror of Apache Parquet
gatorsmile/presto
Distributed SQL query engine for running interactive analytic queries against big data sources.
gatorsmile/pyspark-ai
English SDK for Apache Spark
gatorsmile/spark-sql-kenel-architecture
gatorsmile/sql-query
sql-query
gatorsmile/tensorflow
Computation using data flow graphs for scalable machine learning
gatorsmile/tpcds-kit
TPC-DS benchmark kit with some modifications/additions