Pinned Repositories
alerting-kibana-plugin
Open Distro for Elasticsearch Kibana Alerting Plugin
apicurio-registry
An API/Schema registry - stores APIs and Schemas.
Azure_Synapse_Toolbox
Repository of tools/queries for managing and monitoring Azure Synapse.
brickhouse
Hive UDF's for the data warehouse
cdap
An open source framework for building data analytic applications.
citus
Scalable PostgreSQL for multi-tenant and real-time analytics workloads
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
hadoop
Mirror of Apache Hadoop
hive
Mirror of Apache Hive
kafka
Mirror of Apache Kafka
rsahu's Repositories
rsahu/alerting-kibana-plugin
Open Distro for Elasticsearch Kibana Alerting Plugin
rsahu/apicurio-registry
An API/Schema registry - stores APIs and Schemas.
rsahu/Azure_Synapse_Toolbox
Repository of tools/queries for managing and monitoring Azure Synapse.
rsahu/cdap
An open source framework for building data analytic applications.
rsahu/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
rsahu/hadoop
Mirror of Apache Hadoop
rsahu/hive
Mirror of Apache Hive
rsahu/kafka
Mirror of Apache Kafka
rsahu/helm-nifi
Helm Chart for Apache Nifi
rsahu/hudi
Upserts, Deletes And Incremental Processing on Big Data.
rsahu/kafka-connect-jdbc
Kafka Connect connector for JDBC-compatible databases
rsahu/ksql
The database purpose-built for stream processing applications.
rsahu/kylo
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on Apache Hadoop and Spark. Kylo is licensed under Apache 2.0 and contributed by Think Big, A Teradata Company
rsahu/mmlspark
Microsoft Machine Learning for Apache Spark
rsahu/opendistro-build
Open Distro for Elasticsearch Build Scripts
rsahu/OpenSearch
🔎 Open source distributed and RESTful search engine.
rsahu/pdfbox
Mirror of Apache PDFBox
rsahu/rapidminer-studio
Easy-to-use visual environment for predictive analytics. No programming required. RapidMiner is easily the most powerful and intuitive graphical user interface for the design of analysis processes. Forget sifting through code! You can also choose to run in batch mode. Whatever you prefer, RapidMiner has it all.
rsahu/rapidprom-source
Current development of the RapidProM Extension
rsahu/registry
Schema Registry
rsahu/schema-registry
Confluent Schema Registry for Kafka
rsahu/security
Open Distro for Elasticsearch Security plugin
rsahu/security-advanced-modules
Advanced modules for the Open Distro for Elasticsearch security plugin
rsahu/security-kibana-plugin
Open Distro for Elasticsearch Security Kibana Plugin
rsahu/security-ssl
rsahu/spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
rsahu/strimzi-kafka-operator
Apache Kafka running on Kubernetes
rsahu/timescaledb
An open-source time-series SQL database optimized for fast ingest and complex queries. Packaged as a PostgreSQL extension.
rsahu/TransmogrifAI
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
rsahu/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)