rkkalluri's Stars
JDBraun/isolake
Isolake is a simple and specialized Databricks workspace deployment design on AWS that isolates users and workloads from the public internet, utilizing Unity Catalog and AWS PrivateLink as its foundational architectural components
databricks/terraform-databricks-examples
Examples of using Terraform to deploy Databricks resources
databrickslabs/ucx
Automated migrations to Unity Catalog
databricks/terraform-provider-databricks
Databricks Terraform Provider
databrickslabs/overwatch
Capture deep metrics on one or all assets within a Databricks workspace
apache/incubator-xtable
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
infracost/infracost
Cloud cost estimates for Terraform in pull requests💰📉 Shift FinOps Left!
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
apache/pinot
Apache Pinot - A realtime distributed OLAP datastore
apache/kafka
Mirror of Apache Kafka
yugabyte/yugabyte-db
YugabyteDB - the cloud native distributed SQL database for mission-critical applications.
rkkalluri/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
apache/hudi
Upserts, Deletes And Incremental Processing on Big Data.
xebia-os/hands-on-serverless-guide
A hands-on guide for building Serverless applications
hashicorp/terraform-foundational-policies-library
Sentinel is a language and framework for policy built to be embedded in existing software to enable fine-grained, logic-based policy decisions. This repository contains a library of Sentinel policies, developed by HashiCorp, that can be consumed directly within the Terraform Cloud platform.
acantril/aws-sa-associate-saac02
Course Files for AWS Certified Solutions Architect Certification Course (SAAC02) - Adrian Cantrill
Azure/azure-quickstart-templates
Azure Quickstart Templates
MicrosoftDocs/architecture-center
Open Source documentation for the Azure Architecture Center on Microsoft Docs
Azure/caf-terraform-landingzones
This solution, offered by the Open-Source community, will no longer receive contributions from Microsoft. Customers are encouraged to transition to Microsoft Azure Verified Modules for continued support and updates from Microsoft. Please note, this repository is scheduled for decommissioning and will be removed on July 1, 2025.
ExpediaGroup/waggle-dance
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
ExpediaGroup/beeju
JUnit integration for testing the Apache Hive Metastore and HiveServer2 Thrift APIs
ExpediaGroup/circus-train
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
jeffellin/aws
MrPowers/spark-spec
Test suite to document the behavior of Spark
timothyrenner/kafka-streams-ex
A collection of examples and use-cases for Kafka Streams
pershyn/storm-metrics-opentsdb
storm-metrics-opentsdb is a module for Apache Storm that converts and reports metrics to OpenTSDB
databricks/reference-apps
Spark reference applications
IntersysConsulting/ingestive
An example of how to use Kafka and Storm to ingest events
t3rmin4t0r/notes
Random implementation notes
HubSpot/hbase-support
Supporting configs and tools for HBase at HubSpot