sidharthbolar's Stars
binhnguyennus/awesome-scalability
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
lauris/awesome-scala
A community driven list of useful Scala libraries, frameworks and software.
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
microsoft/SynapseML
Simple and Distributed Machine Learning
Azure/azure-sdk-for-python
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-python.
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
awslabs/diagram-maker
A library to display an interactive editor for any graph-like data.
feathr-ai/feathr
Feathr – A scalable, unified data and AI engineering platform for enterprise
scopt/scopt
command line options parsing for Scala
softwaremill/tapir
Rapid development of self-documenting APIs
fineanmol/Hacktoberfest2022
Make your first Pull Request on Hacktoberfest 2022. Don't forget to spread love and if you like give us a ⭐️
graphframes/graphframes
awslabs/python-deequ
Python API for Deequ
YotpoLtd/metorikku
A simplified, lightweight ETL Framework based on Apache Spark
ing-bank/popmon
Monitor the stability of a Pandas or Spark dataframe ⚙︎
microsoft/knack
Knack - A Python command line interface framework
Azure/azure-event-hubs-spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
uhub/awesome-scala
A curated list of awesome Scala frameworks, libraries and software.
scalacenter/scaladex
The Scala Package Index
SETL-Framework/setl
A simple Spark-powered ETL framework that just works 🍺
Azure-Samples/azure-python-labs
Labs demonstrating how to use Python with Azure, Visual Studio Code, GitHub, Windows Subsystem for Linux, and more!
dask/dask-cloudprovider
Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...
Azure/azure-functions-durable-python
Python library for using the Durable Functions bindings.
scalacenter/sprees
Scala Open Source Sprees: join us and learn how to contribute to open source!
Azure/spark-cdm-connector
starlake-ai/starlake
Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
avensolutions/spark-sql-etl-framework
Multi-stage, config driven, SQL based ETL framework using PySpark
microsoft/azure-synapse-spark-metrics
Azure Synapse Spark Metrics provides easy metrics monitoring functions for Synapse services, especially, Apache Spark pool instances, by leveraging Prometheus, Grafana and Azure APIs.
mouachan/droolsParser
damavis/apache-drools-demo
Demo about how to use Apache Drools with the scala programming language using a database and a template