Dixit-Pathak's Stars
microsoft/python-package-template
Template for Python Projects
awslabs/python-deequ
Python API for Deequ
Netflix/metaflow
Open Source AI/ML Platform
great-expectations/great_expectations
Always know what to expect from your data.
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
apache/arrow
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
fastapi/fastapi
FastAPI framework, high performance, easy to learn, fast to code, ready for production
big-data-europe/docker-spark
Apache Spark docker image
microsoft/vscode-python
Python extension for Visual Studio Code
audreyfeldroy/cookiecutter-pypackage
Cookiecutter template for a Python package.
gettyimages/docker-spark
Docker build for Apache Spark
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
tmalaska/SparkUnitTestingExamples
This project is a collection of Spark Unit Tests Examples to help new Spark users have good examples on how to unit start their code for Spark Core, Spark SQL, and Spark Streaming
hhimanshu/Unit-Testing-In-Scala
Learn to use ScalaTest in your own projects
Azure/azure-powershell
Microsoft Azure PowerShell
databricks-academy/dbacademy
Internal library used to develop and test Databricks Academy courseware
mspnp/azure-databricks-streaming-analytics
Stream processing with Azure Databricks
krishnaik06/Feature-Engineering-Live-sessions
larribas/docker-production-mlflow
This repository builds a production-ready Docker image to productionalize an MLFlow cluster
spirom/LearningSpark
Scala examples for learning to use Spark
microsoft/vscode-docs
Public documentation for Visual Studio Code
Azure/azure-quickstart-templates
Azure Quickstart Templates
solliancenet/microsoft-learning-paths-databricks-notebooks
Contains notebooks used in the Microsoft Azure Databricks Learning Paths modules.
MicrosoftDocs/mslearn-aml-labs
Azure Machine Learning Lab Notebooks
MicrosoftDocs/ml-basics
Exercise notebooks for Machine Learning modules on Microsoft Learn
mlflow/mlflow-example
An example MLflow project
dotnet/spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
srivatsan88/Mastering-Apache-Spark
This is repository of my YouTube Course on End to End Apache Spark in AIEngineering YouTube Channel