Pinned Repositories
aws-cloudformation-templates
A collection of useful CloudFormation templates
big-data-rosetta-code
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
brickflow
Pythonic Programming Framework to orchestrate jobs in Databricks Workflow
business-analytics-and-mathematics-python-book
Advanced Business Analytics and Mathematics with Python
complete-guide-to-step-functions-examples
Examples for the "Complete Guide to Step Functions" course
corda
Corda is an open source blockchain project, designed for business from the start. Only Corda allows you to build interoperable blockchain networks that transact in strict privacy. Corda's smart contract technology allows businesses to transact directly, with value.
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
dribble
terraform-aws-elasticsearch
pariksheet's Repositories
pariksheet/dribble
pariksheet/terraform-aws-elasticsearch
pariksheet/aws-cloudformation-templates
A collection of useful CloudFormation templates
pariksheet/big-data-rosetta-code
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
pariksheet/brickflow
Pythonic Programming Framework to orchestrate jobs in Databricks Workflow
pariksheet/business-analytics-and-mathematics-python-book
Advanced Business Analytics and Mathematics with Python
pariksheet/complete-guide-to-step-functions-examples
Examples for the "Complete Guide to Step Functions" course
pariksheet/corda
Corda is an open source blockchain project, designed for business from the start. Only Corda allows you to build interoperable blockchain networks that transact in strict privacy. Corda's smart contract technology allows businesses to transact directly, with value.
pariksheet/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
pariksheet/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
pariksheet/enceladus
Dynamic Conformance Engine
pariksheet/former2
Generate CloudFormation / Terraform / Troposphere templates from your existing AWS resources.
pariksheet/hadoop-tutorials
hadoop-tutorials
pariksheet/koalas
Koalas: pandas API on Apache Spark
pariksheet/ludwig
Ludwig is a toolbox built on top of TensorFlow that allows to train and test deep learning models without the need to write code.
pariksheet/metaflow
Build and manage real-life data science projects with ease.
pariksheet/minio
MinIO is a high performance object storage server compatible with Amazon S3 APIs
pariksheet/nakadi
A distributed event bus that implements a RESTful API abstraction on top of Kafka-like queues
pariksheet/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
pariksheet/quinn
pyspark methods to enhance developer productivity 📣 👯 🎉
pariksheet/redash
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
pariksheet/soda-sql
Metric collection, data testing and monitoring for SQL accessible data
pariksheet/spark-daria
Essential Spark extensions and helper methods ✨😲
pariksheet/spark-style-guide
Spark style guide
pariksheet/spline
Data Lineage Tracking and Visualization tool for Apache Spark ™
pariksheet/spring-security-react-ant-design-polls-app
Full Stack Polls App built using Spring Boot, Spring Security, JWT, React, and Ant Design
pariksheet/terraform-bootcamp
pariksheet/terraform-course
Course files for my Udemy course about Terraform
pariksheet/troposphere
troposphere - Python library to create AWS CloudFormation descriptions
pariksheet/zipkin
Zipkin is a distributed tracing system