Pinned Repositories
airflow-site
Apache Airflow Website
awesome-cto
A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups
blogs
Technology blogging website from Siby Abin. Talks about dataengineering, aws, spark, python, airflow and more
boto3
AWS SDK for Python
chispa
PySpark test helper methods with beautiful error messages
data_engg_cookbook
The Data Engineering Cookbook
k8s-tools
Tools for using k8s
order-analytics
Data engineering project solution for order data analytics using python and sqllite
pg-dock-sql
Refine your SQL skills using pg-dock-sql, a PostgreSQL Docker environment for hands-on coding challenges and solutions. This helps to elevate your proficiency in SQL through handson experience.
sibyabin.github.io
Portfolio Website of Siby Abin Thomas - Senior Data Engineer
sibyabin's Repositories
sibyabin/k8s-tools
Tools for using k8s
sibyabin/order-analytics
Data engineering project solution for order data analytics using python and sqllite
sibyabin/pg-dock-sql
Refine your SQL skills using pg-dock-sql, a PostgreSQL Docker environment for hands-on coding challenges and solutions. This helps to elevate your proficiency in SQL through handson experience.
sibyabin/airflow-site
Apache Airflow Website
sibyabin/awesome-cto
A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups
sibyabin/blogs
Technology blogging website from Siby Abin. Talks about dataengineering, aws, spark, python, airflow and more
sibyabin/boto3
AWS SDK for Python
sibyabin/chispa
PySpark test helper methods with beautiful error messages
sibyabin/dbt-data-reliability
Data anomalies monitoring as dbt tests and dbt artifacts uploader.
sibyabin/dbt-fundamentals
Sample repository for DBT fundamentals course in https://courses.getdbt.com/
sibyabin/delta
This connector allows Apache Spark™ to read from and write to Delta Lake.
sibyabin/sibyabin.github.io
Portfolio Website of Siby Abin Thomas - Senior Data Engineer
sibyabin/DataLakeBootcamp_2Days
Data Lake Bootcamp: Building Reliable Data Lakes
sibyabin/delta-io-website
Delta Lake Website
sibyabin/delta-sharing
An open protocol for secure data sharing
sibyabin/gh-actions-examples
Repo to check various GH actions
sibyabin/great_expectations
Always know what to expect from your data.
sibyabin/jaffle_shop
A self-contained dbt project for testing purposes
sibyabin/jekyll-algolia
Add fast and relevant search to your Jekyll site
sibyabin/lakeFS
Git-like capabilities for your object storage
sibyabin/learning-k8s
k8s learning notes
sibyabin/LearningSparkV2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
sibyabin/mage-ai
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
sibyabin/sample-app
sibyabin/sibyabin
Siby Abin - Senior Data Engineer
sibyabin/spark
Apache Spark - A unified analytics engine for large-scale data processing
sibyabin/spark-website
Apache Spark Website
sibyabin/styleguide
Style guides for Google-originated open-source projects
sibyabin/superset
Apache Superset is a Data Visualization and Data Exploration Platform
sibyabin/unitycatalog
Open, Multi-modal Catalog for Data & AI