shyamk136's Stars
lakehq/sail
LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.
microsoft/python-package-template
Template for Python Projects
Avaiga/taipy
Turns Data and AI algorithms into production-ready web applications in no time.
ScrapeGraphAI/ScrapeSchema
Python library for Entities, relationships and schemas extraction from unstructured data
databrickslabs/remorph
Cross-compiler and Data Reconciler into Databricks Lakehouse
davidzajac1/zillacode
Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
dslp/dslp
The Data Science Lifecycle Process is a process for taking data science teams from Idea to Value repeatedly and sustainably. The process is documented in this repo.
Data-Engineer-Camp/dbt-dimensional-modelling
Step-by-step tutorial on building a Kimball dimensional model with dbt
professorshabs/data_modeling
kimball data engineering
MrChadMWood/application_tracker
A simple CRUD webapp built with Streamlit and FastAPI, using Postgres backend. This is designed to track job applications. Project started 2024/08/20
dlt-hub/dlt
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
databricks/genai-cookbook
dashbook/dashtool
1997mahadi/dbt-dlt-ingestion-pipeline
Pulsweb/MyScripts
igorbarinov/awesome-data-engineering
A curated list of data engineering tools for software developers
openai/openai-cookbook
Examples and guides for using the OpenAI API
dbt-checkpoint/dbt-checkpoint
:fishing_pole_and_fish: List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
yati1002/Power-BI-DatabricksSQL-QuickStart-Samples
Repo for Power Bi Demos and Templates
run-llama/llama_deploy
Deploy your agentic worfklows to production
unitycatalog/unitycatalog
Open, Multi-modal Catalog for Data & AI
CodyAustinDavis/dbsql_sme
DBSQL SME Repo contains demos, tutorials, blog code, advanced production helper functions and more!
danielbeach/dataEngineeringTemplate
Template for Data Engineering and Data Pipeline projects
kerski/fabric-dataops-patterns
Templates for weaving DataOps into Microsoft Fabric
NatVanG/PBI-Inspector
A rules-based Power BI report layout testing tool.
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!
MicrosoftLearning/dp-203-azure-data-engineer
Exercise files for Microsoft Data Engineer curriculum
nicholasyager/dbt-loom
A dbt-core plugin to weave together multi-project dbt-core deployments
dbt-labs/dbt-meshify
A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.
awslabs/python-deequ
Python API for Deequ