ndswaef
📈 Data Analytics Consultant             🎓 Business Engineering: Data Analytics from Ghent University
Belgium
ndswaef's Stars
CloudFormations/CF.Cumulus
A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away the complexity, giving you the power to build, scale, and manage your dataflows with ease, accelerating data delivery.
Azure/bicep-registry-modules
Bicep registry modules
rebremer/blog-datapipeline-cicd
Data pipeline project using Data Factory, Databricks and Cosmosdb Graph, deployed using Azure DevOps, secured using firewalls and Azure AD
K0p1-Git/cloudflare-ddns-updater
Dynamic DNS (DDNS) service based on Cloudflare! Access your home network remotely via a custom domain name without a static IP! Written in pure BASH~
Azure/azure-quickstart-templates
Azure Quickstart Templates
adidas/lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
pamelafox/simple-fastapi-azure-function
Simple HTTP API using FastAPI framework, deployed to Azure Functions using Azure Developer CLI.
marclelijveld/Power-BI-Automation
Automate tasks in Power BI based on the Power BI Powershell cmdlets and the Power BI REST API
microsoft/Fabric-Readiness
A collection of useful materials for presenters interested in topics related to Microsoft Fabric
bhakthan/awesome-microsoft-fabric
A curated list of awesome Microsoft Fabric resources, updates, blogs, videos and more
dataplat/dbops
âš™ dbops - Powershell module that provides continuous database deployments on any scale
DbUp/DbUp
DbUp is a .NET library that helps you to deploy changes to SQL Server databases. It tracks which SQL scripts have been run already, and runs the change scripts that are needed to get your database up to date.
PowerBI-tips/TabularEditor-Scripts
Scripts for Tabular Editor 2 & 3. Community driven to make your Tabular Editor experience as fast as possible.
dbt-labs/dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
databrickslabs/dbldatagen
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Stefen-Taime/stream-ingestion-redpanda-minio
In this article, you will learn how to set up a real-time data processing and analytics environment using Docker, MySQL, Redpanda, MinIO, and Apache Spark.
David-Summers/Azure-Design
My Azure stencil collection for Visio. Highly functional and always up to date.
mrpaulandrew/ContentCollateral
Images, icons, diagram, etc for various content.
palantir/pyspark-style-guide
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
MingChen0919/learning-apache-spark
Notes on Apache Spark (pyspark)
opendatadiscovery/awesome-data-catalogs
📙 Awesome Data Catalogs and Observability Platforms.
JohnMiner3/community-work
Presentations given to the Data Platform community.
dbt-labs/dbt-external-tables
dbt macros to stage external sources
laurensv-aivix/youtube
DBT testcase Youtube
mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
great-expectations/great_expectations
Always know what to expect from your data.
airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
andkret/Cookbook
The Data Engineering Cookbook
eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.