altifern's Stars
PostHog/posthog
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
gunnarmorling/1brc
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
bruin-data/ingestr
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
RamiKrispin/vscode-python
A Tutorial for Setting Python Development Environment with VScode and Docker
codeproject/CodeProject.AI-Server
CodeProject.AI Server is a self contained service that software developers can include in, and distribute with, their applications in order to augment their apps with the power of AI.
DataExpert-io/cumulative-table-design
This repository helps teach people how to correctly define and create cumulative tables!
RamiKrispin/awesome-ds-setting
A tutorial for setting a new machine with core data science tools
josephmachado/efficient_data_processing_spark
Code for "Efficient Data Processing in Spark" Course
microsoft/bobsql
demos, scripts, samples, and code from the two bobs who work at Microsoft on SQL Server
EcZachly/little-book-of-pipelines
This repository goes over how to handle massive variety in data engineering
tpvasconcelos/ridgeplot
Beautiful ridgeline plots in Python
josephmachado/data_engineering_best_practices
Sample project to demonstrate data engineering best practices
microsoft/fabric-toolbox
Fabric toolbox is a repository of tools, accelerators, scripts, and samples to accelerate your success with Microsoft Fabric, brought to you by Fabric CAT.
Armaan1Gohil/dataengineering-tech-stack
Local Environment to Practice Data Engineering
MicrosoftDocs/fabric-docs
Public repo for the fabric-docs-pr pair
djouallah/Fabric_Notebooks_Demo
Fabric Python Notebooks examples
cnstlungu/portable-data-stack-sqlmesh
A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset
jplane/pyspark-devcontainer
A simple VS Code devcontainer setup for local PySpark development
sanchitvj/sports_betting_analytics_engine
A data and analytics engineering platform designed for real-time sports betting analytics.
symerio/postal-codes-data
Mirror of the database of postal codes from GeoNames
dbrownems/SparkDataEngineeringForSQLServerProfessionals
MicrosoftDocs/mslearn-ingest-data-with-microsoft-fabric-notebooks
Code samples for Ingest data with Microsoft Fabric notebooks
ADefWebserver/FabricDataExplorer
Import display and edit data in Microsoft Fabric
7effrey89/streamlit_azuresqldb_fabric
Demo of a Custom Data Entry App you can build for your Azure SQL Database and Microsoft Fabric Warehouse and Lakehouse using python framework called Streamlit.
dquoctri/mssql-compose
MSSQL (Microsoft SQL Server) powered by docker-compose
d-swapnil/Reddit_data_pipeline
edkreuk/data-factory-testing-framework
A stand-alone test framework that allows to write unit tests for Data Factory pipelines on Microsoft Fabric, Azure Data Factory and Azure Synapse Analytics.
migue1neto/Idealista
Web scraping and data analysis of all Idealista listings in Portugal.
shivasai780/Real-Time-Using-Snowflake
sweetkobem/airflow_dag_generator