MartinArens's Stars
n8n-io/n8n
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
modularml/mojo
The Mojo Programming Language
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
debezium/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
kedro-org/kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
RSS-Bridge/rss-bridge
The RSS feed for websites missing it
dpkp/kafka-python
Python client for Apache Kafka
sqlchat/sqlchat
Chat-based SQL Client and Editor for the next decade
jitsucom/jitsu
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
lightdash/lightdash
Self-serve BI to 10x your data team ⚡️
tchiotludo/akhq
Kafka GUI for Apache Kafka to manage topics, topics data, consumers group, schema registry, connect and more...
dlt-hub/dlt
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
sfu-db/connector-x
Fastest library to load data from DB to DataFrames in Rust and Python
rholder/retrying
Retrying is an Apache 2.0 licensed general-purpose retrying library, written in Python, to simplify the task of adding retry behavior to just about anything.
tableflowhq/csv-import
The open-source CSV importer, maintained by @tableflowhq
intuit/wasabi
Wasabi A/B Testing service is an open source project that is no longer under active development or being supported
grouparoo/grouparoo
🦘 The Grouparoo Monorepo - open source customer data sync framework
Datavault-UK/automate-dv
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
FriendlyCaptcha/friendly-challenge
The widget and docs for the proof of work challenge used in Friendly Captcha. Protect your websites and online services from spam and abuse with Friendly Captcha, a privacy-first anti-bot solution.
ActivitySchema/ActivitySchema
Repository for the ActivitySchema spec and supporting materials
elbwalker/walkerOS
Open-source event collection and tag management (gtag.js/GTM alternative)
confluentinc/confluent-kafka-python
Confluent's Kafka Python Client
dbeatty10/dbt-mysql
dbt-mysql contains all of the code enabling dbt to work with MySQL and MariaDB
transferwise/pipelinewise-tap-mysql
Singer.io Tap for MySQL - PipelineWise compatible
timodechau/analytics_alternatives
This directory should help everyone who is looking for a tracking and analytics solution
memiiso/debezium-server-bigquery
Replicates any database (CDC events) to Bigquery in real time
openintegrationhub/Connectors
Concepts and guidelines for connector development