dgea005's Stars
explosion/wheelwright
🎡 Automated build repo for Python wheels and source packages
pingcap/awesome-database-learning
A list of learning materials to understand databases internals
okbob/pspg
Unix pager (with very rich functionality) designed for work with tables. Designed for PostgreSQL, but MySQL is supported too. Works well with pgcli too. Can be used as CSV or TSV viewer too. It supports searching, selecting rows, columns, or block and export selected area to clipboard.
trustly/pg_badplan
A PostgreSQL extension for logging queries where the expected/actual rows ratio exceeds a defined value
localstack/localstack-python-client
🐍 A lightweight Python client for LocalStack
ksindi/managers-playbook
:book: Heuristics for effective management
mhart/kinesalite
An implementation of Amazon's Kinesis built on LevelDB
altaurog/pgcopy
fast data loading with binary copy
python-streamz/streamz
Real-time stream processing for python
forcedotcom/SalesforcePy
An absurdly simple package for making Salesforce Rest API calls.
zendesk/maxwell
Maxwell's daemon, a mysql-to-json kafka producer
uber/Python-Sample-Application
uber/storagetapper
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
JarvusInnovations/lapidus
Stream your PostgreSQL, MySQL or MongoDB databases anywhere, fast.
linkedin/brooklin
An extensible distributed system for reliable nearline data streaming at scale
moiot/gravity
A Data Replication Center
airbnb/SpinalTap
Change Data Capture (CDC) service
kyleconroy/pgoutput
Postgres logical replication in Go
psycopg/psycopg2
PostgreSQL database adapter for the Python programming language
2ndQuadrant/pglogical
Logical Replication extension for PostgreSQL 17, 16, 15, 14, 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.
pgjdbc/pgjdbc
Postgresql JDBC Driver
debezium/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
eulerto/wal2json
JSON output plugin for changeset extraction
etcd-io/etcd
Distributed reliable key-value store for the most critical data of a distributed system
jepsen-io/jepsen
A framework for distributed systems verification, with fault injection
Teradata/kylo
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
kedro-org/kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
facebook/rocksdb
A library that provides an embeddable, persistent key-value store for fast storage.
Alluxio/alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
aws/aws-sdk-pandas
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).