fpj
Distributed Systems, ZooKeeper, BookKeeper, Kafka, @apache. In a previous life: Yahoo! Research, Microsoft Research, @confluentinc, and Dell.
QbeastBarcelona
Pinned Repositories
bk-ledger-load
Creates and deletes many ledgers to stress bookies.
bookkeeper-tutorial
distributedlog
A high performance replicated log service.
hbase
Mirror of Apache Hadoop HBase
HBASE-2315
HBase and BookKeeper
omid
Transactional Support for HBase
pravega-1
Pravega - Streaming as a new software defined storage primitive
presto-connector
Pravega connector for Presto
s4
S4 repository
zookeeper-book-example
This is a code example that complements the material in the ZooKeeper O'Reilly book.
fpj's Repositories
fpj/presto-connector
Pravega connector for Presto
fpj/bookkeeper
Mirror of Apache Bookkeeper
fpj/codetest
... for git test purposes.
fpj/community
Pravega community content, governance, etc.
fpj/datafusion
Apache DataFusion SQL Query Engine
fpj/dbt-tutorial
fpj/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
fpj/eo-ingestion
Sample exactly-once ingestion
fpj/flink
Mirror of Apache Flink
fpj/flink-connectors
... where the two coolest projects in the globe meet.
fpj/getting-started-k8s
Code and YAML files for Getting Started with Kubernetes video course on Pluralsight
fpj/iceberg
Apache Iceberg
fpj/incubator-xtable
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
fpj/lst-bench
LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as Delta Lake, Apache Hudi, and Apache Iceberg.
fpj/mvt
MVT (Mobile Verification Toolkit) helps with conducting forensics of mobile devices in order to find signs of a potential compromise.
fpj/nessie
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
fpj/pravega
Pravega - Streaming as a new software defined storage primitive
fpj/pravega-ingest-gateway
A simple HTTP server that can be used to write JSON events to a Pravega stream
fpj/pravega-multistreamtxn-2pc
Simple implementation of 2PC to support transactions across streams
fpj/pravega-schema-registry
Pravega Schema Registry repository
fpj/pulsar
Apache Pulsar - distributed pub-sub messaging system
fpj/qbeast-spark
Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!
fpj/queryparser
Parsing and analysis of Vertica, Hive, and Presto SQL.
fpj/simple-pravega-producer
fpj/simplereader
Reads events from a Pravega stream using a Flink source.
fpj/spark-connectors
Apache Spark connectors for Pravega.
fpj/spark-sql-perf
fpj/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
fpj/vast-db-connectors
fpj/vast-trino-connector-legacy