julienledem
Apache Parquet co-creator. OpenLineage and Marquez (LFAI&Data) Apache Arrow, Iceberg, 🐖 PMC.
@AstronomerBerkeley
Pinned Repositories
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
parquet-format
Apache Parquet Format
parquet-java
Apache Parquet Java
blog
blog examples
brennus
Builder pattern to generate java classes
map-reduce-console
Browser based map-reduce console to quickly prototype hadoop jobs
Pig-scripting-examples
Examples of use of pig scripting languages capabilities
redelm
an anagram
marquez
Collect, aggregate, and visualize a data ecosystem's metadata
OpenLineage
An Open Standard for lineage metadata collection
julienledem's Repositories
julienledem/redelm
an anagram
julienledem/Pig-scripting-examples
Examples of use of pig scripting languages capabilities
julienledem/brennus
Builder pattern to generate java classes
julienledem/arrow
Mirror of Apache Arrow
julienledem/parquet-mr
Mirror of Apache Parquet
julienledem/amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
julienledem/arrow-site
Mirror of Apache Arrow site
julienledem/calcite
Mirror of Apache Calcite
julienledem/contributor_covenant
Pledge your respect and appreciation for contributors of all kinds to your open source project.
julienledem/dagre
:no_entry: [DEPRECATED] - Directed graph layout for JavaScript
julienledem/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
julienledem/dropwizard-sentry
Dropwizard integration for error logging to Sentry.
julienledem/egeria
Open Metadata and Governance
julienledem/grpc-java
The Java gRPC implementation. HTTP/2 based RPC
julienledem/homebrew-thrift
julienledem/iceberg
Iceberg is a table format for large, slow-moving tabular data
julienledem/incubator-heron
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
julienledem/julienledem.github.io
julien.ledem.net
julienledem/marquez
julienledem/marquez-airflow
Airflow support for Marquez
julienledem/marquez-chart
Helm Chart for Marquez
julienledem/marquez-java
Java client for Marquez
julienledem/marquez-python
Python client for Marquez
julienledem/marquez-web
Marquez Web UI ALPHA
julienledem/old-parquet-mr
Java implementation to use with Map-Reduce
julienledem/parquet-cpp
Mirror of Apache Parquet
julienledem/parquet-format
Mirror of Apache Parquet
julienledem/proposing-projects
This repo contains the LF AI Project Proposal Process and Project Lifecycle. They explain the process to host new projects in LF AI and provide a proposal template.
julienledem/spark-1
Apache Spark - A unified analytics engine for large-scale data processing
julienledem/stantz
Raytracing