Pinned Repositories
avro-js
Apache Avro implementation in JavaScript
iceberg
Apache Iceberg
jupyter-zeppelin
Conversion utility from Zeppelin notes to Jupyter notebooks.
jvm-repr
API for converting JVM objects to representations by MIME type, for the Jupyter ecosystem.
marker
A markup parser that outputs html and text. Syntax is similar to MediaWiki.
parquet-avro-protobuf
Example: Convert Protobuf to Parquet using parquet-avro and avro-protobuf
parquet-cli
Parquet Command-line Tools
parquet-mr
Mirror of Apache Parquet
s3committer
Hadoop output committers for S3
spark
Mirror of Apache Spark
rdblue's Repositories
rdblue/s3committer
Hadoop output committers for S3
rdblue/jupyter-zeppelin
Conversion utility from Zeppelin notes to Jupyter notebooks.
rdblue/avro-js
Apache Avro implementation in JavaScript
rdblue/brotli-codec
Hadoop Codec for Brotli
rdblue/avro-php
Apache Avro implementation in PHP
rdblue/iceberg
Apache Iceberg
rdblue/iceberg-python
Apache PyIceberg
rdblue/spark
Mirror of Apache Spark
rdblue/SQL
BNF Grammars for SQL-92, SQL-99 and SQL-2003
rdblue/jvm-repr
API for converting JVM objects to representations by MIME type, for the Jupyter ecosystem.
rdblue/parquet-mr
Mirror of Apache Parquet
rdblue/arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
rdblue/async-file-io
rdblue/avro-interop
Test data for Avro interoperability
rdblue/avro-java
Apache Avro implementation in Java
rdblue/avro-ruby
Apache Avro implementation in Ruby
rdblue/avro-shared
rdblue/babar
Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.
rdblue/coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
rdblue/duckdb_iceberg
rdblue/genie
Federated Big Data Orchestration Service
rdblue/github-issue-templates
:symbols: A collection of GitHub issue and pull request templates
rdblue/iceberg-docs
Apache Iceberg Documentation Site
rdblue/incubator-toree
Fork of Apache Toree
rdblue/jvm-magics
A plugin system for magic function implementations across JVM kernels.
rdblue/parquet-format
Mirror of Apache Parquet
rdblue/spark-website
Apache Spark Website
rdblue/strava-uploader
utility to migrate Runkeeper data (GPX and CSV) to Strava
rdblue/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
rdblue/Vegas
The missing MatPlotLib for Scala + Spark