/arrow-datafusion

Apache Arrow DataFusion SQL Query Engine

Primary LanguageRustApache License 2.0Apache-2.0

DataFusion

Coverage Status

logo

DataFusion is a very fast, extensible query engine for building high-quality data-centric systems in Rust, using the Apache Arrow in-memory format. Python Bindings are also available.

DataFusion offers SQL and Dataframe APIs, excellent performance, built-in support for CSV, Parquet, JSON, and Avro, extensive customization, and a great community.

https://arrow.apache.org/datafusion/ contains the project's documentation.

Using DataFusion

The example usage section in the user guide and the datafusion-examples code in the crate contain information on using DataFusion.

Contributing to DataFusion

The developer’s guide contains information on how to contribute.