/unitycatalog-rs

Open, Multi-modal Catalog for Data & AI, written in Rust

Primary LanguageRustApache License 2.0Apache-2.0

unitycatalog-rs

CI crates.io crates.io crates.io

Unity Catalog Rust (UC-R)

A native Rust implementation of the Unity Catalog: an Open and Multimodal Catalog for Data & AI.

Prerequisites

  • Install Rust, e.g. as described here

[WIP] Quick Start

Use two terminal windows: one to run the Unity Catalog Rust server (UC-R) another to explore the content of the UC-R server using varios client tools.

Run the server

In the first terminal window, start the UC server (from the repository root directory)

cargo run

[TBD] Query with DuckDB

  • Install DuckDB
  • Start DuckDB (run duckdb in the terminal)
  • Run the following commands in the DuckDB shell to install the required extensions
install uc_catalog from core_nightly;
load uc_catalog;
install delta;
load delta;
  • Attach the unity catalog to DuckDB
ATTACH 'unity' AS unity (TYPE UC_CATALOG);
  • Run the queries, for example
SHOW ALL TABLES;
SELECT * from unity.default.numbers;

APIs and Compatibility

  • Open API specification: The Unity Catalog Rest API is documented here.
  • Compatibility and stability: The APIs are currently evolving and should not be assumed to be stable.

Compiling and Testing

cargo test

Why Rust?

Rust was chosen as a language for the alternative implementation for a few reasons:

  • Performance: Rust compiles to native bytecode, enabling it to be used in performance-critical and constrained environments.
  • FFI: As Rust uses the C ABI, bindings can be exposed to any language, like C, Go, etc.
  • Python & Rust: Python and Rust are a strong pairing, especially in the AI ecosystem (tokenizers, pydantic, etc).
  • Ecosystem: Rust has a rich ecosystem of libraries, especially related to big data processing (delta-rs, datafusion, etc).
  • Safety: Rust's strong type system and ownership model make it difficult to write code that is unsafe or has undefined behavior. When dealing with data assets, especially when it comes to access control, this helps us ensure that we are not introducing security vulnerabilities.
  • We want to write it!: Most importantly, we like writing Rust!

Getting Involved

We encourage you to reach out and share your feedback or ideas by raising an issue.

See CONTRIBUTING.md if you are looking to contribute.