/gravitino

A high-performance, geo-distributed and federated metadata lake

Primary LanguageJavaApache License 2.0Apache-2.0

Gravitino

GitHub Actions Build GitHub Actions Integration Test

Introduction

Gravitino is a high-performance, geo-distributed, and federated metadata lake. It manages the metadata directly in different sources, types, and regions. It also provides users with unified metadata access for data and AI assets.

Gravitino Architecture

Gravitino aims to provide several key features:

  • Single Source of Truth for multi-regional data with geo-distributed architecture support.
  • Unified Data and AI asset management for both users and engines.
  • Security in one place, centralizing the security for different sources.
  • Built-in data management and data access management.

Contributing to Gravitino

Gravitino is open source software available under the Apache 2.0 license. For information of how to contribute to Gravitino please see the Contribution guidelines.

Online documentation

You can find the latest Gravitino documentation here in the doc folder, The README file only contains basic setup instructions.

Building Gravitino

Gravitino builds Gradle. To build Gravitino, please run:

./gradlew clean build -x test

If you want to build a distribution package, please run:

./gradlew compileDistribution -x test

to build a distribution package.

Or:

./gradlew assembleDistribution -x test

to build a compressed distribution package.

The generated binary distribution package locates in distribution directory.

For the details of building and testing Gravitino, please see How to build Gravitino.

Quick start

Configure and start the Gravitino server

If you already have a binary distribution package, please decompress the package (if required) and go to the directory where the package locates.

Before starting the Gravitino server, please configure the Gravitino server configuration file. The configuration file, gravitino.conf, located in the conf directory and follows the standard property file format. You can modify the configuration within this file.

To start the Gravitino server, please run:

./bin/gravitino.sh start

To stop the Gravitino server, please run:

./bin/gravitino.sh stop

Using Trino with Gravitino

Gravitino provides a Trino connector to access the metadata in Gravitino. To use Trino with Gravitino, please follow the trino-gravitino-connector doc.

Development guide

  1. How to build Gravitino
  2. How to test Gravitino
  3. How to publish Docker images

License

Gravitino is under the Apache License Version 2.0, See the LICENSE for the details.

Apache®, Apache Hadoop®, Apache Hive™, Apache Iceberg™, Apache Kafka®, Apache Spark™, Apache Submarine™, Apache Thrift™ and Apache Zeppelin™ are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries.