/hive-metastore

Apache Hive Metastore as a Standalone server in Docker

Primary LanguageShellApache License 2.0Apache-2.0

hive-metastore

Apache Hive Metastore as a Standalone server in Docker. Can be used in a modular fashion with Presto, Trino, Spark, and many other BigData tools.

There are numerous Docker images that attempt to do this, but yet to see something actually work as advertised with minimal bloat.

Setup

Usage

docker-compose build
docker-compose up -d

Configuration

Controlled via ENVironment variables

Key Required? Description
DATABASE_TYPE_JDBC No, defaults to postgresql Database type1 for JDBC connection
DATABASE_TYPE No, defaults to postgres Database type1 for migration tool
DATABASE_DRIVER No, defaults to org.postgresql.Driver Database class used for JDBC connection
DATABASE_HOST Yes Database host
DATABASE_PORT No, defaults to 5432 Database port
DATABASE_DB Yes Database name
DATABASE_USER Yes Database user
DATABASE_PASSWORD Yes Database password
S3_ENDPOINT_URL No Custom S3 endpoint URL; useful for minio integration
S3_BUCKET Yes S3 bucket name
S3_PREFIX Yes S3 bucket prefix

1 Though you have the ability to modify DATABASE_TYPE_JDBC/DATABASE_TYPE, we presently only install Postgres driver. You'd have to extend this image and install a non-Postgres driver to change the Database type.

Development

This project has most of the batteries included to test and verify that the app works

  1. Install docker and docker-compose

  2. Launch dev environment

    $ make env-up
  3. Run test(s)

    $ docker-compose run test