hive-metastore
Apache Hive Metastore as a Standalone server in Docker. Can be used in a modular fashion with Presto, Trino, Spark, and many other BigData tools.
There are numerous Docker images that attempt to do this, but yet to see something actually work as advertised with minimal bloat.
Setup
Usage
docker-compose build
docker-compose up -d
Configuration
Controlled via ENVironment variables
Key | Required? | Description |
---|---|---|
DATABASE_TYPE_JDBC | No, defaults to postgresql | Database type1 for JDBC connection |
DATABASE_TYPE | No, defaults to postgres | Database type1 for migration tool |
DATABASE_DRIVER | No, defaults to org.postgresql.Driver | Database class used for JDBC connection |
DATABASE_HOST | Yes | Database host |
DATABASE_PORT | No, defaults to 5432 | Database port |
DATABASE_DB | Yes | Database name |
DATABASE_USER | Yes | Database user |
DATABASE_PASSWORD | Yes | Database password |
S3_ENDPOINT_URL | No | Custom S3 endpoint URL; useful for minio integration |
S3_BUCKET | Yes | S3 bucket name |
S3_PREFIX | Yes | S3 bucket prefix |
1 Though you have the ability to modify
DATABASE_TYPE_JDBC
/DATABASE_TYPE
, we presently only install Postgres driver. You'd have to extend this image and install a non-Postgres driver to change the Database type.
Development
This project has most of the batteries included to test and verify that the app works
-
Install docker and docker-compose
-
Launch dev environment
$ make env-up
-
Run test(s)
$ docker-compose run test