/vertica-containers

Docker containers for building and testing Vertica extensions

Primary LanguageShell

License

Vertica Containers

This repository has the sources for container-based projects using Vertica.

For Vertica on Kubernetes containers and resources, see vertica-kubernetes.

IMPORTANT: To build the projects in this repository, you must have a licensed Vertica RPM or DEB file.

The One-Node CE directory provides instructions to build the containerized version of the Vertica Community Edition (CE), a free, limited license that Vertica provides as a hands-on introduction to the platform. For an overview, see the Vertica documentation.

Vertica publishes the binary version of this container on DockerHub as the vertica/vertica-ce container.

The UDx-container directory packages in a container the following resources required to build User-Defined eXtensions (UDxs):

  • C++-compiler
  • Libraries
  • Google protobuf compiler
  • Python interpreter
  • Tools to invoke the UDx

The kafka-scheduler directory provides tools to maintain the official vertica/kafka-scheduler container, or build a custom containerized version of the Vertica Kafka Scheduler, a standalone Java application that automatically consumes data from one or more Kafka topics and then loads the structured data into Vertica.

The Kafka Scheduler provides the following advantages over manually loading data with COPY statements:

  • Streamed data automatically loads in your database according to the frame duration.
  • The Kafka Scheduler manages offsets to ensure an exactly-once message consumption process from Kafka topics.
  • You can configure backup schedulers to provide high-availability. If the primary scheduler fails, the backup scheduler begins loading Kafka data where the failed scheduler left off.