witboost


Designed by Agile Lab, Witboost is a versatile platform that addresses a wide range of sophisticated data engineering challenges. It enables businesses to discover, enhance, and productize their data, fostering the creation of automated data platforms that adhere to the highest standards of data governance. Want to know more about Witboost? Check it out here or contact us!

This repository is part of our Starter Kit meant to showcase Witboost's integration capabilities and provide a "batteries-included" product.

ADLS Gen2 Storage

Overview

This project allows to provision an Azure Storage Account and one or more ADLS Gen2 container inside it. The Terraform configuration files can be found inside the files folder.

It's a module ready to be used by our OSS Terraform Specific Provisioner.

Configuring

The Resource Group where to publish the Storage Account must already exist.

Environment variables

To authenticate on Azure using a Service Principal with a Client Secret, the following env variables need to be defined:

  • ARM_CLIENT_ID - The Client ID of the Service Principal.
  • ARM_CLIENT_SECRET - The Client Secret of the Service Principal.
  • ARM_SUBSCRIPTION_ID - The Subscription ID in which the Storage Account exists.
  • ARM_TENANT_ID - The Tenant ID in which the Subscription exists.

Example configuration for the Terraform SP

  terraform {
    "urn:dmb:utm:azure-storage-adlsgen2-template:0.0.0" {
      repositoryPath: "/tf/adlsgen2"
      descriptorToVariablesMapping: {
        dp_domain = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.component.dpDomain"
        dp_name_major_version = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.component.dpNameMajorVersion"
        component_name = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.component.name"
        resource_group = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.resourceGroup"
        environment = "$.dataProduct.environment"
        account_tier = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.performance"
        account_replication_type = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.redundancy"
        access_tier = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.accessTier"
        infrastructure_encryption_enabled = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.infrastructureEncryptionEnabled"
        allow_nested_items_to_be_public = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.allowNestedItemsToBePublic"
        containers = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.containers"
      }
      principalMappingPlugin {
           pluginClass: "it.agilelab.plugin.principalsmapping.impl.azure.AzureMapperFactory"
           azure: {
              tenantId: ${?PRINCIPAL_MAPPING_TENANT_ID}
              clientId: ${?PRINCIPAL_MAPPING_CLIENT_ID}
              clientSecret: ${?PRINCIPAL_MAPPING_CLIENT_SECRET}
           }
      }
      backendConfigs: {
        stateKey = "key"
        configs = {
          key = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.state.key"
        }
      }
    }
  }

To learn more about the configuration parameters, please refer to the documentation of the Terraform Specific Provisioner.

Terraform State

We recommend to have a different Storage Account where Terraform can save the remote state for this module.

Deploying

The module is packaged as an Helm Chart. It must be deployed as part of an Umbrella Chart along with the Terraform Specific Provisioner and, eventually, other modules.

Values

Key Type Default Description
enabled bool true Include the TF module to create the Storage Account
state object {"container_name":"","resource_group_name":"","storage_account_name":""} Terraform Remote State configuration
state.container_name string "" The Name of the Storage Container within the Storage Account
state.resource_group_name string "" The Name of the Resource Group in which the Storage Account exists
state.storage_account_name string "" The Name of the Storage Account

License

This project is available under the Apache License, Version 2.0; see LICENSE for full details.

About us


Agile Lab


Agile Lab creates value for its Clients in data-intensive environments through customizable solutions to establish performance driven processes, sustainable architectures, and automated platforms driven by data governance best practices.

Since 2014 we have implemented 100+ successful Elite Data Engineering initiatives and used that experience to create Witboost: a technology-agnostic, modular platform, that empowers modern enterprises to discover, elevate and productize their data both in traditional environments and on fully compliant Data mesh architectures.

Contact us or follow us on: