Designed by Agile Lab, Witboost is a versatile platform that addresses a wide range of sophisticated data engineering challenges. It enables businesses to discover, enhance, and productize their data, fostering the creation of automated data platforms that adhere to the highest standards of data governance. Want to know more about Witboost? Check it out here or contact us!
This repository is part of our Starter Kit meant to showcase Witboost's integration capabilities and provide a "batteries-included" product.
This project allows to provision an Azure Storage Account
and one or more ADLS Gen2 container
inside it.
The Terraform configuration files can be found inside the files folder.
It's a module ready to be used by our OSS Terraform Specific Provisioner.
The Resource Group
where to publish the Storage Account must already exist.
To authenticate on Azure using a Service Principal with a Client Secret, the following env variables need to be defined:
ARM_CLIENT_ID
- The Client ID of the Service Principal.ARM_CLIENT_SECRET
- The Client Secret of the Service Principal.ARM_SUBSCRIPTION_ID
- The Subscription ID in which the Storage Account exists.ARM_TENANT_ID
- The Tenant ID in which the Subscription exists.
terraform {
"urn:dmb:utm:azure-storage-adlsgen2-template:0.0.0" {
repositoryPath: "/tf/adlsgen2"
descriptorToVariablesMapping: {
dp_domain = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.component.dpDomain"
dp_name_major_version = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.component.dpNameMajorVersion"
component_name = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.component.name"
resource_group = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.resourceGroup"
environment = "$.dataProduct.environment"
account_tier = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.performance"
account_replication_type = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.redundancy"
access_tier = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.accessTier"
infrastructure_encryption_enabled = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.infrastructureEncryptionEnabled"
allow_nested_items_to_be_public = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.allowNestedItemsToBePublic"
containers = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.containers"
}
principalMappingPlugin {
pluginClass: "it.agilelab.plugin.principalsmapping.impl.azure.AzureMapperFactory"
azure: {
tenantId: ${?PRINCIPAL_MAPPING_TENANT_ID}
clientId: ${?PRINCIPAL_MAPPING_CLIENT_ID}
clientSecret: ${?PRINCIPAL_MAPPING_CLIENT_SECRET}
}
}
backendConfigs: {
stateKey = "key"
configs = {
key = "$.dataProduct.components[?(@.id == '{{componentIdToProvision}}')].specific.state.key"
}
}
}
}
To learn more about the configuration parameters, please refer to the documentation of the Terraform Specific Provisioner.
We recommend to have a different Storage Account where Terraform can save the remote state for this module.
The module is packaged as an Helm Chart. It must be deployed as part of an Umbrella Chart along with the Terraform Specific Provisioner and, eventually, other modules.
Key | Type | Default | Description |
---|---|---|---|
enabled | bool | true |
Include the TF module to create the Storage Account |
state | object | {"container_name":"","resource_group_name":"","storage_account_name":""} |
Terraform Remote State configuration |
state.container_name | string | "" |
The Name of the Storage Container within the Storage Account |
state.resource_group_name | string | "" |
The Name of the Resource Group in which the Storage Account exists |
state.storage_account_name | string | "" |
The Name of the Storage Account |
This project is available under the Apache License, Version 2.0; see LICENSE for full details.
Agile Lab creates value for its Clients in data-intensive environments through customizable solutions to establish performance driven processes, sustainable architectures, and automated platforms driven by data governance best practices.
Since 2014 we have implemented 100+ successful Elite Data Engineering initiatives and used that experience to create Witboost: a technology-agnostic, modular platform, that empowers modern enterprises to discover, elevate and productize their data both in traditional environments and on fully compliant Data mesh architectures.
Contact us or follow us on: