Modern Data Platform environment created using terraform
This release of the terraform-modern-data-platform provides a basic Azure Databricks instance and three Azure v2 Storage Accounts to serve as the Bronze, Silver, Gold storage model popular in the Data Lake House model. The ADB instance is secured in this release by removing the public IP address requiring all traffic to go through the Loadbalancer and no direct external traffic allowed. We have also implemented the first of many Azure Policies, restricting the geopolitical location of the Databricks instance to East US 2 and Central US.
- Core Resource Group
- Managed Resource Group (auto generated by Azure Databricks)
- Vnet - in the Core Resource Group but affects resources in the managed resource group
- Public Subnet
- Private Subnet
- NSG that protects both the public & private subnets
- Loadbalancer that points to the Public Subnet
- Public IP address for the Load Balancer
- 3 v2 Storage Accounts for the Modern Data Platform
- The Azure Databricks Service Instance
- VMs spun up when a Databricks cluster is spun up
- Azure Policy restricting the Regions to eastus2 & centralus
- An Azure Subscription
- Terraform
(once per environment)
az account login
./ConfigureAzureForSecureTerraformAccess.sh
(per project/environment switch)
source ../terraform-azure-bootstrap/TerraformAzureBootstrap.sh -f env/dev.tfvars
terraform apply -var-file env/dev.tfvars
terraform apply --var-file env/dev.tfvars