This stage allows creation and management of a fleet of GKE multitenant clusters, optionally leveraging GKE Hub to configure additional features. It's designed to be replicated once for every homogeneous set of clusters, either per environment or with more granularity as needed (e.g. teams or sets of teams sharing similar requirements).
The following diagram illustrates the high-level design of created resources, which can be adapted to specific requirements via variables:
The detailed architecture of the underlying resources is explained in the documentation of GKE multitenant module.
This stage creates a project containing and as many clusters and node pools as requested by the user through the variables explained below. The GKE clusters are created with the with the following setup:
- All clusters are assumed to be private, therefore only VPC-native clusters are supported.
- Logging and monitoring configured to use Cloud Operations for system components and user workloads.
- GKE metering enabled by default and stored in a bigquery dataset created within the project.
- Optional GKE Fleet support with the possibility to enable any of the following features:
- Support for Config Sync, Hierarchy Controller, and Policy Controller when using Anthos Config Management.
- Groups for GKE can be enabled to facilitate the creation of flexible RBAC policies referencing group principals.
- Support for application layer secret encryption.
- Support to customize peering configuration of the control plane VPC (e.g. to import/export routes to the peered network)
- Some features are enabled by default in all clusters:
This stage is meant to be executed after "foundational stages" (i.e., stages 00-bootstrap
, 01-resman
, 02-networking (either VPN or NVA) and 02-security
) have been run.
It's of course possible to run this stage in isolation, by making sure the architectural prerequisites are satisfied (e.g., networking), and that the Service Account running the stage is granted the roles/permissions below:
- on the organization or network folder level
roles/xpnAdmin
or a custom role which includes the following permissionscompute.organizations.enableXpnResource
,compute.organizations.disableXpnResource
,compute.subnetworks.setIamPolicy
,
- on each folder where projects are created
roles/logging.admin
roles/owner
roles/resourcemanager.folderAdmin
roles/resourcemanager.projectCreator
roles/xpnAdmin
- on the host project for the Shared VPC
roles/browser
roles/compute.viewer
- on the organization or billing account
roles/billing.admin
The VPC host project, VPC and subnets should already exist.
If you're running this on top of FAST, you should run the following commands to create the providers file, and populate the required variables from the previous stage.
# Variable `outputs_location` is set to `~/fast-config` in stage 01-resman
$ cd fabric-fast/stages/03-gke-multitenant/dev
ln -s ~/fast-config/providers/03-gke-dev-providers.tf .
There are two broad sets of variables you will need to fill in:
- variables shared by other stages (organization id, billing account id, etc.), or derived from a resource managed by a different stage (folder id, automation project id, etc.)
- variables specific to resources managed by this stage
To avoid the tedious job of filling in the first group of variables with values derived from other stages' outputs, the same mechanism used above for the provider configuration can be used to leverage pre-configured .tfvars
files.
If you configured a valid path for outputs_location
in the bootstrap and networking stage, simply link the relevant terraform-*.auto.tfvars.json
files from this stage's outputs folder (under the path you specified), where the *
above is set to the name of the stage that produced it. For this stage, a single .tfvars
file is available:
# Variable `outputs_location` is set to `~/fast-config`
ln -s ~/fast-config/tfvars/00-bootstrap.auto.tfvars.json .
ln -s ~/fast-config/tfvars/01-resman.auto.tfvars.json .
ln -s ~/fast-config/tfvars/02-networking.auto.tfvars.json .
If you're not using FAST, refer to the Variables table at the bottom of this document for a full list of variables, their origin (e.g., a stage or specific to this one), and descriptions explaining their meaning.
This stage is designed with multi-tenancy in mind, and the expectation is that GKE clusters will mostly share a common set of defaults. Variables are designed to support this approach for both clusters and node pools:
- the
cluster_default
variable allows defining common defaults for all clusters - the
clusters
variable is used to declare the actual GKE clusters and allows overriding defaults on a per-cluster basis - the
nodepool_defaults
variable allows definining common defaults for all node pools - the
nodepools
variable is used to declare cluster node pools and allows overriding defaults on a per-cluster basis
There are two additional variables that influence cluster configuration: authenticator_security_group
to configure Google Groups for RBAC, dns_domain
to configure Cloud DNS for GKE.
Fleet management is entirely optional, and uses three separate variables:
fleet_features
: specifies the GKE fleet features you want activatefleet_configmanagement_templates
: defines configuration templates for specific sets of features (Config Management currently)fleet_configmanagement_clusters
: specifies which clusters are managed by fleet features, and the optional Config Management template for each clusterfleet_workload_identity
: to enables optional centralized Workload Identity
Leave all these variables unset (or set to null
) to disable fleet management.
Once the provider and variable configuration is complete, you can apply this stage:
terraform init
terraform apply
name | description | modules | resources |
---|---|---|---|
main.tf | GKE multitenant for development environment. | multitenant-fleet |
|
outputs.tf | Output variables. | google_storage_bucket_object · local_file |
|
variables.tf | Module variables. |
name | description | type | required | default | producer |
---|---|---|---|---|---|
automation | Automation resources created by the bootstrap stage. | object({…}) |
✓ | 00-bootstrap |
|
billing_account | Billing account id and organization id ('nnnnnnnn' or null). | object({…}) |
✓ | 00-bootstrap |
|
clusters | map(object({…})) |
✓ | |||
folder_ids | Folders to be used for the networking resources in folders/nnnnnnnnnnn format. If null, folder will be created. | object({…}) |
✓ | 01-resman |
|
host_project_ids | Host project for the shared VPC. | object({…}) |
✓ | 02-networking |
|
nodepools | map(map(object({…}))) |
✓ | |||
prefix | Prefix used for resources that need unique names. | string |
✓ | ||
vpc_self_links | Self link for the shared VPC. | object({…}) |
✓ | 02-networking |
|
authenticator_security_group | Optional group used for Groups for GKE. | string |
null |
||
cluster_defaults | Default values for optional cluster configurations. | object({…}) |
{…} |
||
dns_domain | Domain name used for clusters, prefixed by each cluster name. Leave null to disable Cloud DNS for GKE. | string |
null |
||
fleet_configmanagement_clusters | Config management features enabled on specific sets of member clusters, in config name => [cluster name] format. | map(list(string)) |
{} |
||
fleet_configmanagement_templates | Sets of config management configurations that can be applied to member clusters, in config name => {options} format. | map(object({…})) |
{} |
||
fleet_features | Enable and configue fleet features. Set to null to disable GKE Hub if fleet workload identity is not used. | object({…}) |
null |
||
fleet_workload_identity | Use Fleet Workload Identity for clusters. Enables GKE Hub if set to true. | bool |
false |
||
group_iam | Project-level authoritative IAM bindings for groups in {GROUP_EMAIL => [ROLES]} format. Use group emails as keys, list of roles as values. | map(list(string)) |
{} |
||
iam | Project-level authoritative IAM bindings for users and service accounts in {ROLE => [MEMBERS]} format. | map(list(string)) |
{} |
||
labels | Project-level labels. | map(string) |
{} |
||
nodepool_defaults | object({…}) |
{…} |
|||
outputs_location | Path where providers, tfvars files, and lists for the following stages are written. Leave empty to disable. | string |
null |
||
project_services | Additional project services to enable. | list(string) |
[] |
name | description | sensitive | consumers |
---|---|---|---|
cluster_ids | Cluster ids. | ||
clusters | Cluster resources. | ✓ | |
project_id | GKE project id. |