Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Defining a field in both forProvider and initProvider may cause an infinite update loop #299

Open
Tracked by #4952
mergenci opened this issue Nov 15, 2023 · 0 comments
Labels
bug Something isn't working reconciler v2
Milestone

Comments

@mergenci
Copy link
Member

mergenci commented Nov 15, 2023

What happened?

I discovered this bug while working on supporting initProvider-related behavior in no-fork external client. An infinite update loop occurs when the following conditions hold at the same time:

  1. A field is defined in both initProvider and forProvider,
  2. The field corresponds to a Terraform schema.TypeSet, such as managed_policy_arns,
  3. The field in initProvider has more elements than corresponding one in forProvider.

Provider reports “Cannot observe external resource” and an event is published with message: “cannot run refresh: refresh failed: Invalid index: Elements of a set are identified only by their value and don't have any separate index or key to select with, so it's only possible to perform operations across all elements of the set.”

The problem stems from the fact that Upjet puts initProvider-exclusive fields into Terraform's ignore_changes lifecycle meta argument. Even though not explicitly specified, Terraform documentation suggests that ignore_changes doesn't support sets:

Map and list elements can be referenced using index notation, like tags["Name"] and list[0] respectively.

Currently, we use index notation, like set[0], for sets as well.

I can't think of an easy resolution. We cannot ignore changes in the whole set, because doing so would prevent the values defined in forProvider to take effect. We can let forProvider overwrite initProvider, in which case we should decide whether we do so for lists and maps as well.

How can we reproduce it?

We will create an IAM Role to reproduce the issue. Because we will use managed_policy_arns that refers to IAM Policy resources, we will begin with creating IAM Policies.

  1. Create two IAM Policies by applying the following configuration:
apiVersion: iam.aws.upbound.io/v1beta1
kind: Policy
metadata:
  annotations:
    meta.upbound.io/example-id: iam/v1beta1/policy
  labels:
    testing.upbound.io/example-name: policy
  name: test-upjet-bug-iam-policy-1
spec:
  forProvider:
    policy: |
      {
        "Version": "2012-10-17",
        "Statement": [
          {
              "Sid": "VisualEditor0",
              "Effect": "Allow",
              "Action": "elastic-inference:Connect",
              "Resource": "*"
          }
        ]
      }
---
apiVersion: iam.aws.upbound.io/v1beta1
kind: Policy
metadata:
  annotations:
    meta.upbound.io/example-id: iam/v1beta1/policy
  labels:
    testing.upbound.io/example-name: policy
  name: test-upjet-bug-iam-policy-2
spec:
  forProvider:
    policy: |
      {
        "Version": "2012-10-17",
        "Statement": [
          {
              "Sid": "VisualEditor0",
              "Effect": "Allow",
              "Action": "elastic-inference:Connect",
              "Resource": "*"
          }
        ]
      }
  1. After the policies are created, get their ARN:
$ kubectl get policy.iam test-upjet-bug-iam-policy-1 -o=jsonpath='{.status.atProvider.arn}'
arn:aws:iam::<redacted>:policy/test-upjet-bug-iam-policy-1

$ kubectl get policy.iam test-upjet-bug-iam-policy-2 -o=jsonpath='{.status.atProvider.arn}'
arn:aws:iam::<redacted>:policy/test-upjet-bug-iam-policy-2
  1. Finally, create the IAM role, as follows, using the ARNs you got in the previous step. Note that it's important for initProvider.managedPolicyArns to have more elements than forProvider.managedPolicyArns, to be able to reproduce the bug.
apiVersion: iam.aws.upbound.io/v1beta1
kind: Role
metadata:
  labels:
    testing.upbound.io/example-name: realtimelogconfig
  name: test-upjet-bug-iam-role
spec:
  initProvider:
    managedPolicyArns:
      - "arn:aws:iam::<redacted>:policy/test-upjet-bug-iam-policy-1"
      - "arn:aws:iam::<redacted>:policy/test-upjet-bug-iam-policy-2"
  forProvider:
    managedPolicyArns:
      - "arn:aws:iam::<redacted>:policy/test-upjet-bug-iam-policy-1"
    assumeRolePolicy: |
      {
        "Version": "2012-10-17",
        "Statement": [
          {
            "Action": "sts:AssumeRole",
            "Principal": {
              "Service": "cloudfront.amazonaws.com"
            },
            "Effect": "Allow"
          }
        ]
      }
  1. Check out provider logs to see the update loop.
  2. Check out events to see:
NAMESPACE   LAST SEEN   TYPE      REASON                          OBJECT                               MESSAGE
default     94s         Warning   CannotObserveExternalResource   role/test-upjet-bug-iam-role         cannot run refresh: refresh failed: Invalid index: Elements of a set are identified only by their value and don't have any separate index or key to select with, so it's only possible to perform operations across all elements of the set.

Related Issues

#298 (duplicate of crossplane-contrib/provider-upjet-aws#946), #295

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working reconciler v2
Projects
None yet
Development

No branches or pull requests

2 participants