Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] org.opensearch.recovery.RecoveryWhileUnderLoadIT.testRecoverWhileUnderLoadAllocateReplicasRelocatePrimariesTest {p0={"cluster.indices.replication.strategy":"DOCUMENT"}} is flaky #13820

Closed
reta opened this issue May 24, 2024 · 2 comments
Labels
bug Something isn't working flaky-test Random test failure that succeeds on second run Storage Issues and PRs relating to data and metadata storage

Comments

@reta
Copy link
Collaborator

reta commented May 24, 2024

Describe the bug

The test case org.opensearch.recovery.RecoveryWhileUnderLoadIT.testRecoverWhileUnderLoadAllocateReplicasRelocatePrimariesTest {p0={"cluster.indices.replication.strategy":"DOCUMENT"}} is flaky:

java.lang.AssertionError: All incoming requests on node [node_s4] should have finished. Expected 0 but got 368; pending tasks [[{
  "node" : "node_s4",
  "id" : 76,
  "type" : "direct",
  "action" : "indices:admin/seq_no/retention_lease_sync[p]",
  "status" : {
    "phase" : "primary"
  },
  "description" : "retention_lease_sync shardId=[test][3]",
  "start_time" : "2024-05-24T17:25:25.617Z",
  "start_time_in_millis" : 1716571525617,
  "running_time" : "1m",
  "running_time_in_nanos" : 65752135745,
  "cancellable" : false,
  "cancelled" : false,
  "parent_task_id" : "MWBqDHQmSkefpKFbNwRdHg:75",
  "headers" : { },
  "resource_stats" : {
    "average" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "total" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "min" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "max" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "thread_info" : {
      "thread_executions" : 0,
      "active_threads" : 0
    }
  }
},{
  "node" : "node_s4",
  "id" : 57,
  "type" : "transport",
  "action" : "internal:index/shard/recovery/start_recovery",
  "description" : "",
  "start_time" : "2024-05-24T17:25:25.549Z",
  "start_time_in_millis" : 1716571525549,
  "running_time" : "1m",
  "running_time_in_nanos" : 65820120649,
  "cancellable" : false,
  "cancelled" : false,
  "headers" : { },
  "resource_stats" : {
    "average" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "total" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "min" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "max" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "thread_info" : {
      "thread_executions" : 0,
      "active_threads" : 0
    }
  }
},{
  "node" : "node_s4",
  "id" : 75,
  "type" : "transport",
  "action" : "retention_lease_sync",
  "status" : {
    "phase" : "starting"
  },
  "description" : "retention_lease_sync shardId=[test][3]",
  "start_time" : "2024-05-24T17:25:25.617Z",
  "start_time_in_millis" : 1716571525617,
  "running_time" : "1m",
  "running_time_in_nanos" : 65752504066,
  "cancellable" : false,
  "cancelled" : false,
  "headers" : { },
  "resource_stats" : {
    "average" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "total" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "min" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "max" : {
      "cpu_time_in_nanos" : 0,
      "memory_in_bytes" : 0
    },
    "thread_info" : {
      "thread_executions" : 0,
      "active_threads" : 0
    }
  }
}]]

Related component

Storage

To Reproduce

 ./gradlew ':server:internalClusterTest' --tests "org.opensearch.recovery.RecoveryWhileUnderLoadIT" -Dtests.method="testRecoverWhileUnderLoadAllocateReplicasRelocatePrimariesTest {p0={"cluster.indices.replication.strategy":"DOCUMENT"}}" -Dtests.seed=FE4B39551EA78350

Expected behavior

The test must always pass

Additional Details

Plugins
Please list all plugins currently enabled.

Screenshots
Standard

Host/Environment (please complete the following information):

  • CI

Additional context

@reta reta added bug Something isn't working untriaged flaky-test Random test failure that succeeds on second run labels May 24, 2024
@github-actions github-actions bot added the Storage Issues and PRs relating to data and metadata storage label May 24, 2024
@sachinpkale
Copy link
Member

[Storage Triage - attendees 1 2 3 4 5 6 7 8 9 10 ]

Moving to This Quarter

@sachinpkale sachinpkale moved this from 🆕 New to Now(This Quarter) in Storage Project Board May 30, 2024
@sachinpkale
Copy link
Member

Not able to repro on local even after 1000+ iterations. Closing.

@github-project-automation github-project-automation bot moved this from Now(This Quarter) to ✅ Done in Storage Project Board Jan 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working flaky-test Random test failure that succeeds on second run Storage Issues and PRs relating to data and metadata storage
Projects
Status: ✅ Done
Development

No branches or pull requests

3 participants