Skip to content

HDDS-15223. Stabilize TestReconAndAdminContainerCLI.testMissingContainer#10258

Open
arunsarin85 wants to merge 2 commits into
apache:masterfrom
arunsarin85:HDDS-15223
Open

HDDS-15223. Stabilize TestReconAndAdminContainerCLI.testMissingContainer#10258
arunsarin85 wants to merge 2 commits into
apache:masterfrom
arunsarin85:HDDS-15223

Conversation

@arunsarin85
Copy link
Copy Markdown
Contributor

@arunsarin85 arunsarin85 commented May 13, 2026

What changes were proposed in this pull request?

  • The test now waits longer for Replication Manager and Recon to report the same unhealthy container numbers.
  • It only passes when those numbers match on two polls in a row, not a single check.
  • The @Flaky("HDDS-15223") tag was removed from testMissingContainer.

Please describe your PR in detail:
Replication Manager and Recon don’t always update at the exact same moment. Under CI load, the test could see them briefly disagree (or “cross” each other) while both sides are still catching up. The old wait was short (40 seconds) and treated one matching snapshot as success, which was easy to hit as a timeout or a misleading pass.

This change only hardens the test:

  • 90 second cap on waiting (still polling every second).
  • Two consecutive successful full comparisons before the test continues.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-15223

How was this patch tested?

  1. Triggered flakky-test-check
    https://github.com/arunsarin85/ozone/actions/runs/25793920652

@adoroszlai adoroszlai requested a review from devmadhuu May 17, 2026 11:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants