Enable watch termination grace period (redux).#2200
Conversation
This should mostly eliminate aligned informer re-lists during kube-apiserver rollout.
|
Skipping CI for Draft Pull Request. |
|
No actionable comments were generated in the recent review. 🎉 ℹ️ Recent review info⚙️ Run configurationConfiguration used: Repository: openshift/coderabbit/.coderabbit.yaml Review profile: CHILL Plan: Enterprise Run ID: 📒 Files selected for processing (1)
WalkthroughAdds a new ChangesKubeAPIServer Default Config Update
Estimated code review effort🎯 1 (Trivial) | ⏱️ ~2 minutes 🚥 Pre-merge checks | ✅ 13 | ❌ 2❌ Failed checks (2 warnings)
✅ Passed checks (13 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
/payload-aggregate periodic-ci-openshift-release-master-ci-5.0-upgrade-from-stable-4.22-e2e-aws-ovn-upgrade 10 |
|
@benluddy: trigger 0 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command |
|
/payload 5.0 ci blocking |
|
@benluddy: trigger 5 job(s) of type blocking for the ci release of OCP 5.0
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/50caa6f0-69c8-11f1-94f7-de45d6e2e667-0 |
|
xref #1862 (comment) |
|
From https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-cluster-kube-apiserver-operator-2200-ci-5.0-upgrade-from-stable-4.22-e2e-aws-ovn-upgrade/2066992765111111680, it appears the disruption samplers don't honor 429 responses with Retry-After the same way client-go's REST clients do. Since this change is allowing a few seconds to drain watches after the shutdown delay duration has elapsed, we're hitting the WithRetryAfter filter that is meant to shunt clients to other API servers. |
|
/payload-with-prs 5.0 ci blocking openshift/origin#31311 |
|
@benluddy: trigger 5 job(s) of type blocking for the ci release of OCP 5.0
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/ac034dc0-6a58-11f1-9862-86893dca838b-0 |
|
/test images |
| shutdown-send-retry-after: | ||
| - "true" | ||
| shutdown-watch-termination-grace-period: | ||
| - 5s # shorter than the pod graceful termination period on single-node |
There was a problem hiding this comment.
Being higher extends the overall termination period? I guess if we find that 5s is insufficient here we can explore something like the controller that adjusts goaway-chance based on SNO vs HA.
This should mostly eliminate aligned informer re-lists during kube-apiserver rollout.
Summary by CodeRabbit