-
Notifications
You must be signed in to change notification settings - Fork 38.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Azure] Issue 4230: remove readiness check for cache exclusion #119128
[Azure] Issue 4230: remove readiness check for cache exclusion #119128
Conversation
002c0d7
to
5175752
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/triage accepted
/lgtm
LGTM label has been added. Git tree hash: 2526a48418f0c4c1db5c81ca63e6c78d2ad90063
|
@bridgetkromhout could you help to approve this fix? |
/approve |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: alexanderConstantinescu, bridgetkromhout, cheftako, feiskyer The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/priority important-soon |
from SIG meeting today, please do not backport to 1.25 |
Please do not backport to 1.25 (1.26 is fine) |
/milestone 1.28 |
@gracenng: The provided milestone is not valid for this repository. Milestones in this repository: [ Use In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
woops |
/retest |
So do we need to back-port it here for this to get into AKS 1.26 / 1.27, @cheftako / @feiskyer ? I asked on the original PR in cloud-provider-azure, here: kubernetes-sigs/cloud-provider-azure#4234 (comment) but didn't get an answer. It's been back-ported in that repo, but I have no clue what the dependency tree is for AKS. |
Yes, backport is needed and I did it for you. Please check. #119931 #119932 |
…19128-upstream-release-1.27 Automated cherry pick of #119128: Issue 4230: remove readiness check for cache exclusion
…19128-upstream-release-1.26 Automated cherry pick of #119128: Issue 4230: remove readiness check for cache exclusion
What type of PR is this?
/kind bug
What this PR does / why we need it:
Which issue(s) this PR fixes:
Fixes kubernetes-sigs/cloud-provider-azure#4230
Issue kubernetes-sigs/cloud-provider-azure#4230 explains things in greater depth, but here's the TL;DR
#109706 stopped syncing load balancers when changes are observed to the readiness state of the Node object. Load balancers are essentially almost only re-synced whenever a Node is: added, deleted, has the exclusion label added. The service controller expectation is that the cloud-providers just accept the list of nodes provided through the
UpdateLoadBalancerHost
call and try to configure the load balancers with the entire set of nodes provided. Given that Azure performs additional filtering of the nodes and excludesNotReady
nodes, there will be situations where the node which just transitioned toReady
isn't added to the load balancer node set, because the service controller doesn't re-sync load balancers due to the readiness state change.This patch removes this additional filtering of the readiness state so that all
NotReady
nodes get added to the load balancer set. The health check probes used by the load balancer will then determine which nodes should be used for traffic load balancing.Please have a look at the referenced enhancement proposal which merged in 1.26: kubernetes/enhancements#3458
Special notes for your reviewer:
Does this PR introduce a user-facing change?
Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:
/sig cloud-provider
/assign @feiskyer