Adding metrics for Maxunavailable feature in StatefulSet #130951

Edwinhr716 · 2025-03-20T20:25:56Z

What type of PR is this?

/kind feature

What this PR does / why we need it:

Adds a metric to track how many times there has been a maxunavailable violation, requirement for kubernetes/enhancements#961 beta graduation.

Which issue(s) this PR fixes:

Part of kubernetes/enhancements#961

Special notes for your reviewer:

This is a follow up to the discussion on the KEP update PR kubernetes/enhancements#4474 (comment).

General consensus seems to be that this metric should be in tree instead of in kube-state-metrics.

Open question:

Should the metric be generic like the one exposed by deployment?

cc @atiratree @dgrisonnet @wojtek-t who were part of the original discussion.

Does this PR introduce a user-facing change?

Adds metric for Maxunavailable feature

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

Edwinhr716 · 2025-03-20T20:27:21Z

/assign @janetkuo @soltysh

janetkuo · 2025-03-20T22:45:40Z

/triage accepted

pkg/controller/statefulset/stateful_set_control.go

Edwinhr716 · 2025-03-21T00:51:40Z

/retest

k8s-triage-robot · 2025-03-21T01:15:08Z

This PR may require stable metrics review.

Stable metrics are guaranteed to not change. Please review the documentation for the requirements and lifecycle of stable metrics and ensure that your metrics meet these guidelines.

dims · 2025-03-24T12:57:01Z

cc @xiaohongchen1991

pkg/controller/statefulset/stateful_set_control.go

pkg/controller/statefulset/metrics/metrics.go

test/instrumentation/testdata/stable-metrics-list.yaml

janetkuo · 2025-03-26T23:09:49Z

LGTM in general after the presubmit check failure is fixed. @soltysh would you like to take a look as well?

janetkuo

/lgtm

k8s-ci-robot · 2025-04-14T16:23:07Z

LGTM label has been added.

Git tree hash: 30677f32c92a34a1acfbc1b72625aa1bb7a63803

dgrisonnet · 2025-04-17T16:59:24Z

/assign

soltysh

I believe this is a good starting point, I see Damien will look at it from instrumentation pov.

/lgtm
/approve

soltysh · 2025-04-17T17:50:07Z

/triage accepted
/priority important-longterm

k8s-ci-robot · 2025-04-17T17:50:11Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: Edwinhr716, janetkuo, soltysh
Once this PR has been reviewed and has the lgtm label, please ask for approval from dgrisonnet. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

~~pkg/controller/statefulset/OWNERS~~ [janetkuo,soltysh]
test/instrumentation/testdata/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

atiratree · 2025-04-21T13:14:08Z

pkg/controller/statefulset/stateful_set_control.go

 			"statefulSet", klog.KObj(set),
 			"unavailablePods", unavailablePods,
 			"maxUnavailable", maxUnavailable)
+		if unavailablePods > maxUnavailable {
+			metrics.MaxUnavailableViolations.WithLabelValues(set.Namespace, set.Name).Inc()
+		}


This is always triggered by Parallel StatefulSets multiple times during the initial rollout (depending on the number of replicas, minReadySeconds), so I am not sure how useful this metric is.

IMO, we should remove the logging here when we graduate to beta, or at least make it less verbose (4?), to prevent the spam.

Also, during the OrderedReady rollout there is a period of time where we have unavailable pods, but we don't log that. We also don't notice loss of availability in future StatefulSet updates.

This is always triggered by Parallel StatefulSets multiple times during the initial rollout

By this you mean if PodManagementPolicy is set to Parallel? If so. could you expand on this? I don't see how it is triggered multiple times. There will be at most maxUnavailable unavailable pods no? Even using minReadySeconds?

IMO, we should remove the logging here when we graduate to beta, or at least make it less verbose (4?), to prevent the spam.

Makes sense, especially if we keep the metric.

Also, during the OrderedReady rollout there is a period of time where we have unavailable pods, but we don't log that

Are you suggesting we log it? Wouldn't that just be logging it everytime there is an unavailable pod?

This is always triggered by Parallel StatefulSets multiple times during the initial rollout (depending on the number of replicas, minReadySeconds), so I am not sure how useful this metric is.

This part is being fixed in #130909, where we're only missing unit test to properly account unavailable pods with minReadySeconds taken into account.

IMO, we should remove the logging here when we graduate to beta, or at least make it less verbose (4?), to prevent the spam.

That seems reasonable.

Also, during the OrderedReady rollout there is a period of time where we have unavailable pods, but we don't log that. We also don't notice loss of availability in future StatefulSet updates.

We don't strive to log the time for how long the pods will be unavailable. As you pointed in your first question, this will vary from one statefulset to another, and by the nature of statefulsets it's hard to use that as a reasonable metric. This was discussed several times in the past.

+1 to remove those logs. Even if we're keeping the logs, we don't need to log when unavailablePods == maxUnavailable given that it's a valid case.

By this you mean if PodManagementPolicy is set to Parallel? If so. could you expand on this? I don't see how it is triggered multiple times. There will be at most maxUnavailable unavailable pods no? Even using minReadySeconds?

Yes, in case the PodManagementPolicy is set to Parallel and MaxUnavailableStatefulSet FG enabled. It happens before the statefulset reconciles to the final state.

Yes, but it takes time for all the pods to reach the minReadySeconds.

This part is being fixed in #130909, where we're only missing unit test to properly account unavailable pods with minReadySeconds taken into account.

Even when I test it with #130909, it still happens. Parallel policy hits this point +/- 18 times for StatefulSet with 5 pods for me. It depends on the kubelet/apiserver and other variables how many reconciles we hit.

Also, during the OrderedReady rollout there is a period of time where we have unavailable pods, but we don't log that

Are you suggesting we log it? Wouldn't that just be logging it everytime there is an unavailable pod?

We don't strive to log the time for how long the pods will be unavailable. As you pointed in your first question, this will vary from one statefulset to another, and by the nature of statefulsets it's hard to use that as a reasonable metric. This was discussed several times in the past.

I do not think we necessarily have to log that. Just saying there is difference between OrderedReady and Parallel.

Yes, but it takes time for all the pods to reach the minReadySeconds.

Sure, but we take that into account when we determine how many pods are unavailable since they are only set as available once minReadySeconds has passed. So when we determine how many pods to delete

kubernetes/pkg/controller/statefulset/stateful_set_control.go

Line 755 in 8db5f06

podsToDelete := maxUnavailable - unavailablePods

it shouldn't delete more (causing them to be unavailable) unless more than maxUnavailable pods are available after minReadySeconds have passed.

pkg/controller/statefulset/metrics/metrics.go

atiratree · 2025-04-21T13:34:55Z

/hold
for #130951 (comment)

Co-authored-by: Filip Křepinský <[email protected]>

k8s-ci-robot · 2025-04-23T22:04:56Z

New changes are detected. LGTM label has been removed.

Edwinhr716 · 2025-04-24T15:45:29Z

/retest

k8s-ci-robot · 2025-04-24T16:49:35Z

@Edwinhr716: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
pull-kubernetes-e2e-gce	`e9b66a0`	link	true	`/test pull-kubernetes-e2e-gce`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

soltysh · 2025-04-28T13:46:10Z

/hold
for #130951 (comment)

I believe the comment is being addressed in #130909.

/hold cancel

atiratree · 2025-04-28T19:32:15Z

Added additional info: #130951 (comment)

richabanker · 2025-05-01T16:43:20Z

/assign

soltysh · 2025-06-05T15:30:54Z

Added additional info: #130951 (comment)

Filip's comment is on-point, this will need to be addressed.

soltysh · 2025-06-05T15:41:08Z

/assign

@richabanker any feedback from the sig-instrumentation pov?

richabanker · 2025-06-30T17:36:11Z

/assign

@richabanker any feedback from the sig-instrumentation pov?

whoops, super sorry for the late reply, I think I was just curious, why is the new metric starting off at BETA stabilityLevel?

k8s-ci-robot requested review from krmayankk and mimowo March 20, 2025 20:26

k8s-ci-robot added sig/apps Categorizes an issue or PR as relevant to SIG Apps. and removed do-not-merge/needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Mar 20, 2025

github-project-automation bot added this to SIG Apps Mar 20, 2025

github-project-automation bot moved this to Needs Triage in SIG Apps Mar 20, 2025

k8s-ci-robot added release-note Denotes a PR that will be considered when it comes time to generate release notes. and removed do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. labels Mar 20, 2025

k8s-ci-robot assigned janetkuo and soltysh Mar 20, 2025

k8s-ci-robot added area/stable-metrics Issues or PRs involving stable metrics sig/instrumentation Categorizes an issue or PR as relevant to SIG Instrumentation. labels Mar 20, 2025

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Mar 20, 2025

janetkuo reviewed Mar 20, 2025

View reviewed changes

pkg/controller/statefulset/stateful_set_control.go Outdated Show resolved Hide resolved

janetkuo reviewed Mar 24, 2025

View reviewed changes

pkg/controller/statefulset/stateful_set_control.go Outdated Show resolved Hide resolved

pkg/controller/statefulset/metrics/metrics.go Outdated Show resolved Hide resolved

janetkuo reviewed Mar 25, 2025

View reviewed changes

test/instrumentation/testdata/stable-metrics-list.yaml Outdated Show resolved Hide resolved

janetkuo approved these changes Apr 14, 2025

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Apr 14, 2025

soltysh mentioned this pull request Apr 17, 2025

maxUnavailable for StatefulSets kubernetes/enhancements#961

Open

8 tasks

k8s-ci-robot assigned dgrisonnet Apr 17, 2025

soltysh approved these changes Apr 17, 2025

View reviewed changes

k8s-ci-robot added the priority/important-longterm Important over the long term, but may not be staffed and/or may need multiple releases to complete. label Apr 17, 2025

k8s-ci-robot removed the needs-priority Indicates a PR lacks a `priority/foo` label and requires one. label Apr 17, 2025

atiratree reviewed Apr 21, 2025

View reviewed changes

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Apr 21, 2025

Update pkg/controller/statefulset/metrics/metrics.go

e9b66a0

Co-authored-by: Filip Křepinský <[email protected]>

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Apr 23, 2025

k8s-ci-robot requested review from dgrisonnet, janetkuo and soltysh April 23, 2025 22:04

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Apr 28, 2025

k8s-ci-robot assigned richabanker May 1, 2025

Adding metrics for Maxunavailable feature in StatefulSet #130951

Are you sure you want to change the base?

Adding metrics for Maxunavailable feature in StatefulSet #130951

Conversation

Edwinhr716 commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

Uh oh!

Edwinhr716 commented Mar 20, 2025

Uh oh!

janetkuo commented Mar 20, 2025

Uh oh!

Uh oh!

Edwinhr716 commented Mar 21, 2025

Uh oh!

k8s-triage-robot commented Mar 21, 2025

Uh oh!

dims commented Mar 24, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

janetkuo commented Mar 26, 2025

Uh oh!

janetkuo left a comment

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Apr 14, 2025

Uh oh!

dgrisonnet commented Apr 17, 2025

Uh oh!

soltysh left a comment

Choose a reason for hiding this comment

Uh oh!

soltysh commented Apr 17, 2025

Uh oh!

k8s-ci-robot commented Apr 17, 2025

Uh oh!

atiratree Apr 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Edwinhr716 Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

soltysh Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

janetkuo Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

atiratree Apr 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Edwinhr716 May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

atiratree commented Apr 21, 2025

Uh oh!

k8s-ci-robot commented Apr 23, 2025

Uh oh!

Edwinhr716 commented Apr 24, 2025

Uh oh!

k8s-ci-robot commented Apr 24, 2025

Uh oh!

soltysh commented Apr 28, 2025

Uh oh!

atiratree commented Apr 28, 2025

Uh oh!

richabanker commented May 1, 2025

Uh oh!

soltysh commented Jun 5, 2025

Edwinhr716 commented Mar 20, 2025 •

edited

Loading

atiratree Apr 21, 2025 •

edited

Loading

atiratree Apr 28, 2025 •

edited

Loading

Edwinhr716 May 15, 2025 •

edited

Loading