Retain default group membership when configuring additional paths on probe health groups #40268

tharakadesilva · 2024-04-09T12:58:01Z

When configuring additional paths for Spring Boot actuator health groups ('readiness', 'liveness', etc.), unrelated health indicators to those groups unexpectedly start failing. This behavior presents risks for production environments, potentially leading to unnecessary restarts or pod terminations.

Impact:

This issue introduces the risk of unnecessary pod restarts or service disruptions within environments using health endpoints for availability checks (e.g., Kubernetes). A failing health indicator on one path could lead to cascading failures within liveness or readiness probes.

Steps to reproduce:

Bootstrap a new project with SpringBoot 3.2.4.

Create a custom health indicator that always reports "DOWN":

@Bean
HealthIndicator myHealthIndicator() {
  return () -> Health.down().build();
}

Add an additional paths to the readiness and liveness groups:

management.endpoint.health.group.readiness.additional-path=server:/readyz
management.endpoint.health.group.liveness.additional-path=server:/livez

Expected behavior (and this works without the additional path configs in step (3)):

/actuator/health - should fail
/actuator/health/readiness - should pass
/readyz - should pass
/actuator/health/liveness - should pass
/livez - should pass

Actual behavior:

/actuator/health - fails
/actuator/health/readiness - fails (unexpectedly)
/readyz - fails (unexpectedly)
/actuator/health/liveness - fails (unexpectedly)
/livez - fails (unexpectedly)

I would have expected readiness and livenss to have failed if I only set the following props:

management.endpoint.health.group.readiness.include=my
management.endpoint.health.group.liveness.include=my

Demo Project (see tests):

demo.zip

Potential Workaround (Temporary):

Avoid using additional-path on health groups where this side-effect could be disruptive.

The text was updated successfully, but these errors were encountered:

philwebb · 2024-04-13T19:17:41Z

This is a little confusing, but what's happening here is when you declare the following:

management.endpoint.health.group.readiness.additional-path=server:/readyz
management.endpoint.health.group.liveness.additional-path=server:/livez

Spring Boot is creating two new groups called readiness and liveness which by default include all health indicators. If you want to only include the liveness and readiness probes you have two options. You can either set the following property to create /readyz and /livez additional paths:

management.endpoint.health.probes.add-additional-paths=true

or you can update the properties to include the correct health indicators:

management.endpoint.health.group.readiness.include=readinessState
management.endpoint.health.group.readiness.additional-path=server:/readyz
management.endpoint.health.group.liveness.include=livenessState
management.endpoint.health.group.liveness.additional-path=server:/livez

philwebb · 2024-04-13T19:18:47Z

Flagging for a team discussion since I wonder if we should do more to improve things. Perhaps if management.endpoint.health.probes.enabled=true is set and management.endpoint.health.group.liveness doesn't change the include or exclude we should keep the probe defaults.

philwebb · 2024-04-17T15:39:18Z

We're going to look to see if we can improve the default behavior so that liveness and readiness groups have sensible memberships unless the user has specifically configured them otherwise. This is a breaking change so we can't consider it a bug.

tharakadesilva · 2024-04-17T18:35:40Z

Thanks @philwebb!! I've applied the workaround (1) that you recommended and it is working as expected, thank you.

spring-projects-issues added the status: waiting-for-triage An issue we've not yet triaged label Apr 9, 2024

philwebb added for: team-meeting An issue we'd like to discuss as a team to make progress type: enhancement A general enhancement and removed status: waiting-for-triage An issue we've not yet triaged for: team-meeting An issue we'd like to discuss as a team to make progress labels Apr 13, 2024

philwebb changed the title ~~Actuator health groups fail for unrelated indicators when additional paths are configured~~ Retain default group membership when configuring additional paths on probe health groups Apr 17, 2024

philwebb added this to the 3.x milestone Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retain default group membership when configuring additional paths on probe health groups #40268

Retain default group membership when configuring additional paths on probe health groups #40268

tharakadesilva commented Apr 9, 2024 •

edited

philwebb commented Apr 13, 2024

philwebb commented Apr 13, 2024

philwebb commented Apr 17, 2024

tharakadesilva commented Apr 17, 2024

Retain default group membership when configuring additional paths on probe health groups #40268

Retain default group membership when configuring additional paths on probe health groups #40268

Comments

tharakadesilva commented Apr 9, 2024 • edited

philwebb commented Apr 13, 2024

philwebb commented Apr 13, 2024

philwebb commented Apr 17, 2024

tharakadesilva commented Apr 17, 2024

tharakadesilva commented Apr 9, 2024 •

edited