Add new config_file_service_registration token #15828

pglass · 2022-12-16T23:08:20Z

Description

This adds a new agent token: config_file_service_registration. This token is used to register services and checks that are defined in local config files (including when defined in flags, such as with -hcl).

This adds:

The config field acl.tokens.config_file_service_registration
The PUT /agent/token/config_file_service_registration HTTP API request
The consul acl set-agent-token config_file_service_registration <token> command

The precedence of tokens when registering a service from a service definition or a check from a check definition is:

Inline service token: The token from the token field in the service/check definition is used, if set
Config File Service Registration token: otherwise, the config file registration token is used, if set
Default token: otherwise, the default token is used, if set
Anonymous token: otherwise, the anonymous token is used

Testing & Reproduction steps

Updated unit tests
Also, manually tested:
- Setting acl.tokens.config_file_service_registration and seeing successful registration of a service definition
- Setting acl.tokens.config_file_service_registration and seeing successful registration of a check definition
- Defining an inline token in the service definition and validating that token is used instead of the acl.tokens.config_file_service_registration
- Defining an inline token in the check definition and validating that token is used instead of the acl.tokens.config_file_service_registration
- Unsetting acl.tokens.config_file_service_registration and setting acl.tokens.default to check that checks and services fall back to the default token
- Registering services via the HTTP ensure that the config_file_service_registration token is only used for registering services sourced from config files
- Running consul acl set-agent-token config_file_service_registration and checking that <data-dir>/acl-tokens.json is updated when token persistence is enabled, and that the updated token is used for subsequent service registrations

Links

#4478

PR Checklist

updated test coverage
external facing docs updated
not a security concern

agent/local/state.go

kisunji · 2022-12-19T15:57:17Z

agent/local/state.go

+//
+// The fallback function will return the config file registration token if the
+// given service was sourced from a service definition in a config file.
+func (l *State) RegistrationTokenFallback(key structs.ServiceID) func() string {


nit: I don't think this is used outside package state so it could be private.

Alternatively, what do you think about merging this logic into the body of aclTokenForServiceSync since it already does a lookup of l.services[key]?

Passing around a lock-guarded map in a closure makes me a little cautious; in this PR the codepaths are synchronized but this could be accidentally misused in the future.

I don't think this is used outside package state so it could be private.

I agree. I had to export the method because the test package is different (local vs local_test). Which seems unusual to me. But, without exporting it I can't call it in unit tests.

Alternatively, what do you think about merging this logic into the body of aclTokenForServiceSync since it already does a lookup of l.services[key]?

I opted against this because aclTokenForServiceSync is also used in deleteService.

So, do we want deleteService to also incorporate the config file registration token in its list of fallback tokens? Generally, it seems better to me if it does not.

The main concern to me is if the config_file_registration token has been deleted, then it would fail to deregister the service and we'd see errors in logs. Also, it should fallback to using the agent token anyway. (It is able to use the agent token for service deregistrations because the Catalog.Deregister RPC accepts a token with the relevant node:write permissions).

Because the agent token must have node:write permissions (or else it could not have registered it's node into the catalog) and because the agent token is probably less likely to have been deleted (because agent lifecycle is longer than service instance lifecycle), it seems like the agent could skip straight to using the agent token without considering the config_file_registration token for the deregistration, rather than incorporating the agent token in the list of "fallback" tokens. And that would be faster and would not generate a deceptive log message. There's some relevant discussion on this here: #8078

That's why I opted against inlining the config_file_registration token fallback into aclTokenForServiceSync. Although it does leave me a question: why does deleteService try using the service token and then fallback to agent token, instead of unilaterally using the agent token for deregistrations?

My understanding is that the agent token typically only has node:write on itself. The agent token generally wouldn't have service:write needed to deregister a service.

My understanding is that the agent token typically only has node:write on itself. The agent token generally wouldn't have service:write needed to deregister a service.

Right, but the Consul servers will accept a deregistration if the token contains node:write for the node containing that service. See #5217 and

consul/agent/consul/catalog_endpoint.go

Lines 441 to 448 in 275a0b8

// Allow service deregistration if the token has write permission for the node.

// This accounts for cases where the agent no longer has a token with write permission

// on the service to deregister it.

nodeWriteErr := authz.ToAllowAuthorizer().NodeWriteAllowed(subj.Node, &authzContext)

if nodeWriteErr == nil {

return nil

}

And all services registered with an agent must be registered to that agent's node. (Or from a perms perspective, only to the nodes which the agent has permission to update)

Ah, interesting!

Although it does leave me a question: why does deleteService try using the service token and then fallback to agent token, instead of unilaterally using the agent token for deregistrations?

I've tested this.

The agent only falls back to the agent token if the service token is unset for that particular service (i.e. token field is empty or absent from the service definition in any config files). It does not fallback to other tokens on failure to deregister (i.e. it will not try the deregistration with the service token and, on failure, then try the agent token).

If the original service token has been deleted from the servers, because the agent has stored that service token in its local state, it continues to use that original service token to deregister that service during each state sync - which will repeatedly fail each time the state sync is retried. This feels like a bit of a gotcha.

Basically, I'm weighing two options:

The existing behavior is "good", so we should have deleteService include the config_file_registration token in its list of fallbacks. If set, the config_file_registration would be used instead of the agent token. And if the config_file_registration token was deleted, then the deregistration would fail forever.

Or, the existing behavior isn't great, and it would be better for it to only use the agent token for service deregistrations because that will "just work" because of the node:write "bypass" for service deregistrations.

Thoughts @jkirschner-hashicorp?

Is it accurate to say that the agent basically isn't functional if it lacks a token with node:write on itself?

If so... Is there any downside to approach 2? If I understand correctly, approach 2 would always work assuming the node is still functional (has a token the node:write). Why use approach 1 (which has some edge cases) if approach 2 must necessarily work?

Is there a separate case where a service is being deregistered directly from the server agents rather than from the node that owns the service, in cases where the node no longer exists but the service was never deregistered (but needs to be cleaned up)?

Happy to have a quick Zoom about this tomorrow.

nit: I don't think this is used outside package state so it could be private.

I reworked these tests so the methods are unexported.

agent/token/store.go

kisunji

Left some minor comments but LGTM. What do you think about renaming config_file_registration to something shorter like registration or static_registration? It feels a little verbose but on the other hand maybe the detailed name makes it more clear.

jkirschner-hashicorp · 2023-01-03T17:23:39Z

What do you think about renaming config_file_registration to something shorter like registration or static_registration? It feels a little verbose but on the other hand maybe the detailed name makes it more clear.

In my experience, Consul agent token names are commonly misunderstood, such as agent and default. I personally prefer that we have an unambiguous name that is 3 words (config_file_registration) rather than a shorter name with ambiguity, especially since this isn't something that will be typed all the time (like partition in enterprise to refer to an administrative partition). I feel like registration is too ambiguous, as HTTP API calls are also methods to perform "registration", but wouldn't use this token. static_registration resolves that ambiguity, though because config files are the only way to perform static registration (AFAIK), using config_file_ rather than static_ seems best to me, as there's no interpretation required on the part of the user (to figure out what "static" refers to in this context).

vercel · 2023-01-05T23:39:41Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Updated
consul	🔄 Building (Inspect)		Jan 6, 2023 at 10:25PM (UTC)
consul-ui-staging	🔄 Building (Inspect)		Jan 6, 2023 at 10:25PM (UTC)

pglass · 2023-01-06T20:06:21Z

I've updated this, so I've requested a re-review @kisunji

Rename config_file_registration to config_file_service_registration
Checks in config files also use the config_file_service_registration token. This is because checks can be inlined in service definitions and service-level checks do not need additional permissions, so it makes sense for service-level checks to be registered with this token as well.
Updated the external docs
Added changelog
Rebased onto main

kisunji

LGTM; left mostly docs suggestions

kisunji · 2023-01-06T20:36:00Z

agent/config/config.go

+	AgentRecovery          *string `mapstructure:"agent_recovery"`
+	Default                *string `mapstructure:"default"`
+	Agent                  *string `mapstructure:"agent"`
+	ConfigFileRegistration *string `mapstructure:"config_file_service_registration"`


Is the _service_ part omitted for brevity?

Yes. With Golang's general style preference for shorter variables, I thought ConfigFileRegistration was already long enough but still clear enough.

kisunji · 2023-01-06T20:37:42Z

agent/local/state.go

-		Token:   token,
+		Service:          service,
+		Token:            token,
+		IsLocallyDefined: isLocal,
 	})
 	return nil
 }

 // AddServiceWithChecks adds a service entry and its checks to the local state atomically
 // This entry is persistent and the agent will make a best effort to
 // ensure it is registered


Can we update the godoc to describe what isLocal should represent?
I can imagine someone confusing the "local" concept with peered services.

Yep. I updated this.

fwiw, I used "local" in order match ConfigSourceLocal.

kisunji · 2023-01-06T20:38:03Z

agent/local/state.go

+			return tok
+		}
+	}
+	return ""
 }

 // AddCheck is used to add a health check to the local state.
 // This entry is persistent and the agent will make a best effort to
 // ensure it is registered


Same comment here about updating godocs

agent/local/state_internal_test.go

kisunji · 2023-01-06T20:46:48Z

agent/token/store.go

+	// configFileRegistrationToken is used to register services defined
+	// with a service definitions in a config file.


Suggested change

// configFileRegistrationToken is used to register services defined

// with a service definitions in a config file.

// configFileRegistrationToken is used to register services and checks

// defined with a service/check definition in a config file.

kisunji · 2023-01-06T20:51:37Z

website/content/commands/acl/set-agent-token.mdx

@@ -46,6 +46,13 @@ The token types are:
  operations. This token will need to be configured with read access to
  whatever data is being replicated.

+- `config_file_service_registration` - This is the token that the agent uses to
+  register services and checks defined in config files. This token needs to be
+  configured with permission for the service or checks being registered. If not


Suggested change

configured with permission for the service or checks being registered. If not

configured with write permissions for the service or checks being registered. If not

kisunji · 2023-01-06T20:54:17Z

website/content/docs/agent/config/config-files.mdx

+      [check definitions](/docs/discovery/checks) foudn in configuration files or in configuration
+      strings passed to the agent using the `-hcl` flag.


Suggested change

[check definitions](/docs/discovery/checks) foudn in configuration files or in configuration

strings passed to the agent using the `-hcl` flag.

[check definitions](/docs/discovery/checks) found in configuration files or in configuration

strings passed to the agent using the `-hcl` flag.

Would this be more concise and still convey the same information? @jkirschner-hashicorp

Suggested change

[check definitions](/docs/discovery/checks) foudn in configuration files or in configuration

strings passed to the agent using the `-hcl` flag.

[check definitions](/docs/discovery/checks) on startup.

Does registration also happen on reload?

If so, we could do something like:

Suggested change

[check definitions](/docs/discovery/checks) foudn in configuration files or in configuration

strings passed to the agent using the `-hcl` flag.

[check definitions](/docs/discovery/checks) loaded by the agent on startup and reload.

nit: If we keep the -hcl flag mention, would we also need to include -json?

From what I can tell, -json doesn't exist. The consul agent command only has -hcl for config fragments.

The -hcl flag enables operators to specify agent configuration values on the CLI. There is currently no equivalent -json flag for allowing agent configuration to be provided in JSON format. If we wanted to support that, it would be a new feature that requires additional development.

I don't want to suggest we support that. For some reason I thought I had seen an invocation of a Consul agent with that recently, but it seems like I misremembered (e.g., perhaps it was a config file being passed in with JSON format).

I think I prefer to be more elaborate / specific here to help reduce confusion?

If we say "configuration passed at startup", I feel like "startup" leaves room for interpretation. Is sending an HTTP request passing configuration to the agent? Does that include if I send a service registration request while the agent is "starting up"?

I wanted to be clear about what services/checks the token is specifically used for (those services/checks in files or -hcl config fragments).

kisunji · 2023-01-06T20:57:52Z

website/content/docs/agent/config/config-files.mdx

+      If an inline token is defined in the service or check definition, then the inline token is
+      used to register that service or check instead. If the `config_file_service_registration` token is not
+      defined and if a service or check has no inline token, then the agent uses the
+      [`default`](#acl_tokens_default) token to register the service or check.


I think "inline" might be confusing to users

Suggested change

If an inline token is defined in the service or check definition, then the inline token is

used to register that service or check instead. If the `config_file_service_registration` token is not

defined and if a service or check has no inline token, then the agent uses the

[`default`](#acl_tokens_default) token to register the service or check.

If the `token` field is defined in the service or check definition, then that token is

used to register that service or check instead. If the `config_file_service_registration` token is not

defined and if a service or check has no defined `token` field, then the agent uses the

[`default`](#acl_tokens_default) token to register the service or check.

kisunji · 2023-01-06T20:59:45Z

website/content/docs/agent/config/config-files.mdx

+      `config_file_service_registration` token needs multiple `service:write` permissions in order for
+      the agent to register those services.


I think "multiple" could be interpreted as needing N>1 perms.

Suggested change

`config_file_service_registration` token needs multiple `service:write` permissions in order for

the agent to register those services.

`config_file_service_registration` token needs `service:write` permissions for all services

in order for the agent to register them.

On that note, what happens if a config_file_service_registration token has permissions for a partial set of services? Does it fail to write all services or does it skip only the service with the missing perm?

Could make the behavior clear in the docs here.

I rewrote this using two named services "A" and "B" as an example. I also elaborated a bit more on the failure case (maybe too much?). Let me know what you think!

pglass requested review from a team, skpratt and kisunji and removed request for a team December 16, 2022 23:08

github-actions bot added theme/api theme/cli theme/config labels Dec 16, 2022

pglass commented Dec 16, 2022

View reviewed changes

agent/local/state.go Outdated Show resolved Hide resolved

kisunji reviewed Dec 19, 2022

View reviewed changes

agent/token/store.go Outdated Show resolved Hide resolved

kisunji approved these changes Dec 19, 2022

View reviewed changes

pglass requested a review from a team as a code owner January 5, 2023 20:57

vercel bot deployed to Preview – consul January 5, 2023 21:02 View deployment

Paul Glass added 12 commits January 6, 2023 13:38

Add new config_file_registration token

68920ef

Fix tests

bfa728c

Add 'consul acl set-agent-token config_file_registration <token>'

0f6f071

Fix config_file_registration token persistence

df3a359

Refactor token store update methods

dd9e242

Use config_file_registration token for checks

99ed90c

Update tests for config_file_registration token

7a5d0d1

Update changelog

9cd5b39

docs: Update docs for config_file_registration token

cb7d7de

Fix tests

ac4a9b1

Add forgotten test

afb3b9d

pglass force-pushed the pglass/NET-1768-config-file-registration-token branch from 159e516 to afb3b9d Compare January 6, 2023 19:38

pglass requested a review from kisunji January 6, 2023 19:38

jkirschner-hashicorp mentioned this pull request Jan 6, 2023

Agent doesn't use acl_agent_token for service registration from static json config #4478

Closed

Rename config_file_registration > config_file_service_registration

651e9e0

pglass changed the title ~~Add new config_file_registration token~~ Add new config_file_service_registration token Jan 6, 2023

vercel bot deployed to Preview – consul January 6, 2023 20:03 View deployment

kisunji reviewed Jan 6, 2023

View reviewed changes

Address feedback

1441c2e

vercel bot deployed to Preview – consul January 6, 2023 22:32 View deployment

skpratt approved these changes Jan 9, 2023

View reviewed changes

pglass merged commit f5231b9 into main Jan 10, 2023

pglass deleted the pglass/NET-1768-config-file-registration-token branch January 10, 2023 16:24

skpratt pushed a commit that referenced this pull request Jan 25, 2023

Add new config_file_service_registration token (#15828)

04634fb

hessamalipour mentioned this pull request Sep 14, 2023

consul connect envoy doesn't respect acl.tokens.default #9392

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new config_file_service_registration token #15828

Add new config_file_service_registration token #15828

pglass commented Dec 16, 2022 •

edited

Loading

kisunji Dec 19, 2022

pglass Jan 3, 2023

jkirschner-hashicorp Jan 3, 2023

pglass Jan 3, 2023 •

edited

Loading

jkirschner-hashicorp Jan 3, 2023

pglass Jan 3, 2023

jkirschner-hashicorp Jan 4, 2023 •

edited

Loading

pglass Jan 6, 2023 •

edited

Loading

kisunji left a comment •

edited

Loading

jkirschner-hashicorp commented Jan 3, 2023 •

edited

Loading

vercel bot commented Jan 5, 2023 •

edited

Loading

pglass commented Jan 6, 2023

kisunji left a comment

kisunji Jan 6, 2023

pglass Jan 6, 2023 •

edited

Loading

kisunji Jan 6, 2023

pglass Jan 6, 2023

kisunji Jan 6, 2023

kisunji Jan 6, 2023

kisunji Jan 6, 2023

kisunji Jan 6, 2023

jkirschner-hashicorp Jan 6, 2023

jkirschner-hashicorp Jan 6, 2023 •

edited

Loading

pglass Jan 6, 2023

blake Jan 6, 2023

jkirschner-hashicorp Jan 6, 2023

pglass Jan 6, 2023

kisunji Jan 6, 2023

kisunji Jan 6, 2023

kisunji Jan 6, 2023

pglass Jan 6, 2023

	// Allow service deregistration if the token has write permission for the node.
	// This accounts for cases where the agent no longer has a token with write permission
	// on the service to deregister it.
	nodeWriteErr := authz.ToAllowAuthorizer().NodeWriteAllowed(subj.Node, &authzContext)
	if nodeWriteErr == nil {
	return nil
	}

		// configFileRegistrationToken is used to register services defined
		// with a service definitions in a config file.

	configured with permission for the service or checks being registered. If not
	configured with write permissions for the service or checks being registered. If not

		[check definitions](/docs/discovery/checks) foudn in configuration files or in configuration
		strings passed to the agent using the `-hcl` flag.

		`config_file_service_registration` token needs multiple `service:write` permissions in order for
		the agent to register those services.

Add new config_file_service_registration token #15828

Add new config_file_service_registration token #15828

Conversation

pglass commented Dec 16, 2022 • edited Loading

Description

Testing & Reproduction steps

Links

PR Checklist

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pglass Jan 3, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jkirschner-hashicorp Jan 4, 2023 • edited Loading

Choose a reason for hiding this comment

pglass Jan 6, 2023 • edited Loading

Choose a reason for hiding this comment

kisunji left a comment • edited Loading

Choose a reason for hiding this comment

jkirschner-hashicorp commented Jan 3, 2023 • edited Loading

vercel bot commented Jan 5, 2023 • edited Loading

pglass commented Jan 6, 2023

kisunji left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pglass Jan 6, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jkirschner-hashicorp Jan 6, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pglass commented Dec 16, 2022 •

edited

Loading

pglass Jan 3, 2023 •

edited

Loading

jkirschner-hashicorp Jan 4, 2023 •

edited

Loading

pglass Jan 6, 2023 •

edited

Loading

kisunji left a comment •

edited

Loading

jkirschner-hashicorp commented Jan 3, 2023 •

edited

Loading

vercel bot commented Jan 5, 2023 •

edited

Loading

pglass Jan 6, 2023 •

edited

Loading

jkirschner-hashicorp Jan 6, 2023 •

edited

Loading