Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: Apply execution control to taskset nodes that are not part of exit handler and only delete agent pod if exists. #13016

Draft
wants to merge 4 commits into
base: main
Choose a base branch
from

Conversation

jswxstw
Copy link
Contributor

@jswxstw jswxstw commented May 7, 2024

Fixes #12659
Fixes #13060

Motivation

  1. This PR feat: Support argo plugin stop. Fixes #12333 #12441 has support plugin nodes shutdown, but HTTP node still not.
  2. Fixes controller sending many pod delete requests that result in 404 response #12659
  3. Fixes onExit steps executed after deadline exceeded are stopped if an onExit pod spends >=10 seconds in Pending state #13060

Modifications

Modification 1

Execution control has been applied to the nodes with created pods after pod reconciliation. However, pending and suspended nodes do not have created pods, and taskset nodes use the agent pod. Mark these nodes failed when shutting down or exceeding deadline since pod reconciliation does not take effect on them.

Modification 2

Agent pod will only be created when taskset nodes exist, so do not delete it if it does not exist.

Modification 3

Skip execution control for exit nodes.

Verification

@agilgur5 agilgur5 self-assigned this May 7, 2024
@agilgur5 agilgur5 added the area/agent Argo Agent that runs for HTTP and Plugin templates label May 7, 2024
jswxstw added 2 commits May 9, 2024 12:06
Signed-off-by: oninowang <oninowang@tencent.com>
Signed-off-by: oninowang <oninowang@tencent.com>
@jswxstw jswxstw force-pushed the fix-taskset-nodes-with-agent-pod branch from 1399849 to 6cba1c9 Compare May 9, 2024 06:01
@tooptoop4
Copy link
Contributor

📦

Signed-off-by: jswxstw <jswxstw@gmail.com>
@jswxstw jswxstw force-pushed the fix-taskset-nodes-with-agent-pod branch from 99f79a7 to 56b8628 Compare May 21, 2024 11:48
Signed-off-by: jswxstw <jswxstw@gmail.com>
@jswxstw jswxstw force-pushed the fix-taskset-nodes-with-agent-pod branch from 5d5c40c to 21de6e7 Compare May 21, 2024 13:40
@jswxstw jswxstw changed the title fix: Apply execution control to taskset nodes and only delete agent pod if exists. fix: Apply execution control to taskset nodes that are not part of exit handler and only delete agent pod if exists. May 21, 2024
@jswxstw jswxstw closed this May 21, 2024
@jswxstw jswxstw reopened this May 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/agent Argo Agent that runs for HTTP and Plugin templates
Projects
None yet
3 participants