CRI: Sandbox IP not present after containerd restart #7843

dcantah · 2022-12-20T07:02:06Z

Description

First reported on the CNCF slack https://cloud-native.slack.com/archives/C4RJZ9Z6Y/p1671470742340569

Before containerd process restart:

# crictl -r unix:///run/k0s/containerd.sock inspectp 94c43ab108db3 | jq .status.network
{
 "additionalIps": [],
 "ip": "10.244.0.24"
}

After restarting containerd:

# crictl -r unix:///run/k0s/containerd.sock inspectp 94c43ab108db3 | jq .status.network
{
 "additionalIps": [],
 "ip": ""
}

This means that kubelet sees the sandbox as changed and thus will restart each pod.

Steps to reproduce the issue

For local testing, create a pod using crictl
Restart containerd
Inspect pod status after restart: crictl -r unix:///run/k0s/containerd.sock inspectp $podID | jq .status.network

Describe the results you received and expected

containerd to correctly preserve the sandbox's IP/networking information. This behavior regressed in 1.6.9 and may be related to #7456

What version of containerd are you using?

1.6.12

Any other relevant information

No response

Show configuration if it is related to CRI plugin.

Default containerd config

The text was updated successfully, but these errors were encountered:

dcantah · 2022-12-20T07:02:36Z

cc @samuelkarp @MikeZappa87 as we were discussing on Slack

qiutongs · 2022-12-20T17:13:16Z

Let me take a look.

Fixes #7843

brandond · 2023-01-04T18:09:07Z

This needs to be backported to v1.6 ASAP. All releases since 1.6.9 have this critical regression that breaks containerd's guarantees around non-disruptive restarts. Unaffected versions are affected by CVEs, so users are currently forced to pick between CVEs or or having their pods all restarted whenever containerd restarts.

klueska · 2023-01-11T14:24:15Z

I believe the issue I reported here is also resolved by this fix:
https://cloud-native.slack.com/archives/CGEQHPYF4/p1667586414682319

dcantah added kind/bug area/cri Container Runtime Interface (CRI) labels Dec 20, 2022

dcantah mentioned this issue Dec 20, 2022

CRI: Fix no CNI info for pod sandbox on restart #7845

Merged

ncopa mentioned this issue Dec 20, 2022

Backport fix for containerd k0sproject/k0s#2541

Closed

16 tasks

samuelkarp closed this as completed in #7845 Dec 20, 2022

samuelkarp added a commit that referenced this issue Dec 20, 2022

Merge pull request #7845 from dcantah/fix-noip-onrestart

3233d5d

Fixes #7843

brandond mentioned this issue Jan 4, 2023

rke2 1.24.9 cannot restart without node disruption rancher/rke2#3723

Closed

brandond mentioned this issue Jan 4, 2023

Bump containerd to v1.6.14-k3s1 to fix issue with pod network info not being written to metadata store k3s-io/k3s#6692

Closed

dereknola mentioned this issue Jan 4, 2023

Containerd restart testlet k3s-io/k3s#6696

Merged

bk201 mentioned this issue Jan 9, 2023

Bump RKE2 to v1.24.9+rke2r1 harvester/harvester-installer#397

Closed

xhejtman mentioned this issue Jan 10, 2023

Failed to get sandbox runtime: no runtime for nvidia is configured NVIDIA/gpu-operator#432

Open

16 tasks

rbrtbnfgl mentioned this issue Jan 23, 2023

Pod can't access clusterip service for another pod with endpoint on the same node flannel-io/flannel#1702

Closed

brandond mentioned this issue Jan 23, 2023

[release-1.23] Bump containerd to v1.5.16-k3s2 to fix issue with pod network info not being written to metadata store k3s-io/k3s#6809

Closed

renovate bot mentioned this issue Mar 13, 2023

feat(github-release): update k3s-io/k3s to v1.27.2+k3s1 lenaxia/home-ops-dev#70

Merged

1 task

aavbsouza mentioned this issue May 5, 2023

Documentation clarification about containerd tweaks NVIDIA/gpu-operator#519

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CRI: Sandbox IP not present after containerd restart #7843

CRI: Sandbox IP not present after containerd restart #7843

dcantah commented Dec 20, 2022

dcantah commented Dec 20, 2022

qiutongs commented Dec 20, 2022

brandond commented Jan 4, 2023 •

edited

klueska commented Jan 11, 2023

CRI: Sandbox IP not present after containerd restart #7843

CRI: Sandbox IP not present after containerd restart #7843

Comments

dcantah commented Dec 20, 2022

Description

Steps to reproduce the issue

Describe the results you received and expected

What version of containerd are you using?

Any other relevant information

Show configuration if it is related to CRI plugin.

dcantah commented Dec 20, 2022

qiutongs commented Dec 20, 2022

brandond commented Jan 4, 2023 • edited

klueska commented Jan 11, 2023

brandond commented Jan 4, 2023 •

edited