Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ceph HEALTH_ERR: Module 'devicehealth' has failed: table Device already exists #1298

Open
nirs opened this issue Mar 27, 2024 · 0 comments
Open
Labels
bug Something isn't working test Testing related issue

Comments

@nirs
Copy link
Member

nirs commented Mar 27, 2024

Seen at least 5 times in 188 builds.

The visible error is timeout waiting for cephblockpool or timeout waiting for mirroring daemon health.

When inspecting the cluster we see:

ceph status:

  cluster:
    id:     dbf6c8b8-dd8b-4117-933e-93778b1a7274
    health: HEALTH_ERR
            Module 'devicehealth' has failed: table Device already exists

dr health:

Info: running mirroring daemon health
health: UNKNOWN
daemon health: UNKNOWN
image health: OK
images: 0 total

Retrying drenv start does not help. the cluster is stuck in this state for hours.

Seen after upgrading csi-addons to latest version (0.8.0).

Ceph bug: https://tracker.ceph.com/issues/65494

@nirs nirs added bug Something isn't working test Testing related issue labels Mar 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working test Testing related issue
Projects
None yet
Development

No branches or pull requests

1 participant