Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

submariner test stuck for hours #1293

Open
2 tasks
nirs opened this issue Mar 27, 2024 · 0 comments
Open
2 tasks

submariner test stuck for hours #1293

nirs opened this issue Mar 27, 2024 · 0 comments
Labels
bug Something isn't working test Testing related issue

Comments

@nirs
Copy link
Member

nirs commented Mar 27, 2024

Seen once, submariner test stuck on cleanup after successful run.

kubectl delete -k never completed - needs a timeout.

Why it was stuck? maybe we need to wait when unexporting a service?
2024-03-26 03:21:28,485 ERROR   Command failed
Traceback (most recent call last):
  File "/home/nsoffer/ramen/test/drenv/__main__.py", line 72, in main
    func(env, args)
  File "/home/nsoffer/ramen/test/drenv/__main__.py", line 167, in cmd_start
    execute(
  File "/home/nsoffer/ramen/test/drenv/__main__.py", line 252, in execute
    f.result()
  File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 449, in result
    return self.__get_result()
           ^^^^^^^^^^^^^^^^^^^
  File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 401, in __get_result
    raise self._exception
  File "/usr/lib64/python3.12/concurrent/futures/thread.py", line 58, in run
    result = self.fn(*self.args, **self.kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/nsoffer/ramen/test/drenv/__main__.py", line 282, in start_cluster
    execute(
  File "/home/nsoffer/ramen/test/drenv/__main__.py", line 252, in execute
    f.result()
  File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 449, in result
    return self.__get_result()
           ^^^^^^^^^^^^^^^^^^^
  File "/usr/lib64/python3.12/concurrent/futures/_base.py", line 401, in __get_result
    raise self._exception
  File "/usr/lib64/python3.12/concurrent/futures/thread.py", line 58, in run
    result = self.fn(*self.args, **self.kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/nsoffer/ramen/test/drenv/__main__.py", line 428, in run_worker
    run_addon(addon, worker["name"], hooks=hooks, allow_failure=allow_failure)
  File "/home/nsoffer/ramen/test/drenv/__main__.py", line 459, in run_addon
    run_hook(hook, addon["args"], name, allow_failure=allow_failure)
  File "/home/nsoffer/ramen/test/drenv/__main__.py", line 470, in run_hook
    run(hook, *args, name=name)
  File "/home/nsoffer/ramen/test/drenv/__main__.py", line 485, in run
    for line in commands.watch(*cmd):
  File "/home/nsoffer/ramen/test/drenv/commands.py", line 155, in watch
    raise Error(args, error, exitcode=p.returncode)
drenv.commands.Error: Command failed:
   command: ('addons/submariner/test', 'hub', 'dr1', 'dr2')
   exitcode: 1
   error:
      Traceback (most recent call last):
        File "/home/nsoffer/ramen/test/addons/submariner/test", line 268, in <module>
          wait_for_delete(NS2, clusters[1], clusters[0])
        File "/home/nsoffer/ramen/test/addons/submariner/test", line 79, in wait_for_delete
          kubectl.delete(
        File "/home/nsoffer/ramen/test/drenv/kubectl.py", line 127, in delete
          _watch("delete", *args, input=input, context=context, log=log)
        File "/home/nsoffer/ramen/test/drenv/kubectl.py", line 157, in _watch
          for line in commands.watch(*cmd, input=input):
        File "/home/nsoffer/ramen/test/drenv/commands.py", line 155, in watch
          raise Error(args, error, exitcode=p.returncode)
      drenv.commands.Error: Command failed:
         command: ('kubectl', 'delete', '--context', 'dr2', '--kustomize=ns2/dst', '--wait=true', '--ignore-not-found')
         exitcode: 1
         error:

Tasks

  • Add timeout to all kubectl delete calls in all addons, ramenctl, basic-test
  • Find why the delete fails in submariner test - maybe we need to wait after calling subctl unexport?
@nirs nirs added bug Something isn't working test Testing related issue labels Mar 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working test Testing related issue
Projects
None yet
Development

No branches or pull requests

1 participant