[🚀 Feature]: Automatically drain node after n-failed session attempts #13865

krishtoautomate · 2024-04-24T01:55:10Z

Feature and motivation

User might have n number of nodes and if sessions are randomly allocated during the tests and are failing on few nodes due to issue with docker node. It will be great if this feature is implemented: Automatically drain node after n-failed session attempts.

Usage example

when new RemoteWebDriver session is creating on n-number of nodes with 100-200 test cases in parallel, if I have issues with any docker nodes, those nodes will drain automatically after n(5-10) failed session attempts. the total impact on test execution is less and I end up re-executing only few tests which failed initially.

github-actions · 2024-04-24T01:55:27Z

@krishtoautomate, thank you for creating this issue. We will troubleshoot it as soon as we can.

Info for maintainers

Triage this issue by using labels.

If information is missing, add a helpful comment and then I-issue-template label.

If the issue is a question, add the I-question label.

If the issue is valid but there is no time to troubleshoot it, consider adding the help wanted label.

If the issue requires changes or fixes from an external project (e.g., ChromeDriver, GeckoDriver, MSEdgeDriver, W3C), add the applicable G-* label, and it will provide the correct link and auto-close the issue.

After troubleshooting the issue, please add the R-awaiting answer label.

Thank you!

diemol · 2024-04-24T10:11:36Z

Why is the Docker node failing? How are you monitoring that? Can you share more details? It sounds like we want to fix something with a workaround instead of a proper fix.

krishtoautomate · 2024-04-24T11:00:16Z

Failed Reasons can be any, proxy or internet or chrome crashing or not launching due to driver mismatch. Provided as an example to explain use case. Node can be appium node as well.

diemol · 2024-04-24T13:51:03Z

But that is a Node misconfiguration. Infrastructure needs to be tested before being made available for use. Implementing something like this would hide issues. It is an incomplete workaround.

krishtoautomate · 2024-04-24T16:21:09Z

if we have too many nodes, sometimes our nodes might fall behind for different reasons and we dont want our tests to impact due to nodes which are not working. is their any way this feature can be implemented with some cli argument to make it active and default as off

diemol · 2024-04-26T14:35:00Z

If you have too many nodes, how are you monitoring them? Why would there be a driver mismatch? Are you not testing the changes before sharing the modified infrastructure with the rest of the users?

If a Node is not working, testing the changes done to it should alert you even before you run any regular tests.

Suppose this feature request is implemented, and you have a driver mismatch or network issues with your nodes. What ends up happening is that all of them get shut down. How would you diagnose the actual problem? The result is that the infrastructure went down, and you need to figure out why.

krishtoautomate added I-enhancement needs-triaging labels Apr 24, 2024

diemol added R-awaiting answer and removed needs-triaging labels Apr 24, 2024

github-actions bot added needs-triaging and removed R-awaiting answer labels Apr 24, 2024

diemol added R-awaiting answer and removed needs-triaging labels Apr 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[🚀 Feature]: Automatically drain node after n-failed session attempts #13865

[🚀 Feature]: Automatically drain node after n-failed session attempts #13865

krishtoautomate commented Apr 24, 2024

github-actions bot commented Apr 24, 2024

diemol commented Apr 24, 2024

krishtoautomate commented Apr 24, 2024

diemol commented Apr 24, 2024

krishtoautomate commented Apr 24, 2024

diemol commented Apr 26, 2024

[🚀 Feature]: Automatically drain node after n-failed session attempts #13865

[🚀 Feature]: Automatically drain node after n-failed session attempts #13865

Comments

krishtoautomate commented Apr 24, 2024

Feature and motivation

Usage example

github-actions bot commented Apr 24, 2024

diemol commented Apr 24, 2024

krishtoautomate commented Apr 24, 2024

diemol commented Apr 24, 2024

krishtoautomate commented Apr 24, 2024

diemol commented Apr 26, 2024