-
Notifications
You must be signed in to change notification settings - Fork 556
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Benchmark for CW32 with high backpressure, at times no throughput and frequent gateway restarts #10059
Comments
@deepthidevaki metnioned this looks similar to #9862 |
Terminated gateway nodes were restarted due to imminent node shutdown, at least the ones that I could look at |
We see a high frequency of these bugs: #10014 This is expected, because in the commit used for the benchmark the bug was not fixed yet |
Current working theory is:
Things not explained:
|
Had a chat with Simon. We found no explanation why gateways were being restarted more frequently after 4:00 PM August 10th and why this behavior stopped around 9:30 AM August 11th |
@Zelldon There are a few errors reported about this. It appears to be unrelated to the above. |
We assume this was fixed by a bug fix @oleschoenburg could you please link the corresponding issue/PR and delete the benchmark. |
Issues were caused by #10014 which is fixed. I'll delete the benchmark. |
The benchmark for CW32 shows severely degraded performance: http://34.77.165.228/d/NzsO1mUnk/zeebe-overview?orgId=1&var-DS_PROMETHEUS=Prometheus&var-namespace=medic-cw-32-be18e23b78-benchmark&var-pod=All&var-partition=All&from=1660132800000&to=1660212000000
Throughput is very low:
Frequent restarts of Gateway:
Back pressure is high:
Processing shows a cliff edge:
Snapshots are growing after the cliff edge:
The text was updated successfully, but these errors were encountered: