HashedWheelTimer task scheduling behavior changed in 4.1.85 #13018

lhotari · 2022-11-25T19:20:42Z

Expected behavior

The expectation is that task scheduling behavior of HashedWheelTimer doesn't change significantly between Netty 4.1.x releases.

Actual behavior

The HashedWheelTimer behavior changed in some way that makes multiple integration tests to fail in Apache Pulsar. The only change in HashedWheelTimer in 4.1.85.Final is the PR #12888 .

Steps to reproduce

There are steps to reproduce by running a specific test in Apache Pulsar project. The instructions are in a PR in the Apache Pulsar repo:
apache/pulsar#18599 (comment)

Netty version

4.1.85.Final

JVM version (e.g. `java -version`)

openjdk version "17.0.5" 2022-10-18
OpenJDK Runtime Environment Temurin-17.0.5+8 (build 17.0.5+8)
OpenJDK 64-Bit Server VM Temurin-17.0.5+8 (build 17.0.5+8, mixed mode, sharing)

OS version (e.g. `uname -a`)

Linux x86_64

The text was updated successfully, but these errors were encountered:

lhotari · 2022-11-25T20:32:09Z

@chrisvest I investigated the issue and didn't find a behavior change in HashedWheelTimer. I used HashedWheelTimerTest and made several variations of testExecutionOnTime.

There might be some subtle change that causes the issue, and we'll just have to deal with that in Pulsar.

needmorecode · 2022-11-27T01:12:15Z

@lhotari After reading your comment, I reviewed my code and realised I made a mistake in #12888.
My optimisation based on the assumption that the tasks in a bucket are originally in the order of execution time, which are in fact not.
So breaking the loop at execRound > currRound may cause latter tasks not respond in time.

needmorecode · 2022-11-27T01:17:58Z

Sorry for making that trouble. I already made a PR #13021 to revert it. @lhotari @chrisvest

lhotari · 2022-11-27T18:56:23Z

@lhotari After reading your comment, I reviewed my code and realised I made a mistake in #12888.
My optimisation based on the assumption that the tasks in a bucket are originally in the order of execution time, which are in fact not.
So breaking the loop at execRound > currRound may cause latter tasks not respond in time.

@needmorecode Thanks for the quick confirmation and investigation. Your explanation makes sense. I missed that case when I was trying to add a unit test that would prove an issue.

Motivation: The code I commited in #12888 may cause unexpected task scheduling problems. Modification: Revert this commit. Result: Fixes #13018 .

lhotari mentioned this issue Nov 25, 2022

Improve the performance of expireTimeouts() in HashedWheelTimer #12888

Merged

needmorecode mentioned this issue Nov 27, 2022

Revert #12888 for potential task scheduling problems #13021

Merged

lhotari mentioned this issue Nov 27, 2022

[improve][misc] Upgrade Netty to 4.1.86.Final and Netty Tcnative to 2.0.54.Final apache/pulsar#18599

Merged

4 tasks

normanmaurer closed this as completed in #13021 Nov 28, 2022

normanmaurer pushed a commit that referenced this issue Nov 28, 2022

Revert#12888 for potential scheduling problems (#13021)

b64a6e2

Motivation: The code I commited in #12888 may cause unexpected task scheduling problems. Modification: Revert this commit. Result: Fixes #13018 .

normanmaurer pushed a commit that referenced this issue Nov 28, 2022

Revert#12888 for potential scheduling problems (#13021)

1a4093a

Motivation: The code I commited in #12888 may cause unexpected task scheduling problems. Modification: Revert this commit. Result: Fixes #13018 .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HashedWheelTimer task scheduling behavior changed in 4.1.85 #13018

HashedWheelTimer task scheduling behavior changed in 4.1.85 #13018

lhotari commented Nov 25, 2022

lhotari commented Nov 25, 2022

needmorecode commented Nov 27, 2022

needmorecode commented Nov 27, 2022

lhotari commented Nov 27, 2022

HashedWheelTimer task scheduling behavior changed in 4.1.85 #13018

HashedWheelTimer task scheduling behavior changed in 4.1.85 #13018

Comments

lhotari commented Nov 25, 2022

Expected behavior

Actual behavior

Steps to reproduce

Netty version

JVM version (e.g. java -version)

OS version (e.g. uname -a)

lhotari commented Nov 25, 2022

needmorecode commented Nov 27, 2022

needmorecode commented Nov 27, 2022

lhotari commented Nov 27, 2022

JVM version (e.g. `java -version`)

OS version (e.g. `uname -a`)