Retry map and cache partition destroy operations #12686

mmedenjak · 2018-03-22T09:07:11Z

When the partition is migrating, the operations might fail with a
PartitionMigratingException. We then need to retry the operation as
otherwise we are leaking memory. As for other partition operations,
we wait for the migration to complete, using the default try count and
wait count.

Fixes:
https://github.com/hazelcast/hazelcast-enterprise/issues/930
https://github.com/hazelcast/hazelcast-mono/issues/1427

EE PR: https://github.com/hazelcast/hazelcast-enterprise/pull/1991

vbekiaris · 2018-03-22T11:07:34Z

Test failed: ClientRegressionWithMockNetworkTest.testOperationRedo

https://hazelcast-l337.ci.cloudbees.com/job/new-lab-fast-pr/14605

vbekiaris

Overall looks good, left some questions / comments. Also, the new execution utility lacks tests.

vbekiaris · 2018-03-22T12:38:04Z

hazelcast/src/main/java/com/hazelcast/internal/util/LocalRetryableExecution.java

+ * @see InvocationUtil#invokeLocallyWithRetry(NodeEngine, Operation)
+ */
+public class LocalRetryableExecution implements Runnable, OperationResponseHandler {
+    /** Number of times an operation is retried before being logger at WARNING level */


typo: "logger" -> "logged"

vbekiaris · 2018-03-22T12:39:56Z

hazelcast/src/main/java/com/hazelcast/internal/util/InvocationUtil.java

+     * @see Operation#getOperationResponseHandler()
+     * @see Operation#validatesTarget()
+     */
+    public static LocalRetryableExecution invokeLocallyWithRetry(NodeEngine nodeEngine, Operation operation) {


Maybe executeLocallyWithRetry since it's using OperationService.execute? Also would help avoid confusion with full-blown invocations of OperationService.invokeXXX.

vbekiaris · 2018-03-22T12:56:25Z

hazelcast/src/main/java/com/hazelcast/map/impl/operation/MapPartitionDestroyOperation.java

-    public int getPartitionId() {
-        return partitionContainer.getPartitionId();
+    public boolean returnsResponse() {
+        return true;


No need to override, true is the default in Operation.returnsResponse()

vbekiaris · 2018-03-22T13:01:30Z

hazelcast/src/main/java/com/hazelcast/map/impl/operation/MapPartitionDestroyOperation.java

+
+    @Override
+    public boolean isUrgent() {
+        return true;


Why turn this into an urgent operation? Could it interfere with cluster operations (eg join ops)?

I must have copied it over from CacheSegmentDestroyOperation. I removed it from both operations.

mmedenjak · 2018-03-22T14:53:14Z

@vbekiaris thanks for the review. Addressed all comments and added test for retry logic.

vbekiaris

Nice job!

mmedenjak · 2018-03-27T12:26:01Z

@ahmetmircik can you please finish the review? Is there anything that I need to address?

ahmetmircik · 2018-03-27T12:32:41Z

@mmedenjak seems ok to me

When the partition is migrating, the operations might fail with a PartitionMigratingException. We then need to retry the operation as otherwise we are leaking memory. As for other partition operations, we wait for the migration to complete, using the default try count and wait count. Fixes: https://github.com/hazelcast/hazelcast-enterprise/issues/930 Fixes: https://github.com/hazelcast/hazelcast-enterprise/issues/1933

devOpsHazelcast · 2024-04-22T12:41:35Z

Can one of the admins verify this patch?

devOpsHazelcast · 2024-04-22T12:41:35Z

Can one of the admins verify this patch?

devOpsHazelcast · 2024-04-22T12:41:36Z

Can one of the admins verify this patch?

mmedenjak added Team: Core Type: Test-Failure Module: IMap Module: ICache labels Mar 22, 2018

mmedenjak added this to the 3.10 milestone Mar 22, 2018

mmedenjak self-assigned this Mar 22, 2018

mmedenjak requested review from vbekiaris and ahmetmircik March 22, 2018 09:07

vbekiaris reviewed Mar 22, 2018

View reviewed changes

hazelcast deleted a comment from vbekiaris Mar 22, 2018

vbekiaris approved these changes Mar 22, 2018

View reviewed changes

mmedenjak force-pushed the HiDensityCacheStatsTest-fix branch from 9f97122 to d9e17bc Compare March 23, 2018 11:30

ahmetmircik approved these changes Mar 27, 2018

View reviewed changes

mmedenjak force-pushed the HiDensityCacheStatsTest-fix branch from d9e17bc to 890c299 Compare March 27, 2018 13:19

mmedenjak merged commit 9c10d20 into hazelcast:master Mar 27, 2018

mmedenjak deleted the HiDensityCacheStatsTest-fix branch March 27, 2018 16:11

mmedenjak added the Source: Internal PR or issue was opened by an employee label Apr 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retry map and cache partition destroy operations #12686

Retry map and cache partition destroy operations #12686

mmedenjak commented Mar 22, 2018 •

edited

vbekiaris commented Mar 22, 2018

vbekiaris left a comment

vbekiaris Mar 22, 2018

vbekiaris Mar 22, 2018

vbekiaris Mar 22, 2018

vbekiaris Mar 22, 2018

mmedenjak Mar 22, 2018

mmedenjak commented Mar 22, 2018

vbekiaris left a comment

mmedenjak commented Mar 27, 2018

ahmetmircik commented Mar 27, 2018

devOpsHazelcast commented Apr 22, 2024

devOpsHazelcast commented Apr 22, 2024

devOpsHazelcast commented Apr 22, 2024

Retry map and cache partition destroy operations #12686

Retry map and cache partition destroy operations #12686

Conversation

mmedenjak commented Mar 22, 2018 • edited

vbekiaris commented Mar 22, 2018

vbekiaris left a comment

Choose a reason for hiding this comment

vbekiaris Mar 22, 2018

Choose a reason for hiding this comment

vbekiaris Mar 22, 2018

Choose a reason for hiding this comment

vbekiaris Mar 22, 2018

Choose a reason for hiding this comment

vbekiaris Mar 22, 2018

Choose a reason for hiding this comment

mmedenjak Mar 22, 2018

Choose a reason for hiding this comment

mmedenjak commented Mar 22, 2018

vbekiaris left a comment

Choose a reason for hiding this comment

mmedenjak commented Mar 27, 2018

ahmetmircik commented Mar 27, 2018

devOpsHazelcast commented Apr 22, 2024

devOpsHazelcast commented Apr 22, 2024

devOpsHazelcast commented Apr 22, 2024

mmedenjak commented Mar 22, 2018 •

edited