Reject duplicate parallel gateway activate command #9759

remcowesterhoud · 2022-07-12T07:50:53Z

Description

Parallel gateways get activated by taking a sequence flow and checking if the number of taken flows is greater or equal to the number of incoming sequence flows. If this is the case an activate command is sent. The number of taken sequence flows get rest upon activation of the parallel gateway.

This proves troublesome when a "bad" model causes one of the incoming sequence flows to be taken twice. This could result in the activation command being sent twice. Imagine there is a parallel gateway with 2 incoming flows. What would happen is:

First flow is taken
Second flow is taken. Incoming flows == taken flows so an activate command is sent.
Second flow is taken again. The first activate command has not been processed yet. The number of taken flows has not been reset. As a result incoming flows < taken flows. A second activate command is sent.

This is solved by always sending an activate command when a sequence flow is taken. Once the BpmnStreamProcessor tries to process the record it will check if the state is valid. Here a check has been added to verify that when we receive an activate command for a parallel gateway we will first check if all the incoming flows have been taken. If this is not the case we will reject the command.

Related issues

closes #6778

Definition of Done

Not all items need to be done depending on the issue and the pull request.

Code changes:

The changes are backwards compatibility with previous versions
If it fixes a bug then PRs are created to backport the fix to the last two minor versions. You can trigger a backport by assigning labels (e.g. backport stable/1.3) to the PR, in case that fails you need to create backports manually.

Testing:

There are unit/integration tests that verify all acceptance criterias of the issue
New tests are written to ensure backwards compatibility with further versions
The behavior is tested manually
The change has been verified by a QA run
The impact of the changes is verified by a benchmark

Documentation:

The documentation is updated (e.g. BPMN reference, configuration, examples, get-started guides, etc.)
New content is added to the release announcement
If the PR changes how BPMN processes are validated (e.g. support new BPMN element) then the Camunda modeling team should be informed to adjust the BPMN linting.

Please refer to our review guidelines.

Parallel gateways get activated by taking a sequence flow and checking if the number of taken flows is greater or equal to the number of incoming sequence flows. If this is the case an activate command is sent. The number of taken sequence flows get rest upon activation of the parallel gateway. This proves troublesome when a "bad" model causes one of the incoming sequence flows to be taken twice. This could result in the activation command being sent twice. Imagine there is a parallel gateway with 2 incoming flows. What would happen is: 1. First flow is taken 2. Second flow is taken. Incoming flows == taken flows so an activate command is sent. 3. Second flow is taken again. The first activate command has not been processed yet. The number of taken flows has not been reset. As a result incoming flows < taken flows. A second activate command is sent.

github-actions · 2022-07-12T08:05:43Z

Unit Test Results

  792 files ±    0   792 suites ±0 1h 42m 3s ⏱️ + 1m 24s
6 219 tests +219 6 211 ✔️ +219 8 💤 ±0 0 ❌ ±0
6 388 runs +219 6 380 ✔️ +219 8 💤 ±0 0 ❌ ±0

Results for commit aa9fab1. ± Comparison against base commit 275a7d2.

♻️ This comment has been updated with latest results.

remcowesterhoud · 2022-07-12T09:13:43Z

@korthout I have removed you as a reviewer for now, since the solution is flawed. Sorry!

...c/main/java/io/camunda/zeebe/engine/processing/bpmn/ProcessInstanceStateTransitionGuard.java

korthout · 2022-07-12T14:28:11Z

@remcowesterhoud I like the idea of rejecting the activate command for the parallel gateway when it isn't yet ready to activate. It would allow us to remove the logic that determines whether or not to write the activate command, and entirely move it to the processor.

This also makes it more in line with other concurrent behaviors (e.g. an element might be planned to be completed because a COMPLETE_ELEMENT was written for it on the log, but before it is processed a TERMINATE_ELEMENT is processed due to some interruption like a boundary event, we then reject the COMPLETE_ELEMENT command when we try to process it later).

To make this idea work for Start Process Instance Anywhere, we need something special. We'll need to increment the number of taken sequence flows before processing the ACTIVATE_ELEMENT of the parallel gateway:

for each parallel gateway that is targetted by a start instruction,
for each incoming sequence flow,
within the related flow scope instance.

We could do that by introducing a new event applier for ProcessInstanceCreation CREATED events, which is written (and applied) before the ACTIVATE_ELEMENT of the parallel gateway is processed.

The hard part is to find the related flow scope instance, because you don't have access to the key of the parallel gateway's element instance (i.e. the parallel gateway isn't created yet). There is no direct access to this, but you could traverse the process instance's process model using ElementInstanceState.getChildren(key).

@remcowesterhoud If you want we can talk about it in person.

…cess anywhere In the scenario where we start a process instance at a Parallel Gateway we will have to increment the number of taken sequence flows manually. If we don't do this the new state transition guard will find no taken sequence flows, where it has N amount of incoming flows. Resulting in a command rejection. By incrementing the number of taken sequence flows for each incoming flow of the parallel gateway we can circumvent this rejection and activate the gateway as expected.

This check has been moved to the ProcessInstanceStateTransitionGuard. We can always send the activate command. If a parallel gateway cannot be activated the command gets rejected. This might have a small impact on performance, as now we are sending commands and rejecting them, as opposed to not sending the commands at all.

remcowesterhoud · 2022-07-13T15:01:09Z

@korthout Thanks for your help ❤️

I have resolved the issues, please have a look!

korthout

🚀 Nice work @remcowesterhoud

🔧 Please make it clear in the PR description what this PR changes.
🔧 Please document the new Parallel Gateway behavior.

💭 This will probably not backport so easily to the older versions, because of the new event applier (those versions don't know about start instructions).

👍 LGTM

...main/java/io/camunda/zeebe/engine/processing/deployment/model/element/ExecutableProcess.java

.../main/java/io/camunda/zeebe/engine/state/appliers/ProcessInstanceCreationCreatedApplier.java

.../main/java/io/camunda/zeebe/engine/processing/bpmn/behavior/BpmnStateTransitionBehavior.java

engine/src/test/java/io/camunda/zeebe/engine/processing/bpmn/gateway/ParallelGatewayTest.java

remcowesterhoud · 2022-07-15T14:37:19Z

bors merge

9759: Reject duplicate parallel gateway activate command r=remcowesterhoud a=remcowesterhoud ## Description  Parallel gateways get activated by taking a sequence flow and checking if the number of taken flows is greater or equal to the number of incoming sequence flows. If this is the case an activate command is sent. The number of taken sequence flows get rest upon activation of the parallel gateway. This proves troublesome when a "bad" model causes one of the incoming sequence flows to be taken twice. This could result in the activation command being sent twice. Imagine there is a parallel gateway with 2 incoming flows. What would happen is: 1. First flow is taken 2. Second flow is taken. Incoming flows == taken flows so an activate command is sent. 3. Second flow is taken again. The first activate command has not been processed yet. The number of taken flows has not been reset. As a result incoming flows < taken flows. A second activate command is sent. This is solved by always sending an activate command when a sequence flow is taken. Once the `BpmnStreamProcessor` tries to process the record it will check if the state is valid. Here a check has been added to verify that when we receive an activate command for a parallel gateway we will first check if all the incoming flows have been taken. If this is not the case we will reject the command. ## Related issues  closes #6778 Co-authored-by: Remco Westerhoud <remco@westerhoud.nl>

zeebe-bors-camunda · 2022-07-15T14:44:28Z

Build failed:

Java checks

Expand the test to also assert that we reject the command twice, and only activate it once.

remcowesterhoud · 2022-07-15T14:53:58Z

bors retry

zeebe-bors-camunda · 2022-07-15T15:12:51Z

Build succeeded:

backport-action · 2022-07-15T15:14:05Z

Successfully created backport PR #9822 for stable/1.3.

backport-action · 2022-07-15T15:14:13Z

Successfully created backport PR #9823 for stable/8.0.

9822: [Backport stable/1.3] Reject duplicate parallel gateway activate command r=remcowesterhoud a=backport-action # Description Backport of #9759 to `stable/1.3`. relates to #6778 Co-authored-by: Remco Westerhoud <remco@westerhoud.nl>

9823: [Backport stable/8.0] Reject duplicate parallel gateway activate command r=remcowesterhoud a=backport-action # Description Backport of #9759 to `stable/8.0`. relates to #6778 Co-authored-by: Remco Westerhoud <remco@westerhoud.nl>

9822: [Backport stable/1.3] Reject duplicate parallel gateway activate command r=remcowesterhoud a=backport-action # Description Backport of #9759 to `stable/1.3`. relates to #6778 Co-authored-by: Remco Westerhoud <remco@westerhoud.nl>

9823: [Backport stable/8.0] Reject duplicate parallel gateway activate command r=remcowesterhoud a=backport-action # Description Backport of #9759 to `stable/8.0`. relates to #6778 Co-authored-by: Remco Westerhoud <remco@westerhoud.nl>

9825: Joining parallel gateway doc r=remcowesterhoud a=remcowesterhoud ## Description  Add a bpmn diagram explaining the activation of a joining parallel gateway ## Related issues  relates to #9759 Co-authored-by: Remco Westerhoud <remco@westerhoud.nl>

korthout · 2022-08-02T10:12:26Z

@remcowesterhoud Please add this to the release notes

remcowesterhoud added backport stable/1.3 labels Jul 12, 2022

remcowesterhoud requested a review from korthout July 12, 2022 07:52

remcowesterhoud removed the request for review from korthout July 12, 2022 09:13

korthout reviewed Jul 12, 2022

View reviewed changes

...c/main/java/io/camunda/zeebe/engine/processing/bpmn/ProcessInstanceStateTransitionGuard.java Outdated Show resolved Hide resolved

remcowesterhoud added 2 commits July 13, 2022 16:41

remcowesterhoud requested a review from korthout July 13, 2022 15:00

korthout approved these changes Jul 14, 2022

View reviewed changes

korthout reviewed Jul 14, 2022

View reviewed changes

engine/src/test/java/io/camunda/zeebe/engine/processing/bpmn/gateway/ParallelGatewayTest.java Outdated Show resolved Hide resolved

refactor: general code improvements

1f5a1dd

test: add extra assertion

aa9fab1

Expand the test to also assert that we reject the command twice, and only activate it once.

remcowesterhoud force-pushed the 6778_negate_active_sequence_flow_count branch from 327fd61 to aa9fab1 Compare July 15, 2022 14:47

zeebe-bors-camunda bot merged commit 2c5304e into main Jul 15, 2022

zeebe-bors-camunda bot deleted the 6778_negate_active_sequence_flow_count branch July 15, 2022 15:12

backport-action mentioned this pull request Jul 15, 2022

[Backport stable/1.3] Reject duplicate parallel gateway activate command #9822

Merged

backport-action mentioned this pull request Jul 15, 2022

[Backport stable/8.0] Reject duplicate parallel gateway activate command #9823

Merged

skayliu mentioned this pull request Jul 16, 2022

Support diverging inclusive gateway #9747

Merged

10 tasks

korthout mentioned this pull request Jul 19, 2022

Joining parallel gateway doc #9825

Merged

10 tasks

npepinpe added version:1.3.13 release/8.0.5 labels Aug 1, 2022

npepinpe added the version:8.1.0-alpha4 label Aug 2, 2022

Zelldon added the version:8.1.0 Marks an issue as being completely or in parts released in 8.1.0 label Oct 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reject duplicate parallel gateway activate command #9759

Reject duplicate parallel gateway activate command #9759

remcowesterhoud commented Jul 12, 2022 •

edited

github-actions bot commented Jul 12, 2022 •

edited

remcowesterhoud commented Jul 12, 2022

korthout commented Jul 12, 2022

remcowesterhoud commented Jul 13, 2022

korthout left a comment •

edited

remcowesterhoud commented Jul 15, 2022

zeebe-bors-camunda bot commented Jul 15, 2022

remcowesterhoud commented Jul 15, 2022

zeebe-bors-camunda bot commented Jul 15, 2022

backport-action commented Jul 15, 2022

backport-action commented Jul 15, 2022

korthout commented Aug 2, 2022

Reject duplicate parallel gateway activate command #9759

Reject duplicate parallel gateway activate command #9759

Conversation

remcowesterhoud commented Jul 12, 2022 • edited

Description

Related issues

Definition of Done

github-actions bot commented Jul 12, 2022 • edited

Unit Test Results

remcowesterhoud commented Jul 12, 2022

korthout commented Jul 12, 2022

remcowesterhoud commented Jul 13, 2022

korthout left a comment • edited

Choose a reason for hiding this comment

remcowesterhoud commented Jul 15, 2022

zeebe-bors-camunda bot commented Jul 15, 2022

remcowesterhoud commented Jul 15, 2022

zeebe-bors-camunda bot commented Jul 15, 2022

backport-action commented Jul 15, 2022

backport-action commented Jul 15, 2022

korthout commented Aug 2, 2022

remcowesterhoud commented Jul 12, 2022 •

edited

github-actions bot commented Jul 12, 2022 •

edited

korthout left a comment •

edited