ensure that only registered cluster workers are asked to report metrics #182

orestis · 2018-03-26T17:32:55Z

This is a PR related to #181, as start of discussion.

Tests don't pass yet as I used spaces instead of tabs :)

zbjornson

The ordering issue (new AggregatorRegistry() vs. cluster.fork()) is a breaking change in this PR's current state. It might be sufficient for the worker to retry sending the init message until the master acknowledges, for up to some amount of time. Sort of adds a lot of code...

Didn't comment on the cleanup stuff (stray console.log, linter stuff).

zbjornson · 2018-03-26T18:01:29Z

lib/cluster.js

+        that.aliveWorkers.add(workerId);
+        console.log(that.aliveWorkers);
+      }
+    });


Instead of adding another cluster.on("message") listener, can you do this in addListeners?

To do that I might have to move the aliveListeners to be a module-level variable. Would that be ok?

zbjornson · 2018-03-26T18:07:03Z

lib/cluster.js

+          worker.send(message)
+        }
+      }
+      that.aliveWorkers = new Set([...that.aliveWorkers].filter(x => !failedWorkers.has(x)));


Can you just do this.aliveWorkers.remove(...) in the loop, instead of making a new Set instance each time?

I'll have to check the semantics of mutating the Set while iterating, but yes will change.

…ggregatorRegistry

zbjornson · 2018-03-28T05:21:34Z

lib/cluster.js

+
+     Options are:
+        coordinated If false (default), request metrics from all cluster workers. If true, request metrics only from workers that have required prom-client.
+	 * @param {object?} options object


Is the only reason this needs to be an option because setting it to true makes the order of forking vs. new AggregatorRegistry() matter? I'd rather go for an implementation that isn't sensitive to that, e.g. the worker repeatedly attempts to register with the master until the master acks the registration.

Having a heterogeneous pool of workers (not all of them setting up prom-client) is unusual, but I think the behavior achieved when this is true is what should happen by default.

Yes, the only reason to make it an option is that it would break backwards compatibility.

Given that a metrics client should be as unobtrusive as possible, I would be wary of adding a prolonged discovery phase. I would certainly prefer in my consuming codebase to keep things simple and accept that ordering matters.

In the case of the homogenous cluster, the previous implementation cleanly sidesteps a lot of the issues with “garbage collecting” the workers and nobody else had complained so far :)

FWIW, we will be breaking backwards compatibility pretty hard soon ish (see #177, #178 and #180), so don't let semver stop you from writing the code you'd like to write :D

zbjornson · 2018-03-28T05:23:38Z

lib/cluster.js

+						help: 'Number of connected cluster workers reporting to prometheus',
+						registers: [registry]
+					});
+					g.set(request.workerCount);


Might want to move this new metric to a separate PR. (If @siimon and @SimenB are okay with it, it's a simple change that could land before this PR is ironed out.)

Noted, however this is very specific to the coordinated option - perhaps it should be exposed only when coordinated is true. Perhaps a similar metric could be added to the default metrics collected that counts all the cluster workers?

zbjornson · 2018-03-29T16:07:11Z

(Sorry for the delay, busy week. Will try to send more feedback later today.)

SimenB · 2018-09-19T19:47:16Z

Any news here?

zbjornson · 2018-09-19T20:01:56Z

Sorry to drop the ball.

I'm sorta hesitant to add the complexity to support an unusual use case (heterogeneous workers in a cluster), but this approach works.

My preference would be a version that isn't sensitive to the order of new AggregatorRegistry() vs. cluster.fork() in the client code, but since that only matters if you opt-in to coordinated and it's an advanced usage scenario, those advanced users could be expected to understand/deal with that requirement.

PR also still needs some formatting/linting issues addressed.

ensure that only registered cluster workers are asked to report metrics

372bb75

orestis mentioned this pull request Mar 26, 2018

AggregatorRegistry assumes all workers will have metrics setup #181

Open

zbjornson requested changes Mar 26, 2018

View reviewed changes

orestis added 3 commits March 27, 2018 10:32

convert spaces to tabs, remove console.logs

7634cdf

move new coordinated cluster behaviour behind a constructor flag in A…

564bc13

…ggregatorRegistry

fix misplaced if statement

3ae56ae

zbjornson reviewed Mar 28, 2018

View reviewed changes

zbjornson force-pushed the master branch 2 times, most recently from d6f2cf6 to 29b6d86 Compare November 12, 2019 21:14

zbjornson mentioned this pull request Sep 19, 2021

fix cluster metrics when AggregatorReg isn't created on workers #464

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ensure that only registered cluster workers are asked to report metrics #182

ensure that only registered cluster workers are asked to report metrics #182

orestis commented Mar 26, 2018

zbjornson left a comment

zbjornson Mar 26, 2018

orestis Mar 27, 2018

zbjornson Mar 26, 2018

orestis Mar 27, 2018

zbjornson Mar 28, 2018

orestis Mar 28, 2018

SimenB Mar 28, 2018

zbjornson Mar 28, 2018

orestis Mar 28, 2018

zbjornson commented Mar 29, 2018

SimenB commented Sep 19, 2018

zbjornson commented Sep 19, 2018

ensure that only registered cluster workers are asked to report metrics #182

Are you sure you want to change the base?

ensure that only registered cluster workers are asked to report metrics #182

Conversation

orestis commented Mar 26, 2018

zbjornson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zbjornson commented Mar 29, 2018

SimenB commented Sep 19, 2018

zbjornson commented Sep 19, 2018