Calculate `base_score` based on input labels. #8107

trivialfis · 2022-07-22T09:12:08Z

This PR calculates the base_score from labels for l1 regression and saves it to the output model. Will follow up on other objectives as well.

Configure model parameter for base_score.
Add estimation function in objective.
Change base_score to an array. At the moment, we use only 1 element, the change is to prepare for multi-class and multi-output once we can remove some legacy in the binary model.

Multi-target and multi-class are not yet supported due to the binary model parameter.

trivialfis · 2022-07-28T11:15:29Z

The old binary format and the gpu_id configuration are probably too difficult to workaround for this PR.

trivialfis · 2022-08-23T18:02:12Z

The old binary format and the gpu_id configuration are probably too difficult to workaround for this PR.

I workaround it by limiting the base_score to a single scalar for now.

RAMitchell · 2022-09-06T09:32:27Z

src/learner.cc

+    // - model loaded from new binary or JSON.
+    // - model is created from scratch.
+    // - model is configured second time due to change of parameter
+    CHECK(obj_);


This configuration is very fragile.

I agree. That's why I really want to remove the old model format.

src/learner.cc

src/objective/objective.cc

trivialfis · 2022-09-14T09:53:36Z

I removed the use of nan as the base score flag to avoid breaking changes in downstream libraries (like treelite). Instead, a new base_score_estimated variable is introduced, but the variable is not read during model load so we don't need to keep it stable.

RAMitchell · 2022-09-19T09:03:29Z

src/objective/regression_obj.cu

+    // average base score across all valid workers
+    rabit::Allreduce<rabit::op::Sum>(out.Values().data(), out.Values().size());
+    std::transform(linalg::cbegin(out), linalg::cend(out), linalg::begin(out),
+                   [world](float v) { return v / world; });


Better that it was before. I wonder if it can be more robust with weighted averaging. The MSE version will need to use a weighted average also. Small example:
Worker 0 labels: 0 0 0
Worker 1 labels: 1000
True median: 0
True median mean abs error: 250
Estimated median (current method): 500
Estimated median (current method) mean abs error: 500
Estimated median (weighted average): 250
Estimated median (weighted average) abs error: 375

Thank you for the suggestion, changed it to the weighted average. I have adapted your example into a Python test.

trivialfis marked this pull request as draft July 27, 2022 06:27

trivialfis force-pushed the init-estimation branch from 0a11b8a to 10c9c8c Compare July 28, 2022 10:43

trivialfis marked this pull request as ready for review August 23, 2022 09:01

trivialfis force-pushed the init-estimation branch from 721c4ce to 8b55e9c Compare August 23, 2022 09:01

RAMitchell reviewed Sep 6, 2022

View reviewed changes

trivialfis force-pushed the init-estimation branch from fdd655c to c22010b Compare September 13, 2022 11:51

trivialfis marked this pull request as draft September 13, 2022 21:02

trivialfis marked this pull request as ready for review September 14, 2022 09:49

trivialfis added 19 commits September 16, 2022 22:12

Calculate base_score based on input labels.

6671efb

Custom objective.

d041498

Fixes.

28739cc

Use a tensor in learner.

117b175

fixes.

fefde60

Fix.

697fd12

Lint.

47cfa11

Remove.

de46dc2

Cache the model.

0b0616a

Empty dmatrix.

c243e66

Revert unnecessary changes.

7dfa87a

Fix.

8db5676

Add serialization test.

f260759

CPU build.

fba7245

revert.

2fc0e60

Better average.

fa9d499

Move configuration.

7bc63d1

Check for model initialized.

7c457ad

Merge dispatching into median.

052fff0

trivialfis added 8 commits September 16, 2022 22:12

Split up the configuration.

0c3c3a6

Add a quick test.

bbb30a0

check.

e78c608

test.

964fc05

Don't change.

6c67acb

check.

6c07f98

check.

bb1fc88

cleanup.

8890a2a

trivialfis force-pushed the init-estimation branch from b716145 to 8890a2a Compare September 16, 2022 14:12

typo.

bba1cd9

RAMitchell reviewed Sep 19, 2022

View reviewed changes

trivialfis added 6 commits September 19, 2022 19:06

Weighted average.

644bbe2

Change name.

dad7a37

Add tests.

9c2bdac

CPU build.

79fab2b

Fix.

5099c3c

Add a test for distributed training.

103c722

RAMitchell approved these changes Sep 20, 2022

View reviewed changes

trivialfis merged commit fffb1fc into dmlc:master Sep 20, 2022

trivialfis deleted the init-estimation branch September 20, 2022 12:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calculate `base_score` based on input labels. #8107

Calculate `base_score` based on input labels. #8107

trivialfis commented Jul 22, 2022 •

edited

Loading

trivialfis commented Jul 28, 2022

trivialfis commented Aug 23, 2022

RAMitchell Sep 6, 2022

trivialfis Sep 6, 2022

trivialfis commented Sep 14, 2022

RAMitchell Sep 19, 2022

trivialfis Sep 19, 2022

Calculate base_score based on input labels. #8107

Calculate base_score based on input labels. #8107

Conversation

trivialfis commented Jul 22, 2022 • edited Loading

trivialfis commented Jul 28, 2022

trivialfis commented Aug 23, 2022

RAMitchell Sep 6, 2022

Choose a reason for hiding this comment

trivialfis Sep 6, 2022

Choose a reason for hiding this comment

trivialfis commented Sep 14, 2022

RAMitchell Sep 19, 2022

Choose a reason for hiding this comment

trivialfis Sep 19, 2022

Choose a reason for hiding this comment

Calculate `base_score` based on input labels. #8107

Calculate `base_score` based on input labels. #8107

trivialfis commented Jul 22, 2022 •

edited

Loading