[POC] Experimental support for l1 error. #7812

trivialfis · 2022-04-15T15:51:50Z

Support adaptive tree, a feature supported by both sklearn and lightgbm. The tree leaf is recomputed based on residue of labels and predictions after construction.

For l1 error, the optimal value is the median (50 percentile).

This is marked as experimental support for the following reasons:

The value is not well defined for distributed training, where we might have empty leaves for local workers. Right now I just use the original leaf value for computing the average with other workers, which might cause significant errors.
The GPU implementation is not ideal due to sampling support. The partition is calculated from node position, if a node doesn't have any valid sample then it's missing in the partitioner and we will have to fill it back.

RAMitchell

Looks significantly better to me without trying to share the internal row representations inside the updaters.

RAMitchell · 2022-04-22T18:34:22Z

src/common/partition_builder.h

+    auto p_begin = row_set.Data()->data();
+    ParallelFor(row_set.Size(), ctx->Threads(), [&](size_t i) {
+      auto const& node = row_set[i];
+      if (node.node_id < 0 || !tree[node.node_id].IsLeaf()) {


Can this actually occur?

On the CPU, all nodes are being kept track of including internal nodes. For node_id < 0, it should be an internal node and we can skip the IsLeaf check, I will see if there's any edge case I missed.

Turned the condition into a check.

src/tree/gpu_hist/row_partitioner.cuh

RAMitchell · 2022-04-22T18:52:05Z

src/tree/updater_gpu_hist.cu

+      });
+    } else {
+      // Avoid copying nodes by using evaluator to get leaf weight on-the-fly, this is
+      // useful when tree leaf is not updated after tree construction.


I don't think it's worth keeping this optimisation.

Haven't run the full benchmark yet, copying nodes is probably fine, but not sure about the memory allocation.

RAMitchell · 2022-04-22T18:55:20Z

src/tree/updater_gpu_hist.cu

-                                                     info_->num_col_,
-                                                     batch_param));
+    dh::safe_cuda(cudaSetDevice(ctx_->gpu_id));
+    info_->feature_types.SetDevice(ctx_->gpu_id);


Lots of unrelated changes in this PR.

The change for device_ -> ctx_->gpu_id might not be strictly related, I changed it when I ran into a segfault during the test of gpu_id being specified to 1. I can revert those if needed, but I think these changes are easy to skip.

include/xgboost/tree_updater.h

src/tree/updater_gpu_hist.cu

trivialfis · 2022-04-25T14:22:11Z

This is an initial benchmark with gbm-bench. I haven't run the benchmark with the newly added l1 error, which can use some optimizations in quantile computation and I will run the benchmark after the related work is finished.

	bosch		higgs		years
	Time	AUC	Time	AUC		MSE
CPU Hist (master)	79.08961106099741	0.6902547809021118	141.6087378029988	0.8399291373085628	30.87248679699769	79.77742340435458
CPU Hist (PR)	77.38803030599956	0.6902547809021118	137.95923503099766	0.8399291373085628	24.732420017997356	79.77742340435458
GPU Hist (master)	21.8356087760003	0.6887260447943401	23.57480316100191	0.839534647606739	10.975372369997785	80.31760322683175
GPU Hist (PR)	21.80807627499962	0.6887260388302106	24.510682308002288	0.8395346476950757	10.876273957001104	79.91012003330812

s-banach · 2022-10-31T19:44:02Z

Hey @trivialfis, looks like great work!
Sorry I can't read C++ very well, so I hope I can ask for clarification here.
The 1.7.0 release notes claim you can optimize "without a valid hessian".
Does that mean the hessian is ignored completely when choosing splits?

It seems to be implicit in the xgboost paper than the hessian is supposed to be nonnegative.
I wonder if this approach would allow custom objectives where the hessian may sometimes be negative?

trivialfis · 2022-11-01T16:52:22Z

@s-banach

At the moment XGBoost uses the sample weight as hessian (default to 1) for l1, and recomputes the leaf values based on input labels after growing the tree.

I wonder if this approach would allow custom objectives where the hessian may sometimes be negative

We haven't exposed the feature to custom objectives yet unless you are writing C++ extensions.

kayhman · 2023-01-17T16:49:22Z

Hi @trivialfis,

This is a very interesting approach. How do you update the value after the tree is built ?

The changelog mention line search, but I see no mention of that in the code. I guess that this is the method UpdateTreeLeaf of the objectives that does the job (in UpdateTreeLeafHost ?), but I don't understand what mathematical operation is done.

Can you provide more details?

Best regards,

kayhman · 2023-01-17T17:07:23Z

ok, I see. The line search is done thought the alpha parameter, and you use the alpha-quantile as leaf value?

And as far as I can see, alpha is always set to 0.5.

Is that right?

trivialfis · 2023-01-17T20:01:40Z

For l1, the optimal is median, which is 0.5 quantile.

kayhman · 2023-01-22T17:16:03Z

ok, thanks for the answer. So no needs for line search in this case.

trivialfis added 30 commits April 13, 2022 14:26

Initial commit.

ad0dcee

refresh.

73aef42

Start looking into quantile reg.

f0d949a

init.

3def77b

quantile.

4f5cd8c

percentile.

b216a38

Fixes.

ef5a141

Test.

d19e667

Use in boosting.

823648c

Make sure it's used.

07523df

Use transform iter.

fd1729c

fixes.

033ac6f

Fix.

6b53665

cleanup.

dc1015c

Subsample.

076f810

Start working on GPU.

609ceb3

Work on GPU.

f5732dc

Copy the row index.

9513cf4

Comp.

d9ac306

Start working on device quantile.

eca5afb

Compute target.

8480886

Work on GPU partitioner.

9ff0a01

rle.

2a5f8bd

use it in obj.

9e7cdac

fixes

b1be3fe

fix.

bf4c5b8

Cleanup & fix.

34680ac

Commented code.

2df0a50

Start working on weighted.

f0101a6

Start working on weighted.

3904578

trivialfis added 7 commits April 23, 2022 01:59

Cleanup.

fd9fcc2

Skip updaters that don't support partitioner.

244c216

Forbid external memory.

b940311

Small cleanups.

fe1f36f

Fix test.

a81359f

tidy.

0d328f1

Use nan.

a0d883b

RAMitchell reviewed Apr 25, 2022

View reviewed changes

include/xgboost/tree_updater.h Show resolved Hide resolved

trivialfis commented Apr 25, 2022

View reviewed changes

src/tree/updater_gpu_hist.cu Outdated Show resolved Hide resolved

trivialfis added 4 commits April 25, 2022 19:53

Remove unused code.

5aa21bc

Remove optimization.

1144fd3

Make it a check.

8f96187

Revert some cleanups.

1cf3f02

Naming after the refactor.

a83c813

RAMitchell approved these changes Apr 26, 2022

View reviewed changes

trivialfis merged commit fdf533f into dmlc:master Apr 26, 2022

2.0 Roadmap automation moved this from 2.0 In Progress to 2.0 Done Apr 26, 2022

trivialfis deleted the adaptive-tree branch April 26, 2022 13:41

hcho3 mentioned this pull request May 14, 2022

Poor out-of-sample accuracy with reg:absoluteerror #7674

Closed

trivialfis removed this from 2.0 Done in 2.0 Roadmap Sep 28, 2022

trivialfis added this to In progress in 1.7 Roadmap via automation Sep 28, 2022

trivialfis moved this from In progress to Done in 1.7 Roadmap Sep 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[POC] Experimental support for l1 error. #7812

[POC] Experimental support for l1 error. #7812

trivialfis commented Apr 15, 2022 •

edited

RAMitchell left a comment

RAMitchell Apr 22, 2022

trivialfis Apr 25, 2022

trivialfis Apr 25, 2022

RAMitchell Apr 22, 2022

trivialfis Apr 25, 2022

trivialfis Apr 25, 2022

RAMitchell Apr 22, 2022

trivialfis Apr 25, 2022

trivialfis commented Apr 25, 2022 •

edited

s-banach commented Oct 31, 2022

trivialfis commented Nov 1, 2022

kayhman commented Jan 17, 2023

kayhman commented Jan 17, 2023

trivialfis commented Jan 17, 2023

kayhman commented Jan 22, 2023

[POC] Experimental support for l1 error. #7812

[POC] Experimental support for l1 error. #7812

Conversation

trivialfis commented Apr 15, 2022 • edited

RAMitchell left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

trivialfis commented Apr 25, 2022 • edited

s-banach commented Oct 31, 2022

trivialfis commented Nov 1, 2022

kayhman commented Jan 17, 2023

kayhman commented Jan 17, 2023

trivialfis commented Jan 17, 2023

kayhman commented Jan 22, 2023

trivialfis commented Apr 15, 2022 •

edited

trivialfis commented Apr 25, 2022 •

edited