[R] Fix global feature importance and predict with 1 sample. #7394

trivialfis · 2021-11-04T11:07:43Z

Add implementation for tree index. The parameter is not documented in C API since we
should work on porting the model slicing to R instead of supporting more use of tree
index.
Fix the difference between "gain" and "total_gain".

* Add implementation for tree index. The parameter is not documented in C API since we should work on porting the model slicing to R instead of supporting more use of tree index. * Fix the difference between "gain" and "total_gain".

trivialfis · 2021-11-04T12:28:38Z

@hcho3 @hetong007 Please take a look when you are available.

hetong007 · 2021-11-04T12:47:21Z

Just to confirm, with this patch, xgboost won't break EIX/radiant.model, right?

trivialfis · 2021-11-04T14:24:56Z

I ran devtools::check() on radiant.model and it passed. For EIX, the vignettes failed to build and it doesn't have any test in its repository https://github.com/ModelOriented/EIX .

  installing the package to build vignettes
E  creating vignettes (26.5s)
   --- re-building ‘EIX.Rmd’ using rmarkdown
      [[ suppressing 19 column names 'satisfaction_level', 'last_evaluation', 'number_project' ... ]]
   Warning: ggrepel: 2 unlabeled data points (too many overlaps). Consider increasing max.overlaps
   Quitting from lines 157-165 (EIX.Rmd) 
   Error: processing vignette 'EIX.Rmd' failed with diagnostics:
   non-numeric matrix extent
   --- failed re-building ‘EIX.Rmd’
   
   --- re-building ‘titanic_data.Rmd’ using rmarkdown
   Warning: ggrepel: 4 unlabeled data points (too many overlaps). Consider increasing max.overlaps
   Quitting from lines 81-86 (titanic_data.Rmd) 
   Error: processing vignette 'titanic_data.Rmd' failed with diagnostics:
   non-numeric matrix extent
   --- failed re-building ‘titanic_data.Rmd’
   
   SUMMARY: processing the following files failed:
     ‘EIX.Rmd’ ‘titanic_data.Rmd’
   
   Error: Vignette re-building failed.
   Execution halted
Error in (function (command = NULL, args = character(), error_on_status = TRUE,  : 
  System command 'R' failed, exit status: 1, stdout + stderr (last 10 lines):

trivialfis · 2021-11-04T14:29:36Z

Same error as you have shared. Rerunning tests with 1.4

trivialfis · 2021-11-04T20:15:08Z

@hcho3 sorry, pushed a new commit for the fix in prediction leaf, where input is only one sample but we need to return a matrix instead of vector. I rewrote the prediction conditions to mimic the old code exactly and ran tests with those reverse dependencies.

trivialfis · 2021-11-04T20:23:28Z

@hetong007 I have tested both packages using devtools.

trivialfis · 2021-11-05T07:16:53Z

I will back port

* [R] Fix global feature importance. * Add implementation for tree index. The parameter is not documented in C API since we should work on porting the model slicing to R instead of supporting more use of tree index. * Fix the difference between "gain" and "total_gain". * debug. * Fix prediction.

…e. (#7394) (#7397) * [R] Fix global feature importance. * Add implementation for tree index. The parameter is not documented in C API since we should work on porting the model slicing to R instead of supporting more use of tree index. * Fix the difference between "gain" and "total_gain". * debug. * Fix prediction.

[R] Fix global feature importance.

c008ff3

* Add implementation for tree index. The parameter is not documented in C API since we should work on porting the model slicing to R instead of supporting more use of tree index. * Fix the difference between "gain" and "total_gain".

trivialfis mentioned this pull request Nov 4, 2021

1.5.0 Release Candidate #7260

Closed

8 tasks

debug.

3f07c34

hcho3 approved these changes Nov 4, 2021

View reviewed changes

Fix prediction.

7663e63

trivialfis changed the title ~~[R] Fix global feature importance.~~ [R] Fix global feature importance and predict with 1 sample. Nov 4, 2021

hcho3 approved these changes Nov 4, 2021

View reviewed changes

hetong007 merged commit c968217 into dmlc:master Nov 5, 2021

trivialfis deleted the fix-R-gfi branch November 5, 2021 07:16

trivialfis added this to 1.5.1 Done in 2.0 Roadmap Nov 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[R] Fix global feature importance and predict with 1 sample. #7394

[R] Fix global feature importance and predict with 1 sample. #7394

trivialfis commented Nov 4, 2021

trivialfis commented Nov 4, 2021

hetong007 commented Nov 4, 2021

trivialfis commented Nov 4, 2021

trivialfis commented Nov 4, 2021

trivialfis commented Nov 4, 2021 •

edited

trivialfis commented Nov 4, 2021

trivialfis commented Nov 5, 2021

[R] Fix global feature importance and predict with 1 sample. #7394

[R] Fix global feature importance and predict with 1 sample. #7394

Conversation

trivialfis commented Nov 4, 2021

trivialfis commented Nov 4, 2021

hetong007 commented Nov 4, 2021

trivialfis commented Nov 4, 2021

trivialfis commented Nov 4, 2021

trivialfis commented Nov 4, 2021 • edited

trivialfis commented Nov 4, 2021

trivialfis commented Nov 5, 2021

trivialfis commented Nov 4, 2021 •

edited