New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support more input types for categorical data. #7220
Conversation
Codecov Report
@@ Coverage Diff @@
## master #7220 +/- ##
==========================================
- Coverage 82.63% 82.60% -0.03%
==========================================
Files 13 13
Lines 4019 4024 +5
==========================================
+ Hits 3321 3324 +3
- Misses 698 700 +2
Continue to review full report at Codecov.
|
python-package/xgboost/core.py
Outdated
Set types for features. When `enable_categorical` is set to `True`, string | ||
"c" represents categorical data type. For numerical data, it can be one for | ||
the following: | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't understand what happens when I change the numerical types, is there more documentation somewhere?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not at the moment, it's only used for text model dump. Actually, I have been thinking if it's possible to remove them along with the fmap
parameter. (they are the same thing).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I removed the doc for numerical data. Let's just keep it as it's for now.
* Shorten the type name from "categorical" to "c". * Tests for np/cp array and scipy csr/csc/coo. * Specify the type for feature info.
7b629f3
to
9981295
Compare
That's interesting (and unrelated to this PR):
|
Remaining type: