Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pyspark] Cleanup data processing. #8088

Merged
merged 3 commits into from Jul 26, 2022
Merged

Commits on Jul 25, 2022

  1. pyspark] Cleanup data processing.

    - Use numpy stack for handling list of arrays.
    - Reuse concat function from dask.
    - Prepare for `QuantileDMatrix`.
    - Remove unused code.
    - Use iterator for prediction to avoid initializing xgboost model
    trivialfis committed Jul 25, 2022
    Copy the full SHA
    a7c98c6 View commit details
    Browse the repository at this point in the history
  2. Format.

    trivialfis committed Jul 25, 2022
    Copy the full SHA
    320d0e8 View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2022

  1. Format.

    trivialfis committed Jul 26, 2022
    Copy the full SHA
    5cb8589 View commit details
    Browse the repository at this point in the history