Minimal inclusive scan #6234

RAMitchell · 2020-10-14T04:55:01Z

Attempt at a small inclusive scan implementation to get us around the thrust 2^31 limit (#6228).

This reverts commit 9a3e7fa.

src/data/ellpack_page.cu

This reverts commit 8e2e611.

RAMitchell · 2020-11-02T23:45:48Z

I've got a conda error on the Windows test that seems unrelated, otherwise I think this is ready to go.

python-package/xgboost/core.py

cmake/Utils.cmake

hcho3

LGTM overall. How do we plan to test that DDQDM works with elements more than 2^31?

RAMitchell · 2020-11-03T01:53:12Z

I have this script but it takes > 20s, most of which is in the DMatrix construction time. I don't think that's fast enough for CI.

import numpy as np
import xgboost as xgb
import cupy as cp
import time


# Test for integer overflow or out of memory exceptions
def test_large_input():
    # n*m = 2^31
    n = 1000
    m = (1 << 31) // n
    m //= 2
    X = cp.ones((m, n), dtype=np.float32)
    y = cp.ones(m)
    start = time.time()
    dmat = xgb.DeviceQuantileDMatrix(X, y)
    print(time.time() - start)
    start = time.time()
    xgb.train({"tree_method": "gpu_hist", "max_depth": 1}, dmat, 1)
    print(time.time() - start)

RAMitchell · 2020-11-03T02:21:11Z

I tried it again on V100 and the test takes 4s so I've tried enabling it but skipping if there is less than 15gb available device memory.

hcho3 · 2020-11-03T02:28:07Z

@RAMitchell Thanks. I've recently change the CI configuration such that each GPU worker runs only one job at a time, so you'll have access to the all 16 GB memory.

hcho3

LGTM. Let's see if the large matrix test completes in a reasonable amount of time.

hcho3 · 2020-11-03T03:27:44Z

tests/python-gpu/test_large_input.py

+
+# Test for integer overflow or out of memory exceptions
+def test_large_input():
+    available_bytes, _ = cp.cuda.runtime.memGetInfo()


Currently, this test will skip when --use-rmm-pool is set, as cuPy is not configured to use the RMM allocator. To make it work with RMM, we'll need to run

cupy.cuda.set_allocator(rmm.rmm_cupy_allocator)

Do we want to enable this test with RMM?

Don't think this is necessary for now.

hcho3 · 2020-11-03T03:30:06Z

Reported duration of test_large_input() in the CI:

test-python-gpu-cuda10.2: 7.82s
test-python-gpu-cuda11.0: 7.64s
test-python-gpu-cuda11.0-cross: 7.85s

I think we should keep the test.

RAMitchell added 4 commits October 14, 2020 17:51

Minimal inclusive scan

761fb93

Fix scan

0d2240d

Try with cub

9a3e7fa

Revert "Try with cub"

8e2e611

This reverts commit 9a3e7fa.

trivialfis reviewed Oct 15, 2020

View reviewed changes

src/data/ellpack_page.cu Outdated Show resolved Hide resolved

RAMitchell added 2 commits October 16, 2020 09:55

Revert "Revert "Try with cub""

7522138

This reverts commit 8e2e611.

Cub scan

054ecb7

RAMitchell mentioned this pull request Oct 15, 2020

Fix some integer overflows NVIDIA/cub#215

Merged

RAMitchell added the Blocking label Oct 28, 2020

RAMitchell added 2 commits November 3, 2020 11:07

Update with upstream cub fix

058ce0c

Define THRUST_IGNORE_CUB_VERSION_CHECK for cuda 11

dbf981c

RAMitchell force-pushed the large-gpu branch from b29d6b3 to dbf981c Compare November 2, 2020 22:43

RAMitchell changed the title ~~[WIP] Minimal inclusive scan~~ Minimal inclusive scan Nov 2, 2020

RAMitchell marked this pull request as ready for review November 2, 2020 22:59

trivialfis reviewed Nov 3, 2020

View reviewed changes

python-package/xgboost/core.py Show resolved Hide resolved

hcho3 mentioned this pull request Nov 3, 2020

DeviceQuantileDMatrix Improvements #6335

Closed

hcho3 reviewed Nov 3, 2020

View reviewed changes

cmake/Utils.cmake Show resolved Hide resolved

Update dask doc

e0c9a3d

hcho3 reviewed Nov 3, 2020

View reviewed changes

Add large input test

5da75d6

hcho3 approved these changes Nov 3, 2020

View reviewed changes

hcho3 reviewed Nov 3, 2020

View reviewed changes

RAMitchell merged commit 29745c6 into dmlc:master Nov 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minimal inclusive scan #6234

Minimal inclusive scan #6234

RAMitchell commented Oct 14, 2020

RAMitchell commented Nov 2, 2020

hcho3 left a comment

RAMitchell commented Nov 3, 2020

RAMitchell commented Nov 3, 2020

hcho3 commented Nov 3, 2020

hcho3 left a comment

hcho3 Nov 3, 2020 •

edited

RAMitchell Nov 3, 2020

hcho3 commented Nov 3, 2020 •

edited

Minimal inclusive scan #6234

Minimal inclusive scan #6234

Conversation

RAMitchell commented Oct 14, 2020

RAMitchell commented Nov 2, 2020

hcho3 left a comment

Choose a reason for hiding this comment

RAMitchell commented Nov 3, 2020

RAMitchell commented Nov 3, 2020

hcho3 commented Nov 3, 2020

hcho3 left a comment

Choose a reason for hiding this comment

hcho3 Nov 3, 2020 • edited

Choose a reason for hiding this comment

RAMitchell Nov 3, 2020

Choose a reason for hiding this comment

hcho3 commented Nov 3, 2020 • edited

hcho3 Nov 3, 2020 •

edited

hcho3 commented Nov 3, 2020 •

edited