Releases · rapidsai/rmm

24 May 21:55

rapids-bot

v24.08.00a

bd5135e

[NIGHTLY] v24.08.00 Pre-release

Pre-release

🔗 Links

🛠️ Improvements

Use rapids-build-backend. (#1502) @bdice

Contributors

bdice

Assets 2

10 Apr 14:23

raydouglass

v24.04.00

8f19c9c

v24.04.00 Latest

Latest

🚨 Breaking Changes

Accept stream argument in DeviceMemoryResource allocate/deallocate (#1494) @wence-
Replace all internal usage of get_upstream with get_upstream_resource (#1491) @miscco
Deprecate rmm::mr::device_memory_resource::supports_streams() (#1452) @harrism
Remove deprecated rmm::detail::available_device_memory (#1438) @harrism
Make device_memory_resource::supports_streams() not pure virtual. Remove derived implementations and calls in RMM (#1437) @harrism
Deprecate rmm::mr::device_memory_resource::get_mem_info() and supports_get_mem_info(). (#1436) @harrism

🐛 Bug Fixes

Fix search path for torch allocator in editable installs and ensure CUDA support is available (#1498) @vyasr
Accept stream argument in DeviceMemoryResource allocate/deallocate (#1494) @wence-
Run STATISTICS_TEST and TRACKING_TEST in serial to avoid OOM errors. (#1487) @bdice

📖 Documentation

Pin to recent breathe, to prevent getting an unsupported sphinx version. (#1495) @bdice

🚀 New Features

Replace all internal usage of get_upstream with get_upstream_resource (#1491) @miscco
Add complete set of resource ref aliases (#1479) @nvdbaranec
Automate include grouping using clang-format (#1463) @harrism
Add get_upstream_resource to resource adaptors (#1456) @miscco
Deprecate rmm::mr::device_memory_resource::supports_streams() (#1452) @harrism
Remove duplicated memory_resource_tests (#1451) @miscco
Change rmm::exec_policy to take async_resource_ref (#1449) @miscco
Change device_scalar to take async_resource_ref (#1447) @miscco
Add device_async_resource_ref convenience alias (#1441) @harrism
Remove deprecated rmm::detail::available_device_memory (#1438) @harrism
Make device_memory_resource::supports_streams() not pure virtual. Remove derived implementations and calls in RMM (#1437) @harrism
Deprecate rmm::mr::device_memory_resource::get_mem_info() and supports_get_mem_info(). (#1436) @harrism
Support CUDA 12.2 (#1419) @jameslamb

🛠️ Improvements

Use conda env create --yes instead of --force (#1509) @bdice
Add upper bound to prevent usage of NumPy 2 (#1501) @bdice
Remove hard-coding of RAPIDS version where possible (#1496) @KyleFromNVIDIA
Requre NumPy 1.23+ (#1488) @jakirkham
Use rmm::device_async_resource_ref in multi_stream_allocation benchmark (#1482) @miscco
Update devcontainers to CUDA Toolkit 12.2 (#1470) @trxcllnt
Add support for Python 3.11 (#1469) @jameslamb
target branch-24.04 for GitHub Actions workflows (#1468) @jameslamb
[FEA]: Use std::optional instead of thrust::optional (#1464) @miscco
Add environment-agnostic scripts for running ctests and pytests (#1462) @trxcllnt
Ensure that ctest is called with --no-tests=error. (#1460) @bdice
Update ops-bot.yaml (#1458) @AyodeAwe
Adopt the rmm::device_async_resource_ref alias (#1454) @miscco
Refactor error.hpp out of detail (#1439) @lamarrr

Contributors

trxcllnt, harrism, and 10 other contributors

Assets 2

27 Mar 18:40

rapids-bot

v24.06.00a

fb23a13

[NIGHTLY] v24.06.00 Pre-release

Pre-release

🔗 Links

🚨 Breaking Changes

Refactor polymorphic allocator to use device_async_resource_ref (#1555) @harrism
Remove deprecated functionality (#1537) @harrism
Remove deprecated cuda_async_memory_resource constructor that takes thrust::optional parameters (#1535) @harrism
Remove deprecated supports_streams and get_mem_info methods. (#1519) @harrism

🐛 Bug Fixes

rmm needs to link to nvtx3::nvtx3-cpp to support installed nvtx3 (#1569) @robertmaynard
Make sure rmm wheel dependency on librmm is updated [skip ci] (#1565) @raydouglass
Don't ignore GCC-specific warning under Clang (#1557) @aaronmondal
Add publish jobs for C++ wheels (#1554) @vyasr
Explicitly use the current device resource in DeviceBuffer (#1514) @wence-

📖 Documentation

Allow specifying mr in DeviceBuffer construction, and document ownership requirements in Python/C++ interfacing (#1552) @wence-
Fix Python install instruction (#1547) @wence-
Update multi-gpu discussion for device_buffer and device_vector dtors (#1524) @wence-
Fix ordering / heading levels in README.md and python example in guide.md (#1513) @harrism

🚀 New Features

Add NVTX support and RMM_FUNC_RANGE() macro (#1558) @harrism
Always use a static gtest (#1532) @robertmaynard
Build C++ wheel (#1529) @vyasr
Remove deprecated supports_streams and get_mem_info methods. (#1519) @harrism

🛠️ Improvements

update copyright dates (#1564) @jameslamb
Overhaul ops-codeowners (#1561) @raydouglass
Adding support for cupy.cuda.stream.ExternalStream (#1559) @lilohuang
Refactor polymorphic allocator to use device_async_resource_ref (#1555) @harrism
add RAPIDS copyright pre-commit hook (#1553) @jameslamb
Enable warnings as errors for Python tests (#1551) @mroeschke
Remove header existence tests. (#1550) @bdice
Only use functions in the limited API (#1545) @vyasr
Migrate to {{ stdlib("c") }} (#1543) @hcho3
Fix cuda11.8 nvcc dependency (#1542) @trxcllnt
add --rm and --name to devcontainer run args (#1539) @trxcllnt
Remove deprecated functionality (#1537) @harrism
Remove deprecated cuda_async_memory_resource constructor that takes thrust::optional parameters (#1535) @harrism
Make thrust_allocator deallocate safe in multi-device setting (#1533) @wence-
Move rmm Python package to subdirectory (#1526) @vyasr
Remove a file not being used (#1521) @galipremsagar
Remove unneeded update-version.sh update (#1520) @AyodeAwe
Enable all tests for arm arch (#1510) @galipremsagar

Contributors

trxcllnt, robertmaynard, and 12 other contributors

Assets 2

12 Feb 20:53

raydouglass

v24.02.00

09b406c

v24.02.00

🚨 Breaking Changes

Make device_memory_resource::do_get_mem_info() and supports_get_mem_info() not pure virtual. Remove derived implementations and calls in RMM (#1430) @harrism
Deprecate detail::available_device_memory, most detail/aligned.hpp utilities, and optional pool_memory_resource initial size (#1424) @harrism
Require explicit pool size in pool_memory_resource and move some things out of detail namespace (#1417) @harrism
Remove HTML builds of librmm (#1415) @vyasr
Update to CCCL 2.2.0. (#1404) @bdice
Switch to scikit-build-core (#1287) @vyasr

🐛 Bug Fixes

Exclude tests from builds (#1459) @vyasr
Update CODEOWNERS (#1410) @raydouglass
Correct signatures for torch allocator plug in (#1407) @wence-
Fix Arena MR to support simultaneous access by PTDS and other streams (#1395) @tgravescs
Fix else-after-throw clang tidy error (#1391) @harrism

📖 Documentation

remove references to setup.py in docs (#1420) @jameslamb
Remove HTML builds of librmm (#1415) @vyasr
Update GPU support docs to drop Pascal (#1413) @harrism

🚀 New Features

Make device_memory_resource::do_get_mem_info() and supports_get_mem_info() not pure virtual. Remove derived implementations and calls in RMM (#1430) @harrism
Deprecate detail::available_device_memory, most detail/aligned.hpp utilities, and optional pool_memory_resource initial size (#1424) @harrism
Add a host-pinned memory resource that can be used as upstream for pool_memory_resource. (#1392) @harrism

🛠️ Improvements

Remove usages of rapids-env-update (#1423) @KyleFromNVIDIA
Refactor CUDA versions in dependencies.yaml. (#1422) @bdice
Require explicit pool size in pool_memory_resource and move some things out of detail namespace (#1417) @harrism
Update dependencies.yaml to support CUDA 12.*. (#1414) @bdice
Define python dependency range as a matrix fallback. (#1409) @bdice
Use latest cuda-python within CUDA major version. (#1406) @bdice
Update to CCCL 2.2.0. (#1404) @bdice
Remove RMM_BUILD_WHEELS and standardize Python builds (#1401) @vyasr
Update to fmt 10.1.1 and spdlog 1.12.0. (#1374) @bdice
Switch to scikit-build-core (#1287) @vyasr

Contributors

harrism, wence-, and 6 other contributors

Assets 2

06 Dec 15:04

raydouglass

v23.12.00

2db5cbb

v23.12.00

🚨 Breaking Changes

Document minimum CUDA version of 11.4 (#1385) @harrism
Store and set the correct CUDA device in device_buffer (#1370) @harrism
Use cuda::mr::memory_resource instead of raw device_memory_resource (#1095) @miscco

🐛 Bug Fixes

Update actions/labeler to v4 (#1397) @raydouglass
Backport arena MR fix for simultaneous access by PTDS and other streams (#1396) @bdice
Deliberately leak PTDS thread_local events in stream ordered mr (#1375) @wence-
Add missing CUDA 12 dependencies and fix dlopen library names (#1366) @vyasr

📖 Documentation

Document minimum CUDA version of 11.4 (#1385) @harrism
Fix more doxygen issues (#1367) @vyasr
Add groups to the doxygen docs (#1358) @vyasr
Enable doxygen XML and fix issues (#1348) @vyasr

🚀 New Features

Make internally stored default argument values public (#1373) @vyasr
Store and set the correct CUDA device in device_buffer (#1370) @harrism
Update rapids-cmake functions to non-deprecated signatures (#1357) @robertmaynard
Generate unified Python/C++ docs (#1324) @vyasr
Use cuda::mr::memory_resource instead of raw device_memory_resource (#1095) @miscco

🛠️ Improvements

Silence false gcc warning (#1381) @miscco
Build concurrency for nightly and merge triggers (#1380) @bdice
Update shared-action-workflows references (#1363) @AyodeAwe
Use branch-23.12 workflows. (#1360) @bdice
Update devcontainers to 23.12 (#1355) @raydouglass
Generate proper, consistent nightly versions for pip and conda packages (#1347) @vyasr
RMM: Build CUDA 12.0 ARM conda packages. (#1330) @bdice

Contributors

robertmaynard, harrism, and 6 other contributors

Assets 2

11 Oct 14:33

raydouglass

v23.10.00

f8ac6f8

v23.10.00

🚨 Breaking Changes

Update to Cython 3.0.0 (#1313) @vyasr

🐛 Bug Fixes

Compile cdef public functions from torch_allocator with C ABI (#1350) @wence-
Make doxygen only a conda dependency. (#1344) @bdice
Use conda mambabuild not mamba mambabuild (#1338) @wence-
Fix stream_ordered_memory_resource attempt to record event in stream from another device (#1333) @harrism

📖 Documentation

Clean up headers in CMakeLists.txt. (#1341) @bdice
Add pre-commit hook to validate doxygen (#1334) @vyasr
Fix doxygen warnings (#1317) @vyasr
Treat warnings as errors in Python documentation (#1316) @vyasr

🚀 New Features

Enable RMM Debug Logging via Python (#1339) @harrism

🛠️ Improvements

Update image names (#1346) @AyodeAwe
Update to clang 16.0.6. (#1343) @bdice
Update doxygen to 1.9.1 (#1337) @vyasr
Simplify wheel build scripts and allow alphas of RAPIDS dependencies (#1335) @divyegala
Use copy-pr-bot (#1329) @ajschmidt8
Add RMM devcontainers (#1328) @trxcllnt
Add Python bindings for limiting_resource_adaptor (#1327) @pentschev
Fix missing jQuery error in docs (#1321) @AyodeAwe
Use fetch_rapids.cmake. (#1319) @bdice
Update to Cython 3.0.0 (#1313) @vyasr
Branch 23.10 merge 23.08 (#1312) @vyasr
Branch 23.10 merge 23.08 (#1309) @vyasr

Contributors

trxcllnt, harrism, and 7 other contributors

Assets 2

09 Aug 16:38

raydouglass

v23.08.00

f3af0e8

v23.08.00

🚨 Breaking Changes

Stop invoking setup.py (#1300) @vyasr
Remove now-deprecated top-level allocator functions (#1281) @wence-
Remove padding from device_memory_resource (#1278) @vyasr

🐛 Bug Fixes

Fix typo in wheels-test.yaml. (#1310) @bdice
Add a missing '#include <array>' in logger.hpp (#1295) @valgur
Use gbench thread_index() accessor to fix replay bench compilation (#1293) @harrism
Ensure logger tests don't generate temp directories in build dir (#1289) @robertmaynard

🚀 New Features

Remove now-deprecated top-level allocator functions (#1281) @wence-

🛠️ Improvements

Switch to new CI wheel building pipeline (#1305) @vyasr
Revert CUDA 12.0 CI workflows to branch-23.08. (#1303) @bdice
Update linters: remove flake8, add ruff, update cython-lint (#1302) @vyasr
Adding identify minimum version requirement (#1301) @hyperbolic2346
Stop invoking setup.py (#1300) @vyasr
Use cuda-version to constrain cudatoolkit. (#1296) @bdice
Update to CMake 3.26.4 (#1291) @vyasr
use rapids-upload-docs script (#1288) @AyodeAwe
Reorder parameters in RMM_EXPECTS (#1286) @vyasr
Remove documentation build scripts for Jenkins (#1285) @ajschmidt8
Remove padding from device_memory_resource (#1278) @vyasr
Unpin scikit-build upper bound (#1275) @vyasr
RMM: Build CUDA 12 packages (#1223) @bdice

Contributors

robertmaynard, harrism, and 7 other contributors

Assets 2

07 Jun 14:51

raydouglass

v23.06.00

acaf3f5

v23.06.00

🚨 Breaking Changes

Update minimum Python version to Python 3.9 (#1252) @shwina

🐛 Bug Fixes

Ensure Logger tests aren't run in parallel (#1277) @robertmaynard
Pin to scikit-build<0.17.2. (#1262) @bdice

🛠️ Improvements

Require Numba 0.57.0+ & NumPy 1.21.0+ (#1279) @jakirkham
Align test_cpp.sh with conventions in other RAPIDS repos. (#1269) @bdice
Switch back to using primary shared-action-workflows branch (#1268) @vyasr
Update recipes to GTest version >=1.13.0 (#1263) @bdice
Support CUDA 12.0 for pip wheels (#1259) @bdice
Add build vars (#1258) @AyodeAwe
Enable sccache hits from local builds (#1257) @AyodeAwe
Revert to branch-23.06 for shared-action-workflows (#1256) @shwina
run docs builds nightly too (#1255) @AyodeAwe
Build wheels using new single image workflow (#1254) @vyasr
Update minimum Python version to Python 3.9 (#1252) @shwina
Remove usage of rapids-get-rapids-version-from-git (#1251) @jjacobelli
Remove wheel pytest verbosity (#1249) @sevagh
Update clang-format to 16.0.1. (#1246) @bdice
Remove uses-setup-env-vars (#1242) @vyasr
Move RMM_LOGGING_ASSERT into separate header (#1241) @ahendriksen
Use ARC V2 self-hosted runners for GPU jobs (#1239) @jjacobelli

Contributors

robertmaynard, vyasr, and 7 other contributors

Assets 2

12 Apr 13:25

raydouglass

v23.04.00

deb4729

v23.04.00

🐛 Bug Fixes

Remove MANIFEST.in use auto-generated one for sdists and package_data for wheels (#1233) @vyasr
Fix update-version.sh. (#1227) @vyasr
Specify include_package_data to setup (#1218) @vyasr
Revert changes overriding rapids-cmake repo. (#1209) @bdice
Synchronize stream in DeviceBuffer.c_from_unique_ptr constructor (#1100) @shwina

🚀 New Features

Use rapids-cmake parallel testing feature (#1183) @robertmaynard

🛠️ Improvements

Stop setting package version attribute in wheels (#1236) @vyasr
Add codespell as a linter (#1231) @bdice
Pass AWS_SESSION_TOKEN and SCCACHE_S3_USE_SSL vars to conda build (#1230) @ajschmidt8
Update to GCC 11 (#1228) @bdice
Fix some minor oversights in the conversion to pyproject.toml (#1226) @vyasr
Remove pickle compatibility layer in tests for Python < 3.8. (#1224) @bdice
Move external allocators into rmm.allocators module to defer imports (#1221) @wence-
Generate pyproject.toml dependencies using dfg (#1219) @vyasr
Run rapids-dependency-file-generator via pre-commit (#1217) @vyasr
Skip docs job in nightly runs (#1215) @AyodeAwe
CI: Remove specification of manual stage for check_style.sh script. (#1214) @csadorf
Use script rather than environment variable to modify package names (#1212) @vyasr
Reduce error handling verbosity in CI tests scripts (#1204) @AjayThorve
Update shared workflow branches (#1203) @ajschmidt8
Use date in build string instead of in the version. (#1195) @bdice
Stop using versioneer to manage versions (#1190) @vyasr
Update to spdlog>=1.11.0, fmt>=9.1.0. (#1177) @bdice
Migrate as much as possible to pyproject.toml (#1151) @jakirkham

Contributors

robertmaynard, wence-, and 8 other contributors

Assets 2

09 Feb 15:39

raydouglass

v23.02.00

48e8f2a

v23.02.00

🐛 Bug Fixes

pre-commit: Update isort version to 5.12.0 (#1197) @wence-
Revert "Upgrade to spdlog 1.10 (#1173)" (#1176) @bdice
Ensure UpstreamResourceAdaptor is not cleared by the Python GC (#1170) @shwina

📖 Documentation

Fix documentation author (#1188) @bdice

🚀 New Features

Add RMM PyTorch allocator (#1168) @shwina

🛠️ Improvements

Update shared workflow branches (#1201) @ajschmidt8
Fix update-version.sh (#1199) @raydouglass
Use CTK 118/cp310 branch of wheel workflows (#1193) @sevagh
Update build.yaml workflow to reduce verbosity (#1192) @AyodeAwe
Fix build.yaml workflow (#1191) @ajschmidt8
add docs_build step (#1189) @AyodeAwe
Upkeep/wheel param cleanup (#1187) @sevagh
Update workflows for nightly tests (#1186) @ajschmidt8
Build CUDA 11.8 and Python 3.10 Packages (#1184) @ajschmidt8
Build wheels alongside conda CI (#1182) @sevagh
Update conda recipes. (#1180) @bdice
Update PR Workflow (#1174) @ajschmidt8
Upgrade to spdlog 1.10 (#1173) @kkraus14
Enable codecov (#1171) @ajschmidt8
Add support for Python 3.10. (#1166) @bdice
Update pre-commit hooks (#1154) @bdice

Contributors

wence-, raydouglass, and 6 other contributors

Assets 2

Releases: rapidsai/rmm

[NIGHTLY] v24.08.00

🔗 Links

🛠️ Improvements

Contributors

v24.04.00

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

Contributors

[NIGHTLY] v24.06.00

🔗 Links

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

Contributors

v24.02.00

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

Contributors

v23.12.00

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

Contributors

v23.10.00

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

Contributors

v23.08.00

🚨 Breaking Changes

🐛 Bug Fixes

🚀 New Features

🛠️ Improvements

Contributors

v23.06.00

🚨 Breaking Changes

🐛 Bug Fixes

🛠️ Improvements

Contributors

v23.04.00

🐛 Bug Fixes

🚀 New Features

🛠️ Improvements

Contributors

v23.02.00

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

Contributors