Skip to content

Releases: rapidsai/rmm

[NIGHTLY] v24.08.00

24 May 21:55
bd5135e
Compare
Choose a tag to compare
[NIGHTLY] v24.08.00 Pre-release
Pre-release

🔗 Links

🛠️ Improvements

v24.04.00

10 Apr 14:23
v24.04.00
8f19c9c
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Accept stream argument in DeviceMemoryResource allocate/deallocate (#1494) @wence-
  • Replace all internal usage of get_upstream with get_upstream_resource (#1491) @miscco
  • Deprecate rmm::mr::device_memory_resource::supports_streams() (#1452) @harrism
  • Remove deprecated rmm::detail::available_device_memory (#1438) @harrism
  • Make device_memory_resource::supports_streams() not pure virtual. Remove derived implementations and calls in RMM (#1437) @harrism
  • Deprecate rmm::mr::device_memory_resource::get_mem_info() and supports_get_mem_info(). (#1436) @harrism

🐛 Bug Fixes

  • Fix search path for torch allocator in editable installs and ensure CUDA support is available (#1498) @vyasr
  • Accept stream argument in DeviceMemoryResource allocate/deallocate (#1494) @wence-
  • Run STATISTICS_TEST and TRACKING_TEST in serial to avoid OOM errors. (#1487) @bdice

📖 Documentation

  • Pin to recent breathe, to prevent getting an unsupported sphinx version. (#1495) @bdice

🚀 New Features

  • Replace all internal usage of get_upstream with get_upstream_resource (#1491) @miscco
  • Add complete set of resource ref aliases (#1479) @nvdbaranec
  • Automate include grouping using clang-format (#1463) @harrism
  • Add get_upstream_resource to resource adaptors (#1456) @miscco
  • Deprecate rmm::mr::device_memory_resource::supports_streams() (#1452) @harrism
  • Remove duplicated memory_resource_tests (#1451) @miscco
  • Change rmm::exec_policy to take async_resource_ref (#1449) @miscco
  • Change device_scalar to take async_resource_ref (#1447) @miscco
  • Add device_async_resource_ref convenience alias (#1441) @harrism
  • Remove deprecated rmm::detail::available_device_memory (#1438) @harrism
  • Make device_memory_resource::supports_streams() not pure virtual. Remove derived implementations and calls in RMM (#1437) @harrism
  • Deprecate rmm::mr::device_memory_resource::get_mem_info() and supports_get_mem_info(). (#1436) @harrism
  • Support CUDA 12.2 (#1419) @jameslamb

🛠️ Improvements

[NIGHTLY] v24.06.00

27 Mar 18:40
v24.06.00a
fb23a13
Compare
Choose a tag to compare
[NIGHTLY] v24.06.00 Pre-release
Pre-release

🔗 Links

🚨 Breaking Changes

  • Refactor polymorphic allocator to use device_async_resource_ref (#1555) @harrism
  • Remove deprecated functionality (#1537) @harrism
  • Remove deprecated cuda_async_memory_resource constructor that takes thrust::optional parameters (#1535) @harrism
  • Remove deprecated supports_streams and get_mem_info methods. (#1519) @harrism

🐛 Bug Fixes

📖 Documentation

  • Allow specifying mr in DeviceBuffer construction, and document ownership requirements in Python/C++ interfacing (#1552) @wence-
  • Fix Python install instruction (#1547) @wence-
  • Update multi-gpu discussion for device_buffer and device_vector dtors (#1524) @wence-
  • Fix ordering / heading levels in README.md and python example in guide.md (#1513) @harrism

🚀 New Features

🛠️ Improvements

v24.02.00

12 Feb 20:53
v24.02.00
09b406c
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Make device_memory_resource::do_get_mem_info() and supports_get_mem_info() not pure virtual. Remove derived implementations and calls in RMM (#1430) @harrism
  • Deprecate detail::available_device_memory, most detail/aligned.hpp utilities, and optional pool_memory_resource initial size (#1424) @harrism
  • Require explicit pool size in pool_memory_resource and move some things out of detail namespace (#1417) @harrism
  • Remove HTML builds of librmm (#1415) @vyasr
  • Update to CCCL 2.2.0. (#1404) @bdice
  • Switch to scikit-build-core (#1287) @vyasr

🐛 Bug Fixes

📖 Documentation

🚀 New Features

  • Make device_memory_resource::do_get_mem_info() and supports_get_mem_info() not pure virtual. Remove derived implementations and calls in RMM (#1430) @harrism
  • Deprecate detail::available_device_memory, most detail/aligned.hpp utilities, and optional pool_memory_resource initial size (#1424) @harrism
  • Add a host-pinned memory resource that can be used as upstream for pool_memory_resource. (#1392) @harrism

🛠️ Improvements

  • Remove usages of rapids-env-update (#1423) @KyleFromNVIDIA
  • Refactor CUDA versions in dependencies.yaml. (#1422) @bdice
  • Require explicit pool size in pool_memory_resource and move some things out of detail namespace (#1417) @harrism
  • Update dependencies.yaml to support CUDA 12.*. (#1414) @bdice
  • Define python dependency range as a matrix fallback. (#1409) @bdice
  • Use latest cuda-python within CUDA major version. (#1406) @bdice
  • Update to CCCL 2.2.0. (#1404) @bdice
  • Remove RMM_BUILD_WHEELS and standardize Python builds (#1401) @vyasr
  • Update to fmt 10.1.1 and spdlog 1.12.0. (#1374) @bdice
  • Switch to scikit-build-core (#1287) @vyasr

v23.12.00

06 Dec 15:04
v23.12.00
2db5cbb
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Document minimum CUDA version of 11.4 (#1385) @harrism
  • Store and set the correct CUDA device in device_buffer (#1370) @harrism
  • Use cuda::mr::memory_resource instead of raw device_memory_resource (#1095) @miscco

🐛 Bug Fixes

  • Update actions/labeler to v4 (#1397) @raydouglass
  • Backport arena MR fix for simultaneous access by PTDS and other streams (#1396) @bdice
  • Deliberately leak PTDS thread_local events in stream ordered mr (#1375) @wence-
  • Add missing CUDA 12 dependencies and fix dlopen library names (#1366) @vyasr

📖 Documentation

🚀 New Features

  • Make internally stored default argument values public (#1373) @vyasr
  • Store and set the correct CUDA device in device_buffer (#1370) @harrism
  • Update rapids-cmake functions to non-deprecated signatures (#1357) @robertmaynard
  • Generate unified Python/C++ docs (#1324) @vyasr
  • Use cuda::mr::memory_resource instead of raw device_memory_resource (#1095) @miscco

🛠️ Improvements

v23.10.00

11 Oct 14:33
v23.10.00
f8ac6f8
Compare
Choose a tag to compare

🚨 Breaking Changes

🐛 Bug Fixes

  • Compile cdef public functions from torch_allocator with C ABI (#1350) @wence-
  • Make doxygen only a conda dependency. (#1344) @bdice
  • Use conda mambabuild not mamba mambabuild (#1338) @wence-
  • Fix stream_ordered_memory_resource attempt to record event in stream from another device (#1333) @harrism

📖 Documentation

🚀 New Features

🛠️ Improvements

v23.08.00

09 Aug 16:38
v23.08.00
f3af0e8
Compare
Choose a tag to compare

🚨 Breaking Changes

🐛 Bug Fixes

  • Fix typo in wheels-test.yaml. (#1310) @bdice
  • Add a missing '#include <array>' in logger.hpp (#1295) @valgur
  • Use gbench thread_index() accessor to fix replay bench compilation (#1293) @harrism
  • Ensure logger tests don't generate temp directories in build dir (#1289) @robertmaynard

🚀 New Features

  • Remove now-deprecated top-level allocator functions (#1281) @wence-

🛠️ Improvements

v23.06.00

07 Jun 14:51
acaf3f5
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Update minimum Python version to Python 3.9 (#1252) @shwina

🐛 Bug Fixes

🛠️ Improvements

v23.04.00

12 Apr 13:25
deb4729
Compare
Choose a tag to compare

🐛 Bug Fixes

  • Remove MANIFEST.in use auto-generated one for sdists and package_data for wheels (#1233) @vyasr
  • Fix update-version.sh. (#1227) @vyasr
  • Specify include_package_data to setup (#1218) @vyasr
  • Revert changes overriding rapids-cmake repo. (#1209) @bdice
  • Synchronize stream in DeviceBuffer.c_from_unique_ptr constructor (#1100) @shwina

🚀 New Features

🛠️ Improvements

  • Stop setting package version attribute in wheels (#1236) @vyasr
  • Add codespell as a linter (#1231) @bdice
  • Pass AWS_SESSION_TOKEN and SCCACHE_S3_USE_SSL vars to conda build (#1230) @ajschmidt8
  • Update to GCC 11 (#1228) @bdice
  • Fix some minor oversights in the conversion to pyproject.toml (#1226) @vyasr
  • Remove pickle compatibility layer in tests for Python < 3.8. (#1224) @bdice
  • Move external allocators into rmm.allocators module to defer imports (#1221) @wence-
  • Generate pyproject.toml dependencies using dfg (#1219) @vyasr
  • Run rapids-dependency-file-generator via pre-commit (#1217) @vyasr
  • Skip docs job in nightly runs (#1215) @AyodeAwe
  • CI: Remove specification of manual stage for check_style.sh script. (#1214) @csadorf
  • Use script rather than environment variable to modify package names (#1212) @vyasr
  • Reduce error handling verbosity in CI tests scripts (#1204) @AjayThorve
  • Update shared workflow branches (#1203) @ajschmidt8
  • Use date in build string instead of in the version. (#1195) @bdice
  • Stop using versioneer to manage versions (#1190) @vyasr
  • Update to spdlog>=1.11.0, fmt>=9.1.0. (#1177) @bdice
  • Migrate as much as possible to pyproject.toml (#1151) @jakirkham

v23.02.00

09 Feb 15:39
48e8f2a
Compare
Choose a tag to compare

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements