Add manual cuda deps search logic (#90411) #90426

atalman · 2022-12-07T23:53:34Z

If PyTorch is package into a wheel with nvidia-cublas-cu11, which is designated as PureLib, but torch wheel is not, can cause a torch_globals loading problem.

Fix that by searching for nvidia/cublas/lib/libcublas.so.11 an nvidia/cudnn/lib/libcudnn.so.8 across all sys.path folders.

Test plan:

docker pull amazonlinux:2
docker run --rm -t amazonlinux:2 bash -c 'yum install -y python3 python3-devel python3-distutils patch;python3 -m pip install torch==1.13.0;curl -OL https://patch-diff.githubusercontent.com/raw/pytorch/pytorch/pull/90411.diff; pushd /usr/local/lib64/python3.7/site-packages; patch -p1 </90411.diff; popd; python3 -c "import torch;print(torch.__version__, torch.cuda.is_available())"'

Fixes #88869

Pull Request resolved: #90411
Approved by: https://github.com/atalman

If PyTorch is package into a wheel with [nvidia-cublas-cu11](https://pypi.org/project/nvidia-cublas-cu11/), which is designated as PureLib, but `torch` wheel is not, can cause a torch_globals loading problem. Fix that by searching for `nvidia/cublas/lib/libcublas.so.11` an `nvidia/cudnn/lib/libcudnn.so.8` across all `sys.path` folders. Test plan: ``` docker pull amazonlinux:2 docker run --rm -t amazonlinux:2 bash -c 'yum install -y python3 python3-devel python3-distutils patch;python3 -m pip install torch==1.13.0;curl -OL https://patch-diff.githubusercontent.com/raw/pytorch/pytorch/pull/90411.diff; pushd /usr/local/lib64/python3.7/site-packages; patch -p1 </90411.diff; popd; python3 -c "import torch;print(torch.__version__, torch.cuda.is_available())"' ``` Fixes pytorch#88869 Pull Request resolved: pytorch#90411 Approved by: https://github.com/atalman

pytorch-bot · 2022-12-07T23:53:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90426

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Failures, 1 Pending

As of commit cff9766:

The following jobs have failed:

linux-bionic-py3_7-clang8-xla / test (xla, 1, 1, linux.2xlarge)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

weiwangmeta

LGTM

albanD · 2022-12-08T17:32:34Z

torch/__init__.py

+        nvidia_path = os.path.join(path, 'nvidia')
+        if not os.path.exists(nvidia_path):
+            continue
+        cublas_path = os.path.join(nvidia_path, 'cublas', 'lib', 'libcublas.so.11')


These are very specific lib versions. Isn't that a problem?

atalman mentioned this pull request Dec 7, 2022

[v.1.13.1] Release Tracker #89855

Closed

julian-risch mentioned this pull request Dec 8, 2022

build: pin torch 1.13 for testing deepset-ai/haystack#3681

Closed

6 tasks

weiwangmeta approved these changes Dec 8, 2022

View reviewed changes

atalman merged commit 56de8a3 into pytorch:release/1.13 Dec 8, 2022

albanD reviewed Dec 8, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add manual cuda deps search logic (#90411) #90426

Add manual cuda deps search logic (#90411) #90426

atalman commented Dec 7, 2022

pytorch-bot bot commented Dec 7, 2022 •

edited

weiwangmeta left a comment

albanD Dec 8, 2022

Navigation Menu

Add manual cuda deps search logic (#90411) #90426

Add manual cuda deps search logic (#90411) #90426

Conversation

atalman commented Dec 7, 2022

pytorch-bot bot commented Dec 7, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/90426

❌ 1 Failures, 1 Pending

weiwangmeta left a comment

Choose a reason for hiding this comment

albanD Dec 8, 2022

Choose a reason for hiding this comment

pytorch-bot bot commented Dec 7, 2022 •

edited