You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I created the following Dockerfile to test out whether the latest Horovod version (0.28.1) is compatible with the latest PyTorch version (2.1.0):
# Use the official Python image as the base image
FROM python:3.10-slim-buster
# Update the system and install necessary libraries
RUN apt-get update && apt-get install -y \
build-essential \
cmake \
git \
curl \
ca-certificates \
libjpeg-dev \
libpng-dev && \
rm -rf /var/lib/apt/lists/*
# Install PyTorch
RUN pip install torch==2.1.0
# Install Horovod
RUN HOROVOD_WITH_PYTORCH=1 pip install horovod==0.28.1
And it returned various errors that I am unable to decipher. Here is the full build trace: error.txt
The text was updated successfully, but these errors were encountered:
Hi there. I also encountered similiar issue. It seems there are some compatibility issues between torch and horovod. I have tried to use latest version torch (2.3.0 or 2.2.x or 2.1.x) but I got same error info like:
subprocess.CalledProcessError: Command '['cmake', '--build', '.', '--config', 'RelWithDebInfo', '--', '-j8', 'VERBOSE=1']' returned non-zero exit status 2.
A exeception is torch==2.0.1, I can install horovod with torch 2.0.1. Any suggestions ?
Environment:
Checklist:
Bug report:
I created the following Dockerfile to test out whether the latest Horovod version (0.28.1) is compatible with the latest PyTorch version (2.1.0):
And it returned various errors that I am unable to decipher. Here is the full build trace: error.txt
The text was updated successfully, but these errors were encountered: