pip install -e . does not work #1065

wangkuiyi · 2024-02-07T21:06:30Z

System Info

x86, H100, Ubuntu

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

On a system running CUDA 12.3 and H100, I installed the dependencies by running scripts referred to by Dockerfile.multi:

TensorRT-LLM/docker/Dockerfile.multi

Lines 8 to 51 in 0ab9d17

    
           # https://www.gnu.org/software/bash/manual/html_node/Bash-Startup-Files.html 
        
           # The default values come from `nvcr.io/nvidia/pytorch` 
        
           ENV BASH_ENV=${BASH_ENV:-/etc/bash.bashrc} 
        
           ENV ENV=${ENV:-/etc/shinit_v2} 
        
           SHELL ["/bin/bash", "-c"] 
        
           FROM base as devel 
        
           COPY docker/common/install_base.sh install_base.sh 
        
           RUN bash ./install_base.sh && rm install_base.sh 
        
           COPY docker/common/install_cmake.sh install_cmake.sh 
        
           RUN bash ./install_cmake.sh && rm install_cmake.sh 
        
           COPY docker/common/install_ccache.sh install_ccache.sh 
        
           RUN bash ./install_ccache.sh && rm install_ccache.sh 
        
           # Download & install internal TRT release 
        
           ARG TRT_VER 
        
           ARG CUDA_VER 
        
           ARG CUDNN_VER 
        
           ARG NCCL_VER 
        
           ARG CUBLAS_VER 
        
           COPY docker/common/install_tensorrt.sh install_tensorrt.sh 
        
           RUN bash ./install_tensorrt.sh \ 
        
               --TRT_VER=${TRT_VER} \ 
        
               --CUDA_VER=${CUDA_VER} \ 
        
               --CUDNN_VER=${CUDNN_VER} \ 
        
               --NCCL_VER=${NCCL_VER} \ 
        
               --CUBLAS_VER=${CUBLAS_VER} && \ 
        
               rm install_tensorrt.sh 
        
           # Install latest Polygraphy 
        
           COPY docker/common/install_polygraphy.sh install_polygraphy.sh 
        
           RUN bash ./install_polygraphy.sh && rm install_polygraphy.sh 
        
           # Install mpi4py 
        
           COPY docker/common/install_mpi4py.sh install_mpi4py.sh 
        
           RUN bash ./install_mpi4py.sh && rm install_mpi4py.sh 
        
           # Install PyTorch 
        
           ARG TORCH_INSTALL_TYPE="skip" 
        
           COPY docker/common/install_pytorch.sh install_pytorch.sh 
        
           RUN bash ./install_pytorch.sh $TORCH_INSTALL_TYPE && rm install_pytorch.sh

by setting ENV to ~/.bashrc.

This allowed me to run the following command to build TensorRT-LLM from source code:

pip install -e . --extra-index-url https://pypi.nvidia.com

The building process is very fast, which does not look right, because it usually takes 40 minutes for build_wheel.py to build everything.

After the building, pip list shows that tensorrt-llm is installed.

$ pip list | grep tensorrt
tensorrt                 9.2.0.post12.dev5
tensorrt-llm             0.9.0.dev2024020600 /root/TensorRT-LLM

However, importing it would error:

$ pythonon3 -c 'import tensorrt_llm'
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "/root/TensorRT-LLM/tensorrt_llm/__init__.py", line 44, in <module>
    from .hlapi.llm import LLM, ModelConfig
  File "/root/TensorRT-LLM/tensorrt_llm/hlapi/__init__.py", line 1, in <module>
    from .llm import LLM, ModelConfig
  File "/root/TensorRT-LLM/tensorrt_llm/hlapi/llm.py", line 17, in <module>
    from ..executor import (GenerationExecutor, GenerationResult,
  File "/root/TensorRT-LLM/tensorrt_llm/executor.py", line 11, in <module>
    import tensorrt_llm.bindings as tllm
ModuleNotFoundError: No module named 'tensorrt_llm.bindings'

Expected behavior

My project requires me to build the main branch of TensorRT-LLM. It would be great if pip install could work, so I could declare TensorRT-LLM as a dependency in my project's pyproject.toml file.

actual behavior

I had to build TensorRT-LLM by invoking build_wheel.py as in https://github.com/NVIDIA/TensorRT-LLM/blob/main/docs/source/build_from_source.md#build-tensorrt-llm

additional notes

I was able to build vLLM with the CUDA kernels using pip -e .. Not sure if we could take their build setup as a reference.

The text was updated successfully, but these errors were encountered:

TobyGE · 2024-02-08T07:57:18Z

to install the main brach, you can use the following command
pip3 install tensorrt_llm -U --pre --extra-index-url https://pypi.nvidia.com

also check the readme for more detail

wangkuiyi · 2024-02-08T08:00:25Z

That doesn’t work. As described above, the project requires TensorRT-LLM built from the main branch.

TobyGE · 2024-02-08T09:17:10Z

I used and it worked, my current version is dev2024020600 your cuda version is too high, follow the instruction and use 12.1

…

On Thu, Feb 8, 2024 at 00:00 Yi Wang ***@***.***> wrote: That doesn’t work. As described above, the project requires TensorRT-LLM built from the main branch. — Reply to this email directly, view it on GitHub <#1065 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFQTIVABGN6HGT6GQGGLPSDYSSA2JAVCNFSM6AAAAABC6QXWTSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSMZTGUZTSNRQGM> . You are receiving this because you commented.Message ID: ***@***.***>

wangkuiyi · 2024-02-08T21:15:50Z

It doesn’t build the main branch. It installs a most recent pre release version. On Thu, Feb 8, 2024 at 1:17 AM Yingqiang Ge ***@***.***> wrote:

…

I used and worked On Thu, Feb 8, 2024 at 00:00 Yi Wang ***@***.***> wrote: > That doesn’t work. As described above, the project requires TensorRT-LLM > built from the main branch. > > — > Reply to this email directly, view it on GitHub > < #1065 (comment)>, > or unsubscribe > < https://github.com/notifications/unsubscribe-auth/AFQTIVABGN6HGT6GQGGLPSDYSSA2JAVCNFSM6AAAAABC6QXWTSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSMZTGUZTSNRQGM> > . > You are receiving this because you commented.Message ID: > ***@***.***> > — Reply to this email directly, view it on GitHub <#1065 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAL2DZ4BWZNGI6DS5MK2Y43YSSJ2HAVCNFSM6AAAAABC6QXWTSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSMZTGY2TEMRSGE> . You are receiving this because you authored the thread.Message ID: ***@***.***>

wangkuiyi · 2024-02-09T20:20:43Z

I looks like pip install -e . does not automatically trigger the buiding of the Python binding of the C++ runtime.

jdemouth-nvidia · 2024-02-11T12:28:12Z

@Shixiaowei02 , can you help with that issue, please?

jdemouth-nvidia · 2024-02-11T12:31:23Z

@wangkuiyi , for your information, @Shixiaowei02 is based in China. It means that he won't be able to work on this issue before the end of the break for the Chinese New Year.

wangkuiyi · 2024-02-11T15:59:52Z

Thank you @jdemouth-nvidia and @Shixiaowei02 ! No rush please. It is totally fine after the lunar new year.

Shixiaowei02 · 2024-02-20T07:18:17Z

I am working on fixing this issue now. Thanks for your support!

ekagra-ranjan · 2024-02-20T08:59:33Z

Thanks! I am also facing this issue.

andyluo7 · 2024-02-26T06:39:27Z

I am facing the same issue.

Shixiaowei02 · 2024-02-29T02:57:19Z

Can you use these two commands to temporarily bypass this issue? We will fix this issue in the near future and synchronize it to the main branch. Thank you! @wangkuiyi

python3 scripts/build_wheel.py --trt_root /usr/local/tensorrt
pip3 install -e .

lifelongeeek · 2024-03-02T06:19:30Z

After build wheel & editable install, I still got the same error

    import tensorrt_llm.bindings as tllm
ModuleNotFoundError: No module named 'tensorrt_llm.bindings'

Shixiaowei02 · 2024-04-11T08:25:51Z

Currently, the calling relationship between build_wheel.py and setup.py is inverted, resulting in incomplete installation when users run pip install -e .. Meanwhile, setup.py has been deprecated, so give a friendlier error as a stopgap here. We will come back and refactor when we have the bandwidth later. Thank you!

felixslu · 2024-04-16T03:39:36Z

a friendlier error
@Shixiaowei02 I have got this error in recently released tensorrt-llm v0.9.0. ,please give me some advice to fix it ,tks!

`python3 -c "import tensorrt_llm; print(tensorrt_llm.version)"
Traceback (most recent call last):
File "/opt/workspace/TensorRT-LLM_v0.9.0/tensorrt_llm/init.py", line 39, in
import tensorrt_llm.bindings # NOQA
ImportError: /opt/workspace/TensorRT-LLM_v0.9.0/tensorrt_llm/bindings.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZNK12tensorrt_llm8executor25SpeculativeDecodingConfig22getAcceptanceThresholdEv

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "", line 1, in
File "/opt/workspace/TensorRT-LLM_v0.9.0/tensorrt_llm/init.py", line 41, in
raise ImportError(
ImportError: Import of the bindings module failed. Please check the package integrity. If you are attempting to use the pip development mode (editable installation), please execute build_wheels.py first, and then run `pip install -e .``

wangkuiyi added the bug Something isn't working label Feb 7, 2024

jdemouth-nvidia assigned Shixiaowei02 Feb 11, 2024

Shixiaowei02 closed this as completed Apr 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pip install -e . does not work #1065

pip install -e . does not work #1065

wangkuiyi commented Feb 7, 2024

TobyGE commented Feb 8, 2024

wangkuiyi commented Feb 8, 2024

TobyGE commented Feb 8, 2024 via email •

edited

wangkuiyi commented Feb 8, 2024 via email

wangkuiyi commented Feb 9, 2024

jdemouth-nvidia commented Feb 11, 2024

jdemouth-nvidia commented Feb 11, 2024

wangkuiyi commented Feb 11, 2024

Shixiaowei02 commented Feb 20, 2024

ekagra-ranjan commented Feb 20, 2024

andyluo7 commented Feb 26, 2024

Shixiaowei02 commented Feb 29, 2024

lifelongeeek commented Mar 2, 2024 •

edited

Shixiaowei02 commented Apr 11, 2024 •

edited

felixslu commented Apr 16, 2024 •

edited

pip install -e . does not work #1065

pip install -e . does not work #1065

Comments

wangkuiyi commented Feb 7, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

TobyGE commented Feb 8, 2024

wangkuiyi commented Feb 8, 2024

TobyGE commented Feb 8, 2024 via email • edited

wangkuiyi commented Feb 8, 2024 via email

wangkuiyi commented Feb 9, 2024

jdemouth-nvidia commented Feb 11, 2024

jdemouth-nvidia commented Feb 11, 2024

wangkuiyi commented Feb 11, 2024

Shixiaowei02 commented Feb 20, 2024

ekagra-ranjan commented Feb 20, 2024

andyluo7 commented Feb 26, 2024

Shixiaowei02 commented Feb 29, 2024

lifelongeeek commented Mar 2, 2024 • edited

Shixiaowei02 commented Apr 11, 2024 • edited

felixslu commented Apr 16, 2024 • edited

TobyGE commented Feb 8, 2024 via email •

edited

lifelongeeek commented Mar 2, 2024 •

edited

Shixiaowei02 commented Apr 11, 2024 •

edited

felixslu commented Apr 16, 2024 •

edited