Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: Inference on Arc A770 segfaults after some number of tokens #24360

Open
richiejp opened this issue May 3, 2024 · 0 comments
Open

[Bug]: Inference on Arc A770 segfaults after some number of tokens #24360

richiejp opened this issue May 3, 2024 · 0 comments
Assignees
Labels
bug Something isn't working support_request

Comments

@richiejp
Copy link

richiejp commented May 3, 2024

OpenVINO Version

2024.1.0

Operating System

Other (Please specify in description)

Device used for inference

GPU

Framework

PyTorch

Model used

https://huggingface.co/fakezeta/openchat-3.5-0106-openvino-int8

Issue description

On Ubuntu 22.04.04 using the upstream kernel driver (Linux dell 6.5.0-1020-oem #21-Ubuntu SMP PREEMPT_DYNAMIC Wed Apr 3 14:54:32 UTC 2024 x86_64 x86_64 x86_64 GNU/Linux) with an Intel Arc A770 inference usually causes a segfault after some number of tokens.

Originally reported to LocalAI: mudler/LocalAI#2219

I had issues installing the out-of-tree driver from Intel so have not tried that. There are no problems using the iGPU on the same machine or the CPU.

c.c. @fakezeta

Step-by-step reproduction

It can be reproduced either in LocalAI or by running https://github.com/fakezeta/openchat.openvino/tree/main

All that is required is to ask (any?) model a longer question, such as to describe the output of lscpu. It will usually segfault.

Relevant log output

[ +34.787117] python[66644]: segfault at 1e ip 00007e9adf577063 sp 00007e98aa7ea280 error 4 in libopenvino_intel_gpu_plugin.so[7e9ade9d6000+de4000] likely on CPU 4 (core 8, socket 0)
[  +0.000010] Code: ff e8 81 2c c7 ff 48 8b b5 38 ce ff ff 4c 89 ff 80 8d 37 ce ff ff 80 e8 6b 2c c7 ff 48 8b 85 c0 da ff ff 80 8d 3f ce ff ff 80 <80> 38 00 0f 85 fc 0c 00 00 48 8b 85 f8 da ff ff 80 38 00 74 67 48

Already disassembled here: mudler/LocalAI#2219 (comment)



### Issue submission checklist

- [X] I'm reporting an issue. It's not a question.
- [X] I checked the problem with the documentation, FAQ, open issues, Stack Overflow, etc., and have not found a solution.
- [X] There is reproducer code and related data files such as images, videos, models, etc.
@richiejp richiejp added bug Something isn't working support_request labels May 3, 2024
@geunhwan geunhwan self-assigned this May 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working support_request
Projects
None yet
Development

No branches or pull requests

2 participants