ONNX version of open-ai/whisper large model #4784

Prashanth-dvs · 2023-01-18T04:18:00Z

Prashanth-dvs
Jan 18, 2023

I have exported openai/whisper model to ONNX but size of ONNX file is about 4.5GB. After quantization size reduces to 1.3GB. Kindly suggest me some method to reduce the onnx size further. Also while performing optimization getting run error

KeyError: "ONNX Runtime doesn't support the graph optimization of whisper yet. Only ['bert', 'gpt2', 'bart'] are supported. If you want to support whisper please propose a PR or open up an issue in ONNX Runtime:https://github.com/microsoft/onnxruntime."

Please help anyone

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ONNX version of open-ai/whisper large model #4784

{{title}}

Replies: 0 comments

Select a reply

ONNX version of open-ai/whisper large model #4784

Prashanth-dvs Jan 18, 2023

Replies: 0 comments

Prashanth-dvs
Jan 18, 2023