Support KFServing API V2 predict protocol #899

parano · 2020-07-15T20:45:30Z

About KFServing API V2 predict protocol: https://github.com/kubeflow/kfserving/tree/master/docs/predict-api/v2

The Predict Protocol, version 2 is a set of HTTP/REST and GRPC APIs for inference / prediction servers. By implementing this protocol both inference clients and servers will increase their utility and portability by being able to operate seamlessly on platforms that have standardized around this protocol.

The protocol is composed of a required set of APIs that must be implemented by a compliant server. This required set of APIs is described in required_api.md. The GRPC proto specification for the required APIs is available.

This is to add an option to BentoML API server, that enables a set of special endpoints that are compatible with KFServing API V2 protocol. It will make BentoML API server work much nicer with other tools in the kubeflow eco-system.

Predict protocol V2 spec: https://github.com/kubeflow/kfserving/blob/master/docs/predict-api/v2/required_api.md#tensor-data-types
KFServing roadmap on V2 protocol support is targeting Q3 2020 https://github.com/kubeflow/kfserving/blob/master/ROADMAP.md#v05-api-stabilization-and-tco-reduction-eta-end-of-q3

pncnmnp · 2020-10-27T16:16:01Z

@yubozhao
Kishore and I are interested in contributing to this issue.

parano · 2020-10-27T17:24:39Z

Hi @pncnmnp @Kishore - it probably makes more sense to first implement gRPC support in bentoML #703 before supporting KFServing V2's predict protocol

stale · 2021-02-02T04:49:37Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale · 2021-06-02T17:32:39Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

yonil7 · 2021-10-23T18:27:43Z

Is this feature already supported?

parano · 2021-10-23T20:20:30Z

@yonil7 not yet, is this feature a blocker for you? would love to learn more

yonil7 · 2021-10-24T06:40:20Z

I was just looking for a standard inference REST API and I was surprised to see there is no such standard.
The closest I could find was KServe predict protocol v2. But I think their infer API is unnecessarily too complex and verbose.

On the other hand, GCP Vertex AI predict API (which is almost identical to GCP AI Platform predict API and Tensorflow Serving predict API) has the exact same features/abilities in a much more elegant API - any single instance / prediction and the request parameters is just a JSON object (can be number, null, bool, string, (nested) list, (nested) object)

parano · 2021-10-25T20:43:27Z

@yonil7 bentoML is trying to establish such a standard, and it takes a very different approach compared to Kserve's protocol. Essentially bentoML defines how an HTTP Request/Response is converted to and from a Python object that data scientist's code will consider as input to their inference function.

wolvever · 2023-08-09T03:53:15Z

KServe predict protocol is supported in KServe, Seldon Core and Triton Inference Server. I think it make sense to support this protocol to allow users switch different frameworks. We are currently using Triton Inference Server to serve our own models for historical reason. And we also want to provide BentoML to our collaborators to simplify serving and deployment. But our product relied on this predict protocol, and we can't provide BentoML now.

parano added help-wanted An issue currently lacks a contributor feature Feature requests or pull request implementing a new feature labels Jul 15, 2020

yubozhao added the MLH label Sep 25, 2020

parano added this to Next major release in Roadmap via automation Nov 26, 2020

parano moved this from Next major release to Mid-Long Term in Roadmap Nov 26, 2020

stale bot added the stale label Feb 2, 2021

yubozhao removed MLH labels Feb 2, 2021

stale bot added the stale label Jun 2, 2021

parano removed the stale label Jun 14, 2021

parano closed this as completed Jul 22, 2021

Roadmap automation moved this from Mid-Long Term to Done Jul 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support KFServing API V2 predict protocol #899

Support KFServing API V2 predict protocol #899

parano commented Jul 15, 2020

pncnmnp commented Oct 27, 2020

parano commented Oct 27, 2020

stale bot commented Feb 2, 2021

stale bot commented Jun 2, 2021

yonil7 commented Oct 23, 2021

parano commented Oct 23, 2021

yonil7 commented Oct 24, 2021

parano commented Oct 25, 2021

wolvever commented Aug 9, 2023

Support KFServing API V2 predict protocol #899

Support KFServing API V2 predict protocol #899

Comments

parano commented Jul 15, 2020

pncnmnp commented Oct 27, 2020

parano commented Oct 27, 2020

stale bot commented Feb 2, 2021

stale bot commented Jun 2, 2021

yonil7 commented Oct 23, 2021

parano commented Oct 23, 2021

yonil7 commented Oct 24, 2021

parano commented Oct 25, 2021

wolvever commented Aug 9, 2023