Skip to content

How do we scale runner independently of the api server without Yatai? #3303

Discussion options

You must be logged in to vote

as far as I know there is no option to increase runner process cnt

but here is one way what you want to do

bentoml support below cli

  • bentoml start-runner-server
  • bentoml start-http-server
  • bentoml start-grpc-server

It is possible to deploy in the same way as the archiecture (not unix socket)

referring to the method below, runner container and http-server can be distributed separately in separate pod,
and runner container can be scaled up


docker run -d  --name iris-model1-runner --network="bento-test-network"  -p 3001:3000 iris-bento:latest start-runner-server --runner-name iris-model1

docker run -d  --name iris-model2-runner --network="bento-test-network"  -p 3001:3000 iris-bento:latest…

Replies: 1 comment 5 replies

Comment options

You must be logged in to vote
5 replies
@hmbui-noze
Comment options

@KimSoungRyoul
Comment options

@hmbui-noze
Comment options

@AncientRemember
Comment options

@KimSoungRyoul
Comment options

Answer selected by cadmusthefounder
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
4 participants