Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add latest_once version policy #2040

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

zhuyijie
Copy link

Hi, we are trying to open source our training/serving framework based on tensorflow. It includes several patches to tensorflow and tensorflow serving. We want to eliminate those patches by merging reasonable changes to the official repo.

This PR introduces a new version policy which is called latest_once. Models using this policy only loads the latest version once and skips the later polling. This is similar to the latest policy with file_system_poll_wait_seconds=0, except that it is model level setting rather than process level. We are unable to do it on a process level because we want to serve multi models with different policies in the single instance.

The use case we are applying this is online training giant recommendation models(>10T), which mainly contains large sparse embedding tables. The framework mentioned above contains a dynamic embedding table, which support serving time insertion/deletion/updating. When a model is published to serving, it loads the latest version and listens on deltas of new updates. The benefits are reduced memory(only one version) and reduced gap between serving and training(because deltas are small and fast).

@google-cla
Copy link

google-cla bot commented Aug 13, 2022

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant