Add latest_once version policy #2040

zhuyijie · 2022-08-13T00:36:47Z

Hi, we are trying to open source our training/serving framework based on tensorflow. It includes several patches to tensorflow and tensorflow serving. We want to eliminate those patches by merging reasonable changes to the official repo.

This PR introduces a new version policy which is called latest_once. Models using this policy only loads the latest version once and skips the later polling. This is similar to the latest policy with file_system_poll_wait_seconds=0, except that it is model level setting rather than process level. We are unable to do it on a process level because we want to serve multi models with different policies in the single instance.

The use case we are applying this is online training giant recommendation models(>10T), which mainly contains large sparse embedding tables. The framework mentioned above contains a dynamic embedding table, which support serving time insertion/deletion/updating. When a model is published to serving, it loads the latest version and listens on deltas of new updates. The benefits are reduced memory(only one version) and reduced gap between serving and training(because deltas are small and fast).

google-cla · 2022-08-13T00:36:50Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

add latest_once version policy

0161cbe

zhuyijie force-pushed the latest_once branch from 72a8680 to 0161cbe Compare August 13, 2022 00:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add latest_once version policy #2040

Add latest_once version policy #2040

zhuyijie commented Aug 13, 2022

google-cla bot commented Aug 13, 2022

Add latest_once version policy #2040

Are you sure you want to change the base?

Add latest_once version policy #2040

Conversation

zhuyijie commented Aug 13, 2022

google-cla bot commented Aug 13, 2022