Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: support iteration level scheduling #383

Open
kemingy opened this issue Jun 13, 2023 · 1 comment
Open

feat: support iteration level scheduling #383

kemingy opened this issue Jun 13, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@kemingy
Copy link
Member

kemingy commented Jun 13, 2023

          Also https://www.usenix.org/conference/osdi22/presentation/yu

Originally posted by @VoVAllen in #382 (comment)

@kemingy kemingy added the enhancement New feature or request label Jun 13, 2023
@kemingy
Copy link
Member Author

kemingy commented Jun 13, 2023

Although Orca coupled the scheduler and execution engine, it still has something we can learn from.

For GPT-like models, they can benefit from iteration-level scheduling in the following part:

  1. status request can return to the client before other requests are finished in this batch
  2. new requests can enter the batch without waiting for all the requests in the previous batch to have been finished

refer to:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant