Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

simple API interface for initializing and running model inference #1329

Open
aniketmaurya opened this issue Apr 19, 2024 · 0 comments
Open
Labels
enhancement New feature or request

Comments

@aniketmaurya
Copy link
Contributor

We want to serve LLMs from LitGPT using LitServe, however the current model initialization step leaks a lot of complexity at the user code. Also, couldn't find a generator function to stream responses. So had to bring the generate function on user code side.

A simple API to do this kinds of thing in a few lines of code would be really appreciated!

cc: @lantiga

@aniketmaurya aniketmaurya added the enhancement New feature or request label Apr 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant