simple API interface for initializing and running model inference #1329

aniketmaurya · 2024-04-19T22:11:52Z

We want to serve LLMs from LitGPT using LitServe, however the current model initialization step leaks a lot of complexity at the user code. Also, couldn't find a generator function to stream responses. So had to bring the generate function on user code side.

A simple API to do this kinds of thing in a few lines of code would be really appreciated!

cc: @lantiga

aniketmaurya added the enhancement New feature or request label Apr 19, 2024

aniketmaurya mentioned this issue Apr 20, 2024

add llama3 example Lightning-AI/LitServe#51

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

simple API interface for initializing and running model inference #1329

simple API interface for initializing and running model inference #1329

aniketmaurya commented Apr 19, 2024

simple API interface for initializing and running model inference #1329

simple API interface for initializing and running model inference #1329

Comments

aniketmaurya commented Apr 19, 2024