-
Notifications
You must be signed in to change notification settings - Fork 896
Pull requests: huggingface/text-generation-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
reable xpu, broken by gptq and setuptool upgrade
#1988
opened May 31, 2024 by
sywangyi
Loading…
5 tasks
router: send the input as chunks to the backend
#1981
opened May 30, 2024 by
danieldk
Loading…
2 of 5 tasks
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
#1940
opened May 23, 2024 by
Narsil
Loading…
5 tasks
Enhance AsyncClient for Flexible Session Management in Text Generation Client
Stale
#1784
opened Apr 21, 2024 by
BenHaimItay
Loading…
4 tasks done
ProTip!
Updated in the last three days: updated:>2024-05-28.