forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 8
Pull requests: neuralmagic/nm-vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Kernel] Add
w4a16
support for compressed_tensors
models
#295
opened Jun 10, 2024 by
dsikka
Loading…
[Rel Eng] Test Time: Some Kernel Tests in Parallel
#294
opened Jun 9, 2024 by
robertgshaw2-neuralmagic
Loading…
[Rel Eng] Dial In Accuracy Tests Phase 1
#289
opened Jun 8, 2024 by
robertgshaw2-neuralmagic
Loading…
[WIP] Please do not delete - comparing changes between branches
#203
opened Apr 23, 2024 by
afeldman-nm
Loading…
[WIP] Upstream encoder/decoder support based on multiple blocktables
#161
opened Apr 2, 2024 by
afeldman-nm
•
Draft
ProTip!
Mix and match filters to narrow down what you’re looking for.