neuralmagic / nm-vllm Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 8
Star 217

Code
Issues 1
Pull requests 17
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: neuralmagic/nm-vllm

Labels 9 Milestones 0

New pull request New

17 Open 261 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Kernel] Add w4a16 support for compressed_tensors models

#295 opened Jun 10, 2024 by dsikka

Loading…

[Rel Eng] Test Time: Some Kernel Tests in Parallel

#294 opened Jun 9, 2024 by robertgshaw2-neuralmagic

Loading…

[Rel Eng] Dial In Test Skipping

#293 opened Jun 9, 2024 by robertgshaw2-neuralmagic

Loading…

[Rel Eng] Dial In Accuracy Tests Phase 1

#289 opened Jun 8, 2024 by robertgshaw2-neuralmagic

Loading…

Upstream sync 2024 06 08

#288 opened Jun 8, 2024 by robertgshaw2-neuralmagic

Loading…

Add nightly tag

#287 opened Jun 7, 2024 by dhuangnm

Loading…

Ds w4a16

#269 opened May 29, 2024 by dsikka • Draft

Marlin moe integration

#266 opened May 24, 2024 by ElizaWszola • Draft

Lwilkinson/metrics expansion

#258 opened May 22, 2024 by LucasWilkinson • Draft

Create test_optional_libraries.py

#230 opened May 9, 2024 by mgoin

Loading…

Torch compile fusion backend prototype

#209 opened Apr 25, 2024 by bnellnm • Draft

[WIP] Please do not delete - comparing changes between branches

#203 opened Apr 23, 2024 by afeldman-nm

Loading…

[WIP] FLAN-T5 integration

#194 opened Apr 17, 2024 by afeldman-nm

Loading…

[WIP] Upstream encoder/decoder support based on multiple blocktables

#161 opened Apr 2, 2024 by afeldman-nm • Draft

Support for compressed-tensors

#159 opened Apr 2, 2024 by dbogunowicz

Loading…

[WiP] Whisper Implementation

#147 opened Mar 26, 2024 by dbogunowicz

Loading…

[wip] holistic trace analysis

#146 opened Mar 25, 2024 by LucasWilkinson • Draft

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly