Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[feat]: Support weight only gemm with 2bit
triaged
Issue has been triaged by maintainers
#1568
opened May 9, 2024 by
gavinchen430
Loading…
Update customAllReduceKernels.cu - line 120's typo was edited
#1558
opened May 8, 2024 by
sjbae1999
Loading…
[fix] export failure with CUDA driver < 526 and pynvml>=11.5.0
#1537
opened May 3, 2024 by
CoderHam
Loading…
Loading Medusa Safetensors + AWQ Conversion correction
triaged
Issue has been triaged by maintainers
#1535
opened May 2, 2024 by
Tushar-ml
Loading…
Define hf_config explisitly for convert_hf_mpt_legacy
#1534
opened May 2, 2024 by
bloodeagle40234
Loading…
Add note on build Llama v3
neeed more info
triaged
Issue has been triaged by maintainers
#1522
opened Apr 29, 2024 by
sammcj
Loading…
fix: correct cudaSetDevice error when GPUs per node are fewer than their ranks in inter-node inference
#1495
opened Apr 24, 2024 by
littlefatfat
Loading…
Support internlm2
triaged
Issue has been triaged by maintainers
#1392
opened Apr 2, 2024 by
RunningLeon
Loading…
[feat]: Add Option to convert and run distil-whisper large-v3
#1337
opened Mar 22, 2024 by
IbrahimAmin1
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.