Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[C] Allow bias support for sm80/86/89 for cuDNN 9+
#863 opened May 23, 2024 by cyanguwa Loading…
2 tasks
Avoid framework specific import from top level enhancement New feature or request
#862 opened May 22, 2024 by ksivaman Draft
6 of 11 tasks
[C/PyTorch/JAX] Build system improvements for rpath and C++11 ABI build Build system enhancement New feature or request
#858 opened May 20, 2024 by denera Loading…
9 of 11 tasks
[Common/PyTorch] Grouped GEMM via multi-stream cuBLAS
#853 opened May 17, 2024 by yaox12 Loading…
8 of 11 tasks
Use correct FP8 group in multi-GPU docs 1.7.0 documentation Improvements or additions to documentation
#852 opened May 16, 2024 by timmoon10 Loading…
4 of 11 tasks
Different dimension for attention
#833 opened May 3, 2024 by pggPL Loading…
8 of 11 tasks
[C/PyTorch] Add THD support for cuDNN attention
#832 opened May 2, 2024 by cyanguwa Loading…
8 of 11 tasks
Added comments about Llama3 weights to Llama tutorial 1.7.0 documentation Improvements or additions to documentation
#830 opened May 1, 2024 by pggPL Loading…
7 of 11 tasks
Draft of generation for Gemma
#829 opened May 1, 2024 by pggPL Draft
3 of 11 tasks
Find CXX component for MPI, fortran and C are not needed
#828 opened May 1, 2024 by aurianer Loading…
1 of 5 tasks
[PyTorch] Refactor FP8 workspaces in linear modules bug Something isn't working enhancement New feature or request
#820 opened Apr 27, 2024 by timmoon10 Loading…
[UB] Adding support for multinode nvlink [WIP]
#815 opened Apr 26, 2024 by shamisp Loading…
Bug fix in DGRAD->RS overlap
#802 opened Apr 23, 2024 by vasunvidia Draft
[PyTorch] Fix minor bug in computing num_gqa_groups_per_partition bug Something isn't working
#777 opened Apr 13, 2024 by knowlsie Loading…
[C/PyTorch] Refactor and move userbuffers into TE/common
#760 opened Apr 8, 2024 by denera Loading…
6 of 13 tasks
Fix bhss bias format before sm90
#736 opened Mar 27, 2024 by zlsh80826 Loading…
[PyTorch] Sequential fuser enhancement New feature or request
#707 opened Mar 9, 2024 by timmoon10 Loading…
2 of 6 tasks
[Draft] show how to not use a global mesh. jax
#549 opened Dec 1, 2023 by nouiz Loading…
ProTip! Filter pull requests by the default branch with base:main.