Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[C] Allow bias support for sm80/86/89 for cuDNN 9+
#863
opened May 23, 2024 by
cyanguwa
Loading…
2 tasks
[C/PyTorch/JAX] Build system improvements for rpath and C++11 ABI
build
Build system
enhancement
New feature or request
#858
opened May 20, 2024 by
denera
Loading…
9 of 11 tasks
[Common/PyTorch] Grouped GEMM via multi-stream cuBLAS
#853
opened May 17, 2024 by
yaox12
Loading…
8 of 11 tasks
Use correct FP8 group in multi-GPU docs
1.7.0
documentation
Improvements or additions to documentation
#852
opened May 16, 2024 by
timmoon10
Loading…
4 of 11 tasks
[JAX] Rewrite the Format of FP8 Meta and Remove unused ShardingTypes.
#842
opened May 13, 2024 by
mingxu1067
Loading…
8 of 11 tasks
[C/PyTorch] Add THD support for cuDNN attention
#832
opened May 2, 2024 by
cyanguwa
Loading…
8 of 11 tasks
Added comments about Llama3 weights to Llama tutorial
1.7.0
documentation
Improvements or additions to documentation
#830
opened May 1, 2024 by
pggPL
Loading…
7 of 11 tasks
Find CXX component for MPI, fortran and C are not needed
#828
opened May 1, 2024 by
aurianer
Loading…
1 of 5 tasks
[Pytorch] Implement fp32 accumulation for attention with context parallel in both forward and backward pass.
#821
opened Apr 28, 2024 by
Yuxin-CV
Loading…
[PyTorch] Refactor FP8 workspaces in linear modules
bug
Something isn't working
enhancement
New feature or request
#820
opened Apr 27, 2024 by
timmoon10
Loading…
[PyTorch] Fix minor bug in computing num_gqa_groups_per_partition
bug
Something isn't working
#777
opened Apr 13, 2024 by
knowlsie
Loading…
[C/PyTorch] Refactor and move userbuffers into TE/common
#760
opened Apr 8, 2024 by
denera
Loading…
6 of 13 tasks
[PyTorch] Sequential fuser
enhancement
New feature or request
#707
opened Mar 9, 2024 by
timmoon10
Loading…
2 of 6 tasks
[PyTorch] Distributed intermediate/activation tensors for FSDP
1.7.0
#687
opened Feb 28, 2024 by
denera
Loading…
Remove now useless padding as it is now down automatically.
jax
#680
opened Feb 25, 2024 by
nouiz
Loading…
Add the examples and the tests in the installed packages.
#514
opened Nov 10, 2023 by
nouiz
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.