Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Bump transformers from 4.57.0 to 5.0.0rc3 in /docs/examples/te_llama dependencies Pull requests that update a dependency file python Pull requests that update python code
#2850 opened Apr 8, 2026 by dependabot bot Loading…
Simplify FA3 discovery
#2849 opened Apr 8, 2026 by vcherepanov-nv Loading…
4 of 13 tasks
Skip activation kernels when tensor size is zero bug Something isn't working
#2848 opened Apr 8, 2026 by timmoon10 Loading…
8 of 13 tasks
[Common] Multicast Fixes
#2847 opened Apr 8, 2026 by phu0ngng Draft
13 tasks
Add Megatron-FSDP E2E integration test to TE CI/CD (L1).
#2845 opened Apr 7, 2026 by cspades Loading…
3 of 13 tasks
[Core] Report CUDA versions when NVRTC compilation fails enhancement New feature or request
#2842 opened Apr 7, 2026 by timmoon10 Loading…
8 of 13 tasks
comm_gemm_test fixes
#2839 opened Apr 6, 2026 by almogsegal Loading…
13 tasks
Add grouped unswizzle functionality for MXFP8 scaling factors
#2837 opened Apr 5, 2026 by int-smart Loading…
8 of 13 tasks
Fix JAX extension build with NVTE_UB_WITH_MPI=1
#2835 opened Apr 4, 2026 by GaetanLepage Loading…
2 of 13 tasks
fix CUDA architectures cmake logic community-contribution PRs from external contributor outside the core maintainers, representing community-driven work.
#2832 opened Apr 3, 2026 by GaetanLepage Loading…
2 of 13 tasks
Port softmax ops to libtorch stable ABI
#2830 opened Apr 3, 2026 by pstjohn Loading…
Cp thd swa with ag
#2829 opened Apr 3, 2026 by sudhakarsingh27 Draft
13 tasks
[Common] Reduced padding kernel compilation time
#2827 opened Apr 2, 2026 by Oleg-Goncharov Loading…
5 of 13 tasks
fix(CP, MLA): CP works fine with MLA in a2a cp_comm_type community-contribution PRs from external contributor outside the core maintainers, representing community-driven work.
#2826 opened Apr 2, 2026 by zhujian19891203 Loading…
5 of 13 tasks
[Common] Fix fused router for large top-K and expert counts
#2821 opened Apr 1, 2026 by harryzhou2000 Loading…
7 of 13 tasks
[Pytorch][Common] Hybrid quantization
#2817 opened Mar 31, 2026 by negvet Loading…
1 of 13 tasks
ProTip! Add no:assignee to see everything that’s not assigned.