Force workspace usage with MPI backends on MNNVL communicators when fabric allocated workspace is available. by romerojosh · Pull Request #108 · NVIDIA/cuDecomp

romerojosh · 2026-03-04T20:21:30Z

This PR detects and disables some of the transpose shortcut paths (e.g. alltoall directly from/to user input/output buffers bypassing the workspace) for MPI backends in situations where:

The workspace is fabric allocated (e.g. CUDECOMP_ENABLE_CUMEM=1)
The communicator is on an MNNVL-equipped system and contains multi-node NVLink connections.

The reason for this is that at the current time, MPI communication over MNNVL involving non-fabric allocated buffers uses a staging protocol to route communication via internal fabric-allocated buffers which is a bit less efficient than communication using already fabric allocated buffers, similar to the treatment of managed memory allocations. The shortcut paths present MPI with non-fabric allocated user input/output buffers which trigger this less efficient path, negating the benefits of the shortcut.

Similarly logic is applied to halo communication cases that would normally bypass work space staging.

…ors when fabric allocated workspace is available. Signed-off-by: Josh Romero <joshr@nvidia.com>

Signed-off-by: Josh Romero <joshr@nvidia.com>

romerojosh · 2026-03-04T20:27:24Z

/build

github-actions · 2026-03-04T20:27:36Z

🚀 Build workflow triggered! View run

github-actions · 2026-03-04T20:33:56Z

✅ Build workflow passed! View run

Signed-off-by: Josh Romero <joshr@nvidia.com>

romerojosh · 2026-03-04T20:59:23Z

/build

github-actions · 2026-03-04T20:59:33Z

🚀 Build workflow triggered! View run

github-actions · 2026-03-04T21:07:25Z

✅ Build workflow passed! View run

romerojosh added 2 commits March 4, 2026 11:54

Disable transpose shortcut paths for MPI backends on MNNVL communicat…

977cf0f

…ors when fabric allocated workspace is available. Signed-off-by: Josh Romero <joshr@nvidia.com>

Formatting.

90c62c5

Signed-off-by: Josh Romero <joshr@nvidia.com>

Similar logic to use fabric-allocated workspace for halos.

5b8b8f1

Signed-off-by: Josh Romero <joshr@nvidia.com>

romerojosh changed the title ~~Disable transpose shortcut paths for MPI backends on MNNVL communicators when fabric allocated workspace is available.~~ Force workspace usage with MPI backends on MNNVL communicators when fabric allocated workspace is available. Mar 4, 2026

romerojosh merged commit 3c68e4a into main Mar 4, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Force workspace usage with MPI backends on MNNVL communicators when fabric allocated workspace is available.#108

Force workspace usage with MPI backends on MNNVL communicators when fabric allocated workspace is available.#108
romerojosh merged 3 commits intomainfrom
mpi_mnnvl_no_shortcut

romerojosh commented Mar 4, 2026 •

edited

Loading

Uh oh!

romerojosh commented Mar 4, 2026

Uh oh!

github-actions bot commented Mar 4, 2026

Uh oh!

github-actions bot commented Mar 4, 2026

Uh oh!

romerojosh commented Mar 4, 2026

Uh oh!

github-actions bot commented Mar 4, 2026

Uh oh!

github-actions bot commented Mar 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

romerojosh commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

romerojosh commented Mar 4, 2026

Uh oh!

github-actions bot commented Mar 4, 2026

Uh oh!

github-actions bot commented Mar 4, 2026

Uh oh!

romerojosh commented Mar 4, 2026

Uh oh!

github-actions bot commented Mar 4, 2026

Uh oh!

github-actions bot commented Mar 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

romerojosh commented Mar 4, 2026 •

edited

Loading