Skip to content

Conversation

@amd-nithyavs
Copy link
Contributor

Adds some bug fixes to the acoll component.

  • gather is corrected to work for non-zero root ranks when three stage algorithm is used.
  • Busy waits in shared memory bcast and barrier are changed to avoid random hangs.
  • Overrides with command line arguments are properly taken care of in bcast algorithm selection.
  • Multinode path of reduce and allreduce is fixed to use the hierarchical algorithms of acoll.
  • Compile time warnings are removed.

Adds some bug fixes to the acoll component.
- gather is corrected to work for non-zero root ranks when three stage
  algorithm is used.
- Busy waits in shared memory bcast and barrier are changed to avoid
  random hangs.
- Overrides with command line arguments are properly taken care of in
  bcast algorithm selection.
- Multinode path of reduce and allreduce is fixed to use the
  hierarchical algorithms of acoll.
- Compile time warnings are removed.

Signed-off-by: Nithya V S <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants