[CK_BUILDER] Add bwd weight factories #3509

vpietila-amd · 2026-01-05T11:28:24Z

Proposed changes

Added factories and dispatching for bwd weight conv algorithms from old CK that are currently exposed by the narrow build of CK. As part of the factory implementation, I did the following refactoring

Removed direction specific covn specializations since they were mostly identical. Now, there's only one specialization enum ConvSpecialization and different algorithms support a different set of specializations. This change simplfies the code as we get rid of various std::variants.
Separate the algorithm descriptions (concepts) from the dispatching logic and moved the concepts to a separate file. Added some base concepts that are shared between different algorithm specializations.
Combined all algorithm specializations (reference, large tensor, two-stage multiple-D) under an enum ConvAlgorithmSpecialization. This allows us to handle all specializations in the same way.

…eight-factories

…ing representations.

…t factory.

shumway

This is a huge PR, but overall looks good.

My only concern is the static_assert in unit tests, which should really be a runtime error (failing tests will break the build and no tests will run).

I'm doing major refactoring on conv_traits.hpp that conflict, but the changes here look minimal, I'm guessing just to keep the build from breaking.

spolifroni-amd

The readme needs clarification.

spolifroni-amd · 2026-01-07T18:57:11Z

experimental/builder/README.md

+since `gfx9` architectures do not support WMMA. Hence, to compile also the WMMA builders, add e.g. 
+`gfx1121` to the list of supported architectures or add flag `-D CK_USE_WMMA=ON`. One still needs 
+a Navi card to execute the Builder tests that use the GPU.
+


This is really confusing.

If I got this right:

WMMA isn't supported on gfx9* architectures (which seems strange since the prebuilt rocWMMA is supported on gfx908, gfx90a, gfx942, and gfx950)

Because of this, the WMMA builder won't automatically be included in the tests

So to compile the WMMA builders you need to add a non-gfx9* architecture to the list?

And you need a navi card?

Let me know if I got this right, and if I do, I'll rewrite this for you real quick.

I clarified the comment, the original one was a bit offhanded. The bottom line is this

Tests for WMMA builder are compiled on when CMake variable CK_USE_WMMA evaluates to true. This can be achieved in of the following ways

Use one of the gfx11 or gfx12 architectures as the GPU target, in which case CMake turns this flag on.

Turn on this flag explicitly by given a compiler option -D CK_USE_WMMA=ON.

Most of the builder tests do not run the instances. Instead, they check that the instance traits match with the expected traits. There are few end-to-end tests that actually execute instance on the GPU. I didn't yet add such tests for the WMMA builders, but when they are added, they will require an actual Navi card.

I hope this clarifies the Readme comment.

…eight-factories

vpietila-amd · 2026-01-08T10:42:51Z

@shumway, regarding your comment of using static_assert in the unit tests. It was a case of an obsolete unit test file that was testing functionality that I didn't include in this PR. I had initially forgotten to remove the file, but it should be removed now.

Ville Pietilä added 30 commits December 18, 2025 04:36

Add placeholder test.

f7955d9

Merge remote-tracking branch 'origin/develop' into vpietila/ckb-bwd-w…

b828d35

…eight-factories

Initial conv bwd weight factory.

2460cf4

Conv builder test refactoring.

5a1c9c9

Add missing pieces to bwd weight factory.

1df8077

Improve compile time erros message when no matching factory is found.

4d5b5b7

Use amcro to ensure automatic macthing between concepts are their str…

4d20cc6

…ing representations.

Improve compile time diagnostics.

c6798d3

Small improvements.

8d40e6d

Improve missing member/wrong type compile-time errors.

9679d9b

Improve compile time diagnostics.

5ee99d8

Concept bug fixes.

dacf82d

Remove debug assert.

8eb6224

Update algorithm signature diagnostics.

a8e7edd

Factory bug fixes.

96a4a5d

First functional version of bwd weight conv factory.

608266a

Refactor handing of GEMM-K batch template parameter in conv bwd weigh…

a1740c6

…t factory.

Concept improvements.

77e10c7

Improve concept diagnostics.

ff2fdd8

Introduve a common size type for concepts.

8c80e00

Update compiletime diagnostics to use the size type.

30a9686

Update conv specialization enum.

027d943

Fix fwd conv builder tests.

3bd0f05

Fix smoke tests.

52086b3

Separate bwd weigth and bwd data tests into separate targets.

9926d94

Clean-up CK Tile builder tests.

277981b

Add bwd weight XDL CShuffle V3 factory.

80f4482

Build conv bwd weigth v3 instances successfully.

a83790e

Add instance traits for DeviceGroupedConvBwdWeight_Xdl_CShuffleV3.

ab88cee

Test fix.

3e16fa0

Merge branch 'develop' into vpietila/ckb-bwd-weight-factories

d107b85

vpietila-amd marked this pull request as ready for review January 7, 2026 11:12

vpietila-amd requested review from a team, ThomasNing, afagaj, andriy-ca, aosewski, asleepzzz, bartekxk, carlushuang, cgmillette, coderfeli, ddembeckAMD, geyyer, illsilin, poyenc, qianfengz, shumway, tenpercent and vidyasagar-amd as code owners January 7, 2026 11:12

shumway previously approved these changes Jan 7, 2026

View reviewed changes

spolifroni-amd requested changes Jan 7, 2026

View reviewed changes

Ville Pietilä added 3 commits January 8, 2026 05:12

Merge remote-tracking branch 'origin/develop' into vpietila/ckb-bwd-w…

6c41727

…eight-factories

Clarify builder Readme.

16b8680

Remove obsolete test file.

fd8edf9

vpietila-amd dismissed shumway’s stale review via fd8edf9 January 8, 2026 10:26

Ville Pietilä added 2 commits January 8, 2026 06:51

Fix test after merge.

18c2631

clang-format

0336ac5

vpietila-amd mentioned this pull request Jan 8, 2026

[CK_BUILDER] Convolution forward transfer concepts. #3535

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CK_BUILDER] Add bwd weight factories #3509

[CK_BUILDER] Add bwd weight factories #3509

vpietila-amd commented Jan 5, 2026

Uh oh!

shumway left a comment

Uh oh!

spolifroni-amd left a comment

Uh oh!

spolifroni-amd Jan 7, 2026

Uh oh!

vpietila-amd Jan 8, 2026

Uh oh!

vpietila-amd commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[CK_BUILDER] Add bwd weight factories #3509

Are you sure you want to change the base?

[CK_BUILDER] Add bwd weight factories #3509

Conversation

vpietila-amd commented Jan 5, 2026

Proposed changes

Uh oh!

shumway left a comment

Choose a reason for hiding this comment

Uh oh!

spolifroni-amd left a comment

Choose a reason for hiding this comment

Uh oh!

spolifroni-amd Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

vpietila-amd Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

vpietila-amd commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants