Skip to content

[WIP] columnwise quantize with tma#3157

Draft
nastya236 wants to merge 15 commits intoml-explore:mainfrom
nastya236:tma_load
Draft

[WIP] columnwise quantize with tma#3157
nastya236 wants to merge 15 commits intoml-explore:mainfrom
nastya236:tma_load

Conversation

@nastya236
Copy link
Collaborator

@nastya236 nastya236 commented Feb 23, 2026

Columnwise quantization with tma (mxfp8).

Size With tma (ms) Without tma (ms)
4096×4096 68.48 77.21
4096×8192 78.74 102.58
8192×4096 80.73 100.61
8192×8192 100.67 145.16
4096×16384 97.08 144.45
16384×4096 99.50 137.13

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant