Skip to content

Pull requests: radixark/miles

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

V4 mxfp8
#1340 opened Jun 13, 2026 by xiuhu17 Contributor Draft
[AMD CI] [1/N] Support Python 3.10 on ROCm base
#1339 opened Jun 12, 2026 by XinyuJiangCMU Contributor Loading…
[fix] quantizer_fp8: emit deepgemm ue8m0 scales only when dense GEMM …
#1338 opened Jun 12, 2026 by xiuhu17 Contributor Loading…
feat(rl): add CISPO advantage estimator (MiniMax-M1)
#1331 opened Jun 12, 2026 by EazyReal Loading…
fix(loss): first-class --pg-loss-divisor for Dr.GRPO
#1328 opened Jun 12, 2026 by EazyReal Loading…
fix(grpo): count each rollout once under fan-out
#1325 opened Jun 12, 2026 by EazyReal Loading…
feat: add FlashQLA backend for Qwen GDN linear-attention layers
#1318 opened Jun 11, 2026 by Zhichenzzz Contributor Loading…
fix: load Qwen 3.5 checkpoint with unfused experts
#1317 opened Jun 10, 2026 by lawrence-harmonic Contributor Loading…
[doc, CI] doc driven CI
#1312 opened Jun 9, 2026 by guapisolo Collaborator Loading…
fix(qwen3-vl): per-segment mRoPE + vision under CP + THD packing
#1308 opened Jun 8, 2026 by Zhichenzzz Contributor Loading…
fix(mtp): track megatron mtp_model_layer rename in raw converters
#1307 opened Jun 8, 2026 by Zhichenzzz Contributor Loading…
DO NOT MERGE: CI test run-ci-model-scripts Run model script smoke tests
#1306 opened Jun 8, 2026 by yueming-yuan Collaborator Loading…
[NPU] Feature add npu docker
#1305 opened Jun 8, 2026 by codemayq Loading…
ProTip! Updated in the last three days: updated:>2026-06-12.