Skip to content

[Klaud Cold][DO NOT MERGE] Test signoff Check 3 N/A on disagg-only PR#2031

Closed
functionstackx wants to merge 1 commit into
mainfrom
klaud/test-signoff-disagg-check3
Closed

[Klaud Cold][DO NOT MERGE] Test signoff Check 3 N/A on disagg-only PR#2031
functionstackx wants to merge 1 commit into
mainfrom
klaud/test-signoff-disagg-check3

Conversation

@functionstackx

Copy link
Copy Markdown
Collaborator

Summary

🤖 Generated with Claude Code

Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>
@github-actions

github-actions Bot commented Jul 4, 2026

Copy link
Copy Markdown
Contributor

Thanks for the contribution! Please reach out to respective companies' CODEOWNER to fill in the latest PR_REVIEW_CHECKLIST.md before pinging core maintainer on Slack for review.

For PR verification, add the full-sweep-enabled or full-sweep-fail-fast label to this PR — the benchmark sweep only runs on labeled PRs.

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. See GitHub's docs on re-running failed jobs


感谢你的贡献!请联系相应公司的 CODEOWNER 填写最新的 PR_REVIEW_CHECKLIST.md,然后再在 Slack 上联系核心维护者进行审阅。

如需进行 PR 验证,请为此 PR 添加 full-sweep-enabledfull-sweep-fail-fast 标签 — 基准测试 sweep 仅在带有标签的 PR 上运行。

PR 作者有责任确保合并后所有 GitHub Action 任务完全通过。 很多时候失败只是偶发抖动(flake),重新运行失败的任务即可解决。参见 GitHub 关于重新运行失败任务的文档

1 similar comment
@github-actions

github-actions Bot commented Jul 4, 2026

Copy link
Copy Markdown
Contributor

Thanks for the contribution! Please reach out to respective companies' CODEOWNER to fill in the latest PR_REVIEW_CHECKLIST.md before pinging core maintainer on Slack for review.

For PR verification, add the full-sweep-enabled or full-sweep-fail-fast label to this PR — the benchmark sweep only runs on labeled PRs.

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. See GitHub's docs on re-running failed jobs


感谢你的贡献!请联系相应公司的 CODEOWNER 填写最新的 PR_REVIEW_CHECKLIST.md,然后再在 Slack 上联系核心维护者进行审阅。

如需进行 PR 验证,请为此 PR 添加 full-sweep-enabledfull-sweep-fail-fast 标签 — 基准测试 sweep 仅在带有标签的 PR 上运行。

PR 作者有责任确保合并后所有 GitHub Action 任务完全通过。 很多时候失败只是偶发抖动(flake),重新运行失败的任务即可解决。参见 GitHub 关于重新运行失败任务的文档

@functionstackx

Copy link
Copy Markdown
Collaborator Author

As a PR reviewer and CODEOWNER, I have reviewed this and have:

  • Verified that as of the moment of typing this, this is the latest version of PR_REVIEW_CHECKLIST.md
  • Verified that the general code quality meets the InferenceX standard and does not make the code quality any worse.
  • Verified that this PR has passed PR validation. Please link to GitHub Action workflow that shows this.
  • Verified that this PR passes evals. Please link to GitHub Action workflow that shows this.
  • Verified that speculative decoding PRs uses chat templates to align the AL distribution to real world
  • Verified that the model architecture isn't changed with benchmark hacks like using --hf-overrides to skipping indexer for every x layers on models that don't natively support this. As a general rule, we won't accept optimizations that reduces the number of model architecture FLOPs. Anything that makes that same computation run faster is fair game; FLOPs at lower precisions is fine, given that the config passes private evals. As an general north star princple, we should only use optimizations which is used in production by customers that care about accuracy
  • If an company claims that they support vLLM/SGLang as first class LLM inference engines on their hardware, I have verified that the respective vLLM submission made using upstream https://hub.docker.com/u/vllm docker repo, upstream SGLang https://hub.docker.com/u/lmsysorg docker repo. The only exceptions are for new hardware, such as MI455X UALoE72, Vera Rubin NVL72, Rubin NVL8, etc., and for new model architectures where there is an actual reason why vLLM/SGLang does not fundamentally support them yet as supported by vLLM/SGLang community maintainers
  • If an company claims that they support vLLM/SGLang as first class upstream in-tree LLM inference engines on their hardware, I have have verified that the respective vLLM/SGLang submission has been made before additional frameworks (TRT-LLM, ATOM, etc.). The only exceptions are for new hardware, such as MI455X UALoE72, Vera Rubin NVL72, Rubin NVL8, etc., and for new model architectures where there is an actual reason why vLLM/SGLang does not fundamentally support them yet.
  • Verified that the single-node recipes are similar to the official vLLM recipes and/or theSGLang cookbook:
    • If they are not, I have verified that a PR has been opened in vLLM recipe repo or SGLang repo and linked it below in the additional detail section:
  • If any of the above criteria cannot reasonably be satisfied, I have provided additional reasoning below.

Additional detail section:

  • This is a Dis-agg submission, no recipe update required.

Signed: functionstackx

@Klaud-Cold

Copy link
Copy Markdown
Collaborator

❌❌❌ REJECTED ❌❌❌

@functionstackx — blocking: no passing sweep/eval ran on any commit in this PR, and no authorized /reuse-sweep-run command has been posted. (PR is also titled [DO NOT MERGE] — a verification test PR.)

✅ Check 0 (CODEOWNER): PASS — only changed path is catch-all-owned (* @InferenceX/core); signer is an org MEMBER with no specific-owner mismatch.
❌ Check 1 (sweep on in-PR commit): FAIL — no passing sweep/eval was found on any commit in this PR; the sole commit bfd0099 has no single-node */ or eval / check-runs at all.
❌ Check 2 (evals pass): FAIL — no eval run exists to verify (blocked by Check 1).
➖ Check 3 (recipe link): N/A — disaggregated/multi-node submission (benchmarks/multi_node/** vLLM-disagg only); the recipe-link requirement applies to single-node recipes only.
❌ Check 4 (reuse command): FAIL — no authorized /reuse-sweep-run command has been posted on this PR; an authorized maintainer must comment /reuse-sweep-run before merge via reuse.
✅ Check 5 (latest checklist): PASS — sign-off contains every item of the current docs/PR_REVIEW_CHECKLIST.md, all checked.
➖ Check 6 (upstream image / engine-first): N/A — no configs/*-master.yaml entries changed.
➖ Check 7 (no architecture hacks): N/A — diff adds only a trailing comment; no server args touched.
➖ Check 8 (spec-decode chat template): N/A — no speculative-decoding changes.

@functionstackx

Copy link
Copy Markdown
Collaborator Author

Verified — Check 3 now reports N/A on disagg-only submissions instead of FAIL. Closing.

@functionstackx functionstackx deleted the klaud/test-signoff-disagg-check3 branch July 4, 2026 02:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Development

Successfully merging this pull request may close these issues.

2 participants