fix(messaging): use native WebSocket credential rewrite by ericksoa · Pull Request #3323 · NVIDIA/NemoClaw

ericksoa · 2026-05-10T23:20:17Z

Summary

remove the in-sandbox Discord facade/proxy workaround and route Discord Gateway traffic through native OpenShell WebSocket policy
add hermetic fake Discord Gateway and fake Slack API E2E helpers that explicitly allow Docker private host ranges via OpenShell allowed_ips
keep messaging tokens as OpenShell placeholders in the sandbox and verify host-side token injection only at egress
add schema/test coverage for allowed_ips and tighten gateway-state handling when OpenShell status output includes transport errors

Dependency

This PR depends on OpenShell PR #1286: NVIDIA/OpenShell#1286

Required OpenShell head tested locally: eab184f20bb27c1db8b62deb33717590b018a24a.

The dependency is required because NemoClaw now uses openshell policy update --add-endpoint ... allowed-ip=... for hermetic fake provider endpoints, and native WebSocket credential rewrite must work for Discord Gateway traffic. Without OpenShell #1286, the fake providers remain blocked by private-IP policy and WebSocket IDENTIFY token rewrite is not proven.

Validation

npm run build:cli
npx vitest run test/validate-config-schemas.test.ts test/policies.test.ts test/generate-openclaw-config.test.ts test/generate-hermes-config.test.ts test/hermes-decode-proxy.test.ts test/gateway-state.test.ts
bash -n test/e2e/test-messaging-providers.sh test/e2e/test-hermes-discord-e2e.sh test/e2e/lib/*.sh
node --check test/e2e/lib/fake-discord-gateway.cjs test/e2e/lib/fake-slack-api.cjs
NEMOCLAW_OPENSHELL_BIN=/Users/aerickson/Documents/NemoDev/0510-fix-messaging/OpenShell/target/debug/openshell bash test/e2e/test-hermes-discord-e2e.sh - 24 passed, 0 failed
NEMOCLAW_OPENSHELL_BIN=/Users/aerickson/Documents/NemoDev/0510-fix-messaging/OpenShell/target/debug/openshell bash test/e2e/test-messaging-providers.sh - 60 passed, 0 failed, 5 skipped
NemoClaw pre-push hooks passed when publishing fix/native-messaging-websocket

Notes

This branch should not be merged before OpenShell #1286 lands or NemoClaw pins/installs an OpenShell build containing that PR.

Summary by CodeRabbit

Chores
- Updated OpenShell version requirement to 0.0.38 for enhanced messaging protocol support
- Refined network policy configurations for improved credential handling and security in Discord and Slack messaging integration

coderabbitai · 2026-05-10T23:20:31Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

This PR removes Hermes local Discord facade, decode-proxy, and Slack token rewriter components, replacing them with OpenShell native WebSocket (L7) credential-rewrite support. Policy and schema are extended for websocket protocol. OpenShell is pinned to 0.0.38 with messaging capability validation. Hermetic fake Discord Gateway and Slack API E2E helpers are added. Hermes startup, runtime, recovery, and onboarding are simplified. Tests and CI are updated throughout.

Changes

Messaging Bridge Simplification & Native WebSocket Migration

Layer / File(s)	Summary
Schema and policy infrastructure for WebSocket support `schemas/policy-preset.schema.json`, `schemas/sandbox-policy.schema.json`, `nemoclaw-blueprint/policies/presets/discord.yaml`, `nemoclaw-blueprint/policies/presets/slack.yaml`, `agents/hermes/policy-additions.yaml`, `agents/hermes/policy-permissive.yaml`, `nemoclaw-blueprint/policies/openclaw-sandbox-permissive.yaml`, `agents/openclaw/policy-permissive.yaml`	Extend endpoint schema to support `websocket` protocol alongside `rest`; add `websocket_credential_rewrite` and `request_body_credential_rewrite` boolean fields; extend allow methods to include `WEBSOCKET_TEXT`; update Discord and Slack policies to use `protocol: websocket` with credential rewrite and explicit allow rules, replacing prior `access: full` + `tls: skip` L4 tunnel configurations.
Remove local Discord facade, decode-proxy, ws-proxy-fix, and Slack rewriter `agents/hermes/discord-facade.py`, `agents/hermes/decode-proxy.py`, `nemoclaw-blueprint/scripts/ws-proxy-fix.js`, `nemoclaw-blueprint/scripts/ws-proxy-fix.ts`, `agents/hermes/discord-preload/sitecustomize.py`, `scripts/nemoclaw-start.sh`, `test/seccomp-guard.test.ts`, `test/service-env.test.ts`	Delete `discord-facade.py`, `decode-proxy.py`, ws-proxy-fix scripts; remove facade URL conditional logic from aiohttp patch; delete `install_slack_token_rewriter()` and constants; remove NODE_OPTIONS preload injection and permission validation; update test harnesses to remove related environment variables and assertions.
Simplify Hermes startup and messaging configuration `agents/hermes/start.sh`, `agents/hermes/config/messaging-config.ts`, `agents/hermes/Dockerfile`	Remove `start_decode_proxy()` and `start_discord_facade()` functions; remove DISCORD_PROXY, NEMOCLAW_DISCORD_FACADE_URL exports, and Discord preload PYTHONPATH injection; simplify root and non-root Hermes launch to only set HERMES_HOME; update Dockerfile comments and staging to remove decode-proxy/facade/preload; apply proxy_url only to telegram channel account.
Update Hermes runtime configuration and recovery `src/lib/agent/runtime.ts`, `src/lib/agent/runtime.test.ts`	Simplify `hermesGatewayEnvPrefix()` to emit only HERMES_HOME; remove `hermesDecodeProxyRecoveryCommand()`; remove decode-proxy recovery steps from recovery script and manual recovery command; assert absence of Discord-proxy/facade identifiers and bridge ports in tests.
Enhance gateway lifecycle and cleanup handling `src/lib/onboard.ts`, `test/onboard.test.ts`, `test/gateway-liveness-probe.test.ts`	Add `gatewayCliSupportsLifecycleCommands()` capability probe; update `destroyGateway()` to choose strategy based on Docker-driver vs lifecycle support and return boolean; clear registry only on success; gate container running checks behind lifecycle support; update sandbox readiness polling; remove Slack Socket Mode env injection; add regression tests asserting registry state depends on cleanup success.
Pin OpenShell to 0.0.38 and add messaging capability checks `scripts/install-openshell.sh`, `test/install-openshell-version-check.test.ts`, `test/runner.test.ts`, `nemoclaw-blueprint/blueprint.yaml`	Update version constants to 0.0.38; parameterize PIN_VERSION/TAG/REPO from environment; add `openshell_has_required_messaging_features()` using `strings` probe for credential-rewrite markers; gate "already installed" on version bounds and messaging-feature markers; re-verify markers post-install; update test expectations and add stub `strings()` function to test stubs.
Improve gateway connection detection `src/lib/state/gateway.ts`, `test/gateway-state.test.ts`	Update `isGatewayConnected()` to validate string input, strip ANSI codes, detect error patterns; add test fixtures and assertions for "Connection refused" output with ANSI wrapping.
Add hermetic fake Discord Gateway E2E infrastructure `test/e2e/lib/discord-gateway-proof.sh`, `test/e2e/lib/fake-discord-gateway.cjs`	Create shell helpers for lifecycle management and dual Node/Python client implementations validating WebSocket upgrade, HELLO, IDENTIFY, READY, HEARTBEAT_ACK sequences; create Node.js fake gateway implementing handshake, token validation, heartbeat echo, and event capture.
Add hermetic fake Slack API E2E infrastructure `test/e2e/lib/slack-api-proof.sh`, `test/e2e/lib/fake-slack-api.cjs`	Create shell helpers for fake Slack API lifecycle, policy application, and Node-based HTTP request execution; create Node.js fake Slack API implementing token validation, placeholder detection, JSONL capture, and error responses.
Migrate Hermes Discord E2E test to hermetic gateway `test/e2e/test-hermes-discord-e2e.sh`	Switch from in-sandbox facade checks to host-side hermetic fake gateway; validate WebSocket upgrade/IDENTIFY/READY/heartbeat sequence; assert capture contains real token and excludes raw placeholder marker; replace facade-residue phase with bridge-residue scan; add NEMOCLAW_FRESH and NEMOCLAW_OPENSHELL_BIN support; use openshell --version instead of command -v.
Migrate messaging-providers E2E tests to hermetic helpers `test/e2e/test-messaging-providers.sh`	Replace native Discord/Slack credential probing with hermetic fake implementations; add unresolved-placeholder negative controls; update Socket Mode validation; add NEMOCLAW_E2E_KEEP_SANDBOX support; broaden Telegram connection error classification; force NEMOCLAW_FRESH=1; update terminology to "alias substitution".
Update startup, runtime, and payload tests `test/nemoclaw-start.test.ts`, `src/lib/agent/runtime.test.ts`, `test/sandbox-init.test.ts`	Assert absence of Discord-proxy, decode-proxy, facade identifiers, and bridge ports in recovery/startup scripts; remove _WS_FIX_SCRIPT/_SLACK_REWRITER_SCRIPT from environment blocks; reorder test suites for Slack secrets-on-disk tripwire; verify direct OpenShell egress without local proxy participation.
Add policy and schema validation tests `test/policies.test.ts`, `test/validate-blueprint.test.ts`, `test/validate-config-schemas.test.ts`	Add YAML-parsing tests for messaging WebSocket presets validating protocol/enforcement/websocket_credential_rewrite/allow-rules; add REST credential-rewrite tests; extend schema validation with positive/negative WebSocket and credential-rewrite endpoint cases.
Remove local component test coverage `test/hermes-discord-facade.test.ts`, `test/hermes-decode-proxy.test.ts`, `test/slack-token-rewriter-sync.test.ts`, `test/slack-token-rewriter.test.ts`	Delete test suites for Discord facade, decode-proxy, and Slack token rewriter covering interaction handling, placeholder rewriting, and preload sync.
Update Hermes Slack E2E test `test/e2e/test-hermes-slack-e2e.sh`	Narrow process filtering to hermes/socat; add policy assertion for `request_body_credential_rewrite: true`; add bridge residue verification step; update success message terminology to "alias substitution".
Update onboard and installer tests `test/onboard.test.ts`, `test/install-openshell-version-check.test.ts`, `test/runner.test.ts`, `test/gateway-liveness-probe.test.ts`	Add onboard tests asserting Slack credentials are never embedded in sandbox create command; add source-structure regression tests for lifecycle support and registry cleanup; update fake openshell handling; extend installer capability flag support; add cleanup failure registry state assertions.
Update other E2E scripts, CI workflows, and miscellaneous `test/e2e/test-hermes-e2e.sh`, `test/e2e/test-rebuild-hermes.sh`, `.github/workflows/nightly-e2e.yaml`, `.github/workflows/sandbox-images-and-e2e.yaml`, `test/sandbox-provisioning.test.ts`, `test/sandbox-build-context.test.ts`, `test/cli.test.ts`, `Dockerfile`, `package.json`, `src/lib/onboard/usage-notice.ts`, `test/generate-hermes-config.test.ts`, `test/generate-openclaw-config.test.ts`, `scripts/generate-openclaw-config.py`	Narrow process diagnostics filters; update CI workflow descriptions and remove NEMOCLAW_SANDBOX_NAME env vars; update sandbox image verification; update helper permission tests; update build expectations; isolate CLI test openshell; update Dockerfile chmod; add conditional TypeScript build; remove ws-proxy-fix references; add proxy assertions; apply proxy_url only to telegram.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

NVIDIA/NemoClaw#3293: This PR directly reverses PR #3293, removing the Hermes Discord facade, decode-proxy, and related environment variables and fixtures that #3293 had introduced.

Suggested labels

Integration: Hermes, v0.0.38

Poem

🐰 Discord's local bridge is gone away,
The rabbit says, "Let WebSockets play!"
No facades, no proxies in the night,
OpenShell rewrite makes all things right.
To 0.0.38 we ascend so high,
Native L7—no more TLS goodbye! 🚀

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix/native-messaging-websocket

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (3)

test/gateway-state.test.ts (1)

150-153: ⚡ Quick win

Add one ANSI-wrapped error fixture to lock in the new strip-and-reject path.

The new behavior depends on ANSI stripping before matching error text; a dedicated ANSI refusal fixture would prevent subtle parser regressions.

Proposed test addition

+const STATUS_SERVER_STATUS_REFUSED_ANSI = `
+Server Status
+
+Gateway: nemoclaw
+Server: https://127.0.0.1:8080/
+\x1b[31mError:\x1b[0m Connection refused (os error 61)
+`;
+
 describe("isGatewayConnected", () => {
@@
   it("does not treat Server Status with connection errors as connected", () => {
     expect(isGatewayConnected(STATUS_SERVER_STATUS_REFUSED)).toBe(false);
   });
+
+  it("does not treat ANSI-wrapped connection errors as connected", () => {
+    expect(isGatewayConnected(STATUS_SERVER_STATUS_REFUSED_ANSI)).toBe(false);
+  });

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@test/gateway-state.test.ts` around lines 150 - 153, Add a new ANSI-wrapped
refusal fixture and use it in the existing test that calls isGatewayConnected so
the ANSI-stripping path is exercised; create a constant (e.g.,
STATUS_SERVER_STATUS_REFUSED_ANSI) alongside STATUS_SERVER_STATUS_REFUSED
containing the same refusal message wrapped with ANSI escape sequences, then add
a test case in test/gateway-state.test.ts that calls
isGatewayConnected(STATUS_SERVER_STATUS_REFUSED_ANSI) and expects false to lock
in the strip-and-reject behavior.

test/policies.test.ts (1)

806-848: ⚡ Quick win

Add Hermes Slack websocket assertions next to the Discord gateway check.

This PR changes wss-primary.slack.com and wss-backup.slack.com in both Hermes policy files, but the new Hermes-specific structured test only exercises gateway.discord.gg. A Slack regression in either YAML would currently pass.

Suggested test shape

-  it("Hermes Discord gateway policy enables native WebSocket credential rewrite", () => {
+  it("Hermes messaging gateways use native inspected WebSocket policies", () => {
     const policyFiles = [
       path.join(REPO_ROOT, "agents/hermes/policy-additions.yaml"),
       path.join(REPO_ROOT, "agents/hermes/policy-permissive.yaml"),
     ];
+    const cases = [
+      {
+        host: "gateway.discord.gg",
+        expected: { websocket_credential_rewrite: true },
+      },
+      {
+        host: "wss-primary.slack.com",
+        expected: {},
+      },
+      {
+        host: "wss-backup.slack.com",
+        expected: {},
+      },
+    ];

     for (const file of policyFiles) {
       const content = fs.readFileSync(file, "utf8");
       const parsed = YAML.parse(content) as {
         network_policies?: Record<
@@
         );
-        const endpoint = endpoints.find((candidate) => candidate.host === "gateway.discord.gg");
-        expect(endpoint).toBeTruthy();
-        expect(endpoint).toMatchObject({
-          protocol: "websocket",
-          enforcement: "enforce",
-          websocket_credential_rewrite: true,
-        });
-        expect(endpoint).not.toHaveProperty("access");
-        expect(endpoint).not.toHaveProperty("tls");
-        expect(endpoint?.rules).toEqual(
-          expect.arrayContaining([
-            { allow: { method: "GET", path: "/**" } },
-            { allow: { method: "WEBSOCKET_TEXT", path: "/**" } },
-          ]),
-        );
+        for (const { host, expected } of cases) {
+          const endpoint = endpoints.find((candidate) => candidate.host === host);
+          expect(endpoint).toBeTruthy();
+          expect(endpoint).toMatchObject({
+            protocol: "websocket",
+            enforcement: "enforce",
+            ...expected,
+          });
+          expect(endpoint).not.toHaveProperty("access");
+          expect(endpoint).not.toHaveProperty("tls");
+          expect(endpoint?.rules).toEqual(
+            expect.arrayContaining([
+              { allow: { method: "GET", path: "/**" } },
+              { allow: { method: "WEBSOCKET_TEXT", path: "/**" } },
+            ]),
+          );
+        }
       }
     }
   });

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@test/policies.test.ts` around lines 806 - 848, The test currently only
asserts on the Discord gateway endpoint (candidate.host ===
"gateway.discord.gg"), so add analogous assertions for Slack WebSocket hosts
"wss-primary.slack.com" and "wss-backup.slack.com": after building endpoints
from policyFiles, locate endpoints with host === "wss-primary.slack.com" and
host === "wss-backup.slack.com" and assert they exist, have protocol
"websocket", enforcement "enforce", websocket_credential_rewrite true, no
"access" or "tls" properties, and rules include the GET and WEBSOCKET_TEXT allow
entries (same expectations used for the Discord endpoint).

test/e2e/test-messaging-providers.sh (1)

945-951: 💤 Low value

Minor off-by-one in line count calculation.

Line 947's tail -n "$((capture_after_negative - capture_before_negative + 1))" adds an extra line. If before=10 and after=12, this calculates 3 lines when you likely want 2 new lines. However, since the check is for presence of DEFINITELY_NOT_REGISTERED in the capture, the extra line is harmless (makes false negatives less likely, not more).

This is low impact and the test logic is sound.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@test/e2e/test-messaging-providers.sh` around lines 945 - 951, The tail line
uses tail -n "$((capture_after_negative - capture_before_negative + 1))" which
introduces an off-by-one; update the calculation to use tail -n
"$((capture_after_negative - capture_before_negative))" (or otherwise ensure the
computed count reflects only new lines) so the slice from
FAKE_DISCORD_GATEWAY_CAPTURE_FILE between capture_before_negative and
capture_after_negative is the correct length; modify the expression near the
condition that references fake_gateway_ready, dc_ws_negative,
capture_after_negative and capture_before_negative accordingly.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@scripts/install-openshell.sh`:
- Around line 36-40: The installer is pinned to a non-existent OpenShell tag via
MIN_VERSION and MAX_VERSION set to "0.0.38"; change those to an existing release
(e.g., "0.0.37") or make the version configurable, and ensure the release tag
exists before shipping; additionally, instead of relying only on numeric gating,
add a runtime capability check before the early-success exit path (the
download/validation block that currently short-circuits success) to verify the
specific feature/commit (PR `#1286` / commit
eab184f20bb27c1db8b62deb33717590b018a24a) is present—for example, after cloning
or extracting the OpenShell artifact inspect for the expected file/flag or check
the commit hash in the repo metadata and abort with a clear error if the
capability is missing.

In `@src/lib/onboard.ts`:
- Around line 4250-4259: The recoverGatewayRuntime() function currently calls
the CLI to start the gateway unconditionally; wrap its startup logic with the
same gatewayCliSupportsLifecycleCommands() guard used earlier (check
gatewayCliSupportsLifecycleCommands() at the start of recoverGatewayRuntime()),
and if lifecycle commands are unsupported, skip attempting "openshell gateway
start" and either return gracefully or throw the same descriptive error (or
respect an exitOnFailure-like behavior if recoverGatewayRuntime accepts such a
flag). Update recoverGatewayRuntime() to log the same guidance (mentioning
GATEWAY_NAME/GATEWAY_PORT or pointing to "openshell gateway add/select") when
skipping execution so callers on package-managed OpenShell do not invoke
unsupported CLI commands.
- Around line 3254-3264: The registry.clearAll() call should only run when the
gateway was actually destroyed via lifecycle commands; update the conditional so
registry.clearAll() executes only when hasLifecycleCommands is true and
destroyResult.status === 0 (i.e., after the run initiated by
gatewayCliSupportsLifecycleCommands() succeeded). Locate the branch using
hasLifecycleCommands, runOpenshell([... "gateway", "destroy" ...]) /
runOpenshell([... "gateway", "remove" ...]) and the registry.clearAll() call and
change the guard so removal-via-"gateway remove" does not clear the sandbox
registry.

---

Nitpick comments:
In `@test/e2e/test-messaging-providers.sh`:
- Around line 945-951: The tail line uses tail -n "$((capture_after_negative -
capture_before_negative + 1))" which introduces an off-by-one; update the
calculation to use tail -n "$((capture_after_negative -
capture_before_negative))" (or otherwise ensure the computed count reflects only
new lines) so the slice from FAKE_DISCORD_GATEWAY_CAPTURE_FILE between
capture_before_negative and capture_after_negative is the correct length; modify
the expression near the condition that references fake_gateway_ready,
dc_ws_negative, capture_after_negative and capture_before_negative accordingly.

In `@test/gateway-state.test.ts`:
- Around line 150-153: Add a new ANSI-wrapped refusal fixture and use it in the
existing test that calls isGatewayConnected so the ANSI-stripping path is
exercised; create a constant (e.g., STATUS_SERVER_STATUS_REFUSED_ANSI) alongside
STATUS_SERVER_STATUS_REFUSED containing the same refusal message wrapped with
ANSI escape sequences, then add a test case in test/gateway-state.test.ts that
calls isGatewayConnected(STATUS_SERVER_STATUS_REFUSED_ANSI) and expects false to
lock in the strip-and-reject behavior.

In `@test/policies.test.ts`:
- Around line 806-848: The test currently only asserts on the Discord gateway
endpoint (candidate.host === "gateway.discord.gg"), so add analogous assertions
for Slack WebSocket hosts "wss-primary.slack.com" and "wss-backup.slack.com":
after building endpoints from policyFiles, locate endpoints with host ===
"wss-primary.slack.com" and host === "wss-backup.slack.com" and assert they
exist, have protocol "websocket", enforcement "enforce",
websocket_credential_rewrite true, no "access" or "tls" properties, and rules
include the GET and WEBSOCKET_TEXT allow entries (same expectations used for the
Discord endpoint).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 788c24f2-f82e-42d1-9f32-8853a8dc3103

📥 Commits

Reviewing files that changed from the base of the PR and between 118541a and 90a73b1.

⛔ Files ignored due to path filters (2)

nemoclaw/package-lock.json is excluded by !**/package-lock.json
package-lock.json is excluded by !**/package-lock.json

📒 Files selected for processing (44)

Dockerfile
agents/hermes/Dockerfile
agents/hermes/config/messaging-config.ts
agents/hermes/discord-facade.py
agents/hermes/discord-preload/sitecustomize.py
agents/hermes/policy-additions.yaml
agents/hermes/policy-permissive.yaml
agents/hermes/start.sh
nemoclaw-blueprint/blueprint.yaml
nemoclaw-blueprint/policies/openclaw-sandbox-permissive.yaml
nemoclaw-blueprint/policies/presets/discord.yaml
nemoclaw-blueprint/policies/presets/slack.yaml
nemoclaw-blueprint/scripts/ws-proxy-fix.js
nemoclaw-blueprint/scripts/ws-proxy-fix.ts
package.json
schemas/policy-preset.schema.json
schemas/sandbox-policy.schema.json
scripts/generate-openclaw-config.py
scripts/install-openshell.sh
scripts/nemoclaw-start.sh
src/lib/agent/runtime.test.ts
src/lib/agent/runtime.ts
src/lib/onboard.ts
src/lib/onboard/usage-notice.ts
src/lib/state/gateway.ts
test/e2e/lib/discord-gateway-proof.sh
test/e2e/lib/fake-discord-gateway.cjs
test/e2e/lib/fake-slack-api.cjs
test/e2e/lib/slack-api-proof.sh
test/e2e/test-hermes-discord-e2e.sh
test/e2e/test-messaging-providers.sh
test/gateway-state.test.ts
test/generate-hermes-config.test.ts
test/generate-openclaw-config.test.ts
test/hermes-discord-facade.test.ts
test/nemoclaw-start.test.ts
test/policies.test.ts
test/sandbox-build-context.test.ts
test/sandbox-init.test.ts
test/sandbox-provisioning.test.ts
test/seccomp-guard.test.ts
test/service-env.test.ts
test/validate-blueprint.test.ts
test/validate-config-schemas.test.ts

💤 Files with no reviewable changes (9)

test/hermes-discord-facade.test.ts
agents/hermes/config/messaging-config.ts
nemoclaw-blueprint/scripts/ws-proxy-fix.ts
test/nemoclaw-start.test.ts
agents/hermes/discord-preload/sitecustomize.py
agents/hermes/discord-facade.py
nemoclaw-blueprint/scripts/ws-proxy-fix.js
test/seccomp-guard.test.ts
Dockerfile

coderabbitai

Actionable comments posted: 5

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

test/e2e/test-messaging-providers.sh (1)

243-280: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Keep the pre-merged Slack bootstrap policy in sync with the rewrite-aware preset.

This hardcoded block still omits the new rewrite flags. During install, Hermes/OpenClaw boots against this pre-merged policy before the real Slack preset is applied, so the bootstrap path is now behaviorally different from presets/slack.yaml: REST calls won't get request_body_credential_rewrite, and Socket Mode hosts won't get websocket_credential_rewrite. That can regress the exact placeholder/alias flow this PR is trying to validate.

Suggested diff

       - host: slack.com
         port: 443
         protocol: rest
         enforcement: enforce
+        request_body_credential_rewrite: true
         rules:
           - allow: { method: GET, path: "/**" }
           - allow: { method: POST, path: "/**" }
       - host: api.slack.com
         port: 443
         protocol: rest
         enforcement: enforce
+        request_body_credential_rewrite: true
         rules:
           - allow: { method: GET, path: "/**" }
           - allow: { method: POST, path: "/**" }
       - host: hooks.slack.com
         port: 443
         protocol: rest
         enforcement: enforce
+        request_body_credential_rewrite: true
         rules:
           - allow: { method: GET, path: "/**" }
           - allow: { method: POST, path: "/**" }
       - host: wss-primary.slack.com
         port: 443
         protocol: websocket
         enforcement: enforce
+        websocket_credential_rewrite: true
         rules:
           - allow: { method: GET, path: "/**" }
           - allow: { method: WEBSOCKET_TEXT, path: "/**" }
       - host: wss-backup.slack.com
         port: 443
         protocol: websocket
         enforcement: enforce
+        websocket_credential_rewrite: true
         rules:
           - allow: { method: GET, path: "/**" }
           - allow: { method: WEBSOCKET_TEXT, path: "/**" }

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@test/e2e/test-messaging-providers.sh` around lines 243 - 280, The pre-merged
Slack bootstrap policy block (the slack endpoints with hosts like slack.com,
api.slack.com, hooks.slack.com, wss-primary.slack.com, wss-backup.slack.com) is
missing the new rewrite flags so bootstrap behavior differs from
presets/slack.yaml; update each REST endpoint entry to include
request_body_credential_rewrite (or the exact key used in presets) and add
websocket_credential_rewrite to websocket entries (wss-primary.slack.com,
wss-backup.slack.com) so the temporary pre-merged policy matches the
rewrite-aware preset applied later.

agents/hermes/policy-permissive.yaml (1)

4-6: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Update the “no L7 filtering” header text to match current websocket rules.

Line 5 says this policy has no L7 filtering, but the websocket endpoints now explicitly filter by method/path (GET + WEBSOCKET_TEXT). Please align the header comment to avoid operator confusion.

Suggested doc-only patch

-# All known Hermes-relevant endpoints opened with access: full (no L7 filtering).
+# All known Hermes-relevant endpoints are broadly opened.
+# Note: websocket endpoints still use explicit method/path allow rules.

Also applies to: 191-196, 233-246

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@agents/hermes/policy-permissive.yaml` around lines 4 - 6, Update the header
comment that currently claims "no L7 filtering" to accurately state that
websocket endpoints are explicitly filtered by method/path (specifically GET +
WEBSOCKET_TEXT) in this permissive policy; edit the top-of-file comment in
policy-permissive.yaml and the analogous header comments covering the other
occurrences (around the blocks noted at lines 191-196 and 233-246) so they no
longer say "no L7 filtering" but instead mention that websocket routes are
constrained by method/path (GET + WEBSOCKET_TEXT) while other endpoints remain
broadly open.

🧹 Nitpick comments (1)

test/e2e/test-hermes-slack-e2e.sh (1)
454-530: 🏗️ Heavy lift

Switch this Phase 6 proof to the fake Slack API helper instead of live slack.com.

The probe still POSTs to real Slack, so this test remains internet-dependent and never observes the request-body rewrite at a capture boundary. That means the new request_body_credential_rewrite path can regress while this phase still passes or flakes based on external Slack behavior. Reusing the fake Slack API flow from test/e2e/test-messaging-providers.sh would make this proof hermetic and actually validate the new rewrite contract.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@test/e2e/test-hermes-slack-e2e.sh` around lines 454 - 530, Replace the live
slack.com calls in the Python probe (the call(...) function and slack_probe
usage) with the project’s fake Slack API helper used by the other
messaging-provider tests: change the Request target from
"https://slack.com/api/{path}" to the fake Slack helper endpoint (the same
helper invoked by the messaging-provider tests), ensure the probe uses the same
header/token shaping the helper expects (keep the existing token-prefix logic)
and reuse the same invocation pattern (sandbox_exec_stdin -> Python) so the test
becomes hermetic and will exercise the request_body_credential_rewrite path
rather than hitting the real Slack API.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/lib/onboard.ts`:
- Around line 3254-3272: The destroy flow must not clear local state or mark the
gateway as missing when the underlying destroy command failed; update the logic
in destroyGateway()/the block using hasLifecycleCommands, destroyResult,
runOpenshell, registry.clearAll(), and dockerRemoveVolumesByPrefix so that
registry.clearAll() and any code that sets gatewayReuseState = "missing" only
run when destroyResult.status === 0, and ensure the function propagates the
failure (throw or return a non-zero/failure result) when
runOpenshell(["gateway","destroy",...]) fails so callers cannot continue
assuming the gateway was removed; apply the same guard/failure propagation to
the other similar blocks referenced (the other occurrences using
runOpenshell/destroyResult at the listed spots).
- Around line 1424-1438: The current probe in
gatewayCliSupportsLifecycleCommands() treats an empty or failed
`runCaptureOpenshell(["gateway","--help"])` (normalized.trim() === "") as
indicating lifecycle support; change the boolean assignment so a missing/empty
help output does NOT imply support. Specifically, in
gatewayCliSupportsLifecycleCommands(), use a guard that requires non-empty help
before checking for /\bstart\b/ and /\bdestroy\b/ (e.g.,
Boolean(normalized.trim()) && /\bstart\b/.test(normalized) &&
/\bdestroy\b/.test(normalized)) instead of the current !normalized.trim() ||
(...) logic; keep the runCaptureOpenshell/ANSI_RE usage but ensure
gatewayLifecycleCommandsSupported and its evaluation only become true when the
help output proves both commands exist.
- Around line 4252-4256: The error text that tells users to run "openshell
gateway add http://127.0.0.1:${GATEWAY_PORT} ..." is out of sync with the
runtime which uses getGatewayLocalEndpoint(); update the two places that print
those recovery hints (the block guarded by gatewayCliSupportsLifecycleCommands()
and the later similar block) to call getGatewayLocalEndpoint() instead of
hardcoding "http://127.0.0.1:${GATEWAY_PORT}", leaving the rest of the message
(flags and ${GATEWAY_NAME}) unchanged so the printed command matches the actual
local endpoint used by the implementation.

In `@test/e2e/lib/fake-slack-api.cjs`:
- Around line 43-56: The test helper is currently persisting raw secrets
(Authorization header and request body) into the capture log via record;
instead, change the record payload so it does not include raw Authorization or
full body—replace them with derived fields only: keep tokenMatchesExpected and
tokenLooksPlaceholder (booleans), add a hashed or redacted tokenSuffix (e.g.,
last 4 chars) and a bodyRedacted boolean or hash, and remove direct
authorization/body values before writing to captureFile; update any test
consumers to assert against tokenMatchesExpected, tokenLooksPlaceholder,
tokenSuffix or bodyRedacted/hash rather than the raw Authorization/body.

In `@test/onboard.test.ts`:
- Around line 5054-5061: Add symmetric placeholder-leak checks for Discord and
Telegram to the existing assertions on createCommand.command: ensure you call
assert.doesNotMatch(createCommand.command, /DISCORD_BOT_TOKEN=/) and
assert.doesNotMatch(createCommand.command, /TELEGRAM_BOT_TOKEN=/) alongside the
existing Slack checks so the sandbox create command never contains those
placeholder environment variable keys; update the test in onboard.test.ts near
the other doesNotMatch assertions referencing createCommand.command.

---

Outside diff comments:
In `@agents/hermes/policy-permissive.yaml`:
- Around line 4-6: Update the header comment that currently claims "no L7
filtering" to accurately state that websocket endpoints are explicitly filtered
by method/path (specifically GET + WEBSOCKET_TEXT) in this permissive policy;
edit the top-of-file comment in policy-permissive.yaml and the analogous header
comments covering the other occurrences (around the blocks noted at lines
191-196 and 233-246) so they no longer say "no L7 filtering" but instead mention
that websocket routes are constrained by method/path (GET + WEBSOCKET_TEXT)
while other endpoints remain broadly open.

In `@test/e2e/test-messaging-providers.sh`:
- Around line 243-280: The pre-merged Slack bootstrap policy block (the slack
endpoints with hosts like slack.com, api.slack.com, hooks.slack.com,
wss-primary.slack.com, wss-backup.slack.com) is missing the new rewrite flags so
bootstrap behavior differs from presets/slack.yaml; update each REST endpoint
entry to include request_body_credential_rewrite (or the exact key used in
presets) and add websocket_credential_rewrite to websocket entries
(wss-primary.slack.com, wss-backup.slack.com) so the temporary pre-merged policy
matches the rewrite-aware preset applied later.

---

Nitpick comments:
In `@test/e2e/test-hermes-slack-e2e.sh`:
- Around line 454-530: Replace the live slack.com calls in the Python probe (the
call(...) function and slack_probe usage) with the project’s fake Slack API
helper used by the other messaging-provider tests: change the Request target
from "https://slack.com/api/{path}" to the fake Slack helper endpoint (the same
helper invoked by the messaging-provider tests), ensure the probe uses the same
header/token shaping the helper expects (keep the existing token-prefix logic)
and reuse the same invocation pattern (sandbox_exec_stdin -> Python) so the test
becomes hermetic and will exercise the request_body_credential_rewrite path
rather than hitting the real Slack API.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 3374bf15-e017-4b69-a961-1d8c2a9828c2

📥 Commits

Reviewing files that changed from the base of the PR and between 90a73b1 and 84c189f.

📒 Files selected for processing (44)

.github/workflows/nightly-e2e.yaml
.github/workflows/sandbox-images-and-e2e.yaml
agents/hermes/Dockerfile
agents/hermes/config/messaging-config.ts
agents/hermes/decode-proxy.py
agents/hermes/discord-preload/sitecustomize.py
agents/hermes/policy-additions.yaml
agents/hermes/policy-permissive.yaml
agents/hermes/start.sh
agents/openclaw/policy-permissive.yaml
nemoclaw-blueprint/policies/openclaw-sandbox-permissive.yaml
nemoclaw-blueprint/policies/presets/slack.yaml
nemoclaw-blueprint/scripts/slack-token-rewriter.js
schemas/policy-preset.schema.json
schemas/sandbox-policy.schema.json
scripts/generate-openclaw-config.py
scripts/install-openshell.sh
scripts/nemoclaw-start.sh
src/lib/agent/runtime.test.ts
src/lib/agent/runtime.ts
src/lib/onboard.ts
test/cli.test.ts
test/e2e/lib/fake-slack-api.cjs
test/e2e/lib/slack-api-proof.sh
test/e2e/test-hermes-discord-e2e.sh
test/e2e/test-hermes-e2e.sh
test/e2e/test-hermes-slack-e2e.sh
test/e2e/test-messaging-providers.sh
test/e2e/test-rebuild-hermes.sh
test/gateway-state.test.ts
test/generate-hermes-config.test.ts
test/generate-openclaw-config.test.ts
test/hermes-decode-proxy.test.ts
test/install-openshell-version-check.test.ts
test/nemoclaw-start.test.ts
test/onboard.test.ts
test/policies.test.ts
test/runner.test.ts
test/sandbox-init.test.ts
test/sandbox-provisioning.test.ts
test/slack-token-rewriter-sync.test.ts
test/slack-token-rewriter.test.ts
test/validate-blueprint.test.ts
test/validate-config-schemas.test.ts

💤 Files with no reviewable changes (7)

nemoclaw-blueprint/scripts/slack-token-rewriter.js
test/slack-token-rewriter.test.ts
test/hermes-decode-proxy.test.ts
test/slack-token-rewriter-sync.test.ts
agents/hermes/config/messaging-config.ts
agents/hermes/discord-preload/sitecustomize.py
agents/hermes/decode-proxy.py

✅ Files skipped from review due to trivial changes (3)

.github/workflows/nightly-e2e.yaml
test/e2e/test-hermes-e2e.sh
test/generate-openclaw-config.test.ts

🚧 Files skipped from review as they are similar to previous changes (10)

test/gateway-state.test.ts
test/generate-hermes-config.test.ts
schemas/policy-preset.schema.json
nemoclaw-blueprint/policies/openclaw-sandbox-permissive.yaml
schemas/sandbox-policy.schema.json
scripts/generate-openclaw-config.py
test/e2e/lib/slack-api-proof.sh
nemoclaw-blueprint/policies/presets/slack.yaml
test/policies.test.ts
test/nemoclaw-start.test.ts

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/lib/onboard.ts`:
- Around line 4007-4017: When verifying gateway health (in the block using
verifyGatewayContainerRunning() and in the image-drift branch using
getGatewayClusterImageDrift()), if the check indicates the gateway is stale or
drifted and destroyGateway() fails, downgrade gatewayReuseState from "healthy"
to a non-reuse value (e.g., "missing") so later canReuseHealthyGateway logic
cannot reuse it; update the branches around verifyGatewayContainerRunning() and
getGatewayClusterImageDrift() to set gatewayReuseState = "missing" whenever
cleanup/destroyGateway() returns false (and keep the existing logs/warnings).

In `@test/e2e/lib/fake-slack-api.cjs`:
- Around line 64-73: The auth response currently uses only tokenMatchesExpected
to decide success, so a correct header but wrong request-body token can be
accepted; update the auth decision to require both the header match
(tokenMatchesExpected) and that the parsed body token (e.g., the variable that
holds form field "token") also equals the expected token before returning 200.
Locate the tokenMatchesExpected check in fake-slack-api.cjs and combine it with
the body token validation (logical AND) when setting the status and error string
so incorrect form tokens produce the 401/"invalid_auth" outcome.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 21ea2b06-a0f5-4816-912d-c80014218a01

📥 Commits

Reviewing files that changed from the base of the PR and between 84c189f and b77051d.

📒 Files selected for processing (4)

src/lib/onboard.ts
test/e2e/lib/fake-slack-api.cjs
test/e2e/test-messaging-providers.sh
test/onboard.test.ts

🚧 Files skipped from review as they are similar to previous changes (1)

test/e2e/test-messaging-providers.sh

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@test/e2e/lib/fake-slack-api.cjs`:
- Line 11: Validate the FAKE_SLACK_API_PORT value before using it to start the
server: replace the blind coercion in the port variable with an explicit parse
and check (e.g., parseInt/Number and Number.isInteger) and ensure it falls in
the allowed range (0 or 1–65535); if validation fails, log a clear error and
exit (or throw) before calling server.listen(); update the code paths around the
port constant and the server.listen(...) call so invalid env values fail fast
with a descriptive message.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 1e61c8f3-0db3-4fd0-8cfb-17991481fa25

📥 Commits

Reviewing files that changed from the base of the PR and between b77051d and b09e6a9.

📒 Files selected for processing (3)

src/lib/onboard.ts
test/e2e/lib/fake-slack-api.cjs
test/gateway-liveness-probe.test.ts

🚧 Files skipped from review as they are similar to previous changes (1)

src/lib/onboard.ts

…websocket # Conflicts: # .github/workflows/nightly-e2e.yaml # nemoclaw-blueprint/blueprint.yaml # nemoclaw-blueprint/scripts/slack-token-rewriter.js # scripts/install-openshell.sh # scripts/nemoclaw-start.sh # src/lib/onboard.ts # test/e2e/test-hermes-discord-e2e.sh # test/e2e/test-messaging-providers.sh # test/install-openshell-version-check.test.ts # test/nemoclaw-start.test.ts # test/sandbox-init.test.ts # test/slack-token-rewriter.test.ts

…websocket # Conflicts: # agents/hermes/start.sh # test/sandbox-init.test.ts

…websocket

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

src/lib/onboard.ts (1)
3865-3879: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Keep the registry intact on registration-only gateway remove paths.

When gatewayCliSupportsLifecycleCommands() is false, this falls back to openshell gateway remove, which only unregisters a package-managed gateway. Clearing registry in that case forgets sandboxes that still exist behind the host-managed gateway, so reconnect/resume breaks after the gateway is re-added. Please gate registry.clearAll() behind the destructive paths only (dockerDriver or hasLifecycleCommands).
Suggested fix
-  if (gatewayRemoved) {
+  if (gatewayRemoved && (dockerDriver || hasLifecycleCommands)) {
     registry.clearAll();
   }
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/lib/onboard.ts` around lines 3865 - 3879, The code currently clears
registry regardless of whether the gateway removal was destructive; change the
logic so registry.clearAll() runs only for destructive removals (when
dockerDriver is true or gatewayCliSupportsLifecycleCommands() is true) by gating
the registry.clearAll() call behind those conditions (use the existing
dockerDriver and hasLifecycleCommands variables or the results of
removeDockerDriverGatewayRegistration()/runOpenshell(...) that indicate
destructive removal); update the block around gatewayRemoved and
registry.clearAll to only call registry.clearAll() for the dockerDriver path or
when hasLifecycleCommands is true (leave the non-destructive
runOpenshell(["gateway","remove",...]) path from forgetting local sandboxes).

🧹 Nitpick comments (2)

test/nemoclaw-start.test.ts (1)

1403-1443: ⚡ Quick win

Add a revision-scoped placeholder case to this tripwire test.

This suite only locks in the legacy uppercase placeholder and the canonical openshell:resolve:env:SLACK_BOT_TOKEN form. The new runtime flow also uses revision-scoped placeholders like openshell:resolve:env:v11_SLACK_BOT_TOKEN, so a regression on that shape would currently slip through.

🧪 Suggested assertion

       expect(run('{"botToken":"xoxb-OPENSHELL-RESOLVE-ENV-SLACK_BOT_TOKEN"}\n').status).toBe(0);
       expect(run('{"token":"openshell:resolve:env:SLACK_BOT_TOKEN"}\n').status).toBe(0);
+      expect(run('{"token":"openshell:resolve:env:v11_SLACK_BOT_TOKEN"}\n').status).toBe(0);

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@test/nemoclaw-start.test.ts` around lines 1403 - 1443, Add a new assertion to
the "Slack secrets-on-disk tripwire" test that verifies revision-scoped
placeholders pass the tripwire: when calling run(...) (the helper that writes
openclaw.json and executes verify_no_slack_secrets_on_disk), assert that a
config containing a revision-scoped placeholder like
"openshell:resolve:env:v11_SLACK_BOT_TOKEN" (or similar
"v<revision>_SLACK_BOT_TOKEN") returns status 0; update the test in
test/nemoclaw-start.test.ts near the existing run(...) expectations so
verify_no_slack_secrets_on_disk accepts the new placeholder shape in addition to
the uppercase legacy and canonical forms.

scripts/nemoclaw-start.sh (1)

1800-1800: ⚡ Quick win

Factor the tmp trust-boundary allowlist into one helper.

This validate_tmp_permissions argument list is duplicated in both startup paths. For a security-sensitive allowlist like this, keeping two copies makes it easy to drift the root and non-root validation sets the next time a preload is added or removed.

♻️ Suggested shape

+_TMP_TRUST_BOUNDARY_FILES=(
+  "$_SANDBOX_SAFETY_NET"
+  "$_PROXY_FIX_SCRIPT"
+  "$_NEMOTRON_FIX_SCRIPT"
+  "$_WS_FIX_SCRIPT"
+  "$_SECCOMP_GUARD_SCRIPT"
+  "$_CIAO_GUARD_SCRIPT"
+  "$_TELEGRAM_DIAGNOSTICS_SCRIPT"
+  "$_SLACK_GUARD_SCRIPT"
+)
+
+validate_runtime_preloads() {
+  validate_tmp_permissions "${_TMP_TRUST_BOUNDARY_FILES[@]}"
+}
...
-  validate_tmp_permissions "$_SANDBOX_SAFETY_NET" "$_PROXY_FIX_SCRIPT" "$_NEMOTRON_FIX_SCRIPT" "$_WS_FIX_SCRIPT" "$_SECCOMP_GUARD_SCRIPT" "$_CIAO_GUARD_SCRIPT" "$_TELEGRAM_DIAGNOSTICS_SCRIPT" "$_SLACK_GUARD_SCRIPT"
+  validate_runtime_preloads
...
-validate_tmp_permissions "$_SANDBOX_SAFETY_NET" "$_PROXY_FIX_SCRIPT" "$_NEMOTRON_FIX_SCRIPT" "$_WS_FIX_SCRIPT" "$_SECCOMP_GUARD_SCRIPT" "$_CIAO_GUARD_SCRIPT" "$_TELEGRAM_DIAGNOSTICS_SCRIPT" "$_SLACK_GUARD_SCRIPT"
+validate_runtime_preloads

As per coding guidelines, scripts/nemoclaw-start.sh changes affect every sandbox boot and are invisible to unit tests.

Also applies to: 1965-1965

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@scripts/nemoclaw-start.sh` at line 1800, The duplicated tmp trust-boundary
argument list passed to validate_tmp_permissions should be factored into a
single helper so both startup paths use the same allowlist; create a canonical
helper (for example a shell array or function named TMP_TRUST_ALLOWLIST or
get_tmp_allowlist) that contains the entries "_SANDBOX_SAFETY_NET",
"_PROXY_FIX_SCRIPT", "_NEMOTRON_FIX_SCRIPT", "_WS_FIX_SCRIPT",
"_SECCOMP_GUARD_SCRIPT", "_CIAO_GUARD_SCRIPT", "_TELEGRAM_DIAGNOSTICS_SCRIPT",
and "_SLACK_GUARD_SCRIPT", then replace both direct calls to
validate_tmp_permissions(...) with a call that expands or passes that helper
(e.g., validate_tmp_permissions "${TMP_TRUST_ALLOWLIST[@]}" or
validate_tmp_permissions $(get_tmp_allowlist)), ensuring you preserve the exact
identifiers and ordering and update both startup paths that previously
duplicated the list.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@test/e2e/lib/fake-slack-api.cjs`:
- Around line 39-42: The request body is currently buffered unbounded via the
chunks array in the req.on("data", ...) handler; add a max size guard (e.g.,
MAX_BODY_BYTES) and in the req.on("data", chunk) handler track accumulated
bytes, and if the limit is exceeded, stop further processing by destroying or
pausing the socket and respond with an appropriate error (413 Payload Too Large)
instead of continuing to Buffer.concat; update the code around the
chunks/req.on("data") and req.on("end") logic to enforce this cap and clean up
resources when triggered.

---

Duplicate comments:
In `@src/lib/onboard.ts`:
- Around line 3865-3879: The code currently clears registry regardless of
whether the gateway removal was destructive; change the logic so
registry.clearAll() runs only for destructive removals (when dockerDriver is
true or gatewayCliSupportsLifecycleCommands() is true) by gating the
registry.clearAll() call behind those conditions (use the existing dockerDriver
and hasLifecycleCommands variables or the results of
removeDockerDriverGatewayRegistration()/runOpenshell(...) that indicate
destructive removal); update the block around gatewayRemoved and
registry.clearAll to only call registry.clearAll() for the dockerDriver path or
when hasLifecycleCommands is true (leave the non-destructive
runOpenshell(["gateway","remove",...]) path from forgetting local sandboxes).

---

Nitpick comments:
In `@scripts/nemoclaw-start.sh`:
- Line 1800: The duplicated tmp trust-boundary argument list passed to
validate_tmp_permissions should be factored into a single helper so both startup
paths use the same allowlist; create a canonical helper (for example a shell
array or function named TMP_TRUST_ALLOWLIST or get_tmp_allowlist) that contains
the entries "_SANDBOX_SAFETY_NET", "_PROXY_FIX_SCRIPT", "_NEMOTRON_FIX_SCRIPT",
"_WS_FIX_SCRIPT", "_SECCOMP_GUARD_SCRIPT", "_CIAO_GUARD_SCRIPT",
"_TELEGRAM_DIAGNOSTICS_SCRIPT", and "_SLACK_GUARD_SCRIPT", then replace both
direct calls to validate_tmp_permissions(...) with a call that expands or passes
that helper (e.g., validate_tmp_permissions "${TMP_TRUST_ALLOWLIST[@]}" or
validate_tmp_permissions $(get_tmp_allowlist)), ensuring you preserve the exact
identifiers and ordering and update both startup paths that previously
duplicated the list.

In `@test/nemoclaw-start.test.ts`:
- Around line 1403-1443: Add a new assertion to the "Slack secrets-on-disk
tripwire" test that verifies revision-scoped placeholders pass the tripwire:
when calling run(...) (the helper that writes openclaw.json and executes
verify_no_slack_secrets_on_disk), assert that a config containing a
revision-scoped placeholder like "openshell:resolve:env:v11_SLACK_BOT_TOKEN" (or
similar "v<revision>_SLACK_BOT_TOKEN") returns status 0; update the test in
test/nemoclaw-start.test.ts near the existing run(...) expectations so
verify_no_slack_secrets_on_disk accepts the new placeholder shape in addition to
the uppercase legacy and canonical forms.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 1ffdca16-4307-49a1-be5b-2f6cdad92b16

📥 Commits

Reviewing files that changed from the base of the PR and between b09e6a9 and c05172c.

📒 Files selected for processing (22)

.github/workflows/nightly-e2e.yaml
Dockerfile
agents/hermes/config/messaging-config.ts
agents/hermes/policy-additions.yaml
agents/hermes/start.sh
package.json
scripts/install-openshell.sh
scripts/nemoclaw-start.sh
src/lib/agent/runtime.test.ts
src/lib/onboard.ts
src/lib/state/gateway.ts
test/cli.test.ts
test/e2e/lib/fake-slack-api.cjs
test/e2e/test-hermes-slack-e2e.sh
test/e2e/test-messaging-providers.sh
test/generate-hermes-config.test.ts
test/install-openshell-version-check.test.ts
test/nemoclaw-start.test.ts
test/onboard.test.ts
test/runner.test.ts
test/sandbox-init.test.ts
test/sandbox-provisioning.test.ts

💤 Files with no reviewable changes (2)

agents/hermes/config/messaging-config.ts
Dockerfile

✅ Files skipped from review due to trivial changes (1)

test/generate-hermes-config.test.ts

🚧 Files skipped from review as they are similar to previous changes (11)

test/runner.test.ts
test/sandbox-provisioning.test.ts
scripts/install-openshell.sh
src/lib/state/gateway.ts
agents/hermes/policy-additions.yaml
test/onboard.test.ts
test/e2e/test-messaging-providers.sh
test/e2e/test-hermes-slack-e2e.sh
agents/hermes/start.sh
src/lib/agent/runtime.test.ts
test/cli.test.ts

…websocket # Conflicts: # test/gateway-liveness-probe.test.ts

github-actions · 2026-05-12T01:21:45Z

E2E Advisor Recommendation

Required E2E: hermes-discord-e2e, hermes-slack-e2e, messaging-providers-e2e, hermes-e2e, rebuild-hermes-e2e, token-rotation-e2e, credential-migration-e2e, launchable-smoke-e2e, openshell-gateway-upgrade-e2e, gateway-health-honest-e2e
Optional E2E: cloud-e2e, cloud-onboard-e2e, sandbox-survival-e2e, messaging-compatible-endpoint-e2e, double-onboard-e2e, sandbox-operations-e2e

Dispatch hint: hermes-discord-e2e,hermes-slack-e2e,messaging-providers-e2e,hermes-e2e,rebuild-hermes-e2e,token-rotation-e2e,credential-migration-e2e,launchable-smoke-e2e,openshell-gateway-upgrade-e2e,gateway-health-honest-e2e

Workflow run

Full advisor summary

Pi Semantic E2E Advisor

Base: origin/main
Head: HEAD
Confidence: high

Required E2E

hermes-discord-e2e: Primary regression target. Validates Hermes Discord onboarding and the native OpenShell WebSocket Gateway credential-rewrite path that replaces the deleted discord-facade.py. Test comments explicitly cite nemoclaw onboard --agent hermes generates invalid discord configuration #3032 and the hermetic fake Discord Gateway added in test/e2e/lib/fake-discord-gateway.cjs.
hermes-slack-e2e: Primary regression target. Slack preset adopts websocket_credential_rewrite (Socket Mode) and request_body_credential_rewrite (REST). Test has been updated to exercise the OpenShell credential rewrite path using test/e2e/lib/fake-slack-api.cjs and slack-api-proof.sh.
messaging-providers-e2e: Full credential/provider/L7 proxy chain for Telegram+Discord+Slack, including slack-token-rewriter Bolt-shape → canonical placeholder translation. Nightly comment was updated specifically for this PR (Slack Socket Mode crashes on boot with invalid_auth — appToken missing from baked openclaw.json; placeholder resolution also broken #2085 follow-up). Script itself is modified here.
hermes-e2e: agents/hermes/Dockerfile and start.sh change materially (removed decode-proxy/discord-facade artifacts and env wiring). Baseline Hermes onboarding + health + live inference must still work with the new image contract.
rebuild-hermes-e2e: Hermes image layout changed (files removed from /usr/local/bin). Rebuild upgrade path is the exact scenario that flushes stale images; test script itself is modified in this PR.
token-rotation-e2e: Messaging credential propagation across rotation for Telegram+Discord+Slack. With the move to native WebSocket credential rewrite and updated slack-token-rewriter, rotation semantics are the highest-risk regression surface after first-onboard.
credential-migration-e2e: test/e2e/test-credential-migration.sh is directly modified. Validates host credentials.json migration into the OpenShell gateway, zero-fill, allowlist filter, and symlink-safe deletion — a security boundary.
launchable-smoke-e2e: scripts/brev-launchable-ci-cpu.sh and test/e2e/test-launchable-smoke.sh are both modified; this is the community install path smoke. Installer/onboarding changes require this guard.
openshell-gateway-upgrade-e2e: scripts/install-openshell.sh is modified and the PR pins/builds a specific OpenShell main commit for nightly jobs. The stale-gateway-upgrade path is the regression surface for OpenShell version transitions.
gateway-health-honest-e2e: src/lib/onboard/gateway-http-readiness.ts and src/lib/state/gateway.ts changed. This is the regression guard for false-positive 'gateway healthy' logs when the gateway fails to serve — exactly the surface being touched.

Optional E2E

cloud-e2e: Root Dockerfile and generate-openclaw-config.py changed; worth validating full cloud onboard+inference with NVIDIA Endpoint against the rebuilt OpenClaw image.
cloud-onboard-e2e: Onboard control flow (src/lib/onboard.ts, usage-notice.ts, gateway-http-readiness.ts) changed — public installer + Landlock + inference.local probe covers baseline onboarding.
sandbox-survival-e2e: gateway state and readiness modules changed; gateway restart recovery is the key downstream behavior.
messaging-compatible-endpoint-e2e: Validates Telegram + OpenAI-compatible endpoint routing through inference.local. Provides extra signal for the messaging credential routing changes without touching Discord/Slack.
double-onboard-e2e: src/lib/onboard.ts changed; re-onboard lifecycle is a known regression trap when gateway state logic moves.
sandbox-operations-e2e: Gateway state/readiness changes intersect with sandbox lifecycle, cross-sandbox isolation, and docker-kill recovery paths.

New E2E recommendations

messaging-websocket-credential-rewrite (high): The PR introduces test/e2e/lib/fake-discord-gateway.cjs and fake-slack-api.cjs plus proof scripts, invoked from the hermes-discord and hermes-slack nightly jobs. Currently there is no dedicated failure-mode suite asserting that WebSocket IDENTIFY frames carrying the placeholder token (and only the placeholder) are rewritten at the L7 boundary, that frames carrying the real token from inside the sandbox are rejected, and that a bogus upgrade path is denied. A focused suite would prevent a silent regression where websocket_credential_rewrite stops rewriting but the test only asserts 'connected'.
- Suggested test: test/e2e/test-messaging-websocket-rewrite.sh — hermetic negative/positive tests on gateway.discord.gg and wss-primary.slack.com: (a) placeholder in IDENTIFY → real-token seen by fake gateway, (b) real-token leaked from sandbox → L7 denies/strips and audit log records violation, (c) upgrade to disallowed path denied, (d) WEBSOCKET_TEXT policy enforced for client frames.
policy-schema-and-presets (medium): schemas/policy-preset.schema.json and schemas/sandbox-policy.schema.json gained new fields (websocket_credential_rewrite, request_body_credential_rewrite) used by discord.yaml and slack.yaml presets. Unit tests cover schema validity; there is no E2E that asserts an unknown/misspelled credential_rewrite key is rejected by OpenShell policy load at sandbox startup, which would catch future drift between NemoClaw schema and OpenShell enforcement.
- Suggested test: test/e2e/test-policy-schema-credential-rewrite.sh — onboard a sandbox with a crafted preset mixing valid and misspelled rewrite flags and assert sandbox creation fails with a clear validation error before any traffic flows.
installer-and-launchable (low): scripts/build-openshell-main-for-nightly.sh is new and gates every nightly job to a hard-coded OpenShell main commit when the branch is fix/native-messaging-websocket. There is no E2E or dry-run guard that the script itself succeeds against current GHCR/OpenShell main state outside of nightly cron, so a silent break (token scope, commit gone) would only surface in scheduled runs.
- Suggested test: Add a lightweight PR-time 'build-openshell-main-dry-run' job that runs scripts/build-openshell-main-for-nightly.sh in a cache-only mode against ubuntu-latest so regressions in the pinned commit / GHCR auth surface on PR rather than next nightly.

Dispatch hint

Workflow: nightly-e2e.yaml
jobs input: hermes-discord-e2e,hermes-slack-e2e,messaging-providers-e2e,hermes-e2e,rebuild-hermes-e2e,token-rotation-e2e,credential-migration-e2e,launchable-smoke-e2e,openshell-gateway-upgrade-e2e,gateway-health-honest-e2e

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/lib/onboard.ts`:
- Around line 3661-3665: The volume-prune call is executed unconditionally and
can corrupt a live gateway when destroyGateway() returns false; change the logic
so dockerRemoveVolumesByPrefix(`openshell-cluster-${GATEWAY_NAME}`, {
ignoreError: true }) is only executed when the gateway removal actually
succeeded (i.e., destroyGateway() returned true or no-failure result).
Concretely, update the block that currently checks dockerDriver ||
hasLifecycleCommands to also verify the successful removal result from
destroyGateway (or an explicit success flag returned by the caller) before
calling dockerRemoveVolumesByPrefix; reference the destroyGateway return value
and the existing symbols dockerDriver, hasLifecycleCommands,
dockerRemoveVolumesByPrefix, and GATEWAY_NAME when making the guard change.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 97c054e5-f419-4e0c-b634-50dcf93944e4

📥 Commits

Reviewing files that changed from the base of the PR and between c05172c and 0d4a3f7.

📒 Files selected for processing (8)

.github/workflows/nightly-e2e.yaml
scripts/nemoclaw-start.sh
src/lib/onboard.ts
test/cli.test.ts
test/gateway-liveness-probe.test.ts
test/onboard.test.ts
test/policies.test.ts
test/sandbox-build-context.test.ts

✅ Files skipped from review due to trivial changes (1)

test/sandbox-build-context.test.ts

🚧 Files skipped from review as they are similar to previous changes (4)

test/cli.test.ts
test/gateway-liveness-probe.test.ts
test/policies.test.ts
test/onboard.test.ts

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@scripts/install-openshell.sh`:
- Line 93: The failure path in openshell_has_required_messaging_features()
conflates a missing strings binary with true lack of messaging-rewrite support;
update the function to detect if command -v strings fails and return a distinct
non-zero status while exporting or returning an error reason, and then change
the callers that currently log "OpenShell missing messaging rewrite support." to
check that reason and produce a specific error like "OpenShell missing 'strings'
(install binutils)" when strings is absent; ensure both call sites that handle
openshell_has_required_messaging_features() use this new distinction so minimal
container images without binutils get the correct diagnostic.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 565beee2-aa6d-454a-8887-ba1fcaa03fcb

📥 Commits

Reviewing files that changed from the base of the PR and between a614ab3 and 3aab8d9.

📒 Files selected for processing (2)

.github/workflows/nightly-e2e.yaml
scripts/install-openshell.sh

Signed-off-by: Aaron Erickson <aerickson@nvidia.com>

github-actions · 2026-05-12T03:25:41Z

Selective E2E Results — ❌ Some jobs failed

Run: 25711376978
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 0 passed, 38 failed, 2 skipped

Job	Result
brave-search-e2e	❌ failure
cloud-e2e	❌ failure
cloud-inference-e2e	⚠️ cancelled
cloud-onboard-e2e	❌ failure
credential-migration-e2e	❌ failure
credential-sanitization-e2e	❌ failure
deployment-services-e2e	❌ failure
device-auth-health-e2e	❌ failure
diagnostics-e2e	❌ failure
docs-validation-e2e	❌ failure
double-onboard-e2e	❌ failure
gateway-health-honest-e2e	❌ failure
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	❌ failure
hermes-e2e	❌ failure
hermes-inference-switch-e2e	❌ failure
hermes-slack-e2e	⚠️ cancelled
inference-routing-e2e	❌ failure
issue-2478-crash-loop-recovery-e2e	❌ failure
kimi-inference-compat-e2e	❌ failure
launchable-smoke-e2e	❌ failure
messaging-compatible-endpoint-e2e	❌ failure
messaging-providers-e2e	❌ failure
network-policy-e2e	❌ failure
onboard-repair-e2e	❌ failure
onboard-resume-e2e	❌ failure
openclaw-inference-switch-e2e	❌ failure
openshell-gateway-upgrade-e2e	❌ failure
overlayfs-autofix-e2e	❌ failure
rebuild-hermes-e2e	❌ failure
rebuild-hermes-stale-base-e2e	❌ failure
rebuild-openclaw-e2e	❌ failure
runtime-overrides-e2e	❌ failure
sandbox-operations-e2e	❌ failure
sandbox-survival-e2e	❌ failure
shields-config-e2e	❌ failure
skill-agent-e2e	❌ failure
snapshot-commands-e2e	❌ failure
telegram-injection-e2e	❌ failure
token-rotation-e2e	❌ failure
upgrade-stale-sandbox-e2e	❌ failure

Failed jobs: brave-search-e2e, cloud-e2e, cloud-onboard-e2e, credential-migration-e2e, credential-sanitization-e2e, deployment-services-e2e, device-auth-health-e2e, diagnostics-e2e, docs-validation-e2e, double-onboard-e2e, gateway-health-honest-e2e, hermes-discord-e2e, hermes-e2e, hermes-inference-switch-e2e, inference-routing-e2e, issue-2478-crash-loop-recovery-e2e, kimi-inference-compat-e2e, launchable-smoke-e2e, messaging-compatible-endpoint-e2e, messaging-providers-e2e, network-policy-e2e, onboard-repair-e2e, onboard-resume-e2e, openclaw-inference-switch-e2e, openshell-gateway-upgrade-e2e, overlayfs-autofix-e2e, rebuild-hermes-e2e, rebuild-hermes-stale-base-e2e, rebuild-openclaw-e2e, runtime-overrides-e2e, sandbox-operations-e2e, sandbox-survival-e2e, shields-config-e2e, skill-agent-e2e, snapshot-commands-e2e, telegram-injection-e2e, token-rotation-e2e, upgrade-stale-sandbox-e2e. Check run artifacts for logs.

github-actions · 2026-05-12T04:08:26Z

Selective E2E Results — ❌ Some jobs failed

Run: 25711440456
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 17 passed, 2 failed, 2 skipped

Job	Result
brave-search-e2e	✅ success
cloud-e2e	✅ success
cloud-inference-e2e	✅ success
cloud-onboard-e2e	✅ success
credential-migration-e2e	❌ failure
credential-sanitization-e2e	✅ success
deployment-services-e2e	⚠️ cancelled
device-auth-health-e2e	⚠️ cancelled
diagnostics-e2e	⚠️ cancelled
docs-validation-e2e	✅ success
double-onboard-e2e	⚠️ cancelled
gateway-health-honest-e2e	❌ failure
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	✅ success
hermes-e2e	✅ success
hermes-inference-switch-e2e	✅ success
hermes-slack-e2e	✅ success
inference-routing-e2e	⚠️ cancelled
issue-2478-crash-loop-recovery-e2e	⚠️ cancelled
kimi-inference-compat-e2e	✅ success
launchable-smoke-e2e	⚠️ cancelled
messaging-compatible-endpoint-e2e	⚠️ cancelled
messaging-providers-e2e	⚠️ cancelled
network-policy-e2e	⚠️ cancelled
onboard-repair-e2e	⚠️ cancelled
onboard-resume-e2e	✅ success
openclaw-inference-switch-e2e	⚠️ cancelled
openshell-gateway-upgrade-e2e	✅ success
overlayfs-autofix-e2e	✅ success
rebuild-hermes-e2e	⚠️ cancelled
rebuild-hermes-stale-base-e2e	✅ success
rebuild-openclaw-e2e	⚠️ cancelled
runtime-overrides-e2e	⚠️ cancelled
sandbox-operations-e2e	⚠️ cancelled
sandbox-survival-e2e	✅ success
shields-config-e2e	⚠️ cancelled
skill-agent-e2e	✅ success
snapshot-commands-e2e	⚠️ cancelled
telegram-injection-e2e	⚠️ cancelled
token-rotation-e2e	⚠️ cancelled
upgrade-stale-sandbox-e2e	⚠️ cancelled

Failed jobs: credential-migration-e2e, gateway-health-honest-e2e. Check run artifacts for logs.

github-actions · 2026-05-12T04:23:11Z

Selective E2E Results — ❌ Some jobs failed

Run: 25713038306
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 0 passed, 2 failed, 2 skipped

Job	Result
brave-search-e2e	⚠️ cancelled
cloud-e2e	⚠️ cancelled
cloud-inference-e2e	⚠️ cancelled
cloud-onboard-e2e	⚠️ cancelled
credential-migration-e2e	⚠️ cancelled
credential-sanitization-e2e	⚠️ cancelled
deployment-services-e2e	⚠️ cancelled
device-auth-health-e2e	⚠️ cancelled
diagnostics-e2e	⚠️ cancelled
docs-validation-e2e	⚠️ cancelled
double-onboard-e2e	❌ failure
gateway-health-honest-e2e	⚠️ cancelled
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	❌ failure
hermes-e2e	⚠️ cancelled
hermes-inference-switch-e2e	⚠️ cancelled
hermes-slack-e2e	⚠️ cancelled
inference-routing-e2e	⚠️ cancelled
issue-2478-crash-loop-recovery-e2e	⚠️ cancelled
kimi-inference-compat-e2e	⚠️ cancelled
launchable-smoke-e2e	⚠️ cancelled
messaging-compatible-endpoint-e2e	⚠️ cancelled
messaging-providers-e2e	⚠️ cancelled
network-policy-e2e	⚠️ cancelled
onboard-repair-e2e	⚠️ cancelled
onboard-resume-e2e	⚠️ cancelled
openclaw-inference-switch-e2e	⚠️ cancelled
openshell-gateway-upgrade-e2e	⚠️ cancelled
overlayfs-autofix-e2e	⚠️ cancelled
rebuild-hermes-e2e	⚠️ cancelled
rebuild-hermes-stale-base-e2e	⚠️ cancelled
rebuild-openclaw-e2e	⚠️ cancelled
runtime-overrides-e2e	⚠️ cancelled
sandbox-operations-e2e	⚠️ cancelled
sandbox-survival-e2e	⚠️ cancelled
shields-config-e2e	⚠️ cancelled
skill-agent-e2e	⚠️ cancelled
snapshot-commands-e2e	⚠️ cancelled
telegram-injection-e2e	⚠️ cancelled
token-rotation-e2e	⚠️ cancelled
upgrade-stale-sandbox-e2e	⚠️ cancelled

Failed jobs: double-onboard-e2e, hermes-discord-e2e. Check run artifacts for logs.

github-actions · 2026-05-12T05:10:55Z

Selective E2E Results — ❌ Some jobs failed

Run: 25713229387
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 6 passed, 34 failed, 2 skipped

Job	Result
brave-search-e2e	✅ success
cloud-e2e	❌ failure
cloud-inference-e2e	❌ failure
cloud-onboard-e2e	❌ failure
credential-migration-e2e	❌ failure
credential-sanitization-e2e	❌ failure
deployment-services-e2e	❌ failure
device-auth-health-e2e	❌ failure
diagnostics-e2e	❌ failure
docs-validation-e2e	❌ failure
double-onboard-e2e	❌ failure
gateway-health-honest-e2e	✅ success
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	❌ failure
hermes-e2e	❌ failure
hermes-inference-switch-e2e	❌ failure
hermes-slack-e2e	❌ failure
inference-routing-e2e	✅ success
issue-2478-crash-loop-recovery-e2e	❌ failure
kimi-inference-compat-e2e	✅ success
launchable-smoke-e2e	❌ failure
messaging-compatible-endpoint-e2e	❌ failure
messaging-providers-e2e	❌ failure
network-policy-e2e	❌ failure
onboard-repair-e2e	❌ failure
onboard-resume-e2e	❌ failure
openclaw-inference-switch-e2e	❌ failure
openshell-gateway-upgrade-e2e	❌ failure
overlayfs-autofix-e2e	✅ success
rebuild-hermes-e2e	❌ failure
rebuild-hermes-stale-base-e2e	❌ failure
rebuild-openclaw-e2e	✅ success
runtime-overrides-e2e	❌ failure
sandbox-operations-e2e	❌ failure
sandbox-survival-e2e	❌ failure
shields-config-e2e	❌ failure
skill-agent-e2e	❌ failure
snapshot-commands-e2e	❌ failure
telegram-injection-e2e	❌ failure
token-rotation-e2e	❌ failure
upgrade-stale-sandbox-e2e	❌ failure

Failed jobs: cloud-e2e, cloud-inference-e2e, cloud-onboard-e2e, credential-migration-e2e, credential-sanitization-e2e, deployment-services-e2e, device-auth-health-e2e, diagnostics-e2e, docs-validation-e2e, double-onboard-e2e, hermes-discord-e2e, hermes-e2e, hermes-inference-switch-e2e, hermes-slack-e2e, issue-2478-crash-loop-recovery-e2e, launchable-smoke-e2e, messaging-compatible-endpoint-e2e, messaging-providers-e2e, network-policy-e2e, onboard-repair-e2e, onboard-resume-e2e, openclaw-inference-switch-e2e, openshell-gateway-upgrade-e2e, rebuild-hermes-e2e, rebuild-hermes-stale-base-e2e, runtime-overrides-e2e, sandbox-operations-e2e, sandbox-survival-e2e, shields-config-e2e, skill-agent-e2e, snapshot-commands-e2e, telegram-injection-e2e, token-rotation-e2e, upgrade-stale-sandbox-e2e. Check run artifacts for logs.

github-actions · 2026-05-12T06:31:47Z

Selective E2E Results — ❌ Some jobs failed

Run: 25716272137
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 4 passed, 20 failed, 2 skipped

Job	Result
brave-search-e2e	✅ success
cloud-e2e	❌ failure
cloud-inference-e2e	⚠️ cancelled
cloud-onboard-e2e	⚠️ cancelled
credential-migration-e2e	❌ failure
credential-sanitization-e2e	⚠️ cancelled
deployment-services-e2e	⚠️ cancelled
device-auth-health-e2e	❌ failure
diagnostics-e2e	❌ failure
docs-validation-e2e	❌ failure
double-onboard-e2e	❌ failure
gateway-health-honest-e2e	✅ success
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	⚠️ cancelled
hermes-e2e	❌ failure
hermes-inference-switch-e2e	❌ failure
hermes-slack-e2e	❌ failure
inference-routing-e2e	⚠️ cancelled
issue-2478-crash-loop-recovery-e2e	❌ failure
kimi-inference-compat-e2e	✅ success
launchable-smoke-e2e	❌ failure
messaging-compatible-endpoint-e2e	❌ failure
messaging-providers-e2e	⚠️ cancelled
network-policy-e2e	⚠️ cancelled
onboard-repair-e2e	❌ failure
onboard-resume-e2e	⚠️ cancelled
openclaw-inference-switch-e2e	⚠️ cancelled
openshell-gateway-upgrade-e2e	❌ failure
overlayfs-autofix-e2e	✅ success
rebuild-hermes-e2e	⚠️ cancelled
rebuild-hermes-stale-base-e2e	❌ failure
rebuild-openclaw-e2e	⚠️ cancelled
runtime-overrides-e2e	❌ failure
sandbox-operations-e2e	⚠️ cancelled
sandbox-survival-e2e	⚠️ cancelled
shields-config-e2e	⚠️ cancelled
skill-agent-e2e	⚠️ cancelled
snapshot-commands-e2e	❌ failure
telegram-injection-e2e	❌ failure
token-rotation-e2e	❌ failure
upgrade-stale-sandbox-e2e	❌ failure

Failed jobs: cloud-e2e, credential-migration-e2e, device-auth-health-e2e, diagnostics-e2e, docs-validation-e2e, double-onboard-e2e, hermes-e2e, hermes-inference-switch-e2e, hermes-slack-e2e, issue-2478-crash-loop-recovery-e2e, launchable-smoke-e2e, messaging-compatible-endpoint-e2e, onboard-repair-e2e, openshell-gateway-upgrade-e2e, rebuild-hermes-stale-base-e2e, runtime-overrides-e2e, snapshot-commands-e2e, telegram-injection-e2e, token-rotation-e2e, upgrade-stale-sandbox-e2e. Check run artifacts for logs.

github-actions · 2026-05-12T07:43:11Z

Selective E2E Results — ✅ All requested jobs passed

Run: 25717826183
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 40 passed, 0 failed, 2 skipped

Job	Result
brave-search-e2e	✅ success
cloud-e2e	✅ success
cloud-inference-e2e	✅ success
cloud-onboard-e2e	✅ success
credential-migration-e2e	✅ success
credential-sanitization-e2e	✅ success
deployment-services-e2e	✅ success
device-auth-health-e2e	✅ success
diagnostics-e2e	✅ success
docs-validation-e2e	✅ success
double-onboard-e2e	✅ success
gateway-health-honest-e2e	✅ success
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	✅ success
hermes-e2e	✅ success
hermes-inference-switch-e2e	✅ success
hermes-slack-e2e	✅ success
inference-routing-e2e	✅ success
issue-2478-crash-loop-recovery-e2e	✅ success
kimi-inference-compat-e2e	✅ success
launchable-smoke-e2e	✅ success
messaging-compatible-endpoint-e2e	✅ success
messaging-providers-e2e	✅ success
network-policy-e2e	✅ success
onboard-repair-e2e	✅ success
onboard-resume-e2e	✅ success
openclaw-inference-switch-e2e	✅ success
openshell-gateway-upgrade-e2e	✅ success
overlayfs-autofix-e2e	✅ success
rebuild-hermes-e2e	✅ success
rebuild-hermes-stale-base-e2e	✅ success
rebuild-openclaw-e2e	✅ success
runtime-overrides-e2e	✅ success
sandbox-operations-e2e	✅ success
sandbox-survival-e2e	✅ success
shields-config-e2e	✅ success
skill-agent-e2e	✅ success
snapshot-commands-e2e	✅ success
telegram-injection-e2e	✅ success
token-rotation-e2e	✅ success
upgrade-stale-sandbox-e2e	✅ success

github-actions · 2026-05-12T14:52:34Z

Selective E2E Results — ❌ Some jobs failed

Run: 25740269924
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 2 passed, 20 failed, 2 skipped

Job	Result
brave-search-e2e	✅ success
cloud-e2e	❌ failure
cloud-inference-e2e	❌ failure
cloud-onboard-e2e	⚠️ cancelled
credential-migration-e2e	⚠️ cancelled
credential-sanitization-e2e	⚠️ cancelled
deployment-services-e2e	⚠️ cancelled
device-auth-health-e2e	❌ failure
diagnostics-e2e	❌ failure
docs-validation-e2e	❌ failure
double-onboard-e2e	⚠️ cancelled
gateway-health-honest-e2e	✅ success
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	❌ failure
hermes-e2e	❌ failure
hermes-inference-switch-e2e	⚠️ cancelled
hermes-slack-e2e	❌ failure
inference-routing-e2e	❌ failure
issue-2478-crash-loop-recovery-e2e	⚠️ cancelled
kimi-inference-compat-e2e	❌ failure
launchable-smoke-e2e	❌ failure
messaging-compatible-endpoint-e2e	⚠️ cancelled
messaging-providers-e2e	❌ failure
network-policy-e2e	⚠️ cancelled
onboard-repair-e2e	❌ failure
onboard-resume-e2e	❌ failure
openclaw-inference-switch-e2e	❌ failure
openshell-gateway-upgrade-e2e	⚠️ cancelled
overlayfs-autofix-e2e	⚠️ cancelled
rebuild-hermes-e2e	⚠️ cancelled
rebuild-hermes-stale-base-e2e	⚠️ cancelled
rebuild-openclaw-e2e	⚠️ cancelled
runtime-overrides-e2e	⚠️ cancelled
sandbox-operations-e2e	❌ failure
sandbox-survival-e2e	❌ failure
shields-config-e2e	❌ failure
skill-agent-e2e	❌ failure
snapshot-commands-e2e	⚠️ cancelled
telegram-injection-e2e	⚠️ cancelled
token-rotation-e2e	❌ failure
upgrade-stale-sandbox-e2e	⚠️ cancelled

Failed jobs: cloud-e2e, cloud-inference-e2e, device-auth-health-e2e, diagnostics-e2e, docs-validation-e2e, hermes-discord-e2e, hermes-e2e, hermes-slack-e2e, inference-routing-e2e, kimi-inference-compat-e2e, launchable-smoke-e2e, messaging-providers-e2e, onboard-repair-e2e, onboard-resume-e2e, openclaw-inference-switch-e2e, sandbox-operations-e2e, sandbox-survival-e2e, shields-config-e2e, skill-agent-e2e, token-rotation-e2e. Check run artifacts for logs.

…websocket # Conflicts: # test/install-openshell-version-check.test.ts

github-actions · 2026-05-12T15:27:58Z

Selective E2E Results — ✅ All requested jobs passed

Run: 25742530325
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 4 passed, 0 failed, 2 skipped

Job	Result
brave-search-e2e	✅ success
cloud-e2e	⚠️ cancelled
cloud-inference-e2e	⚠️ cancelled
cloud-onboard-e2e	⚠️ cancelled
credential-migration-e2e	⚠️ cancelled
credential-sanitization-e2e	⚠️ cancelled
deployment-services-e2e	⚠️ cancelled
device-auth-health-e2e	⚠️ cancelled
diagnostics-e2e	⚠️ cancelled
docs-validation-e2e	⚠️ cancelled
double-onboard-e2e	⚠️ cancelled
gateway-health-honest-e2e	⚠️ cancelled
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	⚠️ cancelled
hermes-e2e	⚠️ cancelled
hermes-inference-switch-e2e	✅ success
hermes-slack-e2e	✅ success
inference-routing-e2e	⚠️ cancelled
issue-2478-crash-loop-recovery-e2e	⚠️ cancelled
kimi-inference-compat-e2e	⚠️ cancelled
launchable-smoke-e2e	⚠️ cancelled
messaging-compatible-endpoint-e2e	⚠️ cancelled
messaging-providers-e2e	⚠️ cancelled
network-policy-e2e	⚠️ cancelled
onboard-repair-e2e	⚠️ cancelled
onboard-resume-e2e	⚠️ cancelled
openclaw-inference-switch-e2e	⚠️ cancelled
openshell-gateway-upgrade-e2e	⚠️ cancelled
overlayfs-autofix-e2e	✅ success
rebuild-hermes-e2e	⚠️ cancelled
rebuild-hermes-stale-base-e2e	⚠️ cancelled
rebuild-openclaw-e2e	⚠️ cancelled
runtime-overrides-e2e	⚠️ cancelled
sandbox-operations-e2e	⚠️ cancelled
sandbox-survival-e2e	⚠️ cancelled
shields-config-e2e	⚠️ cancelled
skill-agent-e2e	⚠️ cancelled
snapshot-commands-e2e	⚠️ cancelled
telegram-injection-e2e	⚠️ cancelled
token-rotation-e2e	⚠️ cancelled
upgrade-stale-sandbox-e2e	⚠️ cancelled

github-actions · 2026-05-12T16:33:29Z

Selective E2E Results — ❌ Some jobs failed

Run: 25744597156
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 39 passed, 1 failed, 2 skipped

Job	Result
brave-search-e2e	✅ success
cloud-e2e	✅ success
cloud-inference-e2e	✅ success
cloud-onboard-e2e	✅ success
credential-migration-e2e	✅ success
credential-sanitization-e2e	✅ success
deployment-services-e2e	✅ success
device-auth-health-e2e	✅ success
diagnostics-e2e	✅ success
docs-validation-e2e	✅ success
double-onboard-e2e	✅ success
gateway-health-honest-e2e	✅ success
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	✅ success
hermes-e2e	✅ success
hermes-inference-switch-e2e	✅ success
hermes-slack-e2e	✅ success
inference-routing-e2e	✅ success
issue-2478-crash-loop-recovery-e2e	✅ success
kimi-inference-compat-e2e	✅ success
launchable-smoke-e2e	✅ success
messaging-compatible-endpoint-e2e	✅ success
messaging-providers-e2e	✅ success
network-policy-e2e	✅ success
onboard-repair-e2e	✅ success
onboard-resume-e2e	✅ success
openclaw-inference-switch-e2e	✅ success
openshell-gateway-upgrade-e2e	✅ success
overlayfs-autofix-e2e	✅ success
rebuild-hermes-e2e	✅ success
rebuild-hermes-stale-base-e2e	✅ success
rebuild-openclaw-e2e	✅ success
runtime-overrides-e2e	✅ success
sandbox-operations-e2e	✅ success
sandbox-survival-e2e	✅ success
shields-config-e2e	✅ success
skill-agent-e2e	✅ success
snapshot-commands-e2e	✅ success
telegram-injection-e2e	✅ success
token-rotation-e2e	❌ failure
upgrade-stale-sandbox-e2e	✅ success

Failed jobs: token-rotation-e2e. Check run artifacts for logs.

github-actions · 2026-05-12T17:43:00Z

Selective E2E Results — ✅ All requested jobs passed

Run: 25744597156
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 40 passed, 0 failed, 2 skipped

Job	Result
brave-search-e2e	✅ success
cloud-e2e	✅ success
cloud-inference-e2e	✅ success
cloud-onboard-e2e	✅ success
credential-migration-e2e	✅ success
credential-sanitization-e2e	✅ success
deployment-services-e2e	✅ success
device-auth-health-e2e	✅ success
diagnostics-e2e	✅ success
docs-validation-e2e	✅ success
double-onboard-e2e	✅ success
gateway-health-honest-e2e	✅ success
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	✅ success
hermes-e2e	✅ success
hermes-inference-switch-e2e	✅ success
hermes-slack-e2e	✅ success
inference-routing-e2e	✅ success
issue-2478-crash-loop-recovery-e2e	✅ success
kimi-inference-compat-e2e	✅ success
launchable-smoke-e2e	✅ success
messaging-compatible-endpoint-e2e	✅ success
messaging-providers-e2e	✅ success
network-policy-e2e	✅ success
onboard-repair-e2e	✅ success
onboard-resume-e2e	✅ success
openclaw-inference-switch-e2e	✅ success
openshell-gateway-upgrade-e2e	✅ success
overlayfs-autofix-e2e	✅ success
rebuild-hermes-e2e	✅ success
rebuild-hermes-stale-base-e2e	✅ success
rebuild-openclaw-e2e	✅ success
runtime-overrides-e2e	✅ success
sandbox-operations-e2e	✅ success
sandbox-survival-e2e	✅ success
shields-config-e2e	✅ success
skill-agent-e2e	✅ success
snapshot-commands-e2e	✅ success
telegram-injection-e2e	✅ success
token-rotation-e2e	✅ success
upgrade-stale-sandbox-e2e	✅ success

…websocket

github-actions · 2026-05-12T22:13:33Z

Selective E2E Results — ⚠️ No requested jobs ran

Run: 25764363965
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 0 passed, 0 failed, 2 skipped

Job	Result
brave-search-e2e	⚠️ cancelled
cloud-e2e	⚠️ cancelled
cloud-inference-e2e	⚠️ cancelled
cloud-onboard-e2e	⚠️ cancelled
credential-migration-e2e	⚠️ cancelled
credential-sanitization-e2e	⚠️ cancelled
deployment-services-e2e	⚠️ cancelled
device-auth-health-e2e	⚠️ cancelled
diagnostics-e2e	⚠️ cancelled
docs-validation-e2e	⚠️ cancelled
double-onboard-e2e	⚠️ cancelled
gateway-health-honest-e2e	⚠️ cancelled
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	⚠️ cancelled
hermes-e2e	⚠️ cancelled
hermes-inference-switch-e2e	⚠️ cancelled
hermes-slack-e2e	⚠️ cancelled
inference-routing-e2e	⚠️ cancelled
issue-2478-crash-loop-recovery-e2e	⚠️ cancelled
kimi-inference-compat-e2e	⚠️ cancelled
launchable-smoke-e2e	⚠️ cancelled
messaging-compatible-endpoint-e2e	⚠️ cancelled
messaging-providers-e2e	⚠️ cancelled
network-policy-e2e	⚠️ cancelled
onboard-repair-e2e	⚠️ cancelled
onboard-resume-e2e	⚠️ cancelled
openclaw-inference-switch-e2e	⚠️ cancelled
openshell-gateway-upgrade-e2e	⚠️ cancelled
overlayfs-autofix-e2e	⚠️ cancelled
rebuild-hermes-e2e	⚠️ cancelled
rebuild-hermes-stale-base-e2e	⚠️ cancelled
rebuild-openclaw-e2e	⚠️ cancelled
runtime-overrides-e2e	⚠️ cancelled
sandbox-operations-e2e	⚠️ cancelled
sandbox-survival-e2e	⚠️ cancelled
shields-config-e2e	⚠️ cancelled
skill-agent-e2e	⚠️ cancelled
snapshot-commands-e2e	⚠️ cancelled
telegram-injection-e2e	⚠️ cancelled
token-rotation-e2e	⚠️ cancelled
upgrade-stale-sandbox-e2e	⚠️ cancelled

github-actions · 2026-05-12T22:17:21Z

Selective E2E Results — ❌ Some jobs failed

Run: 25765294451
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 4 passed, 2 failed, 2 skipped

Job	Result
brave-search-e2e	✅ success
cloud-e2e	⚠️ cancelled
cloud-inference-e2e	⚠️ cancelled
cloud-onboard-e2e	⚠️ cancelled
credential-migration-e2e	⚠️ cancelled
credential-sanitization-e2e	⚠️ cancelled
deployment-services-e2e	⚠️ cancelled
device-auth-health-e2e	⚠️ cancelled
diagnostics-e2e	⚠️ cancelled
docs-validation-e2e	⚠️ cancelled
double-onboard-e2e	⚠️ cancelled
gateway-health-honest-e2e	✅ success
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	❌ failure
hermes-e2e	⚠️ cancelled
hermes-inference-switch-e2e	⚠️ cancelled
hermes-slack-e2e	⚠️ cancelled
inference-routing-e2e	⚠️ cancelled
issue-2478-crash-loop-recovery-e2e	⚠️ cancelled
kimi-inference-compat-e2e	⚠️ cancelled
launchable-smoke-e2e	⚠️ cancelled
messaging-compatible-endpoint-e2e	⚠️ cancelled
messaging-providers-e2e	❌ failure
network-policy-e2e	⚠️ cancelled
onboard-repair-e2e	⚠️ cancelled
onboard-resume-e2e	⚠️ cancelled
openclaw-inference-switch-e2e	⚠️ cancelled
openshell-gateway-upgrade-e2e	✅ success
overlayfs-autofix-e2e	✅ success
rebuild-hermes-e2e	⚠️ cancelled
rebuild-hermes-stale-base-e2e	⚠️ cancelled
rebuild-openclaw-e2e	⚠️ cancelled
runtime-overrides-e2e	⚠️ cancelled
sandbox-operations-e2e	⚠️ cancelled
sandbox-survival-e2e	⚠️ cancelled
shields-config-e2e	⚠️ cancelled
skill-agent-e2e	⚠️ cancelled
snapshot-commands-e2e	⚠️ cancelled
telegram-injection-e2e	⚠️ cancelled
token-rotation-e2e	⚠️ cancelled
upgrade-stale-sandbox-e2e	⚠️ cancelled

Failed jobs: hermes-discord-e2e, messaging-providers-e2e. Check run artifacts for logs.

github-actions · 2026-05-12T22:47:08Z

Selective E2E Results — ✅ All requested jobs passed

Run: 25765443073
Branch: fix/native-messaging-websocket
Requested jobs: all (no filter)
Summary: 40 passed, 0 failed, 2 skipped

Job	Result
brave-search-e2e	✅ success
cloud-e2e	✅ success
cloud-inference-e2e	✅ success
cloud-onboard-e2e	✅ success
credential-migration-e2e	✅ success
credential-sanitization-e2e	✅ success
deployment-services-e2e	✅ success
device-auth-health-e2e	✅ success
diagnostics-e2e	✅ success
docs-validation-e2e	✅ success
double-onboard-e2e	✅ success
gateway-health-honest-e2e	✅ success
gpu-double-onboard-e2e	⏭️ skipped
gpu-e2e	⏭️ skipped
hermes-discord-e2e	✅ success
hermes-e2e	✅ success
hermes-inference-switch-e2e	✅ success
hermes-slack-e2e	✅ success
inference-routing-e2e	✅ success
issue-2478-crash-loop-recovery-e2e	✅ success
kimi-inference-compat-e2e	✅ success
launchable-smoke-e2e	✅ success
messaging-compatible-endpoint-e2e	✅ success
messaging-providers-e2e	✅ success
network-policy-e2e	✅ success
onboard-repair-e2e	✅ success
onboard-resume-e2e	✅ success
openclaw-inference-switch-e2e	✅ success
openshell-gateway-upgrade-e2e	✅ success
overlayfs-autofix-e2e	✅ success
rebuild-hermes-e2e	✅ success
rebuild-hermes-stale-base-e2e	✅ success
rebuild-openclaw-e2e	✅ success
runtime-overrides-e2e	✅ success
sandbox-operations-e2e	✅ success
sandbox-survival-e2e	✅ success
shields-config-e2e	✅ success
skill-agent-e2e	✅ success
snapshot-commands-e2e	✅ success
telegram-injection-e2e	✅ success
token-rotation-e2e	✅ success
upgrade-stale-sandbox-e2e	✅ success

fix(messaging): use native websocket credential rewrite

90a73b1

coderabbitai Bot reviewed May 10, 2026

View reviewed changes

Comment thread scripts/install-openshell.sh Outdated

Comment thread src/lib/onboard.ts Outdated

Comment thread src/lib/onboard.ts Outdated

fix(messaging): remove credential rewrite bridges

84c189f

coderabbitai Bot reviewed May 11, 2026

View reviewed changes

Comment thread src/lib/onboard.ts Outdated

Comment thread src/lib/onboard.ts Outdated

Comment thread src/lib/onboard.ts Outdated

Comment thread test/e2e/lib/fake-slack-api.cjs

Comment thread test/onboard.test.ts

fix(onboard): fail closed gateway lifecycle cleanup

b77051d

coderabbitai Bot reviewed May 11, 2026

View reviewed changes

Comment thread src/lib/onboard.ts Outdated

Comment thread test/e2e/lib/fake-slack-api.cjs Outdated

fix(onboard): keep failed gateway cleanup non-reusable

b09e6a9

coderabbitai Bot reviewed May 11, 2026

View reviewed changes

Comment thread test/e2e/lib/fake-slack-api.cjs Outdated

test(e2e): validate fake slack api port

c9ba631

ericksoa added 2 commits May 11, 2026 12:55

Merge remote-tracking branch 'origin/main' into fix/native-messaging-…

316baae

…websocket # Conflicts: # agents/hermes/start.sh # test/sandbox-init.test.ts

github-advanced-security AI found potential problems May 11, 2026

View reviewed changes

Comment thread test/policies.test.ts Fixed

Merge remote-tracking branch 'origin/main' into fix/native-messaging-…

c05172c

…websocket

cv closed this May 12, 2026

ericksoa reopened this May 12, 2026

coderabbitai Bot reviewed May 12, 2026

View reviewed changes

Comment thread test/e2e/lib/fake-slack-api.cjs

Merge remote-tracking branch 'origin/main' into fix/native-messaging-…

0d4a3f7

…websocket # Conflicts: # test/gateway-liveness-probe.test.ts

coderabbitai Bot reviewed May 12, 2026

View reviewed changes

Comment thread src/lib/onboard.ts Outdated

ericksoa added 4 commits May 11, 2026 18:40

ci: pin openshell pr for messaging nightly

6d82117

ci: route openshell pr pin through installer

bd0e346

ci: preinstall openshell pin for messaging e2e

3205ab4

ci: avoid removing openshell install dir

752ae55

coderabbitai Bot reviewed May 12, 2026

View reviewed changes

Comment thread scripts/install-openshell.sh Outdated

ericksoa added 2 commits May 11, 2026 20:16

ci: build OpenShell PR for messaging nightly

4ad426b

Signed-off-by: Aaron Erickson <aerickson@nvidia.com>

ci: trust OpenShell mise config in nightly build

645870c

Signed-off-by: Aaron Erickson <aerickson@nvidia.com>

ci: fix nightly after OpenShell PR pin

f8df884

ci: prefetch Z3 for OpenShell nightly build

d18ed4f

ericksoa added 2 commits May 11, 2026 22:52

fix: probe Docker-driver gateway health endpoint

483f165

ci: refresh OpenShell PR nightly pin

cc05227

fix: probe Docker-driver gRPC health

f0b8e01

ci: build OpenShell main for messaging nightly

8e04c7b

ericksoa added 2 commits May 12, 2026 07:53

ci: mark OpenShell main nightly build as dev channel

799839d

Merge remote-tracking branch 'origin/main' into fix/native-messaging-…

816a6c0

…websocket # Conflicts: # test/install-openshell-version-check.test.ts

fix: address CodeRabbit feedback

2e1d57b

ericksoa added 4 commits May 12, 2026 14:12

test: stabilize macos debug timeout

afb4896

Merge remote-tracking branch 'origin/main' into fix/native-messaging-…

e7baa6d

…websocket

Merge remote-tracking branch 'origin/main' into fix/native-messaging-…

7e83675

…websocket

chore: require openshell 0.0.39

aa3c8fd

fix: avoid openshell e2e wrapper recursion

36ab7f8

Conversation

ericksoa commented May 10, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Dependency

Validation

Notes

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

E2E Advisor Recommendation

Pi Semantic E2E Advisor

Required E2E

Optional E2E

New E2E recommendations

Dispatch hint

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented May 12, 2026

Selective E2E Results — ❌ Some jobs failed

Uh oh!

github-actions Bot commented May 12, 2026

Selective E2E Results — ❌ Some jobs failed

Uh oh!

github-actions Bot commented May 12, 2026

Selective E2E Results — ❌ Some jobs failed

Uh oh!

github-actions Bot commented May 12, 2026

Selective E2E Results — ❌ Some jobs failed

Uh oh!

github-actions Bot commented May 12, 2026

Selective E2E Results — ❌ Some jobs failed

Uh oh!

github-actions Bot commented May 12, 2026

ericksoa commented May 10, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 10, 2026 •

edited

Loading

github-actions Bot commented May 12, 2026 •

edited

Loading