From 336534ab9f5e41564d0f29540dcd9a2f56eef0e1 Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 09:50:57 +0800
Subject: [PATCH 01/17] =?UTF-8?q?feat:=20#216=20in-sandbox=20authorized=20?=
 =?UTF-8?q?cred-fetch=20harness=20=E2=80=94=20the=20sandbox=20agent=20pull?=
 =?UTF-8?q?s=20the=20vaulted=20LLM=20key=20via=20its=20granted=20scope?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The #216 gap: cred-fetch-demo/cred-wire-demo prove the chain master-self
(operator==actor skips the scope check, #195) and nothing fetched from
INSIDE the sandbox as the agent. This closes it across three layers:

- sandbox-agent-isolation.sh: + the #216 cred half — positive
  (agentkeys cred fetch of the P.3-granted service, in-sandbox identity
  from ~/.agentkeys/harness-env) and the scope-denial negative (an
  un-granted probe MUST fail service_not_in_scope; any other outcome
  fails loud).
- phase1-wire-demo.sh: 1.4b stages ~/.agentkeys/harness-env (0600) in
  the sandbox — also fixes the bare-shell env contract the isolation
  script silently lacked (it pointed at the stale :8088 MCP default);
  1.4c uploads the proof script (absolute path + verified — the bare
  filename upload was rejected by the aiosandbox file API and curl
  exited 0 anyway); Phase 4.0 now fetches + plants the LLM key
  IN-SANDBOX as the agent (plaintext never leaves the sandbox; host-CLI
  fetch is the compat fallback, operator env stays the labelled
  DEV-only fallback).
- v2-stage3-demo.sh step 18: the #216 cred-side scope triad on the
  granted agent — cred-fetch cap for the granted service (200),
  un-granted probe (ServiceNotInScope), and the CI/mock-only live
  REVOKE transition (setScope drops the service → the same mint is
  denied → restore), enforcing the '#216 revoke cuts the agent off'
  acceptance in CI. Also fixes upload_sandbox_isolation_test, which
  silently no-op'd (relative upload path + unchecked API body).

Verified live: prod broker /v1/cap/cred-fetch layered errors; the 1.4b
staging command (0600, 11 keys), the new script + the exact Phase 4.0
one-liner against a real aiosandbox; the fixed stage-3 upload. The
chain-gated positives run in CI (software master + mock agent) and on
the operator's next v2-demo.sh run (Touch ID register + pairing).

Docs synced in the same change: harness/CLAUDE.md (inventory rows,
sandbox role) + docs/operator-runbook-harness.md (On Sandbox, phase
3/5 proofs, two new Q&A entries).
---
 docs/operator-runbook-harness.md           | 33 +++++++-
 harness/CLAUDE.md                          | 10 ++-
 harness/phase1-wire-demo.sh                | 86 ++++++++++++++------
 harness/scripts/sandbox-agent-isolation.sh | 88 ++++++++++++++++++++-
 harness/v2-stage3-demo.sh                  | 92 +++++++++++++++++++++-
 5 files changed, 274 insertions(+), 35 deletions(-)
diff --git a/docs/operator-runbook-harness.md b/docs/operator-runbook-harness.md
index e059597f..ecc4a883 100644
--- a/docs/operator-runbook-harness.md
+++ b/docs/operator-runbook-harness.md
@@ -53,6 +53,14 @@ and run the real-agent proof (it signs with the agent's **sandbox-held** key):
 bash "$HOME/sandbox-agent-isolation.sh"   # the REAL agent: the deferred roundtrip (steps 11-12 / 14-15), sandbox-held key
 ```
 
+The proof covers BOTH halves of the real agent: the **memory roundtrip** (cap-mint → STS
+signed as the agent → worker → S3) AND the **#216 cred fetch** — the agent pulls its
+authorized LLM key from the master's vault (`agentkeys cred fetch`, gated by the
+`cred:<service>` scope granted at pairing) and an **un-granted probe service is denied
+with `service_not_in_scope`** (the permission gate, tested from both sides). It reads its
+coordinates from `~/.agentkeys/harness-env`, which the wire phase stages (step 1.4b) —
+no hand-exported env needed.
+
 (If `v2-demo.sh` reported the wire phase **skipped — no aiosandbox**, the agent wasn't paired:
 set the sandbox up and re-pair with `bash harness/v2-demo.sh --from 5`. The wire demo is
 `--real`-only now — the in-memory `--light` path was removed, #207.)
@@ -104,7 +112,10 @@ is the only one CI can't do — no aiosandbox).
   isolation layers, and the **scope triad** — step 16 master-self cap (operator==actor, no
   scope → 200), step 17 cross-actor un-granted → ServiceNotInScope, step 18 granted agent
   (operator≠actor, master granted the scope → 200; the positive delegation proof) — the
-  #195/#196 gate. **Steps 19–21 (#201)** prove the **Config** data-class isolation (master-only
+  #195/#196 gate. **Step 18 also runs the #216 cred-side triad**: a **cred-fetch cap** for
+  the granted service → 200, an un-granted probe → ServiceNotInScope, and (CI/mock only)
+  the **live revoke transition** — setScope drops the service, the same mint is denied,
+  then the scope is restored ("revoking the cred scope cuts the agent off", enforced in CI). **Steps 19–21 (#201)** prove the **Config** data-class isolation (master-only
   taxonomy): step 19 config creds write own `config/` prefix (200) but AccessDenied at the
   memory/vault buckets (+ memory creds → config bucket AccessDenied); steps 20–21 the
   cap data-class-mismatch (config cap ↔ memory/cred workers). These are **master-self → run on
@@ -123,9 +134,12 @@ is the only one CI can't do — no aiosandbox).
 - **phase 5 — wire** (`phase1-wire-demo.sh --real --webauthn`): the agent inside the sandbox reads +
   writes its real memory through `agentkeys wire` — cap-mint → STS relay (`X-Aws-*`) → `memory.litentry.org`
   → S3 `bots/<actor>/memory/`, passively injected each turn by the `pre_llm_call` hook. **Pairs the
-  §10.2 agent AND the master grants its `memory:<ns>` scope via Touch ID** (`--webauthn`; the agent's
-  cap service is `memory:<ns>`, so without the grant `memory.get` → `service_not_in_scope`). The
-  **only** real-memory proof (real-only — the in-memory `--light` path was removed, #207).
+  §10.2 agent AND the master grants its `memory:<ns>` + `cred:<service>` scopes via Touch ID**
+  (`--webauthn`; without the grant `memory.get` / `cred fetch` → `service_not_in_scope`). **Phase 4.0
+  (#216): the agent fetches its LLM key from the master's vault IN-SANDBOX** (`agentkeys cred fetch`
+  via the granted cred scope, coordinates from the 1.4b-staged `~/.agentkeys/harness-env`) and plants
+  it into Hermes without the plaintext leaving the sandbox — the operator env key is only a labelled
+  DEV fallback. The **only** real-memory proof (real-only — the in-memory `--light` path was removed, #207).
 - **phase 6 — web↔agent parity** (`web-parity-demo.sh`): boots `agentkeys-daemon --ui-bridge` (seeded
   with the master's J1 + device via the `--ui-bridge-seed-*` seam, so it skips re-onboarding) and
   plants a dedicated `webparity` probe namespace through the **web** endpoint
@@ -297,6 +311,17 @@ Touch-ID phases 1-2; use `--from 3.16` to jump straight to step 16 if the sessio
 **Q. `DeviceNotActive` / cap-mint device mismatch?**
 The master isn't registered. Re-run stage 1, or `bash harness/scripts/erc4337-register-master.sh`.
 
+**Q. `sandbox-agent-isolation.sh` says `SKIP: cred coordinates not staged`?**
+The sandbox's `~/.agentkeys/harness-env` predates the #216 staging (or the wire phase didn't
+finish). Re-run the wire on the operator host — `bash harness/v2-demo.sh --from 5` (or
+`bash harness/phase1-wire-demo.sh --real --webauthn`) — step 1.4b rewrites the env file with
+the cred coordinates + a fresh session bearer, then re-run the proof in the sandbox.
+
+**Q. The in-sandbox cred fetch fails with `service_not_in_scope` for the GRANTED service?**
+The pairing ran without `--webauthn`, so P.3 never granted the agent's scopes (the grant is a
+Touch ID ceremony). Re-run `bash harness/phase1-wire-demo.sh --real --webauthn` and approve the
+prompt — the fresh pairing grants `memory:<ns>` + the cred service together.
+
 **Q. `MalformedPolicyDocument` / empty AWS results?**
 Wrong profile/region: `awsp <profile>`; always pass `--region "$REGION"` (per [`../CLAUDE.md`](../CLAUDE.md)).
 
diff --git a/harness/CLAUDE.md b/harness/CLAUDE.md
index 1f285b75..dac5773f 100644
--- a/harness/CLAUDE.md
+++ b/harness/CLAUDE.md
@@ -223,8 +223,10 @@ Every orchestrator + the operator runbook MUST keep this split exact:
   **only the operator/master-side tests**; the **agent-side** steps (stage 3 11-12, signed
   AS the agent) **`defer`** to the sandbox — never a failure, never mocked on the operator.
 - **SANDBOX — the real §10.2 agent.** The agent's K10 lives in the sandbox, so the agent-side
-  roundtrip runs THERE (`phase1-wire-demo.sh --real` pairs it; `sandbox-agent-isolation.sh`
-  runs the deferred roundtrip with the sandbox-held key via `sbx_exec`). The master never
+  roundtrip runs THERE (`phase1-wire-demo.sh --real` pairs it + stages `~/.agentkeys/harness-env`;
+  `sandbox-agent-isolation.sh` runs the deferred roundtrip with the sandbox-held key via
+  `sbx_exec`: the memory roundtrip PLUS the #216 cred half — `agentkeys cred fetch` of the
+  authorized service AND the scope-denial negative for an un-granted probe). The master never
   signs for the agent. This is the real agent-side coverage.
 - **CI (`--ci`) — headless, no biometric, no sandbox.** Software register (no Touch ID), stub
   K11 (`WEBAUTHN_MODE=0`), the **mock agent** for the agent-side steps (the sole
@@ -251,8 +253,8 @@ sandbox) is **GREEN**, never fail/incomplete.
 | **`v2-demo.sh`** | **THE single entry point — no flags = phases 1→2→3→4 (memory plant)→5 (wire)→6 (web↔agent parity); wire auto-runs when the aiosandbox is up, else reports INCOMPLETE + exits non-zero (an unexecuted proof is never green — pass `--wire none` to intentionally skip); fail-fast. `PHASE.STEP` addressing (`--from 4.1`, `--only 3.11`). Flags are CI/scoping only.** | (no flags) / `--ci` / `--stage N` / `--from P.S` / `--only P.S` / `--wire real\|light\|none` |
 | `v2-stage1-demo.sh` | M1 foundation demo | `--only-step N` |
 | `v2-stage2-demo.sh` | hardening demo | `--only-step N` |
-| `v2-stage3-demo.sh` | OIDC + per-actor/data-class isolation proof (23 steps; 16–17 = #196 master-self + cross-actor scope; **19–21 = #201 Config data-class isolation** — master-self layer-3/4 + cap data-class-mismatch, run on the operator, `skip` until config infra is provisioned/deployed; **22 = #207 classifier-worker isolation** — master-self `cap_op_mismatch` (storage cap → classify worker) + `cap_data_class_mismatch` (cross-data-class Classify cap), compute-gate so NO STS, `skip` until the worker is deployed; **23 = cleanup + summary**). **Steps 11-12 / 14-15 sign STS creds AS the agent: on the operator they `defer` to the sandbox (the §10.2 agent key lives in the sandbox) — GREEN, never fail. `--mock-agent` (CI-only, auto-on under `--ci`) provisions a master-held DEV agent so headless CI can prove the roundtrip; a real §10.2 agent proves it in-sandbox via `phase1-wire-demo.sh --real`. When 11-12 run they ALSO assert the **#229 durable-audit receipt**: fetch response `audit_envelope_hash` → envelope fetchable from `AGENTKEYS_WORKER_AUDIT_URL`, hash = keccak256(cbor) (the appendV2/appendRootV2 anchor commitment), no plaintext — skip reasons `audit-receipt-missing` / `audit-url-unset`.** | `--from/--to/--only-step` / `--mock-agent` |
-| `phase1-wire-demo.sh` | agent-side `agentkeys wire` demo (real memory only — the in-memory `--light` path was removed, #207); **phase 5 of `v2-demo.sh`** — pairs the §10.2 agent in the sandbox so the On-Sandbox proof (`sandbox-agent-isolation.sh`) can run. **v2-demo runs it `--real --webauthn`** so the master grants the agent's `memory:<ns>` scope (Touch ID); the agent's cap service is `memory:<ns>`, so without the grant `memory.get` → `service_not_in_scope`. | `--real` (default) / `--webauthn` |
+| `v2-stage3-demo.sh` | OIDC + per-actor/data-class isolation proof (23 steps; 16–17 = #196 master-self + cross-actor scope; **18 = granted-agent positives — the memory cap AND the #216 cred-fetch cap for the granted service (200), an un-granted cred probe → ServiceNotInScope, and (CI/mock only) the live #216 REVOKE transition: setScope drops the service → the same cred-fetch mint is denied → restore**; **19–21 = #201 Config data-class isolation** — master-self layer-3/4 + cap data-class-mismatch, run on the operator, `skip` until config infra is provisioned/deployed; **22 = #207 classifier-worker isolation** — master-self `cap_op_mismatch` (storage cap → classify worker) + `cap_data_class_mismatch` (cross-data-class Classify cap), compute-gate so NO STS, `skip` until the worker is deployed; **23 = cleanup + summary**). **Steps 11-12 / 14-15 sign STS creds AS the agent: on the operator they `defer` to the sandbox (the §10.2 agent key lives in the sandbox) — GREEN, never fail. `--mock-agent` (CI-only, auto-on under `--ci`) provisions a master-held DEV agent so headless CI can prove the roundtrip; a real §10.2 agent proves it in-sandbox via `phase1-wire-demo.sh --real`. When 11-12 run they ALSO assert the **#229 durable-audit receipt**: fetch response `audit_envelope_hash` → envelope fetchable from `AGENTKEYS_WORKER_AUDIT_URL`, hash = keccak256(cbor) (the appendV2/appendRootV2 anchor commitment), no plaintext — skip reasons `audit-receipt-missing` / `audit-url-unset`.** | `--from/--to/--only-step` / `--mock-agent` |
+| `phase1-wire-demo.sh` | agent-side `agentkeys wire` demo (real memory only — the in-memory `--light` path was removed, #207); **phase 5 of `v2-demo.sh`** — pairs the §10.2 agent in the sandbox so the On-Sandbox proof (`sandbox-agent-isolation.sh`) can run. **v2-demo runs it `--real --webauthn`** so the master grants the agent's `memory:<ns>` + `cred:$SERVICE` scopes (Touch ID); without the grant `memory.get` / `cred fetch` → `service_not_in_scope`. **Step 1.4b stages `~/.agentkeys/harness-env` (0600) in the sandbox** (MCP/broker/cred coordinates + the operator session bearer) so the in-sandbox proofs run from a bare shell, and **1.4c uploads `sandbox-agent-isolation.sh`**. **Phase 4.0 (#216) fetches the LLM key IN-SANDBOX, as the agent** (`agentkeys cred fetch` via its granted cred scope) and plants it into `~/.hermes/.env` without the plaintext leaving the sandbox; host-CLI fetch is the compat fallback, operator env the labelled DEV-only fallback. | `--real` (default) / `--webauthn` |
 | `web-memory-bootstrap.sh` | issue #196 web-memory pre-flight + proof; runbook [`../docs/operator-runbook-web-memory.md`](../docs/operator-runbook-web-memory.md) | `--from/--to/--only-step` |
 | `memory-plant-demo.sh` | plant a proof memory archive through the REAL chain + read-back (the CLI/CI proof of the plant flow the web "⊕ plant prepared memory" button drives); **phase 4 of `v2-demo.sh`**. Plants into **dedicated `demo-*` namespaces** (never the real travel/personal/family) and **always deletes them on exit** (success OR failure, EXIT trap; `KEEP_DEMO_MEMORY=1` keeps), so test memory never leaks into the master's real store — the real prepared archive is planted ONLY by the user (the button), never by a demo or onboarding. Re-testable; idempotent (`--from 4.1`). | `--from-step/--only-step N` / `--ci` |
 | `web-parity-demo.sh` | **phase 6 of `v2-demo.sh`** (NOT a standalone front door) — boots `agentkeys-daemon --ui-bridge` SEEDED with the master's J1 + device via the `--ui-bridge-seed-*` daemon seam (skips re-onboarding) + plants a **dedicated `webparity` probe ns** through the **web** endpoint `POST /v1/master/memory/plant`, **deleted on exit** (success or failure). A 200 proves the daemon's chain (cap-mint → STS → worker → S3) == the agent/harness chain — the web↔harness drift gate. **Step 4 (#214)** additionally polls `GET /v1/agent/pairing/pending` and asserts a well-formed `{requests:[…]}` — the master-side web-pairing route reaches the real broker rendezvous (the full claim→register e2e needs a live §10.2 agent request, exercised agent-side). Reuses phases 1-2's build/chain/broker/master (one daemon boot, no re-bootstrap); real-only. | `--from-step/--only-step N` / `--ci` |
diff --git a/harness/phase1-wire-demo.sh b/harness/phase1-wire-demo.sh
index 85f735e3..fa886bf2 100755
--- a/harness/phase1-wire-demo.sh
+++ b/harness/phase1-wire-demo.sh
@@ -767,6 +767,33 @@ phase1_sandbox() {
       || fail "1.4 mcp server" "did not come up — see /tmp/agentkeys-mcp.log in the sandbox"
   fi
 
+  # 1.4b stage the in-sandbox harness coordinates (#216). ~/.agentkeys/harness-env
+  # (0600) carries the MCP/broker/cred coordinates + the OPERATOR session bearer so
+  # a bare in-sandbox shell can run the deferred proofs (sandbox-agent-isolation.sh:
+  # memory via the MCP, the #216 cred fetch via the broker) and Phase 4.0 can run
+  # `agentkeys cred fetch` AS THE AGENT inside the sandbox. Rewritten each run (the
+  # session bearer is freshly minted at 0.7); the bearer already rides every
+  # env_pfx sbx_exec command, so a 0600 file is the stricter staging, not a new
+  # exposure. All values are single-quote-safe (JWT/URL/ARN/hex/lowercase names).
+  local henv="$SBX_HOME/.agentkeys/harness-env"
+  sbx_exec "umask 077; mkdir -p ~/.agentkeys; printf '%s\n' 'AGENTKEYS_MCP_URL=$MCP_URL_IN_SANDBOX' 'AGENTKEYS_MCP_VENDOR_TOKEN=$VENDOR_TOKEN' 'AGENTKEYS_ACTOR_OMNI=$ACTOR_OMNI' 'AGENTKEYS_OPERATOR_OMNI=$OPERATOR_OMNI' 'AGENTKEYS_DEVICE_KEY_HASH=$DEVICE_KEY_HASH' 'AGENTKEYS_SESSION_BEARER=$SESSION_BEARER' 'AGENTKEYS_BROKER_URL=${BROKER_URL%/}' 'AGENTKEYS_WORKER_CRED_URL=${AGENTKEYS_WORKER_CRED_URL:-}' 'VAULT_ROLE_ARN=${VAULT_ROLE_ARN:-}' 'REGION=${REGION:-us-east-1}' 'CRED_SERVICE=$SERVICE' > '$henv'" >/dev/null
+  if [[ "$(sbx_rc "grep -q '^AGENTKEYS_SESSION_BEARER=.' '$henv'")" == "0" ]]; then
+    ok "1.4b harness env" "staged $henv (0600) — MCP/broker/cred coordinates for the in-sandbox proofs"
+  else
+    fail "1.4b harness env" "could not stage $henv in the sandbox — the in-sandbox cred fetch (4.0) + bare sandbox-agent-isolation.sh runs need it"
+  fi
+  # 1.4c upload the deferred-proof script so a wire-only run (no stage 3) still
+  # leaves the sandbox self-testable: bash $HOME/sandbox-agent-isolation.sh
+  # The upload `path` MUST be absolute — the aiosandbox file API rejects a bare
+  # filename with `[Errno 2] ... ''` (verified live on v1.0.0.152) — and sbx_put's
+  # returned file_path is the only success signal (curl exits 0 on API failure).
+  local iso_dst="$SBX_HOME/sandbox-agent-isolation.sh"
+  if [[ "$(sbx_put "$REPO_ROOT/harness/scripts/sandbox-agent-isolation.sh" "$iso_dst" 2>/dev/null)" == "$iso_dst" ]]; then
+    ok "1.4c isolation test" "uploaded sandbox-agent-isolation.sh — run IN the sandbox: bash \$HOME/sandbox-agent-isolation.sh"
+  else
+    skip "1.4c isolation test" "upload failed (non-fatal) — stage 3 also uploads it"
+  fi
+
   # 1.5 seed the real memory worker. The
   # agent reads this back in Act 1. Idempotent + scope-aware + --webauthn-gated:
   #   a. namespace already has content → skip everything (no Touch ID);
@@ -1058,32 +1085,43 @@ phase4_surprise() {
   log "Phase 4 — the surprise (real Hermes session in the sandbox)"
 
   # 4.0 #216: the agent's LLM key comes from the MASTER'S VAULT (cred-fetch via its
-  # authorized cred scope), NOT an ambient operator env. Resolve VAULT-FIRST; the
-  # $OPENROUTER_API_KEY/$LLM_API_KEY env is a DEV-ONLY fallback (clearly labelled).
-  # The full vault chain is proven headless (master-self) by harness/cred-wire-demo.sh;
-  # the agent-identity fetch here additionally needs (a) the cred scope granted at
-  # pairing (P.3 SEED_SCOPE_SERVICES, --webauthn) and (b) the key already vaulted.
-  local WIRE_KEY="" WIRE_KEY_SRC="" _host_cli=""
-  if [[ -x "$REPO_ROOT/target/release/agentkeys" ]]; then _host_cli="$REPO_ROOT/target/release/agentkeys"
-  elif [[ -x "$REPO_ROOT/target/debug/agentkeys" ]]; then _host_cli="$REPO_ROOT/target/debug/agentkeys"
-  else _host_cli="$(command -v agentkeys 2>/dev/null || true)"; fi
-  if [[ -n "$_host_cli" && -n "${AGENTKEYS_WORKER_CRED_URL:-}" && -n "${VAULT_ROLE_ARN:-}" \
-        && -n "$SESSION_BEARER" && -n "$ACTOR_OMNI" && -n "$OPERATOR_OMNI" && -n "$DEVICE_KEY_HASH" ]]; then
-    local _fetched
-    if _fetched="$("$_host_cli" cred fetch "$SERVICE" \
-          --operator-omni "$OPERATOR_OMNI" --actor-omni "$ACTOR_OMNI" \
-          --device-key-hash "$DEVICE_KEY_HASH" --session-bearer "$SESSION_BEARER" \
-          --broker-url "${BROKER_URL%/}" --cred-url "${AGENTKEYS_WORKER_CRED_URL}" \
-          --vault-role-arn "${VAULT_ROLE_ARN}" --region "${REGION:-us-east-1}" 2>/dev/null)" \
-        && [[ -n "$_fetched" ]]; then
-      WIRE_KEY="$_fetched"; WIRE_KEY_SRC="the master's VAULT (cred:$SERVICE — #216, the agent's authorized key)"
+  # authorized cred scope), NOT an ambient operator env — and the fetch runs IN THE
+  # SANDBOX, by the agent itself: `agentkeys cred fetch` with the 1.4b-staged
+  # coordinates (the broker checks the cred:$SERVICE scope the master granted at
+  # P.3), planted straight into ~/.hermes/.env so the plaintext never leaves the
+  # sandbox (only its sha returns for the log). Fallbacks, clearly labelled:
+  # (b) host-CLI agent-identity fetch (compat — a stale sandbox binary without the
+  # `cred` subcommand), (c) $OPENROUTER_API_KEY/$LLM_API_KEY env (DEV-ONLY). The
+  # headless complement is harness/cred-wire-demo.sh; the standalone in-sandbox
+  # proof (incl. the scope-denial negative) is sandbox-agent-isolation.sh.
+  local WIRE_KEY="" WIRE_KEY_SRC="" WIRE_PLANTED=false _host_cli="" _insbx
+  _insbx="$(sbx_exec "set -a; . ~/.agentkeys/harness-env 2>/dev/null; set +a; export PATH=\$HOME/.local/bin:\$PATH; k=\$(agentkeys cred fetch '$SERVICE' 2>/tmp/cred-fetch.err); if [ -n \"\$k\" ]; then mkdir -p \$HOME/.hermes; ENV=\$HOME/.hermes/.env; touch \"\$ENV\"; grep -v '^OPENROUTER_API_KEY=' \"\$ENV\" > \"\$ENV.tmp\" 2>/dev/null; printf 'OPENROUTER_API_KEY=%s\n' \"\$k\" >> \"\$ENV.tmp\"; mv \"\$ENV.tmp\" \"\$ENV\"; printf 'PLANTED sha=%s len=%s' \"\$(printf %s \"\$k\" | sha256sum 2>/dev/null | cut -c1-12)\" \"\${#k}\"; else tail -c 200 /tmp/cred-fetch.err 2>/dev/null; fi")"
+  if [[ "$_insbx" == PLANTED* ]]; then
+    WIRE_PLANTED=true
+    WIRE_KEY_SRC="the master's VAULT — fetched + planted IN-SANDBOX by the agent (cred:$SERVICE, #216; ${_insbx#PLANTED })"
+  else
+    [[ -n "$_insbx" ]] && log "  4.0 in-sandbox vault fetch unavailable ($(echo "$_insbx" | tr '\n' ' ' | cut -c1-160)) — trying the host-CLI fetch"
+    if [[ -x "$REPO_ROOT/target/release/agentkeys" ]]; then _host_cli="$REPO_ROOT/target/release/agentkeys"
+    elif [[ -x "$REPO_ROOT/target/debug/agentkeys" ]]; then _host_cli="$REPO_ROOT/target/debug/agentkeys"
+    else _host_cli="$(command -v agentkeys 2>/dev/null || true)"; fi
+    if [[ -n "$_host_cli" && -n "${AGENTKEYS_WORKER_CRED_URL:-}" && -n "${VAULT_ROLE_ARN:-}" \
+          && -n "$SESSION_BEARER" && -n "$ACTOR_OMNI" && -n "$OPERATOR_OMNI" && -n "$DEVICE_KEY_HASH" ]]; then
+      local _fetched
+      if _fetched="$("$_host_cli" cred fetch "$SERVICE" \
+            --operator-omni "$OPERATOR_OMNI" --actor-omni "$ACTOR_OMNI" \
+            --device-key-hash "$DEVICE_KEY_HASH" --session-bearer "$SESSION_BEARER" \
+            --broker-url "${BROKER_URL%/}" --cred-url "${AGENTKEYS_WORKER_CRED_URL}" \
+            --vault-role-arn "${VAULT_ROLE_ARN}" --region "${REGION:-us-east-1}" 2>/dev/null)" \
+          && [[ -n "$_fetched" ]]; then
+        WIRE_KEY="$_fetched"; WIRE_KEY_SRC="the master's VAULT (cred:$SERVICE — #216, the agent's authorized key; host-CLI fetch — update the sandbox binary for the in-sandbox path)"
+      fi
     fi
   fi
-  if [[ -z "$WIRE_KEY" && -n "$LLM_API_KEY" ]]; then
+  if [[ -z "$WIRE_KEY" && "$WIRE_PLANTED" != true && -n "$LLM_API_KEY" ]]; then
     WIRE_KEY="$LLM_API_KEY"
     WIRE_KEY_SRC="operator env \$OPENROUTER_API_KEY (DEV fallback — vault cred:$SERVICE unavailable; #216 wants the vault: grant the cred scope + vault the key, see harness/cred-wire-demo.sh)"
   fi
-  if [[ -z "$WIRE_KEY" ]]; then
+  if [[ -z "$WIRE_KEY" && "$WIRE_PLANTED" != true ]]; then
     skip "4.0 hermes llm" "no LLM key — neither a vaulted cred:$SERVICE (the #216 path; proven by harness/cred-wire-demo.sh) nor \$OPENROUTER_API_KEY (dev fallback). Skipping the surprise."
     return
   fi
@@ -1106,7 +1144,11 @@ phase4_surprise() {
   # be single-line: the sandbox /v1/shell/exec rejects multi-line payloads with
   # a silent ErrorObservation. Verified (not masked with || true).
   local env_path='$HOME/.hermes/.env'
-  sbx_exec "ENV=$env_path; grep -v '^OPENROUTER_API_KEY=' \"\$ENV\" > \"\$ENV.tmp\" 2>/dev/null; printf 'OPENROUTER_API_KEY=%s\n' $(printf '%q' "$WIRE_KEY") >> \"\$ENV.tmp\"; mv \"\$ENV.tmp\" \"\$ENV\"" >/dev/null
+  # In-sandbox path (4.0a) already planted the key without it leaving the sandbox;
+  # only the host-fetched / dev-fallback key needs writing from here.
+  if [[ "$WIRE_PLANTED" != true ]]; then
+    sbx_exec "ENV=$env_path; grep -v '^OPENROUTER_API_KEY=' \"\$ENV\" > \"\$ENV.tmp\" 2>/dev/null; printf 'OPENROUTER_API_KEY=%s\n' $(printf '%q' "$WIRE_KEY") >> \"\$ENV.tmp\"; mv \"\$ENV.tmp\" \"\$ENV\"" >/dev/null
+  fi
   if [[ "$(sbx_rc "grep -q '^OPENROUTER_API_KEY=' $env_path")" != "0" ]]; then
     fail "4.0 hermes llm" "could not write OPENROUTER_API_KEY to ~/.hermes/.env"; return
   fi
diff --git a/harness/scripts/sandbox-agent-isolation.sh b/harness/scripts/sandbox-agent-isolation.sh
index cb2bed45..4344d4f6 100755
--- a/harness/scripts/sandbox-agent-isolation.sh
+++ b/harness/scripts/sandbox-agent-isolation.sh
@@ -7,18 +7,37 @@
 # → memory worker → S3 bots/<agent_omni>/memory/. The stage-3 MOCK uses a master-held
 # key (worker plumbing only); THIS uses the genuine sandbox-held key (the real agent).
 #
+# PLUS the #216 cred half: the agent fetches its AUTHORIZED LLM credential from the
+# master's vault (`agentkeys cred fetch` → cap-mint cred-fetch → STS → cred worker →
+# decrypt), gated by the `cred:<service>` scope the master granted at pairing (P.3).
+#   • POSITIVE — the granted service round-trips (non-empty secret; only its
+#     length + sha prefix are logged, never the value).
+#   • NEGATIVE — an UN-granted probe service MUST be denied with
+#     service_not_in_scope: the broker's isServiceInScope gate stands between the
+#     sandbox agent and the vault. Any other outcome (success, or a different
+#     error) is a FAIL — the permission gate is the thing under test.
+#
 # Prereqs — set up by `bash harness/phase1-wire-demo.sh --real` (run on the operator
 # host first): the `agentkeys` binary + the §10.2-paired agent device-session + the
-# MCP server, all inside the sandbox. The stage-3 script UPLOADS this file to the
-# sandbox automatically (to $HOME/sandbox-agent-isolation.sh); you just run it here.
+# MCP server + the 0600 env file `~/.agentkeys/harness-env` (step 1.4b stages the
+# MCP/broker/cred coordinates + the session bearer), all inside the sandbox. Both
+# phase1 and the stage-3 script UPLOAD this file to the sandbox automatically (to
+# $HOME/sandbox-agent-isolation.sh); you just run it here.
 #
 #   bash "$HOME/sandbox-agent-isolation.sh" [namespace]
 set -uo pipefail
 
+# Coordinates staged by phase1-wire-demo.sh step 1.4b (0600). Without it the
+# CLI falls back to whatever AGENTKEYS_* the shell already exports.
+HARNESS_ENV="${HARNESS_ENV:-$HOME/.agentkeys/harness-env}"
+if [ -f "$HARNESS_ENV" ]; then set -a; . "$HARNESS_ENV"; set +a; fi
+
 NS="${1:-${MEMORY_NS:-travel}}"
 AGENT_BIN="${AGENT_BIN:-$(command -v agentkeys 2>/dev/null || echo "$HOME/.local/bin/agentkeys")}"
 [ -x "$AGENT_BIN" ] || { echo "FAIL: no agentkeys binary in the sandbox ($AGENT_BIN) — run 'phase1-wire-demo.sh --real' on the operator host first." >&2; exit 1; }
 
+sha_hex() { { command -v sha256sum >/dev/null 2>&1 && printf '%s' "$1" | sha256sum || printf '%s' "$1" | shasum -a 256; } | awk '{print $1}'; }
+
 content="sandbox-isolation-proof-$$-$(date +%s 2>/dev/null || echo n)"
 echo "== §10.2 agent isolation — the agent signs with its SANDBOX-held key (not the master) ==" >&2
 
@@ -42,4 +61,67 @@ fi
 # enforced at the IAM layer and is already proven by stage-3 steps 4-9. The agent CLI
 # has no way to target another actor's prefix, so this script proves the POSITIVE path
 # (the real agent works) — the negative path is the master/mock IAM test in stage 3.
-echo "== PASS: tested against the sandbox (the real agent), not the master-held mock. ==" >&2
+
+# ─── #216 cred half — the sandbox agent fetches its AUTHORIZED LLM key ───────
+CRED_SERVICE="${CRED_SERVICE:-openrouter}"
+CRED_NEGATIVE_SERVICE="${CRED_NEGATIVE_SERVICE:-cred-ungranted-probe}"
+echo "== #216 cred fetch — the sandbox agent pulls cred:$CRED_SERVICE from the master's vault ==" >&2
+
+if [ -z "${AGENTKEYS_WORKER_CRED_URL:-}" ] || [ -z "${VAULT_ROLE_ARN:-}" ] || [ -z "${AGENTKEYS_SESSION_BEARER:-}" ]; then
+  echo "SKIP: cred coordinates not staged (need AGENTKEYS_WORKER_CRED_URL + VAULT_ROLE_ARN + AGENTKEYS_SESSION_BEARER in $HARNESS_ENV)." >&2
+  echo "      Re-run 'bash harness/phase1-wire-demo.sh --real --webauthn' on the operator host — step 1.4b stages them." >&2
+  echo "== PARTIAL PASS: memory proven; #216 cred half SKIPPED (stale wire staging — re-stage and re-run). ==" >&2
+  exit 0
+fi
+
+# POSITIVE: fetch the service the master authorized at pairing (P.3 grants the bare
+# cred service alongside memory:<ns>). Identity/session come from harness-env via
+# the CLI's clap env fallbacks; the secret never hits argv or the log.
+cred_err="$(mktemp 2>/dev/null || echo "/tmp/cred-fetch.$$.err")"
+if fetched="$("$AGENT_BIN" cred fetch "$CRED_SERVICE" 2>"$cred_err")" && [ -n "$fetched" ]; then
+  echo "OK: agent fetched cred:$CRED_SERVICE from the vault IN-SANDBOX (len=${#fetched}, sha $(sha_hex "$fetched" | cut -c1-12)…) — authorized scope honoured." >&2
+  if [ -n "${EXPECTED_CRED_SHA256:-}" ]; then
+    if [ "$(sha_hex "$fetched")" = "$EXPECTED_CRED_SHA256" ]; then
+      echo "OK: fetched secret sha == EXPECTED_CRED_SHA256 — the exact master-vaulted value round-tripped." >&2
+    else
+      echo "FAIL: fetched secret sha ($(sha_hex "$fetched")) != EXPECTED_CRED_SHA256 — vault round-trip mismatch." >&2
+      rm -f "$cred_err"; exit 1
+    fi
+  fi
+  # Informational: compare against the key the wire planted into Hermes. Match ⇒
+  # Hermes runs on exactly what this agent can independently fetch (vault-wired);
+  # mismatch is NOT a failure (the wire may have used its labelled dev fallback).
+  planted="$(grep '^OPENROUTER_API_KEY=' "$HOME/.hermes/.env" 2>/dev/null | head -1 | sed 's/^OPENROUTER_API_KEY=//')"
+  if [ -n "$planted" ]; then
+    [ "$(sha_hex "$planted")" = "$(sha_hex "$fetched")" ] \
+      && echo "OK: the key wired into Hermes == this fetch — Hermes runs on the vault key." >&2 \
+      || echo "note: Hermes' planted key differs from this fetch — the wire likely used the dev-fallback env key (see phase1-wire-demo Phase 4.0)." >&2
+  fi
+else
+  err="$(tr '\n' ' ' <"$cred_err" | cut -c1-300)"
+  rm -f "$cred_err"
+  if printf '%s' "$err" | grep -qiE 'not.*in.*scope|NotInScope|service_not_in_scope'; then
+    echo "FAIL: cred:$CRED_SERVICE is NOT granted to this agent — the master must authorize it at pairing. Re-run 'bash harness/phase1-wire-demo.sh --real --webauthn' (P.3 grants it, Touch ID). broker: $err" >&2
+  else
+    echo "FAIL: agent cred fetch errored (not a scope denial): $err" >&2
+  fi
+  exit 1
+fi
+
+# NEGATIVE: an un-granted probe service MUST be DENIED by the broker scope gate
+# (isServiceInScope(operator, agent, probe) == false → service_not_in_scope at
+# cap-mint). A success here means the permission gate is broken — fail loud.
+if neg_out="$("$AGENT_BIN" cred fetch "$CRED_NEGATIVE_SERVICE" 2>"$cred_err")" && [ -n "$neg_out" ]; then
+  echo "FAIL: agent fetched UN-granted cred:$CRED_NEGATIVE_SERVICE — the scope gate did not deny an unauthorized service!" >&2
+  rm -f "$cred_err"; exit 1
+fi
+neg_err="$(tr '\n' ' ' <"$cred_err" | cut -c1-300)"
+rm -f "$cred_err"
+if printf '%s' "$neg_err" | grep -qiE 'not.*in.*scope|NotInScope|service_not_in_scope'; then
+  echo "OK: un-granted cred:$CRED_NEGATIVE_SERVICE denied with service_not_in_scope — the authorization gate stands between the agent and the vault." >&2
+else
+  echo "FAIL: un-granted fetch was rejected for the WRONG reason (want service_not_in_scope): $neg_err" >&2
+  exit 1
+fi
+
+echo "== PASS: tested against the sandbox (the real agent), not the master-held mock — memory roundtrip + #216 authorized cred fetch + scope-denial negative. ==" >&2
diff --git a/harness/v2-stage3-demo.sh b/harness/v2-stage3-demo.sh
index 81fae580..fed21840 100755
--- a/harness/v2-stage3-demo.sh
+++ b/harness/v2-stage3-demo.sh
@@ -604,9 +604,21 @@ upload_sandbox_isolation_test() {
   local script="$REPO_ROOT/harness/scripts/sandbox-agent-isolation.sh"
   [ -f "$script" ] || return 0
   curl -fsS --max-time 8 "$sbx/healthz" >/dev/null 2>&1 || curl -fsS --max-time 8 "$sbx/v1/sandbox" >/dev/null 2>&1 || return 0
-  if curl -sS --max-time 30 -X POST "$sbx/v1/file/upload" -F "file=@$script" -F "path=sandbox-agent-isolation.sh" >/dev/null 2>&1; then
+  # The upload `path` MUST be absolute (a bare filename gets `[Errno 2] ... ''`
+  # from the aiosandbox file API — verified live on v1.0.0.152, where this
+  # function silently no-op'd for months because curl exits 0 on an API-level
+  # failure). Resolve the sandbox $HOME via the shell API and CHECK the body.
+  local sbx_home resp
+  sbx_home=$(curl -sS --max-time 15 -X POST "$sbx/v1/shell/exec" -H 'content-type: application/json' \
+    -d '{"command":"printf %s \"$HOME\""}' 2>/dev/null | jq -r '.data.output // empty') || sbx_home=""
+  [ -n "$sbx_home" ] || return 0
+  resp=$(curl -sS --max-time 30 -X POST "$sbx/v1/file/upload" -F "file=@$script" \
+    -F "path=$sbx_home/sandbox-agent-isolation.sh" 2>/dev/null) || resp=""
+  if echo "$resp" | jq -e '(.success // .data.success) == true' >/dev/null 2>&1; then
     SANDBOX_TEST_UPLOADED=1
     info "uploaded sandbox-agent-isolation.sh → the sandbox ($sbx). REAL agent test (sandbox-held key) runs THERE: bash \$HOME/sandbox-agent-isolation.sh"
+  else
+    info "sandbox-agent-isolation.sh upload to $sbx did NOT confirm ($(echo "$resp" | tr '\n' ' ' | cut -c1-120)) — run the wire phase (its 1.4c uploads it) or copy it in manually"
   fi
 }
 
@@ -1250,8 +1262,12 @@ fi
 # is honoured (delegation works). Stands ALONE (no STS/worker roundtrip): the cap-mint
 # is operator-authenticated (mint_cap sends session.jwt), so it needs NO agent key —
 # only the agent's on-chain device + the grant.
+# PLUS the #216 cred-side triad on the same identity: a CRED-FETCH cap for the
+# granted service → 200, an un-granted probe → ServiceNotInScope, and (CI/mock
+# only) the live REVOKE transition (setScope drops the service → the same mint is
+# denied → restore) — "revoking the cred scope cuts the agent off", enforced.
 if should_run_step 18; then
-  step "POSITIVE: granted agent (operator!=actor) mints memory cap for the GRANTED service → 200"
+  step "POSITIVE: granted agent (operator!=actor) mints memory + #216 cred-fetch caps for the GRANTED service → 200 (+ un-granted/revoked denials)"
   [ -f "$STATE_DIR/session.jwt" ] || die "no session.jwt — re-run step 1"
   # CI mocks the §10.2 agent with a master-held, scope-granted dev agent; the operator's
   # real agent carries its device + grant on chain (stage-1 / sandbox pairing).
@@ -1280,6 +1296,78 @@ if should_run_step 18; then
       if [ "$rc" = "200" ]; then
         ok "granted agent (actor $pg_actor != operator 0x$OWN_ACTOR_OMNI) minted a memory cap for delegated service '$SMOKE_SERVICE' — isServiceInScope honoured"
         record_ok "granted-agent positive: memory cap minted for delegated service '$SMOKE_SERVICE' (operator!=actor, HTTP 200)"
+        # ── #216 cred-side scope triad — same agent identity, the CRED-FETCH cap
+        # route (the cap `agentkeys cred fetch` mints so the agent can pull its
+        # vaulted LLM key):
+        #   (i)   granted service  → 200 (the broker authorizes the delegated fetch)
+        #   (ii)  un-granted probe → ServiceNotInScope (an agent can't reach vault
+        #         entries the master never authorized)
+        #   (iii) CI-only REVOKE transition — setScope WITHOUT the service → the
+        #         same mint now denied (no stale scope caching), then restore.
+        # The operator path runs (i)+(ii) only: mutating scope is a Touch ID
+        # ceremony, and the live denial predicate is covered in-sandbox by
+        # sandbox-agent-isolation.sh's scope-denial negative.
+        rc=$(mint_cap cred-fetch "$pg_body")
+        body=$(cat /tmp/cap.$$.json 2>/dev/null || true); rm -f /tmp/cap.$$.json
+        if [ "$rc" = "200" ]; then
+          ok "#216 cred-fetch cap minted for GRANTED service '$SMOKE_SERVICE' (operator!=actor) — the agent may fetch its authorized cred"
+          record_ok "#216 cred-fetch cap positive (granted service, HTTP 200)"
+        else
+          die "#216 cred-fetch cap for the GRANTED service '$SMOKE_SERVICE' returned HTTP $rc — body: $body"
+        fi
+        neg_svc="${CRED_NEGATIVE_SERVICE:-cred-ungranted-probe}"
+        neg_body=$(jq -n --arg op "0x$OWN_ACTOR_OMNI" --arg actor "$pg_actor" \
+                         --arg svc "$neg_svc" --arg dkh "$pg_dkh" \
+           '{operator_omni:$op, actor_omni:$actor, service:$svc, device_key_hash:$dkh}')
+        rc=$(mint_cap cred-fetch "$neg_body")
+        body=$(cat /tmp/cap.$$.json 2>/dev/null || true); rm -f /tmp/cap.$$.json
+        if [ "$rc" = "200" ]; then
+          die "#216 REGRESSION: cred-fetch cap minted for UN-granted service '$neg_svc' — the scope gate did not deny an unauthorized cred"
+        elif echo "$body" | grep -qiE "not.*scope|NotInScope|service_not_in_scope"; then
+          ok "#216 cred-fetch cap for un-granted '$neg_svc' denied with ServiceNotInScope"
+          record_ok "#216 cred-fetch cap negative (un-granted service rejected)"
+        else
+          die "#216 cred-fetch negative returned unexpected HTTP $rc (want ServiceNotInScope) — body: $body"
+        fi
+        if [ "$MOCK_AGENT" = 1 ]; then
+          # (iii) #216 acceptance: "revoke the cred scope → the agent loses the
+          # key". setScope REPLACES the services list (set-replace), so revoke =
+          # re-set without the service; restore = re-set with it. heima-scope-set
+          # is idempotent + master-model-aware (#250: account masters route via
+          # erc4337-master-exec — the CI software passkey signs headlessly).
+          # Self-healing: if the restore is interrupted, the next run's
+          # ensure_mock_agent re-grants (its getScope pre-check sees the drift).
+          pg_label=$(jq -r '.label // "demo-agent-dev"' "$pg_file")
+          profile_uc=$(printf '%s' "${AGENTKEYS_CHAIN:-heima}" | tr 'a-z-' 'A-Z_')
+          scope_addr=$(eval "echo \${SCOPE_CONTRACT_ADDRESS_${profile_uc}:-}")
+          if [ -z "$scope_addr" ] || [ "$scope_addr" = 0x0 ]; then
+            prereq_missing scope-not-set "no AgentKeysScope address in env — cannot run the #216 revoke transition" || true
+          else
+            revoke_json=$(bash "$REPO_ROOT/scripts/heima-scope-set.sh" --agent "$pg_label" \
+              --services "stage3-cred-revoked-placeholder" --scope-address "$scope_addr" | tail -1) || revoke_json=""
+            if ! echo "$revoke_json" | jq -e '.ok==true and ((.skipped // "") == "" or .skipped=="already-set")' >/dev/null 2>&1; then
+              prereq_missing scope-not-set "#216 revoke transition: setScope (revoke) did not land — $(echo "$revoke_json" | tr '\n' ' ' | cut -c1-160)" || true
+            else
+              rc=$(mint_cap cred-fetch "$pg_body")
+              body=$(cat /tmp/cap.$$.json 2>/dev/null || true); rm -f /tmp/cap.$$.json
+              if [ "$rc" = "200" ]; then
+                die "#216 REGRESSION: cred-fetch cap STILL minted after the scope was revoked — stale scope state at the broker"
+              elif echo "$body" | grep -qiE "not.*scope|NotInScope|service_not_in_scope"; then
+                ok "#216 revoke transition: after setScope dropped '$SMOKE_SERVICE', the same cred-fetch mint is denied (ServiceNotInScope) — revoke cuts the agent off"
+                record_ok "#216 revoke transition (revoked service rejected live)"
+              else
+                die "#216 post-revoke cred-fetch returned unexpected HTTP $rc — body: $body"
+              fi
+              restore_json=$(bash "$REPO_ROOT/scripts/heima-scope-set.sh" --agent "$pg_label" \
+                --services "$SMOKE_SERVICE" --scope-address "$scope_addr" | tail -1) || restore_json=""
+              echo "$restore_json" | jq -e '.ok==true' >/dev/null 2>&1 \
+                && info "#216 revoke transition: scope restored to '$SMOKE_SERVICE' for '$pg_label'" \
+                || info "#216 revoke transition: restore did not confirm (next run's ensure_mock_agent self-heals) — $(echo "$restore_json" | tr '\n' ' ' | cut -c1-120)"
+            fi
+          fi
+        else
+          info "#216 revoke transition: operator run — skipped (scope mutation is a Touch ID ceremony); the denial predicate is covered by (ii) here and by the in-sandbox negative"
+        fi
       elif echo "$body" | grep -qiE "not.*scope|NotInScope|service_not_in_scope"; then
         prereq_missing scope-not-set "agent scope for '$SMOKE_SERVICE' not granted on chain — run \`bash harness/v2-stage1-demo.sh --webauthn\` (step 13 setScope) first. body: $body" || true
       elif echo "$body" | grep -qiE "DeviceNotActive|device.*not.*active|DeviceBindingMismatch|binding.*mismatch|DeviceRoleMissing|role_missing"; then

From 130fc725fd7872c75c3476ee2cb500a03fd56312 Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 12:10:39 +0800
Subject: [PATCH 02/17] =?UTF-8?q?feat:=20#216=20v2-demo=20phase=205=20runs?=
 =?UTF-8?q?=20in=20CI=20=E2=80=94=20mock-wire-demo.sh=20emulates=20the=20a?=
 =?UTF-8?q?iosandbox=20side=20on=20the=20runner?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

CI used to set --wire none, so the entire post-wire agent runtime (the
MCP server + agentkeys cred fetch + the hook path) ran nowhere headless
— stage-3 steps 11-12 cover raw worker curls, not the runtime the
sandbox agent actually uses.

- NEW harness/mock-wire-demo.sh (phase 5 under --ci / --wire mock):
  ensure the sanctioned mock agent + the canonical scope grant → mint
  operator + agent sessions headless (wallet_sig SIWE) → boot the REAL
  agentkeys-mcp-server on localhost (http backend + per-actor STS
  relay, the phase1 1.4 shape) → master-self vault a probe cred under
  the DEDICATED mock-wire-llm service (never openrouter — can't clobber
  a real vault entry) → run THE SAME sandbox-agent-isolation.sh the
  sandbox runs, with EXPECTED_CRED_SHA256: memory roundtrip through the
  MCP + the #216 authorized cred fetch (sha-exact) + the un-granted
  scope denial. Key custody stays operator-only by design.
- v2-demo.sh: --wire gains 'mock' (CI default; 'none' stays the
  explicit off), WIRE_RESULT=mocked in the summary.
- v2-stage3-demo.sh: ONE canonical mock-agent grant MOCK_SCOPE_SERVICES
  (openrouter + memory:ci-wire-proof + mock-wire-llm), used by
  ensure_mock_agent AND the step-18 revoke restore — phases 3 and 5 no
  longer flip-flop setScope every CI run (set-replace semantics).
- _lib.sh: + wallet_sig_mint_jwt (the shared headless SIWE session
  primitive; temp-file based — jq chokes on multi-line SIWE in vars).
- harness-ci.yml: comments/step name now say phases 1-6 with phase 5
  mocked (the workflow already passes --ci; behavior switches with the
  new default). The build step already builds agentkeys-mcp-server.

Verified locally: bash -n all, YAML parse, fixture gate green, the MCP
server boots with the exact arg shape (healthz ok), and
'v2-demo --stage 5 --wire mock' dispatches preflight → mock-wire-demo
→ fails LOUD at the no-master chain gate (this laptop's registry has
no master — correct; CI's software master is registered).

Docs synced: harness/CLAUDE.md (rule 5, CI role, inventory rows) +
operator-runbook-harness.md (On CI, flag table, phase-5 bullet, role
mapping).
---
 .github/workflows/harness-ci.yml |  23 ++--
 docs/operator-runbook-harness.md |  26 ++--
 harness/CLAUDE.md                |  21 ++-
 harness/mock-wire-demo.sh        | 216 +++++++++++++++++++++++++++++++
 harness/scripts/_lib.sh          |  37 ++++++
 harness/v2-demo.sh               |  31 +++--
 harness/v2-stage3-demo.sh        |  14 +-
 7 files changed, 332 insertions(+), 36 deletions(-)
 create mode 100755 harness/mock-wire-demo.sh

diff --git a/.github/workflows/harness-ci.yml b/.github/workflows/harness-ci.yml
index 08cf4ec2..5e21465a 100644
--- a/.github/workflows/harness-ci.yml
+++ b/.github/workflows/harness-ci.yml
@@ -1,9 +1,11 @@
 name: harness CI (no LLM)
 
 # Issue #66: deterministic, no-LLM, no-WebAuthn CI that runs the SAME
-# production harness orchestrator (harness/v2-demo.sh --ci → phases 1-4 + 6;
-# phase 5/wire is the only phase CI can't run — no aiosandbox — so --ci sets
-# --wire none) against a parallel TEST instance of the production environment.
+# production harness orchestrator (harness/v2-demo.sh --ci → phases 1-6;
+# phase 5/wire has no aiosandbox here, so --ci sets --wire mock —
+# mock-wire-demo.sh emulates the sandbox side ON the runner: the real MCP
+# server + the #216 cred fetch via the master-held mock agent) against a
+# parallel TEST instance of the production environment.
 # (Was three separate v2-stage{1,2,3}-demo.sh steps; switched to the whole
 # orchestrator so CI also covers phase 4 (memory-plant) + phase 6 (web-parity —
 # the daemon web-chain runtime proof the #200 restructure added but never wired
@@ -920,21 +922,26 @@ jobs:
           AGENTKEYS_HEALTH_OPTIONAL="config classify" \
             bash scripts/wait-stack-healthy.sh
 
-      - name: v2-demo on Heima mainnet — phases 1-4 + 6 (wire/phase-5 skipped, no sandbox)
+      - name: v2-demo on Heima mainnet — phases 1-6 (phase 5 = mock-sandbox wire on the runner)
         # Run the WHOLE orchestrator (harness/v2-demo.sh) rather than the three
         # v2-stage{1,2,3}-demo.sh in isolation, so CI also covers the phases the
         # #200 v2-demo restructure added but never wired into CI:
         #   - phase 4 (memory-plant-demo.sh): the master plants its own memory
         #     through the real chain + read-back.
+        #   - phase 5 (mock-wire-demo.sh via `--wire mock`, auto under --ci): the
+        #     aiosandbox side EMULATED on the runner — boots the real
+        #     agentkeys-mcp-server with the master-held mock agent's identity,
+        #     master-self vaults a probe cred, then runs the SAME
+        #     sandbox-agent-isolation.sh the sandbox runs (memory via MCP + the
+        #     #216 authorized cred fetch + the un-granted scope denial). Proves
+        #     the post-wire agent RUNTIME headless; the real §10.2 sandbox key
+        #     custody remains operator-only.
         #   - phase 6 (web-parity-demo.sh): the daemon's WEB endpoint
         #     POST /v1/master/memory/plant → real chain (cap-mint → STS → worker
         #     → S3). This is the ONLY runtime proof of the parent-control app's
         #     path — stage 3 exercises the CLI/curl path, not the daemon ui-bridge.
         #     (The #203 check-web-api-drift.sh gate in rust-checks covers its
         #     SHAPE at compile/fixture time; THIS step covers runtime reachability.)
-        # Phase 5 (wire) is the §10.2 agent inside the aiosandbox — CI is headless
-        # with no sandbox, so `--ci` auto-sets `--wire none` (the one phase that
-        # genuinely can't run here).
         #
         # `--ci` threads to every phase (v2-demo run_phase): stage 1 auto-skips
         # deploy/email/provision (PRE-PROVISIONED infra — contracts pinned in
@@ -966,7 +973,7 @@ jobs:
           ARGS=(--ci --allow-skip=scope-not-set,config-role-missing,config-worker-unreachable,classify-not-configured,classify-worker-unavailable)
           case "${STAGE:-}" in
             1|2|3) ARGS+=(--stage "$STAGE") ;;
-            *)     ;;   # all / empty → full phases 1-4 + 6 (phase 5/wire auto-skipped)
+            *)     ;;   # all / empty → full phases 1-6 (phase 5 = the mock-sandbox wire)
           esac
           AGENTKEYS_CHAIN=heima bash harness/v2-demo.sh "${ARGS[@]}"
 
diff --git a/docs/operator-runbook-harness.md b/docs/operator-runbook-harness.md
index ecc4a883..db64608f 100644
--- a/docs/operator-runbook-harness.md
+++ b/docs/operator-runbook-harness.md
@@ -71,16 +71,21 @@ CI has **no Touch ID and no sandbox**, so one flag switches to the software regi
 a **mock** agent + tolerate-skips:
 
 ```bash
-bash harness/v2-demo.sh --ci   # software register, mock agent, tolerate prereq skips; wire OFF (no sandbox in CI)
+bash harness/v2-demo.sh --ci   # software register, mock agent, tolerate prereq skips; wire MOCKED on the runner
 ```
 
 `--ci` (or the runner's `$CI`) ⇒ `--signer software` + `--mock-agent` + `--allow-skip`
 semantics + **stage-1 auto-skips deploy/email/provision** (CI runs against
 pre-provisioned infra — contracts pinned in secrets, identity via wallet_sig, the
-vault/memory buckets+roles an operator one-shot). The mock agent tests the worker
-**plumbing only** — not the real §10.2 agent (that's the sandbox run above). This is
-exactly what `harness-ci.yml` runs: `v2-demo.sh --ci` → phases 1–4 + 6 (phase 5/wire
-is the only one CI can't do — no aiosandbox).
+vault/memory buckets+roles an operator one-shot). This is exactly what
+`harness-ci.yml` runs: `v2-demo.sh --ci` → **phases 1–6, with phase 5 MOCKED**:
+`mock-wire-demo.sh` emulates the aiosandbox side ON the runner — it boots the real
+`agentkeys-mcp-server` with the mock agent's identity, master-self vaults a probe
+cred, and runs the **same `sandbox-agent-isolation.sh`** the sandbox runs (memory
+roundtrip through the MCP + the #216 authorized cred fetch, sha-exact, + the
+un-granted scope denial). The mock proves the post-wire agent **runtime**, not the
+real §10.2 **key custody** — the agent key is master-held; the sandbox run above
+remains the only proof that the key never leaves the sandbox.
 
 ---
 
@@ -140,6 +145,9 @@ is the only one CI can't do — no aiosandbox).
   via the granted cred scope, coordinates from the 1.4b-staged `~/.agentkeys/harness-env`) and plants
   it into Hermes without the plaintext leaving the sandbox — the operator env key is only a labelled
   DEV fallback. The **only** real-memory proof (real-only — the in-memory `--light` path was removed, #207).
+  **On CI** phase 5 runs as `mock-wire-demo.sh` instead (`--wire mock`, auto under `--ci`): the runner
+  emulates the sandbox side with the master-held mock agent and runs the same proof script — the
+  post-wire runtime headless, every PR.
 - **phase 6 — web↔agent parity** (`web-parity-demo.sh`): boots `agentkeys-daemon --ui-bridge` (seeded
   with the master's J1 + device via the `--ui-bridge-seed-*` seam, so it skips re-onboarding) and
   plants a dedicated `webparity` probe namespace through the **web** endpoint
@@ -228,7 +236,8 @@ on-chain anchor itself is exercised by `scripts/heima-worker-smoke.sh` (stage-2
 
 | Flag | Effect |
 |---|---|
-| `--ci` | software register + auto `--mock-agent` + tolerate prereq `skip`s. Sets `AGENTKEYS_CI=1` (the runner's `$CI` also triggers it). |
+| `--ci` | software register + auto `--mock-agent` + tolerate prereq `skip`s + **phase 5 → `--wire mock`** (the runner-local mock-sandbox proof). Sets `AGENTKEYS_CI=1` (the runner's `$CI` also triggers it). |
+| `--wire real\|mock\|none` | (v2-demo) force the wire phase: `real` = the sandbox wire (`phase1-wire-demo.sh`), `mock` = the CI mock-sandbox proof (`mock-wire-demo.sh`, auto under `--ci`), `none` = intentionally off. |
 | `--mock-agent` | (stage 3) mock the sandbox agent with a master-held DEV agent. Auto-applied by `v2-demo.sh`; only needed when running `v2-stage3-demo.sh` directly. |
 | `--allow-skip=<reason>` | (stage 3) opt a prereq into `skip` not `fail`. NOT a release gate. |
 | `--signer software` | force the file-key register signer directly. |
@@ -338,8 +347,9 @@ Rules for any agent (human or AI) working **on** the harness:
   real sandbox). Flags are CI/dev only. Don't add operator-facing flags — prefer auto-detect
   (Cargo's own incremental build, sandbox auto-detect, idempotent skips).
 - **Run-mode mapping (the three roles above):** operator = no flag; `--ci` = software register
-  + mock agent + tolerate skips; **sandbox** = the agent-side tests run *in* the sandbox
-  (the master never signs for a sandbox-held key). The mock is plumbing-only, never the real agent.
+  + mock agent + tolerate skips + the phase-5 **mock-sandbox wire** (`mock-wire-demo.sh` on the
+  runner); **sandbox** = the agent-side tests run *in* the sandbox (the master never signs for a
+  sandbox-held key). The mock proves runtime plumbing, never the real agent's key custody.
 - **Keep the docs in sync — every time a harness script changes** (new flag, new step,
   renamed script, changed default), update **this runbook AND [`../harness/CLAUDE.md`](../harness/CLAUDE.md)
   in the same change.** A script change without the doc update is incomplete.
diff --git a/harness/CLAUDE.md b/harness/CLAUDE.md
index dac5773f..dcf3cf94 100644
--- a/harness/CLAUDE.md
+++ b/harness/CLAUDE.md
@@ -57,8 +57,12 @@ CLAUDE.md runbook-fix-fold-back policy, applied to every harness edit, not just
    from env / flags / `operator-workstation.env` / `agentkeys chain show`, never
    baked in. Temporary exceptions go in [`../hardcoded.md`](../hardcoded.md). **The one
    sanctioned synthetic agent:** CI may provision a **mock agent** (a master-held DEV
-   agent, `demo-agent-dev`) for the agent-side wiring steps (stage 3 11-12) — CI has no
-   sandbox, so the real §10.2 agent can't sign. That mock is **CI-only** (`--ci` /
+   agent, `demo-agent-dev`) for the agent-side wiring steps (stage 3 11-12) AND the
+   phase-5 CI mock-sandbox wire proof (`mock-wire-demo.sh`) — CI has no sandbox, so
+   the real §10.2 agent can't sign. Its grant is ONE canonical list,
+   `MOCK_SCOPE_SERVICES` (= `openrouter,memory:ci-wire-proof,mock-wire-llm` by
+   default), defined identically in `v2-stage3-demo.sh` + `mock-wire-demo.sh` so the
+   two phases never flip-flop setScope. That mock is **CI-only** (`--ci` /
    `--mock-agent`); operators **never** mock — they `defer` those steps to the sandbox.
 6. **Deployer key via `_lib.sh`.** Source `harness/scripts/_lib.sh` and use
    `resolve_master_key` (raw-hex / mnemonic / `~/.agentkeys/heima-deployer.key`)
@@ -233,9 +237,13 @@ Every orchestrator + the operator runbook MUST keep this split exact:
   sanctioned synthetic agent, contract rule 5), and **stage-1 auto-skips
   deploy/email/provision** (CI runs against pre-provisioned infra — contracts pinned,
   wallet_sig identity, buckets/roles an operator one-shot). Tolerates prereq skips.
-  `harness-ci.yml` runs the WHOLE orchestrator — **`v2-demo.sh --ci` → phases 1–4 + 6**
-  (phase 5/wire auto-skips: no aiosandbox). So phase 6 (the daemon web-chain runtime
-  proof) IS exercised in CI; the only phase CI can't run is the sandbox-bound wire.
+  `harness-ci.yml` runs the WHOLE orchestrator — **`v2-demo.sh --ci` → phases 1–6,
+  with phase 5 MOCKED**: `mock-wire-demo.sh` emulates the aiosandbox side ON the
+  runner (boots the real `agentkeys-mcp-server` with the mock agent's identity +
+  per-actor STS relay, master-self vaults a probe cred, then runs the SAME
+  `sandbox-agent-isolation.sh` the sandbox runs — memory-via-MCP + the #216 cred
+  fetch + the scope-denial negative). What the mock cannot prove is sandbox key
+  custody — that stays the operator's On-Sandbox run.
 
 **Fresh-ceremony / re-testable rule:** an operator run must EXERCISE the ceremony (Touch ID),
 not silently skip it — never let a re-run look "tested" while the biometric never fired.
@@ -250,7 +258,7 @@ sandbox) is **GREEN**, never fail/incomplete.
 
 | Script | Goal | Entry |
 |---|---|---|
-| **`v2-demo.sh`** | **THE single entry point — no flags = phases 1→2→3→4 (memory plant)→5 (wire)→6 (web↔agent parity); wire auto-runs when the aiosandbox is up, else reports INCOMPLETE + exits non-zero (an unexecuted proof is never green — pass `--wire none` to intentionally skip); fail-fast. `PHASE.STEP` addressing (`--from 4.1`, `--only 3.11`). Flags are CI/scoping only.** | (no flags) / `--ci` / `--stage N` / `--from P.S` / `--only P.S` / `--wire real\|light\|none` |
+| **`v2-demo.sh`** | **THE single entry point — no flags = phases 1→2→3→4 (memory plant)→5 (wire)→6 (web↔agent parity); wire auto-runs when the aiosandbox is up, else reports INCOMPLETE + exits non-zero (an unexecuted proof is never green — pass `--wire none` to intentionally skip); fail-fast. Under `--ci` phase 5 is MOCKED, not skipped: `mock-wire-demo.sh` runs the post-wire agent runtime on the runner. `PHASE.STEP` addressing (`--from 4.1`, `--only 3.11`). Flags are CI/scoping only.** | (no flags) / `--ci` / `--stage N` / `--from P.S` / `--only P.S` / `--wire real\|mock\|none` |
 | `v2-stage1-demo.sh` | M1 foundation demo | `--only-step N` |
 | `v2-stage2-demo.sh` | hardening demo | `--only-step N` |
 | `v2-stage3-demo.sh` | OIDC + per-actor/data-class isolation proof (23 steps; 16–17 = #196 master-self + cross-actor scope; **18 = granted-agent positives — the memory cap AND the #216 cred-fetch cap for the granted service (200), an un-granted cred probe → ServiceNotInScope, and (CI/mock only) the live #216 REVOKE transition: setScope drops the service → the same cred-fetch mint is denied → restore**; **19–21 = #201 Config data-class isolation** — master-self layer-3/4 + cap data-class-mismatch, run on the operator, `skip` until config infra is provisioned/deployed; **22 = #207 classifier-worker isolation** — master-self `cap_op_mismatch` (storage cap → classify worker) + `cap_data_class_mismatch` (cross-data-class Classify cap), compute-gate so NO STS, `skip` until the worker is deployed; **23 = cleanup + summary**). **Steps 11-12 / 14-15 sign STS creds AS the agent: on the operator they `defer` to the sandbox (the §10.2 agent key lives in the sandbox) — GREEN, never fail. `--mock-agent` (CI-only, auto-on under `--ci`) provisions a master-held DEV agent so headless CI can prove the roundtrip; a real §10.2 agent proves it in-sandbox via `phase1-wire-demo.sh --real`. When 11-12 run they ALSO assert the **#229 durable-audit receipt**: fetch response `audit_envelope_hash` → envelope fetchable from `AGENTKEYS_WORKER_AUDIT_URL`, hash = keccak256(cbor) (the appendV2/appendRootV2 anchor commitment), no plaintext — skip reasons `audit-receipt-missing` / `audit-url-unset`.** | `--from/--to/--only-step` / `--mock-agent` |
@@ -258,6 +266,7 @@ sandbox) is **GREEN**, never fail/incomplete.
 | `web-memory-bootstrap.sh` | issue #196 web-memory pre-flight + proof; runbook [`../docs/operator-runbook-web-memory.md`](../docs/operator-runbook-web-memory.md) | `--from/--to/--only-step` |
 | `memory-plant-demo.sh` | plant a proof memory archive through the REAL chain + read-back (the CLI/CI proof of the plant flow the web "⊕ plant prepared memory" button drives); **phase 4 of `v2-demo.sh`**. Plants into **dedicated `demo-*` namespaces** (never the real travel/personal/family) and **always deletes them on exit** (success OR failure, EXIT trap; `KEEP_DEMO_MEMORY=1` keeps), so test memory never leaks into the master's real store — the real prepared archive is planted ONLY by the user (the button), never by a demo or onboarding. Re-testable; idempotent (`--from 4.1`). | `--from-step/--only-step N` / `--ci` |
 | `web-parity-demo.sh` | **phase 6 of `v2-demo.sh`** (NOT a standalone front door) — boots `agentkeys-daemon --ui-bridge` SEEDED with the master's J1 + device via the `--ui-bridge-seed-*` daemon seam (skips re-onboarding) + plants a **dedicated `webparity` probe ns** through the **web** endpoint `POST /v1/master/memory/plant`, **deleted on exit** (success or failure). A 200 proves the daemon's chain (cap-mint → STS → worker → S3) == the agent/harness chain — the web↔harness drift gate. **Step 4 (#214)** additionally polls `GET /v1/agent/pairing/pending` and asserts a well-formed `{requests:[…]}` — the master-side web-pairing route reaches the real broker rendezvous (the full claim→register e2e needs a live §10.2 agent request, exercised agent-side). Reuses phases 1-2's build/chain/broker/master (one daemon boot, no re-bootstrap); real-only. | `--from-step/--only-step N` / `--ci` |
+| `mock-wire-demo.sh` | **CI mock-sandbox wire proof (#216) — CI-ONLY.** Emulates the aiosandbox side ON the runner with the sanctioned mock agent: ensure agent + the canonical `MOCK_SCOPE_SERVICES` grant → mint operator + agent sessions (wallet_sig SIWE via `_lib.sh::wallet_sig_mint_jwt`) → boot the REAL `agentkeys-mcp-server` on `127.0.0.1:$MOCK_MCP_PORT` (http backend + per-actor STS relay, the phase1 1.4 shape) → master-self vault a probe cred under the DEDICATED `mock-wire-llm` service (never `openrouter` — can't clobber a real vault entry) → run the SAME `sandbox-agent-isolation.sh` with a staged harness-env + `EXPECTED_CRED_SHA256`. Proves the post-wire agent RUNTIME (MCP server + CLI) headless; sandbox key custody stays operator-only. **Phase 5 of `v2-demo.sh` under `--ci`** (`--wire mock`). Idempotent; MCP + temp files torn down on EXIT. | `--from-step/--only-step N` |
 | `cred-fetch-demo.sh` | **#216 agent-side vaulted-key fetch, real e2e** (standalone). A master **vaults** a probe credential via the daemon (web path: cap-mint cred-store → STS → cred worker → S3), then the **agent** fetches it back with `agentkeys cred fetch` (CLI path: cap-mint cred-fetch → STS → cred worker → **decrypt**), asserting the EXACT secret round-trips. Proves the cred half of "the agent uses the key the master authorized it to use" (the Hermes wire is phase1-wire #216 Phase 4.0). Routes through the shared `agentkeys-backend-client` (no re-typed shapes, #204). Idempotent (a FIXED `cred-e2e-probe` service is overwritten each run — never accumulates); daemon killed on exit; real-only. | `--from-step/--only-step N` / `--ci` |
 | `cred-wire-demo.sh` | **#216 agent-side wire, the FULL e2e** (standalone, headless). Extends `cred-fetch-demo.sh` through the Hermes wire: master vaults the LLM key → **agent cred-fetches it** → **plant into the sandbox Hermes** (`~/.hermes/.env` + `hermes config set model.*`) → **Hermes runs on the vault key** (real LLM smoke), asserting the planted key == the vaulted key (sha) with **no `OPENROUTER_API_KEY` in the agent env**. The durable, no-Touch-ID complement to `phase1-wire-demo.sh` Phase 4.0b. Needs a reachable aiosandbox (`SANDBOX_URL`, default `:8080`) with Hermes installed. Idempotent (FIXED `openrouter` service; `.env` key-line rewritten not appended); daemon killed on exit; real-only. | `--from-step/--only-step N` / `--ci` |
 | `sandbox-build-push.sh` | **Path-A binary provisioner (utility, not a stage demo).** Cross-builds the agent binaries (`agentkeys` + `agentkeys-mcp-server` + `agentkeys-daemon`) for the sandbox's aarch64-Linux arch in the cached arm64 builder image (sharing phase1-wire-demo.sh's exact `agentkeys-sandbox-builder` image + `agentkeys-sandbox-*` cargo/target volumes → a warm tree re-pushes in seconds) and uploads them to the sandbox's `~/.local/bin` via the file API. **Build + push ONLY** — it never pairs or wires (that's the master's job in the parent-control web UI). Re-run after any local code change so the in-sandbox agent runs current source. | `SANDBOX_URL` / `RUST_BUILD_IMAGE` / `CROSS_RUST_TOOLCHAIN` |
diff --git a/harness/mock-wire-demo.sh b/harness/mock-wire-demo.sh
new file mode 100755
index 00000000..765bf1b5
--- /dev/null
+++ b/harness/mock-wire-demo.sh
@@ -0,0 +1,216 @@
+#!/usr/bin/env bash
+# harness/mock-wire-demo.sh — CI mock-sandbox wire proof (#216). CI-ONLY.
+#
+# CI has no aiosandbox, so phase 5 (the wire) used to be OFF there — the entire
+# post-wire AGENT RUNTIME (the in-sandbox MCP server + `agentkeys cred fetch` +
+# the hook path) ran nowhere headless. This script emulates the aiosandbox side
+# ON THE RUNNER with the ONE sanctioned synthetic agent (the master-held mock,
+# contract rule 5) and then runs THE SAME proof script the real sandbox runs:
+#
+#   ensure mock agent (registered + canonical scope grant)
+#     → mint agent + operator sessions (wallet_sig SIWE, headless)
+#     → boot the REAL agentkeys-mcp-server on localhost (http backend, per-actor
+#       STS relay — the exact phase1 1.4 shape, minus the sandbox)
+#     → master-self vault a probe LLM cred (agentkeys cred store)
+#     → stage a harness-env + run harness/scripts/sandbox-agent-isolation.sh:
+#       memory roundtrip THROUGH the MCP + #216 cred fetch (positive, sha-exact)
+#       + the un-granted scope-denial negative
+#
+# What this proves that stage-3 steps 11-12 (raw curls) don't: the MCP-server +
+# CLI runtime layer — the same binaries/wire the sandbox agent uses. What it
+# CANNOT prove: sandbox key custody (the mock key is master-held) — that's the
+# operator's On-Sandbox run (phase1-wire-demo.sh --real → sandbox-agent-
+# isolation.sh in the sandbox).
+#
+# The canonical mock-agent grant (KEEP IDENTICAL to v2-stage3-demo.sh):
+#   $SMOKE_SERVICE , memory:$MOCK_WIRE_NS , $MOCK_CRED_SERVICE
+# The cred probe uses the DEDICATED $MOCK_CRED_SERVICE (never `openrouter`), so
+# a run can never overwrite a real vaulted LLM key.
+#
+# Idempotent: agent create + scope grant short-circuit on chain state; the vault
+# probe service is FIXED (overwritten per run); the memory ns blob is replaced
+# per put; the MCP server + temp files are torn down on EXIT.
+#
+#   bash harness/mock-wire-demo.sh                # full (CI invokes via v2-demo --ci)
+#   bash harness/mock-wire-demo.sh --only-step 4  # one step
+set -uo pipefail
+set +m
+
+REPO_ROOT="$(cd "$(dirname "$0")/.." && pwd)"
+ENV_FILE="${ENV_FILE:-$REPO_ROOT/scripts/operator-workstation.env}"
+[ -f "$ENV_FILE" ] && { set -a; . "$ENV_FILE"; set +a; }
+# shellcheck source=/dev/null
+. "$REPO_ROOT/harness/scripts/_lib.sh"
+
+FROM=1; TO=99; STEP_TOTAL=6
+for a in "$@"; do case "$a" in
+  --ci) : ;;   # accepted for orchestrator symmetry — this script IS the CI path
+  --from-step) shift; FROM="${1:-1}" ;; --from-step=*) FROM="${a#*=}" ;;
+  --to-step) shift; TO="${1:-99}" ;;   --to-step=*) TO="${a#*=}" ;;
+  --only-step) shift; FROM="${1:-1}"; TO="$FROM" ;; --only-step=*) FROM="${a#*=}"; TO="$FROM" ;;
+  --help|-h) sed -n '2,36p' "$0" | sed 's/^# \{0,1\}//'; exit 0 ;;
+esac; done
+should_run() { [ "$1" -ge "$FROM" ] && [ "$1" -le "$TO" ]; }
+c() { [ -t 2 ] && printf '\033[%sm%s\033[0m' "$1" "$2" || printf '%s' "$2"; }
+step() { printf '\n%s %s\n' "$(c '1;36' "▸ step $1/$STEP_TOTAL")" "$2" >&2; }
+ok()   { printf '  %s %s\n' "$(c '1;32' ok)" "$1" >&2; }
+skip() { printf '  %s %s\n' "$(c '1;33' skip)" "$1" >&2; }
+die()  { printf '  %s %s\n' "$(c '1;31' fail)" "$1" >&2; exit 1; }
+sha_hex() { { command -v sha256sum >/dev/null 2>&1 && printf '%s' "$1" | sha256sum || printf '%s' "$1" | shasum -a 256; } | awk '{print $1}'; }
+
+BROKER="${OIDC_ISSUER:-${AGENTKEYS_BROKER_URL:-}}"
+REGION="${REGION:-us-east-1}"
+SMOKE_SERVICE="${SMOKE_TEST_SERVICE:-openrouter}"
+MOCK_WIRE_NS="${MOCK_WIRE_NS:-ci-wire-proof}"
+MOCK_CRED_SERVICE="${MOCK_CRED_SERVICE:-mock-wire-llm}"
+MOCK_SCOPE_SERVICES="${MOCK_SCOPE_SERVICES:-$SMOKE_SERVICE,memory:$MOCK_WIRE_NS,$MOCK_CRED_SERVICE}"
+MOCK_AGENT_LABEL="${AGENTKEYS_MOCK_AGENT_LABEL:-demo-agent-dev}"
+MOCK_MCP_PORT="${MOCK_MCP_PORT:-18090}"
+VENDOR_TOKEN="${AGENTKEYS_MCP_VENDOR_TOKEN:-harness-tok}"
+
+# Binaries: the v2-demo preflight prepends target/release to PATH; standalone
+# runs resolve release → debug → PATH and build on miss (contract rule 7).
+resolve_bin() {
+  local name="$1"
+  if [ -x "$REPO_ROOT/target/release/$name" ]; then printf '%s' "$REPO_ROOT/target/release/$name"
+  elif [ -x "$REPO_ROOT/target/debug/$name" ]; then printf '%s' "$REPO_ROOT/target/debug/$name"
+  else command -v "$name" 2>/dev/null || true; fi
+}
+
+MPID=""; MLOG=""; ASB_FILE=""; HENV_FILE=""
+cleanup() {
+  [ -n "$MPID" ] && kill "$MPID" 2>/dev/null
+  rm -f "$MLOG" "$ASB_FILE" "$HENV_FILE"
+}
+trap cleanup EXIT
+
+# ─── Step 1: prereqs + master identity ──────────────────────────────────────
+if should_run 1; then
+  step 1 "Prereqs: tools + binaries + broker + the REGISTERED master (signs the mock's chain state)"
+  for t in cast jq curl; do command -v "$t" >/dev/null 2>&1 || die "missing $t"; done
+  [ -n "$BROKER" ] || die "no broker URL (OIDC_ISSUER) — the mock wire is real-broker-only"
+  [ -n "${AGENTKEYS_WORKER_CRED_URL:-}" ]   || die "AGENTKEYS_WORKER_CRED_URL unset"
+  [ -n "${AGENTKEYS_WORKER_MEMORY_URL:-}" ] || die "AGENTKEYS_WORKER_MEMORY_URL unset"
+  [ -n "${VAULT_ROLE_ARN:-}" ]  || die "VAULT_ROLE_ARN unset"
+  [ -n "${MEMORY_ROLE_ARN:-}" ] || die "MEMORY_ROLE_ARN unset"
+  CLI_BIN="$(resolve_bin agentkeys)"; MCP_BIN="$(resolve_bin agentkeys-mcp-server)"
+  if [ -z "$CLI_BIN" ] || [ -z "$MCP_BIN" ]; then
+    [ -n "${AGENTKEYS_SKIP_CLI_BUILD:-}" ] && die "binaries missing but AGENTKEYS_SKIP_CLI_BUILD set — the preflight should have built them"
+    ( cd "$REPO_ROOT" && cargo build --release -p agentkeys-cli -p agentkeys-mcp-server ) || die "cargo build failed"
+    CLI_BIN="$(resolve_bin agentkeys)"; MCP_BIN="$(resolve_bin agentkeys-mcp-server)"
+  fi
+  { [ -n "$CLI_BIN" ] && [ -n "$MCP_BIN" ]; } || die "could not resolve agentkeys / agentkeys-mcp-server binaries"
+  KEY=$(resolve_master_key) || die "no master deployer key"
+  MASTER_ADDR_LC=$(cast wallet address --private-key "$KEY" | tr 'A-F' 'a-f')
+  OMNI_RAW=$(printf 'agentkeysevm%s' "$MASTER_ADDR_LC" | shasum -a 256 | awk '{print $1}')
+  OPERATOR_OMNI="0x$OMNI_RAW"
+  MASTER_DKH=$(resolve_active_master_dkh "$OMNI_RAW" "$MASTER_ADDR_LC" || true)
+  [ -n "$MASTER_DKH" ] || die "no ACTIVE master device for operator $OPERATOR_OMNI — run stages 1-2 first (the #164 register); the mock agent's create/grant and the master-self vault all need it"
+  ok "binaries + env ready; operator ${OPERATOR_OMNI:0:14}…, master device ${MASTER_DKH:0:14}…"
+fi
+
+# ─── Step 2: mock agent — register + the CANONICAL scope grant ──────────────
+if should_run 2; then
+  step 2 "Mock agent '$MOCK_AGENT_LABEL': register (idempotent) + grant [$MOCK_SCOPE_SERVICES]"
+  profile_uc=$(printf '%s' "${AGENTKEYS_CHAIN:-heima}" | tr 'a-z-' 'A-Z_')
+  registry_addr=$(eval "echo \${SIDECAR_REGISTRY_ADDRESS_${profile_uc}:-}")
+  scope_addr=$(eval "echo \${SCOPE_CONTRACT_ADDRESS_${profile_uc}:-}")
+  { [ -n "$registry_addr" ] && [ "$registry_addr" != 0x0 ]; } || die "no SidecarRegistry address in env"
+  { [ -n "$scope_addr" ] && [ "$scope_addr" != 0x0 ]; }       || die "no AgentKeysScope address in env"
+  bash "$REPO_ROOT/scripts/heima-agent-create.sh" --label "$MOCK_AGENT_LABEL" --registry-address "$registry_addr" >&2 \
+    || die "heima-agent-create.sh failed for '$MOCK_AGENT_LABEL'"
+  grant_json=$(bash "$REPO_ROOT/scripts/heima-scope-set.sh" --agent "$MOCK_AGENT_LABEL" \
+    --services "$MOCK_SCOPE_SERVICES" --scope-address "$scope_addr" | tail -1) || grant_json=""
+  echo "$grant_json" | jq -e '.ok==true' >/dev/null 2>&1 \
+    || die "scope grant did not land: $(echo "$grant_json" | tr '\n' ' ' | cut -c1-200)"
+  AGENT_FILE="$HOME/.agentkeys/agents/${MOCK_AGENT_LABEL}.json"
+  MOCK_ACTOR=$(jq -r '.actor_omni // empty' "$AGENT_FILE"); MOCK_ACTOR="0x${MOCK_ACTOR#0x}"
+  MOCK_DKH=$(jq -r '.device_key_hash // empty' "$AGENT_FILE")
+  [ -n "$MOCK_DKH" ] || MOCK_DKH=$(cast keccak "$(jq -r '.agent_address // .wallet_address' "$AGENT_FILE" | tr '[:upper:]' '[:lower:]')")
+  [ -n "${MOCK_ACTOR#0x}" ] || die "agent file $AGENT_FILE missing actor_omni"
+  ok "mock agent on chain — actor ${MOCK_ACTOR:0:14}…, device ${MOCK_DKH:0:14}…, scope [$MOCK_SCOPE_SERVICES]"
+fi
+
+# ─── Step 3: sessions — operator (cap-mint authz) + agent (per-actor STS) ───
+if should_run 3; then
+  step 3 "Sessions: operator (deployer SIWE) + agent ('$MOCK_AGENT_LABEL' key SIWE → 0600 file)"
+  [ -n "${KEY:-}" ] || die "no master key — run step 1"
+  okey="$(mktemp)"; ( umask 077; printf '%s' "$KEY" > "$okey" )
+  OP_JWT=$(wallet_sig_mint_jwt "$okey" "$BROKER"); rc=$?; rm -f "$okey"
+  [ "$rc" = 0 ] && [ -n "$OP_JWT" ] || die "operator session mint failed (broker wallet_sig)"
+  agent_pk=$(jq -r '.agent_private_key // empty' "${AGENT_FILE:-$HOME/.agentkeys/agents/$MOCK_AGENT_LABEL.json}")
+  { [ -n "$agent_pk" ] && [ "$agent_pk" != null ]; } || die "mock agent has no master-held key — heima-agent-create ran in §10.2 mode?"
+  akey="$(mktemp)"; ( umask 077; printf '%s' "$agent_pk" > "$akey" )
+  AGENT_JWT=$(wallet_sig_mint_jwt "$akey" "$BROKER"); rc=$?; rm -f "$akey"
+  [ "$rc" = 0 ] && [ -n "$AGENT_JWT" ] || die "agent session mint failed (broker wallet_sig)"
+  ASB_FILE="$(mktemp)"; ( umask 077; printf '%s' "$AGENT_JWT" > "$ASB_FILE" )
+  ok "operator session (${#OP_JWT} chars) + agent session (${#AGENT_JWT} chars, omni == actor) ready"
+fi
+
+# ─── Step 4: boot the REAL MCP server on localhost (the mock 'sandbox') ─────
+if should_run 4; then
+  step 4 "agentkeys-mcp-server on 127.0.0.1:$MOCK_MCP_PORT (http backend + per-actor STS relay)"
+  { [ -n "${MOCK_ACTOR:-}" ] && [ -n "${ASB_FILE:-}" ]; } || die "need steps 2-3 first"
+  MLOG="$(mktemp -t mock-wire-mcp.XXXX)"
+  mcp_args=(--backend http --transport http --listen "127.0.0.1:$MOCK_MCP_PORT"
+    --vendor-tokens "harness:$VENDOR_TOKEN" --broker-url "${BROKER%/}"
+    --memory-url "${AGENTKEYS_WORKER_MEMORY_URL}"
+    --default-actor "$MOCK_ACTOR" --default-operator-omni "$OPERATOR_OMNI"
+    --default-device-key-hash "$MOCK_DKH"
+    --agent-session-bearer-file "$ASB_FILE"
+    --memory-role-arn "${MEMORY_ROLE_ARN}" --vault-role-arn "${VAULT_ROLE_ARN}"
+    --aws-region "$REGION")
+  [ -n "${AGENTKEYS_WORKER_AUDIT_URL:-}" ] && mcp_args+=(--audit-url "$AGENTKEYS_WORKER_AUDIT_URL")
+  "$MCP_BIN" "${mcp_args[@]}" > "$MLOG" 2>&1 &
+  MPID=$!
+  ready=0; for _ in $(seq 1 20); do
+    curl -fsS "http://127.0.0.1:$MOCK_MCP_PORT/healthz" >/dev/null 2>&1 && { ready=1; break; }
+    kill -0 "$MPID" 2>/dev/null || break; sleep 0.5
+  done
+  [ "$ready" = 1 ] || die "MCP server not ready: $(tail -3 "$MLOG" | tr '\n' ' ' | cut -c1-200)"
+  ok "MCP up (pid $MPID) — same runtime layer the sandbox agent talks to"
+fi
+
+# ─── Step 5: master-self vault the probe LLM cred ───────────────────────────
+if should_run 5; then
+  step 5 "Master vaults '$MOCK_CRED_SERVICE' (probe secret) — what the mock agent is authorized to fetch"
+  { [ -n "${OP_JWT:-}" ] && [ -n "${MASTER_DKH:-}" ]; } || die "need steps 1+3 first"
+  PROBE_SECRET="sk-mock-wire-$$-$(date +%s)"
+  export PROBE_SECRET
+  s3key=$("$CLI_BIN" cred store "$MOCK_CRED_SERVICE" --secret-env PROBE_SECRET \
+    --operator-omni "$OPERATOR_OMNI" --actor-omni "$OPERATOR_OMNI" \
+    --device-key-hash "$MASTER_DKH" --session-bearer "$OP_JWT" \
+    --broker-url "${BROKER%/}" --cred-url "${AGENTKEYS_WORKER_CRED_URL}" \
+    --vault-role-arn "${VAULT_ROLE_ARN}" --region "$REGION" 2>&1) \
+    || die "master-self cred store failed: $(echo "$s3key" | tr '\n' ' ' | cut -c1-200)"
+  PROBE_SHA=$(sha_hex "$PROBE_SECRET")
+  ok "vaulted (s3: $(echo "$s3key" | tail -1 | cut -c1-60)…) — expected sha ${PROBE_SHA:0:12}…"
+fi
+
+# ─── Step 6: THE PROOF — the sandbox script, on the runner ──────────────────
+if should_run 6; then
+  step 6 "sandbox-agent-isolation.sh on the runner: memory via MCP + #216 cred fetch + scope negative"
+  { [ -n "${OP_JWT:-}" ] && [ -n "${MOCK_ACTOR:-}" ] && [ -n "${PROBE_SHA:-}" ]; } || die "need steps 1-5 first"
+  HENV_FILE="$(mktemp -t mock-wire-henv.XXXX)"
+  ( umask 077; {
+    printf 'AGENTKEYS_MCP_URL=http://127.0.0.1:%s/mcp\n' "$MOCK_MCP_PORT"
+    printf 'AGENTKEYS_MCP_VENDOR_TOKEN=%s\n' "$VENDOR_TOKEN"
+    printf 'AGENTKEYS_ACTOR_OMNI=%s\n' "$MOCK_ACTOR"
+    printf 'AGENTKEYS_OPERATOR_OMNI=%s\n' "$OPERATOR_OMNI"
+    printf 'AGENTKEYS_DEVICE_KEY_HASH=%s\n' "$MOCK_DKH"
+    printf 'AGENTKEYS_SESSION_BEARER=%s\n' "$OP_JWT"
+    printf 'AGENTKEYS_BROKER_URL=%s\n' "${BROKER%/}"
+    printf 'AGENTKEYS_WORKER_CRED_URL=%s\n' "${AGENTKEYS_WORKER_CRED_URL}"
+    printf 'VAULT_ROLE_ARN=%s\n' "${VAULT_ROLE_ARN}"
+    printf 'REGION=%s\n' "$REGION"
+    printf 'CRED_SERVICE=%s\n' "$MOCK_CRED_SERVICE"
+  } > "$HENV_FILE" )
+  if HARNESS_ENV="$HENV_FILE" EXPECTED_CRED_SHA256="$PROBE_SHA" AGENT_BIN="$CLI_BIN" \
+       bash "$REPO_ROOT/harness/scripts/sandbox-agent-isolation.sh" "$MOCK_WIRE_NS"; then
+    ok "the SAME proof the sandbox runs passed on the runner (mock agent): MCP memory roundtrip + authorized cred fetch (sha-exact) + un-granted denial"
+  else
+    die "mock-sandbox proof failed — see sandbox-agent-isolation.sh output above"
+  fi
+fi
+
+printf '\n%s the CI mock sandbox ran the full post-wire agent runtime — MCP server + agentkeys CLI against the live broker/workers. Sandbox key custody is the operator run (phase1-wire-demo.sh --real).\n' "$(c '1;32' 'DONE ·')" >&2
diff --git a/harness/scripts/_lib.sh b/harness/scripts/_lib.sh
index 28d98e9e..d0abcdd1 100644
--- a/harness/scripts/_lib.sh
+++ b/harness/scripts/_lib.sh
@@ -280,3 +280,40 @@ resolve_active_master_dkh() {
   fi
   return 1
 }
+
+# wallet_sig_mint_jwt <key_file> <broker_url>
+# Mint a broker session JWT non-interactively via the wallet_sig (SIWE) auth
+# plugin: /v1/auth/wallet/start → cast wallet sign → /v1/auth/wallet/verify.
+# Echoes the JWT on stdout (logs to stderr); rc 1 on any failure. The JWT's
+# agentkeys.omni_account == the signing key's broker omni: pass the DEPLOYER
+# key for an OPERATOR session (cap-mint authz) or an AGENT's key for the
+# per-actor STS-relay session (omni == actor_omni). Uses temp files, not shell
+# pipelines — jq chokes on the multi-line SIWE message in a var round-trip.
+wallet_sig_mint_jwt() {
+  local key_file="$1" issuer="${2%/}"
+  local key addr t1 t2 rid msg sig jwt
+  key="$(tr -d '[:space:]' < "$key_file")" || return 1
+  case "$key" in 0x*) ;; *) key="0x$key" ;; esac
+  addr="$(cast wallet address --private-key "$key" 2>/dev/null)" \
+    || { echo "wallet_sig_mint_jwt: cast wallet address failed (key valid? cast on PATH?)" >&2; return 1; }
+  t1="$(mktemp)"; t2="$(mktemp)"
+  if ! curl -sS --max-time 15 -X POST "$issuer/v1/auth/wallet/start" -H 'content-type: application/json' \
+      -d "$(jq -n --arg a "$addr" '{address:$a, chain_id:1}')" -o "$t1"; then
+    echo "wallet_sig_mint_jwt: POST $issuer/v1/auth/wallet/start failed" >&2; rm -f "$t1" "$t2"; return 1
+  fi
+  rid="$(jq -r '.request_id // empty' "$t1")"; msg="$(jq -r '.siwe_message // empty' "$t1")"
+  if [ -z "$rid" ] || [ -z "$msg" ]; then
+    echo "wallet_sig_mint_jwt: wallet/start missing request_id/siwe_message: $(head -c 160 "$t1")" >&2
+    rm -f "$t1" "$t2"; return 1
+  fi
+  sig="$(cast wallet sign --private-key "$key" "$msg" 2>/dev/null)" \
+    || { echo "wallet_sig_mint_jwt: cast wallet sign failed" >&2; rm -f "$t1" "$t2"; return 1; }
+  if ! curl -sS --max-time 15 -X POST "$issuer/v1/auth/wallet/verify" -H 'content-type: application/json' \
+      -d "$(jq -n --arg r "$rid" --arg s "$sig" '{request_id:$r, signature:$s}')" -o "$t2"; then
+    echo "wallet_sig_mint_jwt: POST $issuer/v1/auth/wallet/verify failed" >&2; rm -f "$t1" "$t2"; return 1
+  fi
+  jwt="$(jq -r '.session_jwt // .jwt // empty' "$t2")"
+  rm -f "$t1" "$t2"
+  [ -n "$jwt" ] || { echo "wallet_sig_mint_jwt: no session JWT in wallet/verify (broker wallet_sig plugin enabled?)" >&2; return 1; }
+  printf '%s' "$jwt"
+}
diff --git a/harness/v2-demo.sh b/harness/v2-demo.sh
index 78a5fdb6..f45299b3 100755
--- a/harness/v2-demo.sh
+++ b/harness/v2-demo.sh
@@ -26,9 +26,9 @@
 #   bash harness/v2-demo.sh --from 5             # just the wire phase (phase 5 has no sub-steps)
 #
 # Other flags are for CI / scoping only — an operator should not need them:
-#   bash harness/v2-demo.sh --ci                 # CI: software register + mock agent + tolerate skips; wire OFF (no sandbox)
+#   bash harness/v2-demo.sh --ci                 # CI: software register + mock agent + tolerate skips; wire MOCKED on the runner (no sandbox)
 #   bash harness/v2-demo.sh --stage 3            # one phase
-#   bash harness/v2-demo.sh --wire real|none           # force the wire phase on/off (real-data-only; in-memory 'light' removed)
+#   bash harness/v2-demo.sh --wire real|mock|none      # force the wire phase: real sandbox / CI mock-sandbox / off (in-memory 'light' removed)
 #   bash harness/v2-demo.sh --allow-skip=agent-file-invalid   # passthrough to stage 3
 #
 # Fail-fast: stops at the first failing phase (a red phase cascades); the wire phase
@@ -77,7 +77,10 @@ fi
 { [ -n "${AGENTKEYS_CI:-}" ] || [ -n "${CI:-}" ] && [ "$CI" != 0 ]; } && CI=1
 
 # Phase 5 (wire) runs only when it's in STAGES; WIRE_MODE controls HOW. Operator → auto
-# (real if the sandbox is up, else skip-with-note); CI (no sandbox) → none. --wire wins.
+# (real if the sandbox is up, else skip-with-note); CI (no sandbox) → MOCK: the runner
+# emulates the aiosandbox side (mock-wire-demo.sh boots the real MCP server + runs the
+# SAME sandbox-agent-isolation.sh with the master-held mock agent — #216 cred fetch +
+# memory-via-MCP, headless). --wire wins (`none` = intentionally off).
 # "is the aiosandbox (agent container) up?" — probe its HTTP API (the SAME gate
 # phase1-wire-demo.sh:1.1 uses), NOT a local openviking install. Respects $SANDBOX_URL.
 sandbox_present() {
@@ -85,7 +88,7 @@ sandbox_present() {
   curl -fsS --max-time 3 "$u/healthz" >/dev/null 2>&1 || curl -fsS --max-time 3 "$u/v1/sandbox" >/dev/null 2>&1
 }
 if [ -z "$WIRE_MODE" ]; then
-  if [ "$CI" = 1 ]; then WIRE_MODE=none; else WIRE_MODE=auto; fi
+  if [ "$CI" = 1 ]; then WIRE_MODE=mock; else WIRE_MODE=auto; fi
 fi
 
 c() { [ -t 2 ] && printf '\033[%sm%s\033[0m' "$1" "$2" || printf '%s' "$2"; }
@@ -131,15 +134,20 @@ preflight() {
 # Runs --webauthn so the MASTER grants the agent's memory:<ns> scope (one Touch ID, like
 # phases 1-2); WITHOUT the grant the agent pairs but memory.get → service_not_in_scope.
 # Sets WIRE_RESULT so the final summary can tell a real PASS apart from an auto-skip:
-#   wired    — the wire actually ran (proof executed)
-#   disabled — intentionally off (--wire none / CI has no sandbox) → clean
+#   wired    — the REAL wire ran (sandbox proof executed)
+#   mocked   — the CI mock-sandbox proof ran (mock-wire-demo.sh: the runner boots the
+#              real MCP server + runs sandbox-agent-isolation.sh with the master-held
+#              mock agent — the post-wire runtime, NOT sandbox key custody)
+#   disabled — intentionally off (--wire none) → clean
 #   skipped  — auto mode but NO aiosandbox → the proof did NOT run (NOT a pass)
-# Returns 0 on wired/disabled/auto-skip, non-zero only on a real wire failure. The
-# auto-skip is surfaced as DEMO INCOMPLETE (non-zero exit) by the summary, so an
+# Returns 0 on wired/mocked/disabled/auto-skip, non-zero only on a real wire failure.
+# The auto-skip is surfaced as DEMO INCOMPLETE (non-zero exit) by the summary, so an
 # unexecuted proof can never read as green.
 run_wire_phase() {
   case "$WIRE_MODE" in
-    none)  phase "5 — wire (disabled: --wire none / CI has no sandbox)"; WIRE_RESULT=disabled; return 0 ;;
+    none)  phase "5 — wire (disabled: --wire none)"; WIRE_RESULT=disabled; return 0 ;;
+    mock)  phase "5 — mock-wire-demo.sh (CI mock sandbox: master-held dev agent on the runner)"
+           WIRE_RESULT=mocked; bash "$REPO_ROOT/mock-wire-demo.sh" ;;
     auto)
       if sandbox_present; then
         phase "5 — phase1-wire-demo.sh --real --webauthn"; WIRE_RESULT=wired; bash "$REPO_ROOT/phase1-wire-demo.sh" --real --webauthn
@@ -150,8 +158,8 @@ run_wire_phase() {
         return 0
       fi ;;
     real)  phase "5 — phase1-wire-demo.sh --real --webauthn";  WIRE_RESULT=wired; bash "$REPO_ROOT/phase1-wire-demo.sh" --real --webauthn ;;
-    light) echo "v2-demo: --wire light (in-memory) was removed — real-data-only. Use --wire real|none." >&2; return 1 ;;
-    *) echo "v2-demo: --wire wants real|none (got '$WIRE_MODE')" >&2; return 1 ;;
+    light) echo "v2-demo: --wire light (in-memory) was removed — real-data-only. Use --wire real|mock|none." >&2; return 1 ;;
+    *) echo "v2-demo: --wire wants real|mock|none (got '$WIRE_MODE')" >&2; return 1 ;;
   esac
 }
 
@@ -225,6 +233,7 @@ printf '\n%s phases %s — all green.\n' "$(c '1;32' 'v2-demo DONE ·')" "$STAGE
 if [ "$wire_requested" = 1 ]; then
   case "${WIRE_RESULT:-}" in
     wired)    say "agent paired in the sandbox → run the agent-side proof THERE: bash \$HOME/sandbox-agent-isolation.sh" ;;
+    mocked)   say "wire proof ran on the CI mock sandbox (master-held dev agent — post-wire runtime proven). The REAL §10.2 sandbox-key proof is the operator run." ;;
     disabled) say "wire intentionally disabled (--wire none) — no §10.2 agent was paired this run." ;;
   esac
 fi
diff --git a/harness/v2-stage3-demo.sh b/harness/v2-stage3-demo.sh
index fed21840..db207e27 100755
--- a/harness/v2-stage3-demo.sh
+++ b/harness/v2-stage3-demo.sh
@@ -570,6 +570,14 @@ fi
 # never landed stage-1 step 13's setScopeWithWebauthn).
 SMOKE_SERVICE="${SMOKE_TEST_SERVICE:-openrouter}"
 SMOKE_PLAINTEXT="${SMOKE_TEST_SECRET:-stage3-roundtrip-secret-$(date +%s)}"
+# THE canonical mock-agent grant — keep IDENTICAL to harness/mock-wire-demo.sh
+# (which re-grants the same list for the phase-5 CI mock-sandbox proof). One list,
+# two call sites here: ensure_mock_agent + the step-18 revoke-transition restore.
+# Granting only $SMOKE_SERVICE here would make phases 3 and 5 flip-flop setScope
+# every CI run (set-replace semantics) — the same list lets both idempotently skip.
+MOCK_WIRE_NS="${MOCK_WIRE_NS:-ci-wire-proof}"
+MOCK_CRED_SERVICE="${MOCK_CRED_SERVICE:-mock-wire-llm}"
+MOCK_SCOPE_SERVICES="${MOCK_SCOPE_SERVICES:-$SMOKE_SERVICE,memory:$MOCK_WIRE_NS,$MOCK_CRED_SERVICE}"
 
 # Resolve the demo agent's actor_omni + device_key_hash. Prefer the
 # agent file (created by stage-1 step 12) so the cap binds to a real
@@ -633,7 +641,7 @@ ensure_mock_agent() {
   bash "$rr/scripts/heima-agent-create.sh" --label "$label" --registry-address "$registry_addr" >&2 \
     || { echo "ensure_mock_agent: heima-agent-create.sh failed" >&2; return 1; }
   if [ -n "$scope_addr" ] && [ "$scope_addr" != 0x0 ]; then
-    local args=(--agent "$label" --services "$SMOKE_SERVICE" --scope-address "$scope_addr")
+    local args=(--agent "$label" --services "$MOCK_SCOPE_SERVICES" --scope-address "$scope_addr")
     [ "${WEBAUTHN_MODE:-0}" = 1 ] && args+=(--webauthn)
     bash "$rr/scripts/heima-scope-set.sh" "${args[@]}" >&2 \
       || { echo "ensure_mock_agent: heima-scope-set.sh failed" >&2; return 1; }
@@ -1359,9 +1367,9 @@ if should_run_step 18; then
                 die "#216 post-revoke cred-fetch returned unexpected HTTP $rc — body: $body"
               fi
               restore_json=$(bash "$REPO_ROOT/scripts/heima-scope-set.sh" --agent "$pg_label" \
-                --services "$SMOKE_SERVICE" --scope-address "$scope_addr" | tail -1) || restore_json=""
+                --services "$MOCK_SCOPE_SERVICES" --scope-address "$scope_addr" | tail -1) || restore_json=""
               echo "$restore_json" | jq -e '.ok==true' >/dev/null 2>&1 \
-                && info "#216 revoke transition: scope restored to '$SMOKE_SERVICE' for '$pg_label'" \
+                && info "#216 revoke transition: scope restored to the canonical [$MOCK_SCOPE_SERVICES] for '$pg_label'" \
                 || info "#216 revoke transition: restore did not confirm (next run's ensure_mock_agent self-heals) — $(echo "$restore_json" | tr '\n' ' ' | cut -c1-120)"
             fi
           fi

From 26dd17236fb9c14e74088a19b6ecdb4cb36de6e9 Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 12:57:06 +0800
Subject: [PATCH 03/17] =?UTF-8?q?fix:=20#216=20delegated=20cred-fetch=20re?=
 =?UTF-8?q?ads=20the=20master's=20vault=20=E2=80=94=20the=20agent-identity?=
 =?UTF-8?q?=20fetch=20never=20could?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The mock-wire CI proof (first headless run of the post-wire agent
runtime) caught a REAL #216 gap, not a harness bug: the cred worker
keys S3 strictly by cap.payload.actor_omni, so an agent-identity fetch
(actor=agent) read bots/<agent>/credentials/ — but the #216 flow vaults
the key MASTER-SELF into bots/<operator>/credentials/. 502 s3_get.
Every prior #216 proof was master-self (operator==actor → same prefix),
and phase1's old host fetch swallowed the failure (2>/dev/null → env
fallback), so 'the agent fetches from the master's vault' had never
actually worked end-to-end.

Fix (worker, fetch only): try the actor's OWN vault first (#228
agent-owned creds — self-stored entries shadow delegated ones), then,
for a DELEGATED cap (actor != operator), fall back to the OPERATOR's
vault. The envelope AAD is keyed by the vault OWNER (each vault's
objects were encrypted with aad(operator, owner, service, epoch) at
store time), so decrypt matches either source. No IAM change: the S3
read still runs under the caller-relayed STS — reading the operator
prefix requires the operator session the wire context already holds;
the (device-bound, isServiceInScope-verified) cap narrows WHICH service
that session releases. Store/teardown/list stay strictly actor-keyed.

Unit tests: fetch_vault_owners (master-self = one vault, byte-identical
prior behavior; delegated = actor-then-operator). arch.md synced
(credential_envelope row + the cred-fetch sequence) per the
architecture-as-source-of-truth policy.

CI self-verifies: crates/agentkeys-worker-*/** trips the paths-filter →
the test EC2 auto-redeploys → the harness (incl. phase-5 mock wire
step 6) runs against the fixed worker.
---
 crates/agentkeys-worker-creds/src/handlers.rs | 100 ++++++++++++++----
 crates/agentkeys-worker-creds/src/lib.rs      |   7 +-
 docs/arch.md                                  |   4 +-
 3 files changed, 86 insertions(+), 25 deletions(-)

diff --git a/crates/agentkeys-worker-creds/src/handlers.rs b/crates/agentkeys-worker-creds/src/handlers.rs
index f469482e..57532267 100644
--- a/crates/agentkeys-worker-creds/src/handlers.rs
+++ b/crates/agentkeys-worker-creds/src/handlers.rs
@@ -269,30 +269,68 @@ async fn cred_fetch_inner(
     creds: Option<&crate::aws_creds::StsCreds>,
     req: &FetchRequest,
 ) -> Result<Vec<u8>, ApiError> {
-    let key = s3_key(&req.cap.payload.actor_omni, &req.cap.payload.service);
+    // A fetch may read TWO vaults, tried in order (see fetch_vault_owners):
+    // the actor's OWN (#228 agent-owned creds), then — for a DELEGATED cap
+    // (actor != operator) — the OPERATOR's vault (#216: the master vaulted the
+    // key master-self; the cap's already-verified on-chain cred:<service> scope
+    // grant IS the agent's authorization to fetch it). The S3 read still runs
+    // under the CALLER-relayed STS creds, so layer-3 IAM is untouched: reading
+    // the operator's prefix requires operator-tagged STS (the wire context's
+    // operator session) — the cap only narrows WHICH service that session
+    // releases. The envelope AAD is keyed by the vault OWNER (each vault's
+    // objects were encrypted with aad(operator, owner, service, epoch) at
+    // store time), so the owner drives both the key and the decrypt.
     let s3 = s3_for_request(&state.s3, &state.config.region, creds).await;
-    let resp = s3
-        .get_object()
-        .bucket(&state.config.vault_bucket)
-        .key(&key)
-        .send()
-        .await
-        .map_err(|e| err_502(e.to_string(), "s3_get"))?;
-    let body = resp
-        .body
-        .collect()
-        .await
-        .map_err(|e| err_502(e.to_string(), "s3_body"))?
-        .into_bytes();
+    let owners = fetch_vault_owners(&req.cap.payload.actor_omni, &req.cap.payload.operator_omni);
+    let mut last_err: Option<ApiError> = None;
+    for owner in &owners {
+        let key = s3_key(owner, &req.cap.payload.service);
+        let resp = match s3
+            .get_object()
+            .bucket(&state.config.vault_bucket)
+            .key(&key)
+            .send()
+            .await
+        {
+            Ok(resp) => resp,
+            Err(e) => {
+                // NoSuchKey (not stored in this vault) or AccessDenied (the
+                // relayed STS isn't tagged for this prefix) — try the next
+                // vault; surface the LAST attempt's error if all miss.
+                last_err = Some(err_502(e.to_string(), "s3_get"));
+                continue;
+            }
+        };
+        let body = resp
+            .body
+            .collect()
+            .await
+            .map_err(|e| err_502(e.to_string(), "s3_body"))?
+            .into_bytes();
+
+        let aad = envelope::aad(
+            &req.cap.payload.operator_omni,
+            owner,
+            &req.cap.payload.service,
+            req.cap.payload.k3_epoch,
+        );
+        return envelope::decrypt(&state.config.kek_hex_stage1, &body, &aad)
+            .map_err(|e| err_500(e.to_string(), "envelope_decrypt"));
+    }
+    Err(last_err.unwrap_or_else(|| err_502("no vault candidates", "s3_get")))
+}
 
-    let aad = envelope::aad(
-        &req.cap.payload.operator_omni,
-        &req.cap.payload.actor_omni,
-        &req.cap.payload.service,
-        req.cap.payload.k3_epoch,
-    );
-    envelope::decrypt(&state.config.kek_hex_stage1, &body, &aad)
-        .map_err(|e| err_500(e.to_string(), "envelope_decrypt"))
+/// The vaults a fetch may read, in order: the actor's OWN vault first (#228
+/// agent-owned creds — an agent's self-stored entry shadows a same-named
+/// delegated one, never the reverse), then the operator's vault when the cap
+/// is delegated (actor != operator, the #216 master-provisioned LLM key).
+/// Master-self caps (actor == operator) read exactly one vault — unchanged.
+fn fetch_vault_owners(actor_omni: &str, operator_omni: &str) -> Vec<String> {
+    if actor_omni.eq_ignore_ascii_case(operator_omni) {
+        vec![actor_omni.to_string()]
+    } else {
+        vec![actor_omni.to_string(), operator_omni.to_string()]
+    }
 }
 
 async fn cred_teardown(
@@ -443,6 +481,24 @@ fn s3_prefix(actor_omni: &str) -> String {
 mod tests {
     use super::*;
 
+    #[test]
+    fn fetch_owners_master_self_reads_one_vault() {
+        // operator == actor (case-insensitive) → exactly the actor's own vault,
+        // byte-identical to the pre-#216 behavior.
+        assert_eq!(fetch_vault_owners("0xAB", "0xab"), vec!["0xAB".to_string()]);
+    }
+
+    #[test]
+    fn fetch_owners_delegated_falls_back_to_operator_vault() {
+        // actor != operator (#216 delegated fetch) → the agent's own vault
+        // first (#228 agent-owned shadows), then the operator's (the master-
+        // vaulted key the cap's scope grant authorizes).
+        assert_eq!(
+            fetch_vault_owners("0xagent", "0xmaster"),
+            vec!["0xagent".to_string(), "0xmaster".to_string()]
+        );
+    }
+
     #[test]
     fn s3_key_format_matches_arch_md_15_1() {
         // arch.md §15.1: s3://$VAULT_BUCKET/bots/<actor_omni_hex>/credentials/<service>.enc
diff --git a/crates/agentkeys-worker-creds/src/lib.rs b/crates/agentkeys-worker-creds/src/lib.rs
index be2343ab..28a6007a 100644
--- a/crates/agentkeys-worker-creds/src/lib.rs
+++ b/crates/agentkeys-worker-creds/src/lib.rs
@@ -13,7 +13,12 @@
 //!   5. AES-256-GCM encrypt/decrypt with `aad = sha256(operator_omni ||
 //!      actor_omni || service || k3_epoch)`.
 //!   6. S3 PUT/GET at `s3://$VAULT_BUCKET/bots/<actor_omni>/credentials/
-//!      <service>.enc` via the worker's IAM identity.
+//!      <service>.enc` via the worker's IAM identity. A FETCH with a
+//!      delegated cap (actor != operator, #216) additionally falls back to
+//!      the OPERATOR's vault when the actor's own has no entry — the master
+//!      vaulted the key master-self and the cap's verified on-chain
+//!      `cred:<service>` scope grant is the agent's authorization to fetch
+//!      it (store/teardown/list stay strictly actor-keyed).
 //!
 //! Stage-1 simplification: KEK is injected via env. Stage 2 (#90)
 //! replaces with mTLS-derived KEK from the signer enclave.
diff --git a/docs/arch.md b/docs/arch.md
index c2f4deaf..393043fc 100644
--- a/docs/arch.md
+++ b/docs/arch.md
@@ -284,7 +284,7 @@ Pinned to disambiguate the same value showing up under different labels across c
 | `OIDC JWT` (= K7) | Per-mint short-lived JWT signed by K2; consumed by `AssumeRoleWithWebIdentity`. Carries `agentkeys_actor_omni` claim → AWS session tag. | `oidc_jwt`, `JWT_A` / `JWT_B` (demo shell vars). |
 | `cap-token` | The bearer issued by broker authorizing one specific operation (cred-fetch / cred-store / memory-read / audit-append / payment / etc.). Carries K10 sig + K11 assertion (for master mutations) + broker's K1 co-signature. | `cap`, `capability_token`, `op_cap`. |
 | `credential_kek` | 32-byte AES-256 key for one operator's credentials. Derived as `HKDF-SHA256(salt="agentkeys.kek-salt.v2", ikm=K3_v[epoch], info="agentkeys.user.v1" \|\| actor_omni)`. | `KEK`, `cred_kek`. |
-| `credential_envelope` | Wire format of one stored credential: `1B version (0x04) \|\| 1B k3_epoch \|\| 12B nonce \|\| ciphertext \|\| 16B tag`. Stored at `s3://$VAULT_BUCKET/bots/<actor_omni_hex>/credentials/<service>.enc`. AAD binds `(actor_omni, service)`. | `envelope`, `AEAD blob`, `<service>.enc` (S3 key suffix). |
+| `credential_envelope` | Wire format of one stored credential: `1B version (0x04) \|\| 1B k3_epoch \|\| 12B nonce \|\| ciphertext \|\| 16B tag`. Stored at `s3://$VAULT_BUCKET/bots/<vault_owner_omni_hex>/credentials/<service>.enc`; AAD binds `(operator_omni, vault_owner_omni, service, k3_epoch)`. Store/teardown/list are strictly actor-keyed (vault_owner = the cap's actor). A **fetch with a delegated cap** (actor ≠ operator, #216) tries the actor's own vault first (agent-owned, #228), then **falls back to the operator's vault** — the master vaulted the key master-self and the cap's verified on-chain `cred:<service>` grant is the agent's authorization; the S3 read still runs under the caller-relayed STS, so layer-3 IAM is unchanged (reading the operator prefix needs the operator session the wire context holds). | `envelope`, `AEAD blob`, `<service>.enc` (S3 key suffix). |
 | `vault_bucket` / `memory_bucket` / `config_bucket` / `audit_bucket` / `email_bucket` / `payment_audit_bucket` | One S3 bucket per data class per §17. Per-actor prefix at `bots/<actor_omni_hex>/` (config is per-operator + master-only, #201). | `$VAULT_BUCKET`, `$MEMORY_BUCKET`, `$CONFIG_BUCKET`, `$AUDIT_BUCKET`, `$EMAIL_BUCKET`, `$PAYMENT_AUDIT_BUCKET`. |
 | `policy` / `scope` / `namespace` / `category` / `service` (the authorization vocabulary) | **Distinct pipeline stages, NOT synonyms:** **policy** (human intent, off-chain, `DataClass::Config`) → COMPILE → **scope** (on-chain `(operator, actor, serviceHash)` grant, `AgentKeysScope` §19) over **categories/attributes** (the classifier's tag) → **service** (the signed cap string; for memory `service = memory:<ns>`, where **namespace** = the memory category). The unifying unit is the **policy attribute (category)** ([`research/universal-gate-pattern.md`](research/universal-gate-pattern.md) four primitives). Full table + pipeline: [`wiki/policy-scope-namespace.md`](wiki/policy-scope-namespace.md). | Confusions this resolves: "scope" used to mean "namespace" or "policy"; **"tag" = classifier *category*** (≠ the AWS **PrincipalTag** of §17 / [`wiki/tag-based-access.md`](wiki/tag-based-access.md)). |
 
@@ -839,7 +839,7 @@ sequenceDiagram
     Worker->>Chain: re-verify scope + binding + epoch (defense in depth)
     Worker->>Sig: derive_cred_kek(operator_omni, k3_epoch) [mTLS]
     Sig-->>Worker: KEK (32 bytes)
-    Worker->>Worker: GetObject s3://vault_bucket/bots/<actor_omni>/credentials/<service>.enc
+    Worker->>Worker: GetObject s3://vault_bucket/bots/<actor_omni>/credentials/<service>.enc (delegated cap: falls back to bots/<operator_omni>/… — the master-vaulted key, #216)
     Worker->>Worker: AES-256-GCM decrypt under KEK
     Worker-->>Dmn: plaintext credential
     Dmn->>Dmn: Cache plaintext (TTL 5 min)

From 4ae62c4da255dcfecfea8cc575631f54826ff6fd Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 14:32:25 +0800
Subject: [PATCH 04/17] =?UTF-8?q?fix:=20stage-1=20agent-create=20self-heal?=
 =?UTF-8?q?s=20a=20=C2=A710.2=20sandbox-paired=20(keyless)=20agent=20file?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

heima-agent-create.sh read .agent_private_key and fed it straight to
`cast wallet sign`. When the file is a §10.2 sandbox-paired record
(key_custody=sandbox-only, agent_private_key:null — written by the wire
phase's in-sandbox device-session), AGENT_KEY became the literal 'null'
→ 'Error: Failed to decode private key' at stage-1 step 12. The wire
phase and stage-1 share the 'demo-agent' label, so any operator who runs
the wire then re-runs stage 1 hits this.

Now: validate the key shape (0x+64hex or bare 64hex); if unusable
(sandbox custody / corrupt / legacy), back up the file to
<label>.json.bak.<ts> and regenerate a fresh master-held wallet — i.e.
behave as a clean machine would. Non-destructive to §10.2: the sandbox
holds the authoritative key and phase-5 pairing rebuilds the master-side
record. Shape guard tested standalone under set -e (null/empty/short/
non-hex → regenerate; valid → reused, bare key 0x-normalized).

Runbook fold-back: two new Q&A entries (the decode error + the earlier
QR-picker-instead-of-Touch-ID passkey case).
---
 docs/operator-runbook-harness.md |  6 ++++++
 scripts/heima-agent-create.sh    | 35 ++++++++++++++++++++++++++++----
 2 files changed, 37 insertions(+), 4 deletions(-)

diff --git a/docs/operator-runbook-harness.md b/docs/operator-runbook-harness.md
index db64608f..6475323a 100644
--- a/docs/operator-runbook-harness.md
+++ b/docs/operator-runbook-harness.md
@@ -320,6 +320,12 @@ Touch-ID phases 1-2; use `--from 3.16` to jump straight to step 16 if the sessio
 **Q. `DeviceNotActive` / cap-mint device mismatch?**
 The master isn't registered. Re-run stage 1, or `bash harness/scripts/erc4337-register-master.sh`.
 
+**Q. Stage-1 step 12 fails `Error: Failed to decode private key` (heima-agent-create.sh)?**
+A leftover **§10.2 sandbox-paired** `~/.agentkeys/agents/<label>.json` (`key_custody: "sandbox-only …"`, `agent_private_key: null`) is in the way: the wire phase (phase 5) writes that keyless record — the agent's key lives in the sandbox — and it shares the `demo-agent` label with stage-1's master-held agent, so a later stage-1 run reads the `null` key and feeds it to `cast`. **Self-healing now**: `heima-agent-create.sh` detects an unusable key, backs the file up to `<label>.json.bak.<ts>`, and regenerates a fresh master-held wallet — just re-run `bash harness/v2-demo.sh --from 1.12`. (The §10.2 sandbox agent is re-paired by phase 5; nothing is lost.)
+
+**Q. The register (step 10/11) shows a QR "Passkeys & Security Keys" picker instead of Touch ID?**
+The stored K11 enrollment points to a platform passkey that isn't in the browser opening the ceremony (enrolled in a different browser, deleted from Passwords, or only on another device via iCloud) — macOS falls back to the cross-device picker and times out. The step-10 idempotency check trusts `mode=webauthn` and won't re-create it. Fix: `mv ~/.agentkeys/k11/<operator_omni>.json /tmp/` then `bash harness/v2-demo.sh --from 1.10` — step 10 runs a fresh `k11 enroll --webauthn` (platform attachment → Touch ID **create**) in your current default browser, and the register then asserts that credential. Use one browser for both; a fresh passkey derives a new P256Account (small deployer gas, old account harmless).
+
 **Q. `sandbox-agent-isolation.sh` says `SKIP: cred coordinates not staged`?**
 The sandbox's `~/.agentkeys/harness-env` predates the #216 staging (or the wire phase didn't
 finish). Re-run the wire on the operator host — `bash harness/v2-demo.sh --from 5` (or
diff --git a/scripts/heima-agent-create.sh b/scripts/heima-agent-create.sh
index b2ef04a0..ff50b542 100755
--- a/scripts/heima-agent-create.sh
+++ b/scripts/heima-agent-create.sh
@@ -153,11 +153,38 @@ else
   mkdir -p "$AGENT_DIR"
   chmod 700 "$AGENT_DIR" 2>/dev/null || true
 
+  AGENT_ADDR=""
+  AGENT_KEY=""
   if [ -f "$AGENT_FILE" ]; then
-    AGENT_ADDR=$(jq -r .agent_address "$AGENT_FILE")
-    AGENT_KEY=$(jq -r .agent_private_key "$AGENT_FILE")
-    ok "reusing existing agent wallet from $AGENT_FILE → $AGENT_ADDR"
-  else
+    AGENT_ADDR=$(jq -r '.agent_address // empty' "$AGENT_FILE")
+    AGENT_KEY=$(jq -r '.agent_private_key // empty' "$AGENT_FILE")
+    # A §10.2 sandbox-paired file (key_custody="sandbox-only …", written by the
+    # in-sandbox `agentkeys agent device-session` / phase1-wire-demo.sh P.1b)
+    # carries NO master-held key (agent_private_key:null) — the key lives in the
+    # sandbox by design. This stage-1 master-held create path cannot sign the
+    # pop-sig with it; feeding the literal "null"/empty to `cast wallet sign`
+    # below dies with the opaque "Failed to decode private key". Validate the key
+    # SHAPE (0x + 64 hex, or bare 64 hex) and, if it's unusable (sandbox custody /
+    # corrupt / legacy), back the file up and regenerate a fresh master-held
+    # wallet — i.e. behave exactly as a clean machine would. A §10.2 agent's
+    # authoritative key stays in the sandbox and its master-side record is rebuilt
+    # by phase-5 pairing, so this is non-destructive to that flow.
+    case "$AGENT_KEY" in
+      0x[0-9a-fA-F]*) [ "${#AGENT_KEY}" -eq 66 ] || AGENT_KEY="" ;;
+      [0-9a-fA-F]*)   { [ "${#AGENT_KEY}" -eq 64 ] && AGENT_KEY="0x$AGENT_KEY"; } || AGENT_KEY="" ;;
+      *)              AGENT_KEY="" ;;
+    esac
+    if [ -n "$AGENT_KEY" ]; then
+      ok "reusing existing master-held agent wallet from $AGENT_FILE → $AGENT_ADDR"
+    else
+      CUSTODY=$(jq -r '.key_custody // "unknown"' "$AGENT_FILE")
+      BAK="$AGENT_FILE.bak.$(date +%s)"
+      cp "$AGENT_FILE" "$BAK" 2>/dev/null || true
+      log "existing $AGENT_FILE has NO master-held key (key_custody=$CUSTODY) — backed up → $BAK, regenerating a stage-1 master-held wallet (a §10.2 sandbox agent is re-paired by phase 5)"
+      AGENT_ADDR=""   # fall through to generate
+    fi
+  fi
+  if [ -z "$AGENT_KEY" ]; then
     log "Generating fresh agent wallet for label '$LABEL' …"
     WALLET_JSON=$(cast wallet new --json | jq -r '.[0]')
     AGENT_ADDR=$(echo "$WALLET_JSON" | jq -r .address)

From 7c15dea006f4d7fd17a41a8f28c8ef61c6188844 Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 14:54:12 +0800
Subject: [PATCH 05/17] fix: chain helpers take chain_id from the pinned
 profile, not a live eth_chainId curl (transient-RPC hardening)
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

A transient RPC blip (LibreSSL SSL_ERROR_SYSCALL to rpc.heima-parachain)
made the inline `LIVE_CHAIN_ID=$(printf '%d' "$(curl … eth_chainId …)")`
resolve to 0, and `cast send --chain-id 0` was then rejected with
'error code -32603: invalid chain id' (hit at v2-stage1 step 14,
heima-credential-audit.sh). 13 heima-*.sh scripts shared this fragile
pattern — any of them could fail the same way on a blip.

All 13 now resolve the chain id from the OFFLINE pinned profile
(`echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'` — PROFILE_JSON is
already loaded in each) and fail loud if it isn't a positive integer, so
a network blip can never corrupt the --chain-id passed to cast. The live
eth_chainId is still cross-checked once at flow start (v2-stage1 step 5 /
heima-bring-up.sh), so wrong-RPC detection is unchanged.

Inline (not a _lib.sh helper) on purpose: the chain_id line runs early in
each script while _lib.sh is sourced later for resolve_master_key — a
helper call would be a call-before-definition bug in 9 of 13. Inline
needs only PROFILE_JSON + die, both present before the line in all 13
(verified). Functional-tested against the real heima profile (→ 212013)
+ the null/0/missing fail-loud paths; bash -n all 13. Runbook Q&A added.
---
 docs/operator-runbook-harness.md                | 3 +++
 harness/scripts/heima-device-add.sh             | 3 +--
 harness/scripts/heima-register-first-master.sh  | 3 +--
 harness/scripts/heima-register-spare-master.sh  | 3 +--
 harness/scripts/heima-set-recovery-threshold.sh | 3 +--
 scripts/heima-agent-create.sh                   | 2 +-
 scripts/heima-credential-audit.sh               | 2 +-
 scripts/heima-device-revoke.sh                  | 2 +-
 scripts/heima-fund-account.sh                   | 2 +-
 scripts/heima-k3-rotate.sh                      | 3 +--
 scripts/heima-reset-master.sh                   | 2 +-
 scripts/heima-scope-revoke.sh                   | 2 +-
 scripts/heima-scope-set.sh                      | 2 +-
 scripts/heima-worker-smoke.sh                   | 4 +---
 14 files changed, 16 insertions(+), 20 deletions(-)

diff --git a/docs/operator-runbook-harness.md b/docs/operator-runbook-harness.md
index 6475323a..00872b80 100644
--- a/docs/operator-runbook-harness.md
+++ b/docs/operator-runbook-harness.md
@@ -326,6 +326,9 @@ A leftover **§10.2 sandbox-paired** `~/.agentkeys/agents/<label>.json` (`key_cu
 **Q. The register (step 10/11) shows a QR "Passkeys & Security Keys" picker instead of Touch ID?**
 The stored K11 enrollment points to a platform passkey that isn't in the browser opening the ceremony (enrolled in a different browser, deleted from Passwords, or only on another device via iCloud) — macOS falls back to the cross-device picker and times out. The step-10 idempotency check trusts `mode=webauthn` and won't re-create it. Fix: `mv ~/.agentkeys/k11/<operator_omni>.json /tmp/` then `bash harness/v2-demo.sh --from 1.10` — step 10 runs a fresh `k11 enroll --webauthn` (platform attachment → Touch ID **create**) in your current default browser, and the register then asserts that credential. Use one browser for both; a fresh passkey derives a new P256Account (small deployer gas, old account harmless).
 
+**Q. A chain step dies `error code -32603: invalid chain id` or `SSL_ERROR_SYSCALL` to the RPC?**
+A **transient RPC blip** (the Heima RPC briefly dropped the TLS connection). The chain helpers used to derive the chain id from a live `eth_chainId` curl, so a blip resolved it to `0` and `cast send --chain-id 0` was rejected as "invalid chain id." Fixed: every `heima-*.sh` now takes the chain id from the **pinned chain profile** (`agentkeys chain show <chain>` → `.chain_id`, offline) and **fails loud** if it isn't a positive integer — so a blip can no longer corrupt it. If the blip hit the actual `cast send`/`cast call` instead, just **re-run the step** (`bash harness/v2-demo.sh --from <P.S>`) — every step is idempotent.
+
 **Q. `sandbox-agent-isolation.sh` says `SKIP: cred coordinates not staged`?**
 The sandbox's `~/.agentkeys/harness-env` predates the #216 staging (or the wire phase didn't
 finish). Re-run the wire on the operator host — `bash harness/v2-demo.sh --from 5` (or
diff --git a/harness/scripts/heima-device-add.sh b/harness/scripts/heima-device-add.sh
index 700af149..f2915382 100755
--- a/harness/scripts/heima-device-add.sh
+++ b/harness/scripts/heima-device-add.sh
@@ -73,8 +73,7 @@ fi
 AGENTKEYS_CHAIN="${AGENTKEYS_CHAIN:-heima}"
 PROFILE_JSON=$($AGENTKEYS_BIN chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' \
-  -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 if [ -z "$REGISTRY" ]; then
   PROFILE_NAME_UC=$(printf '%s' "$AGENTKEYS_CHAIN" | tr 'a-z-' 'A-Z_')
diff --git a/harness/scripts/heima-register-first-master.sh b/harness/scripts/heima-register-first-master.sh
index d62f36ba..de0937c5 100755
--- a/harness/scripts/heima-register-first-master.sh
+++ b/harness/scripts/heima-register-first-master.sh
@@ -132,8 +132,7 @@ fi
 AGENTKEYS_CHAIN="${AGENTKEYS_CHAIN:-heima}"
 PROFILE_JSON=$("$AGENTKEYS_BIN" chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' \
-  -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 if [ -z "$REGISTRY" ]; then
   PROFILE_NAME_UC=$(printf '%s' "$AGENTKEYS_CHAIN" | tr 'a-z-' 'A-Z_')
diff --git a/harness/scripts/heima-register-spare-master.sh b/harness/scripts/heima-register-spare-master.sh
index d466a132..c13e45a1 100755
--- a/harness/scripts/heima-register-spare-master.sh
+++ b/harness/scripts/heima-register-spare-master.sh
@@ -69,8 +69,7 @@ fi
 AGENTKEYS_CHAIN="${AGENTKEYS_CHAIN:-heima}"
 PROFILE_JSON=$("$AGENTKEYS_BIN" chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' \
-  -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 if [ -z "$REGISTRY" ]; then
   PROFILE_NAME_UC=$(printf '%s' "$AGENTKEYS_CHAIN" | tr 'a-z-' 'A-Z_')
diff --git a/harness/scripts/heima-set-recovery-threshold.sh b/harness/scripts/heima-set-recovery-threshold.sh
index 001b03b6..3b17dcf8 100755
--- a/harness/scripts/heima-set-recovery-threshold.sh
+++ b/harness/scripts/heima-set-recovery-threshold.sh
@@ -53,8 +53,7 @@ fi
 AGENTKEYS_CHAIN="${AGENTKEYS_CHAIN:-heima}"
 PROFILE_JSON=$($AGENTKEYS_BIN chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' \
-  -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 if [ -z "$REGISTRY" ]; then
   PROFILE_NAME_UC=$(printf '%s' "$AGENTKEYS_CHAIN" | tr 'a-z-' 'A-Z_')
diff --git a/scripts/heima-agent-create.sh b/scripts/heima-agent-create.sh
index ff50b542..bbc8b83d 100755
--- a/scripts/heima-agent-create.sh
+++ b/scripts/heima-agent-create.sh
@@ -94,7 +94,7 @@ case "$AGENTKEYS_CHAIN" in
 esac
 PROFILE_JSON=$(agentkeys chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 # Resolve registry address: --registry-address > $SIDECAR_REGISTRY_ADDRESS_<CHAIN_UC>.
 if [ -z "$REGISTRY" ]; then
diff --git a/scripts/heima-credential-audit.sh b/scripts/heima-credential-audit.sh
index 5b54bc8b..e035747d 100755
--- a/scripts/heima-credential-audit.sh
+++ b/scripts/heima-credential-audit.sh
@@ -66,7 +66,7 @@ set -a; . "$ENV_FILE"; set +a
 AGENTKEYS_CHAIN="${AGENTKEYS_CHAIN:-heima}"
 PROFILE_JSON=$(agentkeys chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 if [ -z "$AUDIT_CONTRACT" ]; then
   PROFILE_NAME_UC=$(printf '%s' "$AGENTKEYS_CHAIN" | tr 'a-z-' 'A-Z_')
diff --git a/scripts/heima-device-revoke.sh b/scripts/heima-device-revoke.sh
index 0184d852..d277bd22 100755
--- a/scripts/heima-device-revoke.sh
+++ b/scripts/heima-device-revoke.sh
@@ -88,7 +88,7 @@ fi
 AGENTKEYS_CHAIN="${AGENTKEYS_CHAIN:-heima}"
 PROFILE_JSON=$(agentkeys chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 if [ -z "$REGISTRY" ]; then
   PROFILE_NAME_UC=$(printf '%s' "$AGENTKEYS_CHAIN" | tr 'a-z-' 'A-Z_')
diff --git a/scripts/heima-fund-account.sh b/scripts/heima-fund-account.sh
index 1c6d3484..22884551 100755
--- a/scripts/heima-fund-account.sh
+++ b/scripts/heima-fund-account.sh
@@ -66,7 +66,7 @@ case "$AGENTKEYS_CHAIN" in
 esac
 PROFILE_JSON=$(agentkeys chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 # Deployer key resolution — same 3-way order as heima-bring-up.sh step 3.
 # (No fresh-key fallback here; refusing to mint funds from a brand-new wallet.)
diff --git a/scripts/heima-k3-rotate.sh b/scripts/heima-k3-rotate.sh
index 9c7802d9..91f8ac7a 100755
--- a/scripts/heima-k3-rotate.sh
+++ b/scripts/heima-k3-rotate.sh
@@ -68,8 +68,7 @@ fi
 AGENTKEYS_CHAIN="${AGENTKEYS_CHAIN:-heima}"
 PROFILE_JSON=$("$AGENTKEYS_BIN" chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' \
-  -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 if [ -z "$COUNTER_ADDR" ]; then
   PROFILE_NAME_UC=$(printf '%s' "$AGENTKEYS_CHAIN" | tr 'a-z-' 'A-Z_')
diff --git a/scripts/heima-reset-master.sh b/scripts/heima-reset-master.sh
index 1df81f4d..97bfeea0 100755
--- a/scripts/heima-reset-master.sh
+++ b/scripts/heima-reset-master.sh
@@ -71,7 +71,7 @@ fi
 AGENTKEYS_CHAIN="${AGENTKEYS_CHAIN:-heima}"
 PROFILE_JSON=$("$AGENTKEYS_BIN" chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 if [ -z "$REGISTRY" ]; then
   PROFILE_NAME_UC=$(printf '%s' "$AGENTKEYS_CHAIN" | tr 'a-z-' 'A-Z_')
diff --git a/scripts/heima-scope-revoke.sh b/scripts/heima-scope-revoke.sh
index 1fe934cc..e08ad762 100755
--- a/scripts/heima-scope-revoke.sh
+++ b/scripts/heima-scope-revoke.sh
@@ -62,7 +62,7 @@ fi
 AGENTKEYS_CHAIN="${AGENTKEYS_CHAIN:-heima}"
 PROFILE_JSON=$(agentkeys chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 if [ -z "$SCOPE_CONTRACT" ]; then
   PROFILE_NAME_UC=$(printf '%s' "$AGENTKEYS_CHAIN" | tr 'a-z-' 'A-Z_')
diff --git a/scripts/heima-scope-set.sh b/scripts/heima-scope-set.sh
index 7ee61e1f..949354ab 100755
--- a/scripts/heima-scope-set.sh
+++ b/scripts/heima-scope-set.sh
@@ -97,7 +97,7 @@ case "$AGENTKEYS_CHAIN" in
 esac
 PROFILE_JSON=$(agentkeys chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 if [ -z "$SCOPE_CONTRACT" ]; then
   PROFILE_NAME_UC=$(printf '%s' "$AGENTKEYS_CHAIN" | tr 'a-z-' 'A-Z_')
diff --git a/scripts/heima-worker-smoke.sh b/scripts/heima-worker-smoke.sh
index 431c38e4..82213411 100755
--- a/scripts/heima-worker-smoke.sh
+++ b/scripts/heima-worker-smoke.sh
@@ -89,9 +89,7 @@ esac
 
 PROFILE_JSON=$(agentkeys chain show "$AGENTKEYS_CHAIN")
 RPC_HTTP=$(echo "$PROFILE_JSON" | jq -r .rpc.http)
-LIVE_CHAIN_ID=$(printf '%d' "$(curl -sS -H 'Content-Type: application/json' \
-  -d '{"jsonrpc":"2.0","method":"eth_chainId","params":[],"id":1}' \
-  "$RPC_HTTP" | jq -r .result)")
+LIVE_CHAIN_ID=$(echo "$PROFILE_JSON" | jq -r '(.chain_id // 0)'); [ "$LIVE_CHAIN_ID" -gt 0 ] 2>/dev/null || die "could not resolve a positive chain_id from chain profile '$AGENTKEYS_CHAIN' (transient RPC blip?) — check: agentkeys chain show $AGENTKEYS_CHAIN"
 
 # Master key — shared resolve_master_key (HEIMA_DEPLOYER_KEY_FILE for CI,
 # falls back to ./test-hei mnemonic). Replaces mnemonic-only inline block.

From ade5a0f40183aed662cb7d873b1a632d359b4f5f Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 16:40:49 +0800
Subject: [PATCH 06/17] =?UTF-8?q?feat:=20#216=20default-key=20selection=20?=
 =?UTF-8?q?=E2=80=94=20off-chain=20cred=20manifest=20(no=20contract=20chan?=
 =?UTF-8?q?ge)?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The on-chain AgentKeysScope stores only keccak(service) HASHES, so an
agent can verify a known service name (isServiceInScope) but cannot
enumerate its authorized NAMES or learn which is its default LLM key
from chain. Putting names/a-default on-chain (a contract change) is
unnecessary: the master KNOWS the plaintext names + default at grant
time. Record them OFF-CHAIN; the on-chain hash gate still runs on every
fetch, so authorization is unchanged — this is discovery only.

- agentkeys-types: CredManifest { services, default_service } + resolve
  precedence (explicit > --select N (1-based) > master default) +
  load/save. 11 unit tests.
- agentkeys cred manifest --services a,b,c --default a  → the master
  records the authorized names + designated default (public names only).
- agentkeys cred list  → the agent's off-chain discovery (the chain
  can't enumerate); marks the default.
- agentkeys cred fetch  → service is now OPTIONAL: no arg uses the
  master-designated default (the #216 no-UI path), --select N picks from
  the list, an explicit service is used as-is (still on-chain verified).

Smoke-tested: manifest write → list (default marked) → no-arg fetch
resolves the default before the network call → --select 1 picks the
first → --select 9 clean range error. fmt + clippy clean; types 47 /
cli 19 tests green; backend fixture gate unaffected (not a wire shape).

Harness e2e proof + docs follow in the next commit.
---
 crates/agentkeys-cli/src/cred_admin.rs      |  91 ++++++++
 crates/agentkeys-cli/src/main.rs            |  94 ++++++--
 crates/agentkeys-types/src/cred_manifest.rs | 238 ++++++++++++++++++++
 crates/agentkeys-types/src/lib.rs           |   2 +
 4 files changed, 411 insertions(+), 14 deletions(-)
 create mode 100644 crates/agentkeys-types/src/cred_manifest.rs

diff --git a/crates/agentkeys-cli/src/cred_admin.rs b/crates/agentkeys-cli/src/cred_admin.rs
index 540da6f4..93fc6780 100644
--- a/crates/agentkeys-cli/src/cred_admin.rs
+++ b/crates/agentkeys-cli/src/cred_admin.rs
@@ -12,9 +12,12 @@
 use anyhow::{Context, Result};
 use base64::{engine::general_purpose::STANDARD, Engine as _};
 
+use std::path::{Path, PathBuf};
+
 use agentkeys_backend_client::{
     normalize_omni_0x, BackendClient, CapMintOp, CapMintRequest, CredFetchInput, CredStoreInput,
 };
+use agentkeys_types::CredManifest;
 
 /// Fetch + decrypt the credential `service` the actor is authorized for, returning
 /// the plaintext secret. `operator_omni` == `actor_omni` for a master-self fetch;
@@ -118,3 +121,91 @@ pub async fn cred_store(
         .with_context(|| format!("cred worker store for service `{service}`"))?;
     Ok(result.s3_key)
 }
+
+// ─── #216 default-key selection — the OFF-CHAIN manifest (discovery only) ────
+// The on-chain AgentKeysScope stores only keccak(service) hashes, so the agent
+// can't enumerate its authorized service NAMES or learn its default LLM key from
+// chain. The master records both here, off-chain; every fetch still re-verifies
+// on-chain (isServiceInScope), so this never widens authorization.
+
+/// Resolve the cred-manifest path: an explicit `--manifest` /
+/// `$AGENTKEYS_CRED_MANIFEST` (clap merges both into `explicit`), else
+/// `~/.agentkeys/cred-manifest.json`.
+pub fn cred_manifest_path(explicit: Option<&str>) -> PathBuf {
+    if let Some(p) = explicit {
+        return PathBuf::from(p);
+    }
+    if let Ok(home) = std::env::var("HOME") {
+        return PathBuf::from(home)
+            .join(".agentkeys")
+            .join("cred-manifest.json");
+    }
+    PathBuf::from("cred-manifest.json")
+}
+
+/// Render the authorized-services listing (the chain can't enumerate names —
+/// this is the off-chain discovery layer). Marks the master-designated default.
+pub fn cred_list(path: &Path) -> Result<String> {
+    let man = CredManifest::load(path)
+        .with_context(|| format!("read cred manifest {}", path.display()))?;
+    if man.services.is_empty() {
+        return Ok(format!(
+            "no authorized credential services recorded ({} absent or empty).\n\
+             The master records them at grant time:\n  \
+             agentkeys cred manifest --services <a,b,c> --default <a>",
+            path.display()
+        ));
+    }
+    let default = man.default_name();
+    let mut out = format!("authorized credential services ({}):\n", path.display());
+    for (i, s) in man.services.iter().enumerate() {
+        let mark = if Some(s.as_str()) == default {
+            "  ← default"
+        } else {
+            ""
+        };
+        out.push_str(&format!("  {}. {}{}\n", i + 1, s, mark));
+    }
+    Ok(out.trim_end().to_string())
+}
+
+/// Write the off-chain cred manifest: authorized service NAMES + the
+/// master-designated default (public names only, never secrets). The master /
+/// operator runs this at grant time so the agent's no-arg fetch picks the default.
+pub fn cred_manifest_write(
+    path: &Path,
+    services_csv: &str,
+    default: Option<String>,
+) -> Result<String> {
+    let services: Vec<String> = services_csv
+        .split(',')
+        .map(|s| s.trim().to_string())
+        .filter(|s| !s.is_empty())
+        .collect();
+    if services.is_empty() {
+        anyhow::bail!("--services must list at least one service name");
+    }
+    if let Some(d) = default.as_deref() {
+        if !services.iter().any(|s| s == d) {
+            anyhow::bail!(
+                "--default '{d}' is not in --services [{}]",
+                services.join(", ")
+            );
+        }
+    }
+    if let Some(parent) = path.parent() {
+        if !parent.as_os_str().is_empty() {
+            std::fs::create_dir_all(parent)
+                .with_context(|| format!("create {}", parent.display()))?;
+        }
+    }
+    let man = CredManifest::new(services, default);
+    man.save(path)
+        .with_context(|| format!("write cred manifest {}", path.display()))?;
+    Ok(format!(
+        "recorded cred manifest {} — {} service(s), default `{}`",
+        path.display(),
+        man.services.len(),
+        man.default_name().unwrap_or("(none)")
+    ))
+}
diff --git a/crates/agentkeys-cli/src/main.rs b/crates/agentkeys-cli/src/main.rs
index cd761f79..35502774 100644
--- a/crates/agentkeys-cli/src/main.rs
+++ b/crates/agentkeys-cli/src/main.rs
@@ -406,8 +406,21 @@ enum CredAction {
     /// `cred:<service>` scope; prints the plaintext to stdout. The agent's
     /// identity/session come from the wire context (flags or env).
     Fetch {
-        /// The credential service id (e.g. `openrouter`).
-        service: String,
+        /// The credential service id (e.g. `openrouter`). OPTIONAL — when omitted,
+        /// it is resolved from the off-chain cred manifest (#216): `--select N`
+        /// (1-based) picks from the authorized list, else the master-designated
+        /// default (the no-UI path). An explicit service is used as-is (still
+        /// on-chain-verified at fetch).
+        service: Option<String>,
+        /// Pick the Nth authorized service (1-based) from the cred manifest
+        /// instead of the master-designated default. Ignored when a service is
+        /// given explicitly.
+        #[arg(long)]
+        select: Option<usize>,
+        /// Cred-manifest path (authorized service names + default). Default:
+        /// $AGENTKEYS_CRED_MANIFEST or ~/.agentkeys/cred-manifest.json.
+        #[arg(long, env = "AGENTKEYS_CRED_MANIFEST")]
+        manifest: Option<String>,
         #[arg(long, env = "AGENTKEYS_OPERATOR_OMNI")]
         operator_omni: String,
         #[arg(long, env = "AGENTKEYS_ACTOR_OMNI")]
@@ -425,6 +438,30 @@ enum CredAction {
         #[arg(long, env = "REGION", default_value = "us-east-1")]
         region: String,
     },
+    /// List the agent's authorized credential services from the off-chain
+    /// manifest (#216). The chain stores only keccak(service) hashes — it can
+    /// verify a known name but not enumerate names — so the manifest is the
+    /// discovery layer. Marks the master-designated default.
+    List {
+        #[arg(long, env = "AGENTKEYS_CRED_MANIFEST")]
+        manifest: Option<String>,
+    },
+    /// Record the off-chain cred manifest (#216): the authorized service names +
+    /// the master-designated default (public NAMES only — never secrets). The
+    /// master/operator runs this at grant time so the agent's no-arg `cred fetch`
+    /// picks the designated default LLM key.
+    Manifest {
+        /// Comma-separated authorized service names in order (e.g.
+        /// `openrouter,anthropic`).
+        #[arg(long)]
+        services: String,
+        /// The default service (the no-UI LLM key). Defaults to the first in
+        /// `--services`.
+        #[arg(long)]
+        default: Option<String>,
+        #[arg(long, env = "AGENTKEYS_CRED_MANIFEST")]
+        manifest: Option<String>,
+    },
     /// Vault a credential (#216, the store half of `fetch`). Master-self by
     /// default (operator == actor); seeds the agent's authorized key (e.g. the
     /// LLM key the agent later cred-fetches). Prints the worker S3 key.
@@ -1468,6 +1505,8 @@ async fn main() {
         Commands::Cred { action } => match action {
             CredAction::Fetch {
                 service,
+                select,
+                manifest,
                 operator_omni,
                 actor_omni,
                 device_key_hash,
@@ -1477,18 +1516,33 @@ async fn main() {
                 vault_role_arn,
                 region,
             } => {
-                agentkeys_cli::cred_admin::cred_fetch(
-                    service,
-                    operator_omni,
-                    actor_omni,
-                    device_key_hash,
-                    session_bearer,
-                    broker_url,
-                    cred_url,
-                    vault_role_arn,
-                    region,
-                )
-                .await
+                // #216 default-key selection (off-chain). Resolve which service to
+                // fetch — explicit > --select N (1-based) > master-designated
+                // default — from the cred manifest, then fetch it (still on-chain
+                // verified). An explicit service needs no manifest.
+                let mpath = agentkeys_cli::cred_admin::cred_manifest_path(manifest.as_deref());
+                match agentkeys_types::CredManifest::load(&mpath)
+                    .map_err(|e| anyhow::anyhow!("load cred manifest {}: {e}", mpath.display()))
+                    .and_then(|m| {
+                        m.resolve(service.as_deref(), *select)
+                            .map_err(|e| anyhow::anyhow!("{e}"))
+                    }) {
+                    Ok(resolved) => {
+                        agentkeys_cli::cred_admin::cred_fetch(
+                            &resolved,
+                            operator_omni,
+                            actor_omni,
+                            device_key_hash,
+                            session_bearer,
+                            broker_url,
+                            cred_url,
+                            vault_role_arn,
+                            region,
+                        )
+                        .await
+                    }
+                    Err(e) => Err(e),
+                }
             }
             CredAction::Store {
                 service,
@@ -1530,6 +1584,18 @@ async fn main() {
                     Err(e) => Err(e),
                 }
             }
+            CredAction::List { manifest } => {
+                let mpath = agentkeys_cli::cred_admin::cred_manifest_path(manifest.as_deref());
+                agentkeys_cli::cred_admin::cred_list(&mpath)
+            }
+            CredAction::Manifest {
+                services,
+                default,
+                manifest,
+            } => {
+                let mpath = agentkeys_cli::cred_admin::cred_manifest_path(manifest.as_deref());
+                agentkeys_cli::cred_admin::cred_manifest_write(&mpath, services, default.clone())
+            }
         },
     };
 
diff --git a/crates/agentkeys-types/src/cred_manifest.rs b/crates/agentkeys-types/src/cred_manifest.rs
new file mode 100644
index 00000000..4bf57ed9
--- /dev/null
+++ b/crates/agentkeys-types/src/cred_manifest.rs
@@ -0,0 +1,238 @@
+//! Off-chain credential manifest (#216 default-key selection).
+//!
+//! The on-chain `AgentKeysScope` stores only `keccak(service)` HASHES, so an
+//! agent cannot enumerate its authorized service NAMES, nor learn which one is
+//! its default LLM key, from chain alone — keccak is one-way and there is no
+//! "default" field. The master KNOWS the plaintext names + the designated
+//! default at grant time (they are the input to `setScope` before hashing) and
+//! records them HERE, off-chain, where the agent reads them at wire time.
+//!
+//! This is **discovery-only** and never widens authorization: every fetch still
+//! re-verifies on-chain via `isServiceInScope` (broker cap-mint + worker), so a
+//! service name that appears in the manifest but NOT in the on-chain scope is
+//! rejected regardless. The manifest answers "which of my authorized creds is
+//! the default LLM key?", a question the hash-only chain layer cannot.
+
+use std::path::Path;
+
+use serde::{Deserialize, Serialize};
+
+/// The agent's authorized credential services + master-designated default.
+#[derive(Debug, Clone, Serialize, Deserialize, PartialEq, Eq, Default)]
+pub struct CredManifest {
+    /// Authorized credential service names (plaintext), in the master's order —
+    /// the same names the master granted as `cred:<service>` scopes on-chain.
+    #[serde(default)]
+    pub services: Vec<String>,
+    /// The master-designated default service: the no-UI LLM key the agent uses
+    /// when the developer makes no selection. When `None`, or absent from
+    /// `services`, the FIRST service is treated as the default.
+    #[serde(default)]
+    pub default_service: Option<String>,
+}
+
+/// Why a service could not be resolved from the manifest.
+#[derive(Debug, Clone, PartialEq, Eq)]
+pub enum CredManifestError {
+    /// No explicit service, no `--select`, and the manifest lists no services.
+    Empty,
+    /// A 1-based `--select N` outside `1..=services.len()`.
+    SelectOutOfRange { select: usize, len: usize },
+}
+
+impl std::fmt::Display for CredManifestError {
+    fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result {
+        match self {
+            CredManifestError::Empty => write!(
+                f,
+                "no credential service to fetch: pass an explicit service, or record a \
+                 manifest (the master designates the default authorized cred)"
+            ),
+            CredManifestError::SelectOutOfRange { select, len } => write!(
+                f,
+                "--select {select} is out of range — the manifest lists {len} authorized \
+                 service(s) (use 1..={len})"
+            ),
+        }
+    }
+}
+
+impl std::error::Error for CredManifestError {}
+
+impl CredManifest {
+    pub fn new(services: Vec<String>, default_service: Option<String>) -> Self {
+        Self {
+            services,
+            default_service,
+        }
+    }
+
+    /// The default service name: the master-designated default when it is set
+    /// AND present in `services`, otherwise the first service. `None` only when
+    /// the manifest is empty.
+    pub fn default_name(&self) -> Option<&str> {
+        if let Some(d) = self.default_service.as_deref() {
+            if self.services.iter().any(|s| s == d) {
+                return Some(d);
+            }
+        }
+        self.services.first().map(String::as_str)
+    }
+
+    /// Resolve which credential service to fetch, by precedence:
+    ///   1. `explicit` — an operator/developer-typed service name, used as-is
+    ///      (it is still on-chain-verified at fetch; the manifest never gates it);
+    ///   2. `select` — a **1-based** index into `services` (matches the #216
+    ///      `--select 1` notation, where `1` = the first authorized service);
+    ///   3. the master-designated default ([`default_name`](Self::default_name)).
+    ///
+    /// Errors only when nothing resolves (empty manifest, or out-of-range select).
+    pub fn resolve(
+        &self,
+        explicit: Option<&str>,
+        select: Option<usize>,
+    ) -> Result<String, CredManifestError> {
+        if let Some(s) = explicit {
+            return Ok(s.to_string());
+        }
+        if let Some(n) = select {
+            if n == 0 || n > self.services.len() {
+                return Err(CredManifestError::SelectOutOfRange {
+                    select: n,
+                    len: self.services.len(),
+                });
+            }
+            return Ok(self.services[n - 1].clone());
+        }
+        self.default_name()
+            .map(str::to_string)
+            .ok_or(CredManifestError::Empty)
+    }
+
+    /// Load from a JSON file. A MISSING file yields an empty manifest (so a
+    /// no-manifest environment degrades to "explicit service required", never a
+    /// hard error). A present-but-malformed file IS an error (don't mask it).
+    pub fn load(path: &Path) -> std::io::Result<Self> {
+        match std::fs::read(path) {
+            Ok(bytes) => serde_json::from_slice(&bytes)
+                .map_err(|e| std::io::Error::new(std::io::ErrorKind::InvalidData, e)),
+            Err(e) if e.kind() == std::io::ErrorKind::NotFound => Ok(Self::default()),
+            Err(e) => Err(e),
+        }
+    }
+
+    /// Write the manifest as pretty JSON (0600 is the caller's responsibility —
+    /// the manifest holds only public service NAMES, never secrets).
+    pub fn save(&self, path: &Path) -> std::io::Result<()> {
+        let json = serde_json::to_vec_pretty(self)
+            .map_err(|e| std::io::Error::new(std::io::ErrorKind::InvalidData, e))?;
+        std::fs::write(path, json)
+    }
+}
+
+#[cfg(test)]
+mod tests {
+    use super::*;
+
+    fn m(services: &[&str], default: Option<&str>) -> CredManifest {
+        CredManifest::new(
+            services.iter().map(|s| s.to_string()).collect(),
+            default.map(str::to_string),
+        )
+    }
+
+    #[test]
+    fn default_name_prefers_designated_when_present() {
+        assert_eq!(
+            m(&["openrouter", "anthropic"], Some("anthropic")).default_name(),
+            Some("anthropic")
+        );
+    }
+
+    #[test]
+    fn default_name_falls_back_to_first_when_designated_absent_or_none() {
+        // designated default not in the list → first
+        assert_eq!(
+            m(&["openrouter", "anthropic"], Some("ghost")).default_name(),
+            Some("openrouter")
+        );
+        // no designated default → first
+        assert_eq!(m(&["openrouter"], None).default_name(), Some("openrouter"));
+    }
+
+    #[test]
+    fn default_name_none_when_empty() {
+        assert_eq!(m(&[], None).default_name(), None);
+    }
+
+    #[test]
+    fn resolve_precedence_explicit_beats_select_and_default() {
+        let man = m(&["openrouter", "anthropic"], Some("anthropic"));
+        assert_eq!(man.resolve(Some("typed"), Some(1)).unwrap(), "typed");
+    }
+
+    #[test]
+    fn resolve_select_is_one_based() {
+        let man = m(&["openrouter", "anthropic"], Some("anthropic"));
+        assert_eq!(man.resolve(None, Some(1)).unwrap(), "openrouter");
+        assert_eq!(man.resolve(None, Some(2)).unwrap(), "anthropic");
+    }
+
+    #[test]
+    fn resolve_no_args_uses_master_default() {
+        let man = m(&["openrouter", "anthropic"], Some("anthropic"));
+        assert_eq!(man.resolve(None, None).unwrap(), "anthropic");
+    }
+
+    #[test]
+    fn resolve_select_out_of_range_errors() {
+        let man = m(&["openrouter"], None);
+        assert_eq!(
+            man.resolve(None, Some(0)),
+            Err(CredManifestError::SelectOutOfRange { select: 0, len: 1 })
+        );
+        assert_eq!(
+            man.resolve(None, Some(2)),
+            Err(CredManifestError::SelectOutOfRange { select: 2, len: 1 })
+        );
+    }
+
+    #[test]
+    fn resolve_empty_manifest_errors_without_explicit() {
+        assert_eq!(
+            m(&[], None).resolve(None, None),
+            Err(CredManifestError::Empty)
+        );
+        // …but an explicit service always resolves, even with an empty manifest.
+        assert_eq!(
+            m(&[], None).resolve(Some("openrouter"), None).unwrap(),
+            "openrouter"
+        );
+    }
+
+    #[test]
+    fn load_missing_file_is_empty_manifest() {
+        let p = std::env::temp_dir().join("agentkeys-cred-manifest-does-not-exist-xyz.json");
+        let _ = std::fs::remove_file(&p);
+        assert_eq!(CredManifest::load(&p).unwrap(), CredManifest::default());
+    }
+
+    #[test]
+    fn save_then_load_roundtrips() {
+        let man = m(&["openrouter", "anthropic"], Some("anthropic"));
+        let p = std::env::temp_dir().join(format!(
+            "agentkeys-cred-manifest-rt-{}.json",
+            std::process::id()
+        ));
+        man.save(&p).unwrap();
+        assert_eq!(CredManifest::load(&p).unwrap(), man);
+        let _ = std::fs::remove_file(&p);
+    }
+
+    #[test]
+    fn deserializes_with_absent_default_field() {
+        let man: CredManifest = serde_json::from_str(r#"{"services":["openrouter"]}"#).unwrap();
+        assert_eq!(man.default_service, None);
+        assert_eq!(man.default_name(), Some("openrouter"));
+    }
+}
diff --git a/crates/agentkeys-types/src/lib.rs b/crates/agentkeys-types/src/lib.rs
index 74a134ea..c01c343a 100644
--- a/crates/agentkeys-types/src/lib.rs
+++ b/crates/agentkeys-types/src/lib.rs
@@ -2,8 +2,10 @@ use std::fmt;
 
 use serde::{Deserialize, Serialize};
 
+pub mod cred_manifest;
 pub mod provision;
 
+pub use cred_manifest::{CredManifest, CredManifestError};
 pub use provision::{ProvisionErrorCode, ProvisionEvent, TripwireKind};
 
 #[derive(Debug, Clone, Serialize, Deserialize, PartialEq, Eq, Hash)]

From e24f41463741b3f014dd5e84211be16d1bc1f01b Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 16:46:51 +0800
Subject: [PATCH 07/17] =?UTF-8?q?feat:=20#216=20default-key=20selection=20?=
 =?UTF-8?q?=E2=80=94=20harness=20e2e=20proof=20+=20docs=20(the=20no-UI=20p?=
 =?UTF-8?q?ath)?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Wires the off-chain cred manifest through the agent runtime so the
no-UI default-key path is proven end-to-end:

- phase1-wire-demo.sh 1.4b': the master records the sandbox cred
  manifest (agentkeys cred manifest --services <granted creds>
  --default $SERVICE → ~/.agentkeys/cred-manifest.json) so the
  in-sandbox no-arg fetch resolves the designated default. Skips
  gracefully on an older sandbox binary.
- sandbox-agent-isolation.sh: + the #216 default-key proof — cred list
  (off-chain discovery the chain can't do) + a bare 'cred fetch' (no
  service) asserting it resolves the master default to the SAME secret
  as the explicit fetch. Conditional on a manifest (the explicit cred
  half still stands alone).
- mock-wire-demo.sh: writes a temp manifest + points the isolation run
  at it via AGENTKEYS_CRED_MANIFEST (never the runner's real
  ~/.agentkeys), so CI proves the default-key path headless too.

Docs (same change, keep-in-sync): arch.md (off-chain default-key =
discovery, on-chain = verification), user-manual.md (cred
manifest/list/default verbs), harness/CLAUDE.md (sandbox role + phase1
+ mock-wire rows), operator-runbook-harness.md (On Sandbox proof).

Detection greps verified against real CLI output; bash -n all three
scripts. The CLI core (types + verbs) is the prior commit.
---
 docs/arch.md                               |  1 +
 docs/operator-runbook-harness.md           | 11 +++++---
 docs/user-manual.md                        | 12 ++++++++
 harness/CLAUDE.md                          | 10 ++++---
 harness/mock-wire-demo.sh                  | 15 ++++++++--
 harness/phase1-wire-demo.sh                | 21 ++++++++++++++
 harness/scripts/sandbox-agent-isolation.sh | 32 +++++++++++++++++++++-
 7 files changed, 90 insertions(+), 12 deletions(-)

diff --git a/docs/arch.md b/docs/arch.md
index 393043fc..b6685840 100644
--- a/docs/arch.md
+++ b/docs/arch.md
@@ -1004,6 +1004,7 @@ Each data class gets its own worker — independent IAM, independent deploy life
 - **Operations:** R/W agent state at high frequency. **STS session policies enable direct S3 access** from the agent process for the duration of the session — the worker is NOT in the LLM-call hot path. The worker mints a TTL-bounded STS session at session start; the agent's localhost SDK uses STS creds for many ops within the TTL.
 - **OIDC federation (issue #90):** Same `X-Aws-*` header passthrough as creds. Each data-class has its own IAM role (`agentkeys-memory-role`); memory-role STS creds are rejected at the vault bucket and vice versa. See §17.5.
 - **Namespace = signed service (issue #147):** the memory `service` carries the namespace as **`memory:<namespace>`** (e.g. `memory:travel`). Because `service` is a signed cap field, the namespace is tamper-proof and is authorized by the existing on-chain `isServiceInScope(operator, actor, keccak("memory:<ns>"))` gate. The worker keys storage (`bots/<actor_omni_hex>/memory/memory:<ns>.enc`), the envelope AAD, and the scope check all off that one signed field — so two namespaces are physically segregated with no new mechanism. Minted in `crates/agentkeys-mcp-server/src/tools/memory.rs`; enforced in `crates/agentkeys-worker-memory/src/handlers.rs`.
+- **Default-key selection is OFF-chain (issue #216):** the scope stores only `keccak(service)` HASHES, so the chain can *verify* a known service name (`isServiceInScope`) but cannot *enumerate* names nor designate a default. The agent's authorized service NAMES + the master-designated default LLM key therefore live in an off-chain **cred manifest** (`agentkeys cred manifest|list|fetch`, `agentkeys_types::CredManifest` — public names only, never a secret); a bare `cred fetch` (no service) resolves the master default (the no-UI path a screenless device relies on), `--select N` overrides. This is **discovery only** — every fetch still re-verifies `cred:<service>` on-chain, so the manifest never widens authorization. Putting names/a-default on-chain (a contract change) is unnecessary: the master already holds the plaintext at grant time. On-chain = the verification gate; off-chain = discovery.
 - **Memory engine — pluggable, not built in v0 (Position C):** the worker is **store + gate only** (deterministic, no ranking, no LLM). Ranking / extraction / consolidation is delegated to an external engine via an adapter trait (`extract` / `rank` / `synthesize`); canonical reference engine **OpenViking**; delivery via the `pre_llm_call` hook (#141), never a runtime `memory.provider`. Full design + Hermes-provider compatibility strategy: [`plan/agentkeys-memory-design.md`](plan/agentkeys-memory-design.md) (§6a engine seam; §22 pluggable-axis row). Background: [`research/ai-memory-systems-survey.md`](research/ai-memory-systems-survey.md), decision record [`research/memory-build-vs-gate-decision.md`](research/memory-build-vs-gate-decision.md), [`research/universal-gate-pattern.md`](research/universal-gate-pattern.md). Operator test guide (OpenViking behind the gate): [`operator-runbook-openviking.md`](operator-runbook-openviking.md).
 - **Classifier-service — write-side dual of the engine (§15.6; substrate landed #207 items 2-3-6):** the engine ranks at *read*; the **classifier-service** compiles natural-language intent → the structured policy attribute the gate enforces (memory→namespace, creds→service-category, IoT→device-tier), and tags novel requests — *NL-programmable, deterministically-enforced* authorization fleet-wide, with no model on the gate's hot path. **Landed:** the `agentkeys-worker-classify` crate (a **COMPUTE** gate — same cap + chain-verify chain as the storage workers via `agentkeys_worker_creds::verify`, but NO S3 bucket/KEK; port 9097, `classify.<zone>`), `CapOp::Classify` + `/v1/cap/classify` (data-class-bound), and the bundled **category catalog** (`catalog.rs` — entity→category + sensitivity floor + signed vendor overlays). The classify engine is the deterministic **tier-0** (catalog lookup + a small keyword-rules COMPILE); the LLM-backed engine is the deferred enhancement (no model on the gate, ever). **Daemon bridge + auto-distribution (#207 items 5/7/8, landed):** the catalog is a shared `agentkeys-catalog` crate the daemon also uses; the daemon's `--classify-url` bridge (mint a master-self `Classify` cap → worker TAG, with the local catalog tier-0 as fallback) backs `POST /v1/master/classify/tag` (cred auto-categorize) + `POST /v1/master/classify/propose` (classify an agent's surface → scopes tiered `auto`/`k11` by the catalog sensitivity floor). `propose` writes no scope — only the K11-gated grant path does, so an unconfirmed sensitive category never becomes a grant (§3 invariant). Design + three-phase plan + caching/efficiency model: [`plan/classifier-service.md`](plan/classifier-service.md).
 
diff --git a/docs/operator-runbook-harness.md b/docs/operator-runbook-harness.md
index 00872b80..d08dbb82 100644
--- a/docs/operator-runbook-harness.md
+++ b/docs/operator-runbook-harness.md
@@ -56,10 +56,13 @@ bash "$HOME/sandbox-agent-isolation.sh"   # the REAL agent: the deferred roundtr
 The proof covers BOTH halves of the real agent: the **memory roundtrip** (cap-mint → STS
 signed as the agent → worker → S3) AND the **#216 cred fetch** — the agent pulls its
 authorized LLM key from the master's vault (`agentkeys cred fetch`, gated by the
-`cred:<service>` scope granted at pairing) and an **un-granted probe service is denied
-with `service_not_in_scope`** (the permission gate, tested from both sides). It reads its
-coordinates from `~/.agentkeys/harness-env`, which the wire phase stages (step 1.4b) —
-no hand-exported env needed.
+`cred:<service>` scope granted at pairing), an **un-granted probe service is denied
+with `service_not_in_scope`** (the permission gate, tested from both sides), and the
+**#216 default-key (no-UI) path** — a bare `agentkeys cred fetch` (no service) resolves
+the **master-designated default** from the off-chain manifest that step 1.4b recorded
+(`agentkeys cred list` shows the authorized set). It reads its coordinates from
+`~/.agentkeys/harness-env`, which the wire phase stages (step 1.4b) — no hand-exported
+env needed.
 
 (If `v2-demo.sh` reported the wire phase **skipped — no aiosandbox**, the agent wasn't paired:
 set the sandbox up and re-pair with `bash harness/v2-demo.sh --from 5`. The wire demo is
diff --git a/docs/user-manual.md b/docs/user-manual.md
index 06aa22fc..cb2067ea 100644
--- a/docs/user-manual.md
+++ b/docs/user-manual.md
@@ -213,6 +213,18 @@ agent can fetch a credential only with a granted `cred:<service>` scope. **Vault
 credential** with the form on that page (service id + secret). Listing is
 **master-only** — an agent's single-service cap can't enumerate your vault.
 
+**Default-key selection (#216).** The on-chain scope stores only a
+`keccak(service)` hash, so it can *verify* a service name but can't *enumerate*
+names or mark a default. So an agent's authorized service NAMES + your designated
+default LLM key live in an **off-chain manifest** (`agentkeys cred manifest
+--services openrouter,anthropic --default openrouter` — public names only, never a
+secret). The agent then reads them: `agentkeys cred list` shows its authorized
+services, and a bare `agentkeys cred fetch` (no service argument) pulls the
+**master-designated default** — the no-UI path a screenless device relies on
+(`--select N` overrides to the Nth authorized service). Every fetch still
+re-verifies the `cred:<service>` scope on-chain, so the manifest is discovery only
+and never widens what the agent can reach.
+
 ## Audit receipts (parent-control)
 
 Every Touch-ID chain action — **accepting an agent**, **committing a scope
diff --git a/harness/CLAUDE.md b/harness/CLAUDE.md
index dcf3cf94..1fd6a25a 100644
--- a/harness/CLAUDE.md
+++ b/harness/CLAUDE.md
@@ -230,8 +230,10 @@ Every orchestrator + the operator runbook MUST keep this split exact:
   roundtrip runs THERE (`phase1-wire-demo.sh --real` pairs it + stages `~/.agentkeys/harness-env`;
   `sandbox-agent-isolation.sh` runs the deferred roundtrip with the sandbox-held key via
   `sbx_exec`: the memory roundtrip PLUS the #216 cred half — `agentkeys cred fetch` of the
-  authorized service AND the scope-denial negative for an un-granted probe). The master never
-  signs for the agent. This is the real agent-side coverage.
+  authorized service, the scope-denial negative for an un-granted probe, AND the #216
+  default-key (no-UI) path — a bare `agentkeys cred fetch` resolving the master-designated
+  default from the off-chain manifest). The master never signs for the agent. This is the
+  real agent-side coverage.
 - **CI (`--ci`) — headless, no biometric, no sandbox.** Software register (no Touch ID), stub
   K11 (`WEBAUTHN_MODE=0`), the **mock agent** for the agent-side steps (the sole
   sanctioned synthetic agent, contract rule 5), and **stage-1 auto-skips
@@ -262,11 +264,11 @@ sandbox) is **GREEN**, never fail/incomplete.
 | `v2-stage1-demo.sh` | M1 foundation demo | `--only-step N` |
 | `v2-stage2-demo.sh` | hardening demo | `--only-step N` |
 | `v2-stage3-demo.sh` | OIDC + per-actor/data-class isolation proof (23 steps; 16–17 = #196 master-self + cross-actor scope; **18 = granted-agent positives — the memory cap AND the #216 cred-fetch cap for the granted service (200), an un-granted cred probe → ServiceNotInScope, and (CI/mock only) the live #216 REVOKE transition: setScope drops the service → the same cred-fetch mint is denied → restore**; **19–21 = #201 Config data-class isolation** — master-self layer-3/4 + cap data-class-mismatch, run on the operator, `skip` until config infra is provisioned/deployed; **22 = #207 classifier-worker isolation** — master-self `cap_op_mismatch` (storage cap → classify worker) + `cap_data_class_mismatch` (cross-data-class Classify cap), compute-gate so NO STS, `skip` until the worker is deployed; **23 = cleanup + summary**). **Steps 11-12 / 14-15 sign STS creds AS the agent: on the operator they `defer` to the sandbox (the §10.2 agent key lives in the sandbox) — GREEN, never fail. `--mock-agent` (CI-only, auto-on under `--ci`) provisions a master-held DEV agent so headless CI can prove the roundtrip; a real §10.2 agent proves it in-sandbox via `phase1-wire-demo.sh --real`. When 11-12 run they ALSO assert the **#229 durable-audit receipt**: fetch response `audit_envelope_hash` → envelope fetchable from `AGENTKEYS_WORKER_AUDIT_URL`, hash = keccak256(cbor) (the appendV2/appendRootV2 anchor commitment), no plaintext — skip reasons `audit-receipt-missing` / `audit-url-unset`.** | `--from/--to/--only-step` / `--mock-agent` |
-| `phase1-wire-demo.sh` | agent-side `agentkeys wire` demo (real memory only — the in-memory `--light` path was removed, #207); **phase 5 of `v2-demo.sh`** — pairs the §10.2 agent in the sandbox so the On-Sandbox proof (`sandbox-agent-isolation.sh`) can run. **v2-demo runs it `--real --webauthn`** so the master grants the agent's `memory:<ns>` + `cred:$SERVICE` scopes (Touch ID); without the grant `memory.get` / `cred fetch` → `service_not_in_scope`. **Step 1.4b stages `~/.agentkeys/harness-env` (0600) in the sandbox** (MCP/broker/cred coordinates + the operator session bearer) so the in-sandbox proofs run from a bare shell, and **1.4c uploads `sandbox-agent-isolation.sh`**. **Phase 4.0 (#216) fetches the LLM key IN-SANDBOX, as the agent** (`agentkeys cred fetch` via its granted cred scope) and plants it into `~/.hermes/.env` without the plaintext leaving the sandbox; host-CLI fetch is the compat fallback, operator env the labelled DEV-only fallback. | `--real` (default) / `--webauthn` |
+| `phase1-wire-demo.sh` | agent-side `agentkeys wire` demo (real memory only — the in-memory `--light` path was removed, #207); **phase 5 of `v2-demo.sh`** — pairs the §10.2 agent in the sandbox so the On-Sandbox proof (`sandbox-agent-isolation.sh`) can run. **v2-demo runs it `--real --webauthn`** so the master grants the agent's `memory:<ns>` + `cred:$SERVICE` scopes (Touch ID); without the grant `memory.get` / `cred fetch` → `service_not_in_scope`. **Step 1.4b stages `~/.agentkeys/harness-env` (0600) in the sandbox** (MCP/broker/cred coordinates + the operator session bearer) so the in-sandbox proofs run from a bare shell, and **1.4c uploads `sandbox-agent-isolation.sh`**. **Phase 4.0 (#216) fetches the LLM key IN-SANDBOX, as the agent** (`agentkeys cred fetch` via its granted cred scope) and plants it into `~/.hermes/.env` without the plaintext leaving the sandbox; host-CLI fetch is the compat fallback, operator env the labelled DEV-only fallback. **Step 1.4b also records the off-chain cred manifest** (`agentkeys cred manifest --services <granted creds> --default $SERVICE`) so the in-sandbox no-arg `cred fetch` resolves the master-designated default (#216 default-key selection, no-UI path). | `--real` (default) / `--webauthn` |
 | `web-memory-bootstrap.sh` | issue #196 web-memory pre-flight + proof; runbook [`../docs/operator-runbook-web-memory.md`](../docs/operator-runbook-web-memory.md) | `--from/--to/--only-step` |
 | `memory-plant-demo.sh` | plant a proof memory archive through the REAL chain + read-back (the CLI/CI proof of the plant flow the web "⊕ plant prepared memory" button drives); **phase 4 of `v2-demo.sh`**. Plants into **dedicated `demo-*` namespaces** (never the real travel/personal/family) and **always deletes them on exit** (success OR failure, EXIT trap; `KEEP_DEMO_MEMORY=1` keeps), so test memory never leaks into the master's real store — the real prepared archive is planted ONLY by the user (the button), never by a demo or onboarding. Re-testable; idempotent (`--from 4.1`). | `--from-step/--only-step N` / `--ci` |
 | `web-parity-demo.sh` | **phase 6 of `v2-demo.sh`** (NOT a standalone front door) — boots `agentkeys-daemon --ui-bridge` SEEDED with the master's J1 + device via the `--ui-bridge-seed-*` daemon seam (skips re-onboarding) + plants a **dedicated `webparity` probe ns** through the **web** endpoint `POST /v1/master/memory/plant`, **deleted on exit** (success or failure). A 200 proves the daemon's chain (cap-mint → STS → worker → S3) == the agent/harness chain — the web↔harness drift gate. **Step 4 (#214)** additionally polls `GET /v1/agent/pairing/pending` and asserts a well-formed `{requests:[…]}` — the master-side web-pairing route reaches the real broker rendezvous (the full claim→register e2e needs a live §10.2 agent request, exercised agent-side). Reuses phases 1-2's build/chain/broker/master (one daemon boot, no re-bootstrap); real-only. | `--from-step/--only-step N` / `--ci` |
-| `mock-wire-demo.sh` | **CI mock-sandbox wire proof (#216) — CI-ONLY.** Emulates the aiosandbox side ON the runner with the sanctioned mock agent: ensure agent + the canonical `MOCK_SCOPE_SERVICES` grant → mint operator + agent sessions (wallet_sig SIWE via `_lib.sh::wallet_sig_mint_jwt`) → boot the REAL `agentkeys-mcp-server` on `127.0.0.1:$MOCK_MCP_PORT` (http backend + per-actor STS relay, the phase1 1.4 shape) → master-self vault a probe cred under the DEDICATED `mock-wire-llm` service (never `openrouter` — can't clobber a real vault entry) → run the SAME `sandbox-agent-isolation.sh` with a staged harness-env + `EXPECTED_CRED_SHA256`. Proves the post-wire agent RUNTIME (MCP server + CLI) headless; sandbox key custody stays operator-only. **Phase 5 of `v2-demo.sh` under `--ci`** (`--wire mock`). Idempotent; MCP + temp files torn down on EXIT. | `--from-step/--only-step N` |
+| `mock-wire-demo.sh` | **CI mock-sandbox wire proof (#216) — CI-ONLY.** Emulates the aiosandbox side ON the runner with the sanctioned mock agent: ensure agent + the canonical `MOCK_SCOPE_SERVICES` grant → mint operator + agent sessions (wallet_sig SIWE via `_lib.sh::wallet_sig_mint_jwt`) → boot the REAL `agentkeys-mcp-server` on `127.0.0.1:$MOCK_MCP_PORT` (http backend + per-actor STS relay, the phase1 1.4 shape) → master-self vault a probe cred under the DEDICATED `mock-wire-llm` service (never `openrouter` — can't clobber a real vault entry) → run the SAME `sandbox-agent-isolation.sh` with a staged harness-env + `EXPECTED_CRED_SHA256` + a temp `AGENTKEYS_CRED_MANIFEST` (so the #216 default-key no-arg fetch runs headless too). Proves the post-wire agent RUNTIME (MCP server + CLI) headless; sandbox key custody stays operator-only. **Phase 5 of `v2-demo.sh` under `--ci`** (`--wire mock`). Idempotent; MCP + temp files torn down on EXIT. | `--from-step/--only-step N` |
 | `cred-fetch-demo.sh` | **#216 agent-side vaulted-key fetch, real e2e** (standalone). A master **vaults** a probe credential via the daemon (web path: cap-mint cred-store → STS → cred worker → S3), then the **agent** fetches it back with `agentkeys cred fetch` (CLI path: cap-mint cred-fetch → STS → cred worker → **decrypt**), asserting the EXACT secret round-trips. Proves the cred half of "the agent uses the key the master authorized it to use" (the Hermes wire is phase1-wire #216 Phase 4.0). Routes through the shared `agentkeys-backend-client` (no re-typed shapes, #204). Idempotent (a FIXED `cred-e2e-probe` service is overwritten each run — never accumulates); daemon killed on exit; real-only. | `--from-step/--only-step N` / `--ci` |
 | `cred-wire-demo.sh` | **#216 agent-side wire, the FULL e2e** (standalone, headless). Extends `cred-fetch-demo.sh` through the Hermes wire: master vaults the LLM key → **agent cred-fetches it** → **plant into the sandbox Hermes** (`~/.hermes/.env` + `hermes config set model.*`) → **Hermes runs on the vault key** (real LLM smoke), asserting the planted key == the vaulted key (sha) with **no `OPENROUTER_API_KEY` in the agent env**. The durable, no-Touch-ID complement to `phase1-wire-demo.sh` Phase 4.0b. Needs a reachable aiosandbox (`SANDBOX_URL`, default `:8080`) with Hermes installed. Idempotent (FIXED `openrouter` service; `.env` key-line rewritten not appended); daemon killed on exit; real-only. | `--from-step/--only-step N` / `--ci` |
 | `sandbox-build-push.sh` | **Path-A binary provisioner (utility, not a stage demo).** Cross-builds the agent binaries (`agentkeys` + `agentkeys-mcp-server` + `agentkeys-daemon`) for the sandbox's aarch64-Linux arch in the cached arm64 builder image (sharing phase1-wire-demo.sh's exact `agentkeys-sandbox-builder` image + `agentkeys-sandbox-*` cargo/target volumes → a warm tree re-pushes in seconds) and uploads them to the sandbox's `~/.local/bin` via the file API. **Build + push ONLY** — it never pairs or wires (that's the master's job in the parent-control web UI). Re-run after any local code change so the in-sandbox agent runs current source. | `SANDBOX_URL` / `RUST_BUILD_IMAGE` / `CROSS_RUST_TOOLCHAIN` |
diff --git a/harness/mock-wire-demo.sh b/harness/mock-wire-demo.sh
index 765bf1b5..c890b936 100755
--- a/harness/mock-wire-demo.sh
+++ b/harness/mock-wire-demo.sh
@@ -77,10 +77,10 @@ resolve_bin() {
   else command -v "$name" 2>/dev/null || true; fi
 }
 
-MPID=""; MLOG=""; ASB_FILE=""; HENV_FILE=""
+MPID=""; MLOG=""; ASB_FILE=""; HENV_FILE=""; MANIFEST_FILE=""
 cleanup() {
   [ -n "$MPID" ] && kill "$MPID" 2>/dev/null
-  rm -f "$MLOG" "$ASB_FILE" "$HENV_FILE"
+  rm -f "$MLOG" "$ASB_FILE" "$HENV_FILE" "$MANIFEST_FILE"
 }
 trap cleanup EXIT
 
@@ -191,6 +191,14 @@ fi
 if should_run 6; then
   step 6 "sandbox-agent-isolation.sh on the runner: memory via MCP + #216 cred fetch + scope negative"
   { [ -n "${OP_JWT:-}" ] && [ -n "${MOCK_ACTOR:-}" ] && [ -n "${PROBE_SHA:-}" ]; } || die "need steps 1-5 first"
+  # #216 off-chain cred manifest (default-key selection) — to a TEMP path (never
+  # the runner's real ~/.agentkeys), pointed at via AGENTKEYS_CRED_MANIFEST below
+  # so the isolation script's no-arg `cred fetch` resolves the master-designated
+  # default. Single authorized service = the probe LLM cred, so the default fetch
+  # == the explicit fetch (sha-exact in the isolation proof).
+  MANIFEST_FILE="$(mktemp -t mock-wire-cm.XXXX)"
+  "$CLI_BIN" cred manifest --services "$MOCK_CRED_SERVICE" --default "$MOCK_CRED_SERVICE" \
+    --manifest "$MANIFEST_FILE" >/dev/null || die "cred manifest write failed (#216 default-key)"
   HENV_FILE="$(mktemp -t mock-wire-henv.XXXX)"
   ( umask 077; {
     printf 'AGENTKEYS_MCP_URL=http://127.0.0.1:%s/mcp\n' "$MOCK_MCP_PORT"
@@ -204,10 +212,11 @@ if should_run 6; then
     printf 'VAULT_ROLE_ARN=%s\n' "${VAULT_ROLE_ARN}"
     printf 'REGION=%s\n' "$REGION"
     printf 'CRED_SERVICE=%s\n' "$MOCK_CRED_SERVICE"
+    printf 'AGENTKEYS_CRED_MANIFEST=%s\n' "$MANIFEST_FILE"
   } > "$HENV_FILE" )
   if HARNESS_ENV="$HENV_FILE" EXPECTED_CRED_SHA256="$PROBE_SHA" AGENT_BIN="$CLI_BIN" \
        bash "$REPO_ROOT/harness/scripts/sandbox-agent-isolation.sh" "$MOCK_WIRE_NS"; then
-    ok "the SAME proof the sandbox runs passed on the runner (mock agent): MCP memory roundtrip + authorized cred fetch (sha-exact) + un-granted denial"
+    ok "the SAME proof the sandbox runs passed on the runner (mock agent): MCP memory roundtrip + authorized cred fetch (sha-exact) + #216 default-key + un-granted denial"
   else
     die "mock-sandbox proof failed — see sandbox-agent-isolation.sh output above"
   fi
diff --git a/harness/phase1-wire-demo.sh b/harness/phase1-wire-demo.sh
index fa886bf2..a26f3c19 100755
--- a/harness/phase1-wire-demo.sh
+++ b/harness/phase1-wire-demo.sh
@@ -782,6 +782,27 @@ phase1_sandbox() {
   else
     fail "1.4b harness env" "could not stage $henv in the sandbox — the in-sandbox cred fetch (4.0) + bare sandbox-agent-isolation.sh runs need it"
   fi
+  # 1.4b' record the OFF-CHAIN cred manifest in the sandbox (#216 default-key
+  # selection). The master designates the agent's default LLM key: the granted
+  # cred services (the non-memory entries of SEED_SCOPE_SERVICES) with default
+  # $SERVICE. Written via the in-sandbox agentkeys binary to the CLI default path
+  # (~/.agentkeys/cred-manifest.json) so sandbox-agent-isolation.sh's no-arg
+  # `cred fetch` resolves the master default — the no-UI path. Public NAMES only,
+  # never a secret. The on-chain hash gate still runs on every fetch (discovery,
+  # not authorization). Gracefully skips on an older sandbox binary.
+  local cred_svcs="" _s
+  IFS=',' read -ra _scope_parts <<<"$SEED_SCOPE_SERVICES"
+  for _s in "${_scope_parts[@]}"; do
+    case "$_s" in memory:*) ;; *) cred_svcs+="${cred_svcs:+,}$_s" ;; esac
+  done
+  if [[ -n "$cred_svcs" ]]; then
+    sbx_exec "$AGENT_BIN_DST cred manifest --services '$cred_svcs' --default '$SERVICE' >/dev/null 2>&1" >/dev/null
+    if [[ "$(sbx_rc "$AGENT_BIN_DST cred list 2>/dev/null | grep -q -- '$SERVICE'")" == "0" ]]; then
+      ok "1.4b cred manifest" "recorded [$cred_svcs] default '$SERVICE' in the sandbox (#216 no-UI default-key)"
+    else
+      skip "1.4b cred manifest" "manifest write unconfirmed (older sandbox binary without 'cred manifest'?) — the explicit-service fetch still proves the cred half"
+    fi
+  fi
   # 1.4c upload the deferred-proof script so a wire-only run (no stage 3) still
   # leaves the sandbox self-testable: bash $HOME/sandbox-agent-isolation.sh
   # The upload `path` MUST be absolute — the aiosandbox file API rejects a bare
diff --git a/harness/scripts/sandbox-agent-isolation.sh b/harness/scripts/sandbox-agent-isolation.sh
index 4344d4f6..c533e503 100755
--- a/harness/scripts/sandbox-agent-isolation.sh
+++ b/harness/scripts/sandbox-agent-isolation.sh
@@ -124,4 +124,34 @@ else
   exit 1
 fi
 
-echo "== PASS: tested against the sandbox (the real agent), not the master-held mock — memory roundtrip + #216 authorized cred fetch + scope-denial negative. ==" >&2
+# #216 default-key selection (off-chain manifest). The master records the
+# authorized cred service NAMES + a designated default in an off-chain manifest
+# (the chain stores only keccak hashes — it can verify a known name but cannot
+# enumerate names). Prove the agent's NO-UI path: a no-argument `cred fetch`
+# resolves the master-designated default and returns the SAME secret as the
+# explicit fetch above. Conditional on a manifest being staged (the explicit cred
+# half already stands alone); the agent reads the manifest path from the CLI's
+# env default (~/.agentkeys/cred-manifest.json) or $AGENTKEYS_CRED_MANIFEST.
+list_out="$(mktemp 2>/dev/null || echo "/tmp/cred-list.$$")"
+if "$AGENT_BIN" cred list >"$list_out" 2>/dev/null && grep -qE '^[[:space:]]+[0-9]+\.' "$list_out"; then
+  echo "== #216 default-key selection — off-chain manifest discovery (the chain can't enumerate names) ==" >&2
+  sed 's/^/    /' "$list_out" >&2
+  if def_fetched="$("$AGENT_BIN" cred fetch 2>"$cred_err")" && [ -n "$def_fetched" ]; then
+    if [ "$(sha_hex "$def_fetched")" = "$(sha_hex "$fetched")" ]; then
+      echo "OK: no-arg 'cred fetch' resolved the master-designated default → SAME secret as the explicit fetch — the #216 no-UI default-key path works." >&2
+    else
+      echo "OK: no-arg 'cred fetch' returned the master-designated default (sha $(sha_hex "$def_fetched" | cut -c1-12)…, differs from CRED_SERVICE=$CRED_SERVICE — the master designated a different default LLM key; both authorized)." >&2
+    fi
+  else
+    derr="$(tr '\n' ' ' <"$cred_err" | cut -c1-200)"
+    rm -f "$cred_err" "$list_out"
+    echo "FAIL: no-arg 'cred fetch' (master-designated default) errored: $derr" >&2
+    exit 1
+  fi
+  rm -f "$cred_err"
+else
+  echo "note: no cred manifest staged — skipping the #216 default-key (no-UI) proof; the explicit fetch above is the authorized-cred proof." >&2
+fi
+rm -f "$list_out"
+
+echo "== PASS: tested against the sandbox (the real agent), not the master-held mock — memory roundtrip + #216 authorized cred fetch + default-key (no-UI) + scope-denial negative. ==" >&2

From 7b6cbbafca02113045fb7761497eb0c5606ed299 Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 17:23:43 +0800
Subject: [PATCH 08/17] =?UTF-8?q?docs(wiki):=20master=20recovery=20+=20gua?=
 =?UTF-8?q?rdians=20=E2=80=94=20on-chain=20M-of-N=20execution,=20control-v?=
 =?UTF-8?q?s-secrets,=20off-chain=20spend=20caps?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

New docs/wiki/master-recovery-and-guardians.md (indexed in Home under
Foundations). Came out of the architecture Q&A: the recovery process
wasn't documented in the wiki.

Covers:
- M-of-N guardian social recovery is EXECUTED on-chain by
  P256Account.recover() (in-contract WebAuthn verify, threshold +
  dedup + replay-bound challenge, atomic signer rotation, permissionless
  submit) — the chain is the executor here, not just an audit log.
- Setup (addGuardian + recoveryThreshold) + the K11-gated ceremonies.
- The lost-device timeline (revoke/rotate → broker SSE cap-drop →
  daemon cache zero).
- Control vs. secrets boundary: recover() restores CONTROL on-chain
  instantly; decrypting EXISTING vaulted secrets needs the TEE to
  re-wrap the K3 KEK (on-chain-coordinated, TEE-executed).
- The dev escape hatch (resetMaster) vs. real guardian recovery.
- Why spend caps are enforced OFF-chain (per the user's point): spend
  is a HIGH-FREQUENCY hot path, so the limits are on-chain (policy) but
  the usage accumulator is off-chain (meter) — the inverse of recovery
  (rare + high-stakes → on-chain execution worth the gas). States the
  principle + the future on-chain-accumulator option gated on frequency.

All 15 cross-links verified to resolve; wiki lint clean (no frontmatter,
no H1). Defers to arch.md §11 for the canonical spec.
---
 docs/wiki/Home.md                          |  1 +
 docs/wiki/master-recovery-and-guardians.md | 82 ++++++++++++++++++++++
 2 files changed, 83 insertions(+)
 create mode 100644 docs/wiki/master-recovery-and-guardians.md

diff --git a/docs/wiki/Home.md b/docs/wiki/Home.md
index f10364fa..70322e16 100644
--- a/docs/wiki/Home.md
+++ b/docs/wiki/Home.md
@@ -22,6 +22,7 @@ Every spec and every service on top of AgentKeys preserves these four invariants
 - **[Blockchain TEE Architecture](blockchain-tee-architecture)** — chain + TEE + clients; the four rules in §6
 - **[Session Token](session-token)** — 30-day JWT bearer; issuance, storage, revocation
 - **[Key Security](key-security)** — TEE keys, master session key, storage tiers, threat model
+- **[Master Recovery and Guardians](master-recovery-and-guardians)** — M-of-N guardian social recovery executed on-chain (`P256Account.recover`); control vs. secrets; why spend caps stay off-chain
 - **[Open-Source Frontend Security](open-source-frontend-security)** — why the keyless web frontend is safe to open-source; malicious-clone + magic-link-phishing analysis (keys never touch the browser)
 - **[Data Classification](data-classification)** — data classes, where each lives, retention policy
 - **[Threat Model: Key Custody](https://github.com/litentry/agentKeys/blob/main/docs/spec/threat-model-key-custody.md)** *(spec)* — why nothing sensitive lives on chain or persistently in TEE; off-chain ciphertext + forward-secret epoch rotation (Stage 8)
diff --git a/docs/wiki/master-recovery-and-guardians.md b/docs/wiki/master-recovery-and-guardians.md
new file mode 100644
index 00000000..3f893b64
--- /dev/null
+++ b/docs/wiki/master-recovery-and-guardians.md
@@ -0,0 +1,82 @@
+How an operator recovers control of their AgentKeys account after losing a master
+device — without a seed phrase, an anchor wallet, or any third party. Recovery is
+**M-of-N guardian social recovery**, and the part most people get wrong about it
+here: the recovery is **executed on-chain by a smart contract**, deterministically,
+not merely *audited* on-chain. The chain is the executor, not the bookkeeper.
+
+> **Scope:** the operator-facing recovery model + its on-chain enforcement, what it
+> does and does NOT restore (control vs. secrets), the dev escape hatch, and why a
+> related hot-path control — spend caps — is deliberately enforced OFF-chain. Deep
+> spec: [`arch.md` §11](../arch.md) and the ERC-4337 master plan
+> [`docs/plan/chain/erc4337-master-account.md`](../plan/chain/erc4337-master-account.md).
+
+## What recovery is (and is not)
+
+Your master is an ERC-4337 smart-contract account ([`P256Account`](../../crates/agentkeys-chain/src/P256Account.sol)) controlled by one or more **passkeys** (P-256, Secure-Enclave / StrongBox-sealed). Recovery replaces a lost passkey with a fresh one, authorized by a quorum of **guardians** — passkeys you registered on surviving devices (your phone, a tablet, a partner's device, an offline recovery-only key).
+
+- **No seed phrase, no anchor wallet.** The devices themselves are the quorum. There is nothing to write down or lose.
+- **No third-party recovery.** No friends-as-custodians service, no email reset, no recovery code. The only thing that proves "I am this operator" is **biometric presence (K11) on a surviving guardian** that is still registered on chain.
+- **K11 is the gate.** A stolen device key (K10) alone cannot trigger recovery — that would let a single compromised machine lock you out (DoS). Recovery requires a real WebAuthn user-presence assertion from each guardian.
+
+## The chain executes recovery — it does not just record it
+
+`P256Account.recover(newCredIdHash, newPubX, newPubY, newRpIdHash, assertions[])` ([P256Account.sol](../../crates/agentkeys-chain/src/P256Account.sol)) is the whole authority. It runs **in the contract**, with no broker, relayer, or off-chain party deciding anything:
+
+- **Each guardian's WebAuthn assertion is P-256 verified in-contract** (`IK11Verifier.verifyAssertion`, the pure-Solidity [`P256Verifier`](../../crates/agentkeys-chain/src/P256Verifier.sol) — no precompile, no trusted oracle).
+- **The contract enforces every rule:** `recoveryThreshold` (0 = recovery disabled, the safe default); a replay-bound challenge `keccak(OP_RECOVER, newSigner, recoveryNonce, chainId, address(this))`; guardian de-duplication (the same credId **or** the same physical pubkey is rejected, so one guardian can't satisfy an M≥2 quorum); and `validSignatures ≥ threshold`.
+- **It is atomic and final:** `signerGeneration += 1` invalidates *every* prior signer instantly (the lost passkey is dead the moment recovery lands), then installs the new passkey as the sole active signer.
+- **It is permissionless to submit.** A relayer can land the transaction for a locked-out operator; authority comes purely from the guardian signatures, never from who pays the gas.
+
+This is the distinction that matters: scope grants, agent binding, master registration, K3 epoch advance, and recovery are all **enforced by contracts** here, not decided off-chain and mirrored. A compromised broker cannot grant scope, mint a master, or recover an account. The only purely-audit contract is [`CredentialAudit`](../../crates/agentkeys-chain/src/CredentialAudit.sol) (a Merkle-root anchor) — which is correctly audit-only, since you do not "execute" an audit.
+
+## Setting it up (operator)
+
+1. **Register guardians** — add a passkey from each surviving device as a guardian (`addGuardian`, gated to the account itself / EntryPoint, so only you can add one).
+2. **Set the threshold** — `recoveryThreshold` is per-operator (default 1; the onboarding flow prompts you to bump to 2 when you add a third device). Threshold M with N guardians = an **M-of-N** quorum.
+
+Operator ceremonies (K11-gated):
+[`heima-set-recovery-threshold.sh`](../../harness/scripts/heima-set-recovery-threshold.sh) sets the quorum;
+[`heima-recovery.sh`](../../harness/scripts/heima-recovery.sh) drives the M-of-N master-device revoke + rotate;
+[`heima-register-spare-master.sh`](../../harness/scripts/heima-register-spare-master.sh) registers a third device to exercise the quorum end-to-end.
+
+## The recovery timeline (you lose your laptop)
+
+1. **t=0** — you notice the laptop (master device A) is lost/stolen, and pick up a surviving device B (phone) that holds its own K10 + a guardian passkey K11.
+2. **t≈60s** — in the app you choose *"Lost device — revoke & rotate"*; it builds the revoke + new-signer payload and asks for the K11 biometric on device B.
+3. **t≈90s** — if the threshold is ≥ 2, the app collects the additional guardian assertion(s) (a desktop at home, a tablet, a co-approver) until signatures ≥ threshold.
+4. **t≈2m** — the quorum-signed `recover` / `revoke_device` lands on chain; the contract verifies the assertions, swaps the signer, and bumps the generation. The chain emits the event.
+5. **t≈2m+1s** — the broker receives the chain event over SSE, drops every cap tied to the revoked key, and rejects new cap-mints from it; daemons under your `operator_omni` zero their credential cache. Within ~60s more (the 5-minute `cred_cache_ttl` ceiling, but typically immediate), an attacker holding the old device can no longer perform **any** authorized operation.
+
+## Control vs. secrets — what recovery does and does NOT restore
+
+This is the one boundary to internalize:
+
+- **Recovery restores CONTROL — on-chain, instantly.** After `recover()`, your new passkey can sign UserOps, mint caps, set scope, bind agents — everything the master can do.
+- **Recovery does NOT, by itself, decrypt your EXISTING vaulted secrets.** Vaulted credentials are AES-256-GCM under a per-operator KEK (K3) **derived inside the signer / TEE** — a decryption key can never live on chain (it would be public). So reading secrets that were vaulted *before* recovery requires the TEE to re-derive and re-wrap the KEK under the new master: a **K3-epoch rotation**, *coordinated* on chain ([`K3EpochCounter`](../../crates/agentkeys-chain/src/K3EpochCounter.sol)'s M-of-N) but *executed* in the enclave. Control is on-chain and immediate; the secret re-wrap is a separate TEE ceremony. (See [`./key-security.md`](./key-security.md) and [`./blockchain-tee-architecture.md`](./blockchain-tee-architecture.md).)
+
+**If you lose ALL devices/guardians at once** (whole-household theft, fire) you have lost your actor tree — the deliberate trade-off for having no third-party recovery surface to attack. Mitigate by diversifying: a phone in your pocket, a laptop at home, and a biometric-locked **offline recovery-only guardian** kept in a safe. High-stakes operators can additionally pre-position a TEE-attested emergency override that publishes on chain (designed, not enabled by default).
+
+## The dev escape hatch (NOT recovery): reset master
+
+[`heima-reset-master.sh`](../../scripts/heima-reset-master.sh) (`SidecarRegistry.resetMaster`, behind the daemon's *reset master* button) is a **deployer-gated dev escape** — not Touch ID, not a guardian quorum. It exists because first-master registration makes `operatorMasterWallet` immutable, so a lost passkey with **no guardians configured** is otherwise unrecoverable. It also tears down the whole fleet (declines pending pairings, revokes every agent device, clears local state). In production this would be gated on guardian recovery (`P256Account.recover`) instead of a deployer key. See the *reset master* note in [`../user-manual.md`](../user-manual.md).
+
+## Why spend caps are enforced OFF-chain (the design contrast)
+
+Recovery is the textbook case for **on-chain execution**: it is **rare** and **high-stakes**, so paying a transaction's gas + latency to make it deterministic and trustless is obviously worth it. **Spend caps are the opposite shape, and the architecture treats them differently on purpose.**
+
+- The **limits** are on-chain, in the scope itself ([`AgentKeysScope`](../../crates/agentkeys-chain/src/AgentKeysScope.sol): `maxPerCall`, `maxPerPeriod`, `maxTotal`, `periodSeconds`). The policy — *what an agent is allowed to spend* — is tamper-proof and master-set, exactly like its service scope.
+- The **accounting** — *how much has it spent this period* — is enforced **off-chain** (broker / worker), reading the on-chain limit. The scope contract intentionally has **no `recordSpend` / usage accumulator**.
+
+The reason is **frequency**. Spend enforcement sits on the **hot path**: it is consulted on effectively every cap-mint / operation, potentially many times per second across a fleet. An on-chain per-spend accumulator would charge a transaction's gas and add block-time latency to **every** agent action — prohibitive for a high-frequency meter. So the system follows a clean principle:
+
+> **Put EXECUTION on-chain for the RARE, high-stakes events (recovery, scope grant, master register, agent binding, K3 epoch). Keep the HOT-PATH meter (spend accounting) off-chain, reading the on-chain policy.** The chain is the authority on *what is allowed*; the fast off-chain layer is the authority on *how much has been used*, and it can only ever be **more** restrictive than the on-chain cap, never less.
+
+A future enhancement could add an on-chain spend ledger (a `recordSpend` + rolling-window accumulator that reverts over-cap) to make spend deterministic the way recovery already is — but only where the per-operation gas + latency is acceptable: high-value, low-frequency settlements, or a periodic batch reconciliation, rather than the per-call hot path. The default stays off-chain precisely because spending is high-frequency. (Related: [`./policy-scope-namespace.md`](./policy-scope-namespace.md) for how scope + limits are authored and verified.)
+
+## See also
+
+- [`arch.md` §11 — Recovery: M-of-N device quorum](../arch.md) (the canonical spec) and §4 (the K-key inventory: K10 device key, K11 passkey, K3 KEK).
+- [`./key-security.md`](./key-security.md) — the two-tier secret-storage model and why the KEK is TEE-held.
+- [`./blockchain-tee-architecture.md`](./blockchain-tee-architecture.md) — how the chain and the TEE divide responsibility.
+- [`./policy-scope-namespace.md`](./policy-scope-namespace.md) — scope, services, and spend limits.
+- Plan: [`docs/plan/chain/erc4337-master-account.md`](../plan/chain/erc4337-master-account.md) (the P256Account + guardian-recovery design, #164 E5).

From c93661f1ee4ed86fee91d1cda3ea2964ae85949e Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 17:37:14 +0800
Subject: [PATCH 09/17] =?UTF-8?q?fix:=20email-init=20demo=20is=20idempoten?=
 =?UTF-8?q?t=20=E2=80=94=20'Already=20initialized'=20is=20success,=20not?=
 =?UTF-8?q?=20'init=20died=20early'?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

agentkeys-init-email-demo.sh fires `agentkeys init --email` in the
background and its poll loop treats ANY early exit as failure. But
cmd_init_with_force returns Ok(existing) IMMEDIATELY (exit 0, no email
sent) when a usable session already exists, printing 'Already
initialized as 0x… (run --force)'. So a valid existing session was
mis-reported as 'init died early (likely broker rejection)' and the
script polled S3 forever for an email that was never sent (v2-stage1
step 6).

The fast-fail block now reaps the exit code and, when init exited 0 with
'Already initialized' in its log, reuses the on-disk session and exits 0
— the demo's goal (a live session for this --session-id) is already met.
A real non-zero exit / broker rejection still dies as before.

Trigger: a session aged past do_step_6's 1-hour reuse window but still
within the broker TTL → do_step_6 re-inits → init short-circuits. Re-run
'bash harness/v2-demo.sh --from 1.6' (now green). Runbook Q&A added.
bash -n clean; grep verified against the exact CLI message.
---
 docs/operator-runbook-harness.md     |  3 +++
 scripts/agentkeys-init-email-demo.sh | 16 ++++++++++++++++
 2 files changed, 19 insertions(+)

diff --git a/docs/operator-runbook-harness.md b/docs/operator-runbook-harness.md
index d08dbb82..62e00b16 100644
--- a/docs/operator-runbook-harness.md
+++ b/docs/operator-runbook-harness.md
@@ -332,6 +332,9 @@ The stored K11 enrollment points to a platform passkey that isn't in the browser
 **Q. A chain step dies `error code -32603: invalid chain id` or `SSL_ERROR_SYSCALL` to the RPC?**
 A **transient RPC blip** (the Heima RPC briefly dropped the TLS connection). The chain helpers used to derive the chain id from a live `eth_chainId` curl, so a blip resolved it to `0` and `cast send --chain-id 0` was rejected as "invalid chain id." Fixed: every `heima-*.sh` now takes the chain id from the **pinned chain profile** (`agentkeys chain show <chain>` → `.chain_id`, offline) and **fails loud** if it isn't a positive integer — so a blip can no longer corrupt it. If the blip hit the actual `cast send`/`cast call` instead, just **re-run the step** (`bash harness/v2-demo.sh --from <P.S>`) — every step is idempotent.
 
+**Q. Step 6 (session init) fails `init died early` with `Already initialized as 0x…`?**
+A re-run with an existing, still-usable session. `agentkeys init` returns success **immediately** (no magic-link email) when a usable session already exists — but the email demo launched it in the background expecting it to block until the email arrives, so the instant exit looked like a failure. **Fixed: the email demo now treats `Already initialized` (exit 0) as success and reuses the session** — just re-run `bash harness/v2-demo.sh --from 1.6`. (The trigger was a session aged past step 6's 1-hour reuse window but still within the broker TTL. To force a *fresh* email session instead of reusing, run `agentkeys init --force --email …` by hand.)
+
 **Q. `sandbox-agent-isolation.sh` says `SKIP: cred coordinates not staged`?**
 The sandbox's `~/.agentkeys/harness-env` predates the #216 staging (or the wire phase didn't
 finish). Re-run the wire on the operator host — `bash harness/v2-demo.sh --from 5` (or
diff --git a/scripts/agentkeys-init-email-demo.sh b/scripts/agentkeys-init-email-demo.sh
index 0823bb05..cb87dda1 100755
--- a/scripts/agentkeys-init-email-demo.sh
+++ b/scripts/agentkeys-init-email-demo.sh
@@ -261,6 +261,22 @@ for attempt in $(seq 1 "$POLL_MAX_ATTEMPTS"); do
   # dump the init log and die immediately instead of waiting the full
   # 2-min poll budget for an email that will never come.
   if ! kill -0 "$init_pid" 2>/dev/null; then
+    # init exited before any email arrived. Distinguish the IDEMPOTENT re-run
+    # from a real failure: an already-initialized, still-usable session makes
+    # `agentkeys init` return IMMEDIATELY with success (exit 0, NO email sent),
+    # printing "Already initialized as 0x… (run --force)" — cmd_init_with_force
+    # returns Ok(existing) when load_session() yields a usable session. The
+    # session for this --session-id is already on disk + usable, so the demo's
+    # goal (a live session) is already met. Succeed instead of mis-reporting
+    # "init died early" (the failure the operator hit when a session aged past
+    # do_step_6's 1h reuse window but was still within the broker TTL).
+    init_rc=0; wait "$init_pid" || init_rc=$?
+    if [ "$init_rc" = 0 ] && grep -q "Already initialized" "$init_log"; then
+      log "Session already initialized + usable for '$SESSION_ID' — reusing it (no email round-trip needed):"
+      cat "$init_log"
+      log "DONE — idempotent re-run; existing session reused for $recipient"
+      exit 0
+    fi
     warn "agentkeys init exited before magic link arrived in S3 — dumping log:"
     cat "$init_log" >&2 || true
     die "init died early (likely broker rejection); see log above"

From ba65f9ec1001d982194c4ac4dd95d5cd480526b9 Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 21:46:45 +0800
Subject: [PATCH 10/17] =?UTF-8?q?feat:=20#109=20two-tier=20audit=20?=
 =?UTF-8?q?=E2=80=94=20tier-A=20appendV2=20anchor=20relay=20+=20SSE=20feed?=
 =?UTF-8?q?=20+=20ring=20buffer=20+=20S3=20archive=20+=20daemon=20bridge?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Tier 2 (default-on, AGENTKEYS_AUDIT_BATCH_SECONDS=120): the audit worker
anchors each per-operator V2 batch autonomously — AuditRootAnchor (90)
envelope, hash committed via ungated CredentialAudit.appendV2 signed by
the relay EOA (appendRootV2's master gate is unreachable for a hosted
relay: MasterMustBeAccount + Touch-ID masters can't sign on a timer).
Retry x3 exp backoff; persistent failure re-queues the batch + emits
AuditBatchFailed (91). Zero contract change (open-enum §15.3b).

Tier 1: per-actor ring buffers (1000), GET /v1/audit/stream SSE with
backfill, GET /v1/audit/anchors/:op (per-entry Merkle proofs — the
tamper check), GET /v1/audit/relay-info; AuditFeedEvent shape owned by
agentkeys-types (#203 one-owner). S3 cold archive (env-gated) restores
rings on boot + backs get_envelope across restarts.

Daemon: worker-feed SSE bridge folds worker-side events into the
existing ApiAuditEvent web feed (dedup by envelope hash) and flips
/v1/anchor/status to REAL on anchor events.

Deploy: setup-broker-host.sh generates the relay key (0600, preserved)
+ rewrites worker-audit.env (cadence/chain/bucket) + SSE-safe nginx
location; provision-audit-archive.sh (bucket + instance-role grant)
wired into setup-cloud.sh step 13; heima-fund-audit-relay.sh wired into
setup-heima.sh step 14; AUDIT_BUCKET in both env files + CI
materializer. legacy_tx moved bundler→core (shared EOA signer).
---
 .github/workflows/harness-ci.yml              |   1 +
 Cargo.lock                                    |  20 +
 crates/agentkeys-bundler/src/lib.rs           |   5 +-
 .../examples/export_audit_vectors.rs          |  31 +-
 crates/agentkeys-core/src/audit/bodies.rs     |  95 ++++
 crates/agentkeys-core/src/audit/mod.rs        |  18 +-
 crates/agentkeys-core/src/audit/op_kind.rs    |  15 +-
 .../src/legacy_tx.rs                          |  10 +-
 crates/agentkeys-core/src/lib.rs              |   1 +
 crates/agentkeys-daemon/Cargo.toml            |   2 +-
 crates/agentkeys-daemon/src/main.rs           |   4 +
 crates/agentkeys-daemon/src/ui_bridge.rs      | 420 +++++++++++++-
 crates/agentkeys-types/src/audit_feed.rs      |  42 ++
 crates/agentkeys-types/src/lib.rs             |   1 +
 crates/agentkeys-worker-audit/Cargo.toml      |   7 +
 crates/agentkeys-worker-audit/src/anchor.rs   | 536 ++++++++++++++++++
 crates/agentkeys-worker-audit/src/archive.rs  | 260 +++++++++
 crates/agentkeys-worker-audit/src/handlers.rs | 213 ++++++-
 crates/agentkeys-worker-audit/src/lib.rs      |  19 +-
 crates/agentkeys-worker-audit/src/main.rs     | 136 +++--
 crates/agentkeys-worker-audit/src/merkle.rs   |  12 +
 crates/agentkeys-worker-audit/src/service.rs  | 265 +++++++++
 crates/agentkeys-worker-audit/src/state.rs    | 341 ++++++++++-
 docs/arch.md                                  |  55 +-
 docs/plan/issue-109-two-tier-audit.md         |  71 +++
 docs/user-manual.md                           |  35 ++
 scripts/heima-fund-audit-relay.sh             |  87 +++
 scripts/operator-workstation.env              |   5 +
 scripts/operator-workstation.test.env         |   1 +
 scripts/provision-audit-archive.sh            | 183 ++++++
 scripts/setup-broker-host.sh                  |  63 +-
 scripts/setup-cloud.sh                        |   3 +-
 scripts/setup-heima.sh                        |   4 +
 33 files changed, 2858 insertions(+), 103 deletions(-)
 rename crates/{agentkeys-bundler => agentkeys-core}/src/legacy_tx.rs (92%)
 create mode 100644 crates/agentkeys-types/src/audit_feed.rs
 create mode 100644 crates/agentkeys-worker-audit/src/anchor.rs
 create mode 100644 crates/agentkeys-worker-audit/src/archive.rs
 create mode 100644 crates/agentkeys-worker-audit/src/service.rs
 create mode 100644 docs/plan/issue-109-two-tier-audit.md
 create mode 100755 scripts/heima-fund-audit-relay.sh
 create mode 100755 scripts/provision-audit-archive.sh

diff --git a/.github/workflows/harness-ci.yml b/.github/workflows/harness-ci.yml
index 5e21465a..6ce9b719 100644
--- a/.github/workflows/harness-ci.yml
+++ b/.github/workflows/harness-ci.yml
@@ -850,6 +850,7 @@ jobs:
           # config bucket/role aren't provisioned yet (setup-cloud.sh --ci).
           CONFIG_BUCKET=agentkeys-config-test-$ACCOUNT_ID
           CONFIG_ROLE_ARN=arn:aws:iam::$ACCOUNT_ID:role/agentkeys-config-role-test
+          AUDIT_BUCKET=agentkeys-audit-test-$ACCOUNT_ID
           AGENTKEYS_SIGNER_URL=https://signer-test.$TEST_BROKER_ZONE
           # Worker URLs derived from TEST_BROKER_ZONE → byte-for-byte match
           # setup-broker-host.sh --test's derive_companion() output.
diff --git a/Cargo.lock b/Cargo.lock
index bfd3dbe9..c0ea731b 100644
--- a/Cargo.lock
+++ b/Cargo.lock
@@ -389,18 +389,23 @@ name = "agentkeys-worker-audit"
 version = "0.1.0"
 dependencies = [
  "agentkeys-core",
+ "agentkeys-types",
  "anyhow",
+ "aws-config",
+ "aws-sdk-s3",
  "axum",
  "ciborium",
  "clap",
  "hex",
  "http-body-util",
+ "k256",
  "reqwest",
  "serde",
  "serde_json",
  "sha3",
  "thiserror 2.0.18",
  "tokio",
+ "tokio-stream",
  "tower 0.4.13",
  "tracing",
  "tracing-subscriber",
@@ -3909,12 +3914,14 @@ dependencies = [
  "tokio",
  "tokio-native-tls",
  "tokio-rustls 0.26.4",
+ "tokio-util",
  "tower 0.5.3",
  "tower-http 0.6.8",
  "tower-service",
  "url",
  "wasm-bindgen",
  "wasm-bindgen-futures",
+ "wasm-streams",
  "web-sys",
  "webpki-roots 1.0.7",
 ]
@@ -5244,6 +5251,19 @@ dependencies = [
  "wasmparser",
 ]
 
+[[package]]
+name = "wasm-streams"
+version = "0.4.2"
+source = "registry+https://github.com/rust-lang/crates.io-index"
+checksum = "15053d8d85c7eccdbefef60f06769760a563c7f0a9d6902a13d35c7800b0ad65"
+dependencies = [
+ "futures-util",
+ "js-sys",
+ "wasm-bindgen",
+ "wasm-bindgen-futures",
+ "web-sys",
+]
+
 [[package]]
 name = "wasmparser"
 version = "0.244.0"
diff --git a/crates/agentkeys-bundler/src/lib.rs b/crates/agentkeys-bundler/src/lib.rs
index b47f64e2..c7600eb7 100644
--- a/crates/agentkeys-bundler/src/lib.rs
+++ b/crates/agentkeys-bundler/src/lib.rs
@@ -16,5 +16,8 @@
 //! all reads here are raw JSON). This bundler is PRIVATE: bound to loopback,
 //! fed only by the broker — not a public alt-mempool.
 
-pub mod legacy_tx;
+// `legacy_tx` moved to core (#109) — both the bundler and the audit worker's
+// tier-A anchor relay sign legacy EOA txs; one implementation, re-exported
+// here so existing `agentkeys_bundler::legacy_tx` paths keep working.
+pub use agentkeys_core::legacy_tx;
 pub mod server;
diff --git a/crates/agentkeys-core/examples/export_audit_vectors.rs b/crates/agentkeys-core/examples/export_audit_vectors.rs
index 7eb25be5..1b94ecf6 100644
--- a/crates/agentkeys-core/examples/export_audit_vectors.rs
+++ b/crates/agentkeys-core/examples/export_audit_vectors.rs
@@ -32,9 +32,10 @@
 //! ```
 
 use agentkeys_core::audit::{
-    AuditEnvelope, AuditOpKind, AuditResult, CredFetchBody, CredStoreBody, DeviceAddBody,
-    K3EpochAdvanceBody, MemoryPutBody, PaymentDirectBody, PaymentEscrowRedeemBody, ScopeGrantBody,
-    SignEip191Body, SignEip712Body, ENVELOPE_VERSION,
+    AuditBatchFailedBody, AuditEnvelope, AuditOpKind, AuditResult, AuditRootAnchorBody,
+    CredFetchBody, CredStoreBody, DeviceAddBody, K3EpochAdvanceBody, MemoryPutBody,
+    PaymentDirectBody, PaymentEscrowRedeemBody, ScopeGrantBody, SignEip191Body, SignEip712Body,
+    ENVELOPE_VERSION,
 };
 use serde::Serialize;
 use serde_json::{json, Value};
@@ -226,6 +227,30 @@ fn main() {
             },
             None,
         ),
+        vector(
+            AuditOpKind::AuditRootAnchor,
+            AuditRootAnchorBody {
+                merkle_root: hex0x(&[0x90; 32]),
+                op_kind_bitmap: hex0x(&{
+                    let mut bm = [0u8; 32];
+                    bm[31] = 0x03; // op_kinds 0+1 present
+                    bm
+                }),
+                entry_count: 7,
+                relay_address: "0x4444444444444444444444444444444444444444".into(),
+            },
+            None,
+        ),
+        vector(
+            AuditOpKind::AuditBatchFailed,
+            AuditBatchFailedBody {
+                merkle_root: hex0x(&[0x91; 32]),
+                entry_count: 3,
+                attempts: 3,
+                last_error: "eth_sendRawTransaction HTTP 503".into(),
+            },
+            None,
+        ),
         unknown_vector(250),
     ];
 
diff --git a/crates/agentkeys-core/src/audit/bodies.rs b/crates/agentkeys-core/src/audit/bodies.rs
index b86463a1..87ca96fb 100644
--- a/crates/agentkeys-core/src/audit/bodies.rs
+++ b/crates/agentkeys-core/src/audit/bodies.rs
@@ -236,6 +236,45 @@ pub struct ConfigTeardownBody {
     pub actor_target: String,
 }
 
+// ── 90..99 — audit-service meta family (#109 tier-A hosted anchor) ─────
+//
+// The hosted relay anchors each per-operator Merkle batch by emitting an
+// `AuditRootAnchor` envelope and committing ITS hash on-chain via the
+// ungated `CredentialAudit.appendV2(operatorOmni, relayActorOmni, 90,
+// envelopeHash)` — one tx per batch, real operator omni in the indexed
+// topic, no contract change. Genuine anchors are distinguished from
+// third-party spam by `tx.from == relay_address` (published at the
+// worker's `GET /v1/audit/relay-info`). The master-gated `appendRootV2`
+// remains the sovereign tier-B/C route.
+
+#[derive(Debug, Clone, Serialize, Deserialize, PartialEq, Eq)]
+pub struct AuditRootAnchorBody {
+    /// 32-byte hex — Merkle root over the batch's envelope-hash leaves
+    /// (domain-separated scheme per `CredentialAudit.verifyEntryInRoot`).
+    pub merkle_root: String,
+    /// 32-byte hex — bit N set when the batch contains op_kind N (the
+    /// `appendRootV2` bitmap convention, carried in-body here).
+    pub op_kind_bitmap: String,
+    pub entry_count: u64,
+    /// 20-byte hex — the tier-A relay EOA that signed the anchor tx.
+    /// Verifiers match it against the anchor tx's `from`.
+    pub relay_address: String,
+}
+
+#[derive(Debug, Clone, Serialize, Deserialize, PartialEq, Eq)]
+pub struct AuditBatchFailedBody {
+    /// 32-byte hex — the root of the batch that failed to anchor. Its
+    /// entries are re-queued, so a later `AuditRootAnchor` (with a fresh
+    /// root superset) eventually covers them.
+    pub merkle_root: String,
+    pub entry_count: u64,
+    /// How many submission attempts were made before giving up.
+    pub attempts: u8,
+    /// Last submission error, truncated — diagnostic only, not consumed
+    /// programmatically.
+    pub last_error: String,
+}
+
 #[cfg(test)]
 mod tests {
     use super::*;
@@ -448,6 +487,62 @@ mod tests {
         }
     }
 
+    /// §15.3b step-5 worker test for the audit-service meta family (#109):
+    /// canonical CBOR roundtrip + typed decode for the tier-A anchor and
+    /// the batch-failed alert shapes.
+    #[test]
+    fn audit_meta_family_cbor_roundtrip_and_typed_decode() {
+        use crate::audit::{envelope_for, AuditEnvelope, AuditOpKind, AuditResult, TypedAuditBody};
+
+        let anchor = AuditRootAnchorBody {
+            merkle_root: format!("0x{}", "aa".repeat(32)),
+            op_kind_bitmap: format!("0x{}", "00".repeat(31)) + "03",
+            entry_count: 7,
+            relay_address: format!("0x{}", "ee".repeat(20)),
+        };
+        let env = envelope_for(
+            [0x44; 32], // relay's derived actor omni
+            [0x22; 32], // the REAL operator whose batch was anchored
+            AuditOpKind::AuditRootAnchor,
+            anchor.clone(),
+            AuditResult::Success,
+            None,
+            None,
+        )
+        .unwrap();
+        let decoded =
+            AuditEnvelope::from_canonical_cbor(&env.to_canonical_cbor().unwrap()).unwrap();
+        assert_eq!(AuditOpKind::AuditRootAnchor.label(), "audit.root_anchor");
+        match decoded.typed_body().unwrap() {
+            TypedAuditBody::AuditRootAnchor(b) => assert_eq!(b, anchor),
+            other => panic!("unexpected typed body: {other:?}"),
+        }
+
+        let failed = AuditBatchFailedBody {
+            merkle_root: format!("0x{}", "bb".repeat(32)),
+            entry_count: 3,
+            attempts: 3,
+            last_error: "eth_sendRawTransaction HTTP 503".into(),
+        };
+        let env = envelope_for(
+            [0x44; 32],
+            [0x22; 32],
+            AuditOpKind::AuditBatchFailed,
+            failed.clone(),
+            AuditResult::Failure,
+            None,
+            None,
+        )
+        .unwrap();
+        let decoded =
+            AuditEnvelope::from_canonical_cbor(&env.to_canonical_cbor().unwrap()).unwrap();
+        assert_eq!(AuditOpKind::AuditBatchFailed.label(), "audit.batch_failed");
+        match decoded.typed_body().unwrap() {
+            TypedAuditBody::AuditBatchFailed(b) => assert_eq!(b, failed),
+            other => panic!("unexpected typed body: {other:?}"),
+        }
+    }
+
     #[test]
     fn payment_direct_body_uses_ref_as_field_name() {
         // Sanity check: `ref` is a Rust reserved word, so the field is
diff --git a/crates/agentkeys-core/src/audit/mod.rs b/crates/agentkeys-core/src/audit/mod.rs
index 31df0f26..7240321c 100644
--- a/crates/agentkeys-core/src/audit/mod.rs
+++ b/crates/agentkeys-core/src/audit/mod.rs
@@ -54,11 +54,11 @@ use sha3::{Digest, Keccak256};
 use thiserror::Error;
 
 pub use bodies::{
-    ConfigGetBody, ConfigPutBody, ConfigTeardownBody, CredFetchBody, CredStoreBody,
-    CredTeardownBody, DeviceAddBody, DeviceRevokeBody, EmailReceiveBody, EmailSendBody,
-    K10RotateBody, K3EpochAdvanceBody, MemoryGetBody, MemoryPutBody, MemoryTeardownBody,
-    PaymentDirectBody, PaymentEscrowRedeemBody, ScopeGrantBody, ScopeRevokeBody, SignEip191Body,
-    SignEip712Body,
+    AuditBatchFailedBody, AuditRootAnchorBody, ConfigGetBody, ConfigPutBody, ConfigTeardownBody,
+    CredFetchBody, CredStoreBody, CredTeardownBody, DeviceAddBody, DeviceRevokeBody,
+    EmailReceiveBody, EmailSendBody, K10RotateBody, K3EpochAdvanceBody, MemoryGetBody,
+    MemoryPutBody, MemoryTeardownBody, PaymentDirectBody, PaymentEscrowRedeemBody, ScopeGrantBody,
+    ScopeRevokeBody, SignEip191Body, SignEip712Body,
 };
 pub use op_kind::AuditOpKind;
 
@@ -238,6 +238,8 @@ pub enum TypedAuditBody {
     ConfigPut(ConfigPutBody),
     ConfigGet(ConfigGetBody),
     ConfigTeardown(ConfigTeardownBody),
+    AuditRootAnchor(AuditRootAnchorBody),
+    AuditBatchFailed(AuditBatchFailedBody),
 }
 
 impl TypedAuditBody {
@@ -277,6 +279,12 @@ impl TypedAuditBody {
             AuditOpKind::ConfigTeardown => {
                 Self::ConfigTeardown(serde_json::from_value(value).ok()?)
             }
+            AuditOpKind::AuditRootAnchor => {
+                Self::AuditRootAnchor(serde_json::from_value(value).ok()?)
+            }
+            AuditOpKind::AuditBatchFailed => {
+                Self::AuditBatchFailed(serde_json::from_value(value).ok()?)
+            }
         })
     }
 }
diff --git a/crates/agentkeys-core/src/audit/op_kind.rs b/crates/agentkeys-core/src/audit/op_kind.rs
index 259b2c60..9e04eb4a 100644
--- a/crates/agentkeys-core/src/audit/op_kind.rs
+++ b/crates/agentkeys-core/src/audit/op_kind.rs
@@ -15,7 +15,8 @@
 //! - 60-69 email family (EmailSend=60, EmailReceive=61; 62-69 reserved)
 //! - 70-79 K3 family (K3EpochAdvance=70; 71-79 reserved)
 //! - 80-89 config family (ConfigPut=80, ConfigGet=81, ConfigTeardown=82; 83-89 reserved)
-//! - 90-255 reserved for future families
+//! - 90-99 audit-service meta family (AuditRootAnchor=90, AuditBatchFailed=91; 92-99 reserved) — issue #109
+//! - 100-255 reserved for future families
 
 /// Canonical op_kind enum. The byte value MUST match the row in arch.md
 /// §15.3a. The enum is `repr(u8)` so `as u8` gives the canonical byte.
@@ -47,6 +48,8 @@ pub enum AuditOpKind {
     ConfigPut = 80,
     ConfigGet = 81,
     ConfigTeardown = 82,
+    AuditRootAnchor = 90,
+    AuditBatchFailed = 91,
 }
 
 impl AuditOpKind {
@@ -75,6 +78,8 @@ impl AuditOpKind {
             80 => Self::ConfigPut,
             81 => Self::ConfigGet,
             82 => Self::ConfigTeardown,
+            90 => Self::AuditRootAnchor,
+            91 => Self::AuditBatchFailed,
             _ => return None,
         })
     }
@@ -105,6 +110,8 @@ impl AuditOpKind {
             Self::ConfigPut => "config.put",
             Self::ConfigGet => "config.get",
             Self::ConfigTeardown => "config.teardown",
+            Self::AuditRootAnchor => "audit.root_anchor",
+            Self::AuditBatchFailed => "audit.batch_failed",
         }
     }
 }
@@ -140,6 +147,8 @@ mod tests {
             AuditOpKind::ConfigPut,
             AuditOpKind::ConfigGet,
             AuditOpKind::ConfigTeardown,
+            AuditOpKind::AuditRootAnchor,
+            AuditOpKind::AuditBatchFailed,
         ];
         for k in all {
             let byte = k as u8;
@@ -156,7 +165,7 @@ mod tests {
     #[test]
     fn unknown_bytes_return_none() {
         for byte in [
-            3u8, 9, 13, 19, 22, 32, 42, 53, 62, 71, 83, 89, 90, 200, 250, 255,
+            3u8, 9, 13, 19, 22, 32, 42, 53, 62, 71, 83, 89, 92, 99, 200, 250, 255,
         ] {
             assert_eq!(
                 AuditOpKind::from_u8(byte),
@@ -193,6 +202,8 @@ mod tests {
             AuditOpKind::ConfigPut as u8,
             AuditOpKind::ConfigGet as u8,
             AuditOpKind::ConfigTeardown as u8,
+            AuditOpKind::AuditRootAnchor as u8,
+            AuditOpKind::AuditBatchFailed as u8,
         ];
         let s: HashSet<_> = all.iter().copied().collect();
         assert_eq!(s.len(), all.len(), "duplicate byte assignment");
diff --git a/crates/agentkeys-bundler/src/legacy_tx.rs b/crates/agentkeys-core/src/legacy_tx.rs
similarity index 92%
rename from crates/agentkeys-bundler/src/legacy_tx.rs
rename to crates/agentkeys-core/src/legacy_tx.rs
index 0e20bfdb..eef01b38 100644
--- a/crates/agentkeys-bundler/src/legacy_tx.rs
+++ b/crates/agentkeys-core/src/legacy_tx.rs
@@ -1,12 +1,16 @@
 //! Minimal legacy (pre-EIP-1559) transaction RLP encoding + EIP-155 signing.
 //!
 //! Heima accepts legacy txs and its `eth_estimateGas` reverts on `handleOps`
-//! (see `docs/spec/heima-eth-gap.md`), so the bundler signs a fixed-gas-limit
-//! legacy tx and submits it via `eth_sendRawTransaction` — no alloy/ethers
+//! (see `docs/spec/heima-eth-gap.md`), so callers sign a fixed-gas-limit
+//! legacy tx and submit it via `eth_sendRawTransaction` — no alloy/ethers
 //! (their receipt/header parsers crash on Heima's mixHash-less responses).
 //! Hand-rolled RLP, golden-tested against the EIP-155 reference vector.
+//!
+//! Lives in core (moved from `agentkeys-bundler`, #109) so both EOA tx
+//! emitters — the bundler's `handleOps` carrier and the audit worker's
+//! tier-A anchor relay — share one implementation.
 
-use agentkeys_core::device_crypto::keccak256;
+use crate::device_crypto::keccak256;
 use anyhow::{anyhow, Result};
 use k256::ecdsa::SigningKey;
 
diff --git a/crates/agentkeys-core/src/lib.rs b/crates/agentkeys-core/src/lib.rs
index b3e544a7..3fb06a14 100644
--- a/crates/agentkeys-core/src/lib.rs
+++ b/crates/agentkeys-core/src/lib.rs
@@ -7,6 +7,7 @@ pub mod clear_signing;
 pub mod device_crypto;
 pub mod erc4337;
 pub mod init_flow;
+pub mod legacy_tx;
 pub mod mock_client;
 pub mod otp;
 pub mod payment;
diff --git a/crates/agentkeys-daemon/Cargo.toml b/crates/agentkeys-daemon/Cargo.toml
index df852c3e..c82908f2 100644
--- a/crates/agentkeys-daemon/Cargo.toml
+++ b/crates/agentkeys-daemon/Cargo.toml
@@ -36,7 +36,7 @@ tracing-subscriber = { version = "0.3", features = ["env-filter"] }
 ed25519-dalek = { version = "2", features = ["rand_core"] }
 rand = "0.8"
 base64 = "0.22"
-reqwest = { version = "0.12", features = ["json"] }
+reqwest = { version = "0.12", features = ["json", "stream"] }
 # Parse `Retry-After` HTTP-date form (RFC 7231) for the §10.2 pairing poll
 # backoff; tiny, zero-dep, the same parser hyper uses internally.
 httpdate = "1"
diff --git a/crates/agentkeys-daemon/src/main.rs b/crates/agentkeys-daemon/src/main.rs
index 9330bba0..d0594b1c 100644
--- a/crates/agentkeys-daemon/src/main.rs
+++ b/crates/agentkeys-daemon/src/main.rs
@@ -1217,6 +1217,10 @@ async fn run_ui_bridge_mode(args: Args) -> anyhow::Result<()> {
         // exactly one passkey re-auth.
         ui_bridge::rehydrate_master_session(&state).await;
     }
+    // #109: fold the audit worker's Tier-1 SSE feed into the web app's
+    // audit stream (worker-side ops + on-chain anchor status, live). No-op
+    // when --audit-worker-url is empty.
+    ui_bridge::spawn_audit_feed_bridge(state.clone());
     let app = ui_bridge::build_router(state, &args.ui_bridge_origin);
 
     let listener = tokio::net::TcpListener::bind(&args.ui_bridge_bind)
diff --git a/crates/agentkeys-daemon/src/ui_bridge.rs b/crates/agentkeys-daemon/src/ui_bridge.rs
index 33d6ece6..993ba16c 100644
--- a/crates/agentkeys-daemon/src/ui_bridge.rs
+++ b/crates/agentkeys-daemon/src/ui_bridge.rs
@@ -77,6 +77,11 @@ pub struct UiBridgeState {
     pub caps: RwLock<HashMap<String, Vec<ApiCapToken>>>,
     pub audit: RwLock<VecDeque<ApiAuditEvent>>,
     pub audit_tx: broadcast::Sender<ApiAuditEvent>,
+    /// #109: envelope hashes already surfaced in the feed — dedups the two
+    /// delivery paths for the same op (the local submit-flow push vs the
+    /// audit-worker SSE bridge), whichever lands first. (VecDeque = FIFO
+    /// eviction order, HashSet = O(1) membership.)
+    pub seen_envelope_hashes: RwLock<(VecDeque<String>, std::collections::HashSet<String>)>,
     pub workers: RwLock<HashMap<String, ApiWorker>>,
     pub anchor: RwLock<ApiAnchorStatus>,
     /// Master-actor memory entries, keyed by content_hash for idempotent
@@ -841,6 +846,7 @@ pub fn build_state(
         caps: RwLock::new(HashMap::new()),
         audit: RwLock::new(VecDeque::with_capacity(AUDIT_BUFFER_CAP)),
         audit_tx,
+        seen_envelope_hashes: RwLock::new((VecDeque::new(), std::collections::HashSet::new())),
         workers: RwLock::new(HashMap::new()),
         anchor: RwLock::new(ApiAnchorStatus::default()),
         master_memory: RwLock::new(HashMap::new()),
@@ -2370,11 +2376,7 @@ fn now_unix() -> u64 {
 fn now_ts_hms() -> String {
     // HH:MM:SS in UTC for audit event timestamps. Operator-facing only —
     // chain timestamps are independent.
-    let now = now_unix();
-    let h = (now / 3600) % 24;
-    let m = (now / 60) % 60;
-    let s = now % 60;
-    format!("{:02}:{:02}:{:02}", h, m, s)
+    ts_hms_from_unix(now_unix())
 }
 
 // ─── Read endpoints ────────────────────────────────────────────────────
@@ -6168,6 +6170,34 @@ async fn store_master_credential_inner(
 }
 
 async fn push_audit(state: &SharedUiBridgeState, evt: ApiAuditEvent) {
+    // #109 dedup: the same op can reach the feed twice — once from the
+    // local submit flow (which carries its envelope-hash receipts) and once
+    // from the audit-worker SSE bridge (one event per envelope). Whichever
+    // path delivers an envelope hash first wins; a later event whose hashes
+    // were ALL seen already is dropped.
+    if let Some(hashes) = &evt.audit_envelope_hashes {
+        if !hashes.is_empty() {
+            let mut seen = state.seen_envelope_hashes.write().await;
+            let any_new = hashes
+                .iter()
+                .any(|h| !seen.1.contains(&h.to_lowercase()));
+            if !any_new {
+                return;
+            }
+            const SEEN_CAP: usize = 4096;
+            for h in hashes {
+                let h = h.to_lowercase();
+                if seen.1.insert(h.clone()) {
+                    seen.0.push_back(h);
+                    if seen.0.len() > SEEN_CAP {
+                        if let Some(evicted) = seen.0.pop_front() {
+                            seen.1.remove(&evicted);
+                        }
+                    }
+                }
+            }
+        }
+    }
     let mut buf = state.audit.write().await;
     if buf.len() == AUDIT_BUFFER_CAP {
         buf.pop_front();
@@ -6179,6 +6209,240 @@ async fn push_audit(state: &SharedUiBridgeState, evt: ApiAuditEvent) {
     let _ = state.audit_tx.send(evt);
 }
 
+// ─── #109: audit-worker feed bridge ─────────────────────────────────────
+//
+// The daemon subscribes to the audit worker's Tier-1 SSE
+// (`GET /v1/audit/stream?operator=<session omni>`) and folds every event
+// into the EXISTING ApiAuditEvent feed the web app already streams — so
+// worker-side ops the daemon never sees locally (agent cred fetches,
+// memory reads, denials, the relay's on-chain anchors) appear live in the
+// parent UI. Anchor events additionally flip `state.anchor` to REAL (the
+// "Anchored ✓" badge was a synthesized placeholder before this).
+
+/// The operator omni the bridge filters on — same 3-source resolution the
+/// fleet reconstruction uses (registered master, live session, onboarding).
+async fn current_operator_omni(state: &SharedUiBridgeState) -> Option<String> {
+    let from_registered = state
+        .registered_master
+        .read()
+        .await
+        .as_ref()
+        .map(|rm| rm.operator_omni.clone());
+    let from_session = state
+        .master_session
+        .read()
+        .await
+        .as_ref()
+        .map(|ms| ms.operator_omni.clone());
+    let from_onboarding = state
+        .onboarding_session
+        .read()
+        .await
+        .as_ref()
+        .map(|s| s.omni.clone());
+    [from_registered, from_session, from_onboarding]
+        .into_iter()
+        .flatten()
+        .find(|o| !o.is_empty())
+        .map(|o| agentkeys_backend_client::normalize_omni_0x(&o))
+}
+
+fn short_hex(s: &str) -> String {
+    let h = s.trim_start_matches("0x");
+    if h.len() <= 10 {
+        format!("0x{h}")
+    } else {
+        format!("0x{}…{}", &h[..6], &h[h.len() - 4..])
+    }
+}
+
+fn ts_hms_from_unix(ts: u64) -> String {
+    let h = (ts / 3600) % 24;
+    let m = (ts / 60) % 60;
+    let s = ts % 60;
+    format!("{:02}:{:02}:{:02}", h, m, s)
+}
+
+/// Map one worker feed event into the web app's audit-event shape. Pure —
+/// unit-tested below.
+fn worker_feed_event_to_api(evt: &agentkeys_types::audit_feed::AuditFeedEvent) -> ApiAuditEvent {
+    let result_str = match evt.result {
+        0 => "ok",
+        1 => "failure",
+        2 => "NOT PERMITTED",
+        _ => "unknown",
+    };
+    let (actor, chip, detail) = match evt.kind.as_str() {
+        "anchor" => (
+            "audit-relay".to_string(),
+            "anchor".to_string(),
+            format!(
+                "anchored {} event(s) on-chain · root {} · tx {}",
+                evt.entry_count.unwrap_or(0),
+                short_hex(evt.merkle_root.as_deref().unwrap_or("")),
+                short_hex(evt.tx_hash.as_deref().unwrap_or("")),
+            ),
+        ),
+        "batch_failed" => (
+            "audit-relay".to_string(),
+            "anchor".to_string(),
+            format!(
+                "batch anchor FAILED after retries · {} event(s) re-queued · root {}",
+                evt.entry_count.unwrap_or(0),
+                short_hex(evt.merkle_root.as_deref().unwrap_or("")),
+            ),
+        ),
+        _ => {
+            let mut d = format!(
+                "{} · actor {} · {}",
+                evt.op_kind_label,
+                short_hex(&evt.actor_omni),
+                result_str
+            );
+            if let Some(intent) = &evt.intent_text {
+                d.push_str(" · ");
+                d.push_str(intent);
+            }
+            (short_hex(&evt.actor_omni), "worker".to_string(), d)
+        }
+    };
+    let sev = if evt.result == 0 { "ok" } else { "bad" };
+    let hash_tail = evt.envelope_hash.trim_start_matches("0x");
+    ApiAuditEvent {
+        id: format!("e-worker-{}", &hash_tail[..hash_tail.len().min(12)]),
+        ts: ts_hms_from_unix(evt.ts_unix),
+        actor_id: actor.clone(),
+        actor,
+        kind: evt.op_kind_label.clone(),
+        detail,
+        chip,
+        sev: sev.into(),
+        tx_hash: evt.tx_hash.clone(),
+        audit_envelope_hashes: Some(vec![evt.envelope_hash.clone()]),
+    }
+}
+
+/// Drain complete SSE frames (`event:`/`data:` blocks terminated by a blank
+/// line) out of `buf`, leaving any partial frame in place. Comments and
+/// keep-alives are dropped. Returns `(event_name, data)` pairs.
+fn drain_sse_frames(buf: &mut String) -> Vec<(String, String)> {
+    let mut frames = Vec::new();
+    loop {
+        let Some(pos) = buf.find("\n\n") else { break };
+        let frame: String = buf[..pos].to_string();
+        buf.drain(..pos + 2);
+        let mut event_name = "message".to_string();
+        let mut data_lines: Vec<&str> = Vec::new();
+        for line in frame.lines() {
+            if let Some(rest) = line.strip_prefix("event:") {
+                event_name = rest.trim().to_string();
+            } else if let Some(rest) = line.strip_prefix("data:") {
+                data_lines.push(rest.trim_start());
+            }
+            // ":" comments (keep-alives) and other fields are ignored.
+        }
+        if !data_lines.is_empty() {
+            frames.push((event_name, data_lines.join("\n")));
+        }
+    }
+    frames
+}
+
+/// Spawn the long-running bridge task. No-op (returns false) when the
+/// daemon was started without an audit-worker URL — hermetic tests and
+/// no-infra dev runs stay silent.
+pub fn spawn_audit_feed_bridge(state: SharedUiBridgeState) -> bool {
+    let Some(base) = state
+        .audit_worker_url
+        .clone()
+        .filter(|u| !u.trim().is_empty())
+    else {
+        tracing::info!("audit feed bridge disabled (no audit worker url)");
+        return false;
+    };
+    tokio::spawn(async move {
+        let client = reqwest::Client::new();
+        let mut backoff_secs = 2u64;
+        loop {
+            let Some(omni) = current_operator_omni(&state).await else {
+                // No master session yet — poll cheaply until one exists.
+                tokio::time::sleep(std::time::Duration::from_secs(5)).await;
+                continue;
+            };
+            match pump_worker_feed(&state, &client, &base, &omni).await {
+                Ok(delivered) => {
+                    if delivered > 0 {
+                        backoff_secs = 2; // healthy run — reset backoff
+                    }
+                    tracing::info!(delivered, "audit feed bridge stream ended; reconnecting");
+                }
+                Err(e) => {
+                    tracing::warn!(error = %e, backoff_secs, "audit feed bridge connect/pump failed");
+                }
+            }
+            tokio::time::sleep(std::time::Duration::from_secs(backoff_secs)).await;
+            backoff_secs = (backoff_secs * 2).min(60);
+        }
+    });
+    true
+}
+
+/// One SSE connection lifetime: backfill + live events until the stream
+/// closes or the session's operator changes. Returns how many events were
+/// folded into the feed.
+async fn pump_worker_feed(
+    state: &SharedUiBridgeState,
+    client: &reqwest::Client,
+    base: &str,
+    operator_omni: &str,
+) -> anyhow::Result<usize> {
+    use tokio_stream::StreamExt as _;
+    let url = format!(
+        "{}/v1/audit/stream?operator={}&backfill=200",
+        base.trim_end_matches('/'),
+        operator_omni
+    );
+    let resp = client
+        .get(&url)
+        .header("accept", "text/event-stream")
+        .send()
+        .await?;
+    if !resp.status().is_success() {
+        anyhow::bail!("audit worker stream HTTP {}", resp.status());
+    }
+    tracing::info!(operator = %operator_omni, "audit feed bridge connected");
+    let mut stream = resp.bytes_stream();
+    let mut buf = String::new();
+    let mut delivered = 0usize;
+    while let Some(chunk) = stream.next().await {
+        let chunk = chunk?;
+        buf.push_str(&String::from_utf8_lossy(&chunk));
+        for (event_name, data) in drain_sse_frames(&mut buf) {
+            let Ok(evt) =
+                serde_json::from_str::<agentkeys_types::audit_feed::AuditFeedEvent>(&data)
+            else {
+                tracing::warn!(event_name, "audit feed bridge: undecodable event skipped");
+                continue;
+            };
+            if evt.kind == "anchor" {
+                let mut anchor = state.anchor.write().await;
+                anchor.last_anchor_at = evt.ts_unix;
+            }
+            delivered += 1;
+            push_audit(state, worker_feed_event_to_api(&evt)).await;
+        }
+        // Session switched (logout / reset / different master)? Reconnect
+        // with the new filter. Keep-alive comments tick this check even
+        // when no events flow.
+        let current = current_operator_omni(state).await;
+        if current.as_deref() != Some(operator_omni) {
+            tracing::info!("audit feed bridge: operator changed — reconnecting");
+            break;
+        }
+    }
+    Ok(delivered)
+}
+
 // ─── Tests ─────────────────────────────────────────────────────────────
 //
 // These tests exercise the begin/finish state machine without a real
@@ -8838,6 +9102,152 @@ mod tests {
         assert_eq!(received.id, "e-stream-1");
     }
 
+    // ─── #109: audit-worker feed bridge units ───────────────────────────
+
+    fn worker_evt(kind: &str, hash: u8, result: u8) -> agentkeys_types::audit_feed::AuditFeedEvent {
+        agentkeys_types::audit_feed::AuditFeedEvent {
+            kind: kind.into(),
+            envelope_hash: format!("0x{}", hex::encode([hash; 32])),
+            ts_unix: 45_296, // 12:34:56 UTC
+            actor_omni: format!("0x{}", "aa".repeat(32)),
+            operator_omni: format!("0x{}", "bb".repeat(32)),
+            op_kind: 1,
+            op_kind_label: "cred.fetch".into(),
+            result,
+            intent_text: None,
+            tx_hash: if kind == "anchor" {
+                Some("0xfeedtx".into())
+            } else {
+                None
+            },
+            merkle_root: Some(format!("0x{}", "cc".repeat(32))),
+            entry_count: Some(3),
+        }
+    }
+
+    #[test]
+    fn drain_sse_frames_handles_partials_events_and_comments() {
+        let mut buf = String::new();
+        buf.push_str(": keep-alive\n\nevent: audit\ndata: {\"a\":1}\n\nevent: anch");
+        let frames = drain_sse_frames(&mut buf);
+        assert_eq!(frames, vec![("audit".to_string(), "{\"a\":1}".to_string())]);
+        assert_eq!(buf, "event: anch", "partial frame stays buffered");
+        buf.push_str("or\ndata: {\"b\":2}\n\n");
+        let frames = drain_sse_frames(&mut buf);
+        assert_eq!(frames, vec![("anchor".to_string(), "{\"b\":2}".to_string())]);
+        assert!(buf.is_empty());
+    }
+
+    #[test]
+    fn worker_feed_event_maps_to_api_shape() {
+        let api = worker_feed_event_to_api(&worker_evt("event", 0x11, 2));
+        assert_eq!(api.kind, "cred.fetch");
+        assert_eq!(api.sev, "bad", "NotPermitted renders as bad");
+        assert_eq!(api.ts, "12:34:56");
+        assert_eq!(api.chip, "worker");
+        assert!(api.detail.contains("NOT PERMITTED"));
+        assert_eq!(
+            api.audit_envelope_hashes,
+            Some(vec![format!("0x{}", hex::encode([0x11; 32]))]),
+            "decode page can fetch the real envelope"
+        );
+
+        let anchor = worker_feed_event_to_api(&worker_evt("anchor", 0x22, 0));
+        assert_eq!(anchor.chip, "anchor");
+        assert_eq!(anchor.actor, "audit-relay");
+        assert_eq!(anchor.tx_hash.as_deref(), Some("0xfeedtx"));
+        assert!(anchor.detail.contains("anchored 3 event(s)"));
+    }
+
+    #[tokio::test]
+    async fn push_audit_dedups_by_envelope_hash_either_order() {
+        let state = make_state();
+        // Bridge delivers the worker envelope first…
+        push_audit(&state, worker_feed_event_to_api(&worker_evt("event", 0x33, 0))).await;
+        assert_eq!(state.audit.read().await.len(), 1);
+        // …then the local submit flow pushes its own event carrying the SAME
+        // receipt hash → dropped as a duplicate.
+        let local = ApiAuditEvent {
+            id: "e-scope-1".into(),
+            ts: "00:00:01".into(),
+            actor_id: "master".into(),
+            actor: "master".into(),
+            kind: "scope.granted".into(),
+            detail: "dup of the worker event".into(),
+            chip: "broker".into(),
+            sev: "ok".into(),
+            tx_hash: Some("0xabc".into()),
+            audit_envelope_hashes: Some(vec![format!("0x{}", hex::encode([0x33; 32]))]),
+        };
+        push_audit(&state, local.clone()).await;
+        assert_eq!(state.audit.read().await.len(), 1, "duplicate dropped");
+        // An event with a FRESH hash still lands.
+        let mut fresh = local;
+        fresh.audit_envelope_hashes = Some(vec![format!("0x{}", hex::encode([0x44; 32]))]);
+        push_audit(&state, fresh).await;
+        assert_eq!(state.audit.read().await.len(), 2);
+        // Hash-less events are never deduped.
+        push_audit(
+            &state,
+            ApiAuditEvent {
+                id: "e-local-2".into(),
+                ts: "00:00:02".into(),
+                actor_id: "master".into(),
+                actor: "master".into(),
+                kind: "memory.updated".into(),
+                detail: "no receipt".into(),
+                chip: "broker".into(),
+                sev: "ok".into(),
+                tx_hash: None,
+                audit_envelope_hashes: None,
+            },
+        )
+        .await;
+        assert_eq!(state.audit.read().await.len(), 3);
+    }
+
+    #[tokio::test]
+    async fn bridge_anchor_event_flips_anchor_status_real() {
+        // End-to-end through a real SSE socket: a fake worker serves one
+        // anchor event; the pump folds it into the feed AND updates
+        // /v1/anchor/status's last_anchor_at.
+        use axum::response::sse::{Event as SseEvent, Sse};
+        let evt = worker_evt("anchor", 0x55, 0);
+        let payload = serde_json::to_string(&evt).unwrap();
+        let app = axum::Router::new().route(
+            "/v1/audit/stream",
+            axum::routing::get(move || {
+                let payload = payload.clone();
+                async move {
+                    let stream = tokio_stream::once(Ok::<_, std::convert::Infallible>(
+                        SseEvent::default().event("anchor").data(payload),
+                    ));
+                    Sse::new(stream)
+                }
+            }),
+        );
+        let listener = tokio::net::TcpListener::bind("127.0.0.1:0").await.unwrap();
+        let addr = listener.local_addr().unwrap();
+        tokio::spawn(async move {
+            axum::serve(listener, app).await.ok();
+        });
+
+        let state = make_state();
+        let delivered = pump_worker_feed(
+            &state,
+            &reqwest::Client::new(),
+            &format!("http://{addr}"),
+            &format!("0x{}", "bb".repeat(32)),
+        )
+        .await
+        .expect("pump");
+        assert_eq!(delivered, 1);
+        assert_eq!(state.anchor.read().await.last_anchor_at, 45_296);
+        let feed = state.audit.read().await;
+        assert_eq!(feed.len(), 1);
+        assert_eq!(feed[0].chip, "anchor");
+    }
+
     #[tokio::test]
     async fn decode_audit_event_returns_real_calldata_and_envelope() {
         // #153: GET /v1/audit/:id/decode wires the real decoder. Assert the
diff --git a/crates/agentkeys-types/src/audit_feed.rs b/crates/agentkeys-types/src/audit_feed.rs
new file mode 100644
index 00000000..4c917856
--- /dev/null
+++ b/crates/agentkeys-types/src/audit_feed.rs
@@ -0,0 +1,42 @@
+//! Tier-1 audit feed wire shape (issue #109).
+//!
+//! ONE owner for the event JSON the audit worker's `GET /v1/audit/stream`
+//! SSE emits and its ring buffers / S3 archive persist — consumed by the
+//! worker (`agentkeys-worker-audit`), the daemon's feed bridge
+//! (`ui_bridge`), and any future explorer poller. Re-typing this shape in
+//! a consumer is the #200/#203 drift bug class; depend on this struct
+//! instead.
+
+use serde::{Deserialize, Serialize};
+
+/// One feed entry. `kind` distinguishes ordinary envelope events from the
+/// tier-A relay's meta events (which carry the trailing optional fields).
+#[derive(Clone, Debug, Serialize, Deserialize)]
+pub struct AuditFeedEvent {
+    /// "event" | "anchor" | "batch_failed".
+    pub kind: String,
+    /// 0x-hex `keccak256(canonical_cbor(envelope))`.
+    pub envelope_hash: String,
+    pub ts_unix: u64,
+    /// 0x-hex 32-byte omni of the acting identity (for anchor /
+    /// batch_failed: the relay's derived omni).
+    pub actor_omni: String,
+    /// 0x-hex 32-byte omni of the operator whose boundary the op touched.
+    pub operator_omni: String,
+    pub op_kind: u8,
+    /// `AuditOpKind::label()` or `"unknown(<byte>)"` — render directly so
+    /// new op_kinds degrade gracefully (non-break invariant #4).
+    pub op_kind_label: String,
+    /// 0=Success, 1=Failure, 2=NotPermitted.
+    pub result: u8,
+    #[serde(default, skip_serializing_if = "Option::is_none")]
+    pub intent_text: Option<String>,
+    /// Anchor events: the confirmed `appendV2` tx.
+    #[serde(default, skip_serializing_if = "Option::is_none")]
+    pub tx_hash: Option<String>,
+    /// Anchor / batch_failed events: the batch's Merkle root.
+    #[serde(default, skip_serializing_if = "Option::is_none")]
+    pub merkle_root: Option<String>,
+    #[serde(default, skip_serializing_if = "Option::is_none")]
+    pub entry_count: Option<u64>,
+}
diff --git a/crates/agentkeys-types/src/lib.rs b/crates/agentkeys-types/src/lib.rs
index c01c343a..92061f36 100644
--- a/crates/agentkeys-types/src/lib.rs
+++ b/crates/agentkeys-types/src/lib.rs
@@ -2,6 +2,7 @@ use std::fmt;
 
 use serde::{Deserialize, Serialize};
 
+pub mod audit_feed;
 pub mod cred_manifest;
 pub mod provision;
 
diff --git a/crates/agentkeys-worker-audit/Cargo.toml b/crates/agentkeys-worker-audit/Cargo.toml
index ff576d12..0c3adc47 100644
--- a/crates/agentkeys-worker-audit/Cargo.toml
+++ b/crates/agentkeys-worker-audit/Cargo.toml
@@ -27,6 +27,13 @@ sha3 = "0.10"
 hex = "0.4"
 ciborium = "0.2"
 clap = { version = "4", features = ["derive", "env"] }
+# #109 tier-A anchor relay: legacy-tx signing key + SSE stream plumbing.
+agentkeys-types = { workspace = true }
+k256 = { version = "0.13", features = ["ecdsa", "sha2"] }
+tokio-stream = { version = "0.1", features = ["sync"] }
+# #109 S3 cold archive (instance-profile creds, same as the email worker).
+aws-config = { version = "1", features = ["behavior-version-latest"] }
+aws-sdk-s3 = "1"
 
 [dev-dependencies]
 tokio = { workspace = true, features = ["full", "test-util"] }
diff --git a/crates/agentkeys-worker-audit/src/anchor.rs b/crates/agentkeys-worker-audit/src/anchor.rs
new file mode 100644
index 00000000..099801a7
--- /dev/null
+++ b/crates/agentkeys-worker-audit/src/anchor.rs
@@ -0,0 +1,536 @@
+//! Tier-A on-chain anchor relay (issue #109) — the audit worker submits each
+//! flushed V2 batch to `CredentialAudit` autonomously, on the
+//! `AGENTKEYS_AUDIT_BATCH_SECONDS` cadence.
+//!
+//! ## Why `appendV2` + op_kind 90, not `appendRootV2`
+//!
+//! `appendRootV2` is gated `msg.sender == registry.operatorMasterWallet(omni)`
+//! and the registry rejects EOA masters (`MasterMustBeAccount`), so a hosted
+//! relay EOA can never pass it — and a prod master is a Touch-ID passkey that
+//! cannot sign on a 2-minute cadence. Instead the relay wraps the batch root
+//! in an honest `AuditRootAnchor` envelope (op_kind 90, body carries
+//! `{merkle_root, op_kind_bitmap, entry_count, relay_address}`) and commits
+//! THAT envelope's hash via the ungated
+//! `appendV2(operatorOmni, relayActorOmni, 90, envelopeHash)` — one legacy tx
+//! per batch, the REAL operator omni stays an indexed topic, zero contract
+//! change (§15.3b invariant #6). Genuine anchors are distinguished from
+//! third-party spam by `tx.from == relay_address` (published at
+//! `GET /v1/audit/relay-info`). The master-gated `appendRootV2` remains the
+//! sovereign tier-B/C route (`heima-worker-smoke.sh`).
+//!
+//! All chain reads are raw JSON-RPC (no alloy/ethers — Heima's mixHash-less
+//! responses crash their parsers; see `docs/spec/heima-eth-gap.md`), and the
+//! tx is a fixed-gas-limit legacy tx via `agentkeys_core::legacy_tx`.
+
+use std::time::Duration;
+
+use agentkeys_core::device_crypto::keccak256;
+use agentkeys_core::legacy_tx::LegacyTx;
+use anyhow::{anyhow, bail, Context, Result};
+use k256::ecdsa::SigningKey;
+use serde_json::{json, Value};
+use tracing::{info, warn};
+
+/// Everything the relay needs to sign + submit an anchor tx. Built once at
+/// boot via `from_env` (workers inject a hand-built config in tests — never
+/// mutate process env, per the #258/#259 rule).
+pub struct RelayConfig {
+    pub rpc_url: String,
+    pub chain_id: u64,
+    /// `CredentialAudit` contract address (20 bytes).
+    pub credential_audit: [u8; 20],
+    pub signing_key: SigningKey,
+    /// 20-byte EVM address derived from `signing_key`.
+    pub relay_address: [u8; 20],
+    /// `actor_omni_from_wallet(relay_address)` — the anchor envelopes'
+    /// actor identity. Deterministic; no on-chain registration needed.
+    pub relay_omni: [u8; 32],
+    /// Pinned gas limit (Heima `eth_estimateGas` is unreliable on some
+    /// shapes; `appendV2` is emit-only so 200k is generous headroom).
+    pub gas_limit: u128,
+    /// Submission attempts per batch before declaring the batch failed.
+    pub attempts: u32,
+    /// Base for exponential backoff between attempts (`base * 4^(n-1)`).
+    /// Tests set 0.
+    pub backoff_base: Duration,
+    /// How long to poll for the tx receipt before treating the attempt as
+    /// failed. A timed-out tx may still land later — re-anchoring the same
+    /// envelope hashes is benign (duplicate `AuditAppendedV2` events;
+    /// explorers dedup by envelope hash).
+    pub receipt_timeout: Duration,
+}
+
+impl RelayConfig {
+    /// Resolve from env + the pinned chain profile. Returns `Ok(None)` when
+    /// no relay key is configured — the worker then runs in the pre-#109
+    /// degraded mode (flush logs `appendRootV2` inputs, anchors nothing).
+    ///
+    /// Env surface (all optional except the key):
+    /// - `AGENTKEYS_AUDIT_RELAY_KEY_FILE` — path to a hex private key file
+    ///   (preferred; generated 0600 by `setup-broker-host.sh`), or
+    ///   `AGENTKEYS_AUDIT_RELAY_KEY` — inline hex (CI).
+    /// - `AGENTKEYS_CHAIN` / `AGENTKEYS_CHAIN_PROFILE_FILE` — profile pick.
+    /// - `AGENTKEYS_AUDIT_RPC_URL` — override the profile's `rpc.http`.
+    /// - `AGENTKEYS_AUDIT_CREDENTIAL_AUDIT_ADDRESS` — override the
+    ///   profile's `CredentialAudit` address (isolated test stacks).
+    /// - `AGENTKEYS_AUDIT_ANCHOR_GAS_LIMIT` (default 200000),
+    ///   `AGENTKEYS_AUDIT_ANCHOR_ATTEMPTS` (default 3),
+    ///   `AGENTKEYS_AUDIT_ANCHOR_RECEIPT_TIMEOUT_SECS` (default 60).
+    pub fn from_env() -> Result<Option<Self>> {
+        let key_hex = match std::env::var("AGENTKEYS_AUDIT_RELAY_KEY_FILE") {
+            Ok(path) if !path.is_empty() => std::fs::read_to_string(&path)
+                .with_context(|| format!("read AGENTKEYS_AUDIT_RELAY_KEY_FILE={path}"))?
+                .trim()
+                .to_string(),
+            _ => match std::env::var("AGENTKEYS_AUDIT_RELAY_KEY") {
+                Ok(k) if !k.is_empty() => k.trim().to_string(),
+                _ => return Ok(None),
+            },
+        };
+
+        let (profile, picked) = agentkeys_core::chain_profile::ChainProfile::resolve(
+            None,
+            std::env::var("AGENTKEYS_CHAIN").ok().as_deref(),
+            std::env::var("AGENTKEYS_CHAIN_PROFILE_FILE").ok().as_deref(),
+        )?;
+        info!(profile = %profile.name, %picked, "anchor relay chain profile");
+
+        let rpc_url = match std::env::var("AGENTKEYS_AUDIT_RPC_URL") {
+            Ok(u) if !u.is_empty() => u,
+            _ => profile.rpc.http.clone(),
+        };
+        let audit_addr_hex = match std::env::var("AGENTKEYS_AUDIT_CREDENTIAL_AUDIT_ADDRESS") {
+            Ok(a) if !a.is_empty() => a,
+            _ => profile
+                .contract("CredentialAudit")
+                .ok_or_else(|| anyhow!("chain profile {} has no CredentialAudit", profile.name))?
+                .address
+                .clone(),
+        };
+
+        let gas_limit = env_u128("AGENTKEYS_AUDIT_ANCHOR_GAS_LIMIT", 200_000)?;
+        let attempts = env_u128("AGENTKEYS_AUDIT_ANCHOR_ATTEMPTS", 3)? as u32;
+        let receipt_secs = env_u128("AGENTKEYS_AUDIT_ANCHOR_RECEIPT_TIMEOUT_SECS", 60)? as u64;
+
+        Ok(Some(Self::build(
+            rpc_url,
+            profile.chain_id,
+            &audit_addr_hex,
+            &key_hex,
+            gas_limit,
+            attempts,
+            Duration::from_secs(2),
+            Duration::from_secs(receipt_secs),
+        )?))
+    }
+
+    /// Assemble a config from explicit values (the test path — no env reads).
+    #[allow(clippy::too_many_arguments)]
+    pub fn build(
+        rpc_url: String,
+        chain_id: u64,
+        credential_audit_hex: &str,
+        relay_key_hex: &str,
+        gas_limit: u128,
+        attempts: u32,
+        backoff_base: Duration,
+        receipt_timeout: Duration,
+    ) -> Result<Self> {
+        let credential_audit = decode20(credential_audit_hex)
+            .ok_or_else(|| anyhow!("CredentialAudit address must be 20-byte hex"))?;
+        let key_bytes =
+            hex::decode(relay_key_hex.trim().trim_start_matches("0x")).context("relay key hex")?;
+        let signing_key = SigningKey::from_slice(&key_bytes).context("relay key")?;
+        let relay_address = eth_address(&signing_key);
+        let relay_wallet = agentkeys_types::WalletAddress(format!(
+            "0x{}",
+            hex::encode(relay_address)
+        ));
+        let relay_omni = agentkeys_core::actor_omni::actor_omni_from_wallet(&relay_wallet);
+        Ok(Self {
+            rpc_url,
+            chain_id,
+            credential_audit,
+            signing_key,
+            relay_address,
+            relay_omni,
+            gas_limit,
+            attempts,
+            backoff_base,
+            receipt_timeout,
+        })
+    }
+
+    pub fn relay_address_hex(&self) -> String {
+        format!("0x{}", hex::encode(self.relay_address))
+    }
+
+    pub fn relay_omni_hex(&self) -> String {
+        format!("0x{}", hex::encode(self.relay_omni))
+    }
+}
+
+/// `appendV2(bytes32 operatorOmni, bytes32 actorOmni, uint8 opKind, bytes32
+/// envelopeHash)` calldata. Selector is computed from the signature at call
+/// time (golden-tested below) — no hardcoded magic bytes.
+pub fn encode_append_v2_calldata(
+    operator_omni: [u8; 32],
+    actor_omni: [u8; 32],
+    op_kind: u8,
+    envelope_hash: [u8; 32],
+) -> Vec<u8> {
+    let selector = &keccak256(b"appendV2(bytes32,bytes32,uint8,bytes32)")[..4];
+    let mut data = Vec::with_capacity(4 + 32 * 4);
+    data.extend_from_slice(selector);
+    data.extend_from_slice(&operator_omni);
+    data.extend_from_slice(&actor_omni);
+    let mut kind_padded = [0u8; 32];
+    kind_padded[31] = op_kind;
+    data.extend_from_slice(&kind_padded);
+    data.extend_from_slice(&envelope_hash);
+    data
+}
+
+/// Outcome of one anchored batch.
+#[derive(Debug, Clone)]
+pub struct AnchorReceipt {
+    pub tx_hash: String,
+    pub attempts_used: u32,
+}
+
+/// Submit one anchor tx (calldata already encoded) with the configured
+/// retry/backoff policy. Returns the tx hash on the first confirmed
+/// attempt; aggregates the last error when every attempt fails.
+pub async fn submit_anchor_with_retries(
+    cfg: &RelayConfig,
+    http: &reqwest::Client,
+    calldata: Vec<u8>,
+) -> std::result::Result<AnchorReceipt, AnchorFailure> {
+    let mut last_error = String::new();
+    for attempt in 1..=cfg.attempts {
+        if attempt > 1 {
+            let backoff = cfg.backoff_base * 4u32.pow(attempt - 2);
+            tokio::time::sleep(backoff).await;
+        }
+        match submit_once(cfg, http, &calldata).await {
+            Ok(tx_hash) => {
+                return Ok(AnchorReceipt {
+                    tx_hash,
+                    attempts_used: attempt,
+                })
+            }
+            Err(e) => {
+                last_error = format!("{e:#}");
+                warn!(attempt, error = %last_error, "anchor submission attempt failed");
+            }
+        }
+    }
+    Err(AnchorFailure {
+        attempts: cfg.attempts,
+        last_error,
+    })
+}
+
+/// All attempts exhausted — the caller re-queues the batch entries and
+/// emits the `AuditBatchFailed` (op_kind 91) envelope.
+#[derive(Debug, Clone)]
+pub struct AnchorFailure {
+    pub attempts: u32,
+    pub last_error: String,
+}
+
+async fn submit_once(cfg: &RelayConfig, http: &reqwest::Client, calldata: &[u8]) -> Result<String> {
+    let submitter = cfg.relay_address_hex();
+    let nonce = parse_qty(
+        &rpc_call(
+            http,
+            &cfg.rpc_url,
+            "eth_getTransactionCount",
+            json!([submitter, "pending"]),
+        )
+        .await?,
+        "eth_getTransactionCount",
+    )?;
+    // +25% headroom over the node's quote so a base-fee tick doesn't strand
+    // the tx (same policy as the bundler's handleOps carrier).
+    let gas_price = parse_qty(
+        &rpc_call(http, &cfg.rpc_url, "eth_gasPrice", json!([])).await?,
+        "eth_gasPrice",
+    )? * 125
+        / 100;
+
+    let tx = LegacyTx {
+        nonce,
+        gas_price,
+        gas_limit: cfg.gas_limit,
+        to: cfg.credential_audit,
+        value: 0,
+        data: calldata.to_vec(),
+        chain_id: cfg.chain_id,
+    };
+    let (raw, _) = tx.sign(&cfg.signing_key)?;
+    let tx_hash = rpc_call(
+        http,
+        &cfg.rpc_url,
+        "eth_sendRawTransaction",
+        json!([format!("0x{}", hex::encode(raw))]),
+    )
+    .await?
+    .as_str()
+    .ok_or_else(|| anyhow!("eth_sendRawTransaction returned non-string"))?
+    .to_string();
+
+    // Poll the raw receipt (NEVER a typed parser — Heima receipts carry no
+    // mixHash). status 0x1 = success; 0x0 = reverted (appendV2 is ungated +
+    // emit-only, so a revert means out-of-gas or a wrong address — both
+    // operator errors worth failing loudly on).
+    let deadline = tokio::time::Instant::now() + cfg.receipt_timeout;
+    loop {
+        let receipt = rpc_call(
+            http,
+            &cfg.rpc_url,
+            "eth_getTransactionReceipt",
+            json!([tx_hash]),
+        )
+        .await?;
+        if !receipt.is_null() {
+            let status = receipt
+                .get("status")
+                .and_then(|s| s.as_str())
+                .unwrap_or("0x0");
+            if status == "0x1" {
+                info!(%tx_hash, "anchor tx confirmed");
+                return Ok(tx_hash);
+            }
+            bail!("anchor tx {tx_hash} reverted (status {status})");
+        }
+        if tokio::time::Instant::now() >= deadline {
+            bail!(
+                "anchor tx {tx_hash} unconfirmed after {:?} (it may still land; re-anchoring is benign)",
+                cfg.receipt_timeout
+            );
+        }
+        tokio::time::sleep(Duration::from_secs(2)).await;
+    }
+}
+
+/// Raw JSON-RPC POST with transient-failure retries (the Heima public RPC
+/// intermittently 500s — same posture as the workers' `eth_call` helper).
+async fn rpc_call(
+    http: &reqwest::Client,
+    rpc_url: &str,
+    method: &str,
+    params: Value,
+) -> Result<Value> {
+    let body = json!({"jsonrpc": "2.0", "id": 1, "method": method, "params": params});
+    let mut last = String::new();
+    for _ in 0..3 {
+        match http.post(rpc_url).json(&body).send().await {
+            Ok(resp) if resp.status().is_success() => {
+                let v: Value = resp.json().await.map_err(|e| anyhow!("{method} json: {e}"))?;
+                if let Some(err) = v.get("error") {
+                    // RPC-level errors are NOT transient (bad tx, low funds) —
+                    // surface immediately.
+                    bail!("{method} rpc error: {err}");
+                }
+                return Ok(v.get("result").cloned().unwrap_or(Value::Null));
+            }
+            Ok(resp) => last = format!("{method} HTTP {}", resp.status()),
+            Err(e) => last = format!("{method} POST: {e}"),
+        }
+        tokio::time::sleep(Duration::from_millis(300)).await;
+    }
+    bail!("{last} (after 3 tries)")
+}
+
+/// Current relay balance in wei (`eth_getBalance`), `None` on any RPC
+/// trouble — diagnostics only, never load-bearing.
+pub async fn relay_balance_wei(cfg: &RelayConfig, http: &reqwest::Client) -> Option<u128> {
+    let v = rpc_call(
+        http,
+        &cfg.rpc_url,
+        "eth_getBalance",
+        json!([cfg.relay_address_hex(), "latest"]),
+    )
+    .await
+    .ok()?;
+    parse_qty(&v, "eth_getBalance").ok()
+}
+
+fn parse_qty(v: &Value, what: &str) -> Result<u128> {
+    let s = v
+        .as_str()
+        .ok_or_else(|| anyhow!("{what} returned non-string"))?;
+    u128::from_str_radix(s.trim_start_matches("0x"), 16).with_context(|| format!("{what}: {s}"))
+}
+
+fn env_u128(key: &str, default: u128) -> Result<u128> {
+    match std::env::var(key) {
+        Ok(v) if !v.is_empty() => v.parse::<u128>().with_context(|| format!("{key}={v}")),
+        _ => Ok(default),
+    }
+}
+
+fn decode20(s: &str) -> Option<[u8; 20]> {
+    let v = hex::decode(s.trim_start_matches("0x")).ok()?;
+    if v.len() != 20 {
+        return None;
+    }
+    let mut out = [0u8; 20];
+    out.copy_from_slice(&v);
+    Some(out)
+}
+
+/// Keccak-derived EVM address of a secp256k1 signing key.
+fn eth_address(sk: &SigningKey) -> [u8; 20] {
+    let pubkey = sk.verifying_key().to_encoded_point(false);
+    let digest = keccak256(&pubkey.as_bytes()[1..]);
+    let mut out = [0u8; 20];
+    out.copy_from_slice(&digest[12..]);
+    out
+}
+
+#[cfg(test)]
+mod tests {
+    use super::*;
+    use axum::{routing::post, Json, Router};
+    use std::sync::atomic::{AtomicU32, Ordering};
+    use std::sync::Arc;
+
+    #[test]
+    fn append_v2_calldata_layout() {
+        let data = encode_append_v2_calldata([0x22; 32], [0x44; 32], 90, [0x90; 32]);
+        assert_eq!(data.len(), 4 + 32 * 4);
+        // Selector matches `cast sig "appendV2(bytes32,bytes32,uint8,bytes32)"`.
+        assert_eq!(
+            &data[..4],
+            &keccak256(b"appendV2(bytes32,bytes32,uint8,bytes32)")[..4]
+        );
+        assert_eq!(&data[4..36], &[0x22; 32]);
+        assert_eq!(&data[36..68], &[0x44; 32]);
+        let mut kind = [0u8; 32];
+        kind[31] = 90;
+        assert_eq!(&data[68..100], &kind);
+        assert_eq!(&data[100..132], &[0x90; 32]);
+    }
+
+    #[test]
+    fn relay_identity_is_deterministic() {
+        let cfg = RelayConfig::build(
+            "http://127.0.0.1:1".into(),
+            212013,
+            &format!("0x{}", "11".repeat(20)),
+            &format!("0x{}", "46".repeat(32)),
+            200_000,
+            3,
+            Duration::ZERO,
+            Duration::from_secs(1),
+        )
+        .unwrap();
+        // The EIP-155 reference key (0x46×32) → its well-known address.
+        assert_eq!(
+            cfg.relay_address_hex(),
+            "0x9d8a62f656a8d1615c1294fd71e9cfb3e4855a4f"
+        );
+        // relay_omni = actor_omni(relay address) — the agentkeysevm digest.
+        let expected = agentkeys_core::actor_omni::actor_omni_from_wallet(
+            &agentkeys_types::WalletAddress(cfg.relay_address_hex()),
+        );
+        assert_eq!(cfg.relay_omni, expected);
+    }
+
+    /// Fake JSON-RPC node: counts calls; `fail_first` requests 503 before
+    /// recovering. Drives the full submit path over real HTTP.
+    async fn spawn_fake_rpc(fail_first: u32) -> (String, Arc<AtomicU32>) {
+        let calls = Arc::new(AtomicU32::new(0));
+        let calls_in = calls.clone();
+        let app = Router::new().route(
+            "/",
+            post(move |Json(req): Json<Value>| {
+                let calls = calls_in.clone();
+                async move {
+                    let n = calls.fetch_add(1, Ordering::SeqCst);
+                    if n < fail_first {
+                        return Err(axum::http::StatusCode::SERVICE_UNAVAILABLE);
+                    }
+                    let method = req.get("method").and_then(|m| m.as_str()).unwrap_or("");
+                    let result = match method {
+                        "eth_getTransactionCount" => json!("0x0"),
+                        "eth_gasPrice" => json!("0x3b9aca00"),
+                        "eth_sendRawTransaction" => json!(format!("0x{}", "ab".repeat(32))),
+                        "eth_getTransactionReceipt" => json!({"status": "0x1"}),
+                        other => panic!("unexpected method {other}"),
+                    };
+                    Ok(Json(json!({"jsonrpc": "2.0", "id": 1, "result": result})))
+                }
+            }),
+        );
+        let listener = tokio::net::TcpListener::bind("127.0.0.1:0").await.unwrap();
+        let addr = listener.local_addr().unwrap();
+        tokio::spawn(async move {
+            axum::serve(listener, app).await.ok();
+        });
+        (format!("http://{addr}/"), calls)
+    }
+
+    fn test_cfg(rpc_url: String) -> RelayConfig {
+        RelayConfig::build(
+            rpc_url,
+            212013,
+            &format!("0x{}", "11".repeat(20)),
+            &format!("0x{}", "46".repeat(32)),
+            200_000,
+            3,
+            Duration::ZERO,
+            Duration::from_secs(5),
+        )
+        .unwrap()
+    }
+
+    #[tokio::test]
+    async fn submits_anchor_end_to_end_against_fake_rpc() {
+        let (url, _) = spawn_fake_rpc(0).await;
+        let cfg = test_cfg(url);
+        let http = reqwest::Client::new();
+        let calldata = encode_append_v2_calldata([0x22; 32], cfg.relay_omni, 90, [0x90; 32]);
+        let receipt = submit_anchor_with_retries(&cfg, &http, calldata)
+            .await
+            .expect("anchored");
+        assert_eq!(receipt.tx_hash, format!("0x{}", "ab".repeat(32)));
+        assert_eq!(receipt.attempts_used, 1);
+    }
+
+    #[tokio::test]
+    async fn transient_rpc_failures_recover_within_one_attempt() {
+        // 2 leading 503s are absorbed by rpc_call's own 3-try transport
+        // retry — still attempt #1 from the batch-retry perspective.
+        let (url, _) = spawn_fake_rpc(2).await;
+        let cfg = test_cfg(url);
+        let http = reqwest::Client::new();
+        let calldata = encode_append_v2_calldata([0x22; 32], cfg.relay_omni, 90, [0x90; 32]);
+        let receipt = submit_anchor_with_retries(&cfg, &http, calldata)
+            .await
+            .expect("anchored");
+        assert_eq!(receipt.attempts_used, 1);
+    }
+
+    #[tokio::test]
+    async fn exhausted_attempts_return_failure_with_last_error() {
+        // Every request 503s: 3 batch attempts × 3 transport tries all fail.
+        let (url, calls) = spawn_fake_rpc(u32::MAX).await;
+        let cfg = test_cfg(url);
+        let http = reqwest::Client::new();
+        let calldata = encode_append_v2_calldata([0x22; 32], cfg.relay_omni, 90, [0x90; 32]);
+        let failure = submit_anchor_with_retries(&cfg, &http, calldata)
+            .await
+            .expect_err("must fail");
+        assert_eq!(failure.attempts, 3);
+        assert!(
+            failure.last_error.contains("503"),
+            "got: {}",
+            failure.last_error
+        );
+        // 3 attempts × 3 transport tries on the FIRST rpc (nonce fetch).
+        assert_eq!(calls.load(Ordering::SeqCst), 9);
+    }
+}
diff --git a/crates/agentkeys-worker-audit/src/archive.rs b/crates/agentkeys-worker-audit/src/archive.rs
new file mode 100644
index 00000000..89619182
--- /dev/null
+++ b/crates/agentkeys-worker-audit/src/archive.rs
@@ -0,0 +1,260 @@
+//! S3 cold archive for the Tier-1 feed (issue #109).
+//!
+//! Layout under `s3://$AGENTKEYS_AUDIT_S3_BUCKET/$AGENTKEYS_AUDIT_S3_PREFIX`:
+//!
+//! - `feed/<actor_omni_hex>/<ts:012>-<envelope_hash>.json` — one `FeedEvent`
+//!   per object. Zero-padded unix seconds make lexical key order
+//!   chronological, so "the last N events" is the listing tail.
+//! - `envelopes/<envelope_hash>.cbor` — canonical envelope bytes (the
+//!   durable layer behind `GET /v1/audit/envelope/:hash`; pre-#109 the
+//!   by-hash store was in-memory only and restarts lost it).
+//!
+//! Every operation is best-effort with a loud WARN: the chain commitment is
+//! the tamper-evidence layer; S3 is retention. Writes are spawned off the
+//! append path so a slow S3 never blocks an emit. On boot,
+//! `restore_rings` rebuilds the per-actor ring buffers ("last 1000 events
+//! per actor survive a restart" — the #109 acceptance criterion).
+//!
+//! Credentials come from the default AWS provider chain (the broker host's
+//! EC2 instance profile — same posture as the email worker's inbox client).
+
+use aws_sdk_s3::Client as S3Client;
+use tracing::{info, warn};
+
+use crate::state::{FeedEvent, State};
+
+#[derive(Clone)]
+pub struct Archive {
+    s3: S3Client,
+    bucket: String,
+    /// Normalized to either "" or "…/" so key joins are always clean.
+    prefix: String,
+}
+
+impl Archive {
+    /// `None` when `AGENTKEYS_AUDIT_S3_BUCKET` is unset/empty — the worker
+    /// then runs in-memory only (rings still work, restarts lose them).
+    pub async fn from_env() -> Option<Self> {
+        let bucket = std::env::var("AGENTKEYS_AUDIT_S3_BUCKET").ok()?;
+        if bucket.is_empty() {
+            return None;
+        }
+        let prefix = std::env::var("AGENTKEYS_AUDIT_S3_PREFIX").unwrap_or_else(|_| "audit/".into());
+        let cfg = aws_config::defaults(aws_config::BehaviorVersion::latest())
+            .load()
+            .await;
+        Some(Self::new(S3Client::new(&cfg), bucket, prefix))
+    }
+
+    pub fn new(s3: S3Client, bucket: String, prefix: String) -> Self {
+        let prefix = match prefix.trim_matches('/') {
+            "" => String::new(),
+            p => format!("{p}/"),
+        };
+        Self { s3, bucket, prefix }
+    }
+
+    fn feed_key(&self, evt: &FeedEvent) -> String {
+        format!(
+            "{}feed/{}/{:012}-{}.json",
+            self.prefix,
+            evt.actor_omni.trim_start_matches("0x").to_lowercase(),
+            evt.ts_unix,
+            evt.envelope_hash.trim_start_matches("0x").to_lowercase(),
+        )
+    }
+
+    fn envelope_key(&self, envelope_hash: &str) -> String {
+        format!(
+            "{}envelopes/{}.cbor",
+            self.prefix,
+            envelope_hash.trim_start_matches("0x").to_lowercase()
+        )
+    }
+
+    /// Fire-and-forget archive of one feed event (spawned — never blocks
+    /// the append path).
+    pub fn archive_feed_event(&self, evt: FeedEvent) {
+        let this = self.clone();
+        tokio::spawn(async move {
+            let key = this.feed_key(&evt);
+            let body = match serde_json::to_vec(&evt) {
+                Ok(b) => b,
+                Err(e) => {
+                    warn!(error = %e, "archive: serialize feed event");
+                    return;
+                }
+            };
+            if let Err(e) = this
+                .s3
+                .put_object()
+                .bucket(&this.bucket)
+                .key(&key)
+                .body(body.into())
+                .content_type("application/json")
+                .send()
+                .await
+            {
+                warn!(key, error = %e, "archive: feed event PUT failed");
+            }
+        });
+    }
+
+    /// Fire-and-forget archive of one canonical envelope.
+    pub fn archive_envelope(&self, envelope_hash: String, cbor: Vec<u8>) {
+        let this = self.clone();
+        tokio::spawn(async move {
+            let key = this.envelope_key(&envelope_hash);
+            if let Err(e) = this
+                .s3
+                .put_object()
+                .bucket(&this.bucket)
+                .key(&key)
+                .body(cbor.into())
+                .content_type("application/cbor")
+                .send()
+                .await
+            {
+                warn!(key, error = %e, "archive: envelope PUT failed");
+            }
+        });
+    }
+
+    /// Cold-read an envelope by hash (the `GET /v1/audit/envelope/:hash`
+    /// fallback when the in-memory map misses, e.g. after a restart).
+    pub async fn fetch_envelope(&self, envelope_hash: &str) -> Option<Vec<u8>> {
+        let key = self.envelope_key(envelope_hash);
+        match self
+            .s3
+            .get_object()
+            .bucket(&self.bucket)
+            .key(&key)
+            .send()
+            .await
+        {
+            Ok(out) => match out.body.collect().await {
+                Ok(bytes) => Some(bytes.into_bytes().to_vec()),
+                Err(e) => {
+                    warn!(key, error = %e, "archive: envelope body read failed");
+                    None
+                }
+            },
+            Err(_) => None,
+        }
+    }
+
+    /// Boot-time recovery: rebuild every actor's ring buffer from the
+    /// archive tail (last `ring_cap` events per actor).
+    pub async fn restore_rings(&self, state: &State, ring_cap: usize) {
+        let feed_root = format!("{}feed/", self.prefix);
+        let actors = match self.list_actor_prefixes(&feed_root).await {
+            Ok(a) => a,
+            Err(e) => {
+                warn!(error = %e, "archive: restore skipped (list actors failed)");
+                return;
+            }
+        };
+        let mut restored_actors = 0usize;
+        let mut restored_events = 0usize;
+        for actor_prefix in actors {
+            let keys = match self.list_keys_tail(&actor_prefix, ring_cap).await {
+                Ok(k) => k,
+                Err(e) => {
+                    warn!(actor_prefix, error = %e, "archive: actor listing failed");
+                    continue;
+                }
+            };
+            let mut events = Vec::with_capacity(keys.len());
+            for key in keys {
+                match self.get_feed_event(&key).await {
+                    Some(evt) => events.push(evt),
+                    None => warn!(key, "archive: feed event GET/parse failed during restore"),
+                }
+            }
+            if events.is_empty() {
+                continue;
+            }
+            events.sort_by_key(|e| e.ts_unix);
+            let actor = events[0].actor_omni.clone();
+            restored_events += events.len();
+            restored_actors += 1;
+            state.restore_ring(actor, events).await;
+        }
+        info!(
+            restored_actors,
+            restored_events, "archive: ring buffers restored"
+        );
+    }
+
+    async fn get_feed_event(&self, key: &str) -> Option<FeedEvent> {
+        let out = self
+            .s3
+            .get_object()
+            .bucket(&self.bucket)
+            .key(key)
+            .send()
+            .await
+            .ok()?;
+        let bytes = out.body.collect().await.ok()?.into_bytes();
+        serde_json::from_slice(&bytes).ok()
+    }
+
+    /// One level of common prefixes under `feed/` — the per-actor folders.
+    async fn list_actor_prefixes(&self, feed_root: &str) -> anyhow::Result<Vec<String>> {
+        let mut prefixes = Vec::new();
+        let mut token: Option<String> = None;
+        loop {
+            let resp = self
+                .s3
+                .list_objects_v2()
+                .bucket(&self.bucket)
+                .prefix(feed_root)
+                .delimiter("/")
+                .set_continuation_token(token.take())
+                .send()
+                .await?;
+            for p in resp.common_prefixes() {
+                if let Some(pfx) = p.prefix() {
+                    prefixes.push(pfx.to_string());
+                }
+            }
+            match resp.next_continuation_token() {
+                Some(t) => token = Some(t.to_string()),
+                None => break,
+            }
+        }
+        Ok(prefixes)
+    }
+
+    /// The lexical tail (= chronological tail, keys are ts-prefixed) of one
+    /// actor's feed listing, capped to `n`.
+    async fn list_keys_tail(&self, actor_prefix: &str, n: usize) -> anyhow::Result<Vec<String>> {
+        let mut tail: Vec<String> = Vec::new();
+        let mut token: Option<String> = None;
+        loop {
+            let resp = self
+                .s3
+                .list_objects_v2()
+                .bucket(&self.bucket)
+                .prefix(actor_prefix)
+                .set_continuation_token(token.take())
+                .send()
+                .await?;
+            for o in resp.contents() {
+                if let Some(k) = o.key() {
+                    tail.push(k.to_string());
+                }
+            }
+            // Keep only the newest `n` as we page — bounds memory on large
+            // archives (listing is ascending, so retain the tail).
+            if tail.len() > n {
+                tail.drain(..tail.len() - n);
+            }
+            match resp.next_continuation_token() {
+                Some(t) => token = Some(t.to_string()),
+                None => break,
+            }
+        }
+        Ok(tail)
+    }
+}
diff --git a/crates/agentkeys-worker-audit/src/handlers.rs b/crates/agentkeys-worker-audit/src/handlers.rs
index ff996ad5..4ad40c2e 100644
--- a/crates/agentkeys-worker-audit/src/handlers.rs
+++ b/crates/agentkeys-worker-audit/src/handlers.rs
@@ -10,19 +10,31 @@
 //!   POST /v1/audit/append/v2           — store an envelope + return its `envelope_hash`
 //!   GET  /v1/audit/envelope/:hash      — fetch the canonical CBOR for an envelope hash
 //!
+//! Endpoints (Tier-1 feed + tier-A anchor, issue #109):
+//!   GET  /v1/audit/stream              — SSE live feed (?operator=&actor=&backfill=N)
+//!   GET  /v1/audit/anchors/:operator   — recent anchored batches w/ Merkle proofs
+//!   GET  /v1/audit/relay-info          — relay address/omni + anchor config
+//!
 //! Per arch.md §15.3a, V1 + V2 coexist for one migration cycle.
 
+use std::convert::Infallible;
+
 use axum::{
     body::Body,
-    extract::{Path, State},
+    extract::{Path, Query, State},
     http::{header, HeaderValue, StatusCode},
+    response::sse::{Event, KeepAlive, Sse},
     response::{IntoResponse, Response},
     Json,
 };
 use serde::{Deserialize, Serialize};
 use serde_json::json;
+use tokio_stream::wrappers::BroadcastStream;
+use tokio_stream::{Stream, StreamExt};
 
-use crate::state::{AuditEvent, FlushResult, FlushV2Result, SharedState, V2QueueEntry};
+use crate::state::{
+    AnchorRecord, AuditEvent, FeedEvent, FlushResult, FlushV2Result, SharedState, V2QueueEntry,
+};
 
 #[derive(Deserialize)]
 pub struct AppendRequest {
@@ -56,6 +68,9 @@ pub struct FlushResponse {
     /// the `appendRootV2(operatorOmni, merkleRoot, opKindBitmap, entryCount)`
     /// inputs for the on-chain anchor.
     pub flushed_v2: Vec<FlushV2Result>,
+    /// #109: batches the tier-A relay anchored on-chain during this flush
+    /// (empty when the relay is unconfigured or a batch failed + re-queued).
+    pub anchored: Vec<AnchorRecord>,
 }
 
 pub async fn flush_one(
@@ -66,14 +81,15 @@ pub async fn flush_one(
         .flush(&operator_omni)
         .await
         .map_err(|e| (StatusCode::INTERNAL_SERVER_ERROR, e.to_string()))?;
-    let r2 = state
-        .flush_v2(&operator_omni.to_lowercase())
-        .await
-        .map_err(|e| (StatusCode::INTERNAL_SERVER_ERROR, e.to_string()))?;
+    let (r2, anchored) =
+        crate::service::flush_v2_and_anchor(&state, Some(&operator_omni.to_lowercase()))
+            .await
+            .map_err(|e| (StatusCode::INTERNAL_SERVER_ERROR, e.to_string()))?;
     Ok(Json(FlushResponse {
         ok: true,
         flushed: r.into_iter().collect(),
-        flushed_v2: r2.into_iter().collect(),
+        flushed_v2: r2,
+        anchored,
     }))
 }
 
@@ -84,14 +100,14 @@ pub async fn flush_all(
         .flush_all()
         .await
         .map_err(|e| (StatusCode::INTERNAL_SERVER_ERROR, e.to_string()))?;
-    let r2 = state
-        .flush_v2_all()
+    let (r2, anchored) = crate::service::flush_v2_and_anchor(&state, None)
         .await
         .map_err(|e| (StatusCode::INTERNAL_SERVER_ERROR, e.to_string()))?;
     Ok(Json(FlushResponse {
         ok: true,
         flushed: r,
         flushed_v2: r2,
+        anchored,
     }))
 }
 
@@ -210,9 +226,9 @@ pub async fn append_v2(
         .map_err(|e| (StatusCode::INTERNAL_SERVER_ERROR, format!("hash: {e}")))?;
     let hash_hex = format!("0x{}", hex::encode(envelope_hash));
 
-    state.store_envelope(hash_hex.clone(), cbor).await;
+    state.store_envelope(hash_hex.clone(), cbor.clone()).await;
     // Tier-A anchor feed (#229): queue the envelope hash for the next
-    // `appendRootV2` Merkle batch alongside the by-hash store above.
+    // anchor batch alongside the by-hash store above.
     state
         .queue_v2(
             format!("0x{}", hex::encode(operator_omni)),
@@ -225,12 +241,175 @@ pub async fn append_v2(
         )
         .await;
 
+    // Tier-1 real-time feed (#109): ring buffer + SSE fan-out + cold
+    // archive. ~100ms event-to-UI comes from this in-process push (no
+    // polling anywhere on the path).
+    let op_kind_label = agentkeys_core::audit::AuditOpKind::from_u8(req.op_kind)
+        .map(|k| k.label().to_string())
+        .unwrap_or_else(|| format!("unknown({})", req.op_kind));
+    let evt = state
+        .push_feed(FeedEvent {
+            kind: "event".into(),
+            envelope_hash: hash_hex.clone(),
+            ts_unix: envelope.ts_unix,
+            actor_omni: format!("0x{}", hex::encode(actor_omni)),
+            operator_omni: format!("0x{}", hex::encode(operator_omni)),
+            op_kind: req.op_kind,
+            op_kind_label,
+            result: req.result,
+            intent_text: envelope.intent_text.clone(),
+            tx_hash: None,
+            merkle_root: None,
+            entry_count: None,
+        })
+        .await;
+    if let Some(archive) = &state.archive {
+        archive.archive_feed_event(evt);
+        archive.archive_envelope(hash_hex.clone(), cbor);
+    }
+
     Ok(Json(AppendV2Response {
         ok: true,
         envelope_hash: hash_hex,
     }))
 }
 
+// ─── Tier-1 feed + tier-A anchor endpoints (issue #109) ──────────────────
+
+#[derive(Deserialize)]
+pub struct StreamQuery {
+    /// Filter to one operator's events (0x-hex omni). Strongly recommended
+    /// — the worker is multi-tenant.
+    pub operator: Option<String>,
+    /// Further filter to one actor.
+    pub actor: Option<String>,
+    /// Ring-buffer events replayed on connect before going live. Default
+    /// 0; capped at the ring size.
+    #[serde(default)]
+    pub backfill: usize,
+}
+
+/// `GET /v1/audit/stream` — Server-Sent Events: `backfill` ring events,
+/// then every matching feed event as it happens. Event types: `audit`
+/// (ordinary envelope), `anchor`, `batch_failed`. Heartbeats via SSE
+/// keep-alive comments.
+pub async fn stream(
+    State(state): State<SharedState>,
+    Query(q): Query<StreamQuery>,
+) -> Sse<impl Stream<Item = Result<Event, Infallible>>> {
+    // Subscribe BEFORE snapshotting the backfill so no event falls between
+    // the two (duplicates are possible at the seam; consumers dedup by
+    // envelope_hash).
+    let live = state.subscribe_feed();
+    let backfill = if q.backfill > 0 {
+        state
+            .backfill(q.operator.as_deref(), q.actor.as_deref(), q.backfill)
+            .await
+    } else {
+        Vec::new()
+    };
+    let operator = q.operator.clone();
+    let actor = q.actor.clone();
+
+    let matches = move |e: &FeedEvent| -> bool {
+        if let Some(op) = &operator {
+            if !op.eq_ignore_ascii_case(&e.operator_omni) {
+                return false;
+            }
+        }
+        if let Some(a) = &actor {
+            // Anchor/batch_failed meta events carry the relay's actor omni;
+            // still deliver them to actor-filtered streams for the operator
+            // they belong to (the anchor badge needs them).
+            if e.kind == "event" && !a.eq_ignore_ascii_case(&e.actor_omni) {
+                return false;
+            }
+        }
+        true
+    };
+
+    let backfill_stream = tokio_stream::iter(backfill.into_iter().map(Ok::<_, Infallible>));
+    let live_stream = BroadcastStream::new(live).filter_map(move |msg| match msg {
+        Ok(evt) if matches(&evt) => Some(Ok::<_, Infallible>(evt)),
+        _ => None, // lagged receivers skip; clients re-sync via backfill
+    });
+
+    let stream = backfill_stream
+        .chain(live_stream)
+        .map(|evt: Result<FeedEvent, Infallible>| {
+            let evt = evt.expect("infallible");
+            let name = match evt.kind.as_str() {
+                "anchor" => "anchor",
+                "batch_failed" => "batch_failed",
+                _ => "audit",
+            };
+            Ok(Event::default()
+                .event(name)
+                .data(serde_json::to_string(&evt).unwrap_or_else(|_| "{}".into())))
+        });
+    Sse::new(stream).keep_alive(KeepAlive::default())
+}
+
+#[derive(Serialize)]
+pub struct AnchorsResponse {
+    pub operator_omni: String,
+    pub anchors: Vec<AnchorRecord>,
+}
+
+/// `GET /v1/audit/anchors/:operator` — recent anchored batches (newest
+/// last) with per-entry Merkle proofs. The tamper test recomputes each
+/// leaf from the fetched envelope and verifies it against
+/// `merkle_root_hex` via the proof; any modified event no longer matches.
+pub async fn anchors_for(
+    State(state): State<SharedState>,
+    Path(operator_omni): Path<String>,
+) -> Json<AnchorsResponse> {
+    let anchors = state.anchors_for(&operator_omni).await;
+    Json(AnchorsResponse {
+        operator_omni,
+        anchors,
+    })
+}
+
+#[derive(Serialize)]
+pub struct RelayInfoResponse {
+    pub anchor_enabled: bool,
+    /// 0x-hex EVM address of the tier-A relay — verifiers match anchor
+    /// txs' `from` against this. `null` in degraded mode.
+    #[serde(skip_serializing_if = "Option::is_none")]
+    pub relay_address: Option<String>,
+    /// The relay's derived actor omni (`actor_omni(relay_address)`).
+    #[serde(skip_serializing_if = "Option::is_none")]
+    pub relay_omni: Option<String>,
+    /// Current relay balance in wei (string — u128 range), when the RPC
+    /// answered. The funding helper reads this.
+    #[serde(skip_serializing_if = "Option::is_none")]
+    pub balance_wei: Option<String>,
+}
+
+/// `GET /v1/audit/relay-info` — the relay's public identity + funding
+/// state. `heima-fund-audit-relay.sh` reads `relay_address` + `balance_wei`
+/// to idempotently top up from the deploy wallet.
+pub async fn relay_info(State(state): State<SharedState>) -> Json<RelayInfoResponse> {
+    let Some(relay) = state.relay.as_ref() else {
+        return Json(RelayInfoResponse {
+            anchor_enabled: false,
+            relay_address: None,
+            relay_omni: None,
+            balance_wei: None,
+        });
+    };
+    let balance_wei = crate::anchor::relay_balance_wei(relay, &state.http)
+        .await
+        .map(|b| b.to_string());
+    Json(RelayInfoResponse {
+        anchor_enabled: true,
+        relay_address: Some(relay.relay_address_hex()),
+        relay_omni: Some(relay.relay_omni_hex()),
+        balance_wei,
+    })
+}
+
 /// `GET /v1/audit/envelope/:hash` — return the canonical CBOR for the
 /// envelope identified by `envelope_hash` (a 0x-prefixed 64-hex string).
 /// Returns 404 if unknown.
@@ -239,7 +418,17 @@ pub async fn append_v2(
 /// matches by re-running `keccak256(body)`.
 pub async fn get_envelope(State(state): State<SharedState>, Path(hash): Path<String>) -> Response {
     let key = hash.to_lowercase();
-    match state.get_envelope(&key).await {
+    let mut found = state.get_envelope(&key).await;
+    if found.is_none() {
+        // #109: cold-archive fallback — survives worker restarts.
+        if let Some(archive) = &state.archive {
+            found = archive.fetch_envelope(&key).await;
+            if let Some(cbor) = &found {
+                state.store_envelope(key.clone(), cbor.clone()).await;
+            }
+        }
+    }
+    match found {
         Some(cbor) => Response::builder()
             .status(StatusCode::OK)
             .header(
diff --git a/crates/agentkeys-worker-audit/src/lib.rs b/crates/agentkeys-worker-audit/src/lib.rs
index 4cc8b442..4541e7ff 100644
--- a/crates/agentkeys-worker-audit/src/lib.rs
+++ b/crates/agentkeys-worker-audit/src/lib.rs
@@ -1,15 +1,21 @@
 //! Audit-service worker — tier-A Merkle relay per arch.md §15.3.
 //!
 //! Accepts per-event audit appends over HTTP, batches them in memory per
-//! operator, computes a Merkle tree on flush, and writes the root to the
-//! on-chain CredentialAudit contract (one tx per batch — `appendRoot`).
+//! operator, computes a Merkle tree on flush, and (#109) anchors each batch
+//! on-chain autonomously via the relay EOA + the `AuditRootAnchor`
+//! envelope (`CredentialAudit.appendV2`, op_kind 90). Also serves the
+//! Tier-1 real-time feed: per-actor ring buffers + SSE + S3 cold archive.
 //!
 //! Tier-A vs tier-C (direct `append` per event): tier-A trades latency for
 //! gas — each batch is one tx regardless of size, but events aren't visible
-//! on chain until the next flush.
+//! on chain until the next flush (`AGENTKEYS_AUDIT_BATCH_SECONDS`, default
+//! 120).
 
+pub mod anchor;
+pub mod archive;
 pub mod handlers;
 pub mod merkle;
+pub mod service;
 pub mod state;
 
 use axum::{
@@ -32,5 +38,12 @@ pub fn create_router(state: state::SharedState) -> Router {
         )
         .route("/v1/audit/append/v2", post(handlers::append_v2))
         .route("/v1/audit/envelope/:hash", get(handlers::get_envelope))
+        // Tier-1 feed + tier-A anchor surfaces (issue #109).
+        .route("/v1/audit/stream", get(handlers::stream))
+        .route(
+            "/v1/audit/anchors/:operator_omni",
+            get(handlers::anchors_for),
+        )
+        .route("/v1/audit/relay-info", get(handlers::relay_info))
         .with_state(state)
 }
diff --git a/crates/agentkeys-worker-audit/src/main.rs b/crates/agentkeys-worker-audit/src/main.rs
index e1df03ee..e226ae39 100644
--- a/crates/agentkeys-worker-audit/src/main.rs
+++ b/crates/agentkeys-worker-audit/src/main.rs
@@ -1,14 +1,12 @@
 use std::sync::Arc;
 
-use axum::routing::{get, post};
-use axum::Router;
 use clap::Parser;
-use tracing::info;
+use tracing::{info, warn};
 
-use agentkeys_worker_audit::handlers;
 use agentkeys_worker_audit::state::State;
+use agentkeys_worker_audit::{anchor, archive, create_router, service};
 
-/// Audit-service worker — tier-A Merkle relay (arch.md §15.3).
+/// Audit-service worker — tier-A Merkle relay (arch.md §15.3, issue #109).
 #[derive(Parser)]
 #[command(name = "agentkeys-worker-audit", version)]
 struct Args {
@@ -28,14 +26,16 @@ struct Args {
     )]
     leaves_dir: String,
 
-    /// Periodic flush interval, in seconds. Default 300 (5 min). Set to 0 to
-    /// disable the timer (manual flush via /v1/audit/flush-all only).
-    #[arg(
-        long,
-        env = "AGENTKEYS_WORKER_AUDIT_FLUSH_INTERVAL_SECS",
-        default_value_t = 300
-    )]
-    flush_interval_secs: u64,
+    /// Tier-2 batch cadence in seconds (#109; the 2-minute anchor is a
+    /// PRODUCT decision — see the issue before relaxing it). 0 disables
+    /// the timer (manual flush via /v1/audit/flush-all only).
+    #[arg(long, env = "AGENTKEYS_AUDIT_BATCH_SECONDS")]
+    batch_seconds: Option<u64>,
+
+    /// DEPRECATED alias for --batch-seconds (pre-#109 name). Honored only
+    /// when AGENTKEYS_AUDIT_BATCH_SECONDS is unset.
+    #[arg(long, env = "AGENTKEYS_WORKER_AUDIT_FLUSH_INTERVAL_SECS")]
+    flush_interval_secs: Option<u64>,
 }
 
 #[tokio::main]
@@ -49,68 +49,106 @@ async fn main() -> anyhow::Result<()> {
         .init();
 
     let args = Args::parse();
-    let state = Arc::new(State::new(args.leaves_dir.clone()));
+    let batch_seconds = match (args.batch_seconds, args.flush_interval_secs) {
+        (Some(s), _) => s,
+        (None, Some(legacy)) => {
+            warn!(
+                legacy,
+                "AGENTKEYS_WORKER_AUDIT_FLUSH_INTERVAL_SECS is deprecated — set AGENTKEYS_AUDIT_BATCH_SECONDS"
+            );
+            legacy
+        }
+        (None, None) => 120,
+    };
+
+    let relay = match anchor::RelayConfig::from_env() {
+        Ok(Some(r)) => {
+            info!(
+                relay_address = %r.relay_address_hex(),
+                relay_omni = %r.relay_omni_hex(),
+                rpc = %r.rpc_url,
+                "tier-A anchor relay ENABLED"
+            );
+            Some(r)
+        }
+        Ok(None) => {
+            warn!(
+                "tier-A anchor relay UNCONFIGURED (no AGENTKEYS_AUDIT_RELAY_KEY[_FILE]) — \
+                 flushes will log appendRootV2 inputs only (degraded mode)"
+            );
+            None
+        }
+        Err(e) => {
+            // Boot anyway (the feed + store still work) but say why loudly —
+            // same degraded-boot posture as the #241 bundler.
+            warn!(error = %e, "tier-A anchor relay config INVALID — degraded mode");
+            None
+        }
+    };
 
-    // Spawn the periodic flusher if configured.
-    if args.flush_interval_secs > 0 {
+    let s3_archive = archive::Archive::from_env().await;
+    if s3_archive.is_none() {
+        warn!("S3 cold archive UNCONFIGURED (no AGENTKEYS_AUDIT_S3_BUCKET) — rings are in-memory only");
+    }
+
+    let state = Arc::new(
+        State::new(args.leaves_dir.clone())
+            .with_relay(relay)
+            .with_archive(s3_archive),
+    );
+
+    // Boot-time ring recovery (#109): last 1000 events per actor.
+    if let Some(archive) = state.archive.clone() {
+        archive
+            .restore_rings(&state, agentkeys_worker_audit::state::DEFAULT_RING_CAP)
+            .await;
+    }
+
+    // The Tier-2 anchor timer (#109): flush + anchor every batch_seconds.
+    if batch_seconds > 0 {
         let state = state.clone();
-        let interval = args.flush_interval_secs;
         tokio::spawn(async move {
-            let mut t = tokio::time::interval(std::time::Duration::from_secs(interval));
+            let mut t = tokio::time::interval(std::time::Duration::from_secs(batch_seconds));
             t.tick().await; // skip immediate fire
             loop {
                 t.tick().await;
+                // V1 queues — legacy log-only flush (operator master commits
+                // appendRoot itself; see heima-worker-smoke.sh).
                 match state.flush_all().await {
-                    Ok(rs) if !rs.is_empty() => {
+                    Ok(rs) => {
                         for r in rs {
                             info!(
                                 operator_omni = %r.operator_omni,
                                 entries = r.entry_count,
                                 root = %r.merkle_root_hex,
                                 leaves = %r.leaves_path,
-                                "auto-flush: Merkle root ready for on-chain appendRoot"
+                                "auto-flush: V1 Merkle root ready for on-chain appendRoot"
                             );
                         }
                     }
-                    Ok(_) => {}
-                    Err(e) => tracing::error!(error=%e, "flush failed"),
+                    Err(e) => tracing::error!(error = %e, "v1 flush failed"),
                 }
-                // V2 envelope batches (#229) — anchor inputs for appendRootV2.
-                match state.flush_v2_all().await {
-                    Ok(rs) if !rs.is_empty() => {
-                        for r in rs {
+                // V2 queues — flush AND anchor (the #109 tier-A default-on).
+                match service::flush_v2_and_anchor(&state, None).await {
+                    Ok((flushed, anchored)) => {
+                        if !flushed.is_empty() {
                             info!(
-                                operator_omni = %r.operator_omni,
-                                entries = r.entry_count,
-                                root = %r.merkle_root_hex,
-                                op_kind_bitmap = %r.op_kind_bitmap_hex,
-                                leaves = %r.leaves_path,
-                                "auto-flush: V2 Merkle root ready for on-chain appendRootV2"
+                                batches = flushed.len(),
+                                anchored = anchored.len(),
+                                "auto-flush: V2 batches processed"
                             );
                         }
                     }
-                    Ok(_) => {}
-                    Err(e) => tracing::error!(error=%e, "v2 flush failed"),
+                    Err(e) => tracing::error!(error = %e, "v2 flush failed"),
                 }
             }
         });
+        info!(batch_seconds, "tier-2 anchor timer running");
+    } else {
+        warn!("batch timer DISABLED (batch_seconds=0) — manual flush only");
     }
 
-    let app = Router::new()
-        .route("/healthz", get(|| async { "ok" }))
-        .route("/v1/audit/append", post(handlers::append))
-        .route("/v1/audit/flush/:operator_omni", post(handlers::flush_one))
-        .route("/v1/audit/flush-all", post(handlers::flush_all))
-        .route(
-            "/v1/audit/queue-size/:operator_omni",
-            get(handlers::queue_size),
-        )
-        // V2 endpoints (arch.md §15.3a, issue #97 phase B). V1 stays so
-        // existing callers keep working during the migration cycle.
-        .route("/v1/audit/append/v2", post(handlers::append_v2))
-        .route("/v1/audit/envelope/:hash", get(handlers::get_envelope))
-        .with_state(state);
-
+    let app = create_router(state);
     let listener = tokio::net::TcpListener::bind(&args.bind).await?;
     info!(bind = %args.bind, "agentkeys-worker-audit listening");
     axum::serve(listener, app).await?;
diff --git a/crates/agentkeys-worker-audit/src/merkle.rs b/crates/agentkeys-worker-audit/src/merkle.rs
index 4c758959..70471a4d 100644
--- a/crates/agentkeys-worker-audit/src/merkle.rs
+++ b/crates/agentkeys-worker-audit/src/merkle.rs
@@ -118,6 +118,18 @@ pub fn merkle_proof(raw_leaves: &[Bytes32], index: usize) -> Vec<Bytes32> {
     proof
 }
 
+/// Verify a sorted-pairs proof for a RAW (unprefixed) leaf against a root —
+/// the off-chain mirror of `CredentialAudit.verifyEntryInRoot`. Used by the
+/// anchor-record tests and any consumer of `GET /v1/audit/anchors` proofs
+/// (#109 tamper-evidence check).
+pub fn verify_proof(raw_leaf: Bytes32, proof: &[Bytes32], root: Bytes32) -> bool {
+    let mut computed = leaf_prefix(raw_leaf);
+    for sibling in proof {
+        computed = hash_pair(computed, *sibling);
+    }
+    computed == root
+}
+
 #[cfg(test)]
 mod tests {
     use super::*;
diff --git a/crates/agentkeys-worker-audit/src/service.rs b/crates/agentkeys-worker-audit/src/service.rs
new file mode 100644
index 00000000..5d2a3a15
--- /dev/null
+++ b/crates/agentkeys-worker-audit/src/service.rs
@@ -0,0 +1,265 @@
+//! Flush-and-anchor orchestration (issue #109) — the ONE V2 flush path,
+//! shared by the periodic timer and the HTTP flush endpoints so a manual
+//! flush can never silently drain a batch past the anchor.
+//!
+//! Per flushed batch:
+//! - relay configured → build the `AuditRootAnchor` (op_kind 90) envelope,
+//!   commit its hash on-chain via `appendV2`, record the anchor (with
+//!   per-entry Merkle proofs) and surface it in the feed;
+//! - submission exhausted its retries → re-queue the batch entries (the
+//!   next flush re-batches them under a fresh root), emit the
+//!   `AuditBatchFailed` (op_kind 91) envelope into the store + queue +
+//!   feed, and ERROR-log (journald is the operator alert path);
+//! - no relay → pre-#109 degraded mode: log the `appendRootV2` inputs.
+
+use std::time::{SystemTime, UNIX_EPOCH};
+
+use agentkeys_core::audit::{
+    envelope_for, AuditBatchFailedBody, AuditOpKind, AuditResult, AuditRootAnchorBody,
+};
+use tracing::{error, info, warn};
+
+use crate::anchor::{encode_append_v2_calldata, submit_anchor_with_retries};
+use crate::state::{AnchorRecord, FeedEvent, FlushV2Result, SharedState, V2QueueEntry};
+
+fn now_unix() -> u64 {
+    SystemTime::now()
+        .duration_since(UNIX_EPOCH)
+        .map(|d| d.as_secs())
+        .unwrap_or(0)
+}
+
+fn decode32_hex(s: &str) -> [u8; 32] {
+    let v = hex::decode(s.trim_start_matches("0x")).unwrap_or_default();
+    let mut out = [0u8; 32];
+    let n = v.len().min(32);
+    out[..n].copy_from_slice(&v[..n]);
+    out
+}
+
+/// Drain V2 queues (one operator, or all when `None`) and anchor each
+/// batch. Returns the flush results plus the anchors that landed.
+pub async fn flush_v2_and_anchor(
+    state: &SharedState,
+    operator_omni: Option<&str>,
+) -> anyhow::Result<(Vec<FlushV2Result>, Vec<AnchorRecord>)> {
+    let flushed = match operator_omni {
+        Some(op) => state.flush_v2(op).await?.into_iter().collect(),
+        None => state.flush_v2_all().await?,
+    };
+    let mut anchors = Vec::new();
+    for flush in &flushed {
+        if let Some(record) = anchor_one_batch(state, flush).await {
+            anchors.push(record);
+        }
+    }
+    Ok((flushed, anchors))
+}
+
+/// Anchor a single flushed batch. `None` when the relay is unconfigured or
+/// the batch failed (entries re-queued — nothing is lost either way).
+async fn anchor_one_batch(state: &SharedState, flush: &FlushV2Result) -> Option<AnchorRecord> {
+    let Some(relay) = state.relay.as_ref() else {
+        info!(
+            operator_omni = %flush.operator_omni,
+            entries = flush.entry_count,
+            root = %flush.merkle_root_hex,
+            op_kind_bitmap = %flush.op_kind_bitmap_hex,
+            leaves = %flush.leaves_path,
+            "anchor relay unconfigured — appendRootV2 inputs logged only (degraded mode)"
+        );
+        return None;
+    };
+
+    let operator32 = decode32_hex(&flush.operator_omni);
+    let ts = now_unix();
+
+    // The anchor envelope — an honest AuditEnvelope whose hash goes on
+    // chain. actor = the relay's derived omni; operator = the REAL
+    // operator whose batch this is (stays an indexed topic on-chain).
+    let body = AuditRootAnchorBody {
+        merkle_root: flush.merkle_root_hex.clone(),
+        op_kind_bitmap: flush.op_kind_bitmap_hex.clone(),
+        entry_count: flush.entry_count,
+        relay_address: relay.relay_address_hex(),
+    };
+    let envelope = match envelope_for(
+        relay.relay_omni,
+        operator32,
+        AuditOpKind::AuditRootAnchor,
+        body,
+        AuditResult::Success,
+        None,
+        None,
+    ) {
+        Ok(mut e) => {
+            e.ts_unix = ts;
+            e
+        }
+        Err(e) => {
+            error!(error = %e, "anchor envelope build failed — re-queueing batch");
+            state
+                .requeue_v2(&flush.operator_omni, flush.entries.clone())
+                .await;
+            return None;
+        }
+    };
+    let (cbor, env_hash) = match (envelope.to_canonical_cbor(), envelope.envelope_hash()) {
+        (Ok(c), Ok(h)) => (c, h),
+        (c, h) => {
+            error!(?c, ?h, "anchor envelope encode failed — re-queueing batch");
+            state
+                .requeue_v2(&flush.operator_omni, flush.entries.clone())
+                .await;
+            return None;
+        }
+    };
+    let env_hash_hex = format!("0x{}", hex::encode(env_hash));
+
+    let calldata = encode_append_v2_calldata(
+        operator32,
+        relay.relay_omni,
+        AuditOpKind::AuditRootAnchor as u8,
+        env_hash,
+    );
+
+    match submit_anchor_with_retries(relay, &state.http, calldata).await {
+        Ok(receipt) => {
+            state.store_envelope(env_hash_hex.clone(), cbor.clone()).await;
+            if let Some(archive) = &state.archive {
+                archive.archive_envelope(env_hash_hex.clone(), cbor);
+            }
+            let record = state
+                .record_anchor(flush, env_hash_hex.clone(), receipt.tx_hash.clone(), ts)
+                .await;
+            let evt = state
+                .push_feed(FeedEvent {
+                    kind: "anchor".into(),
+                    envelope_hash: env_hash_hex,
+                    ts_unix: ts,
+                    actor_omni: relay.relay_omni_hex(),
+                    operator_omni: flush.operator_omni.clone(),
+                    op_kind: AuditOpKind::AuditRootAnchor as u8,
+                    op_kind_label: AuditOpKind::AuditRootAnchor.label().into(),
+                    result: AuditResult::Success as u8,
+                    intent_text: None,
+                    tx_hash: Some(receipt.tx_hash.clone()),
+                    merkle_root: Some(flush.merkle_root_hex.clone()),
+                    entry_count: Some(flush.entry_count),
+                })
+                .await;
+            if let Some(archive) = &state.archive {
+                archive.archive_feed_event(evt);
+            }
+            info!(
+                operator_omni = %flush.operator_omni,
+                entries = flush.entry_count,
+                root = %flush.merkle_root_hex,
+                tx_hash = %receipt.tx_hash,
+                attempts = receipt.attempts_used,
+                "tier-A batch anchored on-chain"
+            );
+            Some(record)
+        }
+        Err(failure) => {
+            // Durability first: the entries go back on the queue so the
+            // next tick re-batches them under a fresh root.
+            state
+                .requeue_v2(&flush.operator_omni, flush.entries.clone())
+                .await;
+            emit_batch_failed(state, flush, failure.attempts, &failure.last_error).await;
+            error!(
+                operator_omni = %flush.operator_omni,
+                entries = flush.entry_count,
+                root = %flush.merkle_root_hex,
+                attempts = failure.attempts,
+                last_error = %failure.last_error,
+                "tier-A anchor FAILED after retries — entries re-queued, audit.batch_failed emitted"
+            );
+            None
+        }
+    }
+}
+
+/// Emit the `AuditBatchFailed` envelope: stored by hash, queued for a
+/// future anchor (so the failure itself lands on-chain once the chain
+/// recovers), and pushed to the live feed.
+async fn emit_batch_failed(
+    state: &SharedState,
+    flush: &FlushV2Result,
+    attempts: u32,
+    last_error: &str,
+) {
+    let Some(relay) = state.relay.as_ref() else {
+        return;
+    };
+    let ts = now_unix();
+    let mut truncated = last_error.to_string();
+    truncated.truncate(512);
+    let body = AuditBatchFailedBody {
+        merkle_root: flush.merkle_root_hex.clone(),
+        entry_count: flush.entry_count,
+        attempts: attempts.min(u8::MAX as u32) as u8,
+        last_error: truncated,
+    };
+    let envelope = match envelope_for(
+        relay.relay_omni,
+        decode32_hex(&flush.operator_omni),
+        AuditOpKind::AuditBatchFailed,
+        body,
+        AuditResult::Failure,
+        None,
+        None,
+    ) {
+        Ok(mut e) => {
+            e.ts_unix = ts;
+            e
+        }
+        Err(e) => {
+            warn!(error = %e, "batch_failed envelope build failed");
+            return;
+        }
+    };
+    let (cbor, env_hash) = match (envelope.to_canonical_cbor(), envelope.envelope_hash()) {
+        (Ok(c), Ok(h)) => (c, h),
+        _ => {
+            warn!("batch_failed envelope encode failed");
+            return;
+        }
+    };
+    let env_hash_hex = format!("0x{}", hex::encode(env_hash));
+    state.store_envelope(env_hash_hex.clone(), cbor.clone()).await;
+    if let Some(archive) = &state.archive {
+        archive.archive_envelope(env_hash_hex.clone(), cbor);
+    }
+    state
+        .queue_v2(
+            flush.operator_omni.clone(),
+            V2QueueEntry {
+                envelope_hash: env_hash_hex.clone(),
+                op_kind: AuditOpKind::AuditBatchFailed as u8,
+                actor_omni: relay.relay_omni_hex(),
+                ts_unix: ts,
+            },
+        )
+        .await;
+    let evt = state
+        .push_feed(FeedEvent {
+            kind: "batch_failed".into(),
+            envelope_hash: env_hash_hex,
+            ts_unix: ts,
+            actor_omni: relay.relay_omni_hex(),
+            operator_omni: flush.operator_omni.clone(),
+            op_kind: AuditOpKind::AuditBatchFailed as u8,
+            op_kind_label: AuditOpKind::AuditBatchFailed.label().into(),
+            result: AuditResult::Failure as u8,
+            intent_text: None,
+            tx_hash: None,
+            merkle_root: Some(flush.merkle_root_hex.clone()),
+            entry_count: Some(flush.entry_count),
+        })
+        .await;
+    if let Some(archive) = &state.archive {
+        archive.archive_feed_event(evt);
+    }
+}
diff --git a/crates/agentkeys-worker-audit/src/state.rs b/crates/agentkeys-worker-audit/src/state.rs
index 06a4448a..fb66ea1d 100644
--- a/crates/agentkeys-worker-audit/src/state.rs
+++ b/crates/agentkeys-worker-audit/src/state.rs
@@ -1,14 +1,25 @@
-//! Per-operator in-memory event queue + flush logic.
+//! Per-operator in-memory event queue + flush logic, plus the #109 Tier-1
+//! real-time feed surfaces: per-actor ring buffers, the SSE broadcast
+//! channel, and the anchored-batch history (with Merkle proofs).
 
-use std::collections::HashMap;
+use std::collections::{HashMap, VecDeque};
 use std::sync::Arc;
 use std::time::{SystemTime, UNIX_EPOCH};
 
 use serde::{Deserialize, Serialize};
-use tokio::sync::Mutex;
+use tokio::sync::{broadcast, Mutex};
 
 use crate::merkle::{keccak256, merkle_proof, merkle_root, Bytes32};
 
+/// Per-actor ring-buffer capacity (issue #109: "last 1000 events per
+/// actor"). Tests shrink it via `State::with_caps`.
+pub const DEFAULT_RING_CAP: usize = 1000;
+/// Anchored-batch records retained per operator.
+pub const DEFAULT_ANCHOR_CAP: usize = 50;
+/// Broadcast fan-out capacity — slow SSE subscribers that lag past this
+/// many events miss them in the live stream (they re-sync via backfill).
+const FEED_CHANNEL_CAP: usize = 1024;
+
 #[derive(Clone, Debug, Serialize, Deserialize)]
 pub struct AuditEvent {
     /// 0x-prefixed 32-byte hex.
@@ -62,7 +73,36 @@ pub struct FlushV2Result {
     pub entries: Vec<V2QueueEntry>,
 }
 
-#[derive(Default)]
+/// One entry in the Tier-1 real-time feed (issue #109): the JSON shape the
+/// SSE stream emits, the ring buffers hold, and the S3 archive persists.
+/// The shape has ONE owner — `agentkeys_types::audit_feed::AuditFeedEvent`
+/// — shared with the daemon's feed bridge (the #203 one-owner rule).
+pub use agentkeys_types::audit_feed::AuditFeedEvent as FeedEvent;
+
+/// Merkle membership proof for one batch entry — what the tamper test and
+/// external verifiers consume from `GET /v1/audit/anchors`.
+#[derive(Clone, Debug, Serialize, Deserialize)]
+pub struct AnchorEntryProof {
+    pub envelope_hash: String,
+    pub leaf_index: usize,
+    pub proof: Vec<String>,
+}
+
+/// One anchored batch: the root + the on-chain commitment + per-entry
+/// proofs. Retained per operator (last `DEFAULT_ANCHOR_CAP`).
+#[derive(Clone, Debug, Serialize, Deserialize)]
+pub struct AnchorRecord {
+    pub operator_omni: String,
+    pub merkle_root_hex: String,
+    pub op_kind_bitmap_hex: String,
+    pub entry_count: u64,
+    /// Hash of the `AuditRootAnchor` envelope committed on-chain.
+    pub anchor_envelope_hash: String,
+    pub tx_hash: String,
+    pub anchored_ts_unix: u64,
+    pub entries: Vec<AnchorEntryProof>,
+}
+
 pub struct State {
     /// operator_omni (0x...) → queue of pending events.
     queues: Mutex<HashMap<String, Vec<AuditEvent>>>,
@@ -71,27 +111,196 @@ pub struct State {
     /// `envelope_hash` (lowercased 0x-hex) → canonical CBOR bytes.
     /// Populated by `POST /v1/audit/append/v2`; read by `GET
     /// /v1/audit/envelope/<hash>`. Per arch.md §15.3a issue #97 phase B.
-    ///
-    /// In-memory for v0 — the chain commitment is the durability
-    /// mechanism; if the worker restarts before a chain `appendV2` lands,
-    /// callers re-emit. Persistent storage (e.g., S3
-    /// `s3://<vault>/audit/envelopes/<hash>.cbor`) is tracked as a
-    /// follow-up alongside the contract redeploy.
+    /// The S3 cold archive (#109, `archive.rs`) is the durable layer; this
+    /// map is the hot cache.
     envelopes: Mutex<HashMap<String, Vec<u8>>>,
     /// operator_omni (0x...) → V2 envelopes awaiting the tier-A on-chain
-    /// anchor (`appendRootV2`). Fed by `POST /v1/audit/append/v2` (#229);
-    /// drained by the same flush endpoints/timer as the V1 queues.
+    /// anchor. Fed by `POST /v1/audit/append/v2` (#229); drained by the
+    /// flush endpoints/timer (#109: anchored on-chain when the relay is
+    /// configured).
     v2_queues: Mutex<HashMap<String, Vec<V2QueueEntry>>>,
+    /// actor_omni (0x...) → last `ring_cap` feed events (#109 Tier 1).
+    rings: Mutex<HashMap<String, VecDeque<FeedEvent>>>,
+    /// operator_omni (0x...) → recent anchored batches with proofs.
+    anchors: Mutex<HashMap<String, Vec<AnchorRecord>>>,
+    /// Live fan-out to SSE subscribers.
+    feed_tx: broadcast::Sender<FeedEvent>,
+    ring_cap: usize,
+    anchor_cap: usize,
+    /// Tier-A anchor relay (#109). `None` = degraded mode: flushes log the
+    /// `appendRootV2` inputs (pre-#109 behavior) and anchor nothing.
+    pub relay: Option<crate::anchor::RelayConfig>,
+    /// Shared HTTP client for chain RPC.
+    pub http: reqwest::Client,
+    /// S3 cold archive (#109). `None` = in-memory only.
+    pub archive: Option<crate::archive::Archive>,
 }
 
 impl State {
     pub fn new(leaves_dir: String) -> Self {
+        Self::with_caps(leaves_dir, DEFAULT_RING_CAP, DEFAULT_ANCHOR_CAP)
+    }
+
+    pub fn with_caps(leaves_dir: String, ring_cap: usize, anchor_cap: usize) -> Self {
+        let (feed_tx, _) = broadcast::channel(FEED_CHANNEL_CAP);
         Self {
             queues: Mutex::new(HashMap::new()),
             leaves_dir,
             envelopes: Mutex::new(HashMap::new()),
             v2_queues: Mutex::new(HashMap::new()),
+            rings: Mutex::new(HashMap::new()),
+            anchors: Mutex::new(HashMap::new()),
+            feed_tx,
+            ring_cap,
+            anchor_cap,
+            relay: None,
+            http: reqwest::Client::new(),
+            archive: None,
+        }
+    }
+
+    /// Attach the tier-A anchor relay (builder style — boot path only).
+    pub fn with_relay(mut self, relay: Option<crate::anchor::RelayConfig>) -> Self {
+        self.relay = relay;
+        self
+    }
+
+    /// Attach the S3 cold archive (builder style — boot path only).
+    pub fn with_archive(mut self, archive: Option<crate::archive::Archive>) -> Self {
+        self.archive = archive;
+        self
+    }
+
+    /// Subscribe to the live feed (SSE handler + the daemon bridge).
+    pub fn subscribe_feed(&self) -> broadcast::Receiver<FeedEvent> {
+        self.feed_tx.subscribe()
+    }
+
+    /// Push a feed event: append to the actor's ring buffer (evicting past
+    /// `ring_cap`) and fan out to live subscribers. Returns the event back
+    /// so callers can archive it.
+    pub async fn push_feed(&self, evt: FeedEvent) -> FeedEvent {
+        {
+            let mut rings = self.rings.lock().await;
+            let ring = rings.entry(evt.actor_omni.clone()).or_default();
+            ring.push_back(evt.clone());
+            while ring.len() > self.ring_cap {
+                ring.pop_front();
+            }
+        }
+        // Send fails only when no subscriber exists — that's fine.
+        let _ = self.feed_tx.send(evt.clone());
+        evt
+    }
+
+    /// Restore one actor's ring from the cold archive (boot-time, #109).
+    /// Events are assumed chronologically sorted; caps at `ring_cap`.
+    pub async fn restore_ring(&self, actor_omni: String, events: Vec<FeedEvent>) {
+        let mut ring: VecDeque<FeedEvent> = events.into();
+        while ring.len() > self.ring_cap {
+            ring.pop_front();
+        }
+        let mut rings = self.rings.lock().await;
+        rings.insert(actor_omni, ring);
+    }
+
+    /// Recent feed events, optionally filtered by operator and/or actor,
+    /// chronologically sorted, capped to the most recent `limit`. Powers
+    /// SSE backfill on connect.
+    pub async fn backfill(
+        &self,
+        operator_omni: Option<&str>,
+        actor_omni: Option<&str>,
+        limit: usize,
+    ) -> Vec<FeedEvent> {
+        let rings = self.rings.lock().await;
+        let mut out: Vec<FeedEvent> = Vec::new();
+        for (actor, ring) in rings.iter() {
+            if let Some(a) = actor_omni {
+                if !a.eq_ignore_ascii_case(actor) {
+                    continue;
+                }
+            }
+            for e in ring.iter() {
+                if let Some(op) = operator_omni {
+                    if !op.eq_ignore_ascii_case(&e.operator_omni) {
+                        continue;
+                    }
+                }
+                out.push(e.clone());
+            }
+        }
+        out.sort_by_key(|e| e.ts_unix);
+        if out.len() > limit {
+            out.drain(..out.len() - limit);
+        }
+        out
+    }
+
+    /// Record an anchored batch (computing per-entry Merkle proofs) and
+    /// surface it in the feed. Returns the stored record.
+    pub async fn record_anchor(
+        &self,
+        flush: &FlushV2Result,
+        anchor_envelope_hash: String,
+        tx_hash: String,
+        anchored_ts_unix: u64,
+    ) -> AnchorRecord {
+        let leaves: Vec<Bytes32> = flush
+            .entries
+            .iter()
+            .map(|e| decode32(&e.envelope_hash))
+            .collect();
+        let entries = flush
+            .entries
+            .iter()
+            .enumerate()
+            .map(|(i, e)| AnchorEntryProof {
+                envelope_hash: e.envelope_hash.clone(),
+                leaf_index: i,
+                proof: merkle_proof(&leaves, i)
+                    .iter()
+                    .map(|p| format!("0x{}", hex::encode(p)))
+                    .collect(),
+            })
+            .collect();
+        let record = AnchorRecord {
+            operator_omni: flush.operator_omni.clone(),
+            merkle_root_hex: flush.merkle_root_hex.clone(),
+            op_kind_bitmap_hex: flush.op_kind_bitmap_hex.clone(),
+            entry_count: flush.entry_count,
+            anchor_envelope_hash,
+            tx_hash,
+            anchored_ts_unix,
+            entries,
+        };
+        let mut anchors = self.anchors.lock().await;
+        let v = anchors.entry(flush.operator_omni.clone()).or_default();
+        v.push(record.clone());
+        while v.len() > self.anchor_cap {
+            v.remove(0);
         }
+        record
+    }
+
+    /// Recent anchored batches for one operator (newest last).
+    pub async fn anchors_for(&self, operator_omni: &str) -> Vec<AnchorRecord> {
+        let anchors = self.anchors.lock().await;
+        anchors
+            .iter()
+            .find(|(op, _)| op.eq_ignore_ascii_case(operator_omni))
+            .map(|(_, v)| v.clone())
+            .unwrap_or_default()
+    }
+
+    /// Put entries BACK at the head of an operator's V2 queue after a
+    /// failed anchor — the next flush re-batches them (fresh root).
+    pub async fn requeue_v2(&self, operator_omni: &str, entries: Vec<V2QueueEntry>) {
+        let mut q = self.v2_queues.lock().await;
+        let v = q.entry(operator_omni.to_string()).or_default();
+        let mut merged = entries;
+        merged.append(v);
+        *v = merged;
     }
 
     /// Store a canonical-CBOR-encoded `AuditEnvelope` keyed by its
@@ -377,6 +586,114 @@ mod tests {
         std::fs::remove_file(&r.leaves_path).ok();
     }
 
+    fn feed(actor: u8, operator: u8, ts: u64, hash: u8) -> FeedEvent {
+        FeedEvent {
+            kind: "event".into(),
+            envelope_hash: format!("0x{}", hex::encode([hash; 32])),
+            ts_unix: ts,
+            actor_omni: format!("0x{}", hex::encode([actor; 32])),
+            operator_omni: format!("0x{}", hex::encode([operator; 32])),
+            op_kind: 1,
+            op_kind_label: "cred.fetch".into(),
+            result: 0,
+            intent_text: None,
+            tx_hash: None,
+            merkle_root: None,
+            entry_count: None,
+        }
+    }
+
+    #[tokio::test]
+    async fn ring_caps_per_actor_and_backfill_filters() {
+        let s = State::with_caps("/tmp".into(), 3, 2);
+        for i in 0..5u8 {
+            s.push_feed(feed(0xA1, 0xB1, 100 + i as u64, i)).await;
+        }
+        s.push_feed(feed(0xA2, 0xB2, 200, 9)).await;
+        // Actor 0xA1's ring evicted down to the cap (last 3 of 5).
+        let all = s.backfill(None, None, 100).await;
+        assert_eq!(all.len(), 4, "3 capped + 1 other actor");
+        let a1 = s
+            .backfill(None, Some(&format!("0x{}", hex::encode([0xA1; 32]))), 100)
+            .await;
+        assert_eq!(a1.len(), 3);
+        assert_eq!(a1[0].ts_unix, 102, "oldest two evicted");
+        // Operator filter.
+        let b2 = s
+            .backfill(Some(&format!("0x{}", hex::encode([0xB2; 32]))), None, 100)
+            .await;
+        assert_eq!(b2.len(), 1);
+        // Limit takes the most recent.
+        let limited = s.backfill(None, None, 2).await;
+        assert_eq!(limited.len(), 2);
+        assert_eq!(limited[1].ts_unix, 200);
+    }
+
+    #[tokio::test]
+    async fn push_feed_fans_out_to_subscribers() {
+        let s = State::with_caps("/tmp".into(), 10, 2);
+        let mut rx = s.subscribe_feed();
+        s.push_feed(feed(0xA1, 0xB1, 100, 1)).await;
+        let got = rx.recv().await.expect("live event");
+        assert_eq!(got.ts_unix, 100);
+    }
+
+    #[tokio::test]
+    async fn requeue_v2_puts_entries_back_at_the_head() {
+        let s = State::new("/tmp".to_string());
+        s.queue_v2("0xop".into(), v2(0x01, 1)).await;
+        let r = s.flush_v2("0xop").await.unwrap().expect("non-empty");
+        std::fs::remove_file(&r.leaves_path).ok();
+        // Anchor failed → entries go back; a new event arrives meanwhile.
+        s.queue_v2("0xop".into(), v2(0x02, 11)).await;
+        s.requeue_v2("0xop", r.entries.clone()).await;
+        let r2 = s.flush_v2("0xop").await.unwrap().expect("non-empty");
+        std::fs::remove_file(&r2.leaves_path).ok();
+        assert_eq!(r2.entry_count, 2);
+        assert_eq!(
+            r2.entries[0].envelope_hash,
+            format!("0x{}", hex::encode([0x01; 32])),
+            "re-queued entry batches FIRST (oldest preserved)"
+        );
+    }
+
+    #[tokio::test]
+    async fn record_anchor_proofs_verify_and_tampered_leaf_fails() {
+        let s = State::new("/tmp".to_string());
+        s.queue_v2("0xop".into(), v2(0x01, 1)).await;
+        s.queue_v2("0xop".into(), v2(0x02, 11)).await;
+        s.queue_v2("0xop".into(), v2(0x03, 81)).await;
+        let flush = s.flush_v2("0xop").await.unwrap().expect("non-empty");
+        std::fs::remove_file(&flush.leaves_path).ok();
+        let record = s
+            .record_anchor(&flush, "0xanchorhash".into(), "0xtx".into(), 1_700_000_100)
+            .await;
+        assert_eq!(record.entries.len(), 3);
+        let root = decode32(&record.merkle_root_hex);
+        for entry in &record.entries {
+            let leaf = decode32(&entry.envelope_hash);
+            let proof: Vec<Bytes32> = entry.proof.iter().map(|p| decode32(p)).collect();
+            assert!(
+                crate::merkle::verify_proof(leaf, &proof, root),
+                "genuine leaf {} verifies",
+                entry.leaf_index
+            );
+            // The #109 tamper test: flip one byte of the event → the
+            // recomputed leaf no longer matches the anchored root.
+            let mut tampered = leaf;
+            tampered[0] ^= 0xFF;
+            assert!(
+                !crate::merkle::verify_proof(tampered, &proof, root),
+                "tampered leaf {} must fail",
+                entry.leaf_index
+            );
+        }
+        // Anchors are retrievable per operator (case-insensitive).
+        let got = s.anchors_for("0xOP").await;
+        assert_eq!(got.len(), 1);
+        assert_eq!(got[0].tx_hash, "0xtx");
+    }
+
     #[test]
     fn op_kind_bitmap_lsb_is_op_kind_zero() {
         let hexmap = op_kind_bitmap_hex([0u8].into_iter());
diff --git a/docs/arch.md b/docs/arch.md
index b6685840..0456c4be 100644
--- a/docs/arch.md
+++ b/docs/arch.md
@@ -1142,8 +1142,10 @@ and never reordered**. Grouped by 10s leaves room for related ops.
 | `ConfigPut` | 80 | `{key: string, payload_hash: [u8;32]}` | config-service (#201, #229) |
 | `ConfigGet` | 81 | `{key: string, cap_hash: [u8;32]}` | config-service (#201, #229) |
 | `ConfigTeardown` | 82 | `{actor_target: [u8;32]}` | config-service (#201, #229) |
+| `AuditRootAnchor` | 90 | `{merkle_root: [u8;32], op_kind_bitmap: [u8;32], entry_count: u64, relay_address: [u8;20]}` | audit-service tier-A relay (#109) |
+| `AuditBatchFailed` | 91 | `{merkle_root: [u8;32], entry_count: u64, attempts: u8, last_error: string}` | audit-service tier-A relay (#109) |
 
-Byte ranges `3-9`, `13-19`, `22-29`, `32-39`, `42-49`, `53-59`, `62-69`, `71-79`, `83-89`, `90-255` are reserved for future extensions in the same family (config claimed `80-89` per #229).
+Byte ranges `3-9`, `13-19`, `22-29`, `32-39`, `42-49`, `53-59`, `62-69`, `71-79`, `83-89`, `92-99`, `100-255` are reserved for future extensions in the same family (config claimed `80-89` per #229; audit-service meta claimed `90-99` per #109).
 
 **Data-plane emit sites are LIVE (#229).** The cred / memory / config workers
 emit one envelope per store / fetch / teardown — after cap-verify, before the
@@ -1214,6 +1216,57 @@ per-service draft shape, so this is a pre-first-emit schema fix, not a
 break (invariant #7 forbids reusing/reordering *numbers*; it does not
 freeze a never-emitted body draft).
 
+**Two-tier audit is LIVE (#109): the real-time feed + the autonomous tier-A
+anchor.** The audit worker is the aggregation point for every emit site
+above, and serves both tiers from the same `AuditEnvelope` store:
+
+- **Tier 1 — off-chain real-time feed.** Every `append/v2` fans out
+  in-process to `GET /v1/audit/stream` (SSE; `?operator=`/`?actor=` filters
+  + `?backfill=N` ring-buffer replay) and lands in a per-actor ring buffer
+  (last 1000 events/actor). The shape has ONE owner —
+  [`agentkeys_types::audit_feed::AuditFeedEvent`](../crates/agentkeys-types/src/audit_feed.rs)
+  (#203 one-owner rule). The **daemon bridges** the stream (filtered to the
+  session operator) into its existing `ApiAuditEvent` web feed, deduping by
+  envelope hash against the locally-pushed submit events — so worker-side
+  ops (agent cred fetches, memory reads, denials) and anchor events appear
+  live in the parent UI with no new web socket. An `AGENTKEYS_AUDIT_S3_BUCKET`
+  cold archive (metadata + envelope CBOR only, never plaintext; bucket +
+  EC2-instance-role grant provisioned by
+  [`scripts/provision-audit-archive.sh`](../scripts/provision-audit-archive.sh))
+  restores the rings on worker restart and backs `GET /v1/audit/envelope/:hash`
+  across restarts.
+- **Tier 2 — autonomous on-chain anchor (default-on, 2-min cadence).**
+  Every `AGENTKEYS_AUDIT_BATCH_SECONDS` (default 120 — a PRODUCT decision,
+  don't relax without checking the demo storyboard) the worker drains each
+  operator's V2 queue, Merkle-roots the envelope hashes, wraps the root in
+  an `AuditRootAnchor` (90) envelope and commits THAT envelope's hash via
+  the ungated `CredentialAudit.appendV2(operatorOmni, relayActorOmni, 90,
+  envelopeHash)` — signed by the **tier-A relay EOA** (key generated on the
+  broker host by `setup-broker-host.sh`, 0600, never leaves the host;
+  funded idempotently by
+  [`scripts/heima-fund-audit-relay.sh`](../scripts/heima-fund-audit-relay.sh)
+  via `setup-heima.sh` step 14 reading `GET /v1/audit/relay-info`).
+  **Why not `appendRootV2`:** that gate requires the operator master, the
+  registry rejects EOA masters (`MasterMustBeAccount`), and a prod master
+  is a Touch-ID passkey that can't sign on a timer — while `appendV2` is
+  open-by-design with the REAL operator omni as an indexed topic, and the
+  honest anchor envelope (committed by hash) is exactly what the open-enum
+  §15.3b design exists for (zero contract change). Genuine anchors are
+  distinguished from third-party spam by `tx.from == relay_address`
+  (published at `relay-info`) — matching the tier-A trust row above ("only
+  shared service-relay-wallet" appears on chain). The master-gated
+  `appendRoot`/`appendRootV2` path REMAINS the sovereign tier-B/C route
+  (`heima-worker-smoke.sh` exercises it). Failed submissions retry ×3 with
+  exponential backoff; a persistently-failed batch is **re-queued** (the
+  next tick re-batches it under a fresh root) and an `AuditBatchFailed`
+  (91) envelope is emitted into the store + queue + feed with an ERROR log
+  (journald = the operator alert path). `GET /v1/audit/anchors/:operator`
+  returns recent anchors WITH per-entry Merkle proofs — the tamper check
+  (modify any served event → its recomputed leaf fails the proof against
+  the anchored root) and the parent UI's "Anchored ✓" badge both consume
+  it. With no relay key configured the worker boots in the pre-#109
+  degraded mode: flushes log the `appendRootV2` inputs and anchor nothing.
+
 #### Forward-compat / non-break design
 
 The trade-off when a new op_kind lands is **"uglier UI temporarily for old
diff --git a/docs/plan/issue-109-two-tier-audit.md b/docs/plan/issue-109-two-tier-audit.md
new file mode 100644
index 00000000..757e17fd
--- /dev/null
+++ b/docs/plan/issue-109-two-tier-audit.md
@@ -0,0 +1,71 @@
+# Issue #109 — Two-tier audit wiring (real-time off-chain feed + 2-min on-chain anchor)
+
+**Status:** in progress. Builds on #229 (data-plane emits + V2 queues) and #97/#270
+(control-plane emits + daemon/web audit receipts). Closes the #229-deferred open
+design item "audit-worker-initiated `appendRootV2` chain submission (tier-A relay
+wallet)".
+
+## Design decisions
+
+### Tier 2 anchor — `appendV2` + `AuditRootAnchor` op_kind, NOT `appendRootV2`
+
+`CredentialAudit.appendRootV2` is gated `msg.sender == registry.operatorMasterWallet(operatorOmni)`,
+and `SidecarRegistry.registerFirstMasterDevice` rejects EOA masters
+(`MasterMustBeAccount`) — so a hosted relay EOA can never pass that gate, and a
+prod master is a Touch-ID passkey that cannot sign on a 2-minute cadence. Three
+options considered:
+
+| Option | Verdict |
+|---|---|
+| Contract change (relay allowlist on `appendRootV2`) | Rejected — mainnet redeploy ceremony for something the open-enum envelope design already solves (§15.3b invariant #6: "new op_kinds need ZERO contract redeploys") |
+| Relay as software-P256Account master (CI-style) registered per-operator | Rejected — heavy ceremony, couples audit anchoring to the bundler's liveness (#241 worked to REMOVE that coupling) |
+| **Anchor via ungated `appendV2` with a new `AuditRootAnchor` op_kind (90)** | **Chosen** — zero contract change, real operator omni stays an indexed topic, plain funded EOA relay, one tx per operator-batch |
+
+The anchor is itself an honest `AuditEnvelope`: op_kind 90, body
+`{merkle_root, op_kind_bitmap, entry_count, relay_address}` over the batch's
+envelope-hash leaves. The chain event commits the anchor envelope's hash; the
+envelope commits the root; the leaves verify against the root (existing
+domain-separated Merkle). Genuine anchors are distinguished from third-party
+spam by `tx.from == relay_address` (published at `GET /v1/audit/relay-info`),
+matching arch.md §15.3 tier A verbatim: "only shared service-relay-wallet"
+appears on chain. The master-gated `appendRootV2` path REMAINS the sovereign
+tier-B/C route (`heima-worker-smoke.sh` unchanged).
+
+### Tier 1 feed — worker SSE, daemon bridges into the EXISTING UI feed
+
+The parent-control UI already consumes the daemon's `/v1/audit/stream` SSE +
+`/v1/anchor/status` (synthetic today). Rather than a second UI socket to the
+hosted worker, the daemon subscribes to the worker's new SSE (filtered to the
+session operator omni), maps envelopes into the existing `ApiAuditEvent` feed
+(dedup by envelope hash — the broker submit-relay events would otherwise appear
+twice), and flips `state.anchor` to REAL on anchor feed events.
+
+## Implementation order
+
+| # | Step | Files |
+|---|---|---|
+| 1 | core: op_kinds 90 `AuditRootAnchor` + 91 `AuditBatchFailed` (ritual §15.3b: variants, bodies, typed arms, roundtrip tests, vectors) | `crates/agentkeys-core/src/audit/{op_kind,bodies,mod}.rs` |
+| 2 | core: move `legacy_tx.rs` bundler → core (bundler re-imports) | `crates/agentkeys-core/src/legacy_tx.rs`, `crates/agentkeys-bundler/` |
+| 3 | worker-audit: anchor module — relay key load, raw JSON-RPC (Heima-safe), `appendV2` calldata, retry ×3 exp backoff, re-queue + `AuditBatchFailed` on persistent failure; `AGENTKEYS_AUDIT_BATCH_SECONDS` (default 120, legacy var honored); degraded log-only mode when unconfigured; `GET /v1/audit/relay-info` | `crates/agentkeys-worker-audit/src/{anchor,main,state}.rs` |
+| 4 | worker-audit: Tier-1 feed — per-actor ring buffer (1000), broadcast, `GET /v1/audit/stream` SSE (operator/actor filter + backfill), anchors history `GET /v1/audit/anchors` with per-entry Merkle proofs | `crates/agentkeys-worker-audit/src/{state,handlers,lib,main}.rs` |
+| 5 | worker-audit: S3 cold archive (env-gated bucket+prefix) — async PUT of feed events + envelope CBOR, boot-time ring restore, `get_envelope` S3 fallback | `crates/agentkeys-worker-audit/src/archive.rs` |
+| 6 | daemon: worker-feed bridge → `ApiAuditEvent` push (dedup) + real `state.anchor`; reconnect w/ backoff; hermetic-test seam | `crates/agentkeys-daemon/src/ui_bridge.rs` |
+| 7 | deploy: setup-broker-host.sh — relay-key gen (skip-if-exists), worker-audit env block (batch seconds, chain RPC/profile, relay key, S3 bucket/prefix), nginx SSE-friendly location for `/v1/audit/stream` | `scripts/setup-broker-host.sh` |
+| 8 | chain: `scripts/heima-fund-audit-relay.sh` (idempotent fund from deploy wallet via relay-info) wired into `setup-heima.sh` | `scripts/` |
+| 9 | harness: stage-3 audit-feed + anchor + tamper-evidence assertions; runbook + harness/CLAUDE.md sync | `harness/v2-stage3-demo.sh`, `docs/operator-runbook-harness.md`, `harness/CLAUDE.md` |
+| 10 | docs: arch.md §15.3/§15.3a (tier-A hosted anchor semantics, new rows, Tier-1 feed surface), user-manual.md (live feed + anchored badge) | `docs/` |
+
+## Acceptance criteria mapping (issue #109)
+
+- Denial/revocation in parent UI ≤200ms → steps 4+6 (worker broadcast → daemon SSE; in-process fan-out, no polling)
+- On-chain anchor ≤2min → step 3 (default 120s cadence)
+- `AGENTKEYS_AUDIT_BATCH_SECONDS` default 120 → step 3
+- Restart recovery of last 1000/actor from S3 → step 5
+- Tamper test → step 9 (modify fetched event → recomputed leaf fails Merkle proof from `/v1/audit/anchors`)
+- Retry + `audit.batch_failed` + alert → step 3 (op_kind 91 envelope + ERROR log; the envelope itself is queued so it anchors once the chain recovers)
+
+## Out of scope (unchanged from issue)
+
+Real-time on-chain audit; audit replay UX (M4); cross-actor regulator views;
+per-vendor retention. Also out: explorer renderers for op_kinds 90/91
+(subscan-essentials#12 — `Unknown(byte)` fallback per invariant #4).
diff --git a/docs/user-manual.md b/docs/user-manual.md
index cb2067ea..d9812537 100644
--- a/docs/user-manual.md
+++ b/docs/user-manual.md
@@ -246,3 +246,38 @@ banner instead of failing.
 Scope grants are **set-replace**: the envelope's `service_ids` list is the
 FULL replacement grant (an empty set is the revoke-all), so compare two
 consecutive grant envelopes to see what changed.
+
+## Live audit feed + on-chain anchor badge (#109)
+
+The **audit** page is now a **live feed**, not just a log of your own
+clicks: the daemon subscribes to the hosted audit worker's event stream
+(filtered to your operator identity) and folds in **worker-side events you
+never triggered from this app** — an agent fetching a credential in its
+sandbox, memory reads/writes, and **denials** (`NOT PERMITTED` rows, shown
+red). Events typically appear within a fraction of a second of the
+operation. The same event never shows twice: rows you triggered locally and
+the worker's copy are deduplicated by their envelope-hash receipt.
+
+Two feed-only row types come from the audit service itself (chip `anchor`):
+
+- **`audit.root_anchor`** — every ~2 minutes the audit service batches your
+  recent events into a Merkle tree and commits the batch on-chain. The row
+  carries the batch's root and the **transaction hash** (click through to
+  the explorer). This is what the **"Anchored ✓ HH:MM"** badge reflects —
+  it is REAL chain state, updated from these events. Expect up to a
+  2-minute lag between an event appearing in the feed (real-time) and its
+  batch anchoring (the deliberate batching cadence); the feed row is your
+  instant view, the anchor is the tamper-evidence.
+- **`audit.batch_failed`** — the service could not land an anchor after
+  retries (chain outage, relay out of gas). **No events are lost**: the
+  batch re-queues and anchors with the next successful tick; the failure
+  itself is recorded (and later anchored too). If these persist, the
+  operator funds the relay: `bash scripts/heima-fund-audit-relay.sh`.
+
+To verify tamper-evidence yourself:
+`curl https://audit.litentry.org/v1/audit/anchors/<operator_omni>` returns
+recent anchored batches with a Merkle proof per event — recompute any
+served envelope's hash and check it against the anchored root; a modified
+event fails its proof. If the daemon was started without
+`--audit-worker-url` (no-infra dev), the feed shows only local events and
+the anchor badge stays at its placeholder — nothing errors.
diff --git a/scripts/heima-fund-audit-relay.sh b/scripts/heima-fund-audit-relay.sh
new file mode 100755
index 00000000..22bffef5
--- /dev/null
+++ b/scripts/heima-fund-audit-relay.sh
@@ -0,0 +1,87 @@
+#!/usr/bin/env bash
+# scripts/heima-fund-audit-relay.sh — fund the audit worker's tier-A anchor
+# relay EOA from the deploy wallet (issue #109).
+#
+# The relay key is generated ON the broker host by setup-broker-host.sh
+# (/etc/agentkeys/audit-relay.key, 0600, never leaves the host). This
+# laptop-side helper discovers its PUBLIC address via the worker's
+# `GET /v1/audit/relay-info` and tops it up so the 2-minute `appendV2`
+# anchor txs have gas. Anchor txs are emit-only (~50-100k gas), so the
+# default 0.5 HEI covers thousands of batches.
+#
+# DELIBERATELY operator-run + idempotent — funding delegates to
+# scripts/heima-fund-account.sh (skips when balance >= --amount-hei).
+# Folded into setup-heima.sh step 14 (the tier-A smoke step); callable
+# directly for surgical re-runs.
+#
+# Tolerated prereq gaps (exit 0 + "skipped" JSON, so setup-heima.sh stays
+# green before the broker host is deployed):
+#   - audit worker unreachable                      → relay-worker-unreachable
+#   - worker runs in degraded mode (no relay key)   → relay-not-configured
+#
+# Usage:
+#   bash scripts/heima-fund-audit-relay.sh [--amount-hei 0.5] [--relay-addr 0x..] [--dry-run]
+#
+# Env:
+#   AGENTKEYS_WORKER_AUDIT_URL   (from operator-workstation.env; the relay-info source)
+#   AUDIT_RELAY_FUND_HEI         (default 0.5 — override the top-up threshold)
+#   + the deployer-key resolution heima-fund-account.sh documents.
+
+set -euo pipefail
+
+AMOUNT_HEI="${AUDIT_RELAY_FUND_HEI:-0.5}"
+RELAY_ADDR=""
+DRY_RUN=0
+
+while [ $# -gt 0 ]; do
+  case "$1" in
+    --amount-hei)   [ $# -lt 2 ] && { echo "--amount-hei requires a value" >&2; exit 1; }; AMOUNT_HEI="$2"; shift 2 ;;
+    --amount-hei=*) AMOUNT_HEI="${1#*=}"; shift ;;
+    --relay-addr)   [ $# -lt 2 ] && { echo "--relay-addr requires a value" >&2; exit 1; }; RELAY_ADDR="$2"; shift 2 ;;
+    --relay-addr=*) RELAY_ADDR="${1#*=}"; shift ;;
+    --dry-run)      DRY_RUN=1; shift ;;
+    --help|-h)
+      sed -n '2,/^set -euo/p' "$0" | sed 's/^# \{0,1\}//' | sed '$d'; exit 0 ;;
+    *) echo "unknown flag: $1 (try --help)" >&2; exit 1 ;;
+  esac
+done
+
+if [ -t 2 ]; then
+  C_HEAD='\033[1;36m'; C_OK='\033[1;32m'; C_SKIP='\033[1;33m'; C_ERR='\033[1;31m'; C_RESET='\033[0m'
+else
+  C_HEAD=''; C_OK=''; C_SKIP=''; C_ERR=''; C_RESET=''
+fi
+log()  { printf "${C_HEAD}==>${C_RESET} %s\n" "$*" >&2; }
+ok()   { printf "    ${C_OK}ok${C_RESET}   %s\n" "$*" >&2; }
+skip() { printf "    ${C_SKIP}skip${C_RESET} %s\n" "$*" >&2; }
+die()  { printf "    ${C_ERR}fail${C_RESET} %s\n" "$*" >&2; exit 1; }
+
+REPO_ROOT="$(cd "$(dirname "$0")/.." && pwd)"
+ENV_FILE="${ENV_FILE:-$REPO_ROOT/scripts/operator-workstation.env}"
+[ -f "$ENV_FILE" ] || die "missing $ENV_FILE"
+set -a; . "$ENV_FILE"; set +a
+
+if [ -z "$RELAY_ADDR" ]; then
+  AUDIT_URL="${AGENTKEYS_WORKER_AUDIT_URL:?AGENTKEYS_WORKER_AUDIT_URL required (or pass --relay-addr)}"
+  log "Discovering relay address: $AUDIT_URL/v1/audit/relay-info"
+  info_json=$(curl -fsS --max-time 10 "$AUDIT_URL/v1/audit/relay-info" 2>/dev/null) || {
+    skip "audit worker unreachable at $AUDIT_URL — deploy the broker host first (setup-broker-host.sh), then re-run"
+    printf '{"ok":true,"skipped":"relay-worker-unreachable"}\n'
+    exit 0
+  }
+  enabled=$(printf '%s' "$info_json" | jq -r '.anchor_enabled // false')
+  if [ "$enabled" != "true" ]; then
+    skip "worker reports anchor_enabled=false (no relay key on host) — re-run setup-broker-host.sh, then re-run"
+    printf '{"ok":true,"skipped":"relay-not-configured"}\n'
+    exit 0
+  fi
+  RELAY_ADDR=$(printf '%s' "$info_json" | jq -r '.relay_address // empty')
+  [ -n "$RELAY_ADDR" ] || die "relay-info carried no relay_address: $info_json"
+  balance=$(printf '%s' "$info_json" | jq -r '.balance_wei // "unknown"')
+  ok "relay address $RELAY_ADDR (balance ${balance} wei)"
+fi
+
+log "Funding relay $RELAY_ADDR to >= $AMOUNT_HEI HEI (idempotent)"
+fund_args=(--to "$RELAY_ADDR" --amount-hei "$AMOUNT_HEI")
+[ "$DRY_RUN" = "1" ] && fund_args+=(--dry-run)
+bash "$REPO_ROOT/scripts/heima-fund-account.sh" "${fund_args[@]}"
diff --git a/scripts/operator-workstation.env b/scripts/operator-workstation.env
index 971cd6df..dcf83dec 100644
--- a/scripts/operator-workstation.env
+++ b/scripts/operator-workstation.env
@@ -105,6 +105,11 @@ MEMORY_BUCKET=agentkeys-memory-${ACCOUNT_ID}
 # Provisioned by scripts/provision-config-bucket.sh.
 CONFIG_BUCKET=agentkeys-config-${ACCOUNT_ID}
 
+# #109 audit cold archive — Tier-1 feed events + envelope CBOR (metadata only,
+# never plaintext). Written by the audit worker via the broker host's EC2
+# instance role; provisioned by scripts/provision-audit-archive.sh.
+AUDIT_BUCKET=agentkeys-audit-${ACCOUNT_ID}
+
 # ─── Signer (dev_key_service, issue #74 step 1b) ─────────────────────────────
 # The dedicated signer listener (`agentkeys-signer.service`, :8092 loopback)
 # is fronted publicly by nginx at a separate hostname under the same parent
diff --git a/scripts/operator-workstation.test.env b/scripts/operator-workstation.test.env
index 4674d431..a1d13815 100644
--- a/scripts/operator-workstation.test.env
+++ b/scripts/operator-workstation.test.env
@@ -55,6 +55,7 @@ CONFIG_ROLE_ARN=arn:aws:iam::${ACCOUNT_ID}:role/agentkeys-config-role-test
 # Test per-data-class buckets.
 VAULT_BUCKET=agentkeys-vault-test-${ACCOUNT_ID}
 MEMORY_BUCKET=agentkeys-memory-test-${ACCOUNT_ID}
+AUDIT_BUCKET=agentkeys-audit-test-${ACCOUNT_ID}
 # Test config bucket (#201) — distinct bucket so a config-worker compromise on
 # test can't read prod memory/cred blobs (and vice-versa).
 CONFIG_BUCKET=agentkeys-config-test-${ACCOUNT_ID}
diff --git a/scripts/provision-audit-archive.sh b/scripts/provision-audit-archive.sh
new file mode 100755
index 00000000..1c2a6f53
--- /dev/null
+++ b/scripts/provision-audit-archive.sh
@@ -0,0 +1,183 @@
+#!/usr/bin/env bash
+# scripts/provision-audit-archive.sh — idempotent creation of the audit
+# cold-archive bucket ($AUDIT_BUCKET) + the broker EC2 instance-role grant
+# that lets the co-located audit worker write/read it (issue #109).
+#
+# Mirror of scripts/provision-config-bucket.sh for the bucket half. The
+# archive holds Tier-1 feed events + canonical envelope CBOR — envelope
+# HASHES and op metadata only, never plaintext payloads (#229 rule) — but
+# per arch.md §17.2 it still gets its OWN bucket: folding it into vault/
+# memory/config would collapse the per-data-class blast radius.
+#
+# Unlike the OIDC-assumed per-actor roles (provision-*-role.sh), the
+# archive's writer is the audit WORKER itself via the broker host's EC2
+# instance profile (same credential path the email worker uses) — so the
+# grant is an inline policy on the instance role, not a federated role.
+# The instance role is discovered from the env-keyed broker EIP tag
+# (agentkeys-broker-eip[-test]) per the CLAUDE.md prod-vs-test rule.
+#
+# What it does (each step idempotent via "check first, then act"):
+#   1. head-bucket — if 200, skip create.
+#   2. create-bucket if missing (LocationConstraint only for non-us-east-1).
+#   3. put-public-access-block (idempotent overwrite).
+#   4. put-bucket-encryption with SSE-S3 AES-256 default.
+#   5. resolve broker EIP (by Name tag) → instance → instance-profile role.
+#   6. put-role-policy AuditArchiveS3 (skip when the doc already matches).
+#
+# Required env (sourced from scripts/operator-workstation.env):
+#   ACCOUNT_ID, REGION, AUDIT_BUCKET
+#
+# Required AWS profile: agentkeys-admin
+#
+# Usage:
+#   bash scripts/provision-audit-archive.sh
+#   bash scripts/provision-audit-archive.sh --dry-run
+#   ENV_FILE=scripts/operator-workstation.test.env bash scripts/provision-audit-archive.sh
+
+set -euo pipefail
+
+DRY_RUN=0
+while [ $# -gt 0 ]; do
+  case "$1" in
+    --dry-run) DRY_RUN=1; shift ;;
+    --help|-h)
+      sed -n '2,/^set -euo/p' "$0" | sed 's/^# \{0,1\}//' | sed '$d'; exit 0 ;;
+    *) echo "unknown flag: $1 (try --help)" >&2; exit 1 ;;
+  esac
+done
+
+REPO_ROOT="$(cd "$(dirname "$0")/.." && pwd)"
+ENV_FILE="${ENV_FILE:-$REPO_ROOT/scripts/operator-workstation.env}"
+
+if [ -t 2 ]; then
+  C_HEAD='\033[1;36m'; C_OK='\033[1;32m'; C_SKIP='\033[1;33m'
+  C_WARN='\033[1;33m'; C_ERR='\033[1;31m'; C_RESET='\033[0m'
+else
+  C_HEAD=''; C_OK=''; C_SKIP=''; C_WARN=''; C_ERR=''; C_RESET=''
+fi
+log()  { printf "${C_HEAD}==>${C_RESET} %s\n" "$*" >&2; }
+ok()   { printf "    ${C_OK}ok${C_RESET}   %s\n" "$*" >&2; }
+skip() { printf "    ${C_SKIP}skip${C_RESET} %s\n" "$*" >&2; }
+warn() { printf "    ${C_WARN}warn${C_RESET} %s\n" "$*" >&2; }
+die()  { printf "    ${C_ERR}fail${C_RESET} %s\n" "$*" >&2; exit 1; }
+
+[ -f "$ENV_FILE" ] || die "missing $ENV_FILE"
+set -a; . "$ENV_FILE"; set +a
+
+ACCOUNT_ID="${ACCOUNT_ID:?ACCOUNT_ID required}"
+REGION="${REGION:?REGION required}"
+AUDIT_BUCKET="${AUDIT_BUCKET:?AUDIT_BUCKET required — add it to operator-workstation.env}"
+
+# prod vs CI/test broker EIP selection (CLAUDE.md: keyed on the env file).
+EIP_TAG="agentkeys-broker-eip"
+case "$(basename "$ENV_FILE")" in
+  *test*) EIP_TAG="agentkeys-broker-eip-test" ;;
+esac
+
+log "Preflight: AWS caller identity"
+caller_arn=$(aws sts get-caller-identity --query Arn --output text 2>&1) \
+  || die "aws sts get-caller-identity failed: $caller_arn"
+arn_lc=$(printf '%s' "$caller_arn" | tr '[:upper:]' '[:lower:]')
+case "$arn_lc" in
+  *":user/agentkeys-admin"*) ok "caller is admin: $caller_arn" ;;
+  *) die "caller is $caller_arn — needs agentkeys-admin. Run: awsp agentkeys-admin" ;;
+esac
+
+# Step 1+2: bucket existence
+log "Bucket existence: s3://$AUDIT_BUCKET"
+if aws s3api head-bucket --bucket "$AUDIT_BUCKET" --region "$REGION" >/dev/null 2>&1; then
+  skip "bucket already exists"
+else
+  if [ "$DRY_RUN" = "1" ]; then
+    log "DRY RUN — would create-bucket $AUDIT_BUCKET in $REGION"
+  else
+    log "Creating bucket"
+    if [ "$REGION" = "us-east-1" ]; then
+      aws s3api create-bucket --bucket "$AUDIT_BUCKET" --region "$REGION" \
+        || die "create-bucket failed"
+    else
+      aws s3api create-bucket --bucket "$AUDIT_BUCKET" --region "$REGION" \
+        --create-bucket-configuration "LocationConstraint=$REGION" \
+        || die "create-bucket failed"
+    fi
+    ok "bucket created"
+  fi
+fi
+
+# Step 3: block public access
+log "Public access block"
+pab_target=$(jq -n '{
+  BlockPublicAcls: true, IgnorePublicAcls: true,
+  BlockPublicPolicy: true, RestrictPublicBuckets: true
+}')
+if [ "$DRY_RUN" = "1" ]; then
+  log "DRY RUN — would put-public-access-block: $pab_target"
+else
+  aws s3api put-public-access-block --bucket "$AUDIT_BUCKET" --region "$REGION" \
+    --public-access-block-configuration "$pab_target" \
+    || die "put-public-access-block failed"
+  ok "block-public-access applied (all four flags = true)"
+fi
+
+# Step 4: default encryption SSE-S3
+log "Default encryption (SSE-S3 AES-256)"
+enc_target=$(jq -n '{
+  Rules: [ { ApplyServerSideEncryptionByDefault: { SSEAlgorithm: "AES256" } } ]
+}')
+if [ "$DRY_RUN" = "1" ]; then
+  log "DRY RUN — would put-bucket-encryption: $enc_target"
+else
+  aws s3api put-bucket-encryption --bucket "$AUDIT_BUCKET" --region "$REGION" \
+    --server-side-encryption-configuration "$enc_target" \
+    || die "put-bucket-encryption failed"
+  ok "default SSE-S3 applied (feed events are metadata-only; this is belt-and-braces)"
+fi
+
+# Step 5: resolve the broker host's instance role (env-keyed EIP tag — never
+# a first-match describe-addresses, per the #201 incident rule).
+log "Resolving broker instance role via EIP tag $EIP_TAG"
+EIP=$(aws ec2 describe-addresses --region "$REGION" \
+  --filters "Name=tag:Name,Values=$EIP_TAG" \
+  --query 'Addresses[0].PublicIp' --output text 2>/dev/null || true)
+[ -n "$EIP" ] && [ "$EIP" != "None" ] || die "no EIP with tag $EIP_TAG in $REGION"
+INSTANCE_ID=$(aws ec2 describe-instances --region "$REGION" \
+  --filters "Name=ip-address,Values=$EIP" \
+  --query 'Reservations[0].Instances[0].InstanceId' --output text 2>/dev/null || true)
+[ -n "$INSTANCE_ID" ] && [ "$INSTANCE_ID" != "None" ] || die "no instance behind EIP $EIP"
+PROFILE_ARN=$(aws ec2 describe-instances --region "$REGION" --instance-ids "$INSTANCE_ID" \
+  --query 'Reservations[0].Instances[0].IamInstanceProfile.Arn' --output text 2>/dev/null || true)
+[ -n "$PROFILE_ARN" ] && [ "$PROFILE_ARN" != "None" ] \
+  || die "instance $INSTANCE_ID has no instance profile (docs/cloud-bootstrap.md §6)"
+ROLE_NAME=$(aws iam get-instance-profile --instance-profile-name "${PROFILE_ARN##*/}" \
+  --query 'InstanceProfile.Roles[0].RoleName' --output text 2>/dev/null || true)
+[ -n "$ROLE_NAME" ] && [ "$ROLE_NAME" != "None" ] || die "instance profile ${PROFILE_ARN##*/} has no role"
+ok "instance role: $ROLE_NAME (instance $INSTANCE_ID @ $EIP)"
+
+# Step 6: inline role policy (skip when the live doc already matches).
+POLICY_NAME="AuditArchiveS3"
+policy_target=$(jq -n --arg bucket "$AUDIT_BUCKET" '{
+  Version: "2012-10-17",
+  Statement: [
+    {Sid: "AuditArchiveObjects", Effect: "Allow",
+     Action: ["s3:PutObject", "s3:GetObject"],
+     Resource: "arn:aws:s3:::\($bucket)/*"},
+    {Sid: "AuditArchiveList", Effect: "Allow",
+     Action: ["s3:ListBucket"],
+     Resource: "arn:aws:s3:::\($bucket)"}
+  ]
+}')
+log "Instance-role inline policy $POLICY_NAME"
+current=$(aws iam get-role-policy --role-name "$ROLE_NAME" --policy-name "$POLICY_NAME" \
+  --query 'PolicyDocument' --output json 2>/dev/null || echo "{}")
+if [ "$(echo "$current" | jq -S .)" = "$(echo "$policy_target" | jq -S .)" ]; then
+  skip "policy already matches"
+elif [ "$DRY_RUN" = "1" ]; then
+  log "DRY RUN — would put-role-policy $POLICY_NAME on $ROLE_NAME: $policy_target"
+else
+  aws iam put-role-policy --role-name "$ROLE_NAME" --policy-name "$POLICY_NAME" \
+    --policy-document "$policy_target" \
+    || die "put-role-policy failed"
+  ok "policy applied"
+fi
+
+ok "audit archive provisioning complete: s3://$AUDIT_BUCKET (writer: $ROLE_NAME)"
diff --git a/scripts/setup-broker-host.sh b/scripts/setup-broker-host.sh
index e4c0efd6..fedb8999 100755
--- a/scripts/setup-broker-host.sh
+++ b/scripts/setup-broker-host.sh
@@ -63,6 +63,7 @@ CHAIN_RPC=""
 VAULT_BUCKET=""
 MEMORY_BUCKET=""
 CONFIG_BUCKET=""
+AUDIT_BUCKET=""              # #109 audit cold-archive bucket (provision-audit-archive.sh)
 SCOPE_ADDR=""
 REGISTRY_ADDR=""
 K3_COUNTER_ADDR=""
@@ -113,6 +114,7 @@ while (( $# > 0 )); do
     --vault-bucket)       VAULT_BUCKET="$2"; shift 2 ;;
     --memory-bucket)      MEMORY_BUCKET="$2"; shift 2 ;;
     --config-bucket)      CONFIG_BUCKET="$2"; shift 2 ;;
+    --audit-bucket)       AUDIT_BUCKET="$2"; shift 2 ;;
     --scope-addr)         SCOPE_ADDR="$2"; shift 2 ;;
     --registry-addr)      REGISTRY_ADDR="$2"; shift 2 ;;
     --k3-counter-addr)    K3_COUNTER_ADDR="$2"; shift 2 ;;
@@ -302,6 +304,9 @@ fi
 if [[ -z "$CONFIG_BUCKET" ]]; then
   CONFIG_BUCKET="$(read_envfile_var /etc/agentkeys/worker-config.env CONFIG_BUCKET)"
 fi
+if [[ -z "$AUDIT_BUCKET" ]]; then
+  AUDIT_BUCKET="$(read_envfile_var /etc/agentkeys/worker-audit.env AGENTKEYS_AUDIT_S3_BUCKET)"
+fi
 # Contract addresses (SCOPE/REGISTRY/K3) are NOT read from the host worker env
 # here — unlike buckets/RPC (operator overrides that should stick across re-runs),
 # contract addresses are DEPLOY OUTPUTS that change on every redeploy. Reading the
@@ -484,6 +489,7 @@ if [[ -z "$CLASSIFY_HOST" ]]; then CLASSIFY_HOST="$(derive_companion classify)";
 [[ -z "$VAULT_BUCKET" ]]    && VAULT_BUCKET="agentkeys-vault${SUFFIX}-${ACCOUNT_ID}"
 [[ -z "$MEMORY_BUCKET" ]]   && MEMORY_BUCKET="agentkeys-memory${SUFFIX}-${ACCOUNT_ID}"
 [[ -z "$CONFIG_BUCKET" ]]   && CONFIG_BUCKET="agentkeys-config${SUFFIX}-${ACCOUNT_ID}"
+[[ -z "$AUDIT_BUCKET" ]]    && AUDIT_BUCKET="agentkeys-audit${SUFFIX}-${ACCOUNT_ID}"
 # Test mode flips the email-from default to the -test subdomain too
 # (operator can still override via --email-from).
 if [[ "$TEST_MODE" == "true" ]] && [[ "$BROKER_EMAIL_FROM_ADDRESS" == "noreply-test@bots.litentry.org" ]]; then
@@ -514,6 +520,7 @@ unset _env_file_to_source
 [[ -z "$SCOPE_ADDR" ]]      && SCOPE_ADDR="${SCOPE_CONTRACT_ADDRESS_HEIMA:-}"
 [[ -z "$REGISTRY_ADDR" ]]   && REGISTRY_ADDR="${SIDECAR_REGISTRY_ADDRESS_HEIMA:-}"
 [[ -z "$K3_COUNTER_ADDR" ]] && K3_COUNTER_ADDR="${K3_EPOCH_COUNTER_ADDRESS_HEIMA:-}"
+AUDIT_CONTRACT_ADDR="${CREDENTIAL_AUDIT_ADDRESS_HEIMA:-}"   # #109 anchor target (env-aware: test stack has its own)
 # Last-resort fallback to the host's worker env — ONLY when neither a CLI flag nor
 # operator-workstation.env supplied the address (e.g. a host without a sourced
 # env file). A redeploy's fresh operator-workstation.env addresses always win over
@@ -1153,18 +1160,45 @@ WORKER_CONFIG_ENV_FILE=$DEV_KEY_SERVICE_ENV_DIR/worker-config.env
 WORKER_CLASSIFY_ENV_FILE=$DEV_KEY_SERVICE_ENV_DIR/worker-classify.env
 
 if [[ "$WITH_WORKERS" == "yes" ]]; then
-  # audit + email: no secrets. Mode 0644 is fine; the values are public
-  # config (bucket name, leaves dir). Rewrite on every run so bucket /
-  # region overrides via --vault-bucket / --region take effect.
+  # audit + email env FILES: no secrets (the audit relay key is a separate
+  # 0600 file). Mode 0644 is fine; the values are public config. Rewrite on
+  # every run so bucket / region overrides take effect.
+  # #109 two-tier audit: the relay PRIVATE key lives in a separate 0600 file
+  # (generated below, preserved across re-runs); the env file itself stays
+  # secret-free. Empty RPC/contract values fall back to the compiled-in chain
+  # profile inside the worker.
   log "Writing $WORKER_AUDIT_ENV_FILE"
   sudo tee "$WORKER_AUDIT_ENV_FILE" >/dev/null <<EOF
 AGENTKEYS_WORKER_AUDIT_BIND=127.0.0.1:9092
 AGENTKEYS_WORKER_AUDIT_LEAVES_DIR=/var/lib/agentkeys/audit-leaves
-AGENTKEYS_WORKER_AUDIT_FLUSH_INTERVAL_SECS=300
+# 2-min anchor cadence is a PRODUCT decision (issue #109) — don't relax it
+# without checking the demo storyboard.
+AGENTKEYS_AUDIT_BATCH_SECONDS=120
+AGENTKEYS_CHAIN=heima
+AGENTKEYS_AUDIT_RPC_URL=$CHAIN_RPC
+AGENTKEYS_AUDIT_CREDENTIAL_AUDIT_ADDRESS=$AUDIT_CONTRACT_ADDR
+AGENTKEYS_AUDIT_RELAY_KEY_FILE=/etc/agentkeys/audit-relay.key
+# Tier-1 cold archive (#109): bucket + instance-role grant provisioned by
+# scripts/provision-audit-archive.sh (setup-cloud.sh step 13).
+AGENTKEYS_AUDIT_S3_BUCKET=$AUDIT_BUCKET
+AGENTKEYS_AUDIT_S3_PREFIX=audit/
+AWS_REGION=$REGION
 EOF
   sudo chmod 0644 "$WORKER_AUDIT_ENV_FILE"
   sudo install -d -m 0750 -o agentkeys -g agentkeys /var/lib/agentkeys/audit-leaves
 
+  # Tier-A anchor relay key (#109) — generate once, NEVER overwrite (the
+  # funded on-chain relay account is derived from it; rotating means
+  # re-funding via scripts/heima-fund-audit-relay.sh).
+  if sudo test -f /etc/agentkeys/audit-relay.key; then
+    log "Preserving existing audit relay key (rotation would orphan the funded relay account)"
+  else
+    log "Generating tier-A audit relay key (first-time)"
+    openssl rand -hex 32 | sudo tee /etc/agentkeys/audit-relay.key >/dev/null
+  fi
+  sudo chown agentkeys:agentkeys /etc/agentkeys/audit-relay.key
+  sudo chmod 0600 /etc/agentkeys/audit-relay.key
+
   log "Writing $WORKER_EMAIL_ENV_FILE"
   sudo tee "$WORKER_EMAIL_ENV_FILE" >/dev/null <<EOF
 AGENTKEYS_WORKER_EMAIL_BIND=127.0.0.1:9093
@@ -1839,6 +1873,25 @@ write_worker_nginx_site() {
   local slug="$1" host="$2" port="$3"
   local cert_path="/etc/letsencrypt/live/$host/fullchain.pem"
   local sitefile="/etc/nginx/sites-available/agentkeys-worker-$slug"
+  # #109: the audit worker serves a long-lived SSE stream — nginx must not
+  # buffer it (proxy_buffering would hold events until the buffer fills,
+  # destroying the ~100ms event-to-UI latency) and must not cut it at the
+  # default read timeout.
+  local sse_locations=""
+  if [[ "$slug" == "audit" ]]; then
+    sse_locations="
+    location /v1/audit/stream {
+        proxy_pass http://127.0.0.1:$port;
+        proxy_http_version 1.1;
+        proxy_set_header Host              \$host;
+        proxy_set_header X-Forwarded-Proto \$scheme;
+        proxy_set_header X-Forwarded-For   \$remote_addr;
+        proxy_buffering off;
+        proxy_cache off;
+        proxy_read_timeout 1h;
+    }
+"
+  fi
   if sudo test -f "$cert_path"; then
     log "Writing nginx site for $host (HTTPS — LE cert detected) → :$port"
     sudo tee "$sitefile" >/dev/null <<EOF
@@ -1856,7 +1909,7 @@ server {
     ssl_certificate     /etc/letsencrypt/live/$host/fullchain.pem;
     ssl_certificate_key /etc/letsencrypt/live/$host/privkey.pem;
     ssl_protocols TLSv1.2 TLSv1.3;
-
+$sse_locations
     location / {
         proxy_pass http://127.0.0.1:$port;
         proxy_http_version 1.1;
diff --git a/scripts/setup-cloud.sh b/scripts/setup-cloud.sh
index 8fd6ed31..137cbcf9 100755
--- a/scripts/setup-cloud.sh
+++ b/scripts/setup-cloud.sh
@@ -659,7 +659,7 @@ do_step_12() {
 do_step_13() {
   CUR_STEP=13; step "Per-data-class buckets + roles (delegates to provision-*.sh)"
   if [ "$DRY_RUN" = "1" ]; then
-    warn "DRY: would run provision-{vault,memory,config}-{bucket,role}.sh + apply-{vault,memory,config}-bucket-policy.sh"
+    warn "DRY: would run provision-{vault,memory,config}-{bucket,role}.sh + provision-audit-archive.sh + apply-{vault,memory,config}-bucket-policy.sh"
     return
   fi
   # Pass ENV_FILE through so --ci/--test provisions the -test buckets/roles. Each
@@ -671,6 +671,7 @@ do_step_13() {
   for provisioner in provision-vault-bucket provision-vault-role \
                      provision-memory-bucket provision-memory-role \
                      provision-config-bucket provision-config-role \
+                     provision-audit-archive \
                      apply-vault-bucket-policy apply-memory-bucket-policy apply-config-bucket-policy; do
     ENV_FILE="$ENV_FILE" bash "$SCRIPT_DIR/$provisioner.sh"
   done
diff --git a/scripts/setup-heima.sh b/scripts/setup-heima.sh
index e8cde16e..89869938 100755
--- a/scripts/setup-heima.sh
+++ b/scripts/setup-heima.sh
@@ -376,6 +376,10 @@ do_step_13() {
 
 do_step_14() {
   CUR_STEP=14; step "Tier-A audit relay + worker /healthz smoke (intentionally append-only)"
+  # #109: top up the audit worker's anchor-relay EOA from the deploy wallet
+  # (idempotent skip-if-funded; tolerated skip while the broker host / relay
+  # key isn't deployed yet — the helper exits 0 with a "skipped" JSON).
+  bash "$SCRIPT_DIR/heima-fund-audit-relay.sh" || warn "audit-relay funding failed (anchors will batch_failed until funded)"
   local smoke_args=()
   [ -f "$HOME/.agentkeys/agents/${AGENT_LABEL}.json" ] && smoke_args+=(--actor "$AGENT_LABEL")
   bash "$SCRIPT_DIR/heima-worker-smoke.sh" "${smoke_args[@]}"

From b842e2e818e943703c05cd9cc290cc8ad71740c5 Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 21:59:03 +0800
Subject: [PATCH 11/17] feat: #109 anchor anti-spam gate + async HTTP-flush
 anchoring + smoke feed/anchor/tamper legs + UI chips
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

- Anti-spam gate: anchors submit only for operators with a registered
  on-chain master (eth_call operatorMasterWallet, TTL-cached 10min/60s);
  unregistered batches drop with a WARN (envelopes stay fetchable by
  hash), transient RPC failures re-queue. Without this the open append/v2
  endpoint lets fake operator omnis each burn one relay tx per tick.
- HTTP flush handlers SPAWN anchoring (response never waits out a chain
  confirmation — existing --max-time 10 callers keep working); response
  carries anchor_scheduled; consumers poll /v1/audit/anchors. The timer
  path still awaits anchors inline.
- heima-worker-smoke.sh: #109 legs — SSE backfill must carry the appended
  envelope; idempotent relay top-up before flush; poll the anchor record
  (<=90s), cast-receipt confirm the appendV2 tx, walk the Merkle proof in
  bash (genuine verifies, tampered leaf FAILS — the #109 tamper check).
  Tolerated skips: relay-not-configured / anchor-not-recorded.
- parent-control: 'worker' + 'anchor' ChipKinds (styles + filter row).
- operator-runbook-harness.md: smoke two-tier wiring documented.
- service/anchor tests: registered-operator end-to-end anchor (fake RPC),
  spam-omni drop, RPC-outage re-queue.
---
 .../app/_components/dashboard.tsx             |   2 +-
 apps/parent-control/app/_components/types.ts  |   2 +
 apps/parent-control/lib/constants.ts          |   2 +
 crates/agentkeys-daemon/src/ui_bridge.rs      |  15 +-
 crates/agentkeys-worker-audit/src/anchor.rs   |  90 ++++++-
 crates/agentkeys-worker-audit/src/handlers.rs |  42 +++-
 crates/agentkeys-worker-audit/src/service.rs  | 232 +++++++++++++++++-
 crates/agentkeys-worker-audit/src/state.rs    |   5 +
 docs/operator-runbook-harness.md              |  13 +-
 scripts/heima-worker-smoke.sh                 |  79 ++++++
 10 files changed, 454 insertions(+), 28 deletions(-)

diff --git a/apps/parent-control/app/_components/dashboard.tsx b/apps/parent-control/app/_components/dashboard.tsx
index c8a99ff7..ec867745 100644
--- a/apps/parent-control/app/_components/dashboard.tsx
+++ b/apps/parent-control/app/_components/dashboard.tsx
@@ -297,7 +297,7 @@ export function AuditFeed({
 }) {
   const [filter, setFilter] = useState<string>('all');
   const filtered = filter === 'all' ? events : events.filter((e) => e.chip === filter);
-  const filters: (ChipKind | 'all')[] = ['all', 'memory', 'creds', 'payment', 'audit', 'chain', 'broker'];
+  const filters: (ChipKind | 'all')[] = ['all', 'memory', 'creds', 'payment', 'audit', 'chain', 'broker', 'worker', 'anchor'];
 
   if (events.length === 0) {
     return (
diff --git a/apps/parent-control/app/_components/types.ts b/apps/parent-control/app/_components/types.ts
index 7f7ed515..3f48dd4e 100644
--- a/apps/parent-control/app/_components/types.ts
+++ b/apps/parent-control/app/_components/types.ts
@@ -50,6 +50,8 @@ export type ChipKind =
   | 'audit'
   | 'broker'
   | 'chain'
+  | 'worker'
+  | 'anchor'
   | 'payment'
   | 'revoke'
   | 'scope'
diff --git a/apps/parent-control/lib/constants.ts b/apps/parent-control/lib/constants.ts
index 21756e46..3d18447d 100644
--- a/apps/parent-control/lib/constants.ts
+++ b/apps/parent-control/lib/constants.ts
@@ -23,6 +23,8 @@ export const CHIP_STYLES: Record<ChipKind, string> = {
   audit: 'chip',
   broker: 'chip',
   chain: 'chip ok',
+  worker: 'chip',
+  anchor: 'chip ok',
   payment: 'chip warn',
   revoke: 'chip bad',
   scope: 'chip',
diff --git a/crates/agentkeys-daemon/src/ui_bridge.rs b/crates/agentkeys-daemon/src/ui_bridge.rs
index 993ba16c..64de4ff6 100644
--- a/crates/agentkeys-daemon/src/ui_bridge.rs
+++ b/crates/agentkeys-daemon/src/ui_bridge.rs
@@ -6178,9 +6178,7 @@ async fn push_audit(state: &SharedUiBridgeState, evt: ApiAuditEvent) {
     if let Some(hashes) = &evt.audit_envelope_hashes {
         if !hashes.is_empty() {
             let mut seen = state.seen_envelope_hashes.write().await;
-            let any_new = hashes
-                .iter()
-                .any(|h| !seen.1.contains(&h.to_lowercase()));
+            let any_new = hashes.iter().any(|h| !seen.1.contains(&h.to_lowercase()));
             if !any_new {
                 return;
             }
@@ -9134,7 +9132,10 @@ mod tests {
         assert_eq!(buf, "event: anch", "partial frame stays buffered");
         buf.push_str("or\ndata: {\"b\":2}\n\n");
         let frames = drain_sse_frames(&mut buf);
-        assert_eq!(frames, vec![("anchor".to_string(), "{\"b\":2}".to_string())]);
+        assert_eq!(
+            frames,
+            vec![("anchor".to_string(), "{\"b\":2}".to_string())]
+        );
         assert!(buf.is_empty());
     }
 
@@ -9163,7 +9164,11 @@ mod tests {
     async fn push_audit_dedups_by_envelope_hash_either_order() {
         let state = make_state();
         // Bridge delivers the worker envelope first…
-        push_audit(&state, worker_feed_event_to_api(&worker_evt("event", 0x33, 0))).await;
+        push_audit(
+            &state,
+            worker_feed_event_to_api(&worker_evt("event", 0x33, 0)),
+        )
+        .await;
         assert_eq!(state.audit.read().await.len(), 1);
         // …then the local submit flow pushes its own event carrying the SAME
         // receipt hash → dropped as a duplicate.
diff --git a/crates/agentkeys-worker-audit/src/anchor.rs b/crates/agentkeys-worker-audit/src/anchor.rs
index 099801a7..e1d62ae8 100644
--- a/crates/agentkeys-worker-audit/src/anchor.rs
+++ b/crates/agentkeys-worker-audit/src/anchor.rs
@@ -39,6 +39,12 @@ pub struct RelayConfig {
     pub chain_id: u64,
     /// `CredentialAudit` contract address (20 bytes).
     pub credential_audit: [u8; 20],
+    /// `SidecarRegistry` address — the anti-spam gate: anchors are only
+    /// submitted for operators with a registered on-chain master
+    /// (`operatorMasterWallet(omni) != 0`). Without this, the open
+    /// `append/v2` endpoint would let a spammer mint arbitrary operator
+    /// omnis and drain the relay one anchor tx per fake operator per tick.
+    pub sidecar_registry: [u8; 20],
     pub signing_key: SigningKey,
     /// 20-byte EVM address derived from `signing_key`.
     pub relay_address: [u8; 20],
@@ -91,7 +97,9 @@ impl RelayConfig {
         let (profile, picked) = agentkeys_core::chain_profile::ChainProfile::resolve(
             None,
             std::env::var("AGENTKEYS_CHAIN").ok().as_deref(),
-            std::env::var("AGENTKEYS_CHAIN_PROFILE_FILE").ok().as_deref(),
+            std::env::var("AGENTKEYS_CHAIN_PROFILE_FILE")
+                .ok()
+                .as_deref(),
         )?;
         info!(profile = %profile.name, %picked, "anchor relay chain profile");
 
@@ -107,6 +115,14 @@ impl RelayConfig {
                 .address
                 .clone(),
         };
+        let registry_addr_hex = match std::env::var("AGENTKEYS_AUDIT_REGISTRY_ADDRESS") {
+            Ok(a) if !a.is_empty() => a,
+            _ => profile
+                .contract("SidecarRegistry")
+                .ok_or_else(|| anyhow!("chain profile {} has no SidecarRegistry", profile.name))?
+                .address
+                .clone(),
+        };
 
         let gas_limit = env_u128("AGENTKEYS_AUDIT_ANCHOR_GAS_LIMIT", 200_000)?;
         let attempts = env_u128("AGENTKEYS_AUDIT_ANCHOR_ATTEMPTS", 3)? as u32;
@@ -116,6 +132,7 @@ impl RelayConfig {
             rpc_url,
             profile.chain_id,
             &audit_addr_hex,
+            &registry_addr_hex,
             &key_hex,
             gas_limit,
             attempts,
@@ -130,6 +147,7 @@ impl RelayConfig {
         rpc_url: String,
         chain_id: u64,
         credential_audit_hex: &str,
+        sidecar_registry_hex: &str,
         relay_key_hex: &str,
         gas_limit: u128,
         attempts: u32,
@@ -138,19 +156,20 @@ impl RelayConfig {
     ) -> Result<Self> {
         let credential_audit = decode20(credential_audit_hex)
             .ok_or_else(|| anyhow!("CredentialAudit address must be 20-byte hex"))?;
+        let sidecar_registry = decode20(sidecar_registry_hex)
+            .ok_or_else(|| anyhow!("SidecarRegistry address must be 20-byte hex"))?;
         let key_bytes =
             hex::decode(relay_key_hex.trim().trim_start_matches("0x")).context("relay key hex")?;
         let signing_key = SigningKey::from_slice(&key_bytes).context("relay key")?;
         let relay_address = eth_address(&signing_key);
-        let relay_wallet = agentkeys_types::WalletAddress(format!(
-            "0x{}",
-            hex::encode(relay_address)
-        ));
+        let relay_wallet =
+            agentkeys_types::WalletAddress(format!("0x{}", hex::encode(relay_address)));
         let relay_omni = agentkeys_core::actor_omni::actor_omni_from_wallet(&relay_wallet);
         Ok(Self {
             rpc_url,
             chain_id,
             credential_audit,
+            sidecar_registry,
             signing_key,
             relay_address,
             relay_omni,
@@ -327,7 +346,10 @@ async fn rpc_call(
     for _ in 0..3 {
         match http.post(rpc_url).json(&body).send().await {
             Ok(resp) if resp.status().is_success() => {
-                let v: Value = resp.json().await.map_err(|e| anyhow!("{method} json: {e}"))?;
+                let v: Value = resp
+                    .json()
+                    .await
+                    .map_err(|e| anyhow!("{method} json: {e}"))?;
                 if let Some(err) = v.get("error") {
                     // RPC-level errors are NOT transient (bad tx, low funds) —
                     // surface immediately.
@@ -343,6 +365,38 @@ async fn rpc_call(
     bail!("{last} (after 3 tries)")
 }
 
+/// Anti-spam anchor gate (#109): does this operator have a registered
+/// on-chain master? `eth_call SidecarRegistry.operatorMasterWallet(omni)`
+/// → non-zero address. Spammers minting fake operator omnis through the
+/// open `append/v2` endpoint fail this gate, so they can fill queues but
+/// never burn relay gas. RPC errors return `Err` (the caller re-queues —
+/// never drop real events on a transient flake, never burn gas blind).
+pub async fn operator_has_master(
+    cfg: &RelayConfig,
+    http: &reqwest::Client,
+    operator_omni: [u8; 32],
+) -> Result<bool> {
+    let selector = &keccak256(b"operatorMasterWallet(bytes32)")[..4];
+    let mut data = Vec::with_capacity(4 + 32);
+    data.extend_from_slice(selector);
+    data.extend_from_slice(&operator_omni);
+    let result = rpc_call(
+        http,
+        &cfg.rpc_url,
+        "eth_call",
+        json!([{
+            "to": format!("0x{}", hex::encode(cfg.sidecar_registry)),
+            "data": format!("0x{}", hex::encode(data)),
+        }, "latest"]),
+    )
+    .await?;
+    let raw = result
+        .as_str()
+        .ok_or_else(|| anyhow!("eth_call returned non-string"))?;
+    let bytes = hex::decode(raw.trim_start_matches("0x")).unwrap_or_default();
+    Ok(bytes.iter().any(|b| *b != 0))
+}
+
 /// Current relay balance in wei (`eth_getBalance`), `None` on any RPC
 /// trouble — diagnostics only, never load-bearing.
 pub async fn relay_balance_wei(cfg: &RelayConfig, http: &reqwest::Client) -> Option<u128> {
@@ -420,6 +474,7 @@ mod tests {
             "http://127.0.0.1:1".into(),
             212013,
             &format!("0x{}", "11".repeat(20)),
+            &format!("0x{}", "22".repeat(20)),
             &format!("0x{}", "46".repeat(32)),
             200_000,
             3,
@@ -459,6 +514,19 @@ mod tests {
                         "eth_gasPrice" => json!("0x3b9aca00"),
                         "eth_sendRawTransaction" => json!(format!("0x{}", "ab".repeat(32))),
                         "eth_getTransactionReceipt" => json!({"status": "0x1"}),
+                        // Anti-spam gate: operator omni 0x22… is "registered"
+                        // (non-zero master), everything else isn't.
+                        "eth_call" => {
+                            let data = req["params"][0]["data"].as_str().unwrap_or("");
+                            if data.ends_with(&"22".repeat(32)) {
+                                json!(format!(
+                                    "0x{:0>64}",
+                                    "9d8a62f656a8d1615c1294fd71e9cfb3e4855a4f"
+                                ))
+                            } else {
+                                json!(format!("0x{}", "00".repeat(32)))
+                            }
+                        }
                         other => panic!("unexpected method {other}"),
                     };
                     Ok(Json(json!({"jsonrpc": "2.0", "id": 1, "result": result})))
@@ -478,6 +546,7 @@ mod tests {
             rpc_url,
             212013,
             &format!("0x{}", "11".repeat(20)),
+            &format!("0x{}", "22".repeat(20)),
             &format!("0x{}", "46".repeat(32)),
             200_000,
             3,
@@ -514,6 +583,15 @@ mod tests {
         assert_eq!(receipt.attempts_used, 1);
     }
 
+    #[tokio::test]
+    async fn operator_gate_distinguishes_registered_from_spam() {
+        let (url, _) = spawn_fake_rpc(0).await;
+        let cfg = test_cfg(url);
+        let http = reqwest::Client::new();
+        assert!(operator_has_master(&cfg, &http, [0x22; 32]).await.unwrap());
+        assert!(!operator_has_master(&cfg, &http, [0x99; 32]).await.unwrap());
+    }
+
     #[tokio::test]
     async fn exhausted_attempts_return_failure_with_last_error() {
         // Every request 503s: 3 batch attempts × 3 transport tries all fail.
diff --git a/crates/agentkeys-worker-audit/src/handlers.rs b/crates/agentkeys-worker-audit/src/handlers.rs
index 4ad40c2e..9596f0f1 100644
--- a/crates/agentkeys-worker-audit/src/handlers.rs
+++ b/crates/agentkeys-worker-audit/src/handlers.rs
@@ -68,9 +68,11 @@ pub struct FlushResponse {
     /// the `appendRootV2(operatorOmni, merkleRoot, opKindBitmap, entryCount)`
     /// inputs for the on-chain anchor.
     pub flushed_v2: Vec<FlushV2Result>,
-    /// #109: batches the tier-A relay anchored on-chain during this flush
-    /// (empty when the relay is unconfigured or a batch failed + re-queued).
-    pub anchored: Vec<AnchorRecord>,
+    /// #109: whether the flushed V2 batches were handed to the tier-A
+    /// anchor task (poll `GET /v1/audit/anchors/:operator` for the
+    /// confirmed records — a flush response never waits out a chain
+    /// confirmation).
+    pub anchor_scheduled: bool,
 }
 
 pub async fn flush_one(
@@ -81,15 +83,18 @@ pub async fn flush_one(
         .flush(&operator_omni)
         .await
         .map_err(|e| (StatusCode::INTERNAL_SERVER_ERROR, e.to_string()))?;
-    let (r2, anchored) =
-        crate::service::flush_v2_and_anchor(&state, Some(&operator_omni.to_lowercase()))
-            .await
-            .map_err(|e| (StatusCode::INTERNAL_SERVER_ERROR, e.to_string()))?;
+    let r2: Vec<FlushV2Result> = state
+        .flush_v2(&operator_omni.to_lowercase())
+        .await
+        .map_err(|e| (StatusCode::INTERNAL_SERVER_ERROR, e.to_string()))?
+        .into_iter()
+        .collect();
+    let anchor_scheduled = spawn_anchor(&state, &r2);
     Ok(Json(FlushResponse {
         ok: true,
         flushed: r.into_iter().collect(),
         flushed_v2: r2,
-        anchored,
+        anchor_scheduled,
     }))
 }
 
@@ -100,17 +105,34 @@ pub async fn flush_all(
         .flush_all()
         .await
         .map_err(|e| (StatusCode::INTERNAL_SERVER_ERROR, e.to_string()))?;
-    let (r2, anchored) = crate::service::flush_v2_and_anchor(&state, None)
+    let r2 = state
+        .flush_v2_all()
         .await
         .map_err(|e| (StatusCode::INTERNAL_SERVER_ERROR, e.to_string()))?;
+    let anchor_scheduled = spawn_anchor(&state, &r2);
     Ok(Json(FlushResponse {
         ok: true,
         flushed: r,
         flushed_v2: r2,
-        anchored,
+        anchor_scheduled,
     }))
 }
 
+/// Hand flushed V2 batches to the background anchor task (#109). Returns
+/// whether anchoring was actually scheduled (relay configured + batches
+/// non-empty).
+fn spawn_anchor(state: &SharedState, flushed: &[FlushV2Result]) -> bool {
+    if flushed.is_empty() || state.relay.is_none() {
+        return false;
+    }
+    let state = state.clone();
+    let flushed = flushed.to_vec();
+    tokio::spawn(async move {
+        crate::service::anchor_flushed(&state, &flushed).await;
+    });
+    true
+}
+
 #[derive(Serialize)]
 pub struct QueueSizeResponse {
     pub operator_omni: String,
diff --git a/crates/agentkeys-worker-audit/src/service.rs b/crates/agentkeys-worker-audit/src/service.rs
index 5d2a3a15..a989eda4 100644
--- a/crates/agentkeys-worker-audit/src/service.rs
+++ b/crates/agentkeys-worker-audit/src/service.rs
@@ -38,7 +38,9 @@ fn decode32_hex(s: &str) -> [u8; 32] {
 }
 
 /// Drain V2 queues (one operator, or all when `None`) and anchor each
-/// batch. Returns the flush results plus the anchors that landed.
+/// batch inline. The TIMER path — anchoring (chain receipt wait included)
+/// happens before the next tick. Returns the flush results plus the
+/// anchors that landed.
 pub async fn flush_v2_and_anchor(
     state: &SharedState,
     operator_omni: Option<&str>,
@@ -47,13 +49,22 @@ pub async fn flush_v2_and_anchor(
         Some(op) => state.flush_v2(op).await?.into_iter().collect(),
         None => state.flush_v2_all().await?,
     };
+    let anchors = anchor_flushed(state, &flushed).await;
+    Ok((flushed, anchors))
+}
+
+/// Anchor a set of already-flushed batches. The HTTP flush handlers SPAWN
+/// this (a flush response must not wait out a chain confirmation — callers
+/// poll `GET /v1/audit/anchors/:operator` for the outcome, mirroring the
+/// "anchored within 2 min" product contract); the timer awaits it inline.
+pub async fn anchor_flushed(state: &SharedState, flushed: &[FlushV2Result]) -> Vec<AnchorRecord> {
     let mut anchors = Vec::new();
-    for flush in &flushed {
+    for flush in flushed {
         if let Some(record) = anchor_one_batch(state, flush).await {
             anchors.push(record);
         }
     }
-    Ok((flushed, anchors))
+    anchors
 }
 
 /// Anchor a single flushed batch. `None` when the relay is unconfigured or
@@ -74,6 +85,36 @@ async fn anchor_one_batch(state: &SharedState, flush: &FlushV2Result) -> Option<
     let operator32 = decode32_hex(&flush.operator_omni);
     let ts = now_unix();
 
+    // Anti-spam gate (#109): anchor only operators with a registered
+    // on-chain master. The open append/v2 endpoint otherwise lets a
+    // spammer mint fake operator omnis that each cost the relay one tx
+    // per tick. Unregistered → DROP (the envelopes stay fetchable by
+    // hash; re-queueing spam forever would grow the queue unboundedly).
+    // Transient RPC failure → re-queue and retry next tick (never drop
+    // real events on a flake, never burn gas blind).
+    match operator_anchor_allowed(state, &flush.operator_omni, operator32).await {
+        Ok(true) => {}
+        Ok(false) => {
+            warn!(
+                operator_omni = %flush.operator_omni,
+                entries = flush.entry_count,
+                "anchor gate: operator has NO registered master — batch dropped (spam posture)"
+            );
+            return None;
+        }
+        Err(e) => {
+            warn!(
+                operator_omni = %flush.operator_omni,
+                error = %e,
+                "anchor gate: registry check failed — batch re-queued for next tick"
+            );
+            state
+                .requeue_v2(&flush.operator_omni, flush.entries.clone())
+                .await;
+            return None;
+        }
+    }
+
     // The anchor envelope — an honest AuditEnvelope whose hash goes on
     // chain. actor = the relay's derived omni; operator = the REAL
     // operator whose batch this is (stays an indexed topic on-chain).
@@ -125,7 +166,9 @@ async fn anchor_one_batch(state: &SharedState, flush: &FlushV2Result) -> Option<
 
     match submit_anchor_with_retries(relay, &state.http, calldata).await {
         Ok(receipt) => {
-            state.store_envelope(env_hash_hex.clone(), cbor.clone()).await;
+            state
+                .store_envelope(env_hash_hex.clone(), cbor.clone())
+                .await;
             if let Some(archive) = &state.archive {
                 archive.archive_envelope(env_hash_hex.clone(), cbor);
             }
@@ -181,6 +224,38 @@ async fn anchor_one_batch(state: &SharedState, flush: &FlushV2Result) -> Option<
     }
 }
 
+/// TTL-cached `operatorMasterWallet(omni) != 0` check (the #109 anti-spam
+/// anchor gate). Positive answers cache 10 min; negative 60 s.
+async fn operator_anchor_allowed(
+    state: &SharedState,
+    operator_hex: &str,
+    operator32: [u8; 32],
+) -> anyhow::Result<bool> {
+    const POSITIVE_TTL: std::time::Duration = std::time::Duration::from_secs(600);
+    const NEGATIVE_TTL: std::time::Duration = std::time::Duration::from_secs(60);
+    let key = operator_hex.to_lowercase();
+    {
+        let cache = state.anchor_gate_cache.lock().await;
+        if let Some((allowed, at)) = cache.get(&key) {
+            let ttl = if *allowed { POSITIVE_TTL } else { NEGATIVE_TTL };
+            if at.elapsed() < ttl {
+                return Ok(*allowed);
+            }
+        }
+    }
+    let relay = state
+        .relay
+        .as_ref()
+        .ok_or_else(|| anyhow::anyhow!("gate called without relay"))?;
+    let allowed = crate::anchor::operator_has_master(relay, &state.http, operator32).await?;
+    state
+        .anchor_gate_cache
+        .lock()
+        .await
+        .insert(key, (allowed, std::time::Instant::now()));
+    Ok(allowed)
+}
+
 /// Emit the `AuditBatchFailed` envelope: stored by hash, queued for a
 /// future anchor (so the failure itself lands on-chain once the chain
 /// recovers), and pushed to the live feed.
@@ -228,7 +303,9 @@ async fn emit_batch_failed(
         }
     };
     let env_hash_hex = format!("0x{}", hex::encode(env_hash));
-    state.store_envelope(env_hash_hex.clone(), cbor.clone()).await;
+    state
+        .store_envelope(env_hash_hex.clone(), cbor.clone())
+        .await;
     if let Some(archive) = &state.archive {
         archive.archive_envelope(env_hash_hex.clone(), cbor);
     }
@@ -263,3 +340,148 @@ async fn emit_batch_failed(
         archive.archive_feed_event(evt);
     }
 }
+
+#[cfg(test)]
+mod tests {
+    use super::*;
+    use crate::anchor::RelayConfig;
+    use crate::state::{State, V2QueueEntry};
+    use axum::{routing::post, Json, Router};
+    use serde_json::{json, Value};
+    use std::sync::Arc;
+    use std::time::Duration;
+
+    /// Fake JSON-RPC node for the full flush→gate→anchor path. Operator
+    /// omni 0x22… is registered (non-zero master); anything else isn't.
+    async fn spawn_fake_rpc() -> String {
+        let app = Router::new().route(
+            "/",
+            post(move |Json(req): Json<Value>| async move {
+                let method = req.get("method").and_then(|m| m.as_str()).unwrap_or("");
+                let result = match method {
+                    "eth_getTransactionCount" => json!("0x0"),
+                    "eth_gasPrice" => json!("0x3b9aca00"),
+                    "eth_sendRawTransaction" => json!(format!("0x{}", "cd".repeat(32))),
+                    "eth_getTransactionReceipt" => json!({"status": "0x1"}),
+                    "eth_call" => {
+                        let data = req["params"][0]["data"].as_str().unwrap_or("");
+                        if data.ends_with(&"22".repeat(32)) {
+                            json!(format!(
+                                "0x{:0>64}",
+                                "9d8a62f656a8d1615c1294fd71e9cfb3e4855a4f"
+                            ))
+                        } else {
+                            json!(format!("0x{}", "00".repeat(32)))
+                        }
+                    }
+                    other => panic!("unexpected method {other}"),
+                };
+                Json(json!({"jsonrpc": "2.0", "id": 1, "result": result}))
+            }),
+        );
+        let listener = tokio::net::TcpListener::bind("127.0.0.1:0").await.unwrap();
+        let addr = listener.local_addr().unwrap();
+        tokio::spawn(async move {
+            axum::serve(listener, app).await.ok();
+        });
+        format!("http://{addr}/")
+    }
+
+    fn relay_cfg(rpc_url: String) -> RelayConfig {
+        RelayConfig::build(
+            rpc_url,
+            212013,
+            &format!("0x{}", "11".repeat(20)),
+            &format!("0x{}", "33".repeat(20)),
+            &format!("0x{}", "46".repeat(32)),
+            200_000,
+            3,
+            Duration::ZERO,
+            Duration::from_secs(5),
+        )
+        .unwrap()
+    }
+
+    fn entry(hash_byte: u8, op_kind: u8) -> V2QueueEntry {
+        V2QueueEntry {
+            envelope_hash: format!("0x{}", hex::encode([hash_byte; 32])),
+            op_kind,
+            actor_omni: format!("0x{}", "aa".repeat(32)),
+            ts_unix: 1_700_000_000,
+        }
+    }
+
+    #[tokio::test]
+    async fn registered_operator_batch_anchors_end_to_end() {
+        let rpc = spawn_fake_rpc().await;
+        let state = Arc::new(State::new("/tmp".into()).with_relay(Some(relay_cfg(rpc))));
+        let registered_op = format!("0x{}", "22".repeat(32));
+        state.queue_v2(registered_op.clone(), entry(0x01, 1)).await;
+        state.queue_v2(registered_op.clone(), entry(0x02, 11)).await;
+
+        let mut feed = state.subscribe_feed();
+        let (flushed, anchored) = flush_v2_and_anchor(&state, None).await.unwrap();
+        std::fs::remove_file(&flushed[0].leaves_path).ok();
+        assert_eq!(flushed.len(), 1);
+        assert_eq!(anchored.len(), 1);
+        assert_eq!(anchored[0].tx_hash, format!("0x{}", "cd".repeat(32)));
+        assert_eq!(anchored[0].entries.len(), 2);
+
+        // The anchor surfaced in the live feed + the anchors endpoint state
+        // + the by-hash envelope store (typed back as AuditRootAnchor).
+        let evt = feed.try_recv().expect("anchor feed event");
+        assert_eq!(evt.kind, "anchor");
+        assert_eq!(evt.op_kind, 90);
+        assert_eq!(evt.tx_hash.as_deref(), Some(anchored[0].tx_hash.as_str()));
+        let records = state.anchors_for(&registered_op).await;
+        assert_eq!(records.len(), 1);
+        let cbor = state
+            .get_envelope(&anchored[0].anchor_envelope_hash)
+            .await
+            .expect("anchor envelope stored");
+        let env = agentkeys_core::audit::AuditEnvelope::from_canonical_cbor(&cbor).unwrap();
+        match env.typed_body().unwrap() {
+            agentkeys_core::audit::TypedAuditBody::AuditRootAnchor(b) => {
+                assert_eq!(b.entry_count, 2);
+                assert_eq!(b.merkle_root, anchored[0].merkle_root_hex);
+            }
+            other => panic!("unexpected body {other:?}"),
+        }
+        // Queue is empty — nothing re-queued on success.
+        assert!(state.flush_v2(&registered_op).await.unwrap().is_none());
+    }
+
+    #[tokio::test]
+    async fn unregistered_operator_batch_is_dropped_not_anchored() {
+        let rpc = spawn_fake_rpc().await;
+        let state = Arc::new(State::new("/tmp".into()).with_relay(Some(relay_cfg(rpc))));
+        let spam_op = format!("0x{}", "99".repeat(32));
+        state.queue_v2(spam_op.clone(), entry(0x05, 1)).await;
+
+        let (flushed, anchored) = flush_v2_and_anchor(&state, None).await.unwrap();
+        std::fs::remove_file(&flushed[0].leaves_path).ok();
+        assert_eq!(flushed.len(), 1, "flush still drains");
+        assert!(anchored.is_empty(), "no gas burned for spam omnis");
+        // Dropped, not re-queued — the queue stays empty.
+        assert!(state.flush_v2(&spam_op).await.unwrap().is_none());
+    }
+
+    #[tokio::test]
+    async fn rpc_outage_requeues_and_emits_nothing() {
+        // Relay points at a closed port: the GATE check itself fails →
+        // conservative re-queue (no drop, no batch_failed — transient).
+        let listener = tokio::net::TcpListener::bind("127.0.0.1:0").await.unwrap();
+        let dead = format!("http://{}/", listener.local_addr().unwrap());
+        drop(listener);
+        let state = Arc::new(State::new("/tmp".into()).with_relay(Some(relay_cfg(dead))));
+        let op = format!("0x{}", "22".repeat(32));
+        state.queue_v2(op.clone(), entry(0x07, 1)).await;
+
+        let (flushed, anchored) = flush_v2_and_anchor(&state, None).await.unwrap();
+        std::fs::remove_file(&flushed[0].leaves_path).ok();
+        assert!(anchored.is_empty());
+        let r = state.flush_v2(&op).await.unwrap().expect("re-queued");
+        std::fs::remove_file(&r.leaves_path).ok();
+        assert_eq!(r.entry_count, 1, "entries preserved for the next tick");
+    }
+}
diff --git a/crates/agentkeys-worker-audit/src/state.rs b/crates/agentkeys-worker-audit/src/state.rs
index fb66ea1d..64f351b6 100644
--- a/crates/agentkeys-worker-audit/src/state.rs
+++ b/crates/agentkeys-worker-audit/src/state.rs
@@ -134,6 +134,10 @@ pub struct State {
     pub http: reqwest::Client,
     /// S3 cold archive (#109). `None` = in-memory only.
     pub archive: Option<crate::archive::Archive>,
+    /// Anti-spam anchor-gate cache (#109): operator_omni → (has_master,
+    /// checked_at). Positive entries valid 10 min, negative 60 s (so a
+    /// freshly-registered operator isn't held back long).
+    pub anchor_gate_cache: Mutex<HashMap<String, (bool, std::time::Instant)>>,
 }
 
 impl State {
@@ -156,6 +160,7 @@ impl State {
             relay: None,
             http: reqwest::Client::new(),
             archive: None,
+            anchor_gate_cache: Mutex::new(HashMap::new()),
         }
     }
 
diff --git a/docs/operator-runbook-harness.md b/docs/operator-runbook-harness.md
index 62e00b16..ec01d65e 100644
--- a/docs/operator-runbook-harness.md
+++ b/docs/operator-runbook-harness.md
@@ -233,7 +233,18 @@ commitment), and the envelope must NOT contain the roundtrip plaintext. Skip rea
 `audit-receipt-missing` (worker predates #229 — redeploy the broker host — or the
 emit dropped in best-effort mode) and `audit-url-unset` (stale env file). The tier-A
 on-chain anchor itself is exercised by `scripts/heima-worker-smoke.sh` (stage-2 step
-10), which now also flushes the V2 envelope queue and submits `appendRootV2`.
+10), which flushes the V2 envelope queue, submits the master-gated `appendRoot`
+(tier-B/C sovereign path), and — since #109 — additionally asserts the **two-tier
+wiring**: the appended envelope must appear on the worker's Tier-1 SSE backfill
+(`GET /v1/audit/stream?backfill=N`), and the worker's autonomous tier-A relay must
+anchor the batch on-chain (poll `GET /v1/audit/anchors/:operator` ≤90 s for the
+record, `cast receipt` confirms the `appendV2` tx, then a local Merkle-proof walk
+verifies the genuine envelope AND proves a tampered one fails — the #109
+tamper-evidence check). Tolerated smoke skips: `relay-not-configured` (host has no
+`/etc/agentkeys/audit-relay.key` — re-run `setup-broker-host.sh`) and
+`anchor-not-recorded` (operator unregistered on the anti-spam gate, or relay
+unfunded — `bash scripts/heima-fund-audit-relay.sh`, auto-run by `setup-heima.sh`
+step 14).
 
 ### CI flag reference
 
diff --git a/scripts/heima-worker-smoke.sh b/scripts/heima-worker-smoke.sh
index 82213411..2b258f25 100755
--- a/scripts/heima-worker-smoke.sh
+++ b/scripts/heima-worker-smoke.sh
@@ -198,12 +198,30 @@ else
   [ -z "$V2_ENVELOPE_HASH" ] && die "append/v2 returned no envelope_hash — body: $V2_OUT"
   ok "queued V2 envelope (op_kind=1 cred.fetch) — envelope_hash=$V2_ENVELOPE_HASH"
 
+  # ─── #109 Tier-1 feed: the appended envelope must be visible on the SSE
+  # stream's ring-buffer backfill (read-only; deterministic — no live-race).
+  log "Tier-1 SSE feed (#109) — envelope visible via backfill"
+  SSE_OUT=$(curl -sN --max-time 5 \
+    "$AUDIT_URL/v1/audit/stream?operator=0x$OPERATOR_OMNI&backfill=50" \
+    2>/dev/null | head -c 65536 || true)
+  if printf '%s' "$SSE_OUT" | grep -q "$V2_ENVELOPE_HASH"; then
+    ok "SSE backfill carries the appended envelope"
+  else
+    die "appended envelope $V2_ENVELOPE_HASH missing from the SSE backfill (is the worker pre-#109?)"
+  fi
+
   if [ "$DRY_RUN" = "1" ]; then
     log "DRY RUN — would flush + appendRoot + appendRootV2 now"
     echo "{\"ok\":true,\"dry_run\":true,\"audit_queued\":2,\"audit_v2_queued\":1}"
     exit 0
   fi
 
+  # #109: top up the tier-A anchor relay BEFORE flushing (idempotent
+  # skip-if-funded; tolerated skip when the relay/worker isn't deployed) so
+  # the background anchor task this flush schedules has gas on first runs.
+  bash "$REPO_ROOT/scripts/heima-fund-audit-relay.sh" >/dev/null \
+    || info "audit-relay funding failed — the anchor leg below may skip"
+
   log "Flushing queue → Merkle root"
   FLUSH_OUT=$(curl -sf --max-time 10 -X POST "$AUDIT_URL/v1/audit/flush/0x$OPERATOR_OMNI" 2>&1) \
     || die "flush failed: $FLUSH_OUT"
@@ -274,6 +292,67 @@ else
   [ "$STORED_ROOT_LC" = "$ROOT_LC" ] || die "stored root $STORED_ROOT != flushed root $ROOT"
   ok "on-chain root matches flushed root (idx $LAST_IDX)"
 
+  # ─── #109 Tier-A relay anchor + tamper-evidence ────────────────────────────
+  # The flush above handed the V2 batch to the worker's background anchor
+  # task (appendV2 + AuditRootAnchor envelope, signed by the relay EOA).
+  # Poll /v1/audit/anchors for the confirmed record, then verify the Merkle
+  # proof walk locally — and that a TAMPERED leaf fails it (the #109
+  # acceptance check). Tolerated skips: relay unconfigured (degraded host)
+  # or the operator unregistered on the anti-spam gate.
+  log "Tier-A relay anchor (#109) — poll for the confirmed anchor record"
+  RELAY_INFO=$(curl -sf --max-time 10 "$AUDIT_URL/v1/audit/relay-info" 2>/dev/null || echo '{}')
+  if [ "$(echo "$RELAY_INFO" | jq -r '.anchor_enabled // false')" != "true" ]; then
+    info "skip relay-not-configured — worker in degraded mode (no AGENTKEYS_AUDIT_RELAY_KEY_FILE on the host)"
+  else
+    ANCHOR_RECORD=""
+    for _i in $(seq 1 30); do
+      ANCHORS=$(curl -sf --max-time 10 "$AUDIT_URL/v1/audit/anchors/0x$OPERATOR_OMNI" 2>/dev/null || echo '{}')
+      ANCHOR_RECORD=$(echo "$ANCHORS" | jq -c --arg h "$V2_ENVELOPE_HASH" \
+        '[.anchors[]? | select([.entries[].envelope_hash] | index($h))] | last // empty')
+      [ -n "$ANCHOR_RECORD" ] && break
+      sleep 3
+    done
+    if [ -z "$ANCHOR_RECORD" ]; then
+      info "skip anchor-not-recorded after 90s — operator likely unregistered on the anti-spam gate (registered master required), or the relay is unfunded (run scripts/heima-fund-audit-relay.sh); check worker logs"
+    else
+      ANCHOR_TX=$(echo "$ANCHOR_RECORD" | jq -r '.tx_hash')
+      ANCHOR_ROOT=$(echo "$ANCHOR_RECORD" | jq -r '.merkle_root_hex')
+      TX_STATUS=$(cast receipt "$ANCHOR_TX" status --rpc-url "$RPC_HTTP" 2>/dev/null || echo "")
+      case "$TX_STATUS" in
+        *1*) ok "anchor tx confirmed on-chain: $ANCHOR_TX" ;;
+        *)   die "anchor tx $ANCHOR_TX not confirmed (status: ${TX_STATUS:-unreadable})" ;;
+      esac
+      # Merkle-proof walk (mirrors CredentialAudit.verifyEntryInRoot: leaf
+      # prefixed 0x00, internal nodes 0x01 over the sorted pair).
+      PROOF_LINES=$(echo "$ANCHOR_RECORD" | jq -r --arg h "$V2_ENVELOPE_HASH" \
+        '.entries[] | select(.envelope_hash == $h) | .proof[]')
+      walk_proof() {
+        local computed sibling lo hi
+        computed=$(cast keccak "0x00${1#0x}")
+        for sibling in $PROOF_LINES; do
+          if [ "$(printf '%s\n%s\n' "${computed#0x}" "${sibling#0x}" | LC_ALL=C sort | head -1)" = "${computed#0x}" ]; then
+            lo=$computed; hi=$sibling
+          else
+            lo=$sibling; hi=$computed
+          fi
+          computed=$(cast keccak "0x01${lo#0x}${hi#0x}")
+        done
+        printf '%s' "$computed"
+      }
+      GENUINE_ROOT=$(walk_proof "$V2_ENVELOPE_HASH")
+      [ "$GENUINE_ROOT" = "$ANCHOR_ROOT" ] \
+        || die "genuine envelope failed its Merkle proof (walked $GENUINE_ROOT, anchored $ANCHOR_ROOT)"
+      ok "genuine envelope verifies against the anchored root"
+      FIRST_BYTE=$(printf '%s' "${V2_ENVELOPE_HASH#0x}" | cut -c1-2)
+      FLIP=ff; [ "$FIRST_BYTE" = "ff" ] && FLIP=00
+      TAMPERED_HASH="0x${FLIP}$(printf '%s' "${V2_ENVELOPE_HASH#0x}" | cut -c3-)"
+      TAMPERED_ROOT=$(walk_proof "$TAMPERED_HASH")
+      [ "$TAMPERED_ROOT" != "$ANCHOR_ROOT" ] \
+        || die "TAMPERED envelope still verified — tamper-evidence broken"
+      ok "tamper-evidence: a modified event fails its proof against the anchored root"
+    fi
+  fi
+
   # ─── V2 envelope batch → appendRootV2 (#229 two-tier anchor) ──────────────
   # The same flush above also drained the V2 envelope queue (the durable
   # audit feed the data-plane workers emit into). Commit its Merkle root +

From f789458bd3b8599ea71c2d7201bf2c25696e2ce6 Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 22:01:50 +0800
Subject: [PATCH 12/17] =?UTF-8?q?docs:=20#109=20plan=20status=20=E2=80=94?=
 =?UTF-8?q?=20shipped=20in=20PR=20#281,=20deviations=20noted?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

---
 docs/plan/issue-109-two-tier-audit.md | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/docs/plan/issue-109-two-tier-audit.md b/docs/plan/issue-109-two-tier-audit.md
index 757e17fd..a60081dd 100644
--- a/docs/plan/issue-109-two-tier-audit.md
+++ b/docs/plan/issue-109-two-tier-audit.md
@@ -1,6 +1,9 @@
 # Issue #109 — Two-tier audit wiring (real-time off-chain feed + 2-min on-chain anchor)
 
-**Status:** in progress. Builds on #229 (data-plane emits + V2 queues) and #97/#270
+**Status:** shipped in PR #281 (all 10 steps; step 9's assertions live in
+`heima-worker-smoke.sh` rather than a new stage-3 step — no renumbering of the
+23-step demo). Deferred: live anchor-loop verify on a deployed host;
+subscan renderers for op_kinds 90/91. Builds on #229 (data-plane emits + V2 queues) and #97/#270
 (control-plane emits + daemon/web audit receipts). Closes the #229-deferred open
 design item "audit-worker-initiated `appendRootV2` chain submission (tier-A relay
 wallet)".

From 376f23fce9ea275ed4a88d5af8051ea2ba036b50 Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 22:06:48 +0800
Subject: [PATCH 13/17] fix: clippy while-let in drain_sse_frames (CI runs -D
 warnings --all-targets)

---
 crates/agentkeys-daemon/src/ui_bridge.rs | 3 +--
 1 file changed, 1 insertion(+), 2 deletions(-)

diff --git a/crates/agentkeys-daemon/src/ui_bridge.rs b/crates/agentkeys-daemon/src/ui_bridge.rs
index 64de4ff6..dcc9dcd6 100644
--- a/crates/agentkeys-daemon/src/ui_bridge.rs
+++ b/crates/agentkeys-daemon/src/ui_bridge.rs
@@ -6325,8 +6325,7 @@ fn worker_feed_event_to_api(evt: &agentkeys_types::audit_feed::AuditFeedEvent) -
 /// keep-alives are dropped. Returns `(event_name, data)` pairs.
 fn drain_sse_frames(buf: &mut String) -> Vec<(String, String)> {
     let mut frames = Vec::new();
-    loop {
-        let Some(pos) = buf.find("\n\n") else { break };
+    while let Some(pos) = buf.find("\n\n") {
         let frame: String = buf[..pos].to_string();
         buf.drain(..pos + 2);
         let mut event_name = "message".to_string();

From dba8581c8d4d5e261fdef86791f819d93a2f67da Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 22:39:08 +0800
Subject: [PATCH 14/17] =?UTF-8?q?ci:=20#209=20=E2=80=94=20config=20quarant?=
 =?UTF-8?q?ine=20self-dissolves:=20steps=2019+21=20run=20as=20live=20gates?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The #209 tripwire fired on this branch's run exactly as designed: the test
config worker became reachable (this PR's broker redeploy converged it) and
stage-3 steps 19 (config bucket/role write + cross-bucket denials) and 21
(cross-data-class cap -> 403 cap_data_class_mismatch) both ran LIVE and
passed. Per the guard's own instructions: drop config-role-missing +
config-worker-unreachable from the v2-demo --allow-skip and delete the
now-inert self-dissolving Guard step. The demo's skip reasons remain for
operator-local runs; CI now fails closed on config-worker drift.
---
 .github/workflows/harness-ci.yml | 52 +++-----------------------------
 1 file changed, 5 insertions(+), 47 deletions(-)

diff --git a/.github/workflows/harness-ci.yml b/.github/workflows/harness-ci.yml
index 6ce9b719..2b3cd919 100644
--- a/.github/workflows/harness-ci.yml
+++ b/.github/workflows/harness-ci.yml
@@ -955,15 +955,10 @@ jobs:
         # every OTHER prereq still fails closed):
         #   - scope-not-set: setScopeWithWebauthn needs a real WebAuthn assertion
         #     (heima-scope-set.sh L172) a no-Touch-ID runner cannot produce.
-        #   - config-role-missing / config-worker-unreachable (#201): the test
-        #     config bucket+role AND the config worker DNS+cert (config-test.<zone>)
-        #     are ONE operator one-shot (setup-cloud.sh --ci + the broker deploy's
-        #     certbot). Until then step 19 (config write) + step 21 (config-worker
-        #     reachability) skip cleanly; step 20 (config cap rejected by memory +
-        #     cred workers) still runs. The "Guard" step below keeps this honest —
-        #     warns every run while config-test is unprovisioned (#209) + FAILS once
-        #     it becomes reachable, so the allowance can't silently persist. Drop
-        #     both (allowance + guard) once the test config infra exists.
+        #   - (#209 RESOLVED 2026-06-11: the config-role-missing /
+        #     config-worker-unreachable allowances are GONE — the test config
+        #     bucket+role+worker are provisioned and steps 19+21 run as LIVE
+        #     gates; the self-dissolving Guard step was removed with them.)
         #
         # The workflow_dispatch `stage` input still selects a single phase
         # (`--stage N`); push/PR (stage='' / 'all') runs the full 1-4 + 6 sequence.
@@ -971,50 +966,13 @@ jobs:
           STAGE: ${{ inputs.stage }}
         run: |
           set -euo pipefail
-          ARGS=(--ci --allow-skip=scope-not-set,config-role-missing,config-worker-unreachable,classify-not-configured,classify-worker-unavailable)
+          ARGS=(--ci --allow-skip=scope-not-set,classify-not-configured,classify-worker-unavailable)
           case "${STAGE:-}" in
             1|2|3) ARGS+=(--stage "$STAGE") ;;
             *)     ;;   # all / empty → full phases 1-6 (phase 5 = the mock-sandbox wire)
           esac
           AGENTKEYS_CHAIN=heima bash harness/v2-demo.sh "${ARGS[@]}"
 
-      - name: Guard — stage-3 step-21 config-worker quarantine must not become permanent (#209)
-        if: ${{ inputs.stage == 'all' || inputs.stage == '3' || inputs.stage == '' }}
-        # stage-3 step 21 (run by v2-demo → phase 3) is the ONLY live proof that the
-        # deployed config worker rejects a cross-data-class cap (POST a memory/cred
-        # cap to config-test.<zone>/v1/config/put → expect 403 cap_data_class_mismatch).
-        # It is allow-skipped via config-worker-unreachable while config-test is
-        # unprovisioned (#209). Per the codex adversarial review (2026-06), that
-        # tolerance must NOT silently persist — unit tests + step 20 cannot catch
-        # deployed route/nginx/TLS/BROKER_CAP_PUBKEY/middleware drift. This guard:
-        #   - config-test UNREACHABLE → ::warning:: every run (visible, links #209),
-        #     CI stays green (the skip is legitimate — the host genuinely isn't up).
-        #   - config-test REACHABLE → ::error:: + exit 1: the infra now exists, so the
-        #     allowance must be dropped (remove config-worker-unreachable from the
-        #     v2-demo --allow-skip) and step 21 must run as a live gate. Fail-loud so
-        #     the quarantine self-dissolves the moment #209 lands.
-        run: |
-          set -uo pipefail
-          # config-test.<zone> is the TEST config worker (distinct from prod
-          # config.<zone>). TEST_BROKER_ZONE is set by the "Compute zone" step via
-          # $GITHUB_ENV; use it directly — do NOT source operator-workstation.env
-          # (whose AGENTKEYS_WORKER_CONFIG_URL could point elsewhere).
-          url="https://config-test.${TEST_BROKER_ZONE:-litentry.org}/healthz"
-          code=$(curl -sS -o /dev/null -w '%{http_code}' --max-time 10 "$url" 2>/dev/null) || true
-          code="${code:-000}"
-          echo "config worker healthz probe: $url → HTTP $code"
-          # Self-dissolving: once the allowance is removed, the guard is inert
-          # (so removing just the allow-skip token, without this step, is safe).
-          if ! grep -q 'allow-skip=.*config-worker-unreachable' .github/workflows/harness-ci.yml; then
-            echo "::notice::config-worker-unreachable allowance already removed — stage-3 step 21 runs as a live gate. This Guard step is now a no-op (safe to delete)."
-            exit 0
-          fi
-          if [ "$code" = "200" ]; then
-            echo "::error::config-test is now REACHABLE ($url → 200), but stage-3 step 21 is still allow-skipped via 'config-worker-unreachable'. Remove it from the v2-demo --allow-skip so step 21 runs as a LIVE config-worker isolation gate, then close #209."
-            exit 1
-          fi
-          echo "::warning::stage-3 step 21 (config-worker cap-data-class isolation, #201) is QUARANTINED — config-test unreachable ($url → HTTP $code). The LIVE config-worker rejection proof is NOT running on CI. Provision config-test via 'setup-cloud.sh --ci' to restore it (#209)."
-
       - name: Clean up harness test data (bots/<actor_omni>/ prefix)
         if: always()
         # Codex M3 mitigation: the harness writes to s3://<bucket>/bots/<actor_omni>/<class>/...

From fd9df6c1c141e0832b44335ef86937d17b10e225 Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 23:01:41 +0800
Subject: [PATCH 15/17] fix: #109 anchor gate must use the env-aware
 SidecarRegistry on the test stack
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The worker-audit env block passed the env-aware CredentialAudit address but
not the registry, so the anti-spam gate fell back to the compiled-in profile
(PROD registry) — on the test stack operatorMasterWallet(test-omni) is zero
there and every batch dropped as 'unregistered' (the anchor-not-recorded
skip in the first green harness run; relay was funded, anchor_enabled=true).
Pass AGENTKEYS_AUDIT_REGISTRY_ADDRESS=$REGISTRY_ADDR like the other workers'
SIDECAR_REGISTRY_ADDRESS_HEIMA.
---
 scripts/setup-broker-host.sh | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/scripts/setup-broker-host.sh b/scripts/setup-broker-host.sh
index fedb8999..1e512454 100755
--- a/scripts/setup-broker-host.sh
+++ b/scripts/setup-broker-host.sh
@@ -1177,6 +1177,10 @@ AGENTKEYS_AUDIT_BATCH_SECONDS=120
 AGENTKEYS_CHAIN=heima
 AGENTKEYS_AUDIT_RPC_URL=$CHAIN_RPC
 AGENTKEYS_AUDIT_CREDENTIAL_AUDIT_ADDRESS=$AUDIT_CONTRACT_ADDR
+# Anti-spam anchor gate checks operatorMasterWallet on THIS registry — must be
+# the env-aware one (the test stack has its own; the compiled-in profile
+# carries prod, which would drop every test batch as "unregistered").
+AGENTKEYS_AUDIT_REGISTRY_ADDRESS=$REGISTRY_ADDR
 AGENTKEYS_AUDIT_RELAY_KEY_FILE=/etc/agentkeys/audit-relay.key
 # Tier-1 cold archive (#109): bucket + instance-role grant provisioned by
 # scripts/provision-audit-archive.sh (setup-cloud.sh step 13).

From eae393daddb2c9e66b78bed345ea2472f1a2c73c Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Thu, 11 Jun 2026 23:28:55 +0800
Subject: [PATCH 16/17] =?UTF-8?q?fix:=20smoke=20anchor=20confirm=20?=
 =?UTF-8?q?=E2=80=94=20cast=20receipt=20status=20prints=20'true'=20on=20th?=
 =?UTF-8?q?is=20cast=20version?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The previous CI run PROVED the #109 anchor loop live (registry-gate fix
worked: the relay anchored within seconds and the poll found the record
first try) — the leg then died on its own assertion: 'status: true' did
not match the *1*-only pattern. Accept true|1|0x1; false/0x0 still dies.
---
 scripts/heima-worker-smoke.sh | 6 ++++--
 1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/scripts/heima-worker-smoke.sh b/scripts/heima-worker-smoke.sh
index 2b258f25..c072a848 100755
--- a/scripts/heima-worker-smoke.sh
+++ b/scripts/heima-worker-smoke.sh
@@ -318,9 +318,11 @@ else
       ANCHOR_TX=$(echo "$ANCHOR_RECORD" | jq -r '.tx_hash')
       ANCHOR_ROOT=$(echo "$ANCHOR_RECORD" | jq -r '.merkle_root_hex')
       TX_STATUS=$(cast receipt "$ANCHOR_TX" status --rpc-url "$RPC_HTTP" 2>/dev/null || echo "")
+      # cast prints the success status as "true" (newer) / "1"/"0x1" (older) —
+      # match all three; failure ("false"/"0x0") falls through to die.
       case "$TX_STATUS" in
-        *1*) ok "anchor tx confirmed on-chain: $ANCHOR_TX" ;;
-        *)   die "anchor tx $ANCHOR_TX not confirmed (status: ${TX_STATUS:-unreadable})" ;;
+        *true*|*1*) ok "anchor tx confirmed on-chain: $ANCHOR_TX" ;;
+        *)          die "anchor tx $ANCHOR_TX not confirmed (status: ${TX_STATUS:-unreadable})" ;;
       esac
       # Merkle-proof walk (mirrors CredentialAudit.verifyEntryInRoot: leaf
       # prefixed 0x00, internal nodes 0x01 over the sorted pair).

From 8f8bb5d923d51f17775eb66b7cd6098c37e0d82a Mon Sep 17 00:00:00 2001
From: Hanwen Cheng <heawen.cheng@gmail.com>
Date: Fri, 12 Jun 2026 00:08:23 +0800
Subject: [PATCH 17/17] =?UTF-8?q?docs:=20#109=20plan=20=E2=80=94=20anchor?=
 =?UTF-8?q?=20loop=20verified=20live=20in=20CI=20(registry-gate=20+=20cast?=
 =?UTF-8?q?-status=20lessons=20folded)?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

---
 docs/plan/issue-109-two-tier-audit.md | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/docs/plan/issue-109-two-tier-audit.md b/docs/plan/issue-109-two-tier-audit.md
index a60081dd..373c7bc8 100644
--- a/docs/plan/issue-109-two-tier-audit.md
+++ b/docs/plan/issue-109-two-tier-audit.md
@@ -2,8 +2,9 @@
 
 **Status:** shipped in PR #281 (all 10 steps; step 9's assertions live in
 `heima-worker-smoke.sh` rather than a new stage-3 step — no renumbering of the
-23-step demo). Deferred: live anchor-loop verify on a deployed host;
-subscan renderers for op_kinds 90/91. Builds on #229 (data-plane emits + V2 queues) and #97/#270
+23-step demo). Live anchor-loop verify: DONE in PR #281's CI (Heima mainnet test stack —
+SSE backfill + on-chain anchor tx + Merkle proof + tamper-fail all green).
+Deferred: prod-host redeploy (operator); subscan renderers for op_kinds 90/91. Builds on #229 (data-plane emits + V2 queues) and #97/#270
 (control-plane emits + daemon/web audit receipts). Closes the #229-deferred open
 design item "audit-worker-initiated `appendRootV2` chain submission (tier-A relay
 wallet)".