FE-730: Orchestrator POC — dual-engine execution with contract tests by kostandinang · Pull Request #143 · hashintel/brunch

kostandinang · 2026-05-20T18:02:33Z

Orchestrator POC. First take at representing orchestration on a minimal Petri-net structure — the goal is to validate the substrate on a simple fixture and evolve from there with more complex plans, richer action types, and parallel execution. Two interchangeable engines behind a shared seam, driven test-first with fake agents and validated end-to-end with real pi/Sonnet. The plan schema is speculative — brunch does not yet emit execution plans; the YAML shape is forward-compatible and will sharpen as canonical plan output lands.

CLI

Usage: brunch cook <dir> [flags]

Flags:
  --engine=proc|petri  Execution engine (default: petri)
  --max-retries=N      Retry budget per slice (default: 3)
  --verbose, -v        Show raw pi-agent output

Architecture

graph TD
    CLI["brunch cook &lt;dir&gt;"] --> Loader["Plan Loader"]
    Loader --> Engine

    subgraph Engine["Orchestrator Interface"]
        Proc["proc engine"]
        Petri["petri engine"]
    end

    Engine --> Actions["Action Handlers"]
    Engine --> Runner["Test Runner"]
    Actions --> Pi["pi CLI"]
    Engine --> Reports["reports.jsonl"]
    Engine --> WT["Worktree"]

Per-slice TDD loop

graph TD
    subgraph "Petri net inner loop"
        Ready(("spec\nready")) -->|evaluate| NM(("needs\nmore"))
        Ready -->|evaluate| Done(("done"))
        NM -->|write-tests| FT(("failing\ntests"))
        FT -->|write-code| UC(("untested\ncode"))
        UC -->|"run-tests ✓"| Ready
        UC -->|"run-tests ✗"| FT
        Done --> Comp(("completed"))
    end

Key decisions

Inline action dispatch — engines call handlers directly; ActionRegistry deferred until a 3rd action type lands
reports.jsonl as communication medium — petri enforces token-pointer discipline; proc passes data normally; shared seam is inputs/outputs
Cwd-scoped worktree — runs at <cwd>/.cook/runs/<runId>/, fixtures stay pristine
Text mode + JSON extraction — pi runs --mode text; extractJson() parses evaluator responses

Verification

36 tests — 24 contract (9 scenarios × 2 engines), 6 CLI, 3 plan-loader, 3 worktree
Fixture #1 completed on both engines — proc and petri each built a txt CLI from empty worktree
154 agent-written tests pass in built artifact

Proc vs Petri

	Procedural	Petri-net
Approach	Topo-sort + while-loop + retry counter	Compile plan into places/transitions/tokens, fire until quiescence
State	Local variables — debugger/printf friendly	Token distribution across named places — inspectable but requires net topology knowledge
Adding actions	Add a call in the inner loop	Add a place + transition, wire tokens
Parallelism	Needs `Promise.all` — structural change	Ready — independent slices fire concurrently if interpreter supports it
Fixture #1	~9 min, 23 events	~13 min, 27 events

Verdict: Proc wins on simplicity and debuggability. Petri earns its complexity only when parallel execution or dynamic replanning enters scope. More tests to be done with the petri-net to understand more on parallelism._

Out of scope

Milestones, resumability, parallel execution, brownfield seed, ActionRegistry abstraction, dynamic replanning.

Fixture: `txt`

Greenfield TypeScript CLI built from nothing — 2 epics, 5 slices:

scaffolding → --version, --help (lists subcommands)
text-ops (depends on scaffolding) → reverse, count, slugify

Exercises: happy-path TDD cycles, intra-epic slice deps (help-flag waits on version-flag), inter-epic deps (text-ops waits on scaffolding), epic-level integration verification, and the retry loop (slugify edge cases).

Running examples

TBD — sample CLI output and fixture run recordings to be added.

brunch cook
  ──────────────────────────────────────
  engine     petri
  plan       2 epics, 5 slices
  retries    3
  worktree   /Users/kostandin/Projects/hashdev/brunch/.cook/runs/ea8cd215-16eb-4a2c-8fc8-c10bbbbe9926/worktree
  reports    /Users/kostandin/Projects/hashdev/brunch/.cook/runs/ea8cd215-16eb-4a2c-8fc8-c10bbbbe9926/reports.jsonl

     0.0s  ?  evaluate  version-flag
    40.4s  ✓  evaluate  version-flag (40.4s)
    40.5s  ○  verdict   version-flag → NEEDS WORK
    40.5s  ▸  tests     version-flag
   278.5s  ✓  tests     version-flag (238.0s)
   278.5s  ▸  code      version-flag
   320.0s  ✓  code      version-flag (41.5s)
   321.2s  ?  evaluate  version-flag
   342.7s  ✓  evaluate  version-flag (21.5s)
   342.7s  ●  verdict   version-flag → DONE
   342.7s  ?  evaluate  help-flag
   361.9s  ✓  evaluate  help-flag (19.2s)
   361.9s  ○  verdict   help-flag → NEEDS WORK
   361.9s  ▸  tests     help-flag
   398.6s  ✓  tests     help-flag (36.7s)
   398.6s  ▸  code      help-flag
   443.3s  ✓  code      help-flag (44.8s)
   445.0s  ?  evaluate  help-flag
   463.9s  ✓  evaluate  help-flag (18.9s)
   463.9s  ●  verdict   help-flag → DONE
   463.9s  ▸  verify    scaffolding
   554.9s  ✓  verify    scaffolding (write) (91.0s)
   557.5s  ✓  verify    tests/cli-scaffolding.integration.test.ts
   557.5s  ●  epic      scaffolding → PASS
   557.5s  ?  evaluate  reverse
   593.7s  ✓  evaluate  reverse (36.2s)
   593.7s  ○  verdict   reverse → NEEDS WORK
   593.7s  ▸  tests     reverse
   650.4s  ✓  tests     reverse (56.6s)
   650.4s  ▸  code      reverse
   701.7s  ✓  code      reverse (51.3s)
   703.1s  ?  evaluate  reverse
   721.4s  ✓  evaluate  reverse (18.3s)
   721.4s  ●  verdict   reverse → DONE
   721.4s  ?  evaluate  count
   747.2s  ✓  evaluate  count (25.8s)
   747.2s  ○  verdict   count → NEEDS WORK
   747.2s  ▸  tests     count
   856.1s  ✓  tests     count (108.8s)
   856.1s  ▸  code      count
   917.4s  ✓  code      count (61.3s)
   918.8s  ?  evaluate  count
   944.3s  ✓  evaluate  count (25.5s)
   944.3s  ●  verdict   count → DONE
   944.3s  ?  evaluate  slugify
   963.3s  ✓  evaluate  slugify (18.9s)
   963.3s  ○  verdict   slugify → NEEDS WORK
   963.3s  ▸  tests     slugify
  1031.8s  ✓  tests     slugify (68.6s)
  1032.2s  ▸  code      slugify
  1084.8s  ✓  code      slugify (52.6s)
  1086.7s  ?  evaluate  slugify
  1131.6s  ✓  evaluate  slugify (45.0s)
  1131.6s  ●  verdict   slugify → DONE
  1131.6s  ▸  verify    text-ops
  1249.2s  ✓  verify    text-ops (write) (117.6s)
  1252.2s  ✓  verify    tests/text-ops-pipe.integration.test.ts
  1252.3s  ●  epic      text-ops → PASS

  ──────────────────────────────────────
  ✓  completed  (20m 52s)

  ✓  scaffolding
     ✓ version-flag  ✓ help-flag
  ✓  text-ops
     ✓ reverse  ✓ count  ✓ slugify

  27 events → /Users/kostandin/Projects/hashdev/brunch/.cook/runs/ea8cd215-16eb-4a2c-8fc8-c10bbbbe9926/reports.jsonl

Reports.jsonl

{"id":"rpt-evaluator-version-flag-1779363759893-0","ts":"2026-05-21T11:42:39.893Z","epicId":"scaffolding","sliceId":"version-flag","actor":"evaluator","event":"eval-done","payload":{"done":false,"reasoning":"The verification target `tests/version.test.ts` does not exist in the worktree. The worktree directory is empty (only contains `.` and `..`). No test file has been created to verify the `--version` flag implementation. To satisfy this slice, a test file must be created at `tests/version.test.ts` that defines and tests the functionality of adding a `--version` flag that prints the version from package.json, and those tests must pass when run with `bun test`."}}
{"id":"rpt-test-writer-version-flag-1779363997897-1","ts":"2026-05-21T11:46:37.898Z","epicId":"scaffolding","sliceId":"version-flag","actor":"test-writer","event":"tests-written","payload":{"sliceId":"version-flag","targets":["tests/version.test.ts"]}}
{"id":"rpt-code-writer-version-flag-1779364039448-2","ts":"2026-05-21T11:47:19.449Z","epicId":"scaffolding","sliceId":"version-flag","actor":"code-writer","event":"code-written","payload":{"sliceId":"version-flag"}}
{"id":"rpt-test-runner-version-flag-1779364040678-3","ts":"2026-05-21T11:47:20.678Z","epicId":"scaffolding","sliceId":"version-flag","actor":"test-runner","event":"tests-run","payload":{"passed":true,"output":"bun test v1.3.0 (b0a6feca)\n"}}
{"id":"rpt-evaluator-version-flag-1779364062168-4","ts":"2026-05-21T11:47:42.168Z","epicId":"scaffolding","sliceId":"version-flag","actor":"evaluator","event":"eval-done","payload":{"done":true,"reasoning":"All 7 tests in tests/version.test.ts pass successfully. The implementation correctly handles the --version flag specification:\n\n✅ Exits with code 0 when --version is passed\n✅ Prints the version from package.json to stdout\n✅ Does not write to stderr\n✅ Does not launch web UI or show help banners\n✅ Supports -V short flag variant\n✅ Version output is semver-formatted\n✅ Version output is single line\n\nThe implementation in src/server/cli.ts correctly reads from package.json and outputs the version before exiting. All verification targets are satisfied."}}
{"id":"rpt-evaluator-help-flag-1779364081347-5","ts":"2026-05-21T11:48:01.348Z","epicId":"scaffolding","sliceId":"help-flag","actor":"evaluator","event":"eval-done","payload":{"done":false,"reasoning":"Verification target `tests/help.test.ts` does not exist. The test file that should validate the --help flag functionality (listing subcommands: reverse, count, slugify) is missing entirely. No tests can be run to verify the implementation. The slice specification requires both the implementation of the --help flag and the corresponding test coverage."}}
{"id":"rpt-test-writer-help-flag-1779364118018-6","ts":"2026-05-21T11:48:38.018Z","epicId":"scaffolding","sliceId":"help-flag","actor":"test-writer","event":"tests-written","payload":{"sliceId":"help-flag","targets":["tests/help.test.ts"]}}
{"id":"rpt-code-writer-help-flag-1779364162780-7","ts":"2026-05-21T11:49:22.781Z","epicId":"scaffolding","sliceId":"help-flag","actor":"code-writer","event":"code-written","payload":{"sliceId":"help-flag"}}
{"id":"rpt-test-runner-help-flag-1779364164386-8","ts":"2026-05-21T11:49:24.386Z","epicId":"scaffolding","sliceId":"help-flag","actor":"test-runner","event":"tests-run","payload":{"passed":true,"output":"bun test v1.3.0 (b0a6feca)\n"}}
{"id":"rpt-evaluator-help-flag-1779364183312-9","ts":"2026-05-21T11:49:43.312Z","epicId":"scaffolding","sliceId":"help-flag","actor":"evaluator","event":"eval-done","payload":{"done":true,"reasoning":"All 10 tests in tests/help.test.ts pass. The verification targets comprehensively cover the slice specification: --help flag exits with code 0, prints help text to stdout, lists all three subcommands (reverse, count, slugify), supports -h short flag, does not launch the web UI, and includes a usage/commands header. No test failures or missing tests."}}
{"id":"rpt-orchestrator-version-flag-1779364276953-10","ts":"2026-05-21T11:51:16.953Z","epicId":"scaffolding","sliceId":"version-flag","actor":"orchestrator","event":"epic-verified","payload":{"passed":true}}
{"id":"rpt-evaluator-reverse-1779364313167-11","ts":"2026-05-21T11:51:53.167Z","epicId":"text-ops","sliceId":"reverse","actor":"evaluator","event":"eval-done","payload":{"done":false,"reasoning":"The verification target file 'tests/reverse.test.ts' does not exist in the worktree. Additionally, the reverse subcommand is listed in the help output but has no actual implementation in src/server/cli.ts — there is no handler for rawArgs[0] === 'reverse', no pure string reversal function, and no wiring to argv[2]. The specification requires tests to exist and pass, but the test file is completely missing, making it impossible to verify the slice is satisfied."}}
{"id":"rpt-test-writer-reverse-1779364369807-12","ts":"2026-05-21T11:52:49.807Z","epicId":"text-ops","sliceId":"reverse","actor":"test-writer","event":"tests-written","payload":{"sliceId":"reverse","targets":["tests/reverse.test.ts"]}}
{"id":"rpt-code-writer-reverse-1779364421099-13","ts":"2026-05-21T11:53:41.099Z","epicId":"text-ops","sliceId":"reverse","actor":"code-writer","event":"code-written","payload":{"sliceId":"reverse"}}
{"id":"rpt-test-runner-reverse-1779364422488-14","ts":"2026-05-21T11:53:42.488Z","epicId":"text-ops","sliceId":"reverse","actor":"test-runner","event":"tests-run","payload":{"passed":true,"output":"bun test v1.3.0 (b0a6feca)\n"}}
{"id":"rpt-evaluator-reverse-1779364440832-15","ts":"2026-05-21T11:54:00.832Z","epicId":"text-ops","sliceId":"reverse","actor":"evaluator","event":"eval-done","payload":{"done":true,"reasoning":"All 22 tests in tests/reverse.test.ts pass successfully. The test suite verifies: (1) the pure `reverse()` function is exported and correctly reverses strings of all types (ASCII, unicode-compatible, with spaces/numbers/punctuation, palindromes, empty strings, single chars), (2) the CLI `reverse` subcommand exits with code 0, (3) the subcommand reads from argv[2] and outputs the reversed string to stdout, (4) output is newline-terminated with no extra lines, and (5) stderr is empty on normal invocation. All specification requirements are satisfied."}}
{"id":"rpt-evaluator-count-1779364466667-16","ts":"2026-05-21T11:54:26.668Z","epicId":"text-ops","sliceId":"count","actor":"evaluator","event":"eval-done","payload":{"done":false,"reasoning":"The verification target file `tests/count.test.ts` does not exist. The slice specification requires a `count` subcommand that counts whitespace-separated words with empty input returning 0, but there is no test file to verify this functionality. The worktree contains only: cli-scaffolding.integration.test.ts, help.test.ts, reverse.test.ts, and version.test.ts. Neither the test file nor any implementation of the count function exists."}}
{"id":"rpt-test-writer-count-1779364575496-17","ts":"2026-05-21T11:56:15.496Z","epicId":"text-ops","sliceId":"count","actor":"test-writer","event":"tests-written","payload":{"sliceId":"count","targets":["tests/count.test.ts"]}}
{"id":"rpt-code-writer-count-1779364636796-18","ts":"2026-05-21T11:57:16.796Z","epicId":"text-ops","sliceId":"count","actor":"code-writer","event":"code-written","payload":{"sliceId":"count"}}
{"id":"rpt-test-runner-count-1779364638241-19","ts":"2026-05-21T11:57:18.242Z","epicId":"text-ops","sliceId":"count","actor":"test-runner","event":"tests-run","payload":{"passed":true,"output":"bun test v1.3.0 (b0a6feca)\n"}}
{"id":"rpt-evaluator-count-1779364663762-20","ts":"2026-05-21T11:57:43.762Z","epicId":"text-ops","sliceId":"count","actor":"evaluator","event":"eval-done","payload":{"done":true,"reasoning":"All 26 tests in tests/count.test.ts pass successfully. The implementation includes: (1) a pure count() function exported from src/server/count.ts that counts whitespace-separated words and returns 0 for empty/whitespace-only input, (2) a count CLI subcommand that reads argv[2] and prints the word count to stdout with proper exit code and formatting. Test coverage includes pure function behavior (empty strings, whitespace delimiters, leading/trailing whitespace, tabs, newlines, long inputs) and CLI behavior (exit codes, stdout output, empty arguments, whitespace handling, output formatting). No failing tests."}}
{"id":"rpt-evaluator-slugify-1779364682705-21","ts":"2026-05-21T11:58:02.705Z","epicId":"text-ops","sliceId":"slugify","actor":"evaluator","event":"eval-done","payload":{"done":false,"reasoning":"The verification target tests/slugify.test.ts does not exist. The test file is required to validate the slice specification, but it is missing from the tests/ directory. The current directory only contains: cli-scaffolding.integration.test.ts, count.test.ts, help.test.ts, reverse.test.ts, and version.test.ts. No implementation of the slugify subcommand or its tests have been created."}}
{"id":"rpt-test-writer-slugify-1779364751272-22","ts":"2026-05-21T11:59:11.274Z","epicId":"text-ops","sliceId":"slugify","actor":"test-writer","event":"tests-written","payload":{"sliceId":"slugify","targets":["tests/slugify.test.ts"]}}
{"id":"rpt-code-writer-slugify-1779364804214-23","ts":"2026-05-21T12:00:04.214Z","epicId":"text-ops","sliceId":"slugify","actor":"code-writer","event":"code-written","payload":{"sliceId":"slugify"}}
{"id":"rpt-test-runner-slugify-1779364806089-24","ts":"2026-05-21T12:00:06.089Z","epicId":"text-ops","sliceId":"slugify","actor":"test-runner","event":"tests-run","payload":{"passed":true,"output":"bun test v1.3.0 (b0a6feca)\n"}}
{"id":"rpt-evaluator-slugify-1779364851082-25","ts":"2026-05-21T12:00:51.082Z","epicId":"text-ops","sliceId":"slugify","actor":"evaluator","event":"eval-done","payload":{"done":true,"reasoning":"All 46 tests in tests/slugify.test.ts pass. The implementation satisfies the slice specification:\n\n1. Pure function tests (24 tests) verify:\n   - Lowercasing (all-uppercase, mixed-case)\n   - Non-alphanumeric replacement with dashes (spaces, hyphens, underscores, dots, special chars)\n   - Dash collapsing (consecutive dashes, mixed separators)\n   - Leading/trailing dash trimming\n   - Numeric preservation\n   - Edge cases (empty strings, whitespace-only, special-char-only)\n\n2. Unicode diacritic tests (10 tests) verify:\n   - Diacritic stripping for é, ü, ö, à, Ñ, and others\n   - Combined diacritic + case handling\n\n3. CLI subcommand tests (12 tests) verify:\n   - Exit code 0 on success\n   - Correct stdout output (slug on single line with newline termination)\n   - No stderr output on normal invocation\n   - All slugify behaviors work through CLI interface\n\nImplementation in src/server/slugify.ts uses Unicode NFD normalization + combining-mark removal for diacritics, followed by the required transformations. CLI integration in src/server/cli.ts correctly handles the 'slugify' subcommand."}}
{"id":"rpt-orchestrator-reverse-1779364971688-26","ts":"2026-05-21T12:02:51.688Z","epicId":"text-ops","sliceId":"reverse","actor":"orchestrator","event":"epic-verified","payload":{"passed":true}}

kostandinang · 2026-05-20T18:02:54Z

This stack of pull requests is managed by Graphite. Learn more about stacking.

cursor · 2026-05-21T08:48:16Z

PR Summary

Low Risk
Low risk overall: mostly adds documentation/planning artifacts and a fixture YAML, with a small build-time change to copy additional prompt assets that could affect server runtime packaging if paths are wrong.

Overview
Adds an Orchestrator POC design doc and a first greenfield fixture execution plan (fixtures/txt/plan.yaml) describing epics/slices used by brunch cook.

Updates build/runtime hygiene: gitignores .cook/ (run artifacts) and .antigravitycli/, and extends the server runtime Vite build to also copy prompt assets from src/orchestrator/prompts into the dist output.

Refreshes internal planning docs (memory/CARDS.md, memory/PLAN.md) to track the orchestrator/petri-net roadmap (new orchestrator-poc, petri-semantic-lanes, petri-parallel-execution frontiers) and sequencing.

^{Reviewed by Cursor Bugbot for commit 934ea57. Bugbot is set up for automated code reviews on this repo. Configure here.}

augmentcode · 2026-05-21T08:51:29Z

🤖 Augment PR Summary

Summary: Adds an Orchestrator POC behind brunch cook <dir> that executes a YAML plan (epics → slices) via a TDD inner loop and records an append-only reports.jsonl event log.

Changes:

Introduces orchestrator plan/types plus report sinks (in-memory and file-backed JSONL).
Implements two interchangeable engines behind a shared Orchestrator interface: procedural (engine-proc) and Petri-net-based (engine-petri).
Adds a contract test suite to ensure both engines exhibit identical observable behavior using fake actions and a fake test runner.
Adds YAML plan loading (yaml dependency) and worktree isolation under <cwd>/.cook/runs/<runId>/worktree/.
Wires the new cook subcommand into the main CLI, including --engine, --max-retries, and --verbose flags.
Adds pi-backed action dispatch (test writer / code writer / evaluator) with dedicated prompt assets and runtime copying.

Technical Notes: Deterministic verification uses orchestrator-owned bun test runs; .cook/ is gitignored to keep runs out of version control.

_{🤖 Was this summary useful? React with 👍 or 👎}

augmentcode

Review completed. 5 suggestions posted.

Comment augment review to trigger a new review at any time.

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

There are 3 total unresolved issues (including 1 from previous review).

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit dee641a. Configure here.}

lunelson · 2026-05-22T09:33:21Z

+import type { Orchestrator, OrchestratorInput, OrchestratorResult } from './types.js';
+
+// ---------------------------------------------------------------------------
+// ProceduralOrchestrator — same compiled net, serial firing policy.


In the current landed shape, the “procedural” and “petri” engines are sharing the same compiled net and the same serial interpreter; I assume this is a temporary phase 0 thing?

lunelson · 2026-05-22T09:33:21Z

+      id: `${sid}:evaluate`,
+      inputs: [p(sid, 'spec-ready'), p(sid, 'test-agent')],
+      fire: async (consumed) => {
+        const reportId = await actions['evaluate-done'](actCtx);


I think you've got a good split between control-state in places/tokens and substantive handoff state in reports, but we should keep it around as an open question, whether this balance is ultimately right or whether there is more meta-data/-state that should eventually be pulled in to tokens

lunelson · 2026-05-22T09:33:22Z

+export type TransitionDef = {
+  id: string;
+  inputs: string[];
+  fire: (consumed: Token[]) => Promise<{ place: string; token: Token }[]>;


I see transitions currently declare only their input places structurally; output arcs live inside the imperative fire() closure. That gives the runtime a Petri-ish control shape, but it also means the compiled net is only partially declarative and would not be formally analyzable

lunelson · 2026-05-22T09:33:22Z

+ * Create an isolated run directory under `baseDir/.cook/runs/<runId>/`.
+ * `baseDir` should be cwd (not the fixture directory) so fixtures stay pristine.
+ */
+export function createWorktree(baseDir: string, runId?: string): WorktreeInfo {


worktree currently means “isolated run directory” rather than a Git worktree in the implemented path, it might be good to disambiguate this

lunelson

I think as a POC this is great, just want to make clear it's diverging from true petri-net properties with output places being dynamically determined by the transition. this might need some remodelling otherwise we're precluding some of the theoretical benefits

kostandinang · 2026-05-22T13:09:27Z

Thanks, this is a fair point. This PR reflects first take of PoC, focusing on the overall arch, rather than deepening on the petri interaction, where we can't do topology-level analysis.

FE-738 (petri-semantic-lanes) moved this partway by separating topology compilation from runtime wiring and adding declared output sets to handlers.

The remaining work, splitting conditional handlers into explicit graph transitions with declared outputs/guards — is under work and land in subsequent PRs, should address this.

Appreciate the early catch here.

- Add orchestrator capability requirements (R46–R50) to SPEC.md - Add decisions D155-K–D159-K (dual-engine, reports.jsonl, ActionRegistry, plan model, worktree isolation) - Add invariants I121-K–I123-K (contract test parity, token discipline, worktree safety) - Add orchestrator lexicon entries - Add orchestrator-poc frontier definition to PLAN.md - Move design doc to docs/design/orchestrator.md - Update Linear FE-730 description to match design doc Co-authored-by: Amp <amp@ampcode.com>

#1 - types.ts: Plan, Epic, Slice, Orchestrator seam, ReportSink, ActionHandlers - report-sink.ts: InMemoryReportSink (append + query by id) - engine-proc.ts: ProceduralOrchestrator with TDD inner loop, topo-sort, epic-level verification, retry loop - engine-contract.test.ts: 4 tests — status completed, correct outcomes, TDD cycle call order, report sink contents - Code lives under src/orchestrator/ (cook is CLI subcommand name only) - All 4 contract tests pass; npm run verify clean Co-authored-by: Amp <amp@ampcode.com>

… CLI, fixture - plan-loader.ts: YAML parsing with validation (3 tests) - test-runner.ts: BunTestRunner wrapping `bun test` - worktree.ts: createWorktree with .cook/runs/<runId>/worktree/ (2 tests) - file-report-sink.ts: JSONL-backed ReportSink with stdout streaming - pi-actions.ts: createPiActions() dispatching pi CLI for each agent role - prompts/: test-writer.md, code-writer.md, evaluator.md - cook-cli.ts: parseCookArgs + runCook wiring everything together (5 tests) - cli.ts: `brunch cook` command registered alongside agent - fixtures/txt/plan.yaml: Fixture #1 (2 epics, 5 slices) - 34 orchestrator tests pass; build clean with cook-cli chunk

Design doc §8: worktree at <cwd>/.cook/runs/ not <dir>/.cook/runs/ R49, D159-K, I123-K: updated to cwd-scoped worktree Lexicon: worktree entry clarifies cwd-scoped Card 16 scoped: cwd worktree + fixture cleanup Co-authored-by: Amp <amp@ampcode.com>

- cook-cli: structured header/footer with engine, plan, worktree, retries; per-epic/slice result table; total duration - pi-actions: elapsed timer from session start, compact one-line-per-action with icons (▸ start, ✓ done, ✗ fail, ● verdict, ○ needs work, ? evaluate) - file-report-sink: stop streaming raw JSON to stdout; JSON stays in file only - 35 tests pass, build clean

The field was always the agent working directory, not the fixture directory. Also removes unused ReportLine import from engine-proc. Co-authored-by: Amp <amp@ampcode.com>

topoSort<T>(items, getId, getDeps) replaces topoSort(epics) + topoSortSlices(slices). Co-authored-by: Amp <amp@ampcode.com>

report-helpers.ts: createReport(sink, fields) handles id generation + timestamp + append. Replaces 5 inline report-construction sites across engine-proc, engine-petri, and pi-actions. Co-authored-by: Amp <amp@ampcode.com>

Delete old module-level callOrder/evalCallCount/fakeActions/fakeTestRunner. All 9 contract test suites now use the same createFakes() factory. ~100 lines removed. Co-authored-by: Amp <amp@ampcode.com>

Co-authored-by: Amp <amp@ampcode.com>

…sults - Status banner: landed POC with SPEC cross-references - §2 seam: fixtureDir → worktreeDir, ActionRegistry → ActionHandlers - §3: POC note pointing to §12 deferral - §12: streaming UX row updated (implemented, not deferred) - §13: experiment results with verdict (proc wins on simplicity) Co-authored-by: Amp <amp@ampcode.com>

…propagation, verify-epic parity - cook-cli: validate --max-retries is finite non-negative (prevents NaN infinite loop) - engine-petri: epic deps use single transition with ALL dep-done places as inputs (was one transition per dep → fired on first dep instead of all) - engine-petri: PetriNet.run() accepts shouldHalt callback, checked each iteration (was ignoring ctx.halted so transitions kept firing after a halt) - engine-proc: verify-epic called once per epic, not once per verification entry (handler owns all targets; matches petri engine behavior) Co-authored-by: Amp <amp@ampcode.com>

- engine-petri: unreached slices/epics now set ctx.halted=true so the overall status correctly reports 'halted' instead of 'completed' - report-helpers: append monotonic sequence counter to IDs to prevent collisions when multiple reports are created in the same millisecond Co-authored-by: Amp <amp@ampcode.com>

- engine-petri: epic deps use per-dependent signal places (same pattern as slice deps) so multiple epics depending on the same predecessor each get their own token instead of competing for one - engine-proc: haltedResult() fills in unreached epics/slices as halted before returning, matching petri engine behavior Co-authored-by: Amp <amp@ampcode.com>

Co-authored-by: Amp <amp@ampcode.com>

- engine-proc: hoist reportIds to run() scope so catch preserves them - engine-petri: remove dead ep(epicId, 'ready') place (readiness fans out directly to slice eligible places) - pi-actions: verify-epic write step uses test-writer.md, not evaluator.md Co-authored-by: Amp <amp@ampcode.com>

- Extract PetriNet class, Token, TransitionDef, FiringPolicy into petri-net.ts - Extract compilePlan() and RunCtx into net-compiler.ts - Both engines now call shared compilePlan() with serial firing policy - Migrate retry state from ctx.retries Map into in-net retry-budget places - Add adapter tests pinning compiled net place/transition counts - engine-petri.ts and engine-proc.ts are thin wrappers (~65 LOC each) Amp-Thread-ID: https://ampcode.com/threads/T-019e4b4e-1543-7602-b99d-c32342fb3938 Co-authored-by: Amp <amp@ampcode.com>

…oped - Mark orchestrator-poc done in PLAN.md (Phase 0 complete) - Add petri-semantic-lanes and petri-parallel-execution frontier definitions - Add petri-graph-compilation and petri-simulation-oracle to Horizon - Add Track F dependency graph for H-6476 umbrella - Scope Card 1-3 queue in CARDS.md for petri-semantic-lanes Amp-Thread-ID: https://ampcode.com/threads/T-019e4b4e-1543-7602-b99d-c32342fb3938 Co-authored-by: Amp <amp@ampcode.com>

kostandinang changed the title ~~FE-730: spec + plan for orchestrator POC dual-engine execution~~ FE-730: Orchestrator POC — dual-engine execution with contract tests May 20, 2026

kostandinang marked this pull request as ready for review May 21, 2026 08:48

augmentcode Bot reviewed May 21, 2026

View reviewed changes

kostandinang force-pushed the ka/fe-730-orchestrator-poc-dual-engine branch 2 times, most recently from f41c9d5 to d18a745 Compare May 21, 2026 08:59

cursor Bot reviewed May 21, 2026

View reviewed changes

Comment thread src/orchestrator/src/engine-petri.ts

Comment thread src/orchestrator/src/pi-actions.ts

Comment thread src/orchestrator/src/report-helpers.ts Outdated

cursor Bot reviewed May 21, 2026

View reviewed changes

Comment thread src/orchestrator/src/engine-petri.ts Outdated

Comment thread src/orchestrator/src/engine-proc.ts

cursor Bot reviewed May 21, 2026

View reviewed changes

Comment thread src/orchestrator/src/engine-proc.ts

Comment thread src/orchestrator/src/engine-petri.ts Outdated

Comment thread src/orchestrator/src/pi-actions.ts

cursor Bot reviewed May 21, 2026

View reviewed changes

Comment thread src/orchestrator/src/engine-proc.ts Outdated

Comment thread src/orchestrator/src/engine-petri.ts Outdated

kostandinang self-assigned this May 21, 2026

kostandinang requested a review from lunelson May 21, 2026 12:11

kostandinang mentioned this pull request May 21, 2026

FE-738: Petri semantic lanes — two-lane subnet, compiler split, engine factory #148

Open

lunelson reviewed May 22, 2026

View reviewed changes

lunelson approved these changes May 22, 2026

View reviewed changes

kostandinang mentioned this pull request May 22, 2026

FE-743: Petri parallel execution — concurrent firing, resource pools, worktree-per-slice #149

Open

kostandinang and others added 8 commits May 22, 2026 16:13

FE-730: rename fixtureDir → worktreeDir on OrchestratorInput

0476837

The field was always the agent working directory, not the fixture directory. Also removes unused ReportLine import from engine-proc. Co-authored-by: Amp <amp@ampcode.com>

FE-730: collapse duplicated topo-sort into one generic

b197c15

topoSort<T>(items, getId, getDeps) replaces topoSort(epics) + topoSortSlices(slices). Co-authored-by: Amp <amp@ampcode.com>

kostandinang and others added 13 commits May 22, 2026 16:13

FE-730: migrate contract test #1 to createFakes()

66d7324

Delete old module-level callOrder/evalCallCount/fakeActions/fakeTestRunner. All 9 contract test suites now use the same createFakes() factory. ~100 lines removed. Co-authored-by: Amp <amp@ampcode.com>

FE-730: remove exhausted REFACTOR.md

c3f3ea8

Co-authored-by: Amp <amp@ampcode.com>

FE-730: formatting cleanup (oxfmt)

437ef50

Co-authored-by: Amp <amp@ampcode.com>

FE-730: remove .antigravitycli/ and add to .gitignore

87f1b01

Co-authored-by: Amp <amp@ampcode.com>

FE-730: remove stale fe-716 walkthrough doc (leaked from rebase)

924a888

Co-authored-by: Amp <amp@ampcode.com>

FE-730: make petri the default engine

fceed04

Co-authored-by: Amp <amp@ampcode.com>

kostandinang force-pushed the ka/fe-730-orchestrator-poc-dual-engine branch from b72d1ad to 934ea57 Compare May 22, 2026 14:14

kostandinang mentioned this pull request May 22, 2026

FE-745: Merge slice worktrees into epic-scoped dir for verify-epic #152

Open

Conversation

kostandinang commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CLI

Architecture

Per-slice TDD loop

Key decisions

Verification

Proc vs Petri

Out of scope

Fixture: txt

Running examples

Reports.jsonl

Uh oh!

kostandinang commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor Bot commented May 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

Uh oh!

augmentcode Bot commented May 21, 2026

Uh oh!

augmentcode Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lunelson May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lunelson May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lunelson May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lunelson May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lunelson left a comment

Choose a reason for hiding this comment

Uh oh!

kostandinang commented May 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kostandinang commented May 20, 2026 •

edited

Loading

Fixture: `txt`

kostandinang commented May 20, 2026 •

edited

Loading

cursor Bot commented May 21, 2026 •

edited

Loading

lunelson May 22, 2026 •

edited

Loading

lunelson May 22, 2026 •

edited

Loading

lunelson May 22, 2026 •

edited

Loading

lunelson May 22, 2026 •

edited

Loading