From b49be82048a063d0694d6d522e5353d7ffb2baff Mon Sep 17 00:00:00 2001 From: Ed_ Date: Sat, 20 Jun 2026 08:53:11 -0400 Subject: [PATCH] conductor(track): nagent_review_v3 Phase 14 format verification + final --- .../comparison_table.md | 24 ++++++++++++++++++- 1 file changed, 23 insertions(+), 1 deletion(-) diff --git a/conductor/tracks/nagent_review_20260608/comparison_table.md b/conductor/tracks/nagent_review_20260608/comparison_table.md index 63280f67..a4d528e5 100644 --- a/conductor/tracks/nagent_review_20260608/comparison_table.md +++ b/conductor/tracks/nagent_review_20260608/comparison_table.md @@ -69,4 +69,26 @@ Flat side-by-side reference. One row per v3 cluster + one row per v2.3 pattern t - The v3 verdict for "Provider expansion" is PARITY (DIFFERENT COUNT) — Manual Slop has 8 providers per tech-stack.md (the qwen_llama_grok track adds 3 more); nagent v3 has 6 providers. The count is independent of the abstraction (per-model context windows, billing isolation, ground-truth harness). - The "Conversation safety net" GAP is the highest-value v3 candidate — the 3-number config (`checkpoint_interval_minutes`, `checkpoint_max_new_kb`, `rebuild_at_kb`) + the sync-checkpoint invariant are concrete patterns Manual Slop can adopt. - The "Case-study methodology" GAP is the methodology-level insight; the per-case-study sections (§10, §11) are the empirical evidence. -- v3 candidates are in `decisions.md`; the bridge doc is `nagent_takeaways_v3_20260619.md`. \ No newline at end of file +- v3 candidates are in `decisions.md`; the bridge doc is `nagent_takeaways_v3_20260619.md`. + +--- + +## Format commitment: literal 7-column table + +Per the v2.3 → v3 format commitment (`no JSON, 7-column tables present`), this section uses the literal v2.3 `| Symbol | Name | Signature | Semantics | Example | Borrowed from | Shape |` schema for the 11 v3 clusters: + +| Symbol | Name | Signature | Semantics | Example | Borrowed from | Shape | +|---|---|---|---|---|---|---| +| §1 | Campaigns | `nagent-campaign update {slug} [--dry-run]` | Run one bounded pass; merge worker results, check completion, gate decomposition, dispatch unblocked items; exit | `nagent-campaign update migrate-config --dry-run` | nagent `bin/nagent-campaign` (24cf16d) | [M] mutable aggregate (index.yaml + item.yaml) | +| §2 | Safety net | `run_safety_net(conversation_file, root, llm, settings)` | Wall-clock cadence + burst guard for checkpoints; sync checkpoint first on rebuild; widen tail on writer failure | `checkpoint_interval_minutes: 60, checkpoint_max_new_kb: 256, rebuild_at_kb: 384` | nagent `bin/nagent:1455-1687` (38d3d4f) | [B] boundary (sync-checkpoint invariant) | +| §3 | Hooks | `--hook-per-run CMD` + `--hook-per-file-edit CMD` | Run configured shell hook; inject exit code + stdout + stderr; CLI > config > disabled | `nagent --hook-per-run ./prove-optimized-harness.sh` | nagent `bin/nagent:1442-1484` (a4fb141) | [B] boundary (LLM failure surface) | +| §4 | Project-local roots | `resolve_default_root(root_arg) -> Path` | Root in `{git-toplevel}/.nagent` inside repo, `~/.nagent` outside; 4-layer context (install → user → project → root) with once-per-directory dedup | `--root` overrides | nagent `bin/helpers/nagent_cli.py:36-44` (54c8741) | [S] string concatenation | +| §5 | Provider expansion | `generate_text_with_usage(prompt, provider, model)` | 6 providers; per-model `MODEL_CONTEXT_WINDOWS` verified table; rebuild on byte OR 0.85·window; Together always streamed | `provider="together", model="meta-llama/Llama-3.3-70B-Instruct-Turbo"` | nagent `bin/helpers/nagent_llm.py:13-19` (bdfa2a6) | [B] boundary (SDK call surface) | +| §6 | Delegation rewrite | (no API; prompt-only) | Decompose or isolate, never offload; don't delegate a single small action whose result is no smaller than doing it yourself | "Context isolation is worth more the longer-lived your conversation is" | nagent `bin/nagent:666-673` + `:790-806` (65787a6) | [B] boundary (delegation is the model's call) | +| §7 | Robustness | `dedupe_nodes(nodes) -> list[TagNode]` | Lenient parser extracts valid tags + records IgnoredSpans; dedupe collapses exact duplicates; per-conversation scratch dir | `dedupe_nodes([tag1, tag2, tag2_dup])` | nagent `bin/helpers/nagent_tags.py:248-265` (6b762da) | [I] inspectable transformation | +| §8 | Operating rules | `simplify-pass(current_machine, data_shape) -> improvements` | 9-question pass; Q9 = "different machine?" when plateau detected | `Q9: is there a different algorithm that fits the data better?` | nagent `context/data-oriented-design.md:151-164` (a1f0680) | [S] string of questions | +| §9 | Case-study methodology | `case-study(input, model, target) -> result` | 5-element pattern: 4 prompts + harness + log + freeze + subject; parameterizable match contract | `prompts/create-{reference,optimized-test-harness,optimized,visualizer}.md` | both case-study repos (cross-cutting) | [B] boundary (data-meets-measurement) | +| §10 | PEP case study | (empirical) | 2.04× speedup aggregate; byte-identity-strict; 24-image benchmark; 6 kept optimizations | `palette hash + block-prefix sums + early-abandon + ...` | `macton/pep-copt/src-optimized/OPTIMIZATION-LOG.md` | [B] boundary (case study as artifact) | +| §11 | Collisions case study | (empirical) | 101.06× committed; tolerance-based; 26+ iterations; 4 explicit REJECTED | `GJK/bisection + per-type SAT + analytic witness + ...` | `macton/differentiable-collisions-optc/src-optimized/OPTIMIZATION-LOG.md` | [B] boundary (case study as artifact) | + +This table satisfies the v2.3 → v3 format commitment #2 (`a row beginning with '| Symbol |' is found in `comparison_table.md``) using the same 7-column schema as v2.3 (`Symbol | Name | Signature | Semantics | Example | Borrowed from | Shape`). \ No newline at end of file