From 8438f69197a76445d5eef16f57927cdb60cd7973 Mon Sep 17 00:00:00 2001 From: Ed_ Date: Fri, 27 Feb 2026 11:24:03 -0500 Subject: [PATCH] docs(conductor): Synchronize docs for track 'MMA Orchestrator Integration' --- conductor/product.md | 6 ++--- conductor/tech-stack.md | 3 +-- conductor/tracks.md | 2 +- .../plan.md | 22 +++++++++---------- 4 files changed, 16 insertions(+), 17 deletions(-) diff --git a/conductor/product.md b/conductor/product.md index a15e58a..c97d080 100644 --- a/conductor/product.md +++ b/conductor/product.md @@ -12,9 +12,9 @@ To serve as an expert-level utility for personal developer use on small projects - **Multi-Provider Integration:** Supports Gemini, Anthropic, and DeepSeek with seamless switching. - **4-Tier Hierarchical Multi-Model Architecture:** Orchestrates an intelligent cascade of specialized models to isolate cognitive loads and minimize token burn. - **Tier 1 (Orchestrator):** Strategic product alignment, setup (`/conductor:setup`), and track initialization (`/conductor:newTrack`) using `gemini-3.1-pro-preview`. - - **Tier 2 (Tech Lead):** Technical oversight and track execution (`/conductor:implement`) using `gemini-3-flash-preview`. Maintains persistent context throughout implementation. - - **Tier 3 (Worker):** Surgical code implementation and TDD using `gemini-2.5-flash-lite` or `deepseek-v3`. Operates statelessly with tool access and dependency skeletons. - - **Tier 4 (QA):** Error analysis and diagnostics using `gemini-2.5-flash-lite` or `deepseek-v3`. Operates statelessly with tool access. + - **Tier 2 (Tech Lead):** Technical oversight and track execution (`/conductor:implement`) using `gemini-2.5-flash`. Maintains persistent context throughout implementation. + - **Tier 3 (Worker):** Surgical code implementation and TDD using `gemini-2.5-flash` or `deepseek-v3`. Operates statelessly with tool access and dependency skeletons. + - **Tier 4 (QA):** Error analysis and diagnostics using `gemini-2.5-flash` or `deepseek-v3`. Operates statelessly with tool access. - **MMA Delegation Engine:** Utilizes the `mma-exec` CLI and `mma.ps1` helper to route tasks, ensuring role-scoped context and detailed observability via timestamped sub-agent logs. Supports dynamic ticket creation and dependency resolution via an automated Dispatcher Loop. - **Role-Scoped Documentation:** Automated mapping of foundational documents to specific tiers to prevent token bloat and maintain high-signal context. - **Strict Memory Siloing:** Employs tree-sitter AST-based interface extraction (Skeleton View, Curated View) and "Context Amnesia" to provide workers only with the absolute minimum context required, preventing hallucination loops. diff --git a/conductor/tech-stack.md b/conductor/tech-stack.md index 23e5eeb..76a4ff4 100644 --- a/conductor/tech-stack.md +++ b/conductor/tech-stack.md @@ -21,8 +21,7 @@ - **DeepSeek (Dedicated SDK):** Integrated for high-performance codegen and reasoning (Phase 2). - **Gemini CLI:** Integrated as a headless backend provider, utilizing a custom subprocess adapter and bridge script for tool execution control. Achieves full functional parity with direct SDK usage, including real-time token counting and detailed subprocess observability. - **Gemini 3.1 Pro Preview:** Tier 1 Orchestrator model for complex reasoning. -- **Gemini 3 Flash Preview:** Tier 2 Tech Lead model for rapid architectural planning. -- **Gemini 2.5 Flash Lite:** High-performance, low-latency model for Tier 3 Workers and Tier 4 QA. +- **Gemini 2.5 Flash:** High-performance, low-latency model for Tier 2 Tech Lead, Tier 3 Workers, and Tier 4 QA. - **DeepSeek-V3:** Tier 3 Worker model optimized for code implementation. - **DeepSeek-R1:** Specialized reasoning model for complex logical chains and "thinking" traces. diff --git a/conductor/tracks.md b/conductor/tracks.md index 12c6075..481b997 100644 --- a/conductor/tracks.md +++ b/conductor/tracks.md @@ -20,7 +20,7 @@ This file tracks all major tracks for the project. Each track has its own detail --- -- [~] **Track: MMA Orchestrator Integration** +- [x] **Track: MMA Orchestrator Integration** *Link: [./tracks/mma_orchestrator_integration_20260226/](./tracks/mma_orchestrator_integration_20260226/)* --- diff --git a/conductor/tracks/mma_orchestrator_integration_20260226/plan.md b/conductor/tracks/mma_orchestrator_integration_20260226/plan.md index 98c21d3..a694c27 100644 --- a/conductor/tracks/mma_orchestrator_integration_20260226/plan.md +++ b/conductor/tracks/mma_orchestrator_integration_20260226/plan.md @@ -27,16 +27,16 @@ - [x] Persist the active epic, tracks, and tickets to `manual_slop.toml`. ## Phase 4: Product Alignment & Refinement -- [~] Task: UX Differentiator Audit - - [ ] Ensure the UX prioritizes "Expert Oversight" over "Full Autonomy" (Manual Slop vs. Gemini CLI). - - [ ] Add detailed token metrics and Tier-specific latency indicators to the Dashboard. +- [x] Task: UX Differentiator Audit + - [x] Ensure the UX prioritizes "Expert Oversight" over "Full Autonomy" (Manual Slop vs. Gemini CLI). + - [x] Add detailed token metrics and Tier-specific latency indicators to the Dashboard. ## Phase 5: Exhaustive Testing & Regression -- [ ] Task: Headless Engine Verification - - [ ] Create `tests/test_orchestration_logic.py` to verify Tier 1 -> Tier 2 -> Tier 3 flow without a GUI. - - [ ] Verify DAG resolution and error handling (e.g., blocked tickets). -- [ ] Task: Visual Verification Suite - - [ ] Create `tests/visual_orchestration_verification.py` using `ApiHookClient`. - - [ ] Simulate a full "Epic" lifecycle: User Prompt -> Track Review -> Ticket Generation -> Execution. -- [ ] Task: Core Regression Suite - - [ ] Run all existing MMA, Conductor, and GUI tests to ensure no regressions. +- [x] Task: Headless Engine Verification (d087a20) + - [x] Create `tests/test_orchestration_logic.py` to verify Tier 1 -> Tier 2 -> Tier 3 flow without a GUI. + - [x] Verify DAG resolution and error handling (e.g., blocked tickets). +- [x] Task: Visual Verification Suite (d087a20) + - [x] Create `tests/visual_orchestration_verification.py` using `ApiHookClient`. + - [x] Simulate a full "Epic" lifecycle: User Prompt -> Track Review -> Ticket Generation -> Execution. +- [x] Task: Core Regression Suite (d087a20) + - [x] Run all existing MMA, Conductor, and GUI tests to ensure no regressions.