Commit Graph

16 Commits

Author SHA1 Message Date
Ed_
90fc38f671 fix(sim): wire btn_approve_script and expose pending_script_approval in hook API
_handle_approve_script existed but was not registered in the click handler dict.
_pending_dialog (PowerShell confirmation) was invisible to the hook API —
only _pending_ask_dialog (MCP tool ask) was exposed.

- gui_2.py: register btn_approve_script -> _handle_approve_script
- api_hooks.py: add pending_script_approval field to mma_status response
- visual_sim_mma_v2.py: _drain_approvals handles pending_script_approval

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-01 14:31:32 -05:00
Ed_
63fa181192 feat(sim): add pytest timeout(300) and tier_usage Stage 9 check
Task 2.3: prevent infinite CI hangs with 300s hard timeout
Task 3.2: non-blocking Stage 9 logs mma_tier_usage after Tier 3 completes

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-01 14:24:05 -05:00
Ed_
ce5b6d202b fix(tier1): disable tools in generate_tracks, add enable_tools param to ai_client.send
Tier 1 planning calls are strategic — the model should never use file tools
during epic initialization. This caused JSON parse failures when the model
tried to verify file references in the epic prompt.

- ai_client.py: add enable_tools param to send() and _send_gemini()
- orchestrator_pm.py: pass enable_tools=False in generate_tracks()
- tests/visual_sim_mma_v2.py: remove file reference from test epic

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-01 14:04:44 -05:00
Ed_
89a8d9bcc2 test(sim): Rewrite visual_sim_mma_v2 for real Gemini API with frame-sync fixes
Uses gemini-2.5-flash-lite (real API, CLI quota exhausted). Adds _poll/_drain_approvals helpers, frame-sync sleeps after all state-changing clicks, proper stage transitions, and 120s timeouts for real API latency. Addresses simulation_hardening Issues 2 & 3.
2026-03-01 13:42:34 -05:00
Ed_
6326546005 feat(taxonomy): Redirect logs and artifacts to dedicated sub-folders 2026-03-01 09:03:02 -05:00
Ed_
da21ed543d fix(mma): Unblock visual simulation - event routing, loop passing, adapter preservation
Three independent root causes fixed:
- gui_2.py: Route mma_spawn_approval/mma_step_approval events in _process_event_queue
- multi_agent_conductor.py: Pass asyncio loop from ConductorEngine.run() through to
  thread-pool workers for thread-safe event queue access; add _queue_put helper
- ai_client.py: Preserve GeminiCliAdapter in reset_session() instead of nulling it

Test: visual_sim_mma_v2::test_mma_complete_lifecycle passes in ~8s

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-01 08:32:31 -05:00
Ed_
6b0823ad6c checkpoint: this is a mess... need to define stricter DSL or system for how the AI devices sims and hookup api for tests. 2026-02-28 22:50:14 -05:00
Ed_
397b4e6001 chore(mma): Clean up mma_exec.py and robustify visual simulation mocking 2026-02-28 22:27:17 -05:00
Ed_
42c42985ee chore(mma): Verify track loading in visual simulation and fix deterministic ID logic 2026-02-28 22:12:57 -05:00
Ed_
37df4c8003 chore(mma): Deterministic track IDs, worker spawn hooks, and improved simulation reliability 2026-02-28 22:09:18 -05:00
Ed_
ed56e56a2c chore(mma): Checkpoint progress on visual simulation and UI refresh before sub-agent delegation 2026-02-28 21:41:46 -05:00
Ed_
d65fa79e26 chore(mma): Implement visual simulation for Epic planning and fix UI refresh 2026-02-28 21:07:46 -05:00
Ed_
f5e43c7987 refactor(tests): Add strict type hints to sixth batch of test files 2026-02-28 19:25:54 -05:00
Ed_
d36632c21a checkpoint: massive refactor 2026-02-28 09:06:45 -05:00
Ed_
bc261c6cbe teststests in wrong spot. 2026-02-28 00:07:45 -05:00
Ed_
2a30e62621 test(sim): Setup framework for robust live sim verification 2026-02-27 23:20:42 -05:00