diff --git a/tests/artifacts/PHASE14_TEST_RUN_RESULTS.log b/tests/artifacts/PHASE14_TEST_RUN_RESULTS.log new file mode 100644 index 00000000..c3aa18fc --- /dev/null +++ b/tests/artifacts/PHASE14_TEST_RUN_RESULTS.log @@ -0,0 +1,52 @@ +# Phase 14 - 11/11 Tier Test Run Results + +**Track:** live_gui_test_fixes_20260618 +**Date:** 2026-06-18 +**Command:** `uv run python scripts/run_tests_batched.py --tiers 1,2,3 --no-color --durations` + +## Result: 11/11 tiers PASS clean + +| Tier | Status | Duration | +|---|---|---| +| tier-1-unit-comms | PASS | 25.0s | +| tier-1-unit-core | PASS | 56.1s | +| tier-1-unit-gui | PASS | 27.5s | +| tier-1-unit-headless | PASS | 23.0s | +| tier-1-unit-mma | PASS | 26.3s | +| tier-2-mock_app-comms | PASS | 10.2s | +| tier-2-mock_app-core | PASS | 15.9s | +| tier-2-mock_app-gui | PASS | 12.9s | +| tier-2-mock_app-headless | PASS | 10.9s | +| tier-2-mock_app-mma | PASS | 14.9s | +| tier-3-live_gui | PASS | 601.7s | + +**Total: ~825 seconds (~13.75 minutes)** + +## Notes + +- **Issue 1 verification (tier-3-live_gui, 601.7s):** The `test_execution_sim_live` test now passes. + The `_trigger_blink` + `imgui.set_window_focus("Response")` call in `src/gui_2.py:render_response_panel` + has been deferred to the next frame's idle phase via `_pending_focus_response`. This avoids exhausting + the GUI subprocess main thread's 1.94 MB stack on Windows. + +- **Issue 2 verification (tier-1-unit-gui, 27.5s):** The `test_live_gui_workspace_exists` test now passes + in batched runs. The `live_gui_workspace` fixture in `tests/conftest.py` now calls + `workspace.mkdir(parents=True, exist_ok=True)` before returning the path, making it idempotent and + resilient to concurrent teardown by other xdist workers. + +- **No new `@pytest.mark.skip` markers** added by this track. The 4 Gemini 503 pre-existing skip + markers remain (out of scope; deferred to a follow-up track). + +- **Pre-existing Unicode summary print bug:** `scripts/run_tests_batched.py:_print_summary` fails with + `UnicodeEncodeError: 'charmap' codec can't encode characters` after all batches pass. This is a + Windows cp1252 stdout encoding issue unrelated to the fixes in this track. The actual test results + are captured correctly; only the post-run summary print fails. Pre-existing. + +## Verification + +The full output is captured per-tier: +- `tests/artifacts/_tier_1_run.log` (tier 1 results) +- `tests/artifacts/_tier_2_run.log` (tier 2 results) +- `tests/artifacts/_tier_3_run.log` (tier 3 results) + +The earlier verification at `_tier_run.log` is a partial log from an interrupted run; ignore it.