chore(audit): Phase 4.1 - 11/11 test tiers PASS clean (825s total)
All 11 test tiers pass after the 2 documented test infrastructure fixes. No regressions. The 4 Gemini 503 skip markers remain (out of scope for this track). Result: 11/11 PASS clean. - tier-1-unit-comms: 25.0s - tier-1-unit-core: 56.1s - tier-1-unit-gui: 27.5s (Issue 2 verified) - tier-1-unit-headless: 23.0s - tier-1-unit-mma: 26.3s - tier-2-mock_app-comms: 10.2s - tier-2-mock_app-core: 15.9s - tier-2-mock_app-gui: 12.9s - tier-2-mock_app-headless: 10.9s - tier-2-mock_app-mma: 14.9s - tier-3-live_gui: 601.7s (Issue 1 verified) Total: ~825s (~13.75 min)
This commit is contained in:
@@ -0,0 +1,52 @@
|
||||
# Phase 14 - 11/11 Tier Test Run Results
|
||||
|
||||
**Track:** live_gui_test_fixes_20260618
|
||||
**Date:** 2026-06-18
|
||||
**Command:** `uv run python scripts/run_tests_batched.py --tiers 1,2,3 --no-color --durations`
|
||||
|
||||
## Result: 11/11 tiers PASS clean
|
||||
|
||||
| Tier | Status | Duration |
|
||||
|---|---|---|
|
||||
| tier-1-unit-comms | PASS | 25.0s |
|
||||
| tier-1-unit-core | PASS | 56.1s |
|
||||
| tier-1-unit-gui | PASS | 27.5s |
|
||||
| tier-1-unit-headless | PASS | 23.0s |
|
||||
| tier-1-unit-mma | PASS | 26.3s |
|
||||
| tier-2-mock_app-comms | PASS | 10.2s |
|
||||
| tier-2-mock_app-core | PASS | 15.9s |
|
||||
| tier-2-mock_app-gui | PASS | 12.9s |
|
||||
| tier-2-mock_app-headless | PASS | 10.9s |
|
||||
| tier-2-mock_app-mma | PASS | 14.9s |
|
||||
| tier-3-live_gui | PASS | 601.7s |
|
||||
|
||||
**Total: ~825 seconds (~13.75 minutes)**
|
||||
|
||||
## Notes
|
||||
|
||||
- **Issue 1 verification (tier-3-live_gui, 601.7s):** The `test_execution_sim_live` test now passes.
|
||||
The `_trigger_blink` + `imgui.set_window_focus("Response")` call in `src/gui_2.py:render_response_panel`
|
||||
has been deferred to the next frame's idle phase via `_pending_focus_response`. This avoids exhausting
|
||||
the GUI subprocess main thread's 1.94 MB stack on Windows.
|
||||
|
||||
- **Issue 2 verification (tier-1-unit-gui, 27.5s):** The `test_live_gui_workspace_exists` test now passes
|
||||
in batched runs. The `live_gui_workspace` fixture in `tests/conftest.py` now calls
|
||||
`workspace.mkdir(parents=True, exist_ok=True)` before returning the path, making it idempotent and
|
||||
resilient to concurrent teardown by other xdist workers.
|
||||
|
||||
- **No new `@pytest.mark.skip` markers** added by this track. The 4 Gemini 503 pre-existing skip
|
||||
markers remain (out of scope; deferred to a follow-up track).
|
||||
|
||||
- **Pre-existing Unicode summary print bug:** `scripts/run_tests_batched.py:_print_summary` fails with
|
||||
`UnicodeEncodeError: 'charmap' codec can't encode characters` after all batches pass. This is a
|
||||
Windows cp1252 stdout encoding issue unrelated to the fixes in this track. The actual test results
|
||||
are captured correctly; only the post-run summary print fails. Pre-existing.
|
||||
|
||||
## Verification
|
||||
|
||||
The full output is captured per-tier:
|
||||
- `tests/artifacts/_tier_1_run.log` (tier 1 results)
|
||||
- `tests/artifacts/_tier_2_run.log` (tier 2 results)
|
||||
- `tests/artifacts/_tier_3_run.log` (tier 3 results)
|
||||
|
||||
The earlier verification at `_tier_run.log` is a partial log from an interrupted run; ignore it.
|
||||
Reference in New Issue
Block a user