Private
Public Access
0
0
Commit Graph

2870 Commits

Author SHA1 Message Date
ed 428aa18948 conductor(checkpoint): Checkpoint end of Phase 1 (4 FRs + 4 regression tests) 2026-06-10 09:56:21 -04:00
ed b96d709efb test(reset): regression for 3 pre-existing controller bugs 2026-06-10 09:16:46 -04:00
ed 4284ec6eba fix(controller): remove 'persona_manager' from _LAZY_MANAGER_DEFAULTS 2026-06-10 09:03:12 -04:00
ed bc4651d1e4 fix(controller): re-add self.context_preset_manager init (lost in 72f8f466) 2026-06-10 08:56:35 -04:00
ed 1919aa8a32 fix(controller): _flush_to_project defensive against missing 'model' key 2026-06-10 08:48:57 -04:00
ed d80c94b973 fix(controller): pre-populate mma_tier_usage on reset (restore _flush_to_project contract) 2026-06-10 08:46:54 -04:00
ed f5021360f1 wip: pre-mma-tier-usage-reset-fix (preserve inherited working tree) 2026-06-10 08:43:18 -04:00
ed d304af5d22 sigh 2026-06-10 08:34:46 -04:00
ed 72f8f466fe fix(sim+api): proper wait loops, project switch endpoint, drop stale check
Three real fixes for the sim test + the live_gui coordination layer:

1. /api/project_switch_status endpoint in src/app_controller.py.
   The wait helper had been calling this endpoint but it did not exist;
   the helper always received a 404, fell back to {in_progress: False},
   and returned immediately even when a switch was in flight. Added the
   endpoint that reads _project_switch_in_progress, active_project_path,
   and _project_switch_error from the controller.

2. simulation/sim_base.py: replace time.sleep(2.0)/time.sleep(1.5) in
   the setup() with wait_io_pool_idle and wait_for_project_switch so
   the test does not click btn_md_only while a project switch is in
   flight. Also added the wait calls to sim_context.py for the same
   reason.

3. src/app_controller.py _handle_md_only: removed the is_project_stale()
   early-return. The stale state is a transient window during which the
   previous code dropped the click on the floor with a misleading
   'stale ui' status. The MD generation worker is safe to run from any
   project state; the action handler now always proceeds.

4. tests/test_extended_sims.py: set current_model to 'gemini-cli' so
   _do_generate does not raise KeyError('model') when the test
   overrides provider to gemini_cli.

KNOWN ISSUE: test_context_sim_live still fails with status
'switching to: temp_livecontextsim' after a 60s wait. The click
appears to be re-triggering a project switch via the GUI's render
loop. Root cause investigation deferred; the sim is async and the
test path is fragile.
2026-06-10 00:31:22 -04:00
ed 33d02bb11f fix(test): drop rmtree race in live_gui workspace creation
The session-scoped live_gui fixture deleted the shared workspace
before recreating it, which raced with the per-worker lock acquisition
and produced FileNotFoundError on .live_gui_owner.lock in xdist.
The per-run timestamped name (tests/artifacts/live_gui_workspace_<ts>/)
already provides enough isolation between pytest invocations, so the
rmtree is unnecessary. Use mkdir(exist_ok=True) only.
2026-06-09 23:31:09 -04:00
ed 283bb7085b fix(test): remove live_gui skip gate — lock mechanism handles coordination 2026-06-09 22:45:36 -04:00
ed 5568b59634 fix(test): single shared workspace, remove per-worker subdirs (keep lock mechanism) 2026-06-09 22:38:28 -04:00
ed 4bb19835db fix(test): per-worker workspace subdir + file-lock for xdist live_gui coordination 2026-06-09 22:23:33 -04:00
ed 38cb0f99b4 fix(test): add PID to workspace path for xdist worker isolation 2026-06-09 21:45:02 -04:00
ed 35f4cecb9b fix(test): catch OSError in workspace rmtree retry (broader than PermissionError) 2026-06-09 21:22:00 -04:00
ed aa776224f2 test(workspace): update fixture test to assert tests/artifacts/ not tmp dir 2026-06-09 21:06:06 -04:00
ed ccc2aa0be9 test(workspace): verify per-run workspace path and gitignore status 2026-06-09 20:45:24 -04:00
ed b8c15f8d92 fix(test): per-run workspace under tests/artifacts/ (replaces tmp_path_factory) 2026-06-09 20:42:43 -04:00
ed 93ec28097c docs(styleguide): add workspace_paths.md — hard rule for test workspace paths 2026-06-09 20:36:41 -04:00
ed b95410c565 wip: pre-workspace-path-finalize 2026-06-09 20:32:43 -04:00
ed 39c97cb365 conductor(track): workspace_path_finalize_20260609 - plan with 3 phases, 4-step execution 2026-06-09 20:29:55 -04:00
ed c725270b99 conductor(track): workspace_path_finalize_20260609 - per-run workspace under tests/artifacts/ 2026-06-09 20:27:20 -04:00
ed fe240db410 fix(reset): clear mma_tier_usage and RAG state in _handle_reset_session 2026-06-09 19:44:10 -04:00
ed 9128db5e48 ci(gitea): add test-on-tag workflow for tagged commits (tier-1 + tier-2) 2026-06-09 18:47:59 -04:00
ed 34290e5d1a test(watchdog): update PYTEST_FINISHED_TIMEOUT_SECONDS to 600 to match conftest 2026-06-09 18:42:53 -04:00
ed c3af1b8a2e chore(test): double smart_watchdog timeout from 300s to 600s for tier-3 2026-06-09 18:37:34 -04:00
ed 3b0e63124a fix(mma): process global mma_state_update when no track in payload 2026-06-09 17:45:13 -04:00
ed 7a946544ff test(mma): mark test_visual_mma_components with clean_baseline 2026-06-09 17:14:23 -04:00
ed e7da7e0d6a test(rag): update test for Phase 4 coalescing state 2026-06-09 17:10:33 -04:00
ed 5656957622 conductor(plan): Phase 8 complete - docs + audit extended 2026-06-09 17:05:35 -04:00
ed 719fe9abe7 conductor(checkpoint): Checkpoint end of Phase 8 2026-06-09 17:04:17 -04:00
ed cb525519cf docs(testing): document _LiveGuiHandle + live_gui_workspace + clean_baseline marker 2026-06-09 17:03:26 -04:00
ed 749120d239 feat(audit): flag hardcoded workspace and project-root paths in tests 2026-06-09 17:01:14 -04:00
ed d2ff6ffcf9 conductor(plan): Phase 7 complete - test_bed_health report 2026-06-09 16:59:16 -04:00
ed 84edb20038 docs(report): test_bed_health_20260609 - post-track batch status 2026-06-09 16:58:33 -04:00
ed 1cd3444e4c test(rag): mark RAG tests with clean_baseline for batch isolation 2026-06-09 16:56:55 -04:00
ed 3ed52be4bf conductor(plan): Phase 6 complete - clean_baseline marker 2026-06-09 16:42:48 -04:00
ed 7b87bbf5ec feat(test): clean_baseline marker resets controller state before test 2026-06-09 16:40:18 -04:00
ed afc8600800 conductor(plan): Phase 5 complete - set_value hook verified 2026-06-09 16:35:18 -04:00
ed 33d5caceaf fix(api_hooks): verified set_value('ai_input') works in batch 2026-06-09 16:33:55 -04:00
ed 6764c9e12f conductor(plan): Phase 4 complete - coalesce _sync_rag_engine 2026-06-09 16:27:15 -04:00
ed b8fcd9d6f5 fix(rag): coalesce _sync_rag_engine calls via token + dirty flag 2026-06-09 16:25:44 -04:00
ed 45b4497a66 conductor(plan): Phase 3 complete - tmp_path_factory + live_gui_workspace fixture 2026-06-09 16:15:50 -04:00
ed 006bb11488 refactor(test): 5 test files use live_gui_workspace fixture instead of hardcoded path 2026-06-09 16:14:40 -04:00
ed 91313451a2 feat(test): expose live_gui_workspace as a separate fixture 2026-06-09 15:53:06 -04:00
ed c64da95ef5 refactor(test): live_gui workspace via tmp_path_factory 2026-06-09 15:51:35 -04:00
ed c32ae33817 wip: pre-Phase 3 checkpoint 2026-06-09 15:49:12 -04:00
ed c3cb3c6e44 feat(test): autouse _check_live_gui_health recovers from degraded subprocess 2026-06-09 15:47:28 -04:00
ed 05ddb45236 conductor(plan): Phase 2 complete - FR1 handle + autouse fixture 2026-06-09 15:43:38 -04:00
ed 67d0211e56 feat(test): autouse _check_live_gui_health recovers from degraded subprocess 2026-06-09 15:42:00 -04:00