Private
Public Access
0
0
Commit Graph

2901 Commits

Author SHA1 Message Date
ed 3945fe37fe conductor(tracks): archive test_infrastructure_hardening_20260609 in tracks.md
- Remove row 1 from Active Tracks table
- Update rows 2-5, 17: test_infrastructure_hardening_20260609 -> '(merged)'
- Mark test_infrastructure_hardening as [COMPLETE 2026-06-10] [archived]
- Update link to use archive/ instead of tracks/
- Add closing note: 314/314 tests green, lineage tracks also archived
2026-06-10 20:15:18 -04:00
ed 5d2624526b conductor(archive): move 4 test-hell lineage tracks to archive/
- workspace_path_finalize_20260609 -> archive/ (precursor track)
- test_infrastructure_hardening_20260609 -> archive/ (main 8-phase track)
- mma_tier_usage_reset_fix_20260610 -> archive/ (4 controller bug fixes)
- rag_phase4_sync_fix_20260610 -> archive/ (RAG dim-mismatch + rag_config reset)

The archive/ directory already existed (71+ archived tracks from
earlier phases). The 4 tracks' state.toml + metadata.json were already
closed in the prior commit. This just relocates the folders to match
the convention referenced in tracks.md.
2026-06-10 20:12:50 -04:00
ed 1ea38ad16b conductor(track): close 4 test-hell lineage tracks (state + metadata)
- test_infrastructure_hardening_20260609: status active->completed,
  last_updated 2026-06-09->2026-06-10, t7_*/t8_* tasks marked complete
  with commit SHAs (84edb200, 719fe9a, cb525519)
- mma_tier_usage_reset_fix_20260610: status spec->shipped
- rag_phase4_sync_fix_20260610: status spec->shipped
- workspace_path_finalize_20260609: status active->completed,
  current_phase 1->complete, all tasks marked complete
  (c725270b, 93ec2809), verification flags flipped to true
2026-06-10 20:09:01 -04:00
ed 237f572592 docs(app_controller): replace fictional __init__ + register_hooks with real flow
The previous doc showed:
- A fictional AppState dataclass (does not exist)
- A fictional __init__ that creates manager objects in __init__
  (managers are lazy via __getattr__, created in _load_active_project)
- A fictional register_hooks(app) method (real flow is _init_actions
  called from init_state populates _predefined_callbacks)
- A fictional enable_test_hooks parameter (real signature is
  defer_warmup: bool = False, log_to_stderr: Optional[bool] = None;
  --enable-test-hooks is parsed by sloppy.py for HookServer, not here)

The new doc describes the real init flow (timeline anchors, 12 locks,
GUI health state, io_pool, warmup manager, flags) and points to the
actual line numbers in src/app_controller.py.
2026-06-10 20:07:08 -04:00
ed 5fa8a10ebf docs(testing): critical live_gui_workspace path fix + 8 new sections
CRITICAL fix:
- live_gui_workspace path: tmp_path_factory (banned) ->
  tests/artifacts/live_gui_workspace_<timestamp> (per-run timestamp)
  (per conductor/code_styleguides/workspace_paths.md)

8 new sections under 'Per-test Subprocess Resilience':
1. _reset_clean_baseline autouse fixture (mma_tier_usage +
   rag_config=default RAGConfig(), not None)
2. Watchdog and Hang Bounding (signal-based, 900s smart + 900s
   unconditional, replaces removed 30s daemon-thread)
3. Chroma Cache Path (tests/artifacts/.slop_cache/, parent-trailing-slash
   bug, pre-cleanup pattern in test_rag_phase4_final_verify)
4. xdist Worker Coordination (O_EXCL file lock, PYTEST_XDIST_WORKER,
   owner/client roles, stale lock demotion)
5. Required Test Dependencies Gate (sentence-transformers,
   uv sync --extra local-rag fix)
6. MMA and RAG State in reset_session() (5 buckets: mma_tier_usage
   pre-populated, rag_config fresh RAGConfig() not None)
7. _LiveGuiHandle __getitem__ (handle[0] / handle[1])

Expand 'Audit Script' -> 'Audit Scripts' (4 scripts total):
- check_test_toml_paths.py (existing)
- audit_main_thread_imports.py (startup_speedup)
- audit_weak_types.py (data_structure_strengthening)
- audit_no_models_config_io.py (config_state_owner styleguide)
2026-06-10 20:05:16 -04:00
ed 2e12b266e4 docs(mcp_client+ai_client): correct tool counts (15->18, 45->46)
- Total tool count: 45 -> 46 (per src/models.py:AGENT_TOOL_NAMES)
- Python AST tools: 15 -> 18 (3 structural mutators added:
  py_remove_def, py_add_def, py_move_def, py_region_wrap)
- py_get_symbol_info is fictional; replaced with the 4 actual
  structural mutator tools
- Cross-link from guide_ai_client.md updated
2026-06-10 20:02:01 -04:00
ed 07c1ed4928 docs(ai_client+api_hooks): lazy-loading + warmup endpoints (startup_speedup)
guide_ai_client.md:
- Add 'Module-Level Imports' section explaining that the 5 provider SDKs
  are NOT imported at module level; they're obtained via
  src.module_loader._require_warmed() after the WarmupManager loads them
  in the background. (Per startup_speedup_20260606: import src.ai_client
  went from ~1800ms to ~161ms.)

guide_api_hooks.md:
- Add 4 warmup endpoints to the endpoints table:
  /api/warmup_status, /api/warmup_wait?timeout=N,
  /api/warmup_canaries, /api/startup_timeline
- Add 'Warmup API' section with client methods + external script pattern
  (use get_warmup_wait() instead of time.sleep() race)
2026-06-10 20:00:37 -04:00
ed ca48d33d16 docs(simulations): update live_gui fixture signature to _LiveGuiHandle
The live_gui fixture in tests/conftest.py:467 now yields a _LiveGuiHandle
object (not a tuple). The handle exposes:
- .process, .gui_script, .workspace (Path to per-run workspace)
- .is_alive(), .ensure_alive(), .respawn_count
- __iter__ and __getitem__ for backward-compatible tuple unpacking

Also document the xdist O_EXCL file-lock coordination pattern and the
PYTEST_XDIST_WORKER env var owner/client role split.
2026-06-10 19:53:44 -04:00
ed c501035609 docs(gui_2): __getattr__ hasattr-guard + startup architecture section
Critical fix:
- Update __getattr__ code example to show the current bcdc26d0 version
  (with hasattr guard); old example showed the silent-None bug version

New section 'Startup Architecture (Lazy Imports, Profiler, Refresh Rate)':
- _LazyModule proxies (np, filedialog, Tk, win32gui, win32con)
- _FiledialogStub for headless/tkinter-less envs
- startup_profiler + render_warmup_status_indicator (defer_warmup=True)
- Native _detect_refresh_rate_win32 (ctypes.EnumDisplaySettingsW)
- immapp.run try/except error handling (native 0xc0000005 graceful degrade)
2026-06-10 19:52:11 -04:00
ed 5aa19e59e7 docs(rag): sync with src/rag_engine.py (collection attr, chroma path, dim validation)
Critical fixes:
- Chroma path: .rag/chroma/ -> .slop_cache/chroma_<collection_name>/
- self.vector_store -> self.client (PersistentClient) + self.collection (Collection)
- vector_store_backend -> vector_store.provider (nested VectorStoreConfig)
- RAGConfig schema: removed fictional fields (ast_chunking_enabled,
  vector_store_backend, vector_store_path, auto_index_on_load,
  auto_sync_interval_seconds, top_k); added VectorStoreConfig nested

New sections:
- Dimension Mismatch Protection: documents _validate_collection_dim
  and why it exists (silent corruption from provider switches)
- Path resolution resilience: index_file() CWD fallback for batched tests
2026-06-10 19:50:35 -04:00
ed f973fb275f docs(workspace_profiles): fix WorkspaceProfile schema (ini_content, show_windows, panel_states)
The 2026-06-05 live_gui_fragility_fixes refactor replaced the old 7-field
WorkspaceProfile (docking_layout: bytes, window_visibility, theme,
theme_fx_enabled, captured_at, description) with a 4-field model:
ini_content: str, show_windows, panel_states. tomli_w rejects bytes,
so the ini_content is now a plain ImGui ini string, not base64.

- Update Data Model class example + field table
- Update Serialization section + TOML example
- Update Profile Activation + Capturing Current State steps
- Update Layout Stability note (binary blob -> raw ini string)
- Replace 'Theme FX State is Global' limitation with 'Theme is Not Captured'
2026-06-10 19:46:46 -04:00
ed 7f58f980c6 docs(readme): fix WorkspaceProfile description + gui_2 line refs
- WorkspaceProfile entry: docking_layout bytes -> 4-field model description
- guide_gui_2 entry: _capture_workspace_profile line 601-606 -> 813-841
- Add: __getattr__ ui_ attrs fix, lazy imports, warmup, refresh rate
2026-06-10 19:43:59 -04:00
ed d82153c058 docs(models): sync WorkspaceProfile dataclass to 4-field model
Match the actual src/models.py WorkspaceProfile:
- name: str
- ini_content: str
- show_windows: Dict[str, bool]
- panel_states: Dict[str, Any]

Remove fictional fields (scope, auto_switch_triggers, description).
Remove non-existent LayoutPreset class (was a 2026-06-05 casualty).
2026-06-10 19:43:58 -04:00
ed 252905546e docs(report): test infrastructure hardening - batch goes green 2026-06-10 2026-06-10 18:08:26 -04:00
ed f51bfdcd05 fix(rag): remove INVESTIGATE diagnostic logging 2026-06-10 17:37:03 -04:00
ed 5a9b8d6891 fix(test+rag): clean chroma cache pre-test + add INVESTIGATE stderr for RAG init 2026-06-10 17:20:57 -04:00
ed a3abe49ca9 fix(test): poll for mma_state_update 'simulating' to land in test_gui_ux_event_routing 2026-06-10 15:45:44 -04:00
ed 2c924fe6df test(infra): poll-for-event race fixes + watchdog timeout bump + spec update 2026-06-10 15:14:35 -04:00
ed 563e609505 fix(test): poll for push_event to land in test_visual_mma_components 2026-06-10 15:13:25 -04:00
ed 8f7de45aca fix(rag): robust test polling for entry race + stress test timing tolerance 2026-06-10 14:43:27 -04:00
ed 80697e221a conductor(checkpoint): RAG phase 4 sync fix + test assertion fix - track complete 2026-06-10 13:55:06 -04:00
ed 15ffc3a34f fix(rag): make test assertion accept either file's content (robust to chroma ordering) 2026-06-10 13:53:52 -04:00
ed 2ad0d6a3f0 conductor(plan): Update RAG sync fix track state - sync works, retrieval assertion is separate 2026-06-10 13:29:18 -04:00
ed dc90c54161 fix(rag): reset rag_config to default RAGConfig() (not None) in _handle_reset_session 2026-06-10 13:15:36 -04:00
ed 989b2e6835 conductor(plan): New track for RAG phase 4 sync fix 2026-06-10 12:45:56 -04:00
ed 1772fa8fc2 conductor(checkpoint): Final Phase 2 complete - FR1+FR2 re-applied, sim test passes in batch 2026-06-10 12:13:16 -04:00
ed d945cb7432 fix(controller): re-apply FR1+FR2 (mma_tier_usage pre-population + _flush_to_project defensive d.get) 2026-06-10 11:55:22 -04:00
ed 14a329c1a9 conductor(plan): Adjust track after catastrophic git checkout - FR1+FR2 reverted, FR3+FR4 were no-ops 2026-06-10 11:45:56 -04:00
ed 4660b8c874 fix(sim): defensive .setdefault('paths', []) in test_context_sim_live 2026-06-10 11:33:15 -04:00
ed c729f8adaf conductor(plan): Update spec/plan for Phase 2 (live_gui sim test fragility) 2026-06-10 10:12:09 -04:00
ed e788512d93 conductor(plan): Mark mma_tier_usage_reset_fix_20260610 as complete 2026-06-10 09:59:26 -04:00
ed 428aa18948 conductor(checkpoint): Checkpoint end of Phase 1 (4 FRs + 4 regression tests) 2026-06-10 09:56:21 -04:00
ed b96d709efb test(reset): regression for 3 pre-existing controller bugs 2026-06-10 09:16:46 -04:00
ed 4284ec6eba fix(controller): remove 'persona_manager' from _LAZY_MANAGER_DEFAULTS 2026-06-10 09:03:12 -04:00
ed bc4651d1e4 fix(controller): re-add self.context_preset_manager init (lost in 72f8f466) 2026-06-10 08:56:35 -04:00
ed 1919aa8a32 fix(controller): _flush_to_project defensive against missing 'model' key 2026-06-10 08:48:57 -04:00
ed d80c94b973 fix(controller): pre-populate mma_tier_usage on reset (restore _flush_to_project contract) 2026-06-10 08:46:54 -04:00
ed f5021360f1 wip: pre-mma-tier-usage-reset-fix (preserve inherited working tree) 2026-06-10 08:43:18 -04:00
ed d304af5d22 sigh 2026-06-10 08:34:46 -04:00
ed 72f8f466fe fix(sim+api): proper wait loops, project switch endpoint, drop stale check
Three real fixes for the sim test + the live_gui coordination layer:

1. /api/project_switch_status endpoint in src/app_controller.py.
   The wait helper had been calling this endpoint but it did not exist;
   the helper always received a 404, fell back to {in_progress: False},
   and returned immediately even when a switch was in flight. Added the
   endpoint that reads _project_switch_in_progress, active_project_path,
   and _project_switch_error from the controller.

2. simulation/sim_base.py: replace time.sleep(2.0)/time.sleep(1.5) in
   the setup() with wait_io_pool_idle and wait_for_project_switch so
   the test does not click btn_md_only while a project switch is in
   flight. Also added the wait calls to sim_context.py for the same
   reason.

3. src/app_controller.py _handle_md_only: removed the is_project_stale()
   early-return. The stale state is a transient window during which the
   previous code dropped the click on the floor with a misleading
   'stale ui' status. The MD generation worker is safe to run from any
   project state; the action handler now always proceeds.

4. tests/test_extended_sims.py: set current_model to 'gemini-cli' so
   _do_generate does not raise KeyError('model') when the test
   overrides provider to gemini_cli.

KNOWN ISSUE: test_context_sim_live still fails with status
'switching to: temp_livecontextsim' after a 60s wait. The click
appears to be re-triggering a project switch via the GUI's render
loop. Root cause investigation deferred; the sim is async and the
test path is fragile.
2026-06-10 00:31:22 -04:00
ed 33d02bb11f fix(test): drop rmtree race in live_gui workspace creation
The session-scoped live_gui fixture deleted the shared workspace
before recreating it, which raced with the per-worker lock acquisition
and produced FileNotFoundError on .live_gui_owner.lock in xdist.
The per-run timestamped name (tests/artifacts/live_gui_workspace_<ts>/)
already provides enough isolation between pytest invocations, so the
rmtree is unnecessary. Use mkdir(exist_ok=True) only.
2026-06-09 23:31:09 -04:00
ed 283bb7085b fix(test): remove live_gui skip gate — lock mechanism handles coordination 2026-06-09 22:45:36 -04:00
ed 5568b59634 fix(test): single shared workspace, remove per-worker subdirs (keep lock mechanism) 2026-06-09 22:38:28 -04:00
ed 4bb19835db fix(test): per-worker workspace subdir + file-lock for xdist live_gui coordination 2026-06-09 22:23:33 -04:00
ed 38cb0f99b4 fix(test): add PID to workspace path for xdist worker isolation 2026-06-09 21:45:02 -04:00
ed 35f4cecb9b fix(test): catch OSError in workspace rmtree retry (broader than PermissionError) 2026-06-09 21:22:00 -04:00
ed aa776224f2 test(workspace): update fixture test to assert tests/artifacts/ not tmp dir 2026-06-09 21:06:06 -04:00
ed ccc2aa0be9 test(workspace): verify per-run workspace path and gitignore status 2026-06-09 20:45:24 -04:00
ed b8c15f8d92 fix(test): per-run workspace under tests/artifacts/ (replaces tmp_path_factory) 2026-06-09 20:42:43 -04:00
ed 93ec28097c docs(styleguide): add workspace_paths.md — hard rule for test workspace paths 2026-06-09 20:36:41 -04:00