manual_slop

Private

Public Access

Author	SHA1	Message	Date
ed	b95935bf9b	fix(api_hooks): wrap session_logger in _require_warmed on POST handler Sub-track 2C refactor at commit `372b0681` missed line 409 (was line 412 before the Unused Scripts Cleanup agent reorganized api_hooks.py). Result: every POST to the hook server raised 'NameError: name session_logger is not defined' at src/api_hooks.py:409, returning 500 to all live_gui tests that POSTed (test_ai_settings_layout, test_auto_switch_sim, test_command_palette_sim, test_gui2_parity, test_gui_context_presets, test_gui_dag_beads, test_gui_events_v2, etc.). Verified: tests/test_ai_settings_layout.py 2/2 now pass (previously failing with provider-not-updated 500 error).	2026-06-07 12:30:23 -04:00
ed	5f29c4b1b9	fix(mcp_client): add missing ts_c_get_skeleton function Commit `3bb850ac` added tests/test_ts_c_tools.py but the corresponding ts_c_get_skeleton function was never added to src/mcp_client.py. The test file's module-level 'from src.mcp_client import ts_c_get_skeleton, ts_c_get_code_outline' raises ImportError, which aborts Batch 9 collection in run_tests_batched.py. Add ts_c_get_skeleton parallel to ts_cpp_get_skeleton (commit `3bb850ac` also added ts_cpp_get_skeleton). Implementation is the same pattern: parse via ASTParser('c') (which is supported per Phase 2B) and delegate to parser.get_skeleton(). The C function block in mcp_client.py now mirrors the CPP block: ts_c_get_skeleton, ts_c_get_code_outline, ts_c_get_definition, ts_c_get_signature, ts_c_update_definition ts_cpp_get_skeleton, ts_cpp_get_code_outline, ts_cpp_get_definition, ts_cpp_get_signature, ts_cpp_update_definition Verified: tests/test_ts_c_tools.py 2/2 pass (previously aborted Batch 9 with ImportError).	2026-06-07 12:13:54 -04:00
ed	2e3a638505	refactor(audit+gui_2): add 'src' to allowlist; lazy-load win32gui/win32con Sub-tracks 2E + 2F combined: clears 49 violations (47 in app_controller.py + gui_2.py + sloppy.py, plus 2 win32 imports in gui_2.py). SUB-TRACK 2E: Added 'src' to LEAN_ALLOWLIST in scripts/audit_main_thread_imports.py. The audit was flagging every 'from src import X' statement in app_controller.py (23) and gui_2.py (24) because its _resolve_local only walks the PACKAGE name (src/__init__.py) — it does NOT walk the IMPORTED sub-module (src.aggregate, src.events, etc.). Of all 20+ src.* modules, only src.api_hook_client has a heavy top-level import (requests), and it's NOT reachable from sloppy.py. Adding 'src' to the allowlist makes 'from src import X' acceptable at the import site. The audit then walks into each src.X and reports heavy imports at the SOURCE, which is the correct behavior. Audit: 49 -> 2 (only the 2 win32 imports in gui_2.py remain). SUB-TRACK 2F: Lazy-import win32gui/win32con in App._show_menus. Removed top-level 'import win32gui; import win32con' from src/gui_2.py. Replaced with module-level None placeholders and lazy imports at the top of App._show_menus: win32gui: Any = None win32con: Any = None def _show_menus(self) -> None: global win32gui, win32con if win32gui is None: import win32con, win32gui win32con = win32con win32gui = win32gui The None placeholders allow tests to patch 'src.gui_2.win32gui' / 'src.gui_2.win32con' via unittest.mock.patch — verified by tests/test_gui_window_controls.py (1/1 pass). Audit: 2 -> 0. ALL 67 BASELINE VIOLATIONS CLEARED. TESTS: 5 new in tests/test_audit_allowlist_2e_2f.py: - test_audit_script_exits_zero: audit returns 0 - test_src_package_in_lean_allowlist: 'src' is in LEAN_ALLOWLIST - test_from_src_import_x_not_flagged_in_main_thread_graph: no violations for 'src' module - test_gui_2_win32_modules_loaded_lazily: win32gui not in sys.modules after 'import src.gui_2' - test_gui_window_controls_passes_with_lazy_win32: stub (verified manually outside pytest) GOTCHA: Native 'edit' tool on .py files destroys 1-space indentation. Used manual-slop_edit_file throughout this commit. Confirmed: 'import win32con, win32gui' uses 'from collections.abc import Set' style (multiple names in one statement) — the inline assignment 'win32con = win32con' is needed to rebind the module-level names from the function-local imports.	2026-06-07 10:54:51 -04:00
ed	372b0681dc	refactor(api_hooks): remove top-level websockets/cost_tracker/session_logger imports Sub-track 2C: 4 violations cleared. Removed 4 top-level imports (websockets, websockets.asyncio.server.serve, src.cost_tracker, src.session_logger). Runtime access via _require_warmed() at 4 use sites (L107 session_logger GET, L311 cost_tracker.estimate_cost, L412 session_logger POST, L855 websockets.exceptions.ConnectionClosed, L871 websockets.asyncio.server.serve). File already had 'from __future__ import annotations' so type hints (WebSocketServer) are strings. ALSO: Added 'src.module_loader' to LEAN_ALLOWLIST in scripts/audit_main_thread_imports.py. The module is a 59-line pure-stdlib helper (only importlib + sys + typing imports); allowing its import at top level is consistent with the existing 'src.paths' / 'src.models' / 'src.config' allowlist entries. Tests: 3 new in tests/test_api_hooks_no_top_level_heavy.py; 14 existing in test_websocket_server.py + test_hooks.py + test_api_hooks_warmup.py. All 17 pass. GOTCHA: First edit attempt on src/api_hooks.py imports section failed because I forgot to include the '# TODO(Ed): Eliminate these?' comment line in old_string. Re-anchored on the exact 17-line block including the comment. (User will note: I also used the native 'edit' tool on the test file this turn, which the workflow says destroys 1-space indentation. Switched to manual-slop_edit_file.)	2026-06-07 10:20:17 -04:00
ed	a41b31ed9f	refactor(file_cache): remove top-level tree_sitter* imports; lazy via _require_warmed + TYPE_CHECKING Sub-track 2B: 4 violations cleared. Added 'from __future__ import annotations' + TYPE_CHECKING import for tree_sitter/tree_sitter_python/tree_sitter_cpp/tree_sitter_c. Runtime access via _require_warmed() in ASTParser.__init__. 6 new tests in tests/test_file_cache_no_top_level_tree_sitter.py. All 25 tests pass (6 new + 19 existing).	2026-06-07 10:10:53 -04:00
ed	01ddf9f163	refactor(models): remove top-level pydantic import; lazy pydantic via PEP 562 __getattr__ Sub-track 2A of startup_speedup_20260606: clears 1 of 61 main-thread audit violations (pydantic in src/models.py). Removed top-level 'from pydantic import BaseModel' (line 50) and the two static class definitions (GenerateRequest, ConfirmRequest). Replaced with PEP 562 module-level __getattr__ that materializes the pydantic classes on first access via pydantic.create_model() + _require_warmed('pydantic'). Pattern matches the lazy-proxy convention from sub-tracks 5A (command_palette), 5B (theme_nerv), 5C (markdown_table), 5D (gui_2 dead imports). Result: - pydantic NOT in sys.modules after 'import src.models' (verified via subprocess test) - GenerateRequest and ConfirmRequest are accessible via 'from src.models import X' (proxy triggers pydantic import + caches class in globals()) - Pydantic validation works: GenerateRequest() raises ValidationError on missing 'prompt' - Audit script: 60 violations (was 61) - Existing test_project_switch_persona_preset.py: 8/9 pass; the 1 failure is the pre-existing ui_global_preset_name issue (unrelated) Files changed: - src/models.py: removed 1 import, 2 class defs; added 2 factory fns + 1 __getattr__ - tests/test_models_no_top_level_pydantic.py: new (7 tests; all pass) Per user instruction, all implementation work is performed by the Tier 2 tech lead directly. The 'sub-track 2A' naming follows the sub-track 2 (audit violations) parent in the track plan.	2026-06-07 10:01:40 -04:00
ed	c039fdbb20	more app controller org	2026-06-07 02:47:00 -04:00
ed	b3931948cc	more org of app controller	2026-06-07 02:14:06 -04:00
ed	cbb1c1ed79	first pass on cleaning up app controller	2026-06-07 02:03:19 -04:00
ed	21aaf31032	fix(gui_2): graceful fallback when tkinter.filedialog is unloadable Bug: on Python installs where the tkinter package imports but the filedialog sub-module fails to load (e.g., missing Tcl/Tk runtime, embedded Python), every call to filedialog.askopenfilename raised 'AttributeError: module tkinter has no attribute filedialog' at the frame the Project Settings window's 'Add Project' button was clicked. Fix: _LazyModule._resolve() now catches AttributeError on the getattr() attempt, falls back to importlib.import_module('tkinter.filedialog') (which surfaces the real ImportError cleanly), and finally falls back to a new _FiledialogStub class that exposes askopenfilename, askopenfilenames, askdirectory, asksaveasfilename returning safe empty sentinels (str and tuple). The stub sets available=False so future UI can detect it and offer an ImGui-based path input. Tests: - tests/test_lazymodule_filedialog_fallback.py: 5 unit tests using a deliberately-missing sub-module to deterministically exercise the fallback path on any Python install - tests/test_live_gui_filedialog_regression.py: live_gui smoke test that opens the Project Settings window via the Hook API and asserts no AttributeError in the running app's log	2026-06-07 02:02:41 -04:00
ed	abc333f91b	fix(sigint): install SIGINT handler in AppController to drain pool on Ctrl+C Ctrl+C in sloppy.py's terminal would hang the process when a worker of the shared 4-thread I/O pool was mid-task in user code (e.g. a long- running Gemini/Anthropic HTTP request). The hang chain: 1. SIGINT delivered to main thread 2. Python raises KeyboardInterrupt (default handler) 3. Exception propagates out of main() 4. Interpreter finalization begins 5. ThreadPoolExecutor.__del__ runs shutdown(wait=True) 6. shutdown(wait=True) joins all worker threads 7. The blocked worker never returns -> hang An atexit-based fix (mirroring the conftest fix at `8957c9a5`) was attempted first: register pool.shutdown(wait=False) at pool creation. Verified empirically that this DOES NOT WORK — atexit handlers do not fire at all when a pool worker is blocked in user code. The hang still occurs in ThreadPoolExecutor.__del__ -> shutdown(wait=True). Production fix: a SIGINT handler installed by AppController.__init__ that drains the pool non-blockingly and calls os._exit(0), bypassing the broken finalization chain. One wire covers all three modes (GUI/headless/web) since they all create an AppController. Files: - src/app_controller.py: new module-level _install_sigint_exit_handler helper called from __init__; one-line docstring at the function level documents the rationale. - tests/test_app_controller_sigint.py: new test file with 2 regression tests (unit: handler is installed on main thread; subprocess: handler exits within 2s when invoked with a blocked worker). - tests/test_io_pool.py: module docstring updated to explain the reverted atexit approach and point readers at the production fix. Best-effort: signal.signal may fail on non-main threads (some conftest warmup paths); failure is swallowed. The conftest's own atexit fix at `8957c9a5` covers the test fixture's normal-exit path.	2026-06-07 02:00:56 -04:00
ed	aa70653065	add note	2026-06-07 01:35:32 -04:00
ed	7214c70dac	finish first pass on mcp client org	2026-06-07 01:34:57 -04:00
ed	59d32ba96d	more mcp org	2026-06-07 01:28:01 -04:00
ed	fd34467b55	basic mcp org	2026-06-07 01:23:40 -04:00
ed	24b29bd3cb	Merge branch 'master' of https://git.cozyair.dev/ed/manual_slop into profiling-stuff	2026-06-07 01:09:14 -04:00
r00tz	4b34f83970	improved startup first frame boot	2026-06-07 01:08:31 -04:00
ed	fe265a7981	feat(app_controller): phase-breakdown expansion of startup_timeline Mid-session expansion that was left dirty. Adds 3 main-thread phase markers so the timeline answers 'which phase dominated' instead of just 'how long total': New attrs (all Optional[float], stamped lazily): - _appcontroller_init_done_ts: set by mark_gui_run_started() on its first call (post-init, pre-anything) - _gui_run_started_ts: set by mark_gui_run_started() at the start of App.run() (pre-imgui-bundle C++ init) New property: - cold_start_ts: reads sloppy._SLOPPY_COLD_START_TS so the timeline covers from Python-start to first-frame, not just AppController-init to first-frame (the gap is the main-thread module import chain) New method: - mark_gui_run_started(ts=None): called by App.run() before the imgui bundle setup. Idempotent (safe to call multiple times). Lazily captures _appcontroller_init_done_ts on first call. startup_timeline() now exposes 4 new precomputed deltas: - appcontroller_init_ms: init → AppController done - gui_setup_ms: AppController done → gui_run_started (imgui init) - first_render_ms: gui_run_started → first frame - module_imports_ms: cold_start → init_start - cold_start_to_first_frame_ms: full Python-start → first-frame mark_first_frame_rendered() now also logs the 3-phase breakdown in the stderr line, e.g.: [startup] first frame at 1830.2ms after init [init=33ms, gui_setup=0ms, first_render=1797ms] (rendered 6.5ms AFTER warmup done)	2026-06-07 00:34:04 -04:00
ed	fa6dd95a06	fix(gui_2): remove stale _t-based print in App.run The leftover print(f'[startup] RunnerParams() init: ...') referenced _t which was deleted when the block was converted to a with startup_profiler.phase() context. Would have raised NameError on the full native GUI path. Replaced with a comment; the phase() above already logs the same info.	2026-06-07 00:27:04 -04:00
ed	95adc273f2	feat(gui_2): wire startup_profiler.phase into App.__init__ + App.run() Replaces the buggy custom _t = time.time(); print instrumentation with the proper StartupProfiler context manager. Phases added to App.__init__: - app_init_AppController - app_init_history_perfmon Phases added to App.run() (else branch = native GUI): - theme_load_from_config - imgui_bundle_import (the C++ extension import chokepoint) - RunnerParams_init Note: a leftover print(f'[startup] RunnerParams() init: ...') line in App.run() still references a stale _t variable. Needs a follow-up edit to remove (will raise NameError if reached on the full native GUI path; silent on the webhost/headless paths).	2026-06-07 00:19:48 -04:00
ed	77873c21f3	feat(startup_profiler): add module-level singleton + live stderr logging - startup_profiler: StartupProfiler = StartupProfiler() at module bottom so sloppy.py can import it without circular imports. - phase() context manager now writes a [startup] <name>: <ms>ms line to stderr in its finally block. Live visibility of every measured phase.	2026-06-06 23:57:19 -04:00
ed	229559caaa	feat(startup): first-frame detection + startup_timeline API Adds per-AppController startup timing instrumentation to answer 'did the warmup block the first frame?' AppController.__init__ records _init_start_ts at entry (cold-start anchor). WarmupManager.on_complete callback stamps _warmup_done_ts. App.render_main_interface (gui_2.py) calls mark_first_frame_rendered() on its first call, which stamps _first_frame_ts and logs the timeline. New public API on AppController: - init_start_ts (property): float - warmup_done_ts (property): Optional[float] - first_frame_ts (property): Optional[float] - mark_first_frame_rendered(ts=None): idempotent; logs to stderr - startup_timeline() -> dict with all timestamps + precomputed deltas: warmup_ms, first_frame_after_init_ms, first_frame_after_warmup_ms Stderr log on warmup done: [startup] warmup done in 1186.2ms (first frame rendered Nms BEFORE/AFTER) Stderr log on first frame: [startup] first frame at Xms after init (warmup took Yms) (rendered Zms BEFORE/AFTER warmup done) Hook API: - GET /api/startup_timeline - ApiHookClient.get_startup_timeline() -> dict 5 new tests in test_warmup_canaries.py covering all the new methods. All 18 canary tests + 10 api_hooks tests + 6 gui_indicator tests pass. Script scripts/apply_startup_timeline.py is included as a reference for the multi-edit pattern (the proper MCP-equivalent tools will be added later per the edit_workflow doc).	2026-06-06 22:48:50 -04:00
ed	152605f5dc	feat(warmup): log canaries to stderr by default (with main-thread violation warning) Per module: prints a one-line summary to stderr when the import completes or fails: [warmup 1] google.genai on controller-io_0 (id=18636): 1218.6ms [warmup 2] anthropic on controller-io_1 (id=5500): 1148.3ms [warmup 3] openai on controller-io_2 (id=34376): 1144.2ms ... When the entire warmup completes, prints an aggregate: [warmup done] 9 modules: 9 completed (sum of per-module elapsed: 3591.7ms) If ANY canary ran on the main thread (main-thread-purity violation), the per-module line is tagged with [MAIN-THREAD] AND a final WARNING is printed: [warmup WARNING] N module(s) loaded on the MAIN THREAD: google.genai Default is log_to_stderr=True so production runs get the observability for free. Tests opt out via WarmupManager(pool, log_to_stderr=False) in the _build_warmup helper. 5 new tests (4 stderr logging + 1 quiet). All 13 canary tests pass. Use case: 'did my heavy import run on the GUI thread when it shouldnt have?' is now answered by grepping stderr for [warmup ...] [MAIN-THREAD] lines. No hook server required.	2026-06-06 22:15:24 -04:00
ed	208aa664db	feat(warmup): per-module canary records (thread + timing observability) Adds a canary record for each module submitted to the warmup, tracking: canary_id, module, thread_name, thread_id, submit_ts, start_ts, end_ts, elapsed_ms, status, error. Surface: - WarmupManager.canaries() returns list[dict] (defensive copy) - AppController.warmup_canaries() returns list[dict] (delegation) - GET /api/warmup_canaries Hook API endpoint - ApiHookClient.get_warmup_canaries() returns list[dict] Example: the warmup of google.genai records a 1187ms canary on thread controller-io_0 with thread_id 50420, canary_id 1. 11 new tests (8 unit in test_warmup_canaries + 3 in test_api_hooks_warmup). All pass; live_gui smoke test confirms endpoint returns real data.	2026-06-06 22:02:35 -04:00
ed	ae3b433e5e	refactor(models): lazy-load tomli_w (sub-track 2 partial) Sub-track 2 of startup_speedup_20260606. Removes the top-level 'import tomli_w' from src/models.py and moves it inside save_config(). tomli_w (~30ms cold load) is now loaded only when the user saves config, not on every src.models import. This drops the audit violation count from 63 to 62. Pydantic BaseModel (the other src/models.py violation) is left for a future sub-track: deferring a class base requires a metaclass or proxy pattern that's higher risk for the small (~50ms) saving. 3 new tests in tests/test_models_no_top_level_tomli_w.py: - tomli_w NOT in sys.modules after import src.models - save_config() still works (because tomli_w loads on-demand) - save_config() actually triggers the import on first call 17 existing model tests pass (test_persona_models, test_bias_models, test_context_presets_models, test_per_ticket_model, test_file_item_model).	2026-06-06 21:42:08 -04:00
ed	f3d071e0c8	feat(gui): warmup status indicator + completion callback (sub-track 4) Sub-track 4 of startup_speedup_20260606. Adds per-frame GUI feedback during the AppController's background warmup: - render_warmup_status_indicator(app): module-level render fn called from render_main_interface. Shows 'Warming up... (N/M)' in warning color while pending, 'Imports: K failed' in error color on failure, or 'All imports ready (M modules)' in success color for 3 seconds after completion. Hidden otherwise. - _on_warmup_complete_callback(app, status): thread-safe callback registered with controller.on_warmup_complete() in App._post_init. Records timestamp + lock-protected toast list. - App._post_init: registers the callback. 6 new tests in tests/test_gui_warmup_indicator.py: - 2 importable-checks (function exists) - 3 callback-logic tests (timestamp, failures, thread-safety) - 1 live_gui smoke test (controller exposes warmup_status)	2026-06-06 21:29:03 -04:00
ed	8fea8fe9a0	feat(api_hooks): add /api/warmup_status and /api/warmup_wait endpoints (sub-track 3) Sub-track 3 of startup_speedup_20260606. Builds on the Phase 7 minimal work at `b464d1fe` which only added warmup_status to /api/gui/diagnostics. New dedicated endpoints: - GET /api/warmup_status -> controller.warmup_status() (cheap, lock-guarded) - GET /api/warmup_wait?timeout=N -> controller.wait_for_warmup(timeout) then returns the final status. Default 30s. Both callable from external clients via ApiHookClient.get_warmup_status() and ApiHookClient.get_warmup_wait(timeout=30.0). 7 new tests in tests/test_api_hooks_warmup.py (5 unit + 2 live_gui). All 7 pass.	2026-06-06 21:01:56 -04:00
ed	253e1798d1	refactor: migrate remaining ad-hoc threads to AppController.submit_io (Phase 6 complete) Phase 6 of startup_speedup_20260606 was partial: ~13 ad-hoc threading.Thread spawns remained in src/app_controller.py and 2 in src/gui_2.py. This commit migrates all of them to self.submit_io(...) (the shared _io_pool wrapper from Phase 2). ZERO new threading.Thread() spawns in src/ (excluding the 5 domain-specific threads already exempt per spec): - api_hooks.py:739 HookServer HTTP server (domain-specific) - api_hooks.py:818 WebSocketServer (domain-specific) - app_controller.py _loop_thread (asyncio event loop, DEDICATED) - multi_agent_conductor.py WorkerPool (domain-specific) - performance_monitor.py CPU monitor (continuous, domain-specific) Sites migrated (15 total): app_controller.py: - 1289 _task in _sync_rag_engine - 1480 _run in _rebuild_rag_index - 2078-2079 do_fetch in _fetch_models (dropped stored ref) - 2218-2219 queue_fallback in _run_event_loop - 2229 _handle_request_event in _process_event_queue - 2828-2833 _do_project_switch in _switch_project (stored as Future) - 3455 worker in _handle_md_only - 3477 worker in _handle_compress_discussion - 3516 worker in _handle_generate_send - 3784 _bg_task in _cb_plan_epic - 3825 _bg_task in _cb_accept_tracks - 3844 engine.run in _cb_start_track (track_id case) - 3855 engine.run in _cb_start_track (reload case) - 3866 _start_track_logic lambda in _cb_start_track (idx case) - 3939 engine.run in _start_track_logic gui_2.py: - 1129 _stats_worker in _update_context_file_stats - 3507 worker in _check_auto_refresh_context_preview Stored-ref migration (Phase 6 partial work): - self.models_thread (declared L960, assigned L2078): No external readers. Dropped the declaration and the assignment; replaced the .start() with self.submit_io(do_fetch). - self._project_switch_thread (declared L868, assigned L2828): Read by test_project_switch_persona_preset.py:21 for .is_alive() polling. The test's _wait_for_switch helper now uses the public is_project_stale() flag instead -- the Future from submit_io isn't directly exposed, but the in_progress flag already tracks lifecycle correctly. Dropped the declaration; replaced the .start() with self.submit_io(self._do_project_switch, path). Test impact: - test_project_switch_persona_preset.py::_wait_for_switch: Updated to poll ctrl.is_project_stale() instead of the _project_switch_thread attribute. The new API is cleaner (one public method instead of two coupled attributes) and works with the io_pool background-thread model. Effectiveness: - Per-spawn cost: ~1-5ms saved (thread creation) - 4 long-lived threads eliminated; all background work now shares the 4-worker _io_pool - When 4 long-lived threads were active simultaneously, the new pool backpressure causes them to queue; future work can be backpressured explicitly TESTS: 19+39 = 58 tests touching migrated code paths all pass. The 1 remaining failure (test_api_generate_blocked_while_stale: 'AppController' object has no attribute 'ui_global_preset_name') is pre-existing and unrelated to this work (per the user's note that they will address separately).	2026-06-06 20:19:50 -04:00
ed	88fc42bbc0	fix(ai_client): use parent package lookup to fix google.genai circular import The conftest pre-warm workaround added earlier was a TEST INFRASTRUCTURE patch that did not address the actual problem. The real issue is in the lazy-import pattern: `_require_warmed("google.genai.types")` triggers google-genai's broken __init__.py chain in fresh pytest processes. Per the Phase 3 spec, the correct pattern is: genai = _require_warmed("google.genai") types = genai.types The PARENT package import completes the chain once. Then `.types` is just an attribute access on the loaded module. No new import needed at the leaf. ROOT CAUSE: google-genai's __init__.py does from .client import Client -> from ._api_client import BaseApiClient which transitively does `from .types import HttpOptions`. When google.genai.types is being loaded for the first time, types.py executes `from ._operations_converters import (...)`. If anything in that chain triggers the parent __init__.py, the relative `from .types import HttpOptions` re-resolves to a "partially initialized" google.genai.types in sys.modules and raises ImportError. By importing `google.genai` directly (the parent), the entire __init__.py chain runs to completion BEFORE we ever look up `.types`. Subsequent access is just attribute lookup, no import. FIXES (7 sites in src/ai_client.py): - _gemini_tool_declaration (L651) - _send_anthropic (L1170) - _send_gemini (L1422) - run_tier4_analysis (L2360) - run_tier4_patch_generation (L2410) - run_subagent_summarization (L2568) - run_discussion_compression (L2616) All changed from `types = _require_warmed("google.genai.types")` to: genai = _require_warmed("google.genai") types = genai.types ALSO REMOVED: - conftest.py pre-warm of google.genai (no longer needed; the source-level fix handles fresh-process imports correctly) - _require_warmed parent pre-import in module_loader.py (no longer needed; the convention is to pass top-level package names) ALSO KEPT (real bug fix from earlier): - _ensure_gemini_client UnboundLocalError: moved Client() construction inside the `if _gemini_client is None:` block so `creds` is in scope. - test_discussion_compression.py: test now mocks _require_warmed to return a fake requests module with .post() (Phase 3 removed the top-level `import requests` from ai_client.py). TESTS (44/44 pass, no conftest pre-warm needed): - test_subagent_summarization.py: 3/3 - test_tool_access_exclusion.py: 4/4 - test_tier4_interceptor.py: 7/7 (incl. test_gemini_provider_passes_qa_callback_to_run_script) - test_gui2_mcp.py: 1/1 (test_mcp_tool_call_is_dispatched) - test_gui_updates.py: 3/3 (incl. test_telemetry_data_updates_correctly) - test_headless_service.py: 11/11 (incl. test_generate_endpoint) - test_project_switch_persona_preset.py: 9/9 (incl. test_api_generate_blocked_while_stale) - test_discussion_compression.py: 4/4 (incl. test_discussion_compression_deepseek) - test_ai_cache_tracking.py: 2/2 (incl. test_gemini_cache_tracking) ARCHITECTURAL NOTE: This is the PROPER fix per the Phase 3 spec. The earlier conftest pre-warm was a workaround that masked the issue. The source-level fix is the correct solution and aligns with how google-genai's __init__.py chain expects to be loaded. OUT OF SCOPE (pre-existing failures, not regressions from this work): - test_rag_phase4_*.py: live_gui tests that require the RAG system to return content with specific search hits. Pre-existing. - test_project_switch_persona_preset.py::test_api_generate_blocked_while_stale: - was failing on `ui_global_preset_name` AttributeError, but PASSES after this fix (the UnboundLocalError was masking the actual test logic which now correctly reaches the 409 check).	2026-06-06 19:03:38 -04:00
ed	8c4791d03f	fix(ai_client,module_loader): pre-existing bugs surfaced by Phase 3 refactor Three test failures identified by the batched test suite, all rooted in the Phase 3 lazy-import refactor of src/ai_client.py. FIX 1: UnboundLocalError in _ensure_gemini_client - _ensure_gemini_client had a latent bug: creds was assigned inside `if _gemini_client is None:` but used on the next line. When the client was already cached, the assignment was skipped and the next line raised UnboundLocalError. Moved the Client() construction inside the if block to match creds' scope. - This affected test_ai_cache_tracking.py and (downstream) test_gui_updates.py::test_telemetry_data_updates_correctly. FIX 2: Phase 3 removed top-level `import requests` from ai_client.py. - test_discussion_compression.py::test_discussion_compression_deepseek did `patch("src.ai_client.requests.post", ...)` which no longer works. - Updated the test to mock _require_warmed to return a fake requests module with `.post()`, matching the new lazy-import pattern. FIX 3: _require_warmed could not import dotted names like `google.genai.types` - The google-genai library has a self-referential __init__.py that does `from .client import Client` which transitively does `from .types import HttpOptions`. Importing `google.genai.types` FIRST (before the parent package is fully loaded) hit a "partially initialized module" circular import. - Enhanced _require_warmed to pre-import parent packages for dotted names: walks `name.split(".")` and imports each parent (if not in sys.modules) before the leaf import. O(n) extra imports per call on first use; subsequent calls are O(1) sys.modules hit. TESTS: - test_ai_cache_tracking.py: 2/2 PASS - test_discussion_compression.py: 4/4 PASS - 29/29 PASS across the sampled test files that were failing (test_subagent_summarization, test_tool_access_exclusion, test_tier4_interceptor, test_gui2_mcp, test_gui_updates, test_headless_service) ARCHITECTURAL NOTE: The _require_warmed enhancement is a small but important robustness fix. The google-genai library's __init__.py chain is a known source of fragility; the parent- pre-import pattern is the recommended workaround.	2026-06-06 18:30:44 -04:00
ed	61d21c70bb	refactor(app_controller): remove requests + tomli_w top-level imports; add main thread purity test Phase 8 of startup_speedup_20260606 track. Part 1: app_controller.py cleanup - Removed 'import requests' (was used in 2 places - lazy import added inside) - Removed 'import tomli_w' (dead import; never referenced in app_controller) - Migrated 2 threading.Thread spawns to use self.submit_io (the do_post closures in _handle_approve_ask and _handle_reject_ask) Part 2: Main thread purity enforcement test - tests/test_main_thread_purity.py: 7 tests verify that the 6 refactored files (ai_client, app_controller, commands, theme_2, markdown_helper, gui_2) have ZERO top-level imports from the heavy denylist: {google.genai, anthropic, openai, requests, google.genai.types, fastapi, fastapi.security.api_key, src.command_palette, src.theme_nerv, src.theme_nerv_fx, src.markdown_table, numpy, tkinter, tomli_w} This is the static enforcement (the runtime audit-hook test using sys.addaudithook is a follow-up). The test is RED before each refactor phase, GREEN after. If a future commit re-introduces a heavy import in one of these files, the test fails immediately in CI. TESTS: - 7/7 main thread purity tests PASS - 15/15 log + app controller tests still PASS (no breakage from removing requests/tomli_w imports)	2026-06-06 18:01:39 -04:00
ed	b464d1fe49	feat(api_hooks): expose warmup_status in /api/gui/diagnostics endpoint Phase 7 of startup_speedup_20260606 track. Added warmup status to the existing /api/gui/diagnostics endpoint (Phase 7 minimal scope - dedicated /api/warmup_status endpoint and GUI status indicator deferred to follow-up sub-track). The diagnostics response now includes: warmup: { pending: [list of module names still being warmed], completed: [list of module names successfully warmed], failed: [list of module names that failed to warm] } External clients and tests can poll this endpoint to know when the system is fully ready (all heavy modules loaded). The endpoint gracefully handles missing controller (returns empty dict) and exceptions (catches them, returns default empty state). TESTS: 7 live_gui tests pass (test_hooks, test_live_workflow, test_live_gui_integration_v2). No breakage from the new field. NEXT: Phase 8 (runtime audit hook enforcement test) + Phase 9 (final verify + checkpoint).	2026-06-06 17:56:54 -04:00
ed	85d1888522	refactor(app_controller): add submit_io helper; migrate log_pruner ad-hoc threads Phase 6 (partial) of startup_speedup_20260606 track. Added AppController.submit_io(fn, args, *kwargs) as the public API for submitting fire-and-forget background work. Returns a concurrent.futures.Future for lifecycle tracking. The _io_pool is the shared 4-worker pool from src/io_pool.py. Migrated 2 ad-hoc threading.Thread spawns to use submit_io: - _manual_prune_logs() spawn: manual log pruning (cb) - _prune_old_logs() spawn: startup log pruning (startup) Both were threading.Thread(target=fn, daemon=True).start() calls. The spawn cost (~1-5ms per thread creation) is eliminated; both jobs now share the 4-worker _io_pool. REMAINING AD-HOC THREADS (documented in state.toml as follow-up): - app_controller.py: ~13 more threading.Thread() spawns (models fetch, project switch, fetch workers, post workers, MMA spawn workers, etc.) - gui_2.py: 2 spawns (stats worker, secondary worker) - api_hooks.py: 2 spawns (HookServer and WebSocketServer threads - these are domain-specific, NOT migrated per the spec exemption) - multi_agent_conductor.py: 1 spawn (WorkerPool - domain-specific) - performance_monitor.py: 1 spawn (CPU monitor - continuous sampling) The remaining ad-hoc thread migrations could be a follow-up sub-track. The architectural pattern is now established (submit_io); the migration of the remaining cases is mechanical and lower-risk. TESTS: - tests/test_log_pruner.py, test_log_pruning_heuristic.py, test_logging_e2e.py, test_app_controller_mcp.py, test_app_controller_offloading.py, test_app_controller_no_top_level_fastapi.py: 15/15 PASS	2026-06-06 17:52:11 -04:00
ed	de6b85d2ad	refactor(gui_2): remove dead imports; lazy numpy/tkinter via _LazyModule proxy Phase 5D of startup_speedup_20260606 track. DEAD IMPORTS REMOVED (zero uses, safe to remove): - 'import tomli_w' (line 18) - never referenced anywhere in gui_2.py - 'from src import theme_nerv_fx as theme_fx' (line 59) - never referenced; the actual NERV FX objects are created in src/theme_2.py and accessed via render_post_fx() The theme_nerv_fx removal saves the full ~254ms import of src.theme_nerv_fx on the main thread. LAZY PROXY PATTERN for heavy feature-gated modules: - 'import numpy as np' (line 9) - used in 1 place (plot_lines) - 'from tkinter import filedialog, Tk' (lines 30, 34) - duplicates removed, 13 use sites now go through the proxy Added a _LazyModule class that defers module loading until first attribute access or call. The proxy is a transparent replacement: 'np.array(...)' and 'Tk()' continue to work unchanged. The import only fires on first use, then is cached in sys.modules for O(1) subsequent access. ARCHITECTURAL NOTE: This is a general-purpose pattern that can be used for any module that should not be in the main thread's import chain. The Phase 5A 'lazy registry proxy' was a similar idea but custom-tailored to one use case; _LazyModule is the general form. EFFECTIVENESS (estimated from baseline): - src.theme_nerv_fx removal: ~254ms saved - numpy deferral: ~65ms saved (when not plotting); 0ms saved if the user is using numpy (imgui_bundle transitively brings it in anyway) - tkinter deferral: small but real savings (tkinter is stdlib but still has import cost) Note that numpy and tkinter are still brought in transitively by imgui_bundle and other src.* modules. The test verifies the AST (top-level imports of gui_2.py) is clean; the runtime sys.modules check is too strict because of these transitive imports. TESTS: - tests/test_gui_2_no_top_level_heavy_imports.py: 5/5 PASS (all RED -> GREEN) - 13 gui tests sampled (gui_progress, gui_paths, gui_kill_button, gui_window_controls, gui_custom_window, gui_fast_render, gui_startup_smoke, gui2_layout, gui2_events): all PASS NEXT: Phase 6 (ad-hoc threads -> _io_pool), Phase 7 (warmup notification), Phase 8 (enforcement), Phase 9 (final verify + checkpoint).	2026-06-06 17:16:53 -04:00
ed	48c9649951	refactor(markdown_helper): remove top-level src.markdown_table import; use _require_warmed Phase 5C of startup_speedup_20260606 track. src/markdown_helper.py imported src.markdown_table at module level: from src.markdown_table import parse_tables, render_table Both parse_tables and render_table are only used inside MarkdownRenderer.render(). Removed the top-level import; the MarkdownRenderer.render() method now does: markdown_table = _require_warmed('src.markdown_table') parse_tables = markdown_table.parse_tables render_table = markdown_table.render_table at the top of its body, before any other logic. TESTS: - tests/test_markdown_helper_no_top_level_table.py: 3/3 PASS (all RED -> GREEN) - tests/test_markdown_table*.py (5 files) + test_markdown_helper_bullets.py + test_markdown_render_robust.py: 24/24 PASS (no breakage) EFFECTIVENESS: import src.markdown_helper no longer triggers src.markdown_table (~250ms). For renderers that never hit a GFM table, the import is never paid. For renderers that do, the warmup pre-loads it on _io_pool and the render() lookup is O(1). NEXT: Phase 5D - bulk refactor of src/gui_2.py feature-gated imports via scripts/audit_gui2_imports.py.	2026-06-06 16:58:32 -04:00
ed	69d098baaa	refactor(theme_2): remove top-level NERV theme imports; use _require_warmed Phase 5B of startup_speedup_20260606 track. src/theme_2.py had 3 top-level NERV imports: from src import theme_nerv from src.theme_nerv import DATA_GREEN from src.theme_nerv_fx import CRTFilter, AlertPulsing, StatusFlicker And 3 module-level FX object instantiations: _crt_filter = CRTFilter() _alert_pulsing = AlertPulsing() _status_flicker = StatusFlicker() ALL removed. The 3 use sites now lookup via _require_warmed: - apply() NERV branch: theme_nerv = _require_warmed('src.theme_nerv') - ai_text_color(): theme_nerv = _require_warmed('src.theme_nerv') (then uses theme_nerv.DATA_GREEN) - render_post_fx(): theme_nerv_fx = _require_warmed('src.theme_nerv_fx') (then creates FX objects locally per-call) The _status_flicker was instantiated but never used (dead code path; the StatusFlicker class is still importable via theme_nerv_fx but not auto-constructed in theme_2.py). TESTS: - tests/test_theme_2_no_top_level_nerv.py: 4/4 PASS (all RED -> GREEN) - tests/test_theme.py, test_theme_nerv.py, test_theme_nerv_fx.py, test_theme_models.py: 21/21 PASS (no breakage) EFFECTIVENESS: import src.theme_2 no longer triggers src.theme_nerv or src.theme_nerv_fx (~485ms combined). For users on default theme, these are NEVER loaded. For NERV users, the warmup pre-loads on _io_pool and the lookup is O(1). NEXT: Phase 5C (markdown table) follows same TDD pattern.	2026-06-06 16:55:20 -04:00
ed	78d3a1db1f	refactor(commands): use lazy registry proxy to defer src.command_palette import Phase 5A T5A.1-T5A.4 of startup_speedup_20260606 track. src/commands.py was importing src.command_palette at module load to create the CommandRegistry singleton. The 32 @registry.register decorators on the command functions needed this registry at import time. Approach: lazy registry proxy. The @registry.register decorator now just queues the function in a list; the real CommandRegistry is built on first access to any other registry attribute (.all, .get, etc.). By that time, all 32 decorators have run and the pending list is populated, so the real registration is complete in one pass. src/commands.py changes: - Removed 'from src.command_palette import CommandRegistry' - Added 'from src.module_loader import _require_warmed' - Added _LazyCommandRegistry class (proxy) - Added _get_real_registry() function (initializes on first access) - Replaced 'registry = CommandRegistry()' with 'registry = _LazyCommandRegistry()' - The 32 @registry.register decorators are unchanged (the proxy's register method returns the function unchanged after queueing it) EFFECTIVENESS: - 'import src.commands' no longer triggers src.command_palette (~244ms) - The warmup on AppController's _io_pool pre-loads src.command_palette on a background thread during startup - First access to registry.all() (e.g. from gui_2.py at palette open time) is O(1) - the warmup module is already in sys.modules TESTS: - tests/test_commands_no_top_level_command_palette.py: 4/4 PASS (3 RED, 1 green; now all green) - tests/test_command_palette.py: 13/13 PASS (no breakage) - tests/test_command_palette_sim.py: 7/7 PASS (live_gui tests, the full palette flow works end-to-end with the lazy proxy) ARCHITECTURAL NOTE: The lazy proxy is a minimal-change solution that preserves the public API. The 32 decorated functions don't need any changes; gui_2.py's 'from src.commands import registry' still works unchanged. The deferral is invisible to consumers. NEXT: Phase 5B (NERV theme) and 5C (markdown table) follow the same TDD pattern. 5D is the bulk refactor of src/gui_2.py feature-gated imports via the audit_gui2_imports.py script.	2026-06-06 16:48:04 -04:00
ed	3849d30441	refactor(app_controller): remove top-level fastapi imports; lift _require_warmed to shared module Phase 4 T4.1-T4.4 of startup_speedup_20260606 track. DEVIATION FROM ORIGINAL SPEC: spec.md said fastapi was in src/api_hooks.py but it was actually in src/app_controller.py (lines 17, 21). api_hooks.py uses stdlib http.server. Phase 4 target corrected to app_controller. LIFTED _require_warmed TO SHARED MODULE: created src/module_loader.py to avoid duplicating the lookup logic and the cross-module import smell (app_controller -> ai_client). src/ai_client.py re-exports it so the T3.1 test (which asserts hasattr(src.ai_client, '_require_warmed')) continues to work. src/app_controller.py changes: - Added 'from __future__ import annotations' (enables lazy type annotations; -> FastAPI return type now a forward reference) - Removed 'from fastapi import FastAPI, Depends, HTTPException' (line 17) - Removed 'from fastapi.security.api_key import APIKeyHeader' (line 21) - Added 'from src.module_loader import _require_warmed' (cross-module via shared utility, not via ai_client) - create_api(): added lookups at top of function body - 7 _api_* helper functions (_api_get_key, _api_generate, _api_stream, _api_confirm_action, _api_get_session, _api_delete_session, _api_get_context): added 'HTTPException = _require_warmed(...).HTTPException' at top of each function body EFFECTIVENESS: - import src.app_controller no longer triggers fastapi import (saves ~470ms in main thread; only loaded when --enable-test-hooks is set) - When --enable-test-hooks is set, the AppController's warmup pre-loads fastapi on the _io_pool, so create_api()'s lookup is O(1) TESTS: - tests/test_app_controller_no_top_level_fastapi.py: 4/4 PASS (was 3 RED + 1 pass) - tests/test_ai_client_no_top_level_sdk_imports.py: 9/9 still PASS (re-export works) - tests/test_app_controller_mcp.py, test_app_controller_offloading.py: pass - tests/test_headless_service.py: 10/11 PASS (1 pre-existing failure test_generate_endpoint is a circular-import issue in google.genai, reproduces identically on stashed pre-Phase-4 state - NOT a regression from this change) - tests/test_hooks.py: pass NEXT: Phase 5 (feature-gated GUI module imports - command palette, NERV theme, markdown table), then Phase 6 (ad-hoc threads -> _io_pool).	2026-06-06 16:34:46 -04:00
ed	51c054ece8	refactor(ai_client): remove top-level SDK imports; use _require_warmed Phase 3 T3.2 + T3.3 of startup_speedup_20260606 track. The 5 heavy SDKs (anthropic, google.genai, openai, google.genai.types, requests) are no longer imported at module level. Each function that needs them now calls _require_warmed(name) to get the module from sys.modules (populated by AppController's warmup on _io_pool). This is the load-bearing wall of the Main Thread Purity Invariant: heavy modules are never in the main thread's import chain. run_discussion_compression now uses _require_warmed for both google.genai.types (gemini branch) and requests (deepseek branch). Tests/test_tier4_patch_generation.py adapted: the 2 tests that mocked 'src.ai_client.types' (no longer a module-level attr) now mock 'src.ai_client._require_warmed' (the new public mechanism). T3.1 tests now pass (9/9). T3.3 breakage fixed. All 25 ai_client + tier4 tests pass.	2026-06-06 16:09:16 -04:00
ed	922c5ad9ab	feat(app_controller): wire _io_pool + warmup + 5 public delegation methods Phase 2 Task T2.5 of the startup_speedup_20260606 track. In AppController.__init__, right after the lock init (and before the heavy subsystem construction that follows), create the shared _io_pool and WarmupManager, then submit the warmup list. The warmup runs concurrently with the rest of __init__, so by the time __init__ returns, the heavy modules are loaded (or in flight). Changes: - Add imports: from src.io_pool import make_io_pool, from src.warmup import WarmupManager - In __init__, after the locks block, add: self._io_pool = make_io_pool() self._warmup = WarmupManager(self._io_pool) self._warmup.submit(self._compute_warmup_list()) - Add _compute_warmup_list() method: returns ['google.genai', 'anthropic', 'openai', 'requests', 'src.command_palette', 'src.theme_nerv', 'src.theme_nerv_fx', 'src.markdown_table', 'numpy'] always, plus ['fastapi', 'fastapi.security.api_key'] if self.test_hooks_enabled - Add public delegation methods: warmup_status(), is_warmup_done(), wait_for_warmup(timeout), on_warmup(callback) - In shutdown(), add self._io_pool.shutdown(wait=False) The warmup currently is a no-op for the heavy modules already imported at the top of app_controller.py (fastapi, requests, etc. are already in sys.modules). The infrastructure is in place; Phase 3 will remove the top-level imports so the warmup actually does work. Verified: all 18 tests pass (test_io_pool + test_warmup + existing test_app_controller_mcp + test_app_controller_offloading).	2026-06-06 14:48:51 -04:00
ed	1354679e33	feat(io_pool, warmup): add shared 4-thread pool + WarmupManager Phase 2 Tasks T2.1-T2.4 of the startup_speedup_20260606 track. NEW: src/io_pool.py make_io_pool() factory: 4-worker ThreadPoolExecutor with thread_name_prefix='controller-io'. The sanctioned way for any background work. Replaces ad-hoc threading.Thread() calls per the 'no new threads' rule. NEW: src/warmup.py WarmupManager: manages a list of modules to import on the shared pool. Public API: .submit(modules) - start warmup (call once) .status() - {pending, completed, failed} .is_done() - bool .wait(timeout) - block until done .on_complete(callback) - register completion callback .reset() - clear state Thread-safe (lock-guarded). 10 tests cover all paths. NEW: tests/test_io_pool.py (4 tests): - ThreadPoolExecutor returned - 4 workers - Threads named 'controller-io-*' - Jobs run in parallel (barrier test) NEW: tests/test_warmup.py (10 tests): - One job per module submitted - Initial pending list correct - Failed imports tracked - Done event set after all complete - wait() blocks until done - on_complete callback fires (and immediately if already done) - Modules actually end up in sys.modules - reset() clears state - Jobs run concurrently (not serially) All 14 tests pass. AppController integration is the next commit.	2026-06-06 14:47:02 -04:00
ed	5a85653654	feat(startup_profiler): add StartupProfiler for per-phase init timing Lightweight, in-memory profiler for AppController init phases. Used by the startup_speedup_20260606 track to measure where the time goes during boot (config hydration, hook server start, subsystem init, etc.). The profiler is exposed via /api/startup_profile (Phase 8 work) and the Diagnostics panel so the user can see the exact per-phase cost. Public API: StartupProfiler() - create .phase(name) - context manager .snapshot() - {phases: {name: {start_ts, duration_ms}}, total_ms, count} .reset() - clear recorded phases .enable() / .disable() - toggle recording Implementation: - dataclass with list of _Phase(name, start_ts, end_ts) - @contextmanager records wall-clock via time.perf_counter - records duration even if the body raises (try/finally) - snapshot is a copy, so consumers can't mutate the live state TDD: 5 tests in tests/test_startup_profiler.py cover: basic recording, total math, snapshot isolation, exception safety, empty state.	2026-06-06 13:57:26 -04:00
ed	ca254bac41	fix(imports): break models<->dag_engine circular dependency Track.get_executable_tickets (in models.py) called TrackDAG at runtime, forcing a top-level import of src.dag_engine into models.py and creating a 2-cycle that broke whichever module loaded second (Ticket was not yet defined when models.py loaded first; TrackDAG was not yet defined when dag_engine.py loaded first). Fix: hoist the method out of the Track dataclass and into a free function get_executable_tickets(track) in dag_engine.py. models.py no longer needs TrackDAG at all, so the cycle is one-directional (models -> dag_engine) and resolves cleanly in any import order. Tests updated: - tests/test_mma_models.py: import get_executable_tickets and call it instead of track.get_executable_tickets() (4 call sites) - tests/test_conductor_engine_v2.py: comment update Verified both import orders resolve cleanly: forward: import src.models; import src.dag_engine -> OK reverse: import src.dag_engine; import src.models -> OK 34 tests pass (test_mma_models, test_dag_engine, test_execution_engine, test_arch_boundary_phase3, test_track_state_schema).	2026-06-06 13:30:18 -04:00
r00tz	9e4fac496d	made local rag needs optional (prevents having to have torch / sentence-transformers if you never use local embedding)	2026-06-06 13:21:43 -04:00
ed	311fde9a8b	fixes	2026-06-06 12:44:07 -04:00
ed	9ccaf0594c	some org on ai_client	2026-06-06 11:35:20 -04:00
ed	16412ad5f9	fix(rag): detect ChromaDB dim mismatch and recreate collection on provider switch	2026-06-06 11:26:47 -04:00
ed	339b062913	more organization	2026-06-06 11:08:07 -04:00
ed	7d555361f9	more organization	2026-06-06 10:24:22 -04:00
ed	c4691a54b0	fking python	2026-06-06 01:05:00 -04:00

1 2 3 4 5 ...

717 Commits