manual_slop

Private

Public Access

Author	SHA1	Message	Date
ed	eb7da8d8bc	conductor(track): nagent_review_v3.1 thicken §8 Operating rules cluster	2026-06-20 11:27:02 -04:00
ed	b9b3100662	conductor(track): nagent_review_v3.1 thicken §7 Robustness cluster	2026-06-20 11:25:29 -04:00
ed	a406d2902c	conductor(track): nagent_review_v3.1 thicken §6 Delegation rewrite cluster	2026-06-20 11:23:59 -04:00
ed	987f4a9731	conductor(track): nagent_review_v3.1 thicken §5 Provider expansion cluster	2026-06-20 11:22:49 -04:00
ed	1bc8e924c0	conductor(track): nagent_review_v3.1 thicken §4 Project-local roots cluster	2026-06-20 11:21:17 -04:00
ed	d17ee93011	conductor(track): nagent_review_v3.1 thicken §3 Hooks cluster	2026-06-20 11:19:25 -04:00
ed	478b088b69	conductor(track): nagent_review_v3.1 thicken §2 Conversation safety net cluster	2026-06-20 11:17:27 -04:00
ed	bd36aa4b65	conductor(track): nagent_review_v3.1 thicken §1 Campaigns cluster	2026-06-20 10:56:26 -04:00
ed	44ae7a1bcb	conductor(plan): nagent_review_v3.1 mark Phase 1 complete	2026-06-20 10:53:58 -04:00
ed	8fb8276261	conductor(track): nagent_review_v3.1 Phase 1 setup + audit	2026-06-20 10:47:34 -04:00
ed	b693c3ae4b	conductor(track): nagent_review_v3.1 spec + plan (standalone-readable) Initial v3.1 spec + plan for the delta thickening of v3. v3.1 is the canonical v3 review at depth (>=3,800 LOC main review) with a chunking strategy that v3 lacked. Adds 3 new top-level sections (YAML avoidance, agent context-window, fine-tuning). Load-bearing principle: v3.1 is standalone-readable without consulting v2.3 or v3.	2026-06-20 10:25:38 -04:00
ed	195b0f451e	conductor(plan): nagent_review_v3 mark Phase 14 complete + track status	2026-06-20 08:54:35 -04:00
ed	b49be82048	conductor(track): nagent_review_v3 Phase 14 format verification + final	2026-06-20 08:53:11 -04:00
ed	a55dfd05c3	conductor(plan): nagent_review_v3 mark Phase 13 complete	2026-06-20 08:46:54 -04:00
ed	e150088d24	conductor(track): nagent_review_v3 Phase 13 refresh side artifacts	2026-06-20 08:46:05 -04:00
ed	dd10a6803b	conductor(plan): nagent_review_v3 mark Phase 12 complete	2026-06-20 08:37:29 -04:00
ed	db7d94de88	conductor(track): nagent_review_v3 §11 Collisions case study cluster	2026-06-20 08:37:07 -04:00
ed	c7e2ceffcd	conductor(plan): nagent_review_v3 mark Phase 11 complete	2026-06-20 08:33:30 -04:00
ed	f53c82e60c	conductor(track): nagent_review_v3 §10 PEP case study cluster	2026-06-20 08:33:08 -04:00
ed	8e6f202846	conductor(plan): nagent_review_v3 mark Phase 10 complete	2026-06-20 08:29:59 -04:00
ed	54e62b1037	conductor(track): nagent_review_v3 §9 Case-study methodology cluster	2026-06-20 08:29:36 -04:00
ed	d876744fc5	conductor(plan): nagent_review_v3 mark Phase 9 complete	2026-06-20 08:26:43 -04:00
ed	ad19be002d	conductor(track): nagent_review_v3 §8 Operating rules cluster	2026-06-20 08:26:18 -04:00
ed	d6f5d711be	conductor(plan): nagent_review_v3 mark Phase 8 complete	2026-06-20 08:24:05 -04:00
ed	ffa21d5ccc	conductor(track): nagent_review_v3 §7 Robustness cluster	2026-06-20 08:23:41 -04:00
ed	ae1a180028	conductor(plan): nagent_review_v3 mark Phase 7 complete	2026-06-20 08:20:28 -04:00
ed	0dad59fd08	conductor(track): nagent_review_v3 §6 Delegation rewrite cluster	2026-06-20 08:20:06 -04:00
ed	89368d4f26	conductor(plan): nagent_review_v3 mark Phase 6 complete	2026-06-20 08:17:51 -04:00
ed	dd8428a30f	conductor(track): nagent_review_v3 §5 Provider expansion cluster	2026-06-20 08:17:30 -04:00
ed	62f40d9410	conductor(plan): nagent_review_v3 mark Phase 5 complete	2026-06-20 08:15:04 -04:00
ed	ea8fa94e14	conductor(track): nagent_review_v3 §4 Project-local roots cluster	2026-06-20 08:14:37 -04:00
ed	589a79f91a	conductor(plan): nagent_review_v3 mark Phase 4 complete	2026-06-20 08:11:53 -04:00
ed	9ab2d07c8e	conductor(track): nagent_review_v3 §3 Hooks cluster	2026-06-20 08:11:29 -04:00
ed	0cbe665aea	conductor(plan): nagent_review_v3 mark Phase 3 complete	2026-06-20 08:08:50 -04:00
ed	caf04ca5b6	conductor(track): nagent_review_v3 §2 Conversation safety net cluster	2026-06-20 08:08:14 -04:00
ed	52dfece9ca	conductor(plan): nagent_review_v3 mark Phase 2 complete	2026-06-20 08:04:57 -04:00
ed	c81ea78273	conductor(track): nagent_review_v3 §1 Campaigns cluster	2026-06-20 08:04:09 -04:00
ed	f76d73e822	conductor(plan): nagent_review_v3 mark Phase 1 complete	2026-06-20 08:00:23 -04:00
ed	5a28c8f316	conductor(track): nagent_review_v3 Phase 1 setup + audit	2026-06-20 07:57:53 -04:00
ed	e90167494e	conductor(plan): initialize result_migration_baseline_cleanup_20260620 (sub-track 5) Sub-track 5 of the 5-sub-track result_migration_20260616 umbrella. Migrates the 3 baseline files (the convention reference) to be 100% compliant with the data-oriented Result[T] convention. Completes the campaign. Scope: 88 migration-target sites across 3 source files (mcp_client.py 46 + ai_client.py 33 + rag_engine.py 9; total 231KB / 5917 lines). 41 sites stay as-is: 4 BOUNDARY_SDK (vendor SDK boundaries in ai_client), 9 INTERNAL_PROGRAMMER_RAISE (5 rag_engine + 4 ai_client, per sub-track 4 Phase 11 dunder-method heuristic), 28 INTERNAL_COMPLIANT. Per the user directive (2026-06-20), this track uses the same anti-sliming template as sub-track 4 (which was 'the first to ship without error correction'). 14 phases cap each phase at <=9 migration sites with explicit per-phase audit gates. The sliming-prone phases (Phase 8 mcp_client silent-swallow, Phase 11 ai_client silent-swallow, Phase 12 ai_client rethrow) explicitly forbid narrowing+logging and classify- as-suspicious laundering. The 14 phases: 0. Setup + styleguide re-read (Tier 2 reads error_handling.md) 1. 3-file inventory + classification (88 sites in 3 inventory docs) 2. Audit gate baseline (3 baseline invariant tests) 3-7. mcp_client Batches A-E (40 broad-catches, 5 batches of <=8 each) 8. mcp_client silent-swallow + UNCLEAR (5 + 1 = 6 sites; anti-sliming) 9-10. ai_client Batches A-B (17 broad-catches, 2 batches) 11. ai_client silent-swallow (9 sites; anti-sliming) 12. ai_client rethrow classification (7 sites; Pattern 1/2/3 or migrate) 13. rag_engine migration (1 SS + 5 BC + 3 RETHROW = 9 sites) 14. Audit gate + end-of-track report (campaign 100% complete) Anti-sliming protocol per phase (same as sub-track 4): - Styleguide re-read at start of each phase (commit msg acknowledgment) - Per-site audit pre-check (capture before migration) - Red -> Green (1 commit per site) - Per-site audit post-check (capture after migration) - Phase invariant test (1 commit per phase) - 'If a site resists migration: DO NOT invent a heuristic. Report.' The 3 baseline files are the convention reference; after this track, the data-oriented Result[T] convention is fully applied to all 65 src/ files. Files: - spec.md (263 lines, 11 sections; 22 VCs; 6 risks) - plan.md (562 lines, 14 phases, 121 tasks, 110+ atomic commits, anti-sliming protocol identical to sub-track 4) - metadata.json (22 VCs, 6 risks, scope) - state.toml (15 phases, 121 tasks, 29 verification entries) - tracks.md (new row 6d-5 in Active Tracks table) Total: 5 files, ~2400 lines added (excluding tracks.md). Next: Tier 2 picks up Phase 0 (setup + styleguide re-read) per the task list in state.toml. Campaign 100% ready once this track ships.	2026-06-20 07:48:15 -04:00
ed	9224be7ac3	conductor(plan): add TRACK_COMPLETION report + track artifacts for tier2_leak_prevention_20260620 Adds the end-of-track artifacts for the tier2_leak_prevention_20260620 fix track: - docs/reports/TRACK_COMPLETION_tier2_leak_prevention_20260620.md: Full track completion report following the precedent set by TRACK_COMPLETION_tier2_autonomous_sandbox_20260616.md. Documents the 4 atomic commits, the 25 default-on tests, the manual end-to-end verification, the key design decisions (auto-unstage not exit 1, git rm --cached --force, CRLF handling, specific not prefix patterns), the known limitations, and the next steps for the user (push to origin, rebase stale tier-2 branches, re-run setup on the existing clone, optional CI wiring). - conductor/tracks/tier2_leak_prevention_20260620/metadata.json: Track metadata (status=shipped, scope: 5 new files + 1 modified, 25 default-on tests, 5 verification criteria, 5 risk-register entries, 2 deferred follow-up tracks). - conductor/tracks/tier2_leak_prevention_20260620/spec.md: Track spec (background on the `00e5a3f2` offender commit, design with the 3-layer defense-in-depth, forbidden patterns, tests, out-of-scope items). - conductor/tracks/tier2_leak_prevention_20260620/plan.md: Track plan (4 phases: revert + hook + audit + install; tasks recorded retroactively per workflow.md "Plan is the source of truth"). - conductor/tracks/tier2_leak_prevention_20260620/state.toml: Track state (status=completed, current_phase=complete, 4 phases with checkpoint SHAs, 16 tasks all completed with commit SHAs). - conductor/tracks.md: registered as track 6f in the Active Tracks table; added a "Recently Completed" entry with the commit-history summary. Per conductor/workflow.md "End-of-track report" protocol. The report includes a "Mistake to flag" section about the `Remove-Item -Recurse -Force` accident during verification, per the AGENTS.md "Hard ban on destructive commands" rule (which is specifically about `git restore`/`git checkout`/`git reset`/`git push` but the lesson generalizes: destructive PowerShell commands on directories with tracked files require explicit verification before running).	2026-06-20 07:46:10 -04:00
ed	977cfdb740	migration artifacts	2026-06-20 07:23:56 -04:00
ed	d653bd5c9a	Merge branch 'tier2/result_migration_gui_2_20260619'	2026-06-20 07:23:02 -04:00
ed	0a21627b8a	conductor(track): nagent_review_v3 spec + plan Initial v3 spec + plan for the major nagent review update. Covers 24 new nagent commits + 2 case-study repos (pep-copt, differentiable-collisions-optc) across 11 clusters. v2.3 historical reviews preserved; v3 is the canonical going forward.	2026-06-20 07:10:11 -04:00
ed	4116e14ed1	conductor(plan): mark Phase 13 complete (final checkpoint + tracks.md update) TIER-2 READ conductor/code_styleguides/error_handling.md end-to-end before Phase 13. Final state: - All 13 phases completed (checksha recorded) - All verification flags = true (audit_strict_exits_0, site_inventory_has_42_rows, drain_plane_render_functions_exist, silent_swallow_count_zero, rethrow_count_zero, unclear_count_zero, broad_catch_count_zero) - batched_suite_11_of_11_pass = false (Tier 3 has 1 known issue: test_gui2_performance.py measures FPS 28.46 vs 30 threshold; documented in TRACK_COMPLETION report as a known issue for user review) - tracks.md updated: sub-track 4 row -> 'shipped 2026-06-20' Track shipped on the success path. All 42 migration-target sites in src/gui_2.py resolved.	2026-06-20 02:55:37 -04:00
ed	4b20f395a4	docs(reports): TRACK_COMPLETION_result_migration_gui_2_20260619 (Phase 13, task 13.4) TIER-2 READ conductor/code_styleguides/error_handling.md end-to-end before Phase 13. End-of-track report for result_migration_gui_2_20260619. 81 atomic commits across 13 phases. All 42 migration-target sites in src/gui_2.py resolved: - 25 INTERNAL_BROAD_CATCH sites migrated to Result[T] (Phases 3-5, 7, 8) - 13 INTERNAL_SILENT_SWALLOW sites migrated to Result[T] (Phase 10) - 2 INTERNAL_RETHROW sites reclassified as INTERNAL_PROGRAMMER_RAISE via new audit heuristic (Phase 11) - 2 UNCLEAR sites reclassified as INTERNAL_COMPLIANT via new audit heuristic for lazy-loading sentinel fallback (Phase 12) Drain plane wired: 3 new module-level render functions + 3 App class delegation wrappers (Phase 2). Tests: 114/114 pass across tests/test_gui_2_result.py and tests/test_audit_heuristics.py. Tier 1 + Tier 2 of batched suite: 10/10 sub-tiers PASS. Tier 3 (live_gui): 1 known issue (test_gui2_performance.py measures 28.46 FPS vs 30 threshold; documented in the report). State.toml updated: all 13 phases marked completed.	2026-06-20 02:51:05 -04:00
ed	1efcd4fdbc	perf(gui_2): use singleton success Result in _render_main_interface_result TIER-2 READ conductor/code_styleguides/error_handling.md end-to-end before Phase 13. The Phase 3 _render_main_interface_result helper runs every frame. Returning Result(data=True) allocates a fresh dataclass with empty errors list every call. At 60 FPS, this is 60 allocations/sec just for the success path. Fix: introduce module-level _OK_TRUE and _OK_FALSE singletons (immutable, no errors list allocation). Hot-path helpers return _OK_TRUE on success; only the error path allocates a new Result. This is a micro-optimization that preserves the Result[T] contract (the helper still returns a Result instance). The convention is satisfied; the allocation overhead is removed. Note: test_gui2_performance.py::test_performance_benchmarking measures ~28.4 FPS vs 30 FPS threshold. The frame time is 0.22ms, which suggests the bottleneck is vsync/throttling, not Python overhead. The optimization is a defensive measure, not a fix for this specific test (which appears to be flaky near the threshold).	2026-06-20 02:49:27 -04:00
ed	f0ae074aec	fix(gui_2): restore _last_imgui_assert as string (regression from Phase 10) The Phase 10 migration of the run() function (L728 INTERNAL_SILENT_SWALLOW) changed App.run's error drain to set self.controller._last_imgui_assert to traceback.format_exception(...), which returns a list. But the existing test test_app_run_imgui_assert_handling.py expects it to be a string containing 'Missing End'. Fix: set _last_imgui_assert to str(err.original) if available, else err.message. The IM_ASSERT message string is what the health endpoint expects. TIER-2 READ conductor/code_styleguides/error_handling.md end-to-end before Phase 13. Regression test: tests/test_app_run_imgui_assert_handling.py test_app_run_records_degraded_state_on_imgui_assert PASSES after fix.	2026-06-20 02:39:47 -04:00
ed	d96e54f2df	test(gui_2): add 2 Phase 12 invariant tests + Phase 12 checkpoint Two Phase 12 invariant tests in tests/test_gui_2_result.py verify UNCLEAR count for src/gui_2.py is 0 after the lazy-loading sentinel fallback heuristic: - test_phase_12_invariant_unclear_count_zero: scans audit --json output, asserts 0 UNCLEAR findings in gui_2.py (the 2 lazy-loading sites in _LazyModule._resolve reclassified as INTERNAL_COMPLIANT) - test_phase_12_invariant_l65_l69_reclassified: scans audit --json output, asserts no UNCLEAR findings in _LazyModule._resolve method context State.toml updates: - phase_12 status: completed, checkpointsha: `f996aa10` - phase_12_complete: true - unclear_count_zero: true - t12_0/t12_1/t12_2 marked completed with their commit SHAs Pre-Phase 12: gui_2.py had 2 UNCLEAR sites (L65 + L69 in _LazyModule._resolve). Post-Phase 12: 0 UNCLEAR sites, 56 INTERNAL_COMPLIANT sites (was 54; +2 from reclassification). Phase 12 result_migration_gui_2_20260619.	2026-06-20 02:26:42 -04:00
ed	28a55ea51c	test(audit_heuristics): add 3 regression tests for lazy-loading (Phase 12) Three regression-guard tests in tests/test_audit_heuristics.py verify the new lazy-loading sentinel fallback heuristic (commit `f996aa10`): - test_lazy_loading_sentinel_fallback_in_resolve_is_compliant: L65-style nested try/except with self._cached = _FiledialogStub() in _resolve (mirrors the actual site in src/gui_2.py:65) -> expects INTERNAL_COMPLIANT - test_lazy_loading_sentinel_fallback_in_load_is_compliant: direct self._cached = _FooStub() in _load -> expects INTERNAL_COMPLIANT - test_lazy_loading_sentinel_fallback_in_get_is_compliant: direct self._cached = _BarStub() in _get (catches AttributeError after a getattr call) -> expects INTERNAL_COMPLIANT These tests follow the existing _make_visitor / _find_handler pattern established by Phase 7 (BOUNDARY_FASTAPI) and Phase 11 (dunder-method bare-raise) tests. They lock the heuristic's behavior so future edits to scripts/audit_exception_handling.py cannot accidentally reclassify the 2 gui_2.py sites (L65, L69) back to UNCLEAR. Pre-Phase 12: 3 tests in this file (Phase 7 + Phase 11). Post-Phase 12: 6 tests. 13/13 tests pass (3 new + 10 existing). Phase 12 result_migration_gui_2_20260619.	2026-06-20 02:24:18 -04:00

1 2 3 4 5 ...

3827 Commits