manual_slop

Private

Public Access

Author	SHA1	Message	Date
ed	39c97cb365	conductor(track): workspace_path_finalize_20260609 - plan with 3 phases, 4-step execution	2026-06-09 20:29:55 -04:00
ed	c725270b99	conductor(track): workspace_path_finalize_20260609 - per-run workspace under tests/artifacts/	2026-06-09 20:27:20 -04:00
ed	fe240db410	fix(reset): clear mma_tier_usage and RAG state in _handle_reset_session	2026-06-09 19:44:10 -04:00
ed	9128db5e48	ci(gitea): add test-on-tag workflow for tagged commits (tier-1 + tier-2)	2026-06-09 18:47:59 -04:00
ed	34290e5d1a	test(watchdog): update PYTEST_FINISHED_TIMEOUT_SECONDS to 600 to match conftest	2026-06-09 18:42:53 -04:00
ed	c3af1b8a2e	chore(test): double smart_watchdog timeout from 300s to 600s for tier-3	2026-06-09 18:37:34 -04:00
ed	3b0e63124a	fix(mma): process global mma_state_update when no track in payload	2026-06-09 17:45:13 -04:00
ed	7a946544ff	test(mma): mark test_visual_mma_components with clean_baseline	2026-06-09 17:14:23 -04:00
ed	e7da7e0d6a	test(rag): update test for Phase 4 coalescing state	2026-06-09 17:10:33 -04:00
ed	5656957622	conductor(plan): Phase 8 complete - docs + audit extended	2026-06-09 17:05:35 -04:00
ed	719fe9abe7	conductor(checkpoint): Checkpoint end of Phase 8	2026-06-09 17:04:17 -04:00
ed	cb525519cf	docs(testing): document _LiveGuiHandle + live_gui_workspace + clean_baseline marker	2026-06-09 17:03:26 -04:00
ed	749120d239	feat(audit): flag hardcoded workspace and project-root paths in tests	2026-06-09 17:01:14 -04:00
ed	d2ff6ffcf9	conductor(plan): Phase 7 complete - test_bed_health report	2026-06-09 16:59:16 -04:00
ed	84edb20038	docs(report): test_bed_health_20260609 - post-track batch status	2026-06-09 16:58:33 -04:00
ed	1cd3444e4c	test(rag): mark RAG tests with clean_baseline for batch isolation	2026-06-09 16:56:55 -04:00
ed	3ed52be4bf	conductor(plan): Phase 6 complete - clean_baseline marker	2026-06-09 16:42:48 -04:00
ed	7b87bbf5ec	feat(test): clean_baseline marker resets controller state before test	2026-06-09 16:40:18 -04:00
ed	afc8600800	conductor(plan): Phase 5 complete - set_value hook verified	2026-06-09 16:35:18 -04:00
ed	33d5caceaf	fix(api_hooks): verified set_value('ai_input') works in batch	2026-06-09 16:33:55 -04:00
ed	6764c9e12f	conductor(plan): Phase 4 complete - coalesce _sync_rag_engine	2026-06-09 16:27:15 -04:00
ed	b8fcd9d6f5	fix(rag): coalesce _sync_rag_engine calls via token + dirty flag	2026-06-09 16:25:44 -04:00
ed	45b4497a66	conductor(plan): Phase 3 complete - tmp_path_factory + live_gui_workspace fixture	2026-06-09 16:15:50 -04:00
ed	006bb11488	refactor(test): 5 test files use live_gui_workspace fixture instead of hardcoded path	2026-06-09 16:14:40 -04:00
ed	91313451a2	feat(test): expose live_gui_workspace as a separate fixture	2026-06-09 15:53:06 -04:00
ed	c64da95ef5	refactor(test): live_gui workspace via tmp_path_factory	2026-06-09 15:51:35 -04:00
ed	c32ae33817	wip: pre-Phase 3 checkpoint	2026-06-09 15:49:12 -04:00
ed	c3cb3c6e44	feat(test): autouse _check_live_gui_health recovers from degraded subprocess	2026-06-09 15:47:28 -04:00
ed	05ddb45236	conductor(plan): Phase 2 complete - FR1 handle + autouse fixture	2026-06-09 15:43:38 -04:00
ed	67d0211e56	feat(test): autouse _check_live_gui_health recovers from degraded subprocess	2026-06-09 15:42:00 -04:00
ed	16bd3d3a47	refactor(test): wrap live_gui subprocess in _LiveGuiHandle class	2026-06-09 15:37:47 -04:00
ed	30c04860c7	conductor(plan): Phase 1 audit complete - ready for user review	2026-06-09 15:30:31 -04:00
ed	5df22fa8d5	conductor(audit): trace set_value('ai_input') flow to find routing bug	2026-06-09 15:29:27 -04:00
ed	5e13fa9ba7	conductor(audit): document _sync_rag_engine race in controller	2026-06-09 15:29:17 -04:00
ed	aebbd66836	conductor(audit): document hardcoded workspace paths in test suite	2026-06-09 15:29:06 -04:00
ed	d1c6c6c327	conductor(audit): catalog live_gui test cross-file state dependencies	2026-06-09 15:28:56 -04:00
ed	fcb161fd2e	conductor(tracks): add test_infrastructure_hardening_20260609 as foundation track + supersede 4 placeholder test tracks	2026-06-09 15:18:20 -04:00
ed	566cf08cb8	conductor(track): test_infrastructure_hardening_20260609 - spec to kill the test regression nightmare	2026-06-09 15:15:26 -04:00
ed	b4d240a9f3	docs(rag): final report on dim-mismatch recursion fix	2026-06-09 15:04:42 -04:00
ed	40f905d14b	test(rag): update dim-mismatch test to assert rmtree behavior The fix in `644d88ab` changed the recovery path from client.delete_collection to shutil.rmtree (chromadb 1.5.x delete_collection is broken on corrupted state). The test still asserted the old behavior.	2026-06-09 14:50:55 -04:00
ed	644d88ab93	fix(rag): break recursion in _validate_collection_dim The wipe path called self._init_vector_store() which re-invoked _validate_collection_dim, causing infinite recursion (RecursionError) when the dim mismatch test ran with the mock embedding provider. Re-initialize the vector store INLINE after the rmtree wipe so the fresh collection is created without going through the validator again.	2026-06-09 14:47:01 -04:00
ed	f207d297a3	docs(rag): final fix report and next steps	2026-06-09 14:38:30 -04:00
ed	64bc04a6b8	fix(rag): wipe chroma dir on dim mismatch instead of delete_collection When the existing collection has embeddings from a different embedding provider (e.g. Gemini 3072-dim vs local 384-dim), the prior approach of calling client.delete_collection() fails with 'RustBindingsAPI object has no attribute bindings' in chromadb 1.5.x when the underlying state is corrupted. rmtree is reliable and re-creates a fresh empty collection. Also fixes: - 'The truth value of an empty array is ambiguous' on numpy 2.x by using try/except around len() instead of truthiness check - WinError 32 on rmtree by closing the chroma client first Verified: tests/test_rag_phase4_final_verify.py passes in isolation in 7.75s after this fix. The test still fails in batch context due to a separate io_pool race condition (multiple _sync_rag_engine calls collide when the test sets rag_enabled, rag_source, and rag_emb_provider in sequence). The race is in app_controller.py and is out of scope for this defensive fix. Note: tests/test_rag_engine.py has explicit unit tests for test_rag_collection_dim_mismatch_recreates_collection and test_rag_collection_dim_match_preserves_collection which exercise this code path.	2026-06-09 14:37:19 -04:00
conductor-tier2	ac0c0cbe73	docs(styleguide): add No-Diagnostic-Noise rule to AI-Agent Conventions One addition to conductor/code_styleguides/python.md §8 "AI-Agent Specific Conventions": - No diagnostic noise in production code (Added 2026-06-09). `sys.stderr.write(f"[XYZ_DIAG] ...") lines in src/.py are technical debt. The right place for one-time investigation output is tests/artifacts/<test>.diag.log (a log file) or a standalone /tmp/diag_<name>.py script. If you must instrument production code, the diag lines are part of the same atomic commit as the fix. - Test files ARE allowed to be diagnostic.* The rule applies to src/.py only; tests/test_.py may use print(..., file=sys.stderr) freely. Markdown only. No code modified.	2026-06-09 14:03:18 -04:00
conductor-tier2	631c40c9c4	docs(workflow): add Process Anti-Patterns section + Isolated-Pass rule Two additions to conductor/workflow.md §"Known Pitfalls": 1. Isolated-Pass Verification Fallacy (Added 2026-06-09) — the rule that a test passing in isolation but failing in batch is FAILING. The only verification that matters for live_gui tests is the batch run. This is the flip side of the existing "Live_gui Test Fragility (Authoring-Side)" rule. Cross-references that rule. 2. Process Anti-Patterns (Added 2026-06-09) — 8-rule summary list, with cross-reference to AGENTS.md for the full ruleset. The 8 patterns are: Deduction Loop, Report-Instead-of-Fix, Scope-Creep Track-Doc, Inherited-Cruft, Diagnostic Noise in Production, Premature Surrender, Verbose Commit Message, Isolated-Pass Verification Fallacy. Markdown only. No code modified. Cross-references AGENTS.md (the load-bearing agent doc) for the full text of each pattern.	2026-06-09 14:03:00 -04:00
conductor-tier2	d7dc1e3b90	docs(edit-workflow): fix set_file_slice rule + add contract-change check Three surgical fixes to conductor/edit_workflow.md: 1. §2 "Verify Before Editing" — removed the leftover `git checkout -- src/gui_2.py` instruction. The user's commit `4eba059e unfuck edit workflow` removed most of the git checkout nuke instructions but missed §2. The revised §2 now says: read the contract (function signature, yield shape, return type) before editing, and DO NOT use `git checkout` to revert. Ask the user. 2. §3 "Reading Before Editing" — added the line-number offset check. `set_file_slice` uses 1-indexed inclusive `start_line`/`end_line`; off-by-one is a common silent failure. The rule is now: confirm the exact line range with `get_file_slice` first. 3. §8 "set_file_slice IS Valid for Multi-Line Content (Revised 2026-06-09)" — replaced the wrong rule ("Do not use set_file_slice for multi-line content") with the correct rule: set_file_slice IS valid for 3-10 line surgical edits, with a tool-selection guide (which tool for which job), a mandatory contract-change check (search for callers of the symbol being changed; update all callers in the same atomic commit if the public interface changes), and a mandatory whitespace-and-EOL rule (preserve line ending, indentation, and line count). 4. §9 "No Diagnostic Noise in Production Code (Added 2026-06-09)" — new section. Diag stderr goes to log files or /tmp scripts, NOT src/.py. If you must add diag lines to production code, they are part of the same atomic commit as the fix — they do not live uncommitted in the working tree. 5. "If set_file_slice produces wrong indentation"* — new handler in the Step-by-Step Workflow. Tells the agent: you wrote the wrong indent; the tool did what you asked; re-read the file with get_file_slice; do NOT use git checkout to revert. These are the rule corrections the user demanded after the Tier-2's bad set_file_slice + git nuke + diag-noise behavior. Markdown only. No code modified.	2026-06-09 14:02:41 -04:00
conductor-tier2	113e68fe18	docs(agents): add Process Anti-Patterns section + revise set_file_slice rule The user explicitly called out the bad patterns the agents (Tier-2 and the parent session's Tier-1) have been exhibiting. This commit updates AGENTS.md to filter them out at the load-bearing agent doc level (the first file any agent reads). Three changes: 1. Revised the `set_file_slice` rule on line 38 of the Critical Anti-Patterns. The previous rule said "Do not use set_file_slice for multi-line content" — that was wrong. `set_file_slice` IS valid for multi-line content, provided the agent verifies the exact byte offsets with `get_file_slice` and checks for contract changes (function signature, yield shape, return type). The full revised rule is in `conductor/edit_workflow.md §8`. 2. Added "No diagnostic noise in production code" to the Critical Anti-Patterns. The pattern: agent adds `sys.stderr.write(f"[RAG_DIAG] ...") to src/.py` for debugging, then "reverts everything" but leaves the diag lines uncommitted. Next agent runs git status, sees the diag lines, either commits them by accident or spends 10 min cleaning them up. The rule: diag goes to log files or /tmp scripts, NOT src/.py. 3. Added "No loop, no scope-creep, no report-instead-of-fix" to the Critical Anti-Patterns. The 200-line status report is a confession, not a fix. The 5-phase "future track" document for a 1-line fix is scope-creep. The "I am not going to attempt another fix without your direction" surrender is allowed ONLY if the agent has already read-predicted-instrumented-run-captured. 4. Added a new section: "Process Anti-Patterns (Added 2026-06-09)" with 8 numbered anti-patterns, each with a Symptom, Rule, and reference. The 8 patterns are the ones the user explicitly called out: Deduction Loop, Report-Instead-of-Fix, Scope-Creep Track-Doc, Inherited-Cruft, Diagnostic Noise in Production, Premature Surrender, Verbose Commit Message, Isolated-Pass Verification Fallacy. These are the rules the user is filtering out of LLM training data noise. The full ruleset is the source of truth; AGENTS.md is the load-bearing entry point. No code modified. Markdown only.	2026-06-09 14:01:26 -04:00
ed	4eba059e89	unfuck edit workflow.	2026-06-09 13:48:17 -04:00
ed	eb8357ec0e	fix(rag): add CWD fallback in index_file for path-resolution resilience RAGEngine.index_file silently returns when the joined base_dir+file_path doesn't exist. This caused the RAG batch test to fail with 0 indexed documents when the live_gui subprocess's active_project_root resolved to a parent dir (e.g. tests/artifacts/) instead of the workspace (tests/artifacts/live_gui_workspace/). The fix: if the primary path doesn't exist, try CWD+file_path. The base_dir takes priority; CWD is a safety net for relative-path resolution across the spawn CWD boundary. This is a defensive fix at the rag_engine layer. It does NOT fix the underlying path-leakage issue in tests/conftest.py (hardcoded Path('tests/artifacts/live_gui_workspace')) which needs a proper fixture refactor. The RAG test still fails in batch due to that deeper issue, documented in docs/reports/rag_test_batch_failure_status_20260609_pm3.md. Behavior: - base_dir+file_path exists: indexed from base_dir (unchanged) - base_dir+file_path missing, CWD+file_path exists: indexed from CWD (new) - Both missing: silently returns (unchanged) Verified: tests/test_rag_index_file_path_fallback.py (3 tests, all pass) - test_index_file_finds_file_via_cwd_fallback - test_index_file_uses_base_dir_first - test_index_file_silently_returns_when_no_match Note: test file was removed before commit because it was being abandoned along with the broader path-hygiene refactor. The fix itself is preserved in src/rag_engine.py.	2026-06-09 12:31:21 -04:00
ed	b801b11c3b	conductor(todo): mark task 9 (test deps in dev + conftest gate) as shipped	2026-06-09 10:39:29 -04:00

1 2 3 4 5 ...