Private
Public Access
0
0
Commit Graph

1429 Commits

Author SHA1 Message Date
ed 891c008f0c conductor(plan): mark t3.1-t3.2 + t3.8-t3.13 complete; advance to t3.3+t3.14 (Green) 2026-06-11 01:42:13 -04:00
ed 4204116c66 conductor(plan): mark t2.11 completed (Phase 2 checkpoint) 2026-06-11 01:36:44 -04:00
ed 4d70dcc7ce conductor(plan): mark t2.11 + phase_2 complete; advance to phase 3 2026-06-11 01:35:22 -04:00
ed 45d316a0bd conductor(plan): mark t2.6-t2.10 complete (t2.7 cancelled: no template); advance to t2.11 2026-06-11 01:34:25 -04:00
ed 3940eb36ac conductor(plan): mark t2.1-t2.5 complete; advance to t2.6 (Green) 2026-06-11 00:53:58 -04:00
ed d5373e8f94 conductor(plan): mark t1.12 + phase_1 complete; advance to phase 2 2026-06-11 00:48:14 -04:00
ed 67782198b6 conductor(plan): mark t1.11 (dashscope dep) complete; advance to t1.12 2026-06-11 00:46:18 -04:00
ed f07e616c38 conductor(plan): mark t1.5-t1.10 complete; advance to t1.11 2026-06-11 00:41:11 -04:00
ed 6f11e7da14 conductor(plan): mark t1.1-t1.4 complete; advance to phase 1 in_progress 2026-06-11 00:31:57 -04:00
ed 7d6dbbd371 docs(conductor/index): fix guide count (23->27), update last-refresh date and add docs_sync_test_era_20260610 reference 2026-06-10 23:58:20 -04:00
ed 49ac008a87 docs: replace 2 'fictional' usages with neutral phrasing (predates the refactor / was stale) 2026-06-10 23:34:33 -04:00
ed e1287a4cf4 conductor(plan): prior_session_sepia_20260610 spec + design + metadata
New track for prior-session sepia tint:
- 3 new theme slots (prior_session_bg, prior_session_tint, prior_session_amount)
- per-palette state dict mirroring _brightness/_contrast/_gamma
- apply_prior_tint helper (float-only math per user requirement)
- 6 prior-session render sites wrapped (2 bubble_vendor swaps + 4 tint wraps)
- Theme Settings panel slider with persistence

Code-block tonemap fix is OUT OF SCOPE (upstream imgui_bundle 1.92.5
API only exposes 4-value PaletteId enum, no per-instance struct).
See spec §1.1.1 and design doc 'Honest constraint' section.
2026-06-10 23:00:29 -04:00
ed 994ded3598 conductor(tracks): consolidate Phase 6+ chronology (3 recently completed + 4 in plan)
The Phase 6+ section had two duplicate '### Active' headers, which
made the chronology confusing. The user (paraphrased): preserve the
chronology of project progress, don't need full detail, follow the
previous restructure's lightweight pattern.

Changes:
- Add '### Recently Completed (2026-06-06 to 2026-06-10)' subsection
  containing the 3 closed tracks (startup_speedup, test_batching_refactor,
  test_infrastructure_hardening) with lightweight entries: per-phase
  commit SHAs only, 1-line summary, link to spec/plan/state folder.
  Trimmed the verbose per-sub-track commentary that was in the old
  startup_speedup entry (the per-sub-track bullets for warmup, status
  indicator, audit violations, post-shipping fixes are in the
  archive's spec/plan, not the tracks.md).
- Remove the duplicate '### Active' header.
- Update section intro to reflect '3 recently completed, 4 in plan'
  (was '2 already completed, 3 in plan').
- test_infrastructure_hardening entry now has phase commit SHAs
  (5df22fa8, 67d0211e, 006bb114, b8fcd9d6, 33d5cac, 7b87bbf5,
  84edb200, 719fe9a) instead of just the closing-report link.

Chronology is now visible at a glance; per-track full detail is
in the linked archive/ folder.
2026-06-10 20:42:00 -04:00
ed 2b0e17ef0c conductor(track): add docs_sync_test_era_20260610 plan.md and spec.md
These were authored at track start but missed by the final-state
commit. They are the brief 1-2 page design intent and executable
plan for the docs sync track. The closing report at
docs/reports/docs_sync_test_era_20260610.md summarizes the actual
17-commit execution.
2026-06-10 20:25:32 -04:00
ed da240577f9 conductor(track): close docs_sync_test_era_20260610
- state.toml: status active->completed, all 25 tasks marked complete
  with commit SHAs, all 4 phases checkpointed
- metadata.json: status active->shipped, 17-commit list, all 9
  verification criteria flipped to DONE
2026-06-10 20:24:31 -04:00
ed 72b237457e docs(guidelines): add Testing Requirements section with 4 standards
- Structural Testing Contract (mirrors workflow.md)
- Isolated-Pass Verification Fallacy (Lesson 1, with link to the
  test_infrastructure_hardening_batch_green_20260610 incident report
  that motivated the rule)
- Audit Scripts as CI Gates (4 scripts: check_test_toml_paths,
  audit_main_thread_imports, audit_weak_types, audit_no_models_config_io)
- Skip Markers Are Documentation, Not Avoidance (workflow.md policy)
2026-06-10 20:20:58 -04:00
ed 965e015709 docs(workflow): add 3 test-hell lessons to Known Pitfalls + Live_gui Test Fragility
Known Pitfalls (new subsection):
- HARD BAN: git checkout -- <file>, git restore, git reset
  (per AGENTS.md Critical Anti-Patterns; destroyed user in-progress
  edits twice on 2026-06-07; concrete 2026-06-10 incident:
  mma_tier_usage_reset_fix regression)

Live_gui Test Fragility (2 new subsections):
- Anti-pattern: push_event + time.sleep(N) + assert is a race.
  Fix: poll-until-state-visible with bounded retries. 5+ tests
  affected in 2026-06-10 batch-green wave.
- Async setters need poll-for-state. mma_state_update and rag_*
  setters dispatch to _pending_gui_tasks queue; the setter returns
  before the GUI render loop processes the task. Assert immediately
  = race. Fix: poll via get_value with bounded retry.
2026-06-10 20:19:54 -04:00
ed 01ea22fc4a docs(styleguide): add chroma_cache.md — chroma DB path and cleanup pattern
Lesson 5 from the 4-day test-hell saga. The chroma cache lives at
tests/artifacts/.slop_cache/chroma_<collection>/, NOT at the per-run
live_gui_workspace_<timestamp>/ subdir. The trailing-slash bug in
Path(active_project_path).parent places the cache one level higher
than expected.

RAG tests must pre-clean the cache to avoid persistent state from
prior batched runs. Documents the cleanup pattern (shutil.rmtree with
ignore_errors=True), the auto-recovery mechanism (_validate_collection_dim),
and 3 anti-patterns (assuming per-run, not cleaning, asserting on
first chunk in batched context).
2026-06-10 20:18:09 -04:00
ed f0b7c8b7d6 conductor(index): add Test Infrastructure Hardening to Recently Shipped
New entry at the top of the Recently Shipped list, linking to the
archive/ folder. Includes:
- 314/314 green across all 11 tier batches
- FR1-FR5 summary
- 3 lineage tracks also archived
- The 4 unblocked tracks
- Link to the closing batch-green report
2026-06-10 20:16:17 -04:00
ed 3945fe37fe conductor(tracks): archive test_infrastructure_hardening_20260609 in tracks.md
- Remove row 1 from Active Tracks table
- Update rows 2-5, 17: test_infrastructure_hardening_20260609 -> '(merged)'
- Mark test_infrastructure_hardening as [COMPLETE 2026-06-10] [archived]
- Update link to use archive/ instead of tracks/
- Add closing note: 314/314 tests green, lineage tracks also archived
2026-06-10 20:15:18 -04:00
ed 5d2624526b conductor(archive): move 4 test-hell lineage tracks to archive/
- workspace_path_finalize_20260609 -> archive/ (precursor track)
- test_infrastructure_hardening_20260609 -> archive/ (main 8-phase track)
- mma_tier_usage_reset_fix_20260610 -> archive/ (4 controller bug fixes)
- rag_phase4_sync_fix_20260610 -> archive/ (RAG dim-mismatch + rag_config reset)

The archive/ directory already existed (71+ archived tracks from
earlier phases). The 4 tracks' state.toml + metadata.json were already
closed in the prior commit. This just relocates the folders to match
the convention referenced in tracks.md.
2026-06-10 20:12:50 -04:00
ed 1ea38ad16b conductor(track): close 4 test-hell lineage tracks (state + metadata)
- test_infrastructure_hardening_20260609: status active->completed,
  last_updated 2026-06-09->2026-06-10, t7_*/t8_* tasks marked complete
  with commit SHAs (84edb200, 719fe9a, cb525519)
- mma_tier_usage_reset_fix_20260610: status spec->shipped
- rag_phase4_sync_fix_20260610: status spec->shipped
- workspace_path_finalize_20260609: status active->completed,
  current_phase 1->complete, all tasks marked complete
  (c725270b, 93ec2809), verification flags flipped to true
2026-06-10 20:09:01 -04:00
ed 2c924fe6df test(infra): poll-for-event race fixes + watchdog timeout bump + spec update 2026-06-10 15:14:35 -04:00
ed 80697e221a conductor(checkpoint): RAG phase 4 sync fix + test assertion fix - track complete 2026-06-10 13:55:06 -04:00
ed 2ad0d6a3f0 conductor(plan): Update RAG sync fix track state - sync works, retrieval assertion is separate 2026-06-10 13:29:18 -04:00
ed 989b2e6835 conductor(plan): New track for RAG phase 4 sync fix 2026-06-10 12:45:56 -04:00
ed 1772fa8fc2 conductor(checkpoint): Final Phase 2 complete - FR1+FR2 re-applied, sim test passes in batch 2026-06-10 12:13:16 -04:00
ed 14a329c1a9 conductor(plan): Adjust track after catastrophic git checkout - FR1+FR2 reverted, FR3+FR4 were no-ops 2026-06-10 11:45:56 -04:00
ed c729f8adaf conductor(plan): Update spec/plan for Phase 2 (live_gui sim test fragility) 2026-06-10 10:12:09 -04:00
ed e788512d93 conductor(plan): Mark mma_tier_usage_reset_fix_20260610 as complete 2026-06-10 09:59:26 -04:00
ed d304af5d22 sigh 2026-06-10 08:34:46 -04:00
ed 93ec28097c docs(styleguide): add workspace_paths.md — hard rule for test workspace paths 2026-06-09 20:36:41 -04:00
ed 39c97cb365 conductor(track): workspace_path_finalize_20260609 - plan with 3 phases, 4-step execution 2026-06-09 20:29:55 -04:00
ed c725270b99 conductor(track): workspace_path_finalize_20260609 - per-run workspace under tests/artifacts/ 2026-06-09 20:27:20 -04:00
ed 5656957622 conductor(plan): Phase 8 complete - docs + audit extended 2026-06-09 17:05:35 -04:00
ed d2ff6ffcf9 conductor(plan): Phase 7 complete - test_bed_health report 2026-06-09 16:59:16 -04:00
ed 3ed52be4bf conductor(plan): Phase 6 complete - clean_baseline marker 2026-06-09 16:42:48 -04:00
ed afc8600800 conductor(plan): Phase 5 complete - set_value hook verified 2026-06-09 16:35:18 -04:00
ed 6764c9e12f conductor(plan): Phase 4 complete - coalesce _sync_rag_engine 2026-06-09 16:27:15 -04:00
ed 45b4497a66 conductor(plan): Phase 3 complete - tmp_path_factory + live_gui_workspace fixture 2026-06-09 16:15:50 -04:00
ed 05ddb45236 conductor(plan): Phase 2 complete - FR1 handle + autouse fixture 2026-06-09 15:43:38 -04:00
ed 30c04860c7 conductor(plan): Phase 1 audit complete - ready for user review 2026-06-09 15:30:31 -04:00
ed 5df22fa8d5 conductor(audit): trace set_value('ai_input') flow to find routing bug 2026-06-09 15:29:27 -04:00
ed 5e13fa9ba7 conductor(audit): document _sync_rag_engine race in controller 2026-06-09 15:29:17 -04:00
ed aebbd66836 conductor(audit): document hardcoded workspace paths in test suite 2026-06-09 15:29:06 -04:00
ed d1c6c6c327 conductor(audit): catalog live_gui test cross-file state dependencies 2026-06-09 15:28:56 -04:00
ed fcb161fd2e conductor(tracks): add test_infrastructure_hardening_20260609 as foundation track + supersede 4 placeholder test tracks 2026-06-09 15:18:20 -04:00
ed 566cf08cb8 conductor(track): test_infrastructure_hardening_20260609 - spec to kill the test regression nightmare 2026-06-09 15:15:26 -04:00
conductor-tier2 ac0c0cbe73 docs(styleguide): add No-Diagnostic-Noise rule to AI-Agent Conventions
One addition to conductor/code_styleguides/python.md §8
"AI-Agent Specific Conventions":

- **No diagnostic noise in production code (Added
  2026-06-09).** `sys.stderr.write(f"[XYZ_DIAG] ...") lines
  in src/*.py are technical debt. The right place for
  one-time investigation output is tests/artifacts/<test>.diag.log
  (a log file) or a standalone /tmp/diag_<name>.py script.
  If you must instrument production code, the diag lines
  are part of the same atomic commit as the fix.

- **Test files ARE allowed to be diagnostic.** The rule
  applies to src/*.py only; tests/test_*.py may use
  print(..., file=sys.stderr) freely.

Markdown only. No code modified.
2026-06-09 14:03:18 -04:00
conductor-tier2 631c40c9c4 docs(workflow): add Process Anti-Patterns section + Isolated-Pass rule
Two additions to conductor/workflow.md §"Known Pitfalls":

1. **Isolated-Pass Verification Fallacy (Added 2026-06-09)** —
   the rule that a test passing in isolation but failing in
   batch is FAILING. The only verification that matters for
   live_gui tests is the batch run. This is the flip side of
   the existing "Live_gui Test Fragility (Authoring-Side)"
   rule. Cross-references that rule.

2. **Process Anti-Patterns (Added 2026-06-09)** — 8-rule
   summary list, with cross-reference to AGENTS.md for the
   full ruleset. The 8 patterns are: Deduction Loop,
   Report-Instead-of-Fix, Scope-Creep Track-Doc,
   Inherited-Cruft, Diagnostic Noise in Production, Premature
   Surrender, Verbose Commit Message, Isolated-Pass
   Verification Fallacy.

Markdown only. No code modified. Cross-references
AGENTS.md (the load-bearing agent doc) for the full text
of each pattern.
2026-06-09 14:03:00 -04:00