manual_slop

Private

Public Access

Author	SHA1	Message	Date
ed	1aaa2f626a	conductor(neural_dynamics_miller): Phase 4 Synthesis - report.md (1345 lines, 86KB) + summary.md (~400 words)	2026-06-22 00:50:49 -04:00
ed	4395329002	conductor(neural_dynamics_miller): Phase 3 OCR - 65 frames OCR'd via winsdk in 4.3s	2026-06-22 00:44:54 -04:00
ed	84df12a65e	conductor(neural_dynamics_miller): Phase 2 Keyframes - 65 unique frames (threshold 0.05)	2026-06-22 00:43:50 -04:00
ed	2e2b7cbc7e	conductor(neural_dynamics_miller): Phase 1 Acquire - transcript (1737 clean segments, 64KB) + 275MB mp4	2026-06-22 00:41:45 -04:00
ed	d20e1c2e78	conductor(handoff): code_path_audit_20260607 v2 - metadata + state + TIER2_STARTUP metadata.json: standard track metadata (15 fields per the live_gui_test_fixes_20260618 precedent; includes scope, depends_on, blocks, out_of_scope, tolerated_at_run_time, test_summary, verification_criteria, 10 risks). state.toml: initial state (status=active, current_phase=0; 14 phases pending; 19 verification flags all false). TIER2_STARTUP.md: the per-track readme for the Tier 2 agent. Track-specific supplement to conductor/tier2/agents/tier2-autonomous.md. Covers: what to load (plan_v2.md first, spec_v2.md second; do NOT load v1 spec/plan), hard bans (3-layer), conventions, TDD protocol, per-task commit protocol, pre-delegation checkpoint, failcount contract, 8 known gotchas, verification protocol, end-of-track handoff, out-of-scope restatement. EXPLICITLY NOTES: - any_type_componentization_20260621 + phase2_4_5_call_site_completion_20260621 are NOT on master (merged `f914b2bc`, reverted `751b94d4`). v2 audit is tolerant of their absence. - The 3 candidate aggregates (ToolSpec, ChatMessage, ProviderHistory) are forward-compat placeholders with is_candidate: True. The integration tests verify the placeholder format (synthesize_aggregate_profile() in Phase 9 Task 9.2 has the template hard-coded). - The 1-line extension to scripts/audit_optional_in_3_files.py is the audit gate; skipping Phase 12 Task 12.2 leaves the new file uncovered by the Optional[T] ban. Total v2 artifacts (committed): - spec_v2.md (460 lines) - plan_v2.md (5006 lines) - metadata.json - state.toml - TIER2_STARTUP.md	2026-06-22 00:27:03 -04:00
ed	85baea8cf0	conductor(plan): code_path_audit_20260607 v2 - 14 phases, 85+ tasks, 91 tests Worker-ready plan for the v2 implementation. 14 phases: 0. Setup (8 tasks: state.toml, empty files, fixture dirs) 1. Data model (11 tasks: 5 enums + 9 supporting dataclasses + AggregateProfile) 2. PCG (6 tasks: skeleton + P1/P2/P3 AST passes + build_pcg()) 3. MemoryDim classifier (5 tasks: 2 dicts + override loader + file heuristic + classifier) 4. APD (8 tasks: 4 thresholds + 4 pattern detectors + dominant_pattern + detect_access_pattern) 5. CFE (4 tasks: 6 caller sets + override loader + estimate_call_frequency) 6. Decomposition cost (9 tasks: 6 constants + per_call_cost + frequency_multiplier + componentize + unify + recommended + rationale + compute) 7. Cross-audit integration (7 tasks: read_input_json + 6 input contracts + 3-tier mapping + 2 coverage + aggregate + run_all) 8. v2 DSL (5 tasks: arity table + to_dsl_v2 + to_markdown + to_tree + parse_dsl_v2) 9. run_audit + CLI + MCP (7 tasks: 2 aggregate constants + synthesize + run_audit + render_rollups + CLI + MCP tool) 10. Integration tests (6 tasks: synthetic src/ + 4 function files + 6 JSON fixtures + 7 tests) 11. Live_gui E2E (2 tasks: 2 opt-in tests) 12. Meta-audit + extension + styleguide (4 tasks: 3 implementations) 13. End-of-track report (5 tasks: 1 run + 6 verifications + 1 report + 1 tracks.md update + 1 final verification) Total: 91 tests (84 unit + 7 integration; 2 live_gui opt-in). 13 per-aggregate profiles (10 real + 3 candidate). 4 top-level rollups (summary, cross_audit_summary, decomposition_matrix, candidates). 5 follow-up tracks recorded. No new pip dependencies. No modifications to existing src/*.py files (read-only on the 65 existing files). No modifications to the 5 existing audit scripts (consume their JSON). Self-review: spec coverage (all sections covered), placeholder scan (no TBDs), type consistency (no name mismatches). 5006 lines. spec_v2.md is 460 lines. Total v2 spec+plan: 5466 lines.	2026-06-22 00:18:44 -04:00
ed	7ea414e988	conductor(spec): code_path_audit_20260607 v2 - data-pipeline + decomposition-cost lens Re-scopes the audit from 'expensive operations per action' (v1) to 'data pipelines per aggregate' (v2). The v1 framing was correct 2026-06-07 (the 4 foundational tracks were future) but is now stale; v2 also cross-validates the data_structure_strengthening + data_oriented_error_handling deductions directly. 10 in-scope aggregates (Metadata, FileItem, FileItems, CommsLogEntry, CommsLog, HistoryMessage, History, ToolDefinition, ToolCall, Result[T]) + 3 candidate aggregates (ToolSpec, ChatMessage, ProviderHistory; forward-compat placeholders for any_type_componentization_20260621 which is NOT on master). 4 static analyses: PCG (3 AST passes), MemoryDim classifier, APD (5 access patterns), CFE (7 frequencies). 11 public functions, all return Result[T] per error_handling.md hard rule. Decomposition-cost heuristic per aggregate answers: 'should this data be componentize further (split) or unify further (wider fat structs)?' 4 directions: componentize, unify, hold, insufficient_data. 10-phase TDD plan, 69 tests total. Consumes JSON from 6 existing audit scripts (cross-validates data_structure_strengthening + data_oriented_error_handling). Out-of-scope: runtime profiling (deferred to pipeline_runtime_profiling_20260607), MMA worker spawn (cold). v1 spec.md + plan.md preserved unchanged.	2026-06-22 00:03:32 -04:00
ed	74e5521dca	conductor(brain_counterintuitive): Phase 5 Verification - end-of-track report + state.toml completed	2026-06-22 00:01:34 -04:00
ed	702a3b649c	conductor(brain_counterintuitive): Phase 4 Synthesis - report.md (1241 lines, 77KB) + summary.md (~400 words)	2026-06-22 00:00:10 -04:00
ed	7e61dd7d2f	conductor(brain_counterintuitive): Phase 3 OCR - 91 frames OCR'd via winsdk in 14.7s	2026-06-21 23:54:17 -04:00
ed	327fb0d06d	conductor(brain_counterintuitive): Phase 2 Keyframes - 91 unique frames (threshold 0.05)	2026-06-21 23:53:05 -04:00
ed	29dd6aa6be	conductor(brain_counterintuitive): Phase 1 Acquire - transcript (358 clean segments, 12KB) + 175MB mp4	2026-06-21 23:51:41 -04:00
ed	1e404548e0	conductor(generic_systems_fields): Phase 5 Verification - end-of-track report + state.toml completed	2026-06-21 23:31:03 -04:00
ed	92b2ec4a75	conductor(generic_systems_fields): Phase 4 Synthesis - report.md (1720 lines, 100KB) + summary.md (~410 words)	2026-06-21 23:29:35 -04:00
ed	d1d98c85ce	conductor(generic_systems_fields): Phase 3 OCR - 33 frames OCR'd via winsdk in 1.9s	2026-06-21 23:21:11 -04:00
ed	3c4dd5c20f	conductor(generic_systems_fields): Phase 2 Keyframes - 33 unique frames (threshold 0.05)	2026-06-21 23:18:21 -04:00
ed	99e955795f	conductor(generic_systems_fields): Phase 1 Acquire - transcript (885 clean segments, 30KB) + 58MB mp4	2026-06-21 23:16:13 -04:00
ed	900b68009b	conductor(free_lunches_levin): Phase 5 Verification - end-of-track report + state.toml completed	2026-06-21 23:07:20 -04:00
ed	35746d59ec	conductor(free_lunches_levin): Phase 4 Synthesis - report.md (1628 lines, 105KB) + summary.md (~400 words)	2026-06-21 23:05:51 -04:00
ed	8ff397cfd7	conductor(free_lunches_levin): Phase 3 OCR - 67 frames OCR'd via winsdk in 2.3s	2026-06-21 22:57:26 -04:00
ed	85799bdef1	conductor(free_lunches_levin): Phase 2 Keyframes - 67 unique frames (threshold 0.05)	2026-06-21 22:55:36 -04:00
ed	593da35589	conductor(free_lunches_levin): Phase 1 Acquire - transcript (1539 clean segments, 55KB) + 67MB mp4	2026-06-21 22:54:26 -04:00
ed	cbc6592938	conductor(platonic_intelligence_kumar): Phase 5 Verification - end-of-track report + state.toml completed	2026-06-21 22:41:50 -04:00
ed	8bb7bc0b03	conductor(platonic_intelligence_kumar): Phase 4 Synthesis - report.md (1564 lines, 104KB) + summary.md (384 words)	2026-06-21 22:40:27 -04:00
ed	751b94d4e8	Revert "merge: tier2/phase2_4_5_call_site_completion_20260621 (parent + follow-up + Phase 6e analysis)" This reverts commit `f914b2bcd4`, reversing changes made to `7fef95cc87`.	2026-06-21 22:39:14 -04:00
ed	f32e4fd268	conductor(platonic_intelligence_kumar): Phase 3 OCR - 62 frames OCR'd via winsdk in 3.7s	2026-06-21 22:33:09 -04:00
ed	f690b4dea4	conductor(platonic_intelligence_kumar): Phase 2 Keyframes - 62 unique frames from 133 raw (threshold 0.05)	2026-06-21 22:30:59 -04:00
ed	f914b2bcd4	merge: tier2/phase2_4_5_call_site_completion_20260621 (parent + follow-up + Phase 6e analysis) Merges 39 commits from tier2 sandbox: - any_type_componentization_20260621 parent (48/89 fat-struct sites; Phases 1,2,4,5 complete; Phase 3 deferred) - phase2_4_5_call_site_completion_20260621 follow-up (Phases 6a broadcast fix + 6b sender migration + 6e Phase 3 cost analysis; Phase 6d was a no-op) - docs/reports/PHASE3_TIER2_ANALYSIS.md (Tier 2 authoritative cost analysis; supersedes Tier 1's draft) Unblocks code_path_audit_20260607: - Phase 6a fixes the broadcast() TypeError that contaminated per-action profiling - Phase 6e provides the cost hypothesis the audit will quantify	2026-06-21 22:30:10 -04:00
ed	7fef95cc87	conductor(platonic_intelligence_kumar): Phase 1 Acquire - transcript (1659 clean segments, 61KB) + 89MB mp4	2026-06-21 22:29:25 -04:00
ed	c760b8e09d	conductor(score_dynamics_giorgini): Phase 5 Verification - end-of-track report + state.toml completed	2026-06-21 22:21:05 -04:00
ed	f1d157bf33	conductor(score_dynamics_giorgini): Phase 4 Synthesis - report.md (1325 lines, 93KB) + summary.md (354 words)	2026-06-21 22:19:42 -04:00
ed	077cdf20db	conductor(score_dynamics_giorgini): Phase 3 OCR - 31 frames OCR'd via winsdk in 2.3s	2026-06-21 22:13:03 -04:00
ed	edd2f181eb	conductor(score_dynamics_giorgini): Phase 2 Keyframes - 31 unique frames from 91 raw (threshold 0.05)	2026-06-21 21:45:49 -04:00
ed	16fbf5619f	conductor(score_dynamics_giorgini): Phase 1 Acquire - transcript (1485 clean segments, 46.5KB) + 178MB mp4	2026-06-21 20:43:50 -04:00
ed	49fb0a1a13	artifacts(track): throwaway scripts for phase2_4_5_call_site_completion_20260621 Per the Tier 2 convention, throwaway scripts are committed as archival artifacts so future agents can understand what was tried during the track. 7 scripts: - verify_test_format.py: AST + indentation check for new test file - _check_line_endings.py: CRLF vs LF diagnostic - _find_tracks_line.py: locate line 27 entry in tracks.md - _verify_line_66.py: verify new line 66 content - _update_tracks_md.py: programmatic update of line 27 - _update_state_toml.py: programmatic update of state.toml - _fix_state_toml_crlf.py: restore CRLF after edits	2026-06-21 20:00:57 -04:00
ed	7c3052c893	conductor(archive): ship phase2_4_5_call_site_completion_20260621 (4 phases + report) Updates: - conductor/tracks.md: entry #27 marked SHIPPED 2026-06-21; BLOCKER removed for code_path_audit_20260607 (broadcast() TypeError fixed) - state.toml: status=completed, current_phase=6, all 4 phases marked completed with checkpoint SHAs, all verification booleans true NOT shipped (per user instruction): - The git mv to conductor/tracks/archive/ is the USER's responsibility - Track directory stays at conductor/tracks/phase2_4_5_call_site_completion_20260621/ - tier2/any_type_componentization_20260621 branch NOT merged (reconnaissance framing)	2026-06-21 20:00:11 -04:00
ed	ae745886a7	docs(reports): TRACK_COMPLETION_phase2_4_5_call_site_completion_20260621	2026-06-21 19:54:04 -04:00
ed	e9b1138949	docs(analysis): PHASE3_TIER2_ANALYSIS - authoritative Phase 3 cost hypothesis Tier 2 produced this analysis during phase2_4_5_call_site_completion_20260621 Phase 6e. Supersedes Tier 1's draft at PHASE3_HYPOTHETICAL_PROMOTION.md (kept as the hypothesis doc; this is the refined version with in-context data from Phase 6b/6d work in src/ai_client.py). Key findings: - Measured 104 history references (Tier 1 estimated 112; 7% under) - Anthropic dominates per-turn cost (~35-65µs vs Tier 1's 8-15µs estimate) - Grok/qwen/llama are LOWER than Tier 1 estimated (~400ns vs 2-8µs) - Total per-session: ~0.5-1.0ms (Tier 1 estimated 1.1-2.4ms) - Discovered 3 hidden cross-references Tier 1 missed (_strip_private_keys, _extract_minimax_reasoning, _send_llama_native) - Recommendations for the future Phase 3 track: anthropic first; use 'with h.lock: msg_list = h.messages' for read snapshots; use 'with h.lock: h.messages = [filtered]' for in-place mutations Covers all 6 senders (anthropic, deepseek, minimax, grok, qwen, llama) with per-site cost estimates + hidden cross-references + recommendations. The audit (code_path_audit_20260607) quantifies these estimates after merge.	2026-06-21 19:52:15 -04:00
ed	06287dbb95	refactor(ai_client): migrate _send_grok/_send_minimax/_send_llama to ChatMessage API Completes the deferred t2_6 task from any_type_componentization_20260621 Phase 2. The 3 OpenAI-compatible senders now construct OpenAICompatibleRequest with messages=[ChatMessage(role=, content=)] instead of list[dict] literals. The _<provider>_history global lists are still dicts (Phase 3 deferred to a separate track); the migration converts each dict to ChatMessage at the request-build boundary via list comprehension. The backward-compat shim in openai_compatible.py:86 (m.to_dict() if hasattr(m, 'to_dict') else m) handles both ChatMessage and dict transparently. Verified: 20/20 provider tests pass; tier-1-unit (5 pre-existing sandbox-pollution failures unchanged); no new regressions.	2026-06-21 19:47:40 -04:00
ed	76b10e734d	fix(broadcast): migrate WebSocketServer.broadcast() callers to WebSocketMessage signature Phase 5 of any_type_componentization_20260621 changed WebSocketServer.broadcast(channel, payload) -> broadcast(message: WebSocketMessage) but did not update internal callers. This produced worker[queue_fallback] TypeError spam on the GUI thread. Fixed 2 sites: - src/app_controller.py:1849 _process_pending_gui_tasks (telemetry broadcast) - src/events.py:115 AsyncEventQueue.put (events broadcast) gui_2.py has no internal broadcast callers (grep verified). Both callers now construct WebSocketMessage(channel=, payload=) at the call site. test_websocket_broadcast_regression.py 4/4 pass (was 1/4 failing in red phase).	2026-06-21 19:26:14 -04:00
ed	0c7a12a3fa	test(broadcast): add regression test for WebSocketServer.broadcast() signature Phase 5 of any_type_componentization_20260621 changed WebSocketServer.broadcast(channel, payload) -> broadcast(message: WebSocketMessage) but did not update internal callers in src/app_controller.py + src/events.py. This adds 4 tests that pin the contract: - test_websocket_server_broadcast_signature: asserts (self, message) signature - test_websocket_server_broadcast_rejects_legacy_2arg_call: asserts legacy raises TypeError - test_websocket_server_broadcast_accepts_websocket_message_instance: smoke test - test_internal_callers_use_websocket_message_signature: structural grep over src/ The 4th test currently FAILS (red phase), identifying 2 legacy sites: - src/app_controller.py:1849: self.event_queue.websocket_server.broadcast('telemetry', metrics) - src/events.py:115: self.websocket_server.broadcast('events', {...}) The structural assertion is reused by code_path_audit_20260607.	2026-06-21 19:23:00 -04:00
ed	1dce32037a	un-archive data structure strengthening	2026-06-21 19:18:14 -04:00
ed	e4ec494b89	artifacts	2026-06-21 19:14:57 -04:00
ed	91775ee391	Merge branch 'master' of C:\projects\manual_slop into tier2/any_type_componentization_20260621	2026-06-21 19:08:35 -04:00
ed	6275c860bf	conductor(spec+plan): add Phase 6e to follow-up - Tier 2 authoritative Phase 3 cost deduction The follow-up track now includes Phase 6e: Tier 2 produces the authoritative Phase 3 cost analysis as part of the follow-up work. Tier 2 is in src/ai_client.py doing Phase 6b/6d anyway; they have full context to produce the refined cost hypothesis that Tier 1's draft at PHASE3_HYPOTHETICAL_PROMOTION.md could not (Tier 1 worked without the 6b/6d ground-truth context). Tier 1's draft STAYS as the hypothesis doc. Tier 2's PHASE3_TIER2_ANALYSIS.md is the refined version (per-sender cost summary + hidden call sites table + recommendations for the future Phase 3 track + cross-reference to Tier 1 explicit). Phase 6e tasks (5 total, ~2 commits): - t6e_1: Profile the 6 senders (codepath catalog + hidden cross-refs) - t6e_2: Qualitative cost estimation per sender - t6e_3: Identify hot iteration sites needing 'with h.lock:' pattern - t6e_4: Author PHASE3_TIER2_ANALYSIS.md - t6e_5: Phase 6e checkpoint commit + git note Total estimated commits: 16 -> 18 (still within Tier 2 1-4 hour budget). Files updated: - conductor/tracks/phase2_4_5_call_site_completion_20260621/spec.md (+50 lines) - conductor/tracks/phase2_4_5_call_site_completion_20260621/plan.md (+146 lines) - conductor/tracks/phase2_4_5_call_site_completion_20260621/metadata.json (+13 lines) - conductor/tracks/phase2_4_5_call_site_completion_20260621/state.toml (+9 lines) - conductor/tracks.md (track 27 entry expanded with Phase 6e details)	2026-06-21 18:55:54 -04:00
ed	1a739ecef5	conductor(spec+plan): phase2_4_5_call_site_completion_20260621 + code_path_audit pre-flight adjustments + Phase 3 analysis PHASE 2/4/5 FOLLOW-UP TRACK (Tier 1 decided SHINK to 6a + 6b + 6d): - Phase 6a: Fix HookServer.broadcast() callers (app_controller.py + events.py + gui_2.py) Adds tests/test_websocket_broadcast_regression.py with no-TypeError assertion - Phase 6b: Complete _send_grok/_send_minimax/_send_llama OpenAICompatibleRequest migration - Phase 6d: Update those 3 senders' NormalizedResponse to use UsageStats Total: ~16 atomic commits, ~3 hours Tier 2 work. Unblocks code_path_audit_20260607. CODE_PATH_AUDIT_20260607 PRE-FLIGHT ADJUSTMENTS (per handoffs): - Add 2 new actions: provider_history_append + websocket_broadcast - Add 5 micro-benchmarks: NormalizedResponse.__init__, WebSocketMessage.__init__, UsageStats.__init__, ProviderHistory.lock, ToolSpec.__init__ - Add no-TypeError-errors-on-any-thread assertion (backs test_websocket_broadcast_regression.py) - Add 89 fat-struct sites from ANY_TYPE_AUDIT_20260621.md as instrumented targets - BLOCKER: phase2_4_5_call_site_completion_20260621 (broadcast() TypeError) PHASE 3 HYPOTHETICAL ANALYSIS (separate doc): docs/reports/PHASE3_HYPOTHETICAL_PROMOTION.md - dataclass definitions (already on tier2 branch), per-provider codepath catalog (112 sites), qualitative cost estimation (~+1-2ms per session, ~+8-15us per _send_anthropic turn). Input for the audit; the audit quantifies the cost. REGISTRATION: conductor/tracks.md updated: new row 27 (follow-up), new row 28 (parent any_type_componentization), row 17 (code_path_audit) updated with pre-flight adjustments note. Files: - conductor/tracks/phase2_4_5_call_site_completion_20260621/spec.md (NEW; 633 lines) - conductor/tracks/phase2_4_5_call_site_completion_20260621/plan.md (NEW; 7 phases, 23 tasks) - conductor/tracks/phase2_4_5_call_site_completion_20260621/metadata.json (NEW; 8.8KB) - conductor/tracks/phase2_4_5_call_site_completion_20260621/state.toml (NEW; 11.8KB) - docs/reports/PHASE3_HYPOTHETICAL_PROMOTION.md (NEW; 380 lines; qualitative cost analysis) - conductor/tracks/code_path_audit_20260607/spec.md (MODIFIED; +93 lines Pre-Flight Adjustments) - conductor/tracks.md (MODIFIED; +35 lines: 3 new entries + 1 stale row fix)	2026-06-21 18:32:02 -04:00
ed	f08394a98c	Merge branch 'master' of C:\projects\manual_slop into tier2/any_type_componentization_20260621	2026-06-21 18:13:40 -04:00
ed	95a8fae234	docs(handoff): Tier 1 prompt - follow-up track + audit sequencing Synthesizes the 2 prior handoff docs into a ready-to-use Tier 1 brief: - HANDOFF_CODE_PATH_AUDIT_FROM_any_type_componentization.md (the audit framing) - HANDOFF_FOLLOWUP_TRACK_FROM_any_type_componentization.md (the test failures + scope) Sections: 1. TL;DR (3 paragraphs): what happened, the hidden broadcast() bug, the recommendation (don't merge; use as input for follow-up track) 2. Context: 48 promoted, 41 deferred, 2 new audits, 1 styleguide 3. 4 decision points for Tier 1 (scope, sequencing, audit adjustments, scope expansion) 4. The 4 documents Tier 1 should read in order (45 min total) 5. What Tier 1 should NOT do (3 anti-patterns) 6. What Tier 1 SHOULD do (6 concrete first steps) 7. What Tier 2 is available for (conventions reminder) 8. The bigger vision (agent-debugger framing) Recommended sequencing for Tier 1: T0: Approve follow-up track scope T1: Tier 2 implements Phase 6a + 6b + 6d (~18 commits, 3 hours) T2: Tier 2 runs tier-1-unit-core FULLY (no stop-on-failure) T3: Tier 2 runs tier-3-live_gui FULLY T4: Tier 1 reviews + merges follow-up track T5: Tier 1 launches code_path_audit_20260607 T6: Tier 2 implements Phase 3 + cross-phase coupling (separate track) Tier 1's scope decision: I recommend the SHRUNK version (Phase 6a + 6b + 6d only; defer Phase 3 to its own track). This gives the code-path audit a clean instrumented target without ballooning the follow-up beyond Tier 2's 1-4 hour budget. Audit adjustments to add: - 5 micro-benchmarks (NormalizedResponse.__init__, WebSocketMessage.__init__, UsageStats.__init__, ProviderHistory.lock, ToolSpec.__init__) - 'no-TypeError-errors-on-any-thread' assertion - Instrument grok/minimax/llama providers (currently unprofiled) - Add 2 new actions: provider_history_append + websocket_broadcast	2026-06-21 17:57:38 -04:00
ed	4bbc69019e	chore(gitignore): add video_analysis artifact patterns (.mp4, .vtt) Per FR8 in conductor/tracks/video_analysis_campaign_20260621/spec.md, mp4 files are too large for git and VTT auto-sub files are regenerable from transcript.json. Note: existing tracked files in entropy_epiplexity (commit `5c5f347c`) are still in history. The gitignore prevents FUTURE commits from adding them. To remove from history requires filter-repo/filter-branch rewrite (out of scope for this commit).	2026-06-21 17:54:39 -04:00
ed	b3ed4b1508	docs(handoff): test failure report for follow-up track scoping Categorizes the 12 test failures the user observed when running scripts/run_tests_batched.py after this track: - 10 failures (mine): Phase 2 NormalizedResponse API migration incomplete (state.toml t2_6 deferred task); FIXED in commit `30c8b263` - 3 failures (sandbox): test_audit_tier2_leaks.py flags sandbox files (mcp_paths.toml, opencode.json) as modified; NOT my fault - 1 failure (pre-existing): test_gui2_custom_callback_hook_works; live_gui test not touched by this track Hidden 12th failure: - worker[queue_fallback] error: WebSocketServer.broadcast() takes 2 positional arguments but 3 were given (appeared 6+ times during tier-2-mock-app-core but tests still passed; error logged on GUI thread from app_controller._run_pending_tasks_once_result). Phase 5 refactored broadcast(channel, payload) to broadcast(WebSocketMessage); I updated test_websocket_server.py but missed app_controller.py and events.py callers. Sections: 1. Executive summary (3 categories of failure) 2. Per-failure categorization (10 + 3 + 1) 3. Hidden 12th failure: WebSocket broadcast callers in app_controller 4. Phase 2 API migration status (8 sites; 5 done, 3 unverified) 5. Recommendations for follow-up track (~5 call sites + ~41 Phase 3) 6. Code-path audit input (5 micro-benchmarks to add) Follow-up track scope: ~15-20 commits, well-scoped. Should run BEFORE code_path_audit_20260607 because the worker[queue_fallback] TypeError spam will confuse the audit's runtime instrumentation.	2026-06-21 17:53:48 -04:00

1 2 3 4 5 ...

4133 Commits