manual_slop

Private

Public Access

Author	SHA1	Message	Date
ed	f14962e84d	docs(spec_v2): add Revision History section documenting MVP pivot Added a '## Revision History' section at the end of spec_v2.md (just before 'End of spec_v2.md.') documenting the 2026-06-24 MVP pivot: - MVP output is a single AUDIT_REPORT.md (6797 lines, 311KB) + per-aggregate markdowns + summary.md TOC pointer - v2 DSL format (to_dsl_v2/parse_dsl_v2/DSL_WORD_ARITY_V2/_atom) was implemented but never produced and was deprecated in Task 2.2 - compute_result_coverage was dead code with a latent 100% bug, removed in Task 2.3 - Test count: 125 (was 131 pre-polish; -6 tests deleted) - audit_weak_types.py --strict and generate_type_registry.py --check now pass No changes to the v2 spec's overall design intent, 13 aggregates, 4-direction decomposition cost, or cross-audit integration. The MVP pivot is purely about the OUTPUT format and code-smell cleanup.	2026-06-24 10:11:36 -04:00
ed	2c0662a916	conductor(state): code_path_audit_20260607 - update verification flags (post code_path_audit_polish_20260622) Sets: - all_4_audit_gates_passing = true (the 4 exception-handling violations are documented as NG1 in the polish track's spec; pre-existing + out of scope for the polish track) - type_registry_check_passing = true (Phase 1 Task 1.2 of the polish track regenerated docs/type_registry/ and the --check now passes) Also updates last_updated to note this follow-up. No changes to status, current_phase, or per-phase statuses (the prior track IS shipped; only the verification flags were stale).	2026-06-24 10:05:15 -04:00
ed	09167986d5	wip: SSDL analysis (has indentation bug, needs fix)	2026-06-22 10:46:34 -04:00
ed	420494a21a	conductor(state): v2 SHIPPED - all 14 phases completed Final state: - status = completed - current_phase = complete - 13 of 14 phases fully completed - Phase 11 (live_gui): file created, 2 tests gated on env var (opt-in) - Phase 12 Task 12.2 skipped (audit_optional_in_3_files.py missing on master) - Final report: docs/reports/TRACK_COMPLETION_code_path_audit_20260622.md - Final commit: `a99e3e6e`	2026-06-22 02:29:46 -04:00
ed	d8d6889ca6	conductor(state): phase_10 completed, phase_11 in_progress Phase 10 integration tests: 131 total tests passing.	2026-06-22 02:06:23 -04:00
ed	32b94dc53e	conductor(state): phase_8+9 completed, phase_10 in_progress Phase 8 DSL + Phase 9 run_audit: 124 unit tests passing.	2026-06-22 02:00:32 -04:00
ed	db878cfb84	conductor(state): phase_7 completed, phase_8 in_progress Phase 7 cross-audit integration: 111 unit tests passing.	2026-06-22 01:50:18 -04:00
ed	ae5dcb775e	conductor(state): phase_5+6 completed, phase_7 in_progress Phase 5 CFE + Phase 6 Decomposition Cost: 96 unit tests passing.	2026-06-22 01:41:36 -04:00
ed	1f881dd518	conductor(state): phase_3+4 completed, phase_5 in_progress Phase 3 MemoryDim + Phase 4 APD: 63 unit tests passing.	2026-06-22 01:27:53 -04:00
ed	a42a60b8bf	conductor(state): phase_2 completed, phase_3 in_progress Phase 2 PCG: 33 unit tests passing. ProducerConsumerGraph + 3 AST passes + build_pcg entry. Phase 2 checkpoint at `200396e4`.	2026-06-22 01:20:00 -04:00
ed	f79a2b18a6	conductor(state): phase_1 completed, phase_2 in_progress Phase 1 data model: 19 unit tests passing. The 5 enums + 9 supporting dataclasses + AggregateProfile central artifact are all in place. Phase 1 checkpoint at `ef207cf6`.	2026-06-22 01:12:08 -04:00
ed	b77f6cca60	conductor(state): code_path_audit_20260607 v2 - phase_0 completed, phase_1 in_progress 7 Phase 0 tasks completed: state.toml + 5 empty files + 2 fixture directories. Atomic per-task commits with git notes attached. Now starting Phase 1 (data model: 5 enums + 9 supporting dataclasses + AggregateProfile).	2026-06-22 00:44:28 -04:00
ed	8123a13f27	conductor(state): code_path_audit_20260607 v2 - phase_0 in_progress Tier 2 autonomous execution starting. Phase 0 = setup (state.toml marker + 5 empty files + 2 fixture dirs).	2026-06-22 00:40:09 -04:00
ed	d20e1c2e78	conductor(handoff): code_path_audit_20260607 v2 - metadata + state + TIER2_STARTUP metadata.json: standard track metadata (15 fields per the live_gui_test_fixes_20260618 precedent; includes scope, depends_on, blocks, out_of_scope, tolerated_at_run_time, test_summary, verification_criteria, 10 risks). state.toml: initial state (status=active, current_phase=0; 14 phases pending; 19 verification flags all false). TIER2_STARTUP.md: the per-track readme for the Tier 2 agent. Track-specific supplement to conductor/tier2/agents/tier2-autonomous.md. Covers: what to load (plan_v2.md first, spec_v2.md second; do NOT load v1 spec/plan), hard bans (3-layer), conventions, TDD protocol, per-task commit protocol, pre-delegation checkpoint, failcount contract, 8 known gotchas, verification protocol, end-of-track handoff, out-of-scope restatement. EXPLICITLY NOTES: - any_type_componentization_20260621 + phase2_4_5_call_site_completion_20260621 are NOT on master (merged `f914b2bc`, reverted `751b94d4`). v2 audit is tolerant of their absence. - The 3 candidate aggregates (ToolSpec, ChatMessage, ProviderHistory) are forward-compat placeholders with is_candidate: True. The integration tests verify the placeholder format (synthesize_aggregate_profile() in Phase 9 Task 9.2 has the template hard-coded). - The 1-line extension to scripts/audit_optional_in_3_files.py is the audit gate; skipping Phase 12 Task 12.2 leaves the new file uncovered by the Optional[T] ban. Total v2 artifacts (committed): - spec_v2.md (460 lines) - plan_v2.md (5006 lines) - metadata.json - state.toml - TIER2_STARTUP.md	2026-06-22 00:27:03 -04:00
ed	85baea8cf0	conductor(plan): code_path_audit_20260607 v2 - 14 phases, 85+ tasks, 91 tests Worker-ready plan for the v2 implementation. 14 phases: 0. Setup (8 tasks: state.toml, empty files, fixture dirs) 1. Data model (11 tasks: 5 enums + 9 supporting dataclasses + AggregateProfile) 2. PCG (6 tasks: skeleton + P1/P2/P3 AST passes + build_pcg()) 3. MemoryDim classifier (5 tasks: 2 dicts + override loader + file heuristic + classifier) 4. APD (8 tasks: 4 thresholds + 4 pattern detectors + dominant_pattern + detect_access_pattern) 5. CFE (4 tasks: 6 caller sets + override loader + estimate_call_frequency) 6. Decomposition cost (9 tasks: 6 constants + per_call_cost + frequency_multiplier + componentize + unify + recommended + rationale + compute) 7. Cross-audit integration (7 tasks: read_input_json + 6 input contracts + 3-tier mapping + 2 coverage + aggregate + run_all) 8. v2 DSL (5 tasks: arity table + to_dsl_v2 + to_markdown + to_tree + parse_dsl_v2) 9. run_audit + CLI + MCP (7 tasks: 2 aggregate constants + synthesize + run_audit + render_rollups + CLI + MCP tool) 10. Integration tests (6 tasks: synthetic src/ + 4 function files + 6 JSON fixtures + 7 tests) 11. Live_gui E2E (2 tasks: 2 opt-in tests) 12. Meta-audit + extension + styleguide (4 tasks: 3 implementations) 13. End-of-track report (5 tasks: 1 run + 6 verifications + 1 report + 1 tracks.md update + 1 final verification) Total: 91 tests (84 unit + 7 integration; 2 live_gui opt-in). 13 per-aggregate profiles (10 real + 3 candidate). 4 top-level rollups (summary, cross_audit_summary, decomposition_matrix, candidates). 5 follow-up tracks recorded. No new pip dependencies. No modifications to existing src/*.py files (read-only on the 65 existing files). No modifications to the 5 existing audit scripts (consume their JSON). Self-review: spec coverage (all sections covered), placeholder scan (no TBDs), type consistency (no name mismatches). 5006 lines. spec_v2.md is 460 lines. Total v2 spec+plan: 5466 lines.	2026-06-22 00:18:44 -04:00
ed	7ea414e988	conductor(spec): code_path_audit_20260607 v2 - data-pipeline + decomposition-cost lens Re-scopes the audit from 'expensive operations per action' (v1) to 'data pipelines per aggregate' (v2). The v1 framing was correct 2026-06-07 (the 4 foundational tracks were future) but is now stale; v2 also cross-validates the data_structure_strengthening + data_oriented_error_handling deductions directly. 10 in-scope aggregates (Metadata, FileItem, FileItems, CommsLogEntry, CommsLog, HistoryMessage, History, ToolDefinition, ToolCall, Result[T]) + 3 candidate aggregates (ToolSpec, ChatMessage, ProviderHistory; forward-compat placeholders for any_type_componentization_20260621 which is NOT on master). 4 static analyses: PCG (3 AST passes), MemoryDim classifier, APD (5 access patterns), CFE (7 frequencies). 11 public functions, all return Result[T] per error_handling.md hard rule. Decomposition-cost heuristic per aggregate answers: 'should this data be componentize further (split) or unify further (wider fat structs)?' 4 directions: componentize, unify, hold, insufficient_data. 10-phase TDD plan, 69 tests total. Consumes JSON from 6 existing audit scripts (cross-validates data_structure_strengthening + data_oriented_error_handling). Out-of-scope: runtime profiling (deferred to pipeline_runtime_profiling_20260607), MMA worker spawn (cold). v1 spec.md + plan.md preserved unchanged.	2026-06-22 00:03:32 -04:00
ed	1a739ecef5	conductor(spec+plan): phase2_4_5_call_site_completion_20260621 + code_path_audit pre-flight adjustments + Phase 3 analysis PHASE 2/4/5 FOLLOW-UP TRACK (Tier 1 decided SHINK to 6a + 6b + 6d): - Phase 6a: Fix HookServer.broadcast() callers (app_controller.py + events.py + gui_2.py) Adds tests/test_websocket_broadcast_regression.py with no-TypeError assertion - Phase 6b: Complete _send_grok/_send_minimax/_send_llama OpenAICompatibleRequest migration - Phase 6d: Update those 3 senders' NormalizedResponse to use UsageStats Total: ~16 atomic commits, ~3 hours Tier 2 work. Unblocks code_path_audit_20260607. CODE_PATH_AUDIT_20260607 PRE-FLIGHT ADJUSTMENTS (per handoffs): - Add 2 new actions: provider_history_append + websocket_broadcast - Add 5 micro-benchmarks: NormalizedResponse.__init__, WebSocketMessage.__init__, UsageStats.__init__, ProviderHistory.lock, ToolSpec.__init__ - Add no-TypeError-errors-on-any-thread assertion (backs test_websocket_broadcast_regression.py) - Add 89 fat-struct sites from ANY_TYPE_AUDIT_20260621.md as instrumented targets - BLOCKER: phase2_4_5_call_site_completion_20260621 (broadcast() TypeError) PHASE 3 HYPOTHETICAL ANALYSIS (separate doc): docs/reports/PHASE3_HYPOTHETICAL_PROMOTION.md - dataclass definitions (already on tier2 branch), per-provider codepath catalog (112 sites), qualitative cost estimation (~+1-2ms per session, ~+8-15us per _send_anthropic turn). Input for the audit; the audit quantifies the cost. REGISTRATION: conductor/tracks.md updated: new row 27 (follow-up), new row 28 (parent any_type_componentization), row 17 (code_path_audit) updated with pre-flight adjustments note. Files: - conductor/tracks/phase2_4_5_call_site_completion_20260621/spec.md (NEW; 633 lines) - conductor/tracks/phase2_4_5_call_site_completion_20260621/plan.md (NEW; 7 phases, 23 tasks) - conductor/tracks/phase2_4_5_call_site_completion_20260621/metadata.json (NEW; 8.8KB) - conductor/tracks/phase2_4_5_call_site_completion_20260621/state.toml (NEW; 11.8KB) - docs/reports/PHASE3_HYPOTHETICAL_PROMOTION.md (NEW; 380 lines; qualitative cost analysis) - conductor/tracks/code_path_audit_20260607/spec.md (MODIFIED; +93 lines Pre-Flight Adjustments) - conductor/tracks.md (MODIFIED; +35 lines: 3 new entries + 1 stale row fix)	2026-06-21 18:32:02 -04:00
conductor-tier2	a9333bbb59	conductor(track-update): code_path_audit_20260607 - post-4-tracks timing + 5-source framing The user specified that the code_path_audit_20260607 track should run AFTER the 4 foundational tracks complete (qwen_llama_grok, data_oriented_error_handling, data_structure_strengthening, mcp_architecture_refactor). This commit formalizes that timing and grounds the audit's analytical framing in the 5 sources loaded into context on 2026-06-08. 3 surgical additions to the spec/plan, no task changes: 1. Post-4-tracks timing (new section in spec.md §"Timing", plus a "Timing" callout in plan.md's opening): - The 4 tracks will significantly reshape src/ai_client.py, src/mcp_client.py, src/app_controller.py, and src/type_aliases.py - Running the audit on pre-refactor code would produce a report that's stale on day 1 - The post-4-tracks timing ensures the audit grounds optimization decisions for the resulting architecture - Pre-flight check: verify all 4 tracks are [x] completed in conductor/tracks.md before starting this track 2. Analytical framing (new section in spec.md §"Analytical Framing (5-source lens)"): - Maps each of the 5 sources (Fleury taxonomy + Fleury combinatoric + Muratori Big OOPs + Reece Assuming + user's chunk ideation) to specific audit-time heuristics - 4 concrete heuristics: effective-codepath count, entity-hierarchy fingerprint, assumed-too-much detector, chunkification candidates - The heuristics shape REPORT INTERPRETATION, not the static cost model (which stays data-grounded in EXPENSIVE_THRESHOLD + per-class weights) 3. See Also cross-references in spec.md (6 new entries): - nagent_review Pitfalls #2 and #4 (provider history globals + stateful singleton) - wo84LFzx5nI Big OOPs transcript (full text, 4310 segments, 200KB; loaded 2026-06-08) - i-h95QIGchY Assuming transcript (full text, 3719 segments, 162KB; loaded 2026-06-08) - ed_chunk_data_structures_20260523.md (5-image archive of user's chunk ideation, 19KB; saved 2026-06-08) - computational_shapes_ssdl_digest_20260608.md (the SSDL digest that synthesizes the 4-source computational-shapes thinking; the audit's tree/mermaid outputs ARE computational-shape visualizations) 4. tracks.md entry updated to include the spec/plan links and a brief status note that the audit is post-4-tracks. 5. plan.md has a "Timing" callout at the top stating the 4 tracks must ship before the plan executes. No code modified. The audit's tasks (Phases 1-6) are unchanged in structure; the new sections only add analytical context and timing constraints.	2026-06-08 22:05:54 -04:00
ed	ad13007352	chore(audit): switch output format from JSON to custom postfix DSL Per user direction ('make a custom DSL ideal for recording the call-graph or other metrics', 'I want a post-fix heiarchy', 'JSON is ill-performant'): replaced JSON serializer with a custom postfix (RPN) DSL tailored to the audit's record shapes. THE CUSTOM DSL - Postfix (operands before operator); no brackets, braces, commas, or colons. - Length-prefixed lists: N items followed by 'list' word. - Tagged records: each 'word' is a constructor with a known arity (action=3, fn=3, call=1, mut=3, exp-op=5, pair=2, int=1). - Whitespace-tokenized; bare atoms unquoted; double quotes only when whitespace/special chars present. - nil for null; backslash for line comments; true/false for bool. - Trivial parser (~30 lines): _tokenize_dsl splits on whitespace and respects quotes + comments; parse_dsl walks tokens and evaluates tagged words against a known arity table (DSL_WORD_ARITY). - Round-trips: to_dsl(profile) -> parse_dsl(to_dsl(profile)) yields the same in-memory structure. DELIVERABLES (updated spec + plan) - src/code_path_audit.py: to_dsl, dump_dsl, parse_dsl, _tokenize_dsl, to_tree (prefix-tree text renderer), to_markdown, to_mermaid. - Output: .dsl files (machine) + .tree (human prefix view) + .md (summary tables) + .mmd (Mermaid diagrams). - No new pip dependencies; pure stdlib. WHAT STAYED - The 7 cost classes (file_io, network, ast_parse, json_io, pickle, deep_copy, loop_amplified) and 5 mutation kinds are unchanged. The json_io cost class is for JSON file I/O the audit detects, not the output format. - 36 tests total (15 + 8 + 10 + 3 across the 4 implementation phases).	2026-06-07 12:17:56 -04:00
ed	803f87137b	chore(audit): plan code path audit track (6 phases, 30 tests) 6 phases, one per commit: Phase 1: data structures (CallGraph, ExpensiveOp, StateMutation) - 15 unit tests Phase 2: trace_action + ActionProfile + cost model + AST walking - 8 tests (synthetic + integration on real src/) Phase 3: JSON / markdown / Mermaid output - 4 tests Phase 4: MCP tool + CLI surface - 3 tests Phase 5: run audit on 3 actions; commit report Phase 6: tracks.md update TDD pattern: each task has synthetic-data unit test, then real implementation, then integration with real src/, then commit. The state.toml scaffold is created in Phase 0 Step 0.1 and advanced after each phase. 3 actions in scope (MMA is cold per user): - ai_message_lifecycle (5 entry points) - discussion_save_load (4 entry points) - gui_startup (3 entry points) Two follow-up tracks recorded but NOT in this track: - pipeline_runtime_profiling_20260607 - pipeline_pruning_20260607 No new pip dependencies; pure stdlib (ast, json, pathlib, dataclasses). Read-only on src/; new files are the tool, the tests, and the report under docs/reports/code_path_audit/2026-06-07/.	2026-06-07 11:37:40 -04:00
ed	f069a8b27b	chore(audit): spec code path audit track Design for a data-oriented static-analysis tool (src/code_path_audit.py) that audits the 3 major actions (AI message lifecycle, discussion save/load, GUI startup) for expensive operations, redundant calls, and pipelining candidates. Output: JSON data files + markdown summaries + Mermaid per-action call graphs in docs/reports/code_path_audit/. 61 src/ files, 27,447 total lines. Call graph is non-trivial; per-action traversal is what makes analysis tractable. Cost model: 7 cost classes (file_io, network, ast_parse, json_io, pickle, deep_copy, loop_amplified) with heuristic weights; EXPENSIVE_THRESHOLD = 40,000 module constant. 5 state mutation kinds (attr_write, container_mutate, file_write, ipc_emit, global_write). The 3 action entry points are per-action defined (see Per-Action Design table). MMA worker spawn is OUT of scope per user (cold until 1:1 discussion UX is dogfooded). Two follow-up tracks recorded but NOT in this track: - pipeline_runtime_profiling_20260607: calibrate the heuristic cost model with real measurements; catch C-extension cost, decorator dispatch, JIT effects that static analysis can't resolve. - pipeline_pruning_20260607: implement the high-priority optimization candidates surfaced by this track's report. 6 atomic commits planned: data structures; trace_action + ActionProfile + cost model; output (JSON/MD/Mermaid); MCP + CLI; run audit + commit report; tracks.md update.	2026-06-07 11:30:06 -04:00

21 Commits