22 KiB
Comprehensive Documentation Refresh
Date: 2026-06-02
Status: Draft (pending review)
Parent Track: documentation_refresh_comprehensive_20260602
Sub-Tracks: 3 (sequential with one parallel pair)
1. Context & Motivation
Manual Slop is a local GUI orchestrator for LLM-driven coding sessions. Since the last documentation refresh track (documentation_refresh_20260224, completed February 2026), the codebase has grown substantially:
- New subsystems without dedicated guides: Beads mode (Dolt-backed issue tracking), RAG (ChromaDB + multi-source retrieval), Hot Reload (state-preserving module reloading), Discussion Metrics & Compression, Command Palette, Structural File Editor (unified AST inspector + slice editor)
- New language support via tree-sitter: C, C++ (already in docs), plus planned Lua, GDScript, C# (in backlog)
- Two new guide files that exist but are not linked from the docs index:
docs/guide_context_curation.md,docs/guide_shaders_and_window.md - Drift in agent config files (
AGENTS.md,CLAUDE.md,GEMINI.md): the three files overlap on Session Startup, Conductor System, MMA tiers, and anti-patterns; no single source of truth - CLAUDE.md is largely vestigial — Claude Code is no longer the primary toolchain, but the file still receives content updates
- AGENTS.md contains rich guidance content that should live in
mma-orchestrator/SKILL.mdorconductor/workflow.md, not as a surface-level orientation file
The user has explicitly stated two goals for this refresh:
- Human-facing docs (
./docs/, READMEs) must reach textbook / Microsoft "purple-tomb" SDK documentation fidelity. Every detail: state machines, public APIs, threading constraints, algorithms. No hand-waving, no "left to the reader." A human should be able to use the tool AND maintain/extend the codebase from the docs alone. - Agent-facing docs (
AGENTS.md,GEMINI.md, skill files in.agents/,.gemini/) must follow an explicit divergence model with no content duplication.AGENTS.mdbecomes a thin pointer;CLAUDE.mdis deprecated;GEMINI.mdstays Gemini-CLI-specific.
This refresh is documentation-only (no new code, no new tooling, no new scripts). Drift control after the refresh is manual via the track's verification step.
2. Scope Boundaries
In Scope
- Top-level
Readme.md docs/Readme.mdand alldocs/guide_*.mdfilesconductor/product.md,conductor/product-guidelines.md,conductor/tech-stack.md,conductor/workflow.md,conductor/index.mdAGENTS.md,CLAUDE.md,GEMINI.md- Light verification pass (read each, confirm role accuracy, fix obvious drift; do NOT rewrite wholesale) on
.agents/skills/*/SKILL.md,mma-orchestrator/SKILL.md,.gemini/...mirrors - Per-file atomic git commits with git notes attached as rationale trails
Out of Scope
- Source code changes (a separate agent is making code modifications)
- New lint/check scripts (drift control is manual)
- New
docs/files beyond what's required to document new subsystems - Re-architecting the conductor system itself
- Removing CLAUDE.md (kept as a stub for compatibility)
- Visual diagrams (ASCII is fine; no image generation)
- Translating the docs to other languages
3. Track Architecture
The parent track documentation_refresh_comprehensive_20260602 is a coordinator: it registers the three sub-tracks in conductor/tracks.md, sequences them, and produces a final verification report. Each sub-track is an independent conductor track with its own spec.md, plan.md, and metadata.json.
Dependency Graph
Sub-Track 1: docs_layer_refresh
│
├──> Sub-Track 2: conductor_docs_refresh
│
└──> Sub-Track 3: agent_config_refresh
│
└──> Parent verification
Sub-Tracks 2 and 3 can run in parallel after Sub-Track 1 establishes the human-facing doc layer. The parent track does the final verification and creates the checkpoint commit.
Execution Constraints (apply to all 3 sub-tracks)
- No subagents. Each sub-track is executed by a single agent (the user or the Tier 2 Tech Lead session) sequentially. No Tier 3 worker delegation, no Tier 4 QA delegation, no parallel worker execution.
- Per-file atomic commits. One file = one commit. The git note attached to each commit captures the rationale and the source files cross-referenced during the rewrite.
- Pre-edit
git add .checkpoint before each file edit (protects againstgit restoremishaps). - Style baseline:
conductor/product-guidelines.md— VEFontCache-Odin pattern (Philosophy → Architectural Boundaries → Implementation Logic → Verification), high information density, structural parity between doc symbols and source code symbols. - Symbol parity: Every class, function, method, and event named in the docs must match the source exactly. If the source uses
AsyncEventQueue, the doc must useAsyncEventQueue, not "the event queue." - Link validation: Every
[text](./relative/path)link is manually verified viamanual-slop_search_filesto confirm the target exists. Cross-doc links must point to files that exist in this or a future sub-track. - No comments added to source code. Documentation lives in
./docs/, not inline.
4. Sub-Track 1: docs_layer_refresh_20260602
Priority: HIGHEST. This is the dominant work.
File List
| File | Current Size | Notes |
|---|---|---|
Readme.md |
15.5K | Main project README |
docs/Readme.md |
~3K | Documentation index |
docs/guide_architecture.md |
34.5K | Threading, events, AI client, HITL |
docs/guide_mma.md |
unknown | 4-tier hierarchy, DAG, worker lifecycle |
docs/guide_tools.md |
22.9K | MCP tools, Hook API, shell runner |
docs/guide_simulations.md |
unknown | live_gui fixture, Puppeteer, mock provider |
docs/guide_context_curation.md |
8K | Skeleton/curated/targeted views, fuzzy anchors |
docs/guide_shaders_and_window.md |
3.4K | Shader pipeline, custom window frame |
docs/guide_meta_boundary.md |
4.6K | Application vs Meta-Tooling domain |
docs/MMA_Support/* |
varies | Legacy MMA reference docs — review for archival or merge |
Tasks (Phases)
Phase 1.1: Inventory and Gap Analysis
- Read each existing guide, summarize what subsystems it covers
- Identify subsystems in
conductor/product.mdthat have NO corresponding guide - Identify subsystems that DO have a guide but the guide is stale (e.g., does not cover features added since Feb 2026)
- Produce a gap table: subsystem → current guide → needed changes (rewrite / add / link from index / remove)
Phase 1.2: Update the Documentation Index
- Update
docs/Readme.mdso everydocs/guide_*.mdappears in the Guides table - Currently unlinked:
docs/guide_context_curation.md,docs/guide_shaders_and_window.md - Decide fate of
docs/MMA_Support/*(legacy archive or merge intodocs/guide_mma.md) - Commit:
docs(index): link guide_context_curation and guide_shaders_and_window
Phase 1.3: Rewrite Each Guide (one file at a time, per-file atomic commits)
- For each guide, follow the per-file workflow:
- Read current source code (use
manual-slop_get_file_summaryandmanual-slop_py_get_skeleton; do NOT read full files >50 lines) - Read current
docs/MMA_Support/*andconductor/tracks/*/plan.mdfor context on subsystems that have evolved - Cross-reference
conductor/product.mdandconductor/tech-stack.mdfor the canonical feature/tech statement - Identify what changed since the guide was last meaningfully updated
- Rewrite affected sections, preserving the VEFontCache-Odin style (Philosophy → Architectural Boundaries → Implementation Logic → Verification)
- Validate every internal
[link](./relative/path)resolves - Commit per-file with
docs(<scope>): <description>message - Attach git note with: source files cross-referenced, subsystems updated, any decisions made
- Read current source code (use
Phase 1.4: Update Readme.md
- Add a "Module by Domain" reference table that links to each guide for deep dives
- Add a "Subsystem Index" section cross-referencing each major feature to its dedicated guide
- Update the screenshot/gallery references if needed
- Update setup instructions to reflect any new prerequisites (e.g., Beads CLI, Dolt)
- Commit per-section:
docs(readme): add module-by-domain reference table,docs(readme): add subsystem index,docs(readme): update setup prerequisites, etc.
Phase 1.5: Subsystems Without Guides (write new guides as needed)
For each subsystem identified in 1.1 that lacks a guide, write a new docs/guide_<name>.md using the same per-file workflow. Likely candidates based on the gap analysis:
- RAG (
docs/guide_rag.md) — vector store, chunking, multi-provider search - Beads (
docs/guide_beads.md) — Dolt integration, toolset, context compaction - Hot Reload (
docs/guide_hot_reload.md) —HotReloaderlifecycle, state preservation, UI delegation pattern - Discussion Metrics & Compression (
docs/guide_discussion_metrics.md) — per-response token tracking, history compression strategy - Command Palette (
docs/guide_command_palette.md) — async context preview, fuzzy command resolution - Structural File Editor (
docs/guide_structural_editor.md) — unified AST inspector + slice editor - NERV Theme (
docs/guide_nerv_theme.md) — black void palette, CRT-style effects, status flickering
Each new guide must follow the same VEFontCache-Odin style and link from docs/Readme.md (which means the index update in 1.2 must accommodate them).
Phase 1.6: Verification
grepfor every internal link in every rewritten guide to confirm targets existgrepfor stale feature references (e.g., "MMA 4-tier" without the recent persona, RAG, Beads integrations mentioned inconductor/product.md)- Spot-check 3 random sections per guide against the corresponding source code
- Manual user review (per workflow.md Phase Completion Verification protocol)
Acceptance Criteria
- Every
docs/guide_*.mdis referenced fromdocs/Readme.md - Every internal cross-doc link resolves
- Every subsystem in
conductor/product.mdhas a corresponding guide (either existing or newly written) - Every public class, function, and event named in any guide matches the source code symbol exactly
- Every guide follows the VEFontCache-Odin pattern (Philosophy → Architectural Boundaries → Implementation Logic → Verification) with all four sections present where applicable
- "Textbook / purple-tomb fidelity" interpreted concretely: for every public class/function/event mentioned, the doc states its signature, threading constraints, side effects, and at least one usage example. Internal algorithms are explained step-by-step, not summarized. State machines include the full transition table
- Per-file commits exist for every change, with git notes attached
5. Sub-Track 2: conductor_docs_refresh_20260602
Priority: Medium. Sync the source-of-truth conductor docs to current state.
File List
| File | Current Size | Notes |
|---|---|---|
conductor/product.md |
23.1K | Product vision, feature list, use cases |
conductor/product-guidelines.md |
6.6K | Style and process guidelines |
conductor/tech-stack.md |
unknown | Technology stack constraints |
conductor/workflow.md |
25.2K | Task lifecycle, TDD protocol |
conductor/index.md |
334 bytes | Conductor directory index |
Tasks (Phases)
Phase 2.1: Sync conductor/product.md
- Cross-reference every feature claim in
product.mdagainst the actual codebase - Add features that exist in code but are missing from
product.md(Beads, RAG, Hot Reload, Discussion Metrics/Compression, Command Palette, Structural File Editor, NERV theme, persona editor, workspace profiles, etc.) - Remove features that have been culled or replaced
- Update the "Primary Use Cases" section to reflect current usage patterns
- Commit per-section:
docs(product): sync MMA dashboard section,docs(product): add RAG feature, etc.
Phase 2.2: Sync conductor/tech-stack.md
- Verify every entry corresponds to an actual Python dep (check
pyproject.toml/requirements.txt) or an actual src/ module - Add new dependencies: chromadb (RAG), dolt (Beads), any new ones
- Add new src/ modules: rag_engine.py, beads_client.py, hot_reloader.py, history.py, workspace_manager.py, etc.
- Commit per-section.
Phase 2.3: Sync conductor/workflow.md
- Verify the TDD protocol is followed by recent tracks (spot-check 3 recent tracks)
- Verify the Phase Completion Verification protocol is followed
- Verify the checkpointing protocol is followed
- Update any sections that have drifted from actual practice
- Commit per-section.
Phase 2.4: Sync conductor/product-guidelines.md
- Verify the AI-Optimized Python Style (1-space indent, no comments, type hints) is actually followed by recent code
- Verify the "Modular Controller Pattern" and "UI Delegation Pattern" are followed in
src/app_controller.pyandsrc/gui_2.py - Verify the "Mandatory ImGui Verification" linter (
scripts/check_imgui_scopes.py) is still the source of truth - Update any sections that have drifted
- Commit per-section.
Phase 2.5: Update conductor/index.md
- If any new top-level conductor docs were added (e.g.,
conductor/code_styleguides/is referenced but not verified), link them - Commit.
Phase 2.6: Verification
grepfor references to features that no longer existgrepfor missing dependencies (every dep inpyproject.tomlshould appear intech-stack.md)- Spot-check 3 random claims per file
- Manual user review
Acceptance Criteria
- Every feature in
conductor/product.mdcorresponds to a real subsystem insrc/ortests/ - Every entry in
conductor/tech-stack.mdis an actual dep or module conductor/workflow.mdmatches the actual workflow used by recent completed tracksconductor/product-guidelines.mdmatches the actual coding style used insrc/conductor/index.mdlinks to all relevant conductor files- Per-file commits with git notes
6. Sub-Track 3: agent_config_refresh_20260602
Priority: Lower. Apply the explicit divergence model.
File List
| File | Current Size | Notes |
|---|---|---|
AGENTS.md |
5.4K | Universal agent orientation — target: thin pointer |
CLAUDE.md |
6.7K | DEPRECATE to stub |
GEMINI.md |
3.8K | Keep Gemini-CLI-specific content |
.agents/skills/mma-tier1-orchestrator/SKILL.md |
2.4K | Light verification only |
.agents/skills/mma-tier2-tech-lead/SKILL.md |
3.8K | Light verification only |
.agents/skills/mma-tier3-worker/SKILL.md |
unknown | Light verification only |
.agents/skills/mma-tier4-qa/SKILL.md |
813 bytes | Light verification only |
.agents/skills/mma-orchestrator/SKILL.md |
9.7K | Light verification only |
mma-orchestrator/SKILL.md (root copy) |
9.3K | Light verification only |
.gemini/skills/*/SKILL.md |
varies | Light verification only (mirrors) |
.gemini/agents/*.md |
varies | Light verification only (mirrors) |
.claude/commands/*.md |
9 files | Light verification only (legacy) |
Tasks (Phases)
Phase 3.1: Rewrite AGENTS.md to a thin pointer
Target structure (estimated ~1K):
# AGENTS.md
## What This Is
Manual Slop is a local GUI orchestrator for LLM-driven coding sessions. It bridges high-latency AI reasoning with a low-latency ImGui render loop via a thread-safe async pipeline; every AI-generated payload passes through a human-auditable gate before execution.
## Guidance for AI Agents
Detailed agent guidance lives in the following locations — read these directly, do not duplicate content here:
- **Operational workflow:** `conductor/workflow.md`
- **Code style and process:** `conductor/product-guidelines.md`
- **Tech stack and constraints:** `conductor/tech-stack.md`
- **Product context:** `conductor/product.md`
- **MMA orchestrator role:** `mma-orchestrator/SKILL.md`
- **Tier 1 (Orchestrator):** `.agents/skills/mma-tier1-orchestrator/SKILL.md`
- **Tier 2 (Tech Lead):** `.agents/skills/mma-tier2-tech-lead/SKILL.md`
- **Tier 3 (Worker):** `.agents/skills/mma-tier3-worker/SKILL.md`
- **Tier 4 (QA):** `.agents/skills/mma-tier4-qa/SKILL.md`
## Human-Facing Documentation
For understanding, using, and maintaining the tool, see `docs/Readme.md` and the guides it indexes.
## Critical Anti-Patterns
- Do not read full files >50 lines without first using `py_get_skeleton` or `get_file_summary`
- Do not modify the tech stack without updating `conductor/tech-stack.md` first
- Do not implement code directly as Tier 2 — delegate to Tier 3 workers
- Do not skip TDD — write failing tests before implementation
- Do not batch commits — commit per-task for atomic rollback
Phase 3.2: Replace CLAUDE.md with a 1-line stub
Target content:
# CLAUDE.md
This project is no longer actively used with Claude Code. For project context, see `AGENTS.md`. The conductor system in `./conductor/` is the cross-tool abstraction and works with any agent toolchain.
(Keep the file as a stub to avoid breaking any external tooling that may still reference it.)
Phase 3.3: Lightly refresh GEMINI.md
- Verify it covers only Gemini-CLI-specific content (no shared content that should live in
AGENTS.md) - Update any stale references to features, models, or paths
- If any content was previously shared with
AGENTS.mdand has been relocated, updateGEMINI.mdto point toAGENTS.mdinstead - Commit:
docs(gemini): refresh Gemini CLI orientation
Phase 3.4: Verify .agents/skills/*/SKILL.md and .gemini/... mirrors for coherence
For each skill file:
- Read it; confirm it covers the role accurately
- Compare with the canonical role description in
mma-orchestrator/SKILL.md - If drift is found, fix it (small surgical edits)
- If the file is outdated, flag for a separate track — do not rewrite wholesale in this track
- Commit per file:
docs(skills): fix drift in tier1 orchestrator skill
Phase 3.5: Verify .claude/commands/*.md (legacy)
- These exist for backward compatibility
- Lightly verify they still work and don't reference removed features
- Do not rewrite — they are legacy
- If a file is broken, flag for separate track
Phase 3.6: Verification
grepfor the section headers that should have been relocated (Session Startup, Conductor System, MMA 4-Tier, Anti-Patterns) acrossAGENTS.md,CLAUDE.md,GEMINI.md- Confirm each appears in at most one file (or in multiple files only as short pointers)
- Spot-check 2 random skills files against their canonical description
- Manual user review
Acceptance Criteria
AGENTS.mdis ≤ 1.5K and contains only project orientation + pointersCLAUDE.mdis ≤ 200 bytes and contains only the deprecation stubGEMINI.mdcontains only Gemini-CLI-specific content- No section header appears in 2+ of these files unless it's a short pointer (≤3 lines)
.agents/skills/*/SKILL.mdand.gemini/...mirrors are verified for coherence.claude/commands/*.mdlegacy files are verified to not reference removed features- Per-file commits with git notes
7. Parent Track: Final Verification
After all 3 sub-tracks complete, the parent track produces a final verification report:
- Cross-track link audit: For every markdown file modified in any sub-track, run
grepfor[text](./relative/path)patterns and verify each target exists - Symbol parity audit: Spot-check 5 random symbols named in any doc against the source — confirm exact match
- Drift audit:
grepfor the relocated section headers (Session Startup,Conductor System,MMA 4-Tier,Anti-Patterns, etc.) acrossAGENTS.md,CLAUDE.md,GEMINI.md— confirm explicit divergence holds - Subsystem coverage audit: For every feature in
conductor/product.md, confirm a corresponding guide exists indocs/ - User review gate: Present the final verification report and request user sign-off per workflow.md Phase Completion Verification protocol
The parent track then creates a single checkpoint commit: conductor(checkpoint): Comprehensive documentation refresh complete and attaches the verification report as a git note.
8. Risks & Mitigations
| Risk | Mitigation |
|---|---|
| A source code change by the other agent makes a doc claim stale mid-refresh | Per-file atomic commits + pre-edit git add . checkpoint; if drift is found during a later file's rewrite, fix the earlier file in a follow-up commit |
| Scope creep — adding new guides balloons the track | The gap analysis in 1.1 is the gating step; new guides are written only for subsystems explicitly listed in conductor/product.md |
| The 3 sub-tracks drift in style consistency | Style baseline is conductor/product-guidelines.md (single source); reviewed by user per sub-track |
| Manual link validation misses broken links | Per-file commit immediately after each rewrite; a broken link is caught in the next file's cross-reference work |
| User reviews take long enough that the codebase evolves underneath | Sub-tracks are independent; each can resume from its last commit if interrupted |
The legacy CLAUDE.md deprecation breaks external tooling |
The file is kept as a 1-line stub, not removed; any external tooling that reads it will see a graceful pointer to AGENTS.md |
9. Out of Scope (Reminder)
- Source code changes
- New lint scripts (drift control is manual)
- Image/diagram generation
- Translations
- Removing CLAUDE.md
- Visual layout of the docs site (Manual Slop is a GUI app, not a docs site)
10. Success Criteria
The track is complete when:
- All 3 sub-tracks have completed all their phases with per-file atomic commits and git notes
- The parent track's verification report shows zero broken cross-doc links, zero symbol mismatches in spot-checks, and confirmed explicit divergence in agent config files
- Every subsystem in
conductor/product.mdhas a corresponding guide Readme.md,docs/Readme.md, and everydocs/guide_*.mdis at textbook / purple-tomb fidelity (verified by user review)AGENTS.mdis a thin pointer document;CLAUDE.mdis a deprecation stub- The user has signed off on the final verification report per workflow.md