Per user request 'use your remaining context to update agent workflow
docs and then regular docs based on what was discussed in this report',
this commit creates/updates 15 files derived from the v2.3 nagent
review (the 12 new nagent additions + the 4 memory dimensions
reframing + the cache strategy + the RAG discipline + the knowledge
harvest pattern).
Agent workflow docs (4 files):
- AGENTS.md (UPDATE): add @import line to canonical DOD + 'Code
Styleguides' section pointing to the 6 new styleguides + new
'Human-Facing Documentation' section pointing to ./docs/AGENTS.md
- conductor/workflow.md (UPDATE): new section 'Additions (2026-06-12)
- the 12 patterns from the latest nagent corpus' with TDD
protocols for knowledge harvest, cache ordering, compaction, RAG
discipline
- conductor/product-guidelines.md (UPDATE): new sections 'Memory
Dimensions (added 2026-06-12)' + 'See Also - Updated' with the
6-styleguide catalog
- docs/AGENTS.md (NEW): the agent-facing mirror of docs/Readme.md
(per the nagent CLAUDE.md pattern). 10 sections + the per-tier
reading path + the 4 memory dimensions + the caching strategy +
the knowledge harvest + the RAG discipline + the feature flags
Regular docs (11 files):
- 6 new styleguides (the convention catalog):
* data_oriented_design.md: the canonical DOD reference (Tier
0/1/2; 3 defaults to reject; 8 core defaults; 7-question
simplification pass; 10-question self-check; 4 memory
dimensions in Manual Slop context)
* agent_memory_dimensions.md: the 4 memory dims (curation /
discussion / RAG / knowledge) + when to use each + the
boundaries
* rag_integration_discipline.md: the conservative-RAG rule
(opt-in, complement, provenance, no mutation, feature-gated,
graceful failure)
* cache_friendly_context.md: stable-to-volatile context
ordering + the cache TTL GUI contract + the byte-comparison
test
* knowledge_artifacts.md: the knowledge harvest pattern
(category files, provenance, sha256 ledger, digest
regeneration, 'delete to turn off')
* feature_flags.md: file presence vs config flags vs CLI flags
- 3 new project docs (the cross-cutting guides):
* guide_agent_memory_dimensions.md: the cross-cutting guide on
the 4 dims + the decision tree
* guide_caching_strategy.md: caching across providers +
stable-to-volatile ordering + cache TTL GUI + the byte-
comparison test + the 5th provider (claude-code)
* guide_knowledge_curation.md: the knowledge memory guide (4th
dim) + the 5 category files + per-file notes + the digest +
the ledger + the harvest workflow
- 2 existing doc updates:
* guide_mma.md: new sections 'Delegation as context management'
+ 'The 4 memory dimensions (the MMA scope)'
* guide_ai_client.md: new section 'Cache strategy and the 12-
layer model' + the 5th provider (claude-code)
All files use the same style as the v2.3 review (the user's preferred
format): 7-column tables, no JSON, SSDL shape tags, forth/array
notation, file:line citations, ASCII sketches where useful. The
human Readme files (Readme.md, docs/Readme.md) are NOT modified
(per repeated user instruction).
The 5th provider (claude-code) is documented in guide_ai_client.md
+ the data_oriented_design.md references the nagent pattern as the
source of the canonical rules.
The cross-references are bidirectional: the 6 styleguides reference
the 3 project docs; the 3 project docs reference the 6 styleguides;
the 2 doc updates reference both; AGENTS.md + ./docs/AGENTS.md
provide the entry points.
15 KiB
./docs/AGENTS.md (the agent-facing mirror)
Status: Agent-facing mirror of docs/Readme.md (the human-facing docs index, which is preserved as-is). For agents (any tier), this is the recommended first read for understanding the project's docs structure.
Date: 2026-06-12
Cross-refs: docs/Readme.md (human-facing); AGENTS.md (project root); the 6 styleguides in conductor/code_styleguides/.
What this is.
docs/Readme.mdis the human-facing docs index. This file is the agent-facing equivalent: it organizes the 14 deep-dive guides underdocs/by MMA tier, and it cross-references the canonical styleguides. The 2 files cover the same docs but with different audiences and different reading paths.The reading path. If you're an agent scoping a feature, read this file first; then read the 1-2
guide_*.mdfiles for the layers your feature touches; then read the 1-2 styleguides for the patterns the feature uses. The expected reading time for a typical feature: 10-15 minutes.
0. The 4 memory dimensions (the cross-cutting lens)
The conversation data has 4 distinct memory dimensions. Most features touch 1-2; some touch 3. Use this lens to identify which dimension(s) your feature needs.
| # | Dim | Where it lives | Use when | Styleguide |
|---|---|---|---|---|
| 1 | Curation | FileItem + ContextPreset + Fuzzy Anchors |
"How to render a file" | (the curation is per docs/guide_context_curation.md) |
| 2 | Discussion | app.disc_entries + branching + UISnapshot |
"What was said in this chat" | (the discussion is per docs/guide_architecture.md §"Threading model") |
| 3 | RAG | src/rag_engine.py (ChromaDB) |
"What similar content exists" (opt-in) | conductor/code_styleguides/rag_integration_discipline.md |
| 4 | Knowledge | ~/.manual_slop/knowledge/*.md + per-file + digest |
"What we learned from past sessions" | conductor/code_styleguides/knowledge_artifacts.md |
See docs/guide_agent_memory_dimensions.md for the full cross-cutting guide.
1. The 14 deep-dive guides (organized by MMA tier)
| Tier | Guide | What it covers | When to read |
|---|---|---|---|
| T1 | docs/guide_architecture.md |
Threading model; cross-thread state sync | When scoping any cross-cutting feature |
| T1 | docs/guide_meta_boundary.md |
The Application vs Meta-Tooling split | When scoping a Meta-Tooling-side feature |
| T2 | docs/guide_app_controller.md |
The headless controller; AppState dataclass |
When implementing controller-side logic |
| T2 | docs/guide_ai_client.md |
The multi-provider LLM client | When implementing LLM-side logic |
| T2 | docs/guide_mma.md |
The 4-tier MMA orchestration | When implementing MMA-side logic |
| T2 | docs/guide_tools.md |
The MCP tool inventory + Hook API | When implementing MCP tools or Hook endpoints |
| T2 | docs/guide_mcp_client.md |
The 45 tools + 3-layer security | When implementing new MCP tools or sub-MCPs |
| T3 | docs/guide_context_curation.md |
Granular AST Control + Fuzzy Anchors + Structural File Editor | When implementing curation-side features |
| T3 | docs/guide_personas.md |
The unified agent profile model | When implementing persona-side features |
| T3 | docs/guide_rag.md |
The RAG subsystem | When implementing RAG-side features (rare; opt-in) |
| T3 | docs/guide_gui_2.md |
The ImGui application | When implementing GUI-side features |
| All | docs/guide_testing.md |
The test suite architecture (251 test files; 7 conftest fixtures) | When writing any test |
| All | docs/guide_command_palette.md |
The 33 commands + "Everything" mode | When implementing command-palette features |
| NEW | docs/guide_knowledge_curation.md |
The knowledge memory guide (4th dim) | When implementing knowledge-side features |
| NEW | docs/guide_caching_strategy.md |
Caching across providers; stable-to-volatile ordering; cache TTL GUI | When implementing cache-side features |
| NEW | docs/guide_agent_memory_dimensions.md |
Cross-cutting: the 4 memory dimensions | When scoping any feature that touches memory |
2. The 6 canonical styleguides (the convention catalog)
| Styleguide | What it codifies | When to read |
|---|---|---|
conductor/code_styleguides/data_oriented_design.md |
The canonical DOD reference (Tier 0/1/2; 3 defaults to reject; 7-question simplification pass; 10-question self-check) | Before any non-trivial work |
conductor/code_styleguides/agent_memory_dimensions.md |
The 4 memory dimensions and when to use each | When the feature touches memory |
conductor/code_styleguides/rag_integration_discipline.md |
The conservative-RAG rule (opt-in; complements; provenance; no mutation; feature-gated; graceful failure) | When the feature uses RAG |
conductor/code_styleguides/cache_friendly_context.md |
Stable-to-volatile context ordering; the cache TTL GUI contract; the byte-comparison test | When the feature builds context or caches |
conductor/code_styleguides/knowledge_artifacts.md |
The knowledge harvest pattern (category files, provenance, sha256 ledger, digest regeneration) | When the feature uses the knowledge dim |
conductor/code_styleguides/feature_flags.md |
File presence ("delete to turn off") vs config flags vs CLI flags; when to use each | When adding a new feature toggle |
3. The per-tier reading path
Tier 1 (Orchestrator) — what to read
For scoping a feature, understanding the architecture, and planning:
| Read | Why |
|---|---|
docs/guide_architecture.md |
The threading model; the cross-thread data flow |
docs/guide_meta_boundary.md |
The Application vs Meta-Tooling split (load-bearing) |
docs/guide_agent_memory_dimensions.md |
The 4 memory dimensions (which dim does my feature touch?) |
conductor/code_styleguides/data_oriented_design.md |
The 3 defaults to reject; the simplification pass; the final self-check |
AGENTS.md (project root) |
The project-root agent-facing rules |
This file (.docs/AGENTS.md) |
The docs structure |
Tier 1 does NOT typically read: guide_*.md for the specific subsystems (T2 reads those).
Tier 2 (Tech Lead) — what to read
For track design, ticket generation, and architecture:
| Read | Why |
|---|---|
| All of Tier 1's reads | (foundational) |
docs/guide_app_controller.md |
The headless controller; the _predefined_callbacks and _gettable_fields registries |
docs/guide_ai_client.md |
The LLM client; the providers; the cache strategy |
docs/guide_mma.md |
The 4-tier MMA; the DAG engine; the worker pool |
docs/guide_tools.md |
The MCP tool inventory; the Hook API; the 3-layer security |
conductor/code_styleguides/agent_memory_dimensions.md |
(for memory-touching tracks) |
conductor/code_styleguides/cache_friendly_context.md |
(for context-building tracks) |
Tier 2 does NOT typically read: guide_context_curation.md, guide_personas.md, guide_rag.md, guide_gui_2.md (T3 reads those).
Tier 3 (Worker) — what to read
For surgical implementation:
| Read | Why |
|---|---|
| All of Tier 2's reads (selectively) | (the system context) |
The 1-2 guide_*.md files for the specific layers the ticket touches |
(the implementation surface) |
The 1-2 code_styleguides/...md files for the patterns the ticket uses |
(the convention) |
The ticket itself (conductor/tracks/<id>/plan.md) |
(the specific task) |
Tier 3 reads in depth, not in breadth. A typical T3 worker reads 2-4 docs total.
Tier 4 (QA) — what to read
For error analysis and bug reproduction:
| Read | Why |
|---|---|
| All of Tier 2's reads (selectively) | (the system context) |
The 1-2 guide_*.md files for the failing layer |
(the reproduction surface) |
| The test file (if any) | (the verification surface) |
The audit scripts (scripts/audit_*.py) |
(the static analysis surface) |
Tier 4 reads narrowly. The bug is in 1-2 files; the read is in 1-2 docs.
4. The 4 memory dimensions (the cross-cutting lens, in detail)
Most features touch 1-2 dimensions. Use this decision tree:
Q: What is the *data* the feature needs?
│
├── "How to render a file" ──► Curation (FileItem)
├── "What was said in this chat" ──► Discussion (disc_entries)
├── "What similar content exists" ──► RAG (RAGEngine.search) [opt-in]
└── "What we learned from past runs" ──► Knowledge (knowledge/digest.md)
Pick the matching dimension. If the feature needs 2+, use 2+ — but be explicit about which is primary and which is secondary.
The wrong shape for the right question is a common mistake:
- "Where does X happen?" → RAG (semantic search)
- "How do I configure how file Y is rendered?" → Curation (FileItem)
- "What was the user asking about 3 turns ago?" → Discussion (disc_entries)
- "What did we decide last time about Z?" → Knowledge (digest)
See docs/guide_agent_memory_dimensions.md for the full cross-cutting guide.
5. The caching strategy (the cross-cutting concern)
If the feature builds the initial context (in aggregate.py:run) or calls the LLM (in ai_client.py:send), the cache strategy matters.
The 12-layer model:
| # | Layer | Stable across turns? | Where the cache hits |
|---|---|---|---|
| 1-7 | Role instructions, function-calling schema, tool descriptions, system prompt, persona, project context, knowledge digest | YES (cacheable) | Anthropic cache_control, Gemini cachedContent, OpenAI implicit |
| 8-12 | Discussion metadata, active preset, per-file details, prior tool results, user message | NO (per turn) | NOT cached |
The byte-comparison test (the design contract for the stable prefix):
def test_aggregate_stable_to_volatile_ordering():
"""The first N characters of the context should be identical across turns
of the same conversation, when no stable-layer inputs change."""
...
The provider-specific TTLs:
| Provider | Default TTL | Configurable? |
|---|---|---|
| Anthropic ephemeral | 5 min | yes (per-provider control surface) |
| Gemini explicit | 1 h | yes (per-discussion override) |
| OpenAI implicit | 5-10 min (provider-managed) | no |
The GUI exposure is a "Caching" Operations Hub sub-panel (per the v2.3 §5.3 sketch). See docs/guide_caching_strategy.md for the full guide and conductor/code_styleguides/cache_friendly_context.md for the styleguide.
6. The knowledge harvest (the durable layer)
The 4th memory dimension (knowledge) is opt-in but encouraged — it's the durable, user-editable, provenance-aware store of facts / decisions / questions / playbooks / per-file notes.
The directory layout (per the user's ~/.manual_slop/knowledge/):
knowledge/
├── facts.md # - {statement} {provenance}
├── decisions.md # - {statement, reason} {provenance}
├── questions.md # - {question} {provenance}
├── playbooks.md # - **{name}**: {steps} {provenance}
├── tasks.md # ## Open / ## Done
├── files/{file_id}.md # per-file notes (keyed by inode)
├── digest.md # bounded 4KB; the projection; "delete to turn off"
├── ledger.json # sha256-of-content audit log
└── prompts/harvest-conversation.md # user-editable
The harvest CLI: python -m src.knowledge_harvest [--apply] [--no-harvest] [--max-harvest-bytes N]. Default: dry-run.
The LLM output is strict JSON (no prose, no markdown fence) with 7 categories. The retry budget is 2 attempts.
The "delete to turn off" pattern: rm ~/.manual_slop/knowledge/digest.md → no {knowledge} block injected. Re-enable by running the harvest.
See docs/guide_knowledge_curation.md for the full guide and conductor/code_styleguides/knowledge_artifacts.md for the styleguide.
7. The RAG discipline (the opt-in fuzzy dimension)
RAG is the fuzzy semantic search dimension. It's opt-in (default-off in new projects). The 6 rules:
- Opt-in. Default-off in new projects
- Complements; never replaces. RAG is one of 4 dimensions, not a substitute
- Provenance required. Every result shows file + chunk
- No mutation. RAG results never write to
disc_entries,FileItem, or disk - Feature-gated. A feature must explicitly request RAG in its scope
- Graceful failure. Failed search returns empty; the request continues
See docs/guide_rag.md for the full RAG guide and conductor/code_styleguides/rag_integration_discipline.md for the styleguide.
8. The feature flag patterns (when to use what)
When adding a new feature with an "on/off" toggle, choose the right pattern:
| Pattern | When to use | Example |
|---|---|---|
| File presence ("delete to turn off") | The feature produces a side artifact; the user might want to clean up by rm-ing it |
~/.manual_slop/knowledge/digest.md |
| Config flag | The feature is always on; the flag is a persistent preference | [ai_settings.toml] rag.enabled |
| CLI flag | The feature is invoked from the CLI; the flag is a one-shot override | python -m src.knowledge_harvest --apply |
| Track metadata flag | The track's implementation uses a feature; this is static documentation | metadata.json: {"uses_rag": true} |
See conductor/code_styleguides/feature_flags.md for the full guide.
9. The cross-cutting principles (the data-oriented foundation)
All 14 docs and 6 styleguides share the same foundation (per data_oriented_design.md):
- The data is the thing. The conversation, the file items, the knowledge digest — these are the source of truth
- Behavior is transformation over data. Not object graphs; not hidden state; not opaque handles
- Avoid hidden mutable state. Errors are data, not exceptions. State is on disk, not in memory
- Separate durable artifacts from temporary execution. Workers are disposable; artifacts are durable
- Optimize the shape, availability, and maintenance of the data. Editable, provenance-aware, user-editable
When in doubt, read conductor/code_styleguides/data_oriented_design.md first.
10. The reading path (the 1-page summary)
For an agent scoping a feature:
- Read this file (10 min)
- Read the 1-2
guide_*.mdfor the layers your feature touches (5-10 min each) - Read the 1-2
code_styleguides/...mdfor the patterns your feature uses (5-10 min each) - Read the ticket (
conductor/tracks/<id>/plan.md) for the specific task (variable)
Total: 20-45 min for a typical feature. The investment pays back across the feature's lifetime.
If a guide is missing or stale, that's a bug; file a docs issue (or update the guide inline, per the project's "edit the source of truth, not this file" pattern).
End of agent-facing mirror.