Private

Public Access

T

ed 1c565da7a0 feat(gui): wrap immapp.run in try/except + add /api/gui_health endpoint

PR2 of the test_full_live_workflow_imgui_assert fix sequence.

When an ImGui scope mismatch (IM_ASSERT(Missing End())) fires in
immapp.run (e.g. after cumulative state corruption from prior sims'
panel renders), the RuntimeError propagates out of app.run(). The
controller's _io_pool gets shut down via __del__/finalization. The
hook server (separate ThreadingHTTPServer) survives. Subsequent test
clicks fail with 'cannot schedule new futures after shutdown' and
the test times out after 120s with no clear signal of what went
wrong.

This commit:
1. Wraps immapp.run in try/except RuntimeError in gui_2.py:618.
   On assertion: logs the error to stderr (NOT silent), records
   it on controller._gui_degraded_reason and _last_imgui_assert,
   and returns from run() so the hook server keeps serving.
2. Adds _gui_degraded_reason and _last_imgui_assert to
   AppController.__init__ (initialized to None).
3. Adds /api/gui_health endpoint in api_hooks.py:148. Returns
   {healthy, degraded_reason, last_assert, io_pool_alive}.
4. Adds ApiHookClient.get_gui_health() with the matching unit
   tests (3 mocked tests + 1 live test).

Per user feedback 2026-06-08:
- The wrap does NOT silently swallow the error. It logs at ERROR
  level and surfaces it via the health endpoint.
- Tests can call client.get_gui_health() to detect a degraded GUI
  and fail fast with a clear message.

TDD: tests written first, confirmed to fail, then fix applied.
34/34 unit tests pass. 1/1 live test passes (live_gui health
endpoint reports healthy=True on fresh subprocess).

2026-06-08 20:46:41 -04:00

.agents

setup agy?

2026-05-20 06:32:49 -04:00

.claude

improved startup first frame boot

2026-06-07 01:08:31 -04:00

.gemini

chore(config): synchronize gemini tools with mcp_client.py

2026-05-13 22:11:30 -04:00

.opencode

fix(opencode): Remove invalid MCP tools block, add timeout/env, grant subagent access

2026-06-06 15:44:52 -04:00

assets/fonts

fonts

2026-03-08 23:24:13 -04:00

conductor

conductor(track-update): data_oriented_error_handling - nagent_review + docs refresh

2026-06-08 20:41:00 -04:00

docs

ascii gui comms worflow ideation

2026-06-08 20:32:42 -04:00

gallery

update progress snapshot

2026-03-11 00:38:21 -04:00

mma-orchestrator

chore(gemini): Encode surgical methodology into all Gemini MMA skills

2026-03-01 10:13:29 -05:00

scripts

fix(run_tests_batched): detect batch failure from output when proc.returncode is wrong

2026-06-08 02:03:50 -04:00

simulation

Progress on context composition

2026-05-17 06:43:19 -04:00

src

feat(gui): wrap immapp.run in try/except + add /api/gui_health endpoint

2026-06-08 20:46:41 -04:00

tests

feat(gui): wrap immapp.run in try/except + add /api/gui_health endpoint

2026-06-08 20:46:41 -04:00

themes

config

2026-06-08 00:40:33 -04:00

thirdparty

improvements to defer

2026-05-13 07:47:55 -04:00

_manual_run.py

fix(gui_2): use str sentinel not bytes in _capture_workspace_profile

2026-06-05 19:24:12 -04:00

.coverage

chore: Checkpoint commit of unstaged changes, including new tests and debug scripts

2026-02-26 21:39:03 -05:00

.dockerignore

feat(docker): add Dockerfile and .dockerignore for containerized deployment

2026-06-03 08:22:17 -04:00

.editorconfig

fix(conductor): Apply review suggestions for track 'mma_utilization_refinement_20260226'

2026-02-26 08:35:50 -05:00

.gitignore

chore: gitignore tests/.test_durations.json (developer-local cache)

2026-06-08 01:14:51 -04:00

.mcp.json

fix(mcp): wire run_powershell and MCP server for Windows/Scoop environment

2026-02-28 15:00:05 -05:00

AGENTS.md

docs(agents): reference skip-marker policy from workflow.md

2026-06-07 16:59:37 -04:00

CLAUDE.md

docs(claude): replace with deprecation stub; project migrated to Gemini CLI + OpenCode

2026-06-02 20:46:43 -04:00

config.toml

chore: delete legacy run_tests_batched.py (was preserved for one cycle)

2026-06-08 01:15:12 -04:00

docker-compose.yml

feat(docker): add docker-compose.yml for Unraid deployment

2026-06-03 08:23:01 -04:00

Dockerfile

fix(docker): add tk/X11 deps for headless; improve sloppy.py web mode

2026-06-03 10:13:57 -04:00

GEMINI.md

docs(gemini): refresh to reflect current providers (5), entry point (sloppy.py), and key modules

2026-06-02 20:48:56 -04:00

gencpp_manual_slop_template.toml

refactor: remove dead main_context field from Project Settings

2026-05-10 16:23:21 -04:00

imgui.ini

fixed.

2026-03-13 13:23:31 -04:00

JOURNAL.md

adjustments + new tracks + tasks.md reduction of usage

2026-03-08 03:31:15 -04:00

manual_slop_history.toml

config

2026-06-07 02:15:36 -04:00

manual_slop_test.ini

checkpoint: fixing ux with window frame bar

2026-03-13 13:13:35 -04:00

manual_slop.toml

config

2026-06-08 00:40:33 -04:00

manualslop_layout.ini

chore: delete legacy run_tests_batched.py (was preserved for one cycle)

2026-06-08 01:15:12 -04:00

mcp_env.toml

fix(mcp): wire run_powershell and MCP server for Windows/Scoop environment

2026-02-28 15:00:05 -05:00

mcp_paths.toml

feat(mcp): add mcp_paths.toml for multi-project access

2026-05-10 16:03:17 -04:00

opencode.json

fix(opencode): Remove invalid MCP tools block, add timeout/env, grant subagent access

2026-06-06 15:44:52 -04:00

personas.toml

fix(gui): limit Files & Media collapsible child height to reasonable max

2026-05-10 16:30:54 -04:00

presets.toml

Getting there..

2026-05-10 21:04:24 -04:00

project_history.toml

fix(io_pool): increase worker count from 4 to 8 to prevent test hangs

2026-06-08 17:49:34 -04:00

project.toml

chore: delete legacy run_tests_batched.py (was preserved for one cycle)

2026-06-08 01:15:12 -04:00

pyproject.toml

chore(pyproject): register pytest.mark.live marker

2026-06-08 13:59:18 -04:00

Readme.md

grammar

2026-06-07 18:17:26 -04:00

run_all_tests.ps1

fk it

2026-03-06 00:11:35 -05:00

sloppy.py

typo

2026-06-07 02:03:31 -04:00

TASKS.md

adjustments + new tracks + tasks.md reduction of usage

2026-03-08 03:31:15 -04:00

tool_presets.toml

checkpoint: I have to fix try/finally spam by this ai

2026-03-11 21:43:19 -04:00

Readme.md

Manual Slop

Note by the Human behind this

I see the potential of AI as both an invaluable learning, percise techinical writing and code generation tool when handled with care and deep curation. This repo is both a proof of concept of this assertion and a tool to achieve this because every single paid or vested "AI Agenic developer" seems to not be interested in these principles.

Why did you do this in Python

TLDR: I apologize it was out of sheer practicality with time allocation and resources available. I really don't like python.

Before I winged this project on a whim and frustration, I had tried AI with various langauges, unfortuantely python did remarkably well.

Attic-Greek-TTS - ~3 kloc TTS tool for a dead language, with spectrograph anaylsis for verification.
forth_bootslop - Used scripts to gather and curate large amounts information and data from sources into formats it could digest.

Prior to making this tool I had very dissapointing performance with more favaorable langauges: C11, Odin, or Jai (Which I don't have direct access to).

I don't enjoy web browser sandboxed runtimes so I didn't use javascript. I haven't attempted AI with lua much but that was the alternative, and I knew python had the next best support for AI toolchain bindings along with an imgui package. So based purely on these factors alone I resolved to attempt this in Python.

Summary

A high-density GUI orchestrator for local LLM-driven coding sessions. Manual Slop bridges high-latency AI reasoning with a low-latency ImGui render loop via a thread-safe asynchronous pipeline, ensuring every AI-generated payload passes through a human-auditable gate before execution.

Design Philosophy: Full manual control over vendor API metrics, agent capabilities, and context memory usage. High information density, tactile interactions, and explicit confirmation for destructive actions.

Tech Stack: Python 3.11+, ImGui Bundle (Dear ImGui + imgui-node-editor + imgui_markdown + ImGuiColorTextEdit), FastAPI, Uvicorn, tree-sitter (Python, C, C++), chromadb (RAG), pywin32 (Windows window frame), psutil (telemetry), pydantic, dolt (Beads) Providers: Gemini API, Anthropic API, DeepSeek, Gemini CLI (headless), MiniMax Platform: Windows (PowerShell) — single developer, local use

Key Features

Multi-Provider Integration

Gemini SDK: Server-side context caching with TTL management, automatic cache rebuilding at 90% TTL
Anthropic: Ephemeral prompt caching with 4-breakpoint system, automatic history truncation at 180K tokens
DeepSeek: Dedicated SDK for code-optimized reasoning
Gemini CLI: Headless adapter with full functional parity, synchronous HITL bridge
MiniMax: Alternative provider support

4-Tier MMA Orchestration

Hierarchical task decomposition with specialized models and strict token firewalling:

Tier 1 (Orchestrator): Product alignment, epic → tracks
Tier 2 (Tech Lead): Track → tickets (DAG), persistent context
Tier 3 (Worker): Stateless TDD implementation, context amnesia
Tier 4 (QA): Stateless error analysis, no fixes

Strict Human-in-the-Loop (HITL)

Execution Clutch: All destructive actions suspend on threading.Condition pending GUI approval
Three Dialog Types: ConfirmDialog (scripts), MMAApprovalDialog (steps), MMASpawnApprovalDialog (workers)
Editable Payloads: Review, modify, or reject any AI-generated content before execution

45 MCP Tools with Sandboxing

Three-layer security model: Allowlist Construction → Path Validation → Resolution Gate

File I/O: read, list, search, slice, edit, tree
AST-Based (Python): skeleton, outline, definition, signature, class summary, docstring, var declaration, hierarchy, imports, syntax check, find usages
AST-Based (C/C++): tree-sitter powered skeleton, outline, definition, signature, and surgical update tools for C and C++
File Editing: surgical string match (edit_file) preserving indentation and line endings
Analysis: summary, git diff, find usages, imports, syntax check, hierarchy, derive code path
Network: web search, URL fetch (dependency-free, stdlib only)
Runtime: UI performance metrics
Beads: bd_create, bd_list, bd_ready, bd_update for Dolt-backed issue tracking

See docs/guide_tools.md for the full inventory.

Parallel Tool Execution

Multiple independent tool calls within a single AI turn execute concurrently via asyncio.gather, significantly reducing latency.

AST-Based Context Management

Skeleton View: Signatures + docstrings, bodies replaced with ...
Curated View: Preserves @core_logic decorated functions and [HOT] comment blocks
Targeted View: Extracts only specified symbols and their dependencies
Heuristic Summaries: Token-efficient structural descriptions without AI calls

Architecture at a Glance

Four thread domains operate concurrently: the ImGui main loop, an asyncio worker for AI calls, a HookServer (HTTP on :8999) for external automation, and transient threads for model fetching. Background threads never write GUI state directly — they serialize task dicts into lock-guarded lists that the main thread drains once per frame (details).

The Execution Clutch suspends the AI execution thread on a threading.Condition when a destructive action (PowerShell script, sub-agent spawn) is requested. The GUI renders a modal where the user can read, edit, or reject the payload. On approval, the condition is signaled and execution resumes (details).

The MMA (Multi-Model Agent) system decomposes epics into tracks, tracks into DAG-ordered tickets, and executes each ticket with a stateless Tier 3 worker that starts from ai_client.reset_session() — no conversational bleed between tickets (details).

Test Coverage

The project has 273 test files with 98.9% pass rate (272/273 in the latest batched run; the 1 failure is a pre-existing flake in test_rag_phase4_stress that passes in isolation). Most failures are caught and fixed via the 4-tier MMA test-harden track system. See docs/guide_testing.md for the full testing contract.

Documentation

Guide	Scope
Readme	Documentation index, GUI panel reference, configuration files, environment variables
Architecture	Threading model, event system, AI client multi-provider architecture (Gemini, Anthropic, DeepSeek, Gemini CLI, MiniMax), HITL mechanism, comms logging, RAG integration, Tier 4 patch flow
Tools & IPC	MCP Bridge 3-layer security, 45-tool inventory, Hook API endpoints, ApiHookClient reference, shell runner, Beads tools
MMA Orchestration	4-tier hierarchy, Ticket/Track/WorkerContext data structures, DAG engine, ConductorEngine, worker lifecycle, persona application, abort propagation
Simulations	`live_gui` fixture, Puppeteer pattern, mock provider, visual verification, test areas by subsystem, headless service
Context Curation	AST masking, fuzzy anchor slices, structural file editor, view presets, history snapshotting
Shaders & Window	Hybrid shader injection, custom window frame, NERV theme effects
Themes	TOML-based theming, `[colors]` table, 4-syntax-palette upstream limit, `load_themes_from_disk` / `apply_syntax_palette` API, color-callable convention
Meta-Boundary	Application vs Meta-Tooling domains, inter-domain bridges, cross-tool abstractions

Subsystem Index

Subsystem	Guide	Primary Module(s)
Multi-provider LLM client	Architecture	`src/ai_client.py`
4-Tier MMA orchestration	MMA	`src/multi_agent_conductor.py`, `src/dag_engine.py`
DAG engine & ticket lifecycle	MMA	`src/dag_engine.py`
MCP tools & Hook API	Tools & IPC	`src/mcp_client.py`, `src/api_hooks.py`
Execution Clutch (HITL)	Architecture	`src/app_controller.py`
Context composition & aggregation	Context Curation	`src/aggregate.py`, `src/file_cache.py`
AST inspection & slicing	Context Curation	`src/file_cache.py`, `src/fuzzy_anchor.py`
Personas (unified profiles)	See guide_mma.md; dedicated guide pending	`src/personas.py`
Tool bias engine	See guide_tools.md; dedicated guide pending	`src/tool_bias.py`
RAG (Retrieval-Augmented Generation)	See guide_architecture.md; dedicated guide pending	`src/rag_engine.py`
Beads mode (Dolt issue tracking)	See guide_tools.md; dedicated guide pending	`src/beads_client.py`
Hot reload (state-preserving)	Dedicated guide pending	`src/hot_reloader.py`
Discussion metrics & compression	Architecture	`src/ai_client.py`
Test infrastructure & simulations	Simulations	`tests/conftest.py`, `simulation/`
Headless service (FastAPI)	Simulations	`src/api_hooks.py`
NERV theme & visual effects	Shaders & Window	`src/theme_nerv.py`, `src/theme_nerv_fx.py`
TOML theme system (palette + syntax)	Themes	`src/theme_2.py`, `src/theme_models.py`
Custom window frame	Shaders & Window	`src/gui_2.py`
Workspace profiles (docking layouts)	Dedicated guide pending	`src/workspace_manager.py`
History (undo/redo)	Context Curation	`src/history.py`
External MCP integration	Tools & IPC	`src/mcp_client.py`
Telemetry & performance monitoring	Architecture	`src/performance_monitor.py`
Session logging	Tools & IPC	`src/session_logger.py`
MMA dashboard & node editor	MMA	`src/gui_2.py:_render_mma_dashboard`
Cross-tool abstractions (conductor)	Meta-Boundary	`conductor/`

Subsystems marked "dedicated guide pending" are slated for dedicated docs/guide_*.md files in upcoming docs work. For now, their details live inline in the guides listed under Documentation above.

Setup

Prerequisites

Python 3.11+
uv for package management

Installation

git clone <repo>
cd manual_slop
uv sync

Credentials

Configure in credentials.toml:

[gemini]
api_key = "YOUR_KEY"

[anthropic]
api_key = "YOUR_KEY"

[deepseek]
api_key = "YOUR_KEY"

[minimax]
api_key = "YOUR_KEY"

Each provider's key is loaded by the corresponding _ensure_<provider>_client() in src/ai_client.py. The credentials.toml is blacklisted by the MCP allowlist — AI tools cannot read it under any circumstance.

Running

uv run sloppy.py                        # Normal mode
uv run sloppy.py --enable-test-hooks    # With Hook API on :8999

Running Tests

uv run pytest tests/ -v

Note: See the Structural Testing Contract for rules regarding mock patching, live_gui standard usage, and artifact isolation (logs are generated in tests/logs/ and tests/artifacts/).

MMA 4-Tier Architecture

The Multi-Model Agent system uses hierarchical task decomposition with specialized models at each tier:

Tier	Role	Model	Responsibility
Tier 1	Orchestrator	`gemini-3.1-pro-preview`	Product alignment, epic → tracks, track initialization
Tier 2	Tech Lead	`gemini-3-flash-preview`	Track → tickets (DAG), architectural oversight, persistent context
Tier 3	Worker	`gemini-2.5-flash-lite` / `deepseek-v3`	Stateless TDD implementation per ticket, context amnesia
Tier 4	QA	`gemini-2.5-flash-lite` / `deepseek-v3`	Stateless error analysis, diagnostics only (no fixes)

Key Principles:

Context Amnesia: Tier 3/4 workers start with ai_client.reset_session() — no history bleed
Token Firewalling: Each tier receives only the context it needs
Model Escalation: Failed tickets automatically retry with more capable models
WorkerPool: Bounded concurrency (default: 4 workers) with semaphore gating

Module by Domain

src/ — Core implementation (53 modules)

File	Role
`src/gui_2.py`	Primary ImGui interface — App class, frame-sync, HITL dialogs, event system
`src/app_controller.py`	Headless controller; bridges GUI and async AI workers
`src/ai_client.py`	Multi-provider LLM abstraction (Gemini, Anthropic, DeepSeek, MiniMax)
`src/mcp_client.py`	45 MCP tools with 3-layer filesystem security and tool dispatch
`src/api_hooks.py`	HookServer — REST API on `127.0.0.1:8999` for external automation
`src/api_hook_client.py`	Python client for the Hook API (used by tests and external tooling)
`src/multi_agent_conductor.py`	ConductorEngine — Tier 2 orchestration loop with DAG execution
`src/dag_engine.py`	TrackDAG (dependency graph) + ExecutionEngine (tick-based state machine)
`src/models.py`	Ticket, Track, WorkerContext, Metadata, Persona, WorkspaceProfile, etc.
`src/events.py`	EventEmitter, AsyncEventQueue, UserRequestEvent
`src/project_manager.py`	TOML config persistence, discussion management, track state
`src/session_logger.py`	JSON-L + markdown audit trails (comms, tools, CLI, hooks)
`src/rag_engine.py`	RAG subsystem (ChromaDB + embedding providers)
`src/beads_client.py`	Beads/Dolt-backed issue tracking client
`src/hot_reloader.py`	State-preserving module reloader
`src/personas.py`	Unified agent profile manager
`src/presets.py`	System prompt preset manager
`src/context_presets.py`	Context composition preset manager
`src/tool_presets.py`	Tool preset manager
`src/tool_bias.py`	Tool bias engine (semantic nudging + dynamic strategy)
`src/command_palette.py`	Command palette + fuzzy matcher + registry
`src/commands.py`	32 registered commands (toggle, theme, layout, AI, project, tools)
`src/workspace_manager.py`	Workspace profile save/load with scope inheritance
`src/theme_2.py`	Theme system (palette/font/etc.)
`src/theme_nerv.py`	NERV Tactical Console theme
`src/theme_nerv_fx.py`	NERV FX (scanlines, flicker, alert)
`src/shell_runner.py`	PowerShell execution with timeout, env config, QA callback
`src/file_cache.py`	ASTParser (tree-sitter) — skeleton, curated, targeted views
`src/fuzzy_anchor.py`	Fuzzy anchor slice algorithm
`src/history.py`	Undo/redo HistoryManager with UISnapshot
`src/imgui_scopes.py`	ImGui context managers (imscope) for the UI delegation pattern
`src/performance_monitor.py`	FPS, frame time, CPU, input lag tracking
`src/log_registry.py`	Session metadata persistence
`src/log_pruner.py`	Automated log cleanup based on age and whitelist
`src/paths.py`	Centralized path resolution with environment variable overrides
`src/cost_tracker.py`	Token cost estimation for API calls
`src/gemini_cli_adapter.py`	CLI subprocess adapter with session management
`src/mma_prompts.py`	Tier-specific system prompts for MMA orchestration
`src/summarize.py`	Heuristic file summaries (imports, classes, functions)
`src/outline_tool.py`	Hierarchical code outline via stdlib `ast`
`src/summary_cache.py`	SHA256-keyed summary LRU cache
`src/markdown_helper.py`	Markdown rendering helpers
`src/patch_modal.py`	Patch approval modal
`src/diff_viewer.py`	Diff rendering
`src/external_editor.py`	External editor integration (VSCode, etc.)
`src/orchestrator_pm.py`	Orchestrator project manager
`src/conductor_tech_lead.py`	Tier 2 ticket generation from track briefs
`src/synthesis_formatter.py`	Multi-take synthesis
`src/thinking_parser.py`	AI thinking-trace extraction

Simulation modules in simulation/:

File	Role
`simulation/sim_base.py`	BaseSimulation class with setup/teardown lifecycle
`simulation/workflow_sim.py`	WorkflowSimulator — high-level GUI automation
`simulation/user_agent.py`	UserSimAgent — simulated user behavior (reading time, thinking delays)

Setup

The MCP Bridge implements a three-layer security model in mcp_client.py:

Every tool accessing the filesystem passes through _resolve_and_check(path) before any I/O.

Layer 1: Allowlist Construction (`configure`)

Called by ai_client before each send cycle:

Resets _allowed_paths and _base_dirs to empty sets
Sets _primary_base_dir from extra_base_dirs[0]
Iterates file_items, resolving paths, adding to allowlist
Blacklist check: history.toml, *_history.toml, config.toml, credentials.toml are NEVER allowed

Layer 2: Path Validation (`_is_allowed`)

Checks run in order:

Blacklist: history.toml, *_history.toml → hard deny
Explicit allowlist: Path in _allowed_paths → allow
CWD fallback: If no base dirs, allow cwd() subpaths
Base containment: Must be subpath of _base_dirs
Default deny: All other paths rejected

Layer 3: Resolution Gate (`_resolve_and_check`)

Convert raw path string to Path
If not absolute, prepend _primary_base_dir
Resolve to absolute (follows symlinks)
Call _is_allowed()
Return (resolved_path, "") on success or (None, error_message) on failure

All paths are resolved (following symlinks) before comparison, preventing symlink-based traversal attacks.

Security Model

The MCP Bridge implements a three-layer security model in mcp_client.py. Every tool accessing the filesystem passes through _resolve_and_check(path) before any I/O.

Layer 1: Allowlist Construction (`configure`)

Called by ai_client before each send cycle:

Resets _allowed_paths and _base_dirs to empty sets.
Sets _primary_base_dir from extra_base_dirs[0] (resolved) or falls back to cwd().
Iterates file_items, resolving each path to an absolute path, adding to _allowed_paths; its parent directory is added to _base_dirs.
Any entries in extra_base_dirs that are valid directories are also added to _base_dirs.

Layer 2: Path Validation (`_is_allowed`)

Checks run in this exact order:

Blacklist: history.toml, *_history.toml, config, credentials → hard deny
Explicit allowlist: Path in _allowed_paths → allow
CWD fallback: If no base dirs, any under cwd() is allowed (fail-safe for projects without explicit base dirs)
Base containment: Must be a subpath of at least one entry in _base_dirs (via relative_to())
Default deny: All other paths rejected All paths are resolved (following symlinks) before comparison, preventing symlink-based traversal attacks.

Layer 3: Resolution Gate (`_resolve_and_check`)

Every tool call passes through this:

Convert raw path string to Path.
If not absolute, prepend _primary_base_dir.
Resolve to absolute.
Call _is_allowed().
Return (resolved_path, "") on success, (None, error_message) on failure All paths are resolved (following symlinks) before comparison, preventing symlink-based traversal attacks.

Conductor SystemThe project uses a spec-driven track system in `conductor/` for structured development:

conductor/
├── workflow.md           # Task lifecycle, TDD protocol, phase verification
├── tech-stack.md         # Technology constraints and patterns
├── product.md            # Product vision and guidelines
├── product-guidelines.md # Code standards, UX principles
└── tracks/
    └── <track_name>_<YYYYMMDD>/
        ├── spec.md       # Track specification
        ├── plan.md       # Implementation plan with checkbox tasks
        ├── metadata.json # Track metadata
        └── state.toml    # Structured state with task list

Key Concepts:

Tracks: Self-contained implementation units with spec, plan, and state
TDD Protocol: Red (failing tests) → Green (pass) → Refactor
Phase Checkpoints: Verification gates with git notes for audit trails
MMA Delegation: Tracks are executed via the 4-tier agent hierarchy

See conductor/workflow.md for the full development workflow.

Project Configuration

Projects are stored as <name>.toml files. The discussion history is split into a sibling <name>_history.toml to keep the main config lean.

[project]
name = "my_project"
git_dir = "./my_repo"
system_prompt = ""

[files]
base_dir = "./my_repo"
paths = ["src/**/*.py", "README.md"]

[screenshots]
base_dir = "./my_repo"
paths = []

[output]
output_dir = "./md_gen"

[gemini_cli]
binary_path = "gemini"

[agent.tools]
run_powershell = true
read_file = true
# ... 26 tool flags

Quick Reference

Hook API Endpoints (port 8999)

Endpoint	Method	Description
`/status`	GET	Health check
`/api/project`	GET/POST	Project config
`/api/session`	GET/POST	Discussion entries
`/api/gui`	POST	GUI task queue
`/api/gui/mma_status`	GET	Full MMA state
`/api/gui/value/<tag>`	GET	Read GUI field
`/api/ask`	POST	Blocking HITL dialog

MCP Tool Categories

Category	Tools
File I/O	`read_file`, `list_directory`, `search_files`, `get_tree`, `get_file_slice`, `set_file_slice`, `edit_file`
AST (Python)	`py_get_skeleton`, `py_get_code_outline`, `py_get_definition`, `py_update_definition`, `py_get_signature`, `py_set_signature`, `py_get_class_summary`, `py_get_var_declaration`, `py_set_var_declaration`, `py_get_docstring`
Analysis	`get_file_summary`, `get_git_diff`, `py_find_usages`, `py_get_imports`, `py_check_syntax`, `py_get_hierarchy`
Network	`web_search`, `fetch_url`
Runtime	`get_ui_performance`

Languages

Python 81.6%

C++ 17%

PowerShell 0.7%

C 0.6%

Shell 0.1%

Readme.md

Manual Slop

Note by the Human behind this

Why did you do this in Python

Summary

Key Features

Multi-Provider Integration

4-Tier MMA Orchestration

Strict Human-in-the-Loop (HITL)

45 MCP Tools with Sandboxing

Parallel Tool Execution

AST-Based Context Management

Architecture at a Glance

Test Coverage

Documentation

Subsystem Index

Setup

Prerequisites

Installation

Credentials

Running

Running Tests

MMA 4-Tier Architecture

Module by Domain

src/ — Core implementation (53 modules)

Setup

Layer 1: Allowlist Construction (configure)

Layer 2: Path Validation (_is_allowed)

Layer 3: Resolution Gate (_resolve_and_check)

Security Model

Layer 1: Allowlist Construction (configure)

Layer 2: Path Validation (_is_allowed)

Layer 3: Resolution Gate (_resolve_and_check)

Conductor SystemThe project uses a spec-driven track system in conductor/ for structured development:

Project Configuration

Quick Reference

Hook API Endpoints (port 8999)

MCP Tool Categories

Layer 1: Allowlist Construction (`configure`)

Layer 2: Path Validation (`_is_allowed`)

Layer 3: Resolution Gate (`_resolve_and_check`)

Layer 1: Allowlist Construction (`configure`)

Layer 2: Path Validation (`_is_allowed`)

Layer 3: Resolution Gate (`_resolve_and_check`)

Conductor SystemThe project uses a spec-driven track system in `conductor/` for structured development: