Go to file

Ed_ 52a463d13f conductor: Encode surgical spec methodology into Tier 1 skills for Claude and Gemini

Distills what made this session's track specs high-quality into reusable
methodology for both Claude and Gemini Tier 1 orchestrators:

Key additions to conductor-new-track.md:
- MANDATORY Step 2: Deep Codebase Audit before writing any spec
- 'Current State Audit' section template (Already Implemented + Gaps)
- 6 rules for writing worker-ready tasks (WHERE/WHAT/HOW/SAFETY)
- Anti-patterns section (vague specs, no line refs, no audit, etc.)
- Architecture doc fallback references

Key additions to mma-tier1-orchestrator.md (Claude + Gemini):
- 'The Surgical Methodology' section with 6 protocols
- Spec template with REQUIRED sections (Current State Audit is mandatory)
- Plan template with REQUIRED task format (file:line refs + API calls)
- Root cause analysis requirement for fix tracks
- Cross-track dependency mapping requirement
- Added py_get_definition to Gemini's tool list (was missing)

The core insight: the quality gap between this session's output and previous
track specs came from (1) reading actual code before writing specs, (2) listing
what EXISTS before what's MISSING, and (3) specifying exact locations and APIs
in tasks so lesser models don't have to search or guess.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-01 10:08:25 -05:00

.claude

conductor: Encode surgical spec methodology into Tier 1 skills for Claude and Gemini

2026-03-01 10:08:25 -05:00

.gemini

conductor: Encode surgical spec methodology into Tier 1 skills for Claude and Gemini

2026-03-01 10:08:25 -05:00

conductor

chore(conductor): Add index.md to new tracks, archive completed/superseded tracks

2026-03-01 10:00:49 -05:00

docs

docs: Complete documentation rewrite at gencpp/VEFontCache reference quality

2026-03-01 09:44:50 -05:00

gallery

latest ux and readme update

2026-02-21 23:38:24 -05:00

MMA_Support

chore(conductor): Fix SKILL.md and documentation typos to correctly use the new Role-Based sub-agent protocol

2026-02-25 09:15:25 -05:00

mma-orchestrator

Fixes to mma and conductor.

2026-02-28 21:59:28 -05:00

scripts

dont use flash-lite for tier 3

2026-03-01 09:07:17 -05:00

simulation

feat(taxonomy): Redirect logs and artifacts to dedicated sub-folders

2026-03-01 09:03:02 -05:00

tests

feat(taxonomy): Redirect logs and artifacts to dedicated sub-folders

2026-03-01 09:03:02 -05:00

.coverage

chore: Checkpoint commit of unstaged changes, including new tests and debug scripts

2026-02-26 21:39:03 -05:00

.dockerignore

feat(headless): Implement Phase 5 - Dockerization

2026-02-25 13:23:04 -05:00

.editorconfig

fix(conductor): Apply review suggestions for track 'mma_utilization_refinement_20260226'

2026-02-26 08:35:50 -05:00

.gitignore

chore(conductor): Fix .gitignore corruption and add artifact/log dirs

2026-03-01 08:58:45 -05:00

.mcp.json

fix(mcp): wire run_powershell and MCP server for Windows/Scoop environment

2026-02-28 15:00:05 -05:00

aggregate.py

fix(conductor): Apply review suggestions for track 'python_style_refactor_20260227'

2026-02-28 20:53:03 -05:00

ai_client.py

fix(mma): Unblock visual simulation - event routing, loop passing, adapter preservation

2026-03-01 08:32:31 -05:00

api_hook_client.py

chore(checkpoint): Phase 6 Test Suite Stabilization complete. 257/261 tests PASS. Resolved run_linear drift, formatter expectations, and Hook Server startup.

2026-02-28 20:42:54 -05:00

api_hooks.py

fix(mma): Unblock visual simulation - event routing, loop passing, adapter preservation

2026-03-01 08:32:31 -05:00

ARCHITECTURE.md

checkpoint: Claude Code integration + implement missing MCP var tools

2026-02-28 10:47:42 -05:00

BUILD.md

checkpoint: Claude Code integration + implement missing MCP var tools

2026-02-28 10:47:42 -05:00

CLAUDE.md

fix(mcp): wire run_powershell and MCP server for Windows/Scoop environment

2026-02-28 15:00:05 -05:00

conductor_tech_lead.py

checkpoint: this is a mess... need to define stricter DSL or system for how the AI devices sims and hookup api for tests.

2026-02-28 22:50:14 -05:00

CONDUCTOR.md

checkpoint: Claude Code integration + implement missing MCP var tools

2026-02-28 10:47:42 -05:00

config.toml

checkpoint: this is a mess... need to define stricter DSL or system for how the AI devices sims and hookup api for tests.

2026-02-28 22:50:14 -05:00

dag_engine.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

debug_ast_2.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

debug_ast.py

checkpoint: massive refactor

2026-02-28 09:06:45 -05:00

Dockerfile

feat(headless): Implement Phase 5 - Dockerization

2026-02-25 13:23:04 -05:00

events.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

file_cache.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

fix_task.toml

chore(conductor): Archive track 'Add support for the deepseek api as a provider.'

2026-02-25 23:34:46 -05:00

gemini_cli_adapter.py

refactor(types): Phase 4 type hint sweep — core modules

2026-02-28 15:13:55 -05:00

GEMINI.md

docs: Update entry point to gui_2.py

2026-02-24 20:37:20 -05:00

gemini.py

refactor(types): Phase 4 type hint sweep — core modules

2026-02-28 15:13:55 -05:00

get_file_summary.bat

TOOLS

2026-02-27 22:10:46 -05:00

gui_2.py

feat(taxonomy): Redirect logs and artifacts to dedicated sub-folders

2026-03-01 09:03:02 -05:00

gui_legacy.py

refactor(indentation): Apply codebase-wide 1-space ultra-compact refactor. Formatted 21 core modules and tests.

2026-02-28 19:36:38 -05:00

hello.ps1

chore(checkpoint): Phase 6 Test Suite Stabilization complete. 257/261 tests PASS. Resolved run_linear drift, formatter expectations, and Hook Server startup.

2026-02-28 20:42:54 -05:00

inspect_ast.py

checkpoint: massive refactor

2026-02-28 09:06:45 -05:00

JOURNAL.md

checkpoint: Claude Code integration + implement missing MCP var tools

2026-02-28 10:47:42 -05:00

log_pruner.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

log_registry.py

refactor(types): Phase 4 type hint sweep — core modules

2026-02-28 15:13:55 -05:00

MainContext.md

feat(gui): Rename gui.py to gui_legacy.py and update references

2026-02-24 20:36:04 -05:00

manual_slop_history.toml

checkpoint

2026-02-27 18:35:11 -05:00

manual_slop.toml

docs(conductor): Expert-level architectural documentation refresh

2026-03-01 09:19:48 -05:00

manualslop_layout.ini

checkpoint: this is a mess... need to define stricter DSL or system for how the AI devices sims and hookup api for tests.

2026-02-28 22:50:14 -05:00

mcp_client.py

refactor(indentation): Apply codebase-wide 1-space ultra-compact refactor. Formatted 21 core modules and tests.

2026-02-28 19:36:38 -05:00

mcp_env.toml

fix(mcp): wire run_powershell and MCP server for Windows/Scoop environment

2026-02-28 15:00:05 -05:00

mma_prompts.py

chore(checkpoint): Phase 4 Codebase-Wide Type Hint Sweep complete. Total fixes: ~400+. Verification status: 230 pass, 16 fail (pre-existing API drift), 29 error (live_gui env).

2026-02-28 19:35:46 -05:00

MMA_UX_SPEC.md

docs(mma): Add Phase 7 UX specification and update track plan

2026-02-26 21:37:45 -05:00

models.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

multi_agent_conductor.py

fix(mma): Unblock visual simulation - event routing, loop passing, adapter preservation

2026-03-01 08:32:31 -05:00

orchestrator_pm.py

refactor(types): Phase 4 type hint sweep — core modules

2026-02-28 15:13:55 -05:00

outline_tool.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

performance_monitor.py

refactor(types): Phase 4 type hint sweep — core modules

2026-02-28 15:13:55 -05:00

project_history.toml

checkpoint

2026-02-28 20:53:46 -05:00

project_manager.py

refactor(indentation): Apply codebase-wide 1-space ultra-compact refactor. Formatted 21 core modules and tests.

2026-02-28 19:36:38 -05:00

project.toml

feat(ui): Support multiple concurrent AI response streams and strategy visualization

2026-02-27 22:56:40 -05:00

pyproject.toml

checkpoint: Claude Code integration + implement missing MCP var tools

2026-02-28 10:47:42 -05:00

Readme.md

docs: Complete documentation rewrite at gencpp/VEFontCache reference quality

2026-03-01 09:44:50 -05:00

refactor_ui_task.toml

checkpoint: massive refactor

2026-02-28 09:06:45 -05:00

reproduce_issue.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

reproduce_missing_hints.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

requirements.txt

feat(mma): Decouple UI from API calls using UserRequestEvent and AsyncEventQueue

2026-02-26 20:45:23 -05:00

run_tests.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

sanity_task.toml

chore(conductor): Archive track 'Add support for the deepseek api as a provider.'

2026-02-25 23:34:46 -05:00

scan_report.txt

chore(checkpoint): Phase 4 Codebase-Wide Type Hint Sweep complete. Total fixes: ~400+. Verification status: 230 pass, 16 fail (pre-existing API drift), 29 error (live_gui env).

2026-02-28 19:35:46 -05:00

session_logger.py

feat(taxonomy): Redirect logs and artifacts to dedicated sub-folders

2026-03-01 09:03:02 -05:00

shell_runner.py

refactor(indentation): Apply codebase-wide 1-space ultra-compact refactor. Formatted 21 core modules and tests.

2026-02-28 19:36:38 -05:00

summarize.py

refactor(types): Phase 4 type hint sweep — core modules

2026-02-28 15:13:55 -05:00

task.toml

chore(conductor): Archive track 'Add support for the deepseek api as a provider.'

2026-02-25 23:34:46 -05:00

test_mma_persistence.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

tests.toml

chore(tests): Move meta-infrastructure tests to conductor/tests/ for permanent isolation

2026-02-27 19:01:12 -05:00

theme_2.py

refactor(types): Phase 4 type hint sweep — core modules

2026-02-28 15:13:55 -05:00

theme.py

refactor(types): auto -> None sweep across entire codebase

2026-02-28 11:16:56 -05:00

verify_pm_changes.py

checkpoint!

2026-02-27 20:21:52 -05:00

Readme.md

Manual Slop

A GUI orchestrator for local LLM-driven coding sessions. Manual Slop bridges high-latency AI reasoning with a low-latency ImGui render loop via a thread-safe asynchronous pipeline, ensuring every AI-generated payload passes through a human-auditable gate before execution.

Tech Stack: Python 3.11+, Dear PyGui / ImGui, FastAPI, Uvicorn Providers: Gemini API, Anthropic API, DeepSeek, Gemini CLI (headless) Platform: Windows (PowerShell) — single developer, local use

Architecture at a Glance

Four thread domains operate concurrently: the ImGui main loop, an asyncio worker for AI calls, a HookServer (HTTP on :8999) for external automation, and transient threads for model fetching. Background threads never write GUI state directly — they serialize task dicts into lock-guarded lists that the main thread drains once per frame (details).

The Execution Clutch suspends the AI execution thread on a threading.Condition when a destructive action (PowerShell script, sub-agent spawn) is requested. The GUI renders a modal where the user can read, edit, or reject the payload. On approval, the condition is signaled and execution resumes (details).

The MMA (Multi-Model Agent) system decomposes epics into tracks, tracks into DAG-ordered tickets, and executes each ticket with a stateless Tier 3 worker that starts from ai_client.reset_session() — no conversational bleed between tickets (details).

Documentation

Guide	Scope
Architecture	Threading model, event system, AI client multi-provider architecture, HITL mechanism, comms logging
Tools & IPC	MCP Bridge security model, all 26 native tools, Hook API endpoints, ApiHookClient reference, shell runner
MMA Orchestration	4-tier hierarchy, Ticket/Track data structures, DAG engine, ConductorEngine execution loop, worker lifecycle
Simulations	`live_gui` fixture, Puppeteer pattern, mock provider, visual verification patterns, ASTParser / summarizer

Module Map

File	Lines	Role
`gui_2.py`	~3080	Primary ImGui interface — App class, frame-sync, HITL dialogs
`ai_client.py`	~1800	Multi-provider LLM abstraction (Gemini, Anthropic, DeepSeek, Gemini CLI)
`mcp_client.py`	~870	26 MCP tools with filesystem sandboxing and tool dispatch
`api_hooks.py`	~330	HookServer — REST API for external automation on `:8999`
`api_hook_client.py`	~245	Python client for the Hook API (used by tests and external tooling)
`multi_agent_conductor.py`	~250	ConductorEngine — Tier 2 orchestration loop with DAG execution
`conductor_tech_lead.py`	~100	Tier 2 ticket generation from track briefs
`dag_engine.py`	~100	TrackDAG (dependency graph) + ExecutionEngine (tick-based state machine)
`models.py`	~100	Ticket, Track, WorkerContext dataclasses
`events.py`	~89	EventEmitter, AsyncEventQueue, UserRequestEvent
`project_manager.py`	~300	TOML config persistence, discussion management, track state
`session_logger.py`	~200	JSON-L + markdown audit trails (comms, tools, CLI, hooks)
`shell_runner.py`	~100	PowerShell execution with timeout, env config, QA callback
`file_cache.py`	~150	ASTParser (tree-sitter) — skeleton and curated views
`summarize.py`	~120	Heuristic file summaries (imports, classes, functions)
`outline_tool.py`	~80	Hierarchical code outline via stdlib `ast`

Setup

Prerequisites

Python 3.11+
uv for package management

Installation

git clone <repo>
cd manual_slop
uv sync

Credentials

Configure in credentials.toml:

[gemini]
api_key = "YOUR_KEY"

[anthropic]
api_key = "YOUR_KEY"

[deepseek]
api_key = "YOUR_KEY"

Running

uv run gui_2.py                        # Normal mode
uv run gui_2.py --enable-test-hooks    # With Hook API on :8999

Running Tests

uv run pytest tests/ -v

Project Configuration

Projects are stored as <name>.toml files. The discussion history is split into a sibling <name>_history.toml to keep the main config lean.

[project]
name = "my_project"
git_dir = "./my_repo"
system_prompt = ""

[files]
base_dir = "./my_repo"
paths = ["src/**/*.py", "README.md"]

[screenshots]
base_dir = "./my_repo"
paths = []

[output]
output_dir = "./md_gen"

[gemini_cli]
binary_path = "gemini"

[agent.tools]
run_powershell = true
read_file = true
# ... 26 tool flags