5.2 KiB
TASKS.md
Active Tracks
(none — all planned tracks queued below)
Completed This Session
mma_agent_focus_ux_20260302— Per-tier source_tier tagging on comms+tool entries; Focus Agent combo UI; filter logic in comms+tool panels; [tier] label per comms entry. 18 tests. Checkpoint:b30e563.feature_bleed_cleanup_20260302— Removed dead comms panel dup, dead menubar block, duplicate init vars; added working Quit; fixed Token Budget layout. All phases verified. Checkpoint:0d081a2.context_token_viz_20260301— Token budget panel (color bar, breakdown table, trim warning, cache status, auto-refresh). All phases verified. Commit:d577457.tech_debt_and_test_cleanup_20260302— [BOTCHED/ARCHIVED] Centralized fixtures but exposed deep asyncio flaws.
Planned: The Strict Execution Queue
All previously loose backlog items have been rigorously spec'd and initialized as Conductor Tracks. They MUST be executed in this exact order.
1. test_stabilization_20260302 (Active/Next)
- Status: Initialized / Looked Over
- Priority: High
- Goal: Stabilize
asyncioerrors, ban mock-rot, completely removegui_legacy.py, and consolidate testing paradigms.
2. strict_static_analysis_and_typing_20260302
- Status: Initialized / Looked Over
- Priority: High
- Goal: Resolve 512+ mypy errors and remaining ruff violations to secure the foundation before refactoring. Add pre-commit hooks.
3. codebase_migration_20260302
- Status: Initialized / Looked Over
- Priority: High
- Goal: Restructure directories to a
src/layout. Doing this after static analysis ensures no hidden import bugs are introduced. Createssloppy.pyentry point.
4. gui_decoupling_controller_20260302
- Status: Initialized / Looked Over
- Priority: High
- Goal: Extract the state machine and core lifecycle into a headless
app_controller.py, leavinggui_2.pyas a pure, immediate-mode view.
5. hook_api_ui_state_verification_20260302
- Status: Initialized / Looked Over
- Priority: Medium
- Goal: Add a
/api/gui/stateGET endpoint. Wire UI state into_settable_fieldsto enable programmaticlive_guitesting without user confirmation.
6. robust_json_parsing_tech_lead_20260302
- Status: Initialized / Looked Over
- Priority: Medium
- Goal: Implement an auto-retry loop that catches
JSONDecodeErrorand feeds the traceback to the Tier 2 model for self-correction.
7. concurrent_tier_source_tier_20260302
- Status: Initialized / Looked Over
- Priority: Low
- Goal: Replace global state with
threading.local()or explicit context passing to guarantee thread-safe logging when multiple Tier 3 workers process tickets in parallel.
8. test_suite_performance_and_flakiness_20260302
- Status: Initialized / Looked Over
- Priority: Low
- Goal: Replace
time.sleep()with deterministic polling orthreading.Event()triggers. Mark exceptionally heavy tests with@pytest.mark.slow.
9. manual_ux_validation_20260302
- Status: Initialized / Looked Over
- Priority: Medium
- Goal: Highly interactive human-in-the-loop track to review and adjust GUI UX, animations, popups, and layout structures based on slow-interval simulation feedback.
Phase 3: Future Horizons (Post-Hardening Backlog)
To be evaluated in a future Tier 1 session once the Strict Execution Queue is cleared and the architectural foundation is stabilized.
1. True Parallel Worker Execution (The DAG Realization)
Goal: Implement true concurrency for the DAG engine. Once threading.local() is in place, the ExecutionEngine should spawn independent Tier 3 workers in parallel (e.g., 4 workers handling 4 isolated tests simultaneously). Requires strict file-locking or a Git-based diff-merging strategy to prevent AST collision.
2. Deep AST-Driven Context Pruning (RAG for Code)
Goal: Before dispatching a Tier 3 worker, use tree_sitter to automatically parse the target file's AST, strip out unrelated function bodies, and inject a surgically condensed skeleton into the worker's prompt. Guarantees the AI only "sees" what it needs to edit, drastically reducing token burn.
3. Visual DAG & Interactive Ticket Editing
Goal: Replace the linear ticket list in the GUI with an interactive Node Graph using ImGui Bundle's node editor. Allow the user to visually drag dependency lines, split nodes, or delete tasks before clicking "Execute Pipeline."
4. Advanced Tier 4 QA Auto-Patching
Goal: Elevate Tier 4 from a log summarizer to an auto-patcher. When a verification test fails, Tier 4 generates a .patch file. The GUI intercepts this and presents a side-by-side Diff Viewer. The user clicks "Apply Patch" to instantly resume the pipeline.
5. Transitioning to a Native Orchestrator
Goal: Absorb the Conductor extension entirely into the core application. Manual Slop should natively read/write plan.md, manage the metadata.json, and orchestrate the MMA tiers in pure Python, removing the dependency on external CLI shell executions (mma_exec.py).