Files
manual_slop/TASKS.md

5.2 KiB

TASKS.md

Active Tracks

(none — all planned tracks queued below)

Completed This Session

  • mma_agent_focus_ux_20260302 — Per-tier source_tier tagging on comms+tool entries; Focus Agent combo UI; filter logic in comms+tool panels; [tier] label per comms entry. 18 tests. Checkpoint: b30e563.
  • feature_bleed_cleanup_20260302 — Removed dead comms panel dup, dead menubar block, duplicate init vars; added working Quit; fixed Token Budget layout. All phases verified. Checkpoint: 0d081a2.
  • context_token_viz_20260301 — Token budget panel (color bar, breakdown table, trim warning, cache status, auto-refresh). All phases verified. Commit: d577457.
  • tech_debt_and_test_cleanup_20260302 — [BOTCHED/ARCHIVED] Centralized fixtures but exposed deep asyncio flaws.

Planned: The Strict Execution Queue

All previously loose backlog items have been rigorously spec'd and initialized as Conductor Tracks. They MUST be executed in this exact order.

1. test_stabilization_20260302 (Active/Next)

  • Status: Initialized / Looked Over
  • Priority: High
  • Goal: Stabilize asyncio errors, ban mock-rot, completely remove gui_legacy.py, and consolidate testing paradigms.

2. strict_static_analysis_and_typing_20260302

  • Status: Initialized / Looked Over
  • Priority: High
  • Goal: Resolve 512+ mypy errors and remaining ruff violations to secure the foundation before refactoring. Add pre-commit hooks.

3. codebase_migration_20260302

  • Status: Initialized / Looked Over
  • Priority: High
  • Goal: Restructure directories to a src/ layout. Doing this after static analysis ensures no hidden import bugs are introduced. Creates sloppy.py entry point.

4. gui_decoupling_controller_20260302

  • Status: Initialized / Looked Over
  • Priority: High
  • Goal: Extract the state machine and core lifecycle into a headless app_controller.py, leaving gui_2.py as a pure, immediate-mode view.

5. hook_api_ui_state_verification_20260302

  • Status: Initialized / Looked Over
  • Priority: Medium
  • Goal: Add a /api/gui/state GET endpoint. Wire UI state into _settable_fields to enable programmatic live_gui testing without user confirmation.

6. robust_json_parsing_tech_lead_20260302

  • Status: Initialized / Looked Over
  • Priority: Medium
  • Goal: Implement an auto-retry loop that catches JSONDecodeError and feeds the traceback to the Tier 2 model for self-correction.

7. concurrent_tier_source_tier_20260302

  • Status: Initialized / Looked Over
  • Priority: Low
  • Goal: Replace global state with threading.local() or explicit context passing to guarantee thread-safe logging when multiple Tier 3 workers process tickets in parallel.

8. test_suite_performance_and_flakiness_20260302

  • Status: Initialized / Looked Over
  • Priority: Low
  • Goal: Replace time.sleep() with deterministic polling or threading.Event() triggers. Mark exceptionally heavy tests with @pytest.mark.slow.

9. manual_ux_validation_20260302

  • Status: Initialized / Looked Over
  • Priority: Medium
  • Goal: Highly interactive human-in-the-loop track to review and adjust GUI UX, animations, popups, and layout structures based on slow-interval simulation feedback.

Phase 3: Future Horizons (Post-Hardening Backlog)

To be evaluated in a future Tier 1 session once the Strict Execution Queue is cleared and the architectural foundation is stabilized.

1. True Parallel Worker Execution (The DAG Realization)

Goal: Implement true concurrency for the DAG engine. Once threading.local() is in place, the ExecutionEngine should spawn independent Tier 3 workers in parallel (e.g., 4 workers handling 4 isolated tests simultaneously). Requires strict file-locking or a Git-based diff-merging strategy to prevent AST collision.

2. Deep AST-Driven Context Pruning (RAG for Code)

Goal: Before dispatching a Tier 3 worker, use tree_sitter to automatically parse the target file's AST, strip out unrelated function bodies, and inject a surgically condensed skeleton into the worker's prompt. Guarantees the AI only "sees" what it needs to edit, drastically reducing token burn.

3. Visual DAG & Interactive Ticket Editing

Goal: Replace the linear ticket list in the GUI with an interactive Node Graph using ImGui Bundle's node editor. Allow the user to visually drag dependency lines, split nodes, or delete tasks before clicking "Execute Pipeline."

4. Advanced Tier 4 QA Auto-Patching

Goal: Elevate Tier 4 from a log summarizer to an auto-patcher. When a verification test fails, Tier 4 generates a .patch file. The GUI intercepts this and presents a side-by-side Diff Viewer. The user clicks "Apply Patch" to instantly resume the pipeline.

5. Transitioning to a Native Orchestrator

Goal: Absorb the Conductor extension entirely into the core application. Manual Slop should natively read/write plan.md, manage the metadata.json, and orchestrate the MMA tiers in pure Python, removing the dependency on external CLI shell executions (mma_exec.py).