chore(conductor): Checkpoint Phase 1: Research and Inventory for test curation track

2026-02-25 20:52:45 -05:00
parent edae93498d
commit be689ad1e9
3 changed files with 75 additions and 5 deletions
--- a/conductor/tracks.md
+++ b/conductor/tracks.md
@@ -45,7 +45,7 @@ This file tracks all major tracks for the project. Each track has its own detail
 ---
- [ ] **Track: Test Suite Curation and Organization**
+- [~] **Track: Test Suite Curation and Organization**
 *Link: [./tracks/test_curation_20260225/](./tracks/test_curation_20260225/)*
--- a/conductor/tracks/test_curation_20260225/inventory.md
+++ b/conductor/tracks/test_curation_20260225/inventory.md
@@ -0,0 +1,70 @@
 # Test Suite Inventory - manual_slop
 ## Categories
 ### Manual Slop Core/GUI
 - `tests/test_ai_context_history.py`
 - `tests/test_api_events.py`
 - `tests/test_gui_diagnostics.py`
 - `tests/test_gui_events.py`
 - `tests/test_gui_performance_requirements.py`
 - `tests/test_gui_stress_performance.py`
 - `tests/test_gui_updates.py`
 - `tests/test_gui2_events.py`
 - `tests/test_gui2_layout.py`
 - `tests/test_gui2_mcp.py`
 - `tests/test_gui2_parity.py`
 - `tests/test_gui2_performance.py`
 - `tests/test_headless_api.py`
 - `tests/test_headless_dependencies.py`
 - `tests/test_headless_startup.py`
 - `tests/test_history_blacklist.py`
 - `tests/test_history_bleed.py` (FAILING)
 - `tests/test_history_migration.py`
 - `tests/test_history_persistence.py`
 - `tests/test_history_truncation.py`
 - `tests/test_performance_monitor.py`
 - `tests/test_token_usage.py`
 - `tests/test_layout_reorganization.py`
 ### Conductor/MMA (To be Blacklisted from core runs)
 - `tests/test_mma_exec.py`
 - `tests/test_mma_skeleton.py`
 - `tests/test_conductor_api_hook_integration.py`
 - `tests/conductor/test_infrastructure.py`
 - `tests/test_gemini_cli_adapter.py`
 - `tests/test_gemini_cli_integration.py` (FAILING)
 - `tests/test_ai_client_cli.py`
 - `tests/test_cli_tool_bridge.py` (FAILING)
 - `tests/test_gemini_metrics.py`
 ### MCP/Integrations
 - `tests/test_api_hook_client.py`
 - `tests/test_api_hook_extensions.py`
 - `tests/test_hooks.py`
 - `tests/test_sync_hooks.py`
 - `tests/test_mcp_perf_tool.py`
 ### Simulation/Workflows
 - `tests/test_sim_ai_settings.py`
 - `tests/test_sim_base.py`
 - `tests/test_sim_context.py`
 - `tests/test_sim_execution.py`
 - `tests/test_sim_tools.py`
 - `tests/test_workflow_sim.py`
 - `tests/test_extended_sims.py`
 - `tests/test_user_agent.py`
 - `tests/test_live_workflow.py`
 - `tests/test_agent_capabilities.py`
 - `tests/test_agent_tools_wiring.py`
 ## Redundancy Observations
 - GUI tests are split between `gui` and `gui2`. Since `gui_2.py` is the current focus, legacy `gui` tests should be reviewed for relevance.
 - History tests are highly fragmented (5+ files).
 - Headless tests are fragmented (3 files).
 - Simulation tests are fragmented (10+ files).
 ## Failure Summary
 - `tests/test_cli_tool_bridge.py`: `test_deny_decision` and `test_unreachable_hook_server` failing (wrong decision returned).
 - `tests/test_gemini_cli_integration.py`: Integration with `gui_2.py` failing to find mock response in history.
 - `tests/test_history_bleed.py`: `test_get_history_bleed_stats_basic` failing (assert 0 == 900000).
--- a/conductor/tracks/test_curation_20260225/plan.md
+++ b/conductor/tracks/test_curation_20260225/plan.md
@@ -3,10 +3,10 @@
 This plan outlines the process for categorizing, organizing, and curating the existing test suite using a central manifest and exhaustive review.
 ## Phase 1: Research and Inventory
- [ ] Task: Initialize MMA Environment `activate_skill mma-orchestrator`
+- [x] Task: Initialize MMA Environment `activate_skill mma-orchestrator`
- [ ] Task: Inventory all existing tests in `tests/` and mapping them to categories
+- [x] Task: Inventory all existing tests in `tests/` and mapping them to categories
- [ ] Task: Identify failing and redundant tests through a full execution sweep
+- [x] Task: Identify failing and redundant tests through a full execution sweep
- [ ] Task: Conductor - User Manual Verification 'Phase 1: Research and Inventory' (Protocol in workflow.md)
+- [x] Task: Conductor - User Manual Verification 'Phase 1: Research and Inventory' (Protocol in workflow.md)
 ## Phase 2: Manifest and Tooling
 - [ ] Task: Design and create `tests.toml` manifest file