chore(conductor): Add new track 'Fix 45 failing test files across 12 batches'

2026-05-14 20:05:09 -04:00
parent ef91c64a65
commit 62628b0878
5 changed files with 68 additions and 4 deletions
@@ -62,6 +62,13 @@ This file tracks all major tracks for the project. Each track has its own detail
    *Link: [./tracks/gui_refactor_stabilization_20260512/](./tracks/gui_refactor_stabilization_20260512/)*
    *Goal: Refactor gui_2.py to fix regressions and enforce better imgui scoping patterns.*
 14. [x] **Track: I started to do a large cleanup to ./src/gui_2.py. I want you to study it and derive more information on how to maintain and write code for the python codebase. Please update product guidlines or the python code_styleguidleines based on what you discover. Also we may need to make some changes the mcp_tools for better structural awareness of annotations or other conventions with these python files. There is still more orgnaizatoin to be done like annotation/organizing the __init__ method's declarations, among other nitpicks.**
 *Link: [./tracks/gui_2_cleanup_20260513/](./tracks/gui_2_cleanup_20260513/)*
 ---
 15. [x] **Track: Add Python structural MCP tools (py_remove_def, py_add_def, py_move_def, py_region_wrap)**
 *Link: [./tracks/python_structural_mcp_tools_20260513/](./tracks/python_structural_mcp_tools_20260513/)*
 ---
 ## Hot Reload Feature
@@ -258,9 +265,7 @@ This file tracks all major tracks for the project. Each track has its own detail
 ---
 - [x] **Track: I started to do a large cleanup to ./src/gui_2.py. I want you to study it and derive more information on how to maintain and write code for the python codebase. Please update product guidlines or the python code_styleguidleines based on what you discover. Also we may need to make some changes the mcp_tools for better structural awareness of annotations or other conventions with these python files. There is still more orgnaizatoin to be done like annotation/organizing the __init__ method's declarations, among other nitpicks.**
 *Link: [./tracks/gui_2_cleanup_20260513/](./tracks/gui_2_cleanup_20260513/)*
 ---
- [x] **Track: Add Python structural MCP tools (py_remove_def, py_add_def, py_move_def, py_region_wrap)**
+- [ ] **Track: Fix 45 failing test files across 12 batches**
-*Link: [./tracks/python_structural_mcp_tools_20260513/](./tracks/python_structural_mcp_tools_20260513/)*
+*Link: [./tracks/fix_test_suite_failures_20260514/](./tracks/fix_test_suite_failures_20260514/)*
@@ -0,0 +1,5 @@
 # Track fix_test_suite_failures_20260514 Context
 - [Specification](./spec.md)
 - [Implementation Plan](./plan.md)
 - [Metadata](./metadata.json)
@@ -0,0 +1,8 @@
 {
  "track_id": "fix_test_suite_failures_20260514",
  "type": "bug",
  "status": "new",
  "created_at": "2026-05-14T00:00:00Z",
  "updated_at": "2026-05-14T00:00:00Z",
  "description": "Fix 45 failing test files across 12 batches"
 }
@@ -0,0 +1,28 @@
 # Implementation Plan: Fix All Remaining Test Failures
 ## Phase 1: GUI and Layout Tests
 - [ ] Task: Fix `test_gui_discussion_tabs.py` (`AssertionError: 'Original###main' in []`).
 - [ ] Task: Fix `test_gui_window_controls.py` (`ValueError: not enough values to unpack`).
 - [ ] Task: Fix `test_project_settings_rename.py` (Verify "Project Settings" rename in code).
 - [ ] Task: Fix `test_session_hub_merge.py` (Add missing tabs Context Composition, Snapshot, Takes to Discussion Hub).
 - [ ] Task: Fix `test_preset_windows_layout.py` (Address `None == 'ok'` and timeout issues).
 - [ ] Task: Fix `test_shader_live_editor.py` (Ensure `imgui.begin` is called correctly).
 - [ ] Task: Conductor - User Manual Verification 'Phase 1: GUI and Layout Tests' (Protocol in workflow.md)
 ## Phase 2: RAG System Tests
 - [ ] Task: Fix `test_rag_engine.py` (Resolve `SentenceTransformer` attribute error).
 - [ ] Task: Fix `test_rag_gui_presence.py` (Ensure `_render_rag_panel` is called in `_gui_func`).
 - [ ] Task: Fix `test_rag_phase4_final_verify.py` (Address indexing timeout).
 - [ ] Task: Fix `test_rag_phase4_stress.py` (Optimize incremental indexing).
 - [ ] Task: Conductor - User Manual Verification 'Phase 2: RAG System Tests' (Protocol in workflow.md)
 ## Phase 3: MMA, Workflow, and Negative Flow Tests
 - [ ] Task: Fix `test_auto_switch_sim.py` (`AttributeError: 'NoneType' object has no attribute 'get'`).
 - [ ] Task: Fix `test_mma_approval_indicators.py` (Ensure 'APPROVAL PENDING' badge is displayed correctly).
 - [ ] Task: Fix `test_history_manager.py` (Provide missing `context_files` argument to `UISnapshot.__init__`).
 - [ ] Task: Fix `test_z_negative_flows.py` (Fix response event and subprocess timeouts).
 - [ ] Task: Conductor - User Manual Verification 'Phase 3: MMA, Workflow, and Negative Flow Tests' (Protocol in workflow.md)
 ## Phase 4: Remaining Batched Failures
 - [ ] Task: Run the test suite and verify if any remaining files from the 45 failed batches still have underlying issues not covered above.
 - [ ] Task: Conductor - User Manual Verification 'Phase 4: Remaining Batched Failures' (Protocol in workflow.md)
@@ -0,0 +1,18 @@
 # Specification: Fix All Remaining Test Failures
 ## 1. Overview
 The current test suite has 45 failing test files across 12 batches. The objective of this track is to systematically analyze and resolve all test failures, returning the test suite to a 100% passing state.
 ## 2. Scope
 - **In Scope:**
  - Analyzing the root cause of each failing test.
  - Modifying application code where bugs or regressions are identified.
  - Updating test code where tests are outdated, flaky, or no longer align with intentional architectural changes.
  - Using Tier 3/4 workers for deep diagnostics and targeted fixes as per MMA guidelines.
 - **Out of Scope:**
  - Adding new features.
  - Extensive refactoring of systems unrelated to the test failures.
  - Rewriting the test suite or testing framework.
 ## 3. Success Criteria
 - Running `uv run .\scripts\run_tests_batched.py` results in 0 failed batches and all tests passing.