3.2 KiB
3.2 KiB
Implementation Plan: Agent Tool Preference & Bias Tuning
Phase 1: Data Model & Storage Extension [checkpoint: 77a0b38]
- Task: Extend the
ToolPresetandToolmodels.77a0b38 - Task: Implement Global Bias Profiles.
77a0b38 - Task: Write unit tests for the extended data model and storage logic.
77a0b38 - Task: Conductor - User Manual Verification 'Phase 1: Data Model Extension' (Protocol in workflow.md)
77a0b38
Phase 2: Orchestration & Nudging Logic [checkpoint: cad04bf]
- Task: Implement the
ToolBiasEngineinsrc/ai_client.py(or a new module).cad04bf - Task: Integrate the bias engine into the AI client
send()loop.cad04bf- Ensure that for every agent turn, the tool definitions and system instructions are dynamically biased based on the active agent's role and selected preset.
cad04bf
- Ensure that for every agent turn, the tool definitions and system instructions are dynamically biased based on the active agent's role and selected preset.
- Task: Write integration tests for the bias generation logic.
cad04bf - Task: Conductor - User Manual Verification 'Phase 2: Orchestration Logic' (Protocol in workflow.md)
cad04bf
Phase 3: GUI Integration [checkpoint: 1c83b3e]
- Task: Update the Tool Preset Manager UI.
1c83b3e - Task: Enhance tool list visualization.
1c83b3e- Implement color-coded priority badges in the Operations panel and tool settings.
1c83b3e
- Implement color-coded priority badges in the Operations panel and tool settings.
- Task: Implement the "Bias Override" in the agent focus modal.
1c83b3e- Add a dropdown to select a global bias profile or a specific preset override before spawning a worker.
1c83b3e
- Add a dropdown to select a global bias profile or a specific preset override before spawning a worker.
- Task: Write integration tests for the new UI data flow.
1c83b3e - Task: Conductor - User Manual Verification 'Phase 3: GUI Integration' (Protocol in workflow.md)
1c83b3e
Phase 4: Verification & Final Polish [checkpoint: 85ae409]
- Task: Create a Bias Efficacy Simulation.
85ae409- Implement a specialized simulation test where two tools could solve a problem, and verify the agent chooses the one with higher weight.
85ae409
- Implement a specialized simulation test where two tools could solve a problem, and verify the agent chooses the one with higher weight.
- Task: Final UI polish (spacing, icons, tooltips explaining the bias system).
85ae409 - Task: Run full suite of relevant tests.
85ae409 - Task: Conductor - User Manual Verification 'Phase 4: Verification & Polish' (Protocol in workflow.md)
85ae409