3.1 KiB
3.1 KiB
Implementation Plan: Agent Tool Preference & Bias Tuning
Phase 1: Data Model & Storage Extension [checkpoint: 77a0b38]
- Task: Extend the
ToolPresetandToolmodels.77a0b38 - Task: Implement Global Bias Profiles.
77a0b38 - Task: Write unit tests for the extended data model and storage logic.
77a0b38 - Task: Conductor - User Manual Verification 'Phase 1: Data Model Extension' (Protocol in workflow.md)
77a0b38
Phase 2: Orchestration & Nudging Logic [checkpoint: cad04bf]
- Task: Implement the
ToolBiasEngineinsrc/ai_client.py(or a new module).cad04bf - Task: Integrate the bias engine into the AI client
send()loop.cad04bf- Ensure that for every agent turn, the tool definitions and system instructions are dynamically biased based on the active agent's role and selected preset.
cad04bf
- Ensure that for every agent turn, the tool definitions and system instructions are dynamically biased based on the active agent's role and selected preset.
- Task: Write integration tests for the bias generation logic.
cad04bf - Task: Conductor - User Manual Verification 'Phase 2: Orchestration Logic' (Protocol in workflow.md)
cad04bf
Phase 3: GUI Integration
- Task: Update the Tool Preset Manager UI.
- Add
imgui.slider_intfor each tool to adjust its weight. - Add a sub-menu or modal for editing parameter-level bias.
- Add
- Task: Enhance tool list visualization.
- Implement color-coded priority badges in the Operations panel and tool settings.
- Task: Implement the "Bias Override" in the agent focus modal.
- Add a dropdown to select a global bias profile or a specific preset override before spawning a worker.
- Task: Write visual regression tests using
live_guito verify the new UI components. - Task: Conductor - User Manual Verification 'Phase 3: GUI Integration' (Protocol in workflow.md)
Phase 4: Verification & Final Polish
- Task: Create a Bias Efficacy Simulation.
- Implement a specialized simulation test where two tools could solve a problem, and verify the agent chooses the one with higher weight.
- Task: Final UI polish (spacing, icons, tooltips explaining the bias system).
- Task: Run full suite of relevant tests.
- Task: Conductor - User Manual Verification 'Phase 4: Verification & Polish' (Protocol in workflow.md)