Improve tool selection
What does this MR do and why?
Fix gitlab-org/modelops/ai-model-validation-and-research/ai-evaluation/ai-experiments#11 (closed)
<example>
tag
Remove explicit "Here is an example of using this tool:" in This removes most of failures.
Remove tool list numbering
This removes (potentially) one failure. I think XML having text node may add cognitive complexity to LLM.
MR acceptance checklist
Please evaluate this MR against the MR acceptance checklist. It helps you analyze changes to reduce risks in quality, performance, reliability, security, and maintainability.
Screenshots or screen recordings
Screenshots are required for UI changes, and strongly recommended for all other merge requests.
Before | After |
---|---|
How to set up and validate locally
Numbered steps to set up and validate the change are strongly suggested.