Update evaluations for Work Item

Everyone can contribute. Help move this issue forward while earning points, leveling up and collecting rewards.

Once we introduce a new Work Item tool, we need to update our evaluations to test new supported cases.

We would need to update regression evaluation (needed to be verified if we don't have those cases already)

We also need a separate dataset with different types of work item objects.

Note: this issue can be implemented after the Work Item support is done, as we still can use manual testing to verify that it works correctly.

Edited by 🤖 GitLab Bot 🤖