-
feat: add gemini client 0 of 2 checklist items completed!113
-
feat: add openai as cs eval client 0 of 2 checklist items completed!112
-
feat: add groq provider for CS 0 of 2 checklist items completed!111
-
feat: update code-suggestions evaluation scripts 0 of 2 checklist items completed!110
-
Update the fix-broken-pipeline evaluation for Duo Workflow 1 of 2 checklist items completed
-
Evaluate Duo Chat on an issue/epic-related QA dataset 0 of 2 checklist items completed
-
Draft: Codestral latency test scripts 0 of 2 checklist items completed
-
Draft: Resolve "Implement functional correctness for mbpp" 0 of 2 checklist items completed
-
Draft: Update LLM evaluator 0 of 2 checklist items completed