Experiment with LLM judge for code search results

Experiment with some models as LLM judges to see how well they can rerank code search results.

Edited Jul 29, 2024 by Ben Venker

Assignee Loading

Time tracking Loading