Evaluate using AI to evaluate search responses

Background

We want to make changes to Elasticsearch queries to improve relevance of results, but have no automated way to evaluate before/after.

Proposals

Evaluate whether an AI platform could be used to evaluate search query/response accuracy. I'll list a few options below (feel free to add):

Edited by Terri Chu