Skip to content

Daily Run with a Subset of Data in Prod- RCA

🚀 Plan B: Running Daily Subset in Production

📋 Overview

As we progress through the staging environment (Use a non-prod environment for evaluating GitLa... (#346 - closed)), we aim to kick off Plan B by running a subset of daily runs in Production via the new RCA chat endpoint.

💡 Proposal

To accomplish this, we need to:

  • 🔑 Generate PATs (Personal Access Tokens) for three users:

  • 🔗 Integrate with the slash-troubleshoot endpoint (powered by Claude 3.5 Sonnet) !584 (merged)

  • 🔄 Execute daily runs

  • 🔄 Increase daily runs upto ~1400 prompts ( July 19th-Target date)

  • 📊 Update the Daily Run Dashboard

  • 🔍 Conduct spot check analysis and troubleshoot scores of 1 due to error limit

  • 🔍 Conduct spot check analysis on prompt and foundational model to give a rough benchmark as well

📚 Further Details

RCA Dashboard: https://lookerstudio.google.com/reporting/e0af7354-fbbd-46db-a541-e791621b49d7

Edited by Mon Ray