Monitor for high percentage of non-200 requests to the AI gateway
A high percentage of non-200 requests to the AI gateway can signify something wrong with the service.
We need to detect that early and be alerted asap to mitigate.
Extracted from the recent incident discussion: production#18064 (comment 1932891015)