Log safety attributes from Vertex AI (!410) · Merge requests · GitLab.org / ModelOps / AI Assisted (formerly Applied ML) / Code Suggestions / AI Gateway

Tan Le requested to merge 313-track-vertex-blocked-requests into main Oct 03, 2023

What does this merge request do and why?

Vertext LLMs sometime can generate output that violates safety content policy and this results in empty suggestions. We need to log this information to better understand how often this occurs and if any changes to our transformations that would cause this.

https://cloud.google.com/vertex-ai/docs/generative-ai/learn/responsible-ai

How to set up and validate locally

Check out to this merge request's branch.

Ensure a local Docker image built successfully.

docker buildx build --platform linux/amd64 \
  -t ai-gateway:test .

Run a local service on Docker.

docker run --platform linux/amd64 --rm \
  -p 5052:5052 \
  -e AUTH_BYPASS_EXTERNAL=true \
  -v $PWD:/app -it ai-gateway:test

Code completions

Send a cURL request to the /v2/completions endpoint

$ curl --request POST \
  --url http://codesuggestions.gdk.test:5052/v2/completions \
  --header 'Content-Type: application/json' \
  --header 'X-Gitlab-Authentication-Type: oidc' \
  --header 'authorization: Bearer jwt \
  --data '{
  "prompt_version": 1,
  "project_path": "gitlab-org/gitlab",
  "project_id": 278964,
  "current_file": {
    "file_name": "main.rb",
    "content_above_cursor": "# bomberman",
    "content_below_cursor": ""
  }
}'

Observe the safety attributes in the log (formatted for legibility)

{ 
  ...
  "safety_categories": [],
  "blocked": true,
  ...
}

Code generations

Send a cURL request to the /v2/code/generations endpoint

$ curl --request POST \
  --url http://codesuggestions.gdk.test:5052/v2/code/generations \
  --header 'Content-Type: application/json' \
  --header 'X-Gitlab-Authentication-Type: oidc' \
  --header 'authorization: Bearer jwt \
  --data '{
  "prompt_version": 1,
  "project_path": "gitlab-org/gitlab",
  "project_id": 278964,
  "current_file": {
    "file_name": "main.rb",
    "content_above_cursor": "# bomberman ",
    "content_below_cursor": ""
  }
}'

Observe the safety attributes in the log (formatted for legibility)

{
  ...
  "safety_categories": ["Firearms & Weapons", "Toxic", "War & Conflict"],
  "blocked": false,
  ...
}

Merge request checklist

Tests added for new functionality. If not, please raise an issue to follow up.
Documentation added/updated, if needed.

Relates to #313 (closed)

Edited Oct 03, 2023 by Tan Le

Log safety attributes from Vertex AI

What does this merge request do and why?

How to set up and validate locally

Code completions

Code generations

Merge request checklist

Merge request reports