Monitor thread activities (!596) · Merge requests · GitLab.org / ModelOps / AI Assisted (formerly Applied ML) / Code Suggestions / AI Gateway

Shinya Maeda requested to merge monitoring-threads into main Jan 17, 2024

What does this merge request do and why?

This MR introduces a new task for monitoring the thread activities, which is ai_gateway/instrumentators/threads.py module. So that we can understand how many threads are running on the server and what they are doing. This metric can be later used for Execute CodeParser.from_language_id in a separa... (!595 - merged) and adjusting concurrent requests at Cloud Run according to the Cloud Run doc.

How to set up and validate locally

Checkout this branch.
Set the following environment variables to .env:

# Instrumentators
AIGW_INSTRUMENTATOR__THREAD_MONITORING_ENABLED=True
AIGW_INSTRUMENTATOR__THREAD_MONITORING_INTERVAL=1

poetry run ai_gateway
Run the following request:

curl -X 'POST'   'http://0.0.0.0:5052/v2/completions'   -H 'accept: application/json'   -H 'Content-Type: application/json'   -d '{ 
  "project_path": "string",
  "project_id": 0,
  "current_file": {
    "file_name": "main.py",
    "language_identifier": "python",
    "content_above_cursor": "print",
    "content_below_cursor": ""
  },
  "model_provider": "vertex-ai",
  "model_name": "code-gecko@002",
  "telemetry": [],
  "stream": false,
  "prompt_version": 1
}'

While the request is being executed, check http://0.0.0.0:8082/ to see the ai_gateway_threads_count.
You can also check threads logger in modelgateway_debug.log to see the details of the thread activities.

Merge request checklist

Tests added for new functionality. If not, please raise an issue to follow up.
Documentation added/updated, if needed.

Edited Jan 17, 2024 by Shinya Maeda

Monitor thread activities

What does this merge request do and why?

How to set up and validate locally

Further reading

Merge request checklist

Merge request reports