Commits · master · GitLab.org / GitLab

Nov 12, 2024

Improve Vertex AI HTTP response handling · 36abedd2

Stan Hu authored 3 months ago

This commit restores the previous behavior of returning `nil` if the
response body is empty and improves the error messages around this.

36abedd2

Nov 08, 2024

Fix LLM AI client not returning a HTTP response with 204 responses · 98e578c9

Stan Hu authored 3 months ago

Previously `Llm::AiGateway::Client#request` returned `nil` if the AI
Gateway return a `204 No Content` response. However, this makes it
impossible to discern whether the request was successful or whether
the server returned a 5xx error.

This was happening because `run_retry_with_exponential_backoff`
returned `nil` if the body were blank. To fix this, return the
HTTParty response even the body is blank, but ensure the callers
handle this.

Changelog: fixed

98e578c9

Sep 20, 2024
- Refactor anthropic client, completions base and categorize questions to use... · 5d6178f0
  Gosia Ksionek authored 5 months ago
  
  Refactor anthropic client, completions base and categorize questions to use new logger-related mixin
  5d6178f0
Jun 06, 2024
- Allow local requests in vertex ai and anthropic clients · b56d7ab6
  Shinya Maeda authored 8 months ago
  
  b56d7ab6
May 20, 2024

Make VertexAI::Client compatible with AI Gateway · 587a0f92

Shinya Maeda authored 9 months ago

This commit makes VertexAI::Client compatible with
AI Gateway.

The change is behind the feature flag.

587a0f92

Apr 22, 2024
- Resolve "Certain Duo Chat logs are missing in Kibana" · b619371b
  Lesley Razzaghian authored 10 months ago
  
  b619371b
Dec 14, 2023
- Fix bug with empty vertex response · 9fbc36f1
  Gosia Ksionek authored 1 year ago
  
  9fbc36f1
Nov 24, 2023
- Add more extensive logging guarded by feature flag · cc80caa1
  Gosia Ksionek authored 1 year ago and Jan Provaznik committed 1 year ago
  
  cc80caa1
Nov 02, 2023

Use token usage from Vertex response · cd5f964d

Nicolas Dular authored 1 year ago

Instead of relying on an estimation of 4 characters being 1 token, we
now use the actual tokens we receive from the Vertex API.
In addition to that, we now track embeddings as a separate action and do
not count for it twice for input and output.

cd5f964d

Sep 11, 2023
- Fix tracking tokens · 12f07582
  Nicolas Dular authored 1 year ago and Alexandru Croitor committed 1 year ago
  
  12f07582
Sep 04, 2023

Track AI feature token usage · e1632fba

Nicolas Dular authored 1 year ago and

Alexandru Croitor committed 1 year ago

Adds tracking to AI features with an approximated measurement of our
token usage for Anthropic and Vertex.

It enables us to group token usage per feature or per user.

e1632fba

Aug 30, 2023

Update AI client SLI · 95c65597

Jan Provaznik authored 1 year ago and

Gosia Ksionek committed 1 year ago

* with this change success ratio of AI requests is measured outside of
  exponential backoff
* llm_chat_answers SLI is replaced with more generic llm_completion
  which tracks error ratio of all AI actions

95c65597

Aug 28, 2023

Add model and DB tables to support vertex embeddings for docs · c92ed801

Alexandru Croitor authored 1 year ago and

Prabakaran Murugesan committed 1 year ago

This MR adds database table vertex_gitlab_docs to store vertex
embeddigns for gitlab documentation.

This also adds the text_embeddings method to the VertexAI::Client
class to call vertex ai endpoint for building text embeddings.

re https://gitlab.com/gitlab-org/gitlab/-/issues/420939

c92ed801

Aug 10, 2023
- Enable the content block retry to be enabled contextually · bb5f0ac7
  Gregory Havenga authored 1 year ago and Kerri Miller committed 1 year ago
  
  Changelog: changed EE: true
  bb5f0ac7
Aug 02, 2023
- Add general AI request SLI · e95161e0
  Jan Provaznik authored 1 year ago and Nicolas Dular committed 1 year ago
  
  * adds also more generic AI request SLI to monitor all provider requests
  e95161e0
Aug 01, 2023

Set LLM log_level to info by default · ccb0199d

Nicolas Dular authored 1 year ago and

Alexandru Croitor committed 1 year ago

Sets the log_level to INFO by default to make sure we do not log PII
data in production environments.

Changelog: changed
EE: true

ccb0199d

Jun 21, 2023

Adds `code_completion` method to VertexAi::Client · 376ccafa

Bojan Marjanovic authored 1 year ago

Extend `Gitlab::Llm::VertexAi::Client` to:
- Add `code_completion` method to utilize `code-gecko` model

Changelog: added
EE: true

376ccafa

Jun 12, 2023

Improve IssueIdentifier prompt · e188fb60

Alexandru Croitor authored 1 year ago

Also adding raw response to the debug log, which helps with
debugging responses from AI.

e188fb60

May 18, 2023
- Update httparty to gitlab http · b8d62420
  Gosia Ksionek authored 1 year ago
  
  b8d62420
May 16, 2023
- Add new llm logger for debugging AI features · ec4a6e99
  Nicolas Dular authored 1 year ago and Jan Provaznik committed 1 year ago
  
  Adds a new logger and debugging statements for AI features.
  ec4a6e99
May 11, 2023

Define different service names per LLM client · 401d4dae

Patrick Bajao authored 1 year ago

`Gitlab::Llm::Concerns::CircuitBreaker` requires `service_name` to
be defined.

Before this change, we are only using a single `service_name` and
that means all client will use a single circuit. If a single provider
fails and the circuit opens, all providers will be affected.

To prevent that, since we have different clients (e.g. OpenAI, Vertex,
Anthropic), we define a specific service name per client.

This also includes a fix to `ExponentialBackoff` concern to raise
the correct exception to avoid a `NameError`.

401d4dae

May 10, 2023

Extend VertexAi::Client to add text/code/message_chat methods · a99134d7

George Koltsov authored 1 year ago

Extend `Gitlab::Llm::VertexAi::Client` to:
  - Add `text` method to utilize `text-bison` model
  - Add `code` method to utilize `code-bison-001` model
  - Add `messages_chat` method to utilize multiturn `chat-bison-001` model

Changelog: added
EE: true

a99134d7

May 05, 2023
- De-obfuscate tofa/VertexAI code · b6ae01cc
  mo khan authored 1 year ago and Jan Provaznik committed 1 year ago
  
  This renames the codename Tofa to Vertex AI
  b6ae01cc
May 04, 2023

Refactor some classes in the AI abstraction layer · 9feaba6e

Alexandru Croitor authored 1 year ago

- Moved the Completions::Factory to a more generic module
- Moved ExponentialBackoff to a generic AI concern

9feaba6e

Apr 26, 2023
- Add support for TOFA AI API · 1244e4b5
  Robert May authored 1 year ago and Jessie Young committed 1 year ago
  
  Changelog: added
  1244e4b5