Commits · 1f44c61dd5b607c38f5b6fe2f3cf3f3534eb681c · GitLab.org / GitLab

Mar 29, 2024

Use Claude 3 Sonnet for Duo Chat Zero Shot · 1f44c61d

Jessie Young authored 11 months ago

- This is behind a feature flag for testing
- Uses request body format needed for messages API, which is now
  supported by the AI Gateway:
  gitlab-org/modelops/applied-ml/code-suggestions/ai-assist!668
- #444629

1f44c61d

Dec 04, 2023

Add client info to llm action tracking · 395dd923

Nicolas Dular authored 1 year ago and

Michał Wielich committed 1 year ago

This adds the client information to our AI action tracking by passing
forward the user agent and parsing it. For now we only distinguish
between vscode and web.

395dd923

Nov 30, 2023
- Cleanup obsolete compatibility code · 09bf6d0b
  Pavel Shutsin authored 1 year ago
  
  09bf6d0b
Nov 15, 2023
- Introduce AiMessageContext · bb49e77f
  Pavel Shutsin authored 1 year ago
  
  This model will contain related state of the system for specific message
  bb49e77f
Nov 14, 2023
- Add CompletionService to allow logic reuse · c65c58ca
  Alejandro Rodríguez authored 1 year ago
  
  c65c58ca
Nov 08, 2023
- Remove old method of scheduling completion worker · 5d35e3da
  Pavel Shutsin authored 1 year ago
  
  5d35e3da
Nov 07, 2023
- Update chat completion scheduling · ba52a37a
  Pavel Shutsin authored 1 year ago
  
  ba52a37a
- Update CompletionWorker interface · f524c839
  Pavel Shutsin authored 1 year ago
  
  Pure refactoring
  f524c839
Oct 31, 2023

Replace the legacy openai_experimentation FF with ai_global_switch · aab2235a

Alexandru Croitor authored 1 year ago

This replaces the legacy openai_experimentation FF usages with
the more generic ai_global_switch ops FF

Changelog: changed
EE: true

aab2235a

Oct 25, 2023
- Introduce global switch FF to enable or disable ai features · fb4ba9eb
  Alexandru Croitor authored 1 year ago and Nicolas Dular committed 1 year ago
  
  This FF is added to replace the legacy openai_experimentation
  fb4ba9eb
Oct 12, 2023
- Make AI Completions aware of AiMessage · 528c91fc
  Pavel Shutsin authored 1 year ago and Nicolas Dular committed 1 year ago
  
  It makes completion classes aware of prompt message object. Pure refactoring, no user facing changes.
  528c91fc
Oct 09, 2023
- Rework AiMessage emitting · b220eb42
  Pavel Shutsin authored 1 year ago and Nicolas Dular committed 1 year ago
  
  It reworks AI Response emitting to work with AiMessage model. No user facing changes.
  b220eb42
Oct 05, 2023
- Remove permanent development shortcut for CompletionWorker · d19d2f0f
  Pavel Shutsin authored 1 year ago
  
  d19d2f0f
Sep 28, 2023
- Remove obsolete "internal_request" param for AI completions · fb4ffeca
  Pavel Shutsin authored 1 year ago
  
  Related functionality was removed so this codepath is not used anymore
  fb4ffeca
Sep 27, 2023

Add development mode for AI actions · 58c88893

Nicolas Dular authored 1 year ago and

Gosia Ksionek committed 1 year ago

Sidekiq can cause issues with code reloading in development. To overcome
this, it's now possible to set `LLM_DEVELOPMENT_SYNC_EXECUTION=1`
in development which executes AI actions synchronously.

58c88893

Sep 05, 2023

Add duration apdex for llm completion worker · 6878f320

Jan Provaznik authored 1 year ago and

Bob Van Landuyt committed 1 year ago

* adds SLI for overall time needed to serve user's request (currently
  20s)
* adds histogram for overall worker duration (with service/category
  labels)

6878f320

Sep 04, 2023

Track AI feature token usage · e1632fba

Nicolas Dular authored 1 year ago and

Alexandru Croitor committed 1 year ago

Adds tracking to AI features with an approximated measurement of our
token usage for Anthropic and Vertex.

It enables us to group token usage per feature or per user.

e1632fba

Aug 30, 2023

Update AI client SLI · 95c65597

Jan Provaznik authored 1 year ago and

Gosia Ksionek committed 1 year ago

* with this change success ratio of AI requests is measured outside of
  exponential backoff
* llm_chat_answers SLI is replaced with more generic llm_completion
  which tracks error ratio of all AI actions

95c65597

Aug 22, 2023

Add optional client_subscription_id · 37252ebe

Nicolas Dular authored 1 year ago and

Gosia Ksionek committed 1 year ago

This adds an optional client_subscription_id to the ai_completion_response
subscription. In addition to that it fixes the GraphqlTriggers to be
able to deal with optional subscription arguments.

This prepares us to allow listening only to a specific client_subscription_id
on the websocket, and to only broadcast messages based on a user_id. This
is important for the chat as well as other aiActions.

This has no breaking changes, nor changes how the subscription gets used.

Changelog: changed
EE: true

37252ebe

Aug 18, 2023

Support explain code for blobs · feff925d

euko authored 1 year ago and

Gosia Ksionek committed 1 year ago

If the current user is viewing some code blob,
the user should be able to ask the chat to
explain the code.

We will inject the blob's code into the zeroshot
executor's prompt and ask the LLMs to directly
explain the code when instructed.

To make that possible, we will make use of
the Referer header to detect if a user is viewing
a Blob. The referer url will be added as an option
to be extracted by CompletionWorker.

CompletionWorker will then attempt to resolve
and authorize the blob pointed to by the referer url.
If the blob is found and authorized,
it will be available as a context attribute, 'extra_resource'.

The zeroshot executor can then use the attribute to
include the code blob and additional prompt.

The change is guarded with a feature flag.

feff925d

Aug 04, 2023

Fix storing messages for summarizing reviews · 3bdc03f4

Nicolas Dular authored 1 year ago

We fixed this before by setting `skip_cache: true` by default in
`ExecuteMethodService`. However, `SummarizeSubmittedReviewService` was
not going through the `ExecuteMethodService`.

As we only want to store messages from the `chat` action, and to fix
this for the future, the logic is now reversed and it's required to set
`cache_response: true` explicitly.

3bdc03f4

Jul 27, 2023
- Update feature_category for ai-related classes · 67365bac
  Jan Provaznik authored 1 year ago and Max Woolf committed 1 year ago
  
  * also updates outdated product_analytics ategory
  67365bac
Jul 26, 2023

Do not store chat messages by default · 3822cfb1

Nicolas Dular authored 1 year ago

We no longer want to store and show AI messages on the chat, if not
explicitly enabled by the feature. It is now only enabled for the `chat`
AI action.
We do this by setting `skip_cache = true` by default.

It also fixes a bug where the `skip_cache` was not passed along properly
to the GraphqlSubscriptionResponseService.

Changelog: fixed
EE: true

3822cfb1

Jul 13, 2023

Optional resource_id for ai actions · 42e26840

Nicolas Dular authored 1 year ago and

Igor Drozdov committed 1 year ago

Deal with optional resource_id in service and worker

Makes sure we allow using nil as resource_id in the service and the
worker.

42e26840

Jun 26, 2023

Do not broadcast AI responses twice · 425f1589

Nicolas Dular authored 1 year ago and

🤖 GitLab Bot 🤖 committed 1 year ago

When the Agent picks the `SummarizeComments` tool, we are internally
calling the `GenerateSummaryService`, which also stores the response to
the cache and broadcasts it via the GraphQL subscription.

With `skip_cache: true` we were already able to not store the response
to the cache. This change renames the `skip_cache` option to
`internal_request` and also no longer broadcasts the response to the
client, which resulted in duplicated responses in the chat.

425f1589

Jun 20, 2023

Do not broadcast AI responses twice · d29b9c8b

Nicolas Dular authored 1 year ago

When the Agent picks the `SummarizeComments` tool, we are internally
calling the `GenerateSummaryService`, which also stores the response to
the cache and broadcasts it via the GraphQL subscription.

With `skip_cache: true` we were already able to not store the response
to the cache. This change renames the `skip_cache` option to
`internal_request` and also no longer broadcasts the response to the
client, which resulted in duplicated responses in the chat.

d29b9c8b

Jun 13, 2023

Add skip_cache option · a52e9d59

Jan Provaznik authored 1 year ago

It's possible that some AI chain tools use completion service (e.g.
SummaryComments tool), in this case we want to avoid storing
request/response in cache because it's only intermediate step.

a52e9d59

May 31, 2023
- Removes `send_to_ai?` method · 40f88f67
  Bojan Marjanovic authored 1 year ago and George Koltsov committed 1 year ago
  
  It removes redundant method from the code.
  40f88f67
May 23, 2023
- Add de-duplication to Llm Completion Worker · c4b0d036
  Allison Browne authored 1 year ago and George Koltsov committed 1 year ago
  
  c4b0d036
May 16, 2023
- Add new llm logger for debugging AI features · ec4a6e99
  Nicolas Dular authored 1 year ago and Jan Provaznik committed 1 year ago
  
  Adds a new logger and debugging statements for AI features.
  ec4a6e99
May 09, 2023

Adds unique request ID to AI actions · 24456b38

Jan Provaznik authored 1 year ago and

George Koltsov committed 1 year ago

* for each AI mutation it generates a unique ID
* this ID is also part of subscription message so clients can pair
  responses with original requests

24456b38

May 04, 2023

Refactor some classes in the AI abstraction layer · 9feaba6e

Alexandru Croitor authored 1 year ago

- Moved the Completions::Factory to a more generic module
- Moved ExponentialBackoff to a generic AI concern

9feaba6e

Apr 13, 2023

Check read permissions in Llm::CompletionWorker · 590362e4

Patrick Bajao authored 1 year ago

Before we make a call to AI API, we need to check if user who
actually executed the action can read the resource and if the
resource can actually be sent to AI (utilize `#send_to_ai?`).

We already have the permission check at the mutation level and
`#send_to_ai?` check in `Llm::BaseService`. But it is possible
that those permissions changes after the job is enqueued.

No changelog since this is still behind a feature flag.

590362e4

Apr 12, 2023
- Add options argument to the CompletionWorker worker · 95057a4f
  Igor Drozdov authored 1 year ago and Alexandru Croitor committed 1 year ago
  
  95057a4f
- Add backend support for ai completion · d316f1d9
  Alexandru Croitor authored 1 year ago and Bojan Marjanovic committed 1 year ago
  
  This adds backend services and sidekiq job to handle ai completion requests to OpenAI API with potential to extend it beyond OpenAI
  d316f1d9