Add LiteLLM client to handle requests to self-managed OpenAI compatible models (!742) · Merge requests · GitLab.org / ModelOps / AI Assisted (formerly Applied ML) / Code Suggestions / AI Gateway

Igor Drozdov requested to merge id-openai-compatible-client into main Apr 19, 2024

What does this merge request do and why?

Mistral and Mixtral models are expected to be deployed behind OpenAI compatible API server (like vLLM). Let's add a handler for this type of requests.

Related issues:

AI Gateway API endpoint routing (gitlab-org/gitlab#455315 - closed)

Edited Apr 29, 2024 by Igor Drozdov

Add LiteLLM client to handle requests to self-managed OpenAI compatible models

What does this merge request do and why?

Merge request reports