Skip to content
Snippets Groups Projects
Verified Commit 1a8363e3 authored by Bob Van Landuyt's avatar Bob Van Landuyt
Browse files

Add blurb about deployments

parent 0f64632e
No related branches found
No related tags found
1 merge request!166AI-gateway API & architecture
......@@ -401,3 +401,22 @@ When storing the embedding, we should make sure we include the model
and provider data. When embeddings are used to generate a prompt, we
could include that metadata in the payload so we can judge the quality
of the embedding.
## Deployment
Currently, the model-gateway that will become the AI-gateway is being
deployed using HELM from the project repository in
[`gitlab-org/modelops/applied-ml/code-suggestions/ai-assist`](https://gitlab.com/gitlab-org/modelops/applied-ml/code-suggestions/ai-assist).
It is deployed to a kubernetes cluster in it's own project. There is a
staging environment that is currently used directly by engineers for
testing.
In the future, this will be deloyed using
[Runway](https://gitlab.com/gitlab-com/gl-infra/platform/runway/). At
that time, there will be a production and staging deployment. The
staging deployment can be used for automated QA-runs that will have
the potential to stop a deployment from reaching production.
Further testing strategy is being discussed in
[#114](https://gitlab.com/gitlab-org/modelops/applied-ml/code-suggestions/ai-assist/-/issues/114).
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment