Skip to content

PoC: Embedding storage and AI in Elastic

This issue captures findings from a proof of concept to explore the use of Elasticsearch (or OpenSearch) to store embeddings that are generated by a model on ES (instead of on postgres) and generating an answer to a question by using ES exclusively. In other words, replace the back-and-forth process Duo Chat follows to answer questions about gitlab docs.

Given the architecture used by Duo chat currently:

Screenshot_2024-02-01_at_14.48.06

Elastic can help with the following use-cases: Screenshot_2024-02-01_at_14.48.13

Screenshot_2024-02-01_at_14.48.23

Screenshot_2024-02-01_at_14.48.43Example:

Screenshot 2024-02-01 at 13.59.03.png

Screenshot_2024-02-01_at_14.48.52

Screenshot_2024-02-01_at_14.49.02

The AI Gateway can potentially be used to serve models to .com, dedicated instances and SM instances.

Screenshot_2024-02-01_at_14.49.15

Screenshot_2024-02-01_at_15.42.33

Edited by Madelein van Niekerk