Code Suggestions Performance improvements
This epics is targetted around collecting all issues + discussion that are targetted especially to improve performance for users of Code Suggestions. Performance contains both Latency improvements and Perceived performance. ## Possible Improvement topics ### Latency improvements * Look for Improvements in * Rails Layer * Model gateway * Infrastructure setup * Prompt improvements for code generation * Fix long running prompts * Websockets might be a possibility to cut out DNS + Rails time * Grpc from GL Instance to Model Gateway * Local Code LLM (next to SM instance - self hosted) * Local Machine Code Completion LLM ### Perceived performance improvements * [Better IDE UX Indicator when we request next to line](https://gitlab.com/gitlab-org/gitlab/-/issues/418999) * [larger better status bar indicator is in latest version](https://gitlab.com/gitlab-org/gitlab-vscode-extension/-/issues/765#note_1672638789) * Streaming for code generation at least (later maybe another code completion model supports streaming) if we move forward * Better local caching, type-in-cache
epic