Skip to content

Retry simulation ingress rule creation

Alejo Carballude requested to merge feature/simulation_ingress_rule_retry into develop

Related tasks

Context

Simulations sometimes fail to connect to the websocket server during launch, returning the following error

Error: &{5505 500 RunningSimulation node could not be created. [] websocket: bad handshake

A previous MR made the websocket creation retry a certain number of times to give the load balancer enough time to set the rule.

There have been cases where the Kubernetes cluster has rejected updates to the ingress rule because a change is in progress.

Change

This MR retries creating the simulation's ingress rule if the Kubernetes server rejects an update for up to 10 minutes using exponential backoff.

Other information

Additional documentation

Edited by Alejo Carballude

Merge request reports