Use more realistic request distribution #77

howardjohn · 2021-12-03T18:47:42Z

Current situation

Today, the benchmark deploys 64 emojivoto instances (3 pods each). It then calls instance 0 for X time, instance 1 for X time, .... Each call is done by sending 128 request instantaneously to the server, waiting a bit (depending on RPS configuration), then sending 128 requests again.

Impact

This does not align with real world scenarios. It is extremely unlikely to receive this 'bursty' traffic pattern in the real world, as is it uncommon to send requests to only one namespace at a time.

Ideal future situation

Requests are sent to all namespaces with a uniform request release rate. Most load clients can do this - IIRC from looking at it a while back the wrk2 fork used here actually introduces the batching behavior, while standard wrk2 does not perform like this.

An alternative would be to have multiple replicas within a single namespace, rather than 64 namespaces. For example, we could have 6 namespaces with 10 replicas of each pod, for example. This is both more realistic and gives an opportunity for better load balancing algorithms to shine.

howardjohn · 2021-12-06T20:57:38Z

This shows the difference of current traffic flow (left) vs a uniform distribution (right). Each dot represents 1 request and the Y axis is latency.

The difference is night and day

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use more realistic request distribution #77

Use more realistic request distribution #77

howardjohn commented Dec 3, 2021

howardjohn commented Dec 6, 2021

Use more realistic request distribution #77

Use more realistic request distribution #77

Comments

howardjohn commented Dec 3, 2021

Current situation

Impact

Ideal future situation

howardjohn commented Dec 6, 2021