[Research] Possible performance improvements in Lens expressions #182551

markov00 · 2024-04-30T14:31:04Z

markov00
Apr 30, 2024
Collaborator

I will add here some of the findings related to performance bottlenecks the current Lens/SearchStrategy architecture and possible improvements to these solutions.
Please add comments on each of these findings, or add new ones if you found something that is worth considering as performance improvement

Current Test environment and parameters:

Test 1: single chart with date_histogram and terms aggregation

ES running on a single node locally on M1 Max Mac.
10M documents ingested inside ES
breakdown by 30 series (no other)
date_histogram over 3 days at with buckets every 5 minutes: ~864 date buckets per series
a total of 864 * 30 data points (counting also empty buckets): ~26k data points

Panel JSON

[
    {
        "type": "lens",
        "gridData": {
            "x": 0,
            "y": 0,
            "w": 48,
            "h": 15,
            "i": "9f528643-9481-418b-b2df-ccb1ec708e0c"
        },
        "panelIndex": "9f528643-9481-418b-b2df-ccb1ec708e0c",
        "embeddableConfig": {
            "attributes": {
                "title": "check",
                "description": "",
                "visualizationType": "lnsXY",
                "type": "lens",
                "references": [
                    {
                        "type": "index-pattern",
                        "id": "cffc1414-a0b2-4ea8-b1d0-690289326c93",
                        "name": "indexpattern-datasource-layer-c8abd0c4-a2b4-43b8-97c8-116e65fefc99"
                    }
                ],
                "state": {
                    "visualization": {
                        "legend": {
                            "isVisible": true,
                            "position": "right"
                        },
                        "valueLabels": "hide",
                        "fittingFunction": "Linear",
                        "axisTitlesVisibilitySettings": {
                            "x": true,
                            "yLeft": true,
                            "yRight": true
                        },
                        "tickLabelsVisibilitySettings": {
                            "x": true,
                            "yLeft": true,
                            "yRight": true
                        },
                        "labelsOrientation": {
                            "x": 0,
                            "yLeft": 0,
                            "yRight": 0
                        },
                        "gridlinesVisibilitySettings": {
                            "x": true,
                            "yLeft": true,
                            "yRight": true
                        },
                        "preferredSeriesType": "line",
                        "layers": [
                            {
                                "layerId": "c8abd0c4-a2b4-43b8-97c8-116e65fefc99",
                                "accessors": [
                                    "e0dee44e-457f-48a4-afc2-5221722e3ea7",
                                    "0ad75b08-3083-4e1e-bd49-feedc82ed9b6"
                                ],
                                "position": "top",
                                "seriesType": "line",
                                "showGridlines": false,
                                "layerType": "data",
                                "colorMapping": {
                                    "assignments": [],
                                    "specialAssignments": [
                                        {
                                            "rule": {
                                                "type": "other"
                                            },
                                            "color": {
                                                "type": "loop"
                                            },
                                            "touched": false
                                        }
                                    ],
                                    "paletteId": "eui_amsterdam_color_blind",
                                    "colorMode": {
                                        "type": "categorical"
                                    }
                                },
                                "xAccessor": "b1b9ee06-1bef-4b77-b56a-5fae4e0e40db",
                                "splitAccessor": "783bd726-3eae-41a9-bdd1-d2aebbb0446d"
                            }
                        ],
                        "emphasizeFitting": true
                    },
                    "query": {
                        "query": "",
                        "language": "kuery"
                    },
                    "filters": [],
                    "datasourceStates": {
                        "formBased": {
                            "layers": {
                                "c8abd0c4-a2b4-43b8-97c8-116e65fefc99": {
                                    "columns": {
                                        "b1b9ee06-1bef-4b77-b56a-5fae4e0e40db": {
                                            "label": "@timestamp",
                                            "dataType": "date",
                                            "operationType": "date_histogram",
                                            "sourceField": "@timestamp",
                                            "isBucketed": true,
                                            "scale": "interval",
                                            "params": {
                                                "interval": "auto",
                                                "includeEmptyRows": true,
                                                "dropPartials": false
                                            }
                                        },
                                        "e0dee44e-457f-48a4-afc2-5221722e3ea7": {
                                            "label": "Average of bytes",
                                            "dataType": "number",
                                            "operationType": "average",
                                            "sourceField": "bytes",
                                            "isBucketed": false,
                                            "scale": "ratio",
                                            "params": {
                                                "emptyAsNull": true
                                            }
                                        },
                                        "783bd726-3eae-41a9-bdd1-d2aebbb0446d": {
                                            "label": "Top 30 values of geo.dest",
                                            "dataType": "string",
                                            "operationType": "terms",
                                            "scale": "ordinal",
                                            "sourceField": "geo.dest",
                                            "isBucketed": true,
                                            "params": {
                                                "size": 30,
                                                "orderBy": {
                                                    "type": "alphabetical",
                                                    "fallback": false
                                                },
                                                "orderDirection": "asc",
                                                "otherBucket": false,
                                                "missingBucket": false,
                                                "parentFormat": {
                                                    "id": "terms"
                                                },
                                                "include": [],
                                                "exclude": [],
                                                "includeIsRegex": false,
                                                "excludeIsRegex": false
                                            }
                                        },
                                        "0ad75b08-3083-4e1e-bd49-feedc82ed9b6": {
                                            "label": "Maximum of machine.ram",
                                            "dataType": "number",
                                            "operationType": "max",
                                            "sourceField": "machine.ram",
                                            "isBucketed": false,
                                            "scale": "ratio",
                                            "params": {
                                                "emptyAsNull": true
                                            }
                                        }
                                    },
                                    "columnOrder": [
                                        "783bd726-3eae-41a9-bdd1-d2aebbb0446d",
                                        "b1b9ee06-1bef-4b77-b56a-5fae4e0e40db",
                                        "e0dee44e-457f-48a4-afc2-5221722e3ea7",
                                        "0ad75b08-3083-4e1e-bd49-feedc82ed9b6"
                                    ],
                                    "incompleteColumns": {},
                                    "sampling": 1
                                }
                            }
                        },
                        "indexpattern": {
                            "layers": {}
                        },
                        "textBased": {
                            "layers": {}
                        }
                    },
                    "internalReferences": [],
                    "adHocDataViews": {}
                }
            },
            "enhancements": {}
        }
    }
]

ES query

POST /logstash-*/_async_search?batched_reduce_size=64&ccs_minimize_roundtrips=true&wait_for_completion_timeout=1000ms&keep_on_completion=true&keep_alive=60000ms&ignore_unavailable=true&preference=1714730816670
{
  "aggs": {
    "0": {
      "terms": {
        "field": "geo.dest",
        "order": {
          "_key": "asc"
        },
        "size": 30
      },
      "aggs": {
        "1": {
          "date_histogram": {
            "field": "@timestamp",
            "fixed_interval": "5m",
            "time_zone": "Europe/Rome",
            "extended_bounds": {
              "min": 1714341600000,
              "max": 1714600800000
            }
          },
          "aggs": {
            "2": {
              "avg": {
                "field": "bytes"
              }
            },
            "3": {
              "max": {
                "field": "machine.ram"
              }
            }
          }
        }
      }
    }
  },
  "size": 0,
  "fields": [
    {
      "field": "@timestamp",
      "format": "date_time"
    },
    {
      "field": "relatedContent.article:modified_time",
      "format": "date_time"
    },
    {
      "field": "relatedContent.article:published_time",
      "format": "date_time"
    },
    {
      "field": "utc_time",
      "format": "date_time"
    }
  ],
  "script_fields": {},
  "stored_fields": [
    "*"
  ],
  "runtime_mappings": {},
  "_source": {
    "excludes": []
  },
  "query": {
    "bool": {
      "must": [],
      "filter": [
        {
          "range": {
            "@timestamp": {
              "format": "strict_date_optional_time",
              "gte": "2024-04-28T22:00:00.000Z",
              "lte": "2024-05-01T22:00:00.000Z"
            }
          }
        }
      ],
      "should": [],
      "must_not": []
    }
  }
}

Profile file

Profiling results with hot ES cache and client cache from a click of the refresh button to render-complete event

The overall process takes ~ 1.2seconds to complete.

~60ms between the click event and the call to the fetch function. Looks like most time is spent on: dashboard: forceRefresh, on embeddable: setInitializationFinished and on lens: renderUserMessages.

~550ms from fetch request to tabified data. As described in #182551 (comment) and #182551 (comment) we have a fetch request due to mentioned reasons.

~650ms A long time is spent on Elastic-charts to compute the geometries to render. We can probably skip this passage, avoid generating geometries objects, and render the values directly, but require an alternative method to detect geometries under the mouse cursor.

markov00 · 2024-05-03T09:44:21Z

markov00
May 3, 2024
Collaborator Author

Unnecessary multiple requests with the same esagg query (FIXED in #182919)

When a search is sent to ES and a response is received, the search service is looking if the request needs a post-flight request.

kibana/src/plugins/data/common/search/search_source/search_source.ts

Lines 539 to 548 in 74fdd1b

    
           if (!this.hasPostFlightRequests()) { 
        
             obs.next(this.postFlightTransform(response)); 
        
             obs.complete(); 
        
           } else { 
        
             // Treat the complete response as partial, then run the postFlightRequests. 
        
             obs.next({ 
        
               ...this.postFlightTransform(response), 
        
               isPartial: true, 
        
               isRunning: true, 
        
             });

If needed, it transforms the response to a partial response and update the body with the postflight request.
This works correctly if the postflight is actually necessary, but due to the current implementation the postflight request is always "applied" even if not needed, causing a subsequent request to be sent to ES.
This results to an increase of:

more time spent unnecessary before returning the results to the client
1 more unnecessary search strategy that cache check
1 more unnecessary run of tabify

Analysis
The current method that checks if a request needs a subsequent post-flight request relies on a loose check from the function hasPostFlightRequests. This function checks if the agg property type.postFlightRequest is a function.

kibana/src/plugins/data/common/search/search_source/search_source.ts

Lines 474 to 483 in 74fdd1b

    
           private hasPostFlightRequests() { 
        
             const aggs = this.getField('aggs'); 
        
             if (aggs instanceof AggConfigs) { 
        
               return aggs.aggs.some( 
        
                 (agg) => agg.enabled && typeof agg.type.postFlightRequest === 'function' 
        
               ); 
        
             } else { 
        
               return false; 
        
             } 
        
           }

This function is there even if is not required. For example in a terms aggregation without the other bucket the function is still there but just return its identity

kibana/src/plugins/data/common/search/aggs/buckets/terms.ts

Line 93 in 74fdd1b

    
           postFlightRequest: createOtherBucketPostFlightRequest(constructSingleTermOtherFilter),

All the other cases this is defaulted to an identity function, so the hasPostFlightRequests function will always return true.

kibana/src/plugins/data/common/search/aggs/agg_type.ts

Line 311 in 74fdd1b

this.postFlightRequest = config.postFlightRequest || identity;

3 replies

ppisljar May 6, 2024
Collaborator

lets fix this

thomasneirynck May 10, 2024
Maintainer

I created this issue to track #183199

markov00 May 13, 2024
Collaborator Author

Peter already created an issue and a PR for that #182919

markov00 · 2024-05-03T09:44:32Z

markov00
May 3, 2024
Collaborator Author

`wait_for_completion_timeout` value is too low and can't process, without delays, a full response

This parameter, used in async search, describes the timeout before returning asynch search with a partial result..
This parameter is currently set to 200ms.

kibana/src/plugins/data/config.ts

Line 58 in b8d8c73

waitForCompletion: schema.duration({ defaultValue: '200ms' }),

After this 200ms interval the polling mechanism kicks in and the results then are just delayed everytime by at least ~300ms

kibana/src/plugins/data/common/search/poll_search.ts

Lines 20 to 35 in b8d8c73

    
           const getPollInterval = (elapsedTime: number): number => { 
        
             if (typeof pollInterval === 'number') return pollInterval; 
        
             else { 
        
               // if static pollInterval is not provided, then use default back-off logic 
        
               switch (true) { 
        
                 case elapsedTime < 1500: 
        
                   return 300; 
        
                 case elapsedTime < 5000: 
        
                   return 1000; 
        
                 case elapsedTime < 20000: 
        
                   return 2500; 
        
                 default: 
        
                   return 5000; 
        
               } 
        
             } 
        
           };

Probably I don't have enough knowledge in that, but I don't see any major drawback to increase this value to at least 1s as proposed here #157837 (comment) or even more.
The main drawback with that is an open connection between ES and Kibana that last for ~1 second, instead of opening and closing a new one 5 times in the same time interval.

3 replies

markov00 May 3, 2024
Collaborator Author

from @dej611

I think it makes sense to increase it.
From some experiments we saw, in the past, that "quick" responses from ES where within 150ms but it the test didn't have any specific statistical significance but it was enough to increase it from 100ms to 200ms. I think pushing it to 400/500ms might be worth. wdyt @ppisljar ?

ppisljar May 6, 2024
Collaborator

i think we need to be careful with increasing this values as they might lead to substantial increase of open requests to elasticsearch. Lets discuss with es team before going forward with this.

thomasneirynck May 10, 2024
Maintainer

++ @ppisljar for recommendation to double check with ES-team. I do hope we can "push down" this concern. Either we initiate unnecessary polls or we keep the default open longer. The former is the current situation, and there is quite some additional overhead in a full round-trip in multiple layers of the stack.

markov00 · 2024-05-03T09:44:43Z

markov00
May 3, 2024
Collaborator Author

`getXDomain` can be speeded up

When using cartesian charts, we compute the x domain. If that domain is big, the time to compute is pretty relevant. For example for a 50k data point dataset it tooks ~40ms. This can probably reduced by half if we adopt a better strategy on data processing, avoiding multiple array scans to sort, filter, map values and we just loop once with a reduce.

1 reply

thomasneirynck May 10, 2024
Maintainer

+1 for using a simple for-loop, functional programming-style by nature introduces overhead. Maybe we can remove the sort to (?), I'd assume data comes back sorted from ES already (?)

markov00 · 2024-05-03T13:35:59Z

markov00
May 3, 2024
Collaborator Author

Test 1 utopic baseline

I've implemented the most simple HTML page that covers: fetch directly to ES, a bit of data processing, and rendering on canvas.
This is a utopic/unrealistic situation because it doesn't consider: other DOM element rendering, interactivity support (and logic behind the interactivity) other data processing steps required to setup axes, labels, legend etc, and query creation etc.

Fetch: 202.56 ms // 176.68ms on fetch and 25.88ms response.json() time
ES response took: 130 // took time shown in the ES query result, 46.68ms of other network I/O
Tabify: 4.35009765625 ms // simplified tabify
Rendering: 3.890869140625 ms // simplified rendering
Overall time: 211ms

This time can't be achieved in Kibana but at least can be considered as the super-optimal baseline.
Our current Kibana implementation is 6x times slower than this baseline.

Simple demo page

<!DOCTYPE html>
<html>

<head>
    <title>My Empty Page</title>
</head>

<body>
    <button onclick="run()" style="position:absolute; right: 0;">RUN</button>
    <canvas id="chart"></canvas>

    <script>
        async function run() {
            const maxSeries = 1;
            const canvasWidth = 1000;
            const chartHeight = 200;
            const canvasHeight = (chartHeight) * maxSeries;
            const canvas = document.getElementById('chart');
            const ctx = canvas.getContext('2d');
            ctx.scale(window.devicePixelRatio, window.devicePixelRatio);
            canvas.width = canvasWidth * window.devicePixelRatio;
            canvas.height = canvasHeight * window.devicePixelRatio;
            canvas.style.width = `${canvasWidth}px`;
            canvas.style.height = `${canvasHeight}px`;

            function linearScale(value, inputDomain, outputRange) {
                const [inputMin, inputMax] = inputDomain;
                const [outputMin, outputMax] = outputRange;
                const inputRange = inputMax - inputMin;
                const outputDiff = outputMax - outputMin;
                const scaledValue = ((value - inputMin) / inputRange) * outputDiff + outputMin;
                return scaledValue;
            }

            const overallTimes = [];

            async function getData(i, t) {
                const startDate = `2024-04-28T22:10:27.454Z`;
                const endDate = `2024-05-01T21:00:48.656Z`;
                const startTime = performance.now();
                console.time(`Overall [${i}]`);
                console.time(`Fetch [${i}]`);
                const response = await fetch("http://localhost:9200/_search?max_concurrent_shard_requests=10", {
                    "headers": {
                        'Authorization': `Basic ${btoa("elastic:changeme")}`,
                        "content-type": "application/json",
                    },
                    "body": JSON.stringify({
                        "aggs": {
                            "byTerms": {
                                "terms": {
                                    "field": "geo.dest",
                                    "order": {
                                        "_key": "asc"
                                    },
                                    "size": 30
                                },
                                "aggs": {
                                    "byDate": {
                                        "date_histogram": {
                                            "field": "@timestamp",
                                            "fixed_interval": "5m",
                                            "time_zone": "Europe/Rome",
                                            "extended_bounds": {
                                                "min": new Date(startDate).valueOf(),
                                                "max": new Date(endDate).valueOf()
                                            }
                                        },
                                        "aggs": {
                                            "avgMetric": {
                                                "avg": {
                                                    "field": "bytes"
                                                }
                                            },
                                            "maxMetric": {
                                                "max": {
                                                    "field": "machine.ram"
                                                }
                                            }
                                        }
                                    }
                                }
                            }
                        },
                        "size": 0,
                        "query": {
                            "bool": {
                                "filter": [
                                    {
                                        "range": {
                                            "@timestamp": {
                                                "format": "strict_date_optional_time",
                                                "gte": startDate,
                                                "lte": endDate
                                            }
                                        }
                                    }
                                ]
                            }
                        }
                    }
                    ),
                    "method": "POST",
                });
                console.timeLog(`Fetch [${i}]`, 'response');
                const data = await response.json();
                console.timeEnd(`Fetch [${i}]`);
                console.log(`ES response took: ${data.took}`);

                console.time(`Tabify [${i}]`);
                const series = data.aggregations.byTerms.buckets.reduce((acc, bucket) => {
                    const { key, byDate: { buckets } } = bucket;

                    const seriesData = buckets.reduce((bucketAcc, d) => {
                        bucketAcc.dataArray.push(
                            {
                                key,
                                x: d.key,
                                maxMetric: d.maxMetric.value,
                                avgMetric: d.avgMetric.value,
                            });
                        bucketAcc.xDomain[0] = Math.min(bucketAcc.xDomain[0], d.key);
                        bucketAcc.xDomain[1] = Math.max(bucketAcc.xDomain[1], d.key);
                        bucketAcc.yDomain[0] = Math.min(bucketAcc.yDomain[0], d.maxMetric.value, d.avgMetric.value);
                        bucketAcc.yDomain[1] = Math.max(bucketAcc.yDomain[1], d.maxMetric.value, d.avgMetric.value);
                        return bucketAcc;
                    }, { dataArray: [], xDomain: [Infinity, -Infinity], yDomain: [Infinity, -Infinity] })

                    acc.series.set(key, seriesData.dataArray);
                    acc.xDomain[0] = Math.min(acc.xDomain[0], seriesData.xDomain[0]);
                    acc.xDomain[1] = Math.max(acc.xDomain[1], seriesData.xDomain[1]);

                    acc.yDomain[0] = Math.min(acc.yDomain[0], seriesData.yDomain[0]);
                    acc.yDomain[1] = Math.max(acc.yDomain[1], seriesData.yDomain[1]);

                    return acc;
                }, { series: new Map(), xDomain: [Infinity, -Infinity], yDomain: [Infinity, -Infinity] });
                console.timeEnd(`Tabify [${i}]`);
                console.time(`Rendering [${i}]`);

                // ctx.fillStyle = 'white';
                // ctx.fillRect(0, 0, canvasWidth, canvasHeight);

                ctx.save();
                ctx.translate(0, (chartHeight) * i);
                const colors = [
                    '#54B399',
                    '#6092C0',
                    '#9170B8',
                    '#CA8EAE',
                    '#D36086',
                    '#E7664C',
                    '#AA6556',
                    '#DA8B45',
                    '#B9A888',
                    '#D6BF57',
                ];
                [...series.series.values()].forEach((s, i) => {
                    let firstMoveA = false;
                    let firstMoveB = false;
                    ctx.strokeStyle = colors[i % colors.length];
                    ctx.beginPath();
                    s.forEach((d, i) => {
                        if (d.maxMetric !== null && !firstMoveA) {
                            ctx.moveTo(linearScale(d.x, series.xDomain, [0, canvasWidth]), linearScale(d.maxMetric, series.yDomain, [chartHeight, 0]));
                            firstMoveA = true;
                        } else if (d.maxMetric !== null && firstMoveA) {
                            ctx.lineTo(linearScale(d.x, series.xDomain, [0, canvasWidth]), linearScale(d.maxMetric, series.yDomain, [chartHeight, 0]));
                        }
                    });
                    ctx.stroke();
                    ctx.beginPath();
                    s.forEach((d, i) => {
                        if (d.avgMetric !== null && !firstMoveB) {
                            ctx.moveTo(linearScale(d.x, series.xDomain, [0, canvasWidth]), linearScale(d.avgMetric, series.yDomain, [chartHeight, 0]));
                            firstMoveA = true;
                        } else if (d.avgMetric !== null && firstMoveB) {
                            ctx.lineTo(linearScale(d.x, series.xDomain, [0, canvasWidth]), linearScale(d.avgMetric, series.yDomain, [chartHeight, 0]));
                        }
                    });
                    ctx.stroke();


                });

                ctx.restore();

                console.timeEnd(`Rendering [${i}]`);
                console.timeEnd(`Overall [${i}]`);
                console.log(`-------`);
                overallTimes.push(performance.now() - startTime);
            }

            console.time('Total time parallel');
            const requests = Array.from({ length: maxSeries }, (d, i) => getData(i));
            await Promise.all(requests);
            console.timeEnd('Total time parallel');

            // console.time('Total time sequential');
            // for(let i = 0; i < maxSeries; i++) {

            //     await getData(i, 0);

            // }
            // console.timeEnd('Total time sequential');

            console.log(`Overall avg time: ${overallTimes.reduce((s, c) => { s += c; return s }, 0) / overallTimes.length}ms`);

        }
    </script>
</body>
</html>

1 reply

thomasneirynck May 10, 2024
Maintainer

This is pretty great illustration :) . It also shows there is quite some room for improvement.

markov00 · 2024-05-03T13:42:16Z

markov00
May 3, 2024
Collaborator Author

Kibana Server doesn't reply with 304 to cached requests

During the testing, I've noticed that our server doesn't reply with 304 HTTP statuses even if the request is actually the same. When we have cases where nothing changed in a dashboard we should probably respond with 304 instead of querying ES again and waiting for its cached response.
ES on its own doesn't seem to respond with 304 either.

3 replies

ppisljar May 6, 2024
Collaborator

this sounds like it would need a cache on kibana server, which can be a memory problem. (as es doesnt respond with 304 kibana server would need to keep previous response in memory to figure out that the new one looks the same, or rely on request params to decide the response will not change). i would prefer to avoid this.

dej611 May 6, 2024
Collaborator

Maybe we could evaluate case by case where to apply a 304. bsearch is something with a lower probability of repeating the same request, but things like asking the list of indexes (local or remote) is something that happens very frequently and often the response is the same.

thomasneirynck May 10, 2024
Maintainer

@ppisljar agreed we'd likely want to avoid introducing more caches on Kibana server, especially for data responses.

@dej611 wrt asking the list of indices (so not wrt bsearch)

How do we load this now?

Overall, rather than necessarily returning 304s, it may also be something we could side-step and rely on cache headers and have the browser figure it out. e.g. using http-cache headers is how Maps ends up not re-requesting identical data.

e.g. see how pbf -reqeusts are loaded from disc

It does require loading data with GET for this to work "automagically" I think.

@mattkime do you any thoughts here? Some of the issue here feels similar to what ended up for field-cap caching, although that did introduce more caching server-side I believe (but was also needed for server-side alert-usage so ⚖️ )

markov00 · 2024-05-03T13:55:38Z

markov00
May 3, 2024
Collaborator Author

Polling search

As described earlier, we have a polling mechanism that is used to check for search completion when using async searches.
This poll mechanism is implemented on both side: one from the browser to the kibana server and another polling is from kibana server to ES.
Still in research, but we believe that this double polling mechanism could go "out-of-sync" when the ES load is higher (when ES is actually responding after the fist back-off interval) causing longer delays than expected.

kibana/src/plugins/data/common/search/poll_search.ts

Lines 20 to 35 in b8d8c73

    
           const getPollInterval = (elapsedTime: number): number => { 
        
             if (typeof pollInterval === 'number') return pollInterval; 
        
             else { 
        
               // if static pollInterval is not provided, then use default back-off logic 
        
               switch (true) { 
        
                 case elapsedTime < 1500: 
        
                   return 300; 
        
                 case elapsedTime < 5000: 
        
                   return 1000; 
        
                 case elapsedTime < 20000: 
        
                   return 2500; 
        
                 default: 
        
                   return 5000; 
        
               } 
        
             } 
        
           };

Could be interesting knowing how many HTTP open connection we can keep between the ES and Kibana server and between the Kibana server and the client so we can use server-side push or just keep a long living tunnel to receive the ES responses without any polling mechanism in place.

4 replies

ppisljar May 6, 2024
Collaborator

6 open connections between kibana client and server, we work around this by using bfetch

dej611 May 6, 2024
Collaborator

I think @markov00 was also asking for server to server connections limit (which I suspect is way more than 6, correct?)

thomasneirynck May 10, 2024
Maintainer

@ppisljar wrt open connections, we are looking to remove this altogether by moving to http2 eventually #7104 (comment).

@dej611 agreed. That is also the assumption we make for doing a trial run and turning of bsearch in Serverless, where the proxy has http2 support.

cc @lukasolson

vadimkibana Jun 5, 2024
Collaborator

... just keep a long living tunnel to receive the ES responses without any polling mechanism in place.

On the client-side: we could use just the streaming functionality of the bfetch plugin (not using the batching functionality) to send pushes from the server. It has plugins.bfetch.addStreamingResponseRoute() method which allows to create an HTTP request which never ends—can push unlimited number of messages.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Research] Possible performance improvements in Lens expressions #182551

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 6 comments 15 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

[Research] Possible performance improvements in Lens expressions #182551

markov00 Apr 30, 2024 Collaborator

Test 1: single chart with date_histogram and terms aggregation

Profiling results with hot ES cache and client cache from a click of the refresh button to render-complete event

Replies: 6 comments · 15 replies

markov00 May 3, 2024 Collaborator Author

Unnecessary multiple requests with the same esagg query (FIXED in #182919)

ppisljar May 6, 2024 Collaborator

thomasneirynck May 10, 2024 Maintainer

markov00 May 13, 2024 Collaborator Author

markov00 May 3, 2024 Collaborator Author

wait_for_completion_timeout value is too low and can't process, without delays, a full response

markov00 May 3, 2024 Collaborator Author

ppisljar May 6, 2024 Collaborator

thomasneirynck May 10, 2024 Maintainer

markov00 May 3, 2024 Collaborator Author

getXDomain can be speeded up

thomasneirynck May 10, 2024 Maintainer

markov00 May 3, 2024 Collaborator Author

Test 1 utopic baseline

thomasneirynck May 10, 2024 Maintainer

markov00 May 3, 2024 Collaborator Author

Kibana Server doesn't reply with 304 to cached requests

ppisljar May 6, 2024 Collaborator

dej611 May 6, 2024 Collaborator

thomasneirynck May 10, 2024 Maintainer

markov00 May 3, 2024 Collaborator Author

Polling search

ppisljar May 6, 2024 Collaborator

dej611 May 6, 2024 Collaborator

thomasneirynck May 10, 2024 Maintainer

vadimkibana Jun 5, 2024 Collaborator

markov00
Apr 30, 2024
Collaborator

Replies: 6 comments 15 replies

markov00
May 3, 2024
Collaborator Author

ppisljar May 6, 2024
Collaborator

thomasneirynck May 10, 2024
Maintainer

markov00 May 13, 2024
Collaborator Author

markov00
May 3, 2024
Collaborator Author

`wait_for_completion_timeout` value is too low and can't process, without delays, a full response

markov00 May 3, 2024
Collaborator Author

ppisljar May 6, 2024
Collaborator

thomasneirynck May 10, 2024
Maintainer

markov00
May 3, 2024
Collaborator Author

`getXDomain` can be speeded up

thomasneirynck May 10, 2024
Maintainer

markov00
May 3, 2024
Collaborator Author

thomasneirynck May 10, 2024
Maintainer

markov00
May 3, 2024
Collaborator Author

ppisljar May 6, 2024
Collaborator

dej611 May 6, 2024
Collaborator

thomasneirynck May 10, 2024
Maintainer

markov00
May 3, 2024
Collaborator Author

ppisljar May 6, 2024
Collaborator

dej611 May 6, 2024
Collaborator

thomasneirynck May 10, 2024
Maintainer

vadimkibana Jun 5, 2024
Collaborator