Feature Request: ensure topo calls use `--topo_read_concurrency` #17275

timvaillancourt · 2024-11-22T19:21:07Z

Feature Description

The --topo_read_concurrency flag is intended (based on it's help message and usage in the code) to limit how many reads a Vitess component does concurrently to the topo

Unfortunately in #14693 and #17071 (and at least 2 x more PRs for VTOrc to come) Vitess code is not always respecting this concurrency limit and is simply launching N x goroutines with sync.WaitGroups

Another issue is sometimes methods do respect the --topo_read_concurrency limit, but only local to the the method execution, and that code is called concurrently, significantly exceeding the limit

Instead of having the "callers" of go/vt/topo methods (internal or external) deal with concurrency limits all over the code, this feature request discusses making the concurrency limit built-in to all read-related struct methods of go/vt/topo's Server struct

I believe if this is done correctly the concurrency limit should be respected by all methods, including if they're stacked or called repeatedly/concurrently in other methods/funcs

Your thoughts appreciated 🙇

Use Case(s)

Regular vitess usage

The text was updated successfully, but these errors were encountered:

timvaillancourt · 2024-11-22T20:00:08Z

A question for all: the default --topo_read_concurrency is 32

Do we want both local and global topo calls to use that same limit, or should each have their own semaphore == to --topo_read_concurrency (each has 32)? I'm leaning towards the later but we should make it clear

deepthi · 2024-11-22T20:03:58Z

I'd lean towards each having their own semaphore. And yes, we should document it.

timvaillancourt · 2024-11-23T15:25:13Z

@deepthi sounds good, this is implemented here: #17276

The "cell" topo reads share a single semaphore that defaults to 32. A drawback to this approach is if 1 x cell is really slow it may starve the semaphore for "cell" reads. One way to avoid this is to give each cell connection a new semaphore that is a subset of the total. I'm curious what your thoughts were?

deepthi · 2024-12-08T17:40:10Z

Each cell should have its own limit so that different cells can be read in parallel. The intent of cell topo is that it is independent of global topo, and ideally each cell has its own topo server. I realize that people don't always deploy it that way, but we should not put a limit across cells.

timvaillancourt added Type: Internal Cleanup Type: Enhancement Logical improvement (somewhere between a bug and feature) Type: Performance Component: VTorc Vitess Orchestrator integration Component: Throttler Component: VTTablet Component: VTGate labels Nov 22, 2024

timvaillancourt self-assigned this Nov 22, 2024

timvaillancourt mentioned this issue Nov 23, 2024

Ensure all topo read calls consider --topo_read_concurrency #17276

Merged

5 tasks

timvaillancourt mentioned this issue Dec 3, 2024

Feature Request: vtorc to support --topo_read_concurrency limit #17073

Closed

timvaillancourt closed this as completed in #17276 Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: ensure topo calls use `--topo_read_concurrency` #17275

Feature Request: ensure topo calls use `--topo_read_concurrency` #17275

timvaillancourt commented Nov 22, 2024 •

edited

Loading

timvaillancourt commented Nov 22, 2024 •

edited

Loading

deepthi commented Nov 22, 2024 •

edited

Loading

timvaillancourt commented Nov 23, 2024

deepthi commented Dec 8, 2024

Feature Request: ensure topo calls use --topo_read_concurrency #17275

Feature Request: ensure topo calls use --topo_read_concurrency #17275

Comments

timvaillancourt commented Nov 22, 2024 • edited Loading

Feature Description

Use Case(s)

timvaillancourt commented Nov 22, 2024 • edited Loading

deepthi commented Nov 22, 2024 • edited Loading

timvaillancourt commented Nov 23, 2024

deepthi commented Dec 8, 2024

Feature Request: ensure topo calls use `--topo_read_concurrency` #17275

Feature Request: ensure topo calls use `--topo_read_concurrency` #17275

timvaillancourt commented Nov 22, 2024 •

edited

Loading

timvaillancourt commented Nov 22, 2024 •

edited

Loading

deepthi commented Nov 22, 2024 •

edited

Loading