RFC: Move VDiff to vttablet #9216

ajm188 · 2021-11-12T18:46:11Z

Feature Description

Currently vdiff runs at the vtctld layer, which is responsible for contacting tablets involved in a workflow, streaming the rows into memory, doing the diff, and output the results. This works but it does not scale well for running many vdiffs, or large vdiffs at once on vtctlds (which often are much less beefy than vttablets, hardware-wise as well).

If we were able to push down the work to just the tablets involved in the workflow, and leave the vtctld to only coordinate and collect results from the tablets, we could achieve much better vdiff throughput.

Design

This issue is a placeholder to start the discussion around how best to achieve this. We should eventually write up a design before doing significant work here.

mattlord · 2022-06-14T23:54:31Z

Closing this as fixed in: #10382

Planned follow-up work can be seen here: #10494

Please feel free to re-open this if I missed or misunderstood something.

ajm188 added Type: Feature Component: Cluster management Component: VReplication labels Nov 12, 2021

mattlord closed this as completed Jun 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Move VDiff to vttablet #9216

RFC: Move VDiff to vttablet #9216

ajm188 commented Nov 12, 2021

mattlord commented Jun 14, 2022

RFC: Move VDiff to vttablet #9216

RFC: Move VDiff to vttablet #9216

Comments

ajm188 commented Nov 12, 2021

Feature Description

Design

mattlord commented Jun 14, 2022