experiment: use pipeline mode #2707

robx · 2023-03-16T15:41:26Z

This pulls in an exploratory implementation of pipeline mode (#2295). The bulk of the change is in the supporting libraries:

postgresql-libpq is extended to wrap the libpq pipeline mode API
hasql is hacked to allow queueing up pipelined statements, with pipeline synchronization and result reading (ignoring) deferred until the next result is required

Some notes as to the state of this:

the hasql change is very much just a minimal hack to allow us to evaluate pipeline mode in postgrest -- it makes little sense as some kind of "pipeline mode for a hasql" feature
the implementation seems mostly correct, but error handling is at least a bit broken (we don't treat aborted pipelines quite right), and there's a decent chance that some failure scenarios actually mess up the connection state, though I haven't seen that
this includes running postgrest-loadtest with artificial latency #2682 to allow simulating slow postgres

robx · 2023-03-16T15:55:54Z

Some performance results, using postgrest-loadtest.

	pipeline=yes	pipeline=no	main branch
pgdelay=0	300	287	294
pgdelay=1ms	62	55	55
pgdelay=5ms pgrst_delay=5ms	17.3	14.5	14.5
pgdelay=1ms pgrst_delay=10ms	26.6	24.8	24.7
pgdelay=10ms	12.1	9.6	9.6
pgdelay=50ms	2.7	2.2	2.2

the number is request rate from loadtest output -- it isn't particularly stable, e.g. I wouldn't trust the 300 > 294 in the first row to signal an improvement. But the overall improvement between the columns seem consistent.
pipeline=yes/no is on this branch, with usePipeline set to True or False; main branch is with unmodified dependencies
command line is e.g. PGRST_BUILD_CABAL=1 PGDELAY=1ms PGRST_DELAY=10ms postgrest-loadtest
PGRST_BUILD_CABAL=1 says to build using postgrest-build for quicker iteration (there's a supporting change in this PR)

Regarding the results:

there's some overhead in hasql that comes with supporting pipeline mode; this seem to be noticeable as a slight performance cost in the undelayed scenario
as soon as there's a bit of a latency towards postgresql (regardless of how that latency compares to the http client latency), pipeline mode does seem to provide a measurable benefit

robx · 2023-03-16T16:07:21Z

The library changes:

The postgresql-libpq change is essentially good to go upstream, but I haven't filed it yet.

wolfgangwalther · 2023-03-16T16:09:23Z

Very cool!

steve-chavez · 2023-03-20T07:35:32Z

src/PostgREST/Query.hs

+usePipeline :: Bool
+usePipeline = True
+
+queuePipelineStatement :: params -> SQL.Statement params () -> SQL.Transaction ()
+queuePipelineStatement params stmt =
+  if usePipeline then SQL.inTransaction $ Session.queuePipelineStatement params stmt
+                 else SQL.statement params stmt


So awesome that is such a small change.

the implementation seems mostly correct, but error handling is at least a bit broken (we don't treat aborted pipelines quite right), and there's a decent chance that some failure scenarios actually mess up the connection state, though I haven't seen that

So since there could be some unknown unknowns(pipeline mode also new to me), I think we should make this configurable. Name could be db-pipeline-mode. True by default.

I mean, the large change is in hasql. And I don't think it's a particularly sensible implementation there API-wise so far.

But yes gating this by a configuration option makes sense.

steve-chavez · 2023-03-21T08:25:45Z

Just FYI, I'm leaving the pgbench pipeline test on steve-chavez@682930d.

robx mentioned this pull request Mar 16, 2023

pipeline mode hack robx/hasql#1

Closed

robx mentioned this pull request Mar 17, 2023

Add pipeline mode API haskellari/postgresql-libpq#42

Closed

steve-chavez reviewed Mar 20, 2023

View reviewed changes

robx mentioned this pull request Mar 20, 2023

running postgrest-loadtest with artificial latency #2682

Merged

robx added 3 commits May 3, 2023 11:59

fmt

ee019b3

use forked dependencies that enable use of pipeline mode

9443c53

use pipeline mode

24b7def

robx force-pushed the pipeline branch from e8d50d3 to 24b7def Compare May 3, 2023 10:19

robx added 3 commits May 5, 2023 11:08

hasql-pipeline update

9ab6239

nix sha256 format change due to update-...

b523b2a

fix import aliasing warning

33d7d26

steve-chavez mentioned this pull request Aug 14, 2023

CSV bulk insert performance #1206

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiment: use pipeline mode #2707

experiment: use pipeline mode #2707

robx commented Mar 16, 2023

robx commented Mar 16, 2023

robx commented Mar 16, 2023

wolfgangwalther commented Mar 16, 2023

steve-chavez Mar 20, 2023 •

edited

Loading

robx Mar 20, 2023

steve-chavez commented Mar 21, 2023

experiment: use pipeline mode #2707

Are you sure you want to change the base?

experiment: use pipeline mode #2707

Conversation

robx commented Mar 16, 2023

robx commented Mar 16, 2023

robx commented Mar 16, 2023

wolfgangwalther commented Mar 16, 2023

steve-chavez Mar 20, 2023 • edited Loading

Choose a reason for hiding this comment

robx Mar 20, 2023

Choose a reason for hiding this comment

steve-chavez commented Mar 21, 2023

steve-chavez Mar 20, 2023 •

edited

Loading