Edits to pgd_bench content

EnterpriseDB · Sep 19, 2023 · 281a447 · 281a447
1 parent 5cb7557
commit 281a447
Show file tree

Hide file tree

Showing 2 changed files with 110 additions and 119 deletions.
diff --git a/product_docs/docs/pgd/5/reference/testingandtuning.mdx b/product_docs/docs/pgd/5/reference/testingandtuning.mdx
@@ -4,117 +4,116 @@ navTitle: Testing and tuning
 indexdepth: 2
 ---
 
-EDB Postgres Distributed has tools which help with testing and tuning of your PGD clusters. For background, read the [Testing and Tuning](../testingandtuning) section.
+EDB Postgres Distributed has tools that help with testing and tuning your PGD clusters. For background, see [Testing and tuning](../testingandtuning).
 
 
-## `pgd_bench`
+## pgd_bench
 
 ### Synopsis
 
-A benchmarking tool for PGD enhanced PostgreSQL.
+A benchmarking tool for PGD-enhanced PostgreSQL.
 
 ```shell
 pgd_bench [OPTION]... [DBNAME] [DBNAME2]
 ```
 
-`DBNAME` may be a conninfo string of the format:
+`DBNAME` can be a conninfo string of the format:
   `"host=10.1.1.2 user=postgres dbname=master"`
 
-Consult the [Testing and Tuning - Pgd_bench](../testingandtuning#pgd_bench) section for examples
-of `pgd_bench` options and usage.
+See [pgd_bench in Testing and tuning](../testingandtuning#pgd_bench) for examples
+of pgd_bench options and usage.
 
 ### Options
 
-`pgd_bench` specific options include:
+pgd_bench-specific options include the following.
 
 #### Setting mode
 
 `-m` or `--mode`
 
-Which can be set to `regular`, `camo`, or `failover`. It defaults to `regular`.
+The mode can be set to `regular`, `camo`, or `failover`. The default is `regular`.
 
-* regular &mdash; Only a single node is needed to run `pgd_bench`
-* camo &mdash; A second node must be specified to act as the CAMO-partner (CAMO should be set up)
-* failover &mdash; A second node must be specified to act as the failover.
+* `regular` &mdash; Only a single node is needed to run pgd_bench.
+* `camo` &mdash; A second node must be specified to act as the CAMO partner. (CAMO must be set up.)
+* `failover` &mdash; A second node must be specified to act as the failover.
 
-When using `-m failover`, an additional option `--retry` is available. This will
-instruct `pgd_bench` to retry transactions when there is a failover. The `--retry` 
-option is automatically enabled with `-m camo`.
+When using `-m failover`, an additional option `--retry` is available. This option
+instructs pgd_bench to retry transactions when there's a failover. The `--retry` 
+option is enabled with `-m camo`.
 
 #### Setting GUC variables
 
  `-o` or `--set-option`
 
-This option is followed by `NAME=VALUE` entries, which will be applied using the 
-Postgresql [`SET`](https://www.postgresql.org/docs/current/sql-set.html) command on each server, and only those servers, that `pgd_bench` connects to. 
-
-The other options are identical to the Community PostgreSQL `pgbench`. For more
-details, consult the official documentation on
-[`pgbench`](https://www.postgresql.org/docs/current/pgbench.html).
-
-We list all the options (`pgd_bench` and `pgbench`) below for completeness.
-
-#### Initialization options:
--   `-i, --initialize`  &mdash; invokes initialization mode
--   `-I, --init-steps=[dtgGvpf]+` (default `"dtgvp"`)  &mdash; run selected initialization steps
-    -   `d`  &mdash; drop any existing `pgbench` tables
-    -   `t`  &mdash; create the tables used by the standard `pgbench` scenario
-    -   `g`  &mdash; generate data client-side and load it into the standard tables, replacing any data already present
-    -   `G`  &mdash; generate data server-side and load it into the standard tables, replacing any data already present
-    -   `v`  &mdash; invoke `VACUUM` on the standard tables
-    -   `p`  &mdash; create primary key indexes on the standard tables
-    -   `f`  &mdash; create foreign key constraints between the standard tables
--   `-F, --fillfactor=NUM` &mdash; set fill factor
--   `-n, --no-vacuum` &mdash; do not run `VACUUM` during initialization
--   `-q, --quiet` &mdash; quiet logging (one message each 5 seconds)
--   `-s, --scale=NUM` &mdash; scaling factor
--   `--foreign-keys` &mdash; create foreign key constraints between tables
--   `--index-tablespace=TABLESPACE` &mdash; create indexes in the specified tablespace
--   `--partition-method=(range|hash)` &mdash; partition `pgbench_accounts` with this method (default: range)
--   `--partitions=NUM` &mdash; partition `pgbench_accounts` into `NUM` parts (default: 0)
--   `--tablespace=TABLESPACE` &mdash; create tables in the specified tablespace
--   `--unlogged-tables` &mdash; create tables as unlogged tables (Note: unlogged tables are not replicated)
-
-#### Options to select what to run:
--   `-b, --builtin=NAME[@W]` &mdash; add builtin script NAME weighted at W (default: 1). Use `-b list` to list available scripts.
--   `-f, --file=FILENAME[@W]` &mdash; add script `FILENAME` weighted at W (default: 1)
--   `-N, --skip-some-updates` &mdash;  updates of pgbench_tellers and pgbench_branches. Same as `-b simple-update`
--   `-S, --select-only` &mdash; perform SELECT-only transactions. Same as `-b select-only`
-
-#### Benchmarking options:
--   `-c, --client=NUM` &mdash; number of concurrent database clients (default: 1)
--   `-C, --connect` &mdash; establish new connection for each transaction
--   `-D, --define=VARNAME=VALUE` &mdash;  define variable for use by custom script
--   `-j, --jobs=NUM` &mdash; number of threads (default: 1)
--   `-l, --log` &mdash; write transaction times to log file
--   `-L, --latency-limit=NUM` &mdash; count transactions lasting more than NUM ms as late
--   `-m, --mode=regular|camo|failover` &mdash; mode in which pgbench should run (default: `regular`)
--   `-M, --protocol=simple|extended|prepared` &mdash; protocol for submitting queries (default: `simple`)
--   `-n, --no-vacuum` &mdash; do not run `VACUUM` before tests
--   `-o, --set-option=NAME=VALUE` &mdash; specify runtime SET option
--   `-P, --progress=NUM`  &mdash; show thread progress report every NUM seconds
--   `-r, --report-per-command`  &mdash;  latencies, failures and retries per command
--   `-R, --rate=NUM`  &mdash; target rate in transactions per second
--   `-s, --scale=NUM`  &mdash; report this scale factor in output
--   `-t, --transactions=NUM`  &mdash; number of transactions each client runs (default: 10)
--   `-T, --time=NUM`  &mdash; duration of benchmark test in seconds
--   `-v, --vacuum-all`  &mdash; vacuum all four standard tables before tests
--   `--aggregate-interval=NUM`  &mdash;  data over NUM seconds
--   `--failures-detailed`  &mdash; report the failures grouped by basic types
--   `--log-prefix=PREFIX`  &mdash; prefix for transaction time log file (default: `pgbench_log`)
--   `--max-tries=NUM`  &mdash; max number of tries to run transaction (default: 1)
--   `--progress-timestamp`  &mdash; use Unix epoch timestamps for progress
--   `--random-seed=SEED`  &mdash; set random seed ("time", "rand", integer)
--   `--retry`  &mdash; retry transactions on failover, used with "-m"
--   `--sampling-rate=NUM`  &mdash; fraction of transactions to log (e.g., 0.01 for 1%)
--   `--show-script=NAME`  &mdash; show builtin script code, then exit
--   `--verbose-errors`  &mdash; print messages of all errors
+This option is followed by `NAME=VALUE` entries, which are applied using the 
+PostgreSQL [`SET`](https://www.postgresql.org/docs/current/sql-set.html) command on each server that pgd_bench connects to, and only those servers. 
+
+The other options are identical to the PostgreSQL pgd_bench command. For
+details, see the PostgreSQL 
+[pgd_bench](https://www.postgresql.org/docs/current/pgbench.html) documentation.
+
+The complete list of options (pgd_bench and pgbench) follow.
+
+#### Initialization options
+-   `-i, --initialize`  &mdash; Invoke initialization mode.
+-   `-I, --init-steps=[dtgGvpf]+` (default `"dtgvp"`)  &mdash; Run selected initialization steps.
+    -   `d`  &mdash; Drop any existing pgd_bench tables.
+    -   `t`  &mdash; Create the tables used by the standard pgd_bench scenario.
+    -   `g`  &mdash; Generate data client-side and load it into the standard tables, replacing any data already present.
+    -   `G`  &mdash; Generate data server-side and load it into the standard tables, replacing any data already present.
+    -   `v`  &mdash; Invoke `VACUUM` on the standard tables.
+    -   `p`  &mdash; Create primary key indexes on the standard tables.
+    -   `f`  &mdash; Create foreign key constraints between the standard tables.
+-   `-F, --fillfactor=NUM` &mdash; Set fill factor.
+-   `-n, --no-vacuum` &mdash; Don't run `VACUUM` during initialization.
+-   `-q, --quiet` &mdash; Quiet logging (one message every 5 seconds).
+-   `-s, --scale=NUM` &mdash; Scaling factor.
+-   `--foreign-keys` &mdash; Create foreign key constraints between tables.
+-   `--index-tablespace=TABLESPACE` &mdash; Create indexes in the specified tablespace.
+-   `--partition-method=(range|hash)` &mdash; Partition `pgbench_accounts` with this method. The default is `range`.
+-   `--partitions=NUM` &mdash; Partition `pgbench_accounts` into `NUM` parts. The default is `0`.
+-   `--tablespace=TABLESPACE` &mdash; Create tables in the specified tablespace.
+-   `--unlogged-tables` &mdash; Create tables as unlogged tables. (Note: Unlogged tables aren't replicated.)
+
+#### Options to select what to run
+-   `-b, --builtin=NAME[@W]` &mdash; Add built-in script NAME weighted at W. The default is 1. Use `-b list` to list available scripts.
+-   `-f, --file=FILENAME[@W]` &mdash; Add script `FILENAME` weighted at W. The default is 1.
+-   `-N, --skip-some-updates` &mdash;  Updates of pgbench_tellers and pgbench_branches. Same as `-b simple-update`.
+-   `-S, --select-only` &mdash; Perform SELECT-only transactions. Same as `-b select-only`.
+
+#### Benchmarking options
+-   `-c, --client=NUM` &mdash; Number of concurrent database clients. The default is 1.
+-   `-C, --connect` &mdash; Establish new connection for each transaction.
+-   `-D, --define=VARNAME=VALUE` &mdash; Define variable for use by custom script.
+-   `-j, --jobs=NUM` &mdash; Number of threads. The default is 1.
+-   `-l, --log` &mdash; Write transaction times to log file.
+-   `-L, --latency-limit=NUM` &mdash; Count transactions lasting more than NUM ms as late.
+-   `-m, --mode=regular|camo|failover` &mdash; Mode in which to run pgbench. The default is `regular`.
+-   `-M, --protocol=simple|extended|prepared` &mdash; Protocol for submitting queries. The default is `simple`.
+-   `-n, --no-vacuum` &mdash; Don't run `VACUUM` before tests.
+-   `-o, --set-option=NAME=VALUE` &mdash; Specify runtime `SET` option.
+-   `-P, --progress=NUM`  &mdash; Show thread progress report every NUM seconds.
+-   `-r, --report-per-command`  &mdash; Latencies, failures, and retries per command.
+-   `-R, --rate=NUM` &mdash; Target rate in transactions per second.
+-   `-s, --scale=NUM` &mdash; Report this scale factor in output.
+-   `-t, --transactions=NUM` &mdash; Number of transactions each client runs. The default is 10.
+-   `-T, --time=NUM` &mdash; Duration of benchmark test, in seconds.
+-   `-v, --vacuum-all` &mdash; Vacuum all four standard tables before tests.
+-   `--aggregate-interval=NUM` &mdash; Data over NUM seconds.
+-   `--failures-detailed` &mdash; Report the failures grouped by basic types.
+-   `--log-prefix=PREFIX` &mdash; Prefix for transaction time log file. The default is `pgbench_log`.
+-   `--max-tries=NUM` &mdash; Max number of tries to run transaction. The default is `1`.
+-   `--progress-timestamp` &mdash; Use Unix epoch timestamps for progress.
+-   `--random-seed=SEED` &mdash; Set random seed (`time`, `rand`, `integer`).
+-   `--retry` &mdash; Retry transactions on failover, used with `-m`.
+-   `--sampling-rate=NUM` &mdash; Fraction of transactions to log, for example, 0.01 for 1%.
+-   `--show-script=NAME` &mdash; Show built-in script code, then exit.
+-   `--verbose-errors` &mdash; Print messages of all errors.
 
 #### Common options:
--   `-d, --debug` &mdash; print debugging output
--   `-h, --host=HOSTNAME` &mdash; database server host or socket directory
--   `-p, --port=PORT` &mdash; database server port number
--   `-U, --username=USERNAME` &mdash; connect as specified database user
--   `-V, --version` &mdash; output version information, then exit
--   `-?, --help` &mdash; show help, then exit
-
+-   `-d, --debug` &mdash; Print debugging output.
+-   `-h, --host=HOSTNAME` &mdash; Database server host or socket directory.
+-   `-p, --port=PORT` &mdash; Database server port number.
+-   `-U, --username=USERNAME` &mdash; Connect as specified database user.
+-   `-V, --version` &mdash; Output version information, then exit.
+-   `-?, --help` &mdash; Show help, then exit.
diff --git a/product_docs/docs/pgd/5/testingandtuning.mdx b/product_docs/docs/pgd/5/testingandtuning.mdx
@@ -1,6 +1,6 @@
 ---
-title: Testing and Tuning PGD clusters
-navTitle: Testing and Tuning
+title: Testing and tuning PGD clusters
+navTitle: Testing and tuning
 ---
 
 You can test PGD applications using the following approaches:
@@ -29,26 +29,26 @@ of the application.
 ### pgd_bench
 
 The Postgres benchmarking application
-[`pgbench`](https://www.postgresql.org/docs/current/pgbench.html) has been
-extended in PGD 5.0 in the form of a new applications: `pgd_bench`.
+[`pgbench`](https://www.postgresql.org/docs/current/pgbench.html) was
+extended in PGD 5.0 in the form of a new applications: pgd_bench.
 
-[`pgd_bench`](/pgd/latest/reference/testingandtuning#pgd_bench) is a regular command-line utility that's added to PostgreSQL's bin
-directory. The utility is based on the Community PostgreSQL `pgbench` tool but
-supports benchmarking CAMO transactions and PGD specific workloads.
+[pgd_bench](/pgd/latest/reference/testingandtuning#pgd_bench) is a regular command-line utility that's added to the PostgreSQL bin
+directory. The utility is based on the PostgreSQL pgbench tool but
+supports benchmarking CAMO transactions and PGD-specific workloads.
 
-Functionality of the `pgd_bench` is a superset of those of `pgbench` but
-requires the BDR extension to be installed in order to work properly.
+Functionality of pgd_bench is a superset of those of pgbench but
+requires the BDR extension to be installed to work properly.
 
 Key differences include:
 
 -   Adjustments to the initialization (`-i` flag) with the standard 
-    `pgbench` scenario to prevent global lock timeouts in certain cases
--   `VACUUM` command in the standard scenario is executed on all nodes
--   `pgd_bench` releases are tied to the releases of the BDR extension
-    and are built against the corresponding PostgreSQL flavour (this is
-    reflected in the output of `--version` flag)
+    pgbench scenario to prevent global lock timeouts in certain cases.
+-   `VACUUM` command in the standard scenario is executed on all nodes.
+-   pgd_bench releases are tied to the releases of the BDR extension
+    and are built against the corresponding PostgreSQL flavor. This is
+    reflected in the output of the `--version` flag.
 
-The current version allows users to run failover tests while using CAMO or
+The current version allows you to run failover tests while using CAMO or
 regular PGD deployments. 
 
 The following options were added:
@@ -93,24 +93,22 @@ transactions.
 
 ### Notes on pgd_bench usage
 
--   When using custom init-scripts it is important to understand implications behind the DDL commands.
-It is generally recommended to wait for the secondary nodes to catch-up on the data-load steps
-before proceeding with DDL operations such as `CREATE INDEX`. The latter acquire global locks which
-can't be acquired until the data-load is complete and thus may time out.
-
--   No extra steps are taken to suppress client messages, such as `NOTICE`s and `WARNING`s emitted
-by PostgreSQL and or any possible extensions including the BDR extension. It is the user's
-responsibility to suppress them by setting appropriate variables (e.g. `client_min_messages`, 
-`bdr.camo_enable_client_warnings ` etc.).
-
+-   When using custom init-scripts, it's important to understand implications behind the DDL commands.
+We generally recommend waiting for the secondary nodes to catch up on the data-load steps
+before proceeding with DDL operations such as `CREATE INDEX`. The latter acquire global locks that
+can't be acquired until the data load is complete and thus might time out.
 
+-   No extra steps are taken to suppress client messages, such as `NOTICE` and `WARNING` messages emitted
+by PostgreSQL and or any possible extensions, including the BDR extension. It's your
+responsibility to suppress them by setting appropriate variables, such as `client_min_messages`, 
+`bdr.camo_enable_client_warnings`, and so on.
 
 ## Performance testing and tuning
 
 PGD allows you to issue write transactions onto multiple master nodes. Bringing
-those writes back together onto each node has a cost in performance.
+those writes back together onto each node has a performance cost.
 
-First, replaying changes from another node has a CPU cost, an I/O cost,
+First, replaying changes from another node has a CPU cost an an I/O cost,
 and it generates WAL records. The resource use is usually less
 than in the original transaction since CPU overheads are lower as a result
 of not needing to reexecute SQL. In the case of UPDATE and DELETE
@@ -135,7 +133,7 @@ If PGD is running slow, then we suggest the following:
 1.  Write a custom test script for pgd_bench, as close as you can make it
     to the production system's problem case.
 2.  Run the script on one node to give you a baseline figure.
-3.  Run the script on as many nodes as occurs in production, using the
+3.  Run the script on as many nodes as occur in production, using the
     same number of sessions in total as you did on one node. This technique
     shows you the effect of moving to multiple nodes.
 4.  Increase the number of sessions for these two tests so you can
@@ -145,9 +143,3 @@ If PGD is running slow, then we suggest the following:
 
 Use all of the normal Postgres tuning features to improve the speed
 of critical parts of your application.
-
-
-
-
-
-