Merge pull request #5270 from EnterpriseDB/release/2024-02-15b

Release/2024-02-15 (b)
EnterpriseDB · Feb 15, 2024 · ad41f31 · ad41f31 · github-actions · Feb 15, 2024
2 parents a44251d + 35c4e5f
commit ad41f31
Show file tree

Hide file tree

Showing 65 changed files with 3,590 additions and 369 deletions.
diff --git a/advocacy_docs/pg_extensions/extensionrefs.json b/advocacy_docs/pg_extensions/extensionrefs.json
@@ -117,5 +117,6 @@
     "mysql_fdw": "https://github.com/EnterpriseDB/mysql_fdw",
     "bdr_5.x":"https://www.enterprisedb.com/docs/pgd/5/",
     "pg_cron":"https://github.com/citusdata/pg_cron",
-    "sql_profiler":"https://www.enterprisedb.com/docs/pem/latest/profiling_workloads/using_sql_profiler/"
+    "sql_profiler":"https://www.enterprisedb.com/docs/pem/latest/profiling_workloads/using_sql_profiler/",
+    "pg_squeeze":"https://www.enterprisedb.com/docs/pg_extensions/pg_squeeze/"
 }
diff --git a/advocacy_docs/pg_extensions/index.mdx b/advocacy_docs/pg_extensions/index.mdx
diff --git a/advocacy_docs/pg_extensions/pg_squeeze/configuring.mdx b/advocacy_docs/pg_extensions/pg_squeeze/configuring.mdx
@@ -0,0 +1,33 @@
+---
+title: Configuring pg_squeeze 
+navTitle: Configuring
+---
+
+1. Add `pg_squeeze` to the `shared_preload_libraries` parameter: 
+
+   ```ini
+   shared_preload_libraries = 'pg_squeeze'
+   ```
+
+   !!! Note
+   If `shared_preload_libraries` has other extensions, then you can add `pg_squeeze` to the list. The order doesn't matter. 
+   !!!
+
+1. Add these changes to `postgresql.conf`:
+
+  ```shell
+  wal_level = logical
+  max_replication_slots = 1 # ... or add 1 to the current value.
+  ```
+
+1. Restart Postgres.
+
+1. Create the `pg_squeeze` extension in your database:
+
+  ```shell
+  CREATE EXTENSION pg_squeeze;
+  ```
+
+!!! Note 
+    When upgrading a database cluster with `pg_squeeze` installed (either using `pg_dumpall`/`pg_restore` or `pg_upgrade`), make sure that the new cluster has `pg_squeeze` in `shared_preload_libraries` before you upgrade. Otherwise the upgrade fails.
+
diff --git a/advocacy_docs/pg_extensions/pg_squeeze/index.mdx b/advocacy_docs/pg_extensions/pg_squeeze/index.mdx
@@ -0,0 +1,27 @@
+---
+title: 'PG Squeeze'
+indexCards: none
+navigation:
+   - rel_notes
+   - installing
+   - configuring
+   - using
+directoryDefaults:
+  product: PG Squeeze
+---
+
+PG Squeeze (`pg_squeeze`) is an extension released as open source software under the PostgreSQL License. 
+
+`pg_squeeze` removes unused space from a table and optionally sorts tuples according to a particular index (as if a `CLUSTER` command were executed concurrently with regular reads and writes). In fact, `pg_squeeze` tries to replace the `pg_repack` extension. While providing very similar functionality, `pg_squeeze` takes a different approach by:
+
+- Implementing functionality solely on the server side. This approach makes both configuration and use simpler than `pg_repack`, which uses both server- and client-side code. Server-side implementation also allows for smooth implementation of unattended processing using background workers.
+
+- Using recent improvements of the PostgreSQL database server. Most notably, besides the use of background workers, `pg_squeeze` uses logical decode instead of triggers to capture concurrent changes. 
+
+For more information about `pg_squeeze`, see:
+
+- [Installing pg_squeeze](installing.mdx)
+- [Configuring pg_squeeze](configuring.mdx)
+- [Using pg_squeeze](using.mdx)
+
+
diff --git a/advocacy_docs/pg_extensions/pg_squeeze/installing.mdx b/advocacy_docs/pg_extensions/pg_squeeze/installing.mdx
@@ -0,0 +1,71 @@
+---
+title: Installing pg_squeeze 
+navTitle: Installing
+---
+
+`pg_squeeze` is supported on the same platforms as the Postgres distribution you're using. Support for `pg_squeeze` starts with Postgres 12. For details, see:
+
+- [EDB Postgres Advanced Server Product Compatibility](https://www.enterprisedb.com/platform-compatibility#epas)
+- [PostgreSQL Product Compatibility](https://www.enterprisedb.com/resources/platform-compatibility#pg)
+- [EDB Postgres Extended Server Product Compatibility](https://www.enterprisedb.com/resources/platform-compatibility#epas_extended)
+
+## Installation
+
+Before you begin the installation process:
+
+- Install Postgres. See: 
+
+  - [Installing EDB Postgres Advanced Server](/epas/latest/installing/)
+
+  - [Installing PostgreSQL](https://www.postgresql.org/download/) 
+
+  - [Installing EDB Postgres Extended Server](/pge/latest/installing/)
+
+- Set up the repository.
+
+  Setting up the repository is a one-time task. If you've already set up your repository, you don't need to perform this step.
+
+  To set up the repository, go to [EDB repositories](https://www.enterprisedb.com/repos-downloads) and follow the instructions provided there.
+
+## Install the package
+
+The syntax for the package install command is:
+
+```shell
+# For SLES, CentOS, RHEL and its derivatives
+sudo <package-manager> -y install edb-<postgres><postgres_version>-pg-squeeze1
+
+# For Debian and Ubuntu
+sudo <package-manager> -y install edb-<postgres><postgres_version>-pg-squeeze-1
+```
+
+Where: 
+- `<package-manager>`is the package manager used with your operating system:
+
+  | Package manager |             Operating system     |
+  | --------------- | -------------------------------- |
+  | dnf             | RHEL 8/9 and derivatives         |
+  | yum             | RHEL 7 and derivatives, CentOS 7 |
+  | zypper          | SLES                             |
+  | apt-get         | Debian and derivatives           |
+
+- `<postgres>` is the distribution of Postgres you're using:
+
+ |    Postgres distribution     |   Value    |
+ | ---------------------------- | ---------- |
+ | PostgreSQL                   | pg         |
+ | EDB Postgres Advanced Server | as         |
+ | EDB Postgres Extended Server | postgresextended |
+
+- `<postgres_version>` is the version of Postgres you're using.
+
+For example, to install pg_squeeze for EDB Postgres Advanced Server 15 on a RHEL 8 platform:
+
+```shell
+sudo dnf -y install edb-as15-pg-squeeze1
+```
+
+
+
+
+
diff --git a/advocacy_docs/pg_extensions/pg_squeeze/rel_notes/index.mdx b/advocacy_docs/pg_extensions/pg_squeeze/rel_notes/index.mdx
@@ -0,0 +1,10 @@
+---
+title: 'pg_squeeze release notes'
+navTitle: "Release notes"
+indexCards: none
+---
+The pg_squeeze documentation describes the latest version of pg_squeeze, including minor releases and patches. These release notes cover what was new in each release. For new functionality introduced in a minor or patch release, there are also indicators in the content about the release that introduced the feature.
+
+|               Version               | Release Date |
+| ----------------------------------- | ------------ |
+| [1.6.1](pg_squeeze_1.6.1_rel_notes) | 15 Feb 2024  |
diff --git a/advocacy_docs/pg_extensions/pg_squeeze/rel_notes/pg_squeeze_1.6.1_rel_notes.mdx b/advocacy_docs/pg_extensions/pg_squeeze/rel_notes/pg_squeeze_1.6.1_rel_notes.mdx
@@ -0,0 +1,10 @@
+---
+title: Release notes for pg_squeeze version 1.6.1
+navTitle: "Version 1.6.1"
+---
+
+This release of pg_squeeze includes:
+
+|  Type   |                                Description                                 |
+| ------- | -------------------------------------------------------------------------- |
+| Feature | This is the initial release.                                               |
diff --git a/advocacy_docs/pg_extensions/pg_squeeze/using.mdx b/advocacy_docs/pg_extensions/pg_squeeze/using.mdx
@@ -0,0 +1,190 @@
+---
+title: Using pg_squeeze
+navTitle: Using
+---
+
+## Registering a table for regular processing
+
+First, ensure that your table has either a primary key or a unique constraint. This is necessary to process changes resulting from other transactions  while `pg_squeeze` is doing its work.
+
+To make the `pg_squeeze` extension aware of the table, you must insert a record into the `squeeze.tables` table. After it is added, table statistics are checked periodically. When the table meets the necessary criteria to be squeezed, a task is added to a queue. The tasks are processed sequentially, in the order they are created.
+
+Here is an example of a simple registration:
+
+```sql
+INSERT INTO squeeze.tables (tabschema, tabname, schedule)
+VALUES ('public', 'foo', ('{30}', '{22}', NULL, NULL, '{3, 5}'));
+```
+
+Additional columns can be specified optionally, for example:
+
+```sql
+INSERT INTO squeeze.tables (
+    tabschema,
+    tabname,
+    schedule,
+    free_space_extra,
+    vacuum_max_age,
+    max_retry
+)
+VALUES (
+    'public',
+    'bar',
+    ('{30}', '{22}', NULL, NULL, '{3, 5}'),
+    30,
+    '2 hours',
+    2
+);
+```
+
+The following list describes the table metadata:
+
+- `tabschema` &mdash; Schema name.
+
+- `tabname` &mdash; Table name.
+
+- `schedule` &mdash; Specifies when the table should be checked and possibly squeezed. The schedule is described by a value of the following composite data type, which resembles a crontab entry:
+
+    ```shell
+    CREATE TYPE schedule AS (
+        minutes       minute[],
+        hours         hour[],
+        days_of_month dom[],
+        months        month[],
+        days_of_week  dow[]
+    );
+    ```
+
+    In this data type `minutes` (0-59) and `hours` (0-23) determine the time when the check occurs within a day, while `days_of_month` (1-31), `months` (1-12) and `days_of_week` (0-7, where both 0 and 7 stand for Sunday) determine the day of the check.
+
+    The check is performed if the `minutes`, `hours`, and `months` all match the current timestamp. `NULL` means any minute, hour, and month, respectively. Either `days_of_month` or `days_of_week` must match the current timestamp or both must be `NULL` for the check to take place.
+
+    For example, the entries in the sample registration specify to check the table `public.bar` every Wednesday and Friday at 22:30.
+
+- `free_space_extra` &mdash; Minimum percentage of extra free space needed to trigger processing of the table. The `extra` adjective refers to the fact that free space derived from `fillfactor` isn't a reason to squeeze the table.
+
+    For example, if `fillfactor` equals 60, then at least 40 percent of each page stays free during normal operation. If you want to ensure that 70 percent of free space makes `pg_squeeze` interested in the table, set `free_space_extra` to 30 (that is, 70 percent required to be free minus the 40 percent free due to the `fillfactor`).
+
+    The default value of `free_space_extra` is 50.
+
+- `min_size` &mdash; Minimum disk space in megabytes that the table must occupy to be eligible for processing. The default value is 8.
+
+- `vacuum_max_age` &mdash; Maximum time since the completion of the last `VACUUM` to consider the free space map (FSM) fresh. After this interval has elapsed, the portion of dead tuples might be significant. In this case, you must spend more effort to evaluate the potential effect of `pg_squeeze` beyond checking the FSM. The default value is 1 hour.
+
+- `max_retry` &mdash; Maximum number of extra attempts to squeeze a table if the first processing of the corresponding task fails. The typical reason to retry processing is that the table definition changed while the table was being squeezed. If the number of retries is achieved, processing of the table is considered complete. The next task is created at the next scheduled time.
+    The default value of `max_retry` is 0 (that is, don't retry).
+
+!!! note
+    The `squeeze.table` is the only table to modify. If you want to change anything else, make sure you understand what you're doing.
+
+## Ad hoc processing for any table
+
+It's possible to squeeze tables manually without registering (that is, without inserting the corresponding record into `squeeze.tables`) and without prior checking of the actual bloat.
+
+Function signature:
+
+```shell
+squeeze.squeeze_table(
+    tabschema name,
+    tabname name,
+    clustering_index name,
+    rel_tablespace name,
+    ind_tablespaces name[]
+)
+```
+
+The following list describes the table metadata for ad hoc processing:
+
+- `clustering_index` &mdash; Index of the processed table. After processing finishes, tuples of the table are physically sorted by the key of this index.
+
+- `rel_tablespace` &mdash; Existing tablespace into which to move the table. `NULL` means to leave the table where it is.
+
+- `ind_tablespaces` &mdash; Two-dimensional array in which each row specifies tablespace mapping of an index. The first and the second columns represent the index name and tablespace name, respectively. All indexes for which no mapping is specified remain in the original tablespace.
+    !!! note
+        If a tablespace is specified for the table but not for indexes, the table gets moved to that tablespace, but the indexes remain in the original tablespace. In other words, the tablespace of the table isn't the default for indexes.
+
+### Sample execution
+
+```shell
+SELECT squeeze.squeeze_table('public', 'pgbench_accounts');
+```
+
+## Enabling or disabling table processing
+
+To enable processing of bloated tables, run this statement as superuser:
+
+```shell
+SELECT squeeze.start_worker();
+```
+
+The function starts a background worker (`scheduler worker`) that periodically checks which of the registered tables to check for bloat and creates a task for each. Another worker (`squeeze worker`) is launched whenever a task exists for particular database.
+
+If the scheduler worker is already running for the current database, the function doesn't report any error, and the new worker exits immediately.
+
+If the workers are running for the current database, you can use the following statement to stop them:
+
+```shell
+SELECT squeeze.stop_worker();
+```
+
+!!! note
+    Only the functions mentioned in this documentation are considered part of the user interface. If you want to call any other function, make sure you understand what you're doing.
+
+If you want the background workers to start during startup of the whole PostgreSQL cluster, add entries like the following to the  `postgresql.conf` file:
+
+```shell
+squeeze.worker_autostart = 'my_database your_database'
+squeeze.worker_role = postgres
+```
+
+The next time you start the cluster, two or more workers (that is, one scheduler worker and one or more squeeze workers) are launched for `my_database` and the same for `your_database`. If you take this approach, any worker doesn't start or stop without doing any work if either:
+
+- The `pg_squeeze` extension doesn't exist in the database.
+
+- The `squeeze.worker_role` parameter specifies a role that doesn't have superuser privileges.
+
+Although there are actually two workers, the functions and configuration variables described here use a singular form of the word `worker`. This is because only one `worker` existed in the previous versions of `pg_squeeze`, which ensured both scheduling and execution of the tasks. This implementation change, then, doesn't force all users to adjust their configuration files during upgrade.
+
+## Controlling impact on other backends
+
+Although the table being squeezed is available for both read and write operations by other transactions most of the time, an exclusive lock is needed to finalize processing. If `pg_squeeze` occasionally seems to block access to tables, consider setting the `squeeze.max_xlock_time` GUC parameter. For example:
+
+```shell
+SET squeeze.max_xlock_time TO 100;
+```
+
+This example specifies not to hold the exclusive lock for more than 0.1 second (100 milliseconds). If more time is needed for the final stage, `pg_squeeze` releases the exclusive lock, processes changes committed by other transactions in between, and tries the final stage again. An error is reported if the lock duration is exceeded a few more times. If that happens, either increase the setting or schedule processing of the problematic table for a different time when write activity is lower.
+
+## Running multiple workers per database
+
+If you think that a single squeeze worker doesn't cope with the load, consider setting the `squeeze.workers_per_database` configuration variable to a value higher than 1. Then the `pg_squeeze` extension can process multiple tables simultaneously: one table per squeeze worker. 
+
+However, be aware that this setting affects all databases in which you actively use the `pg_squeeze` extension. The total number of all the squeeze workers in the cluster (including the scheduler workers) can't exceed the in-core configuration variable `max_worker_processes`.
+
+## Monitoring
+
+The `squeeze.log` table contains one entry per successfully squeezed table.
+
+The columns `tabschema` and `tabname` identify the processed table. The columns `started` and `finished` report when the processing started and finished. `ins_initial` is the number of tuples inserted into the new table storage during the initial load stage, that is, the number of tuples present in the table before the processing started. On the other hand, `ins`, `upd`, and `del` are the numbers of tuples inserted, updated, and deleted by applications during the table processing. (These concurrent data changes must also be incorporated into the squeezed table. Otherwise they'd get lost.)
+
+The `squeeze.errors` table contains errors that happen during squeezing. A common problem is that someone changes the definition of the table whose processing is in progress (that is, someone adds or removes a column).
+
+The `squeeze.get_active_workers()` function returns a table of squeeze workers that are processing tables in the current database.
+
+The `pid` column contains the system PID of the worker process. The other columns have the same meaning as their counterparts in the `squeeze.log` table. While the `squeeze.log` table shows information only on the completed squeeze operations, the `squeeze.get_active_workers()` function lets you check the progress during processing.
+
+## Unregistering a table
+
+If a particular table is no longer subject to periodical squeezes, delete the corresponding row from the `squeeze.tables` table.
+
+It's also a good practice to unregister a table that you're going to drop, although the background worker does unregister non-existing tables periodically.
+
+## Concurrency
+
+The `pg_squeeze` extension doesn't prevent other transactions from altering a table at certain stages of the processing. If a disruptive command (that is, `ALTER TABLE`, `VACUUM FULL`, `CLUSTER`, or `TRUNCATE`) manages to commit before the squeeze finishes, the `squeeze_table()` function aborts and all changes made to the table are rolled back. The `max_retry` column of the `squeeze.tables` table determines how many times the squeeze worker retries. Changing your schedule might help to avoid these kind of disruptions.
+
+Like `pg_repack`, `pg_squeeze` also changes visibility of rows and thus allows for the MVCC-unsafe behavior described in the first paragraph of [Caveats](https://www.postgresql.org/docs/current/mvcc-caveats.html) for MVCC.
+
+## Disk space requirements
+
+Performing a full-table squeeze requires twice as much free disk space as the target table and its indexes. For example, if the total size of the tables and indexes to be squeezed is 1GB, you need an additional 2GB of disk space.
diff --git a/advocacy_docs/pg_extensions/wait_states/index.mdx b/advocacy_docs/pg_extensions/wait_states/index.mdx
@@ -4,6 +4,7 @@ indexCards: none
 directoryDefaults:
   product: EDB Wait States
 navigation:
+  - rel_notes
   - installing
   - using
 ---

diff --git a/advocacy_docs/pg_extensions/wait_states/rel_notes/index.mdx b/advocacy_docs/pg_extensions/wait_states/rel_notes/index.mdx
@@ -0,0 +1,11 @@
+---
+title: 'EDB Wait States release notes'
+navTitle: "Release notes"
+indexCards: none
+---
+
+The EDB Wait States documentation describes the latest version of EDB Wait States, including minor releases and patches. These release notes cover what was new in each release. For new functionality introduced in a minor or patch release, there are also indicators in the content about the release that introduced the feature.
+
+|             Version              | Release Date |
+| -------------------------------- | ------------ |
+| [1.2](wait_states_1.2_rel_notes) | 15 Feb 2024  |
diff --git a/advocacy_docs/pg_extensions/wait_states/rel_notes/wait_states_1.2_rel_notes.mdx b/advocacy_docs/pg_extensions/wait_states/rel_notes/wait_states_1.2_rel_notes.mdx
@@ -0,0 +1,11 @@
+---
+title: Release notes for Wait States version 1.2
+navTitle: "Version 1.2"
+---
+
+This release of Wait States includes:
+
+| Type        | Description                                                                                                                                                                          |
+|-------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| Enhancement | Added new parameters to functions `edb_wait_states_samples()` and `edn_wait_states_session()`.                                                                                       |
+| Enhancement | Added new functions: `edb_wait_states_system_info`, `edb_wait_states_wait_events`, `edb_wait_states_sql_statements`, `edb_wait_states_cluster_stats`, and `edb_wait_states_cluster`. |