Parallel Query Usage and Limitations in PostgreSQL

linkend="bgworker">background worker processes</link> equal to the number of workers chosen by the planner. The number of background workers that the planner will consider using is limited to at most <xref linkend="guc-max-parallel-workers-per-gather"/>. The total number of background workers that can exist at any one time is limited by both <xref linkend="guc-max-worker-processes"/> and <xref linkend="guc-max-parallel-workers"/>. Therefore, it is possible for a parallel query to run with fewer workers than planned, or even with no workers at all. The optimal plan may depend on the number of workers that are available, so this can result in poor query performance. If this occurrence is frequent, consider increasing <varname>max_worker_processes</varname> and <varname>max_parallel_workers</varname> so that more workers can be run simultaneously or alternatively reducing <varname>max_parallel_workers_per_gather</varname> so that the planner requests fewer workers. </para> <para> Every background worker process that is successfully started for a given parallel query will execute the parallel portion of the plan. The leader will also execute that portion of the plan, but it has an additional responsibility: it must also read all of the tuples generated by the workers. When the parallel portion of the plan generates only a small number of tuples, the leader will often behave very much like an additional worker, speeding up query execution. Conversely, when the parallel portion of the plan generates a large number of tuples, the leader may be almost entirely occupied with reading the tuples generated by the workers and performing any further processing steps that are required by plan nodes above the level of the <literal>Gather</literal> node or <literal>Gather Merge</literal> node. In such cases, the leader will do very little of the work of executing the parallel portion of the plan. </para> <para> When the node at the top of the parallel portion of the plan is <literal>Gather Merge</literal> rather than <literal>Gather</literal>, it indicates that each process executing the parallel portion of the plan is producing tuples in sorted order, and that the leader is performing an order-preserving merge. In contrast, <literal>Gather</literal> reads tuples from the workers in whatever order is convenient, destroying any sort order that may have existed. </para> </sect1> <sect1 id="when-can-parallel-query-be-used"> <title>When Can Parallel Query Be Used?</title> <para> There are several settings that can cause the query planner not to generate a parallel query plan under any circumstances. In order for any parallel query plans whatsoever to be generated, the following settings must be configured as indicated. </para> <itemizedlist> <listitem> <para> <xref linkend="guc-max-parallel-workers-per-gather"/> must be set to a value that is greater than zero. This is a special case of the more general principle that no more workers should be used than the number configured via <varname>max_parallel_workers_per_gather</varname>. </para> </listitem> </itemizedlist> <para> In addition, the system must not be running in single-user mode. Since the entire database system is running as a single process in this situation, no background workers will be available. </para> <para> Even when it is in general possible for parallel query plans to be generated, the planner will not generate them for a given query if any of the following are true: </para> <itemizedlist> <listitem> <para> The query writes any data or locks any database rows. If a query contains a data-modifying operation either at the top level or within a CTE, no parallel plans

The parallel query feature in PostgreSQL utilizes background worker processes to speed up query execution, but its usage is limited by various settings and conditions, such as the number of available workers, system mode, and query type, and the planner will not generate parallel plans for queries that write data or lock database rows.