Initialization Fork and Database Page Layout

a bit is not set, it might or might not be true. Visibility map bits are only set by vacuum, but are cleared by any data-modifying operations on a page. </para> <para> The <xref linkend="pgvisibility"/> module can be used to examine the information stored in the visibility map. </para> </sect1> <sect1 id="storage-init"> <title>The Initialization Fork</title> <indexterm> <primary>Initialization Fork</primary> </indexterm> <para> Each unlogged table, and each index on an unlogged table, has an initialization fork. The initialization fork is an empty table or index of the appropriate type. When an unlogged table must be reset to empty due to a crash, the initialization fork is copied over the main fork, and any other forks are erased (they will be recreated automatically as needed). </para> </sect1> <sect1 id="storage-page-layout"> <title>Database Page Layout</title> <para> This section provides an overview of the page format used within <productname>PostgreSQL</productname> tables and indexes.<footnote> <para> Actually, use of this page format is not required for either table or index access methods. The <literal>heap</literal> table access method always uses this format. All the existing index methods also use the basic format, but the data kept on index metapages usually doesn't follow the item layout rules. </para> </footnote> Sequences and <acronym>TOAST</acronym> tables are formatted just like a regular table. </para> <para> In the following explanation, a <firstterm>byte</firstterm> is assumed to contain 8 bits. In addition, the term <firstterm>item</firstterm> refers to an individual data value that is stored on a page. In a table, an item is a row; in an index, an item is an index entry. </para> <para> Every table and index is stored as an array of <firstterm>pages</firstterm> of a fixed size (usually 8 kB, although a different page size can be selected when compiling the server). In a table, all the pages are logically equivalent, so a particular item (row) can be stored in any page. In indexes, the first page is generally reserved as a <firstterm>metapage</firstterm> holding control information, and there can be different types of pages within the index, depending on the index access method. </para> <para> <xref linkend="page-table"/> shows the overall layout of a page. There are five parts to each page. </para> <table tocentry="1" id="page-table"> <title>Overall Page Layout</title> <titleabbrev>Page Layout</titleabbrev> <tgroup cols="2"> <thead> <row> <entry> Item </entry> <entry>Description</entry> </row> </thead> <tbody> <row> <entry>PageHeaderData</entry> <entry>24 bytes long. Contains general information about the page, including free space pointers.</entry> </row> <row> <entry>ItemIdData</entry> <entry>Array of item identifiers pointing to the actual items. Each entry is an (offset,length) pair. 4 bytes per item.</entry> </row> <row> <entry>Free space</entry> <entry>The unallocated space. New item identifiers are allocated from the start of this area, new items from the end.</entry> </row> <row> <entry>Items</entry> <entry>The actual items themselves.</entry> </row> <row> <entry>Special space</entry> <entry>Index access method specific data. Different methods store different data. Empty in ordinary tables.</entry> </row> </tbody> </tgroup> </table> <para> The first 24 bytes of each page consists of a page header (<structname>PageHeaderData</structname>). Its format is detailed in <xref linkend="pageheaderdata-table"/>. The first field tracks the most recent WAL entry related to this page. The second field contains the page checksum if <xref linkend="app-initdb-data-checksums"/> are enabled. Next is a 2-byte field containing flag bits. This is followed by three 2-byte integer fields (<structfield>pd_lower</structfield>, <structfield>pd_upper</structfield>, and <structfield>pd_special</structfield>). These contain byte offsets from the

This section describes the Initialization Fork, used for unlogged tables to provide an empty copy for recovery after a crash. It then transitions into explaining the layout of database pages in PostgreSQL, covering the overall structure and the five main components: PageHeaderData, ItemIdData, Free space, Items, and Special space. It highlights that sequences and TOAST tables share the same format as regular tables and that the page format is not required for table or index access methods. The section also introduces key concepts like 'byte' and 'item' and provides an overview of how tables and indexes are stored as arrays of fixed-size pages.