Transformation Process and the PostgreSQL Rule System

documents dealing with <application>flex</application> and <application>bison</application>. You should be familiar with <application>bison</application> before you start to study the grammar given in <filename>gram.y</filename> otherwise you won't understand what happens there. </para> </sect2> <sect2 id="parser-stage-transformation-process"> <title>Transformation Process</title> <para> The parser stage creates a parse tree using only fixed rules about the syntactic structure of SQL. It does not make any lookups in the system catalogs, so there is no possibility to understand the detailed semantics of the requested operations. After the parser completes, the <firstterm>transformation process</firstterm> takes the tree handed back by the parser as input and does the semantic interpretation needed to understand which tables, functions, and operators are referenced by the query. The data structure that is built to represent this information is called the <firstterm>query tree</firstterm>. </para> <para> The reason for separating raw parsing from semantic analysis is that system catalog lookups can only be done within a transaction, and we do not wish to start a transaction immediately upon receiving a query string. The raw parsing stage is sufficient to identify the transaction control commands (<command>BEGIN</command>, <command>ROLLBACK</command>, etc.), and these can then be correctly executed without any further analysis. Once we know that we are dealing with an actual query (such as <command>SELECT</command> or <command>UPDATE</command>), it is okay to start a transaction if we're not already in one. Only then can the transformation process be invoked. </para> <para> The query tree created by the transformation process is structurally similar to the raw parse tree in most places, but it has many differences in detail. For example, a <structname>FuncCall</structname> node in the parse tree represents something that looks syntactically like a function call. This might be transformed to either a <structname>FuncExpr</structname> or <structname>Aggref</structname> node depending on whether the referenced name turns out to be an ordinary function or an aggregate function. Also, information about the actual data types of columns and expression results is added to the query tree. </para> </sect2> </sect1> <sect1 id="rule-system"> <title>The <productname>PostgreSQL</productname> Rule System</title> <para> <productname>PostgreSQL</productname> supports a powerful <firstterm>rule system</firstterm> for the specification of <firstterm>views</firstterm> and ambiguous <firstterm>view updates</firstterm>. Originally the <productname>PostgreSQL</productname> rule system consisted of two implementations: <itemizedlist> <listitem> <para> The first one worked using <firstterm>row level</firstterm> processing and was implemented deep in the <firstterm>executor</firstterm>. The rule system was called whenever an individual row had been accessed. This implementation was removed in 1995 when the last official release of the <productname>Berkeley Postgres</productname> project was transformed into <productname>Postgres95</productname>. </para> </listitem> <listitem> <para> The second implementation of the rule system is a technique called <firstterm>query rewriting</firstterm>. The <firstterm>rewrite system</firstterm> is a module that exists between the <firstterm>parser stage</firstterm> and the <firstterm>planner/optimizer</firstterm>. This technique is still implemented. </para> </listitem> </itemizedlist> </para> <para> The query rewriter is discussed in some detail

This excerpt details the transformation process in PostgreSQL, which semantically interprets the parse tree created by the parser, identifying tables, functions, and operators. This process constructs a query tree. The separation of parsing and semantic analysis allows transaction control commands to be identified and executed without starting a transaction immediately. The query tree contains detailed information, such as data types. The excerpt also introduces the PostgreSQL rule system, highlighting its evolution from row-level processing to query rewriting, which is the currently implemented technique.