regexp_split Examples and regexp

<para> The <function>regexp_split_to_array</function> function behaves the same as <function>regexp_split_to_table</function>, except that <function>regexp_split_to_array</function> returns its result as an array of <type>text</type>. It has the syntax <function>regexp_split_to_array</function>(<replaceable>string</replaceable>, <replaceable>pattern</replaceable> <optional>, <replaceable>flags</replaceable> </optional>). The parameters are the same as for <function>regexp_split_to_table</function>. </para> <para> Some examples: <programlisting> SELECT foo FROM regexp_split_to_table('the quick brown fox jumps over the lazy dog', '\s+') AS foo; foo ------- the quick brown fox jumps over the lazy dog (9 rows) SELECT regexp_split_to_array('the quick brown fox jumps over the lazy dog', '\s+'); regexp_split_to_array ----------------------------------------------- {the,quick,brown,fox,jumps,over,the,lazy,dog} (1 row) SELECT foo FROM regexp_split_to_table('the quick brown fox', '\s*') AS foo; foo ----- t h e q u i c k b r o w n f o x (16 rows) </programlisting> </para> <para> As the last example demonstrates, the regexp split functions ignore zero-length matches that occur at the start or end of the string or immediately after a previous match. This is contrary to the strict definition of regexp matching that is implemented by the other regexp functions, but is usually the most convenient behavior in practice. Other software systems such as Perl use similar definitions. </para> <para> The <function>regexp_substr</function> function returns the substring that matches a POSIX regular expression pattern, or <literal>NULL</literal> if there is no match. It has the syntax <function>regexp_substr</function>(<replaceable>string</replaceable>, <replaceable>pattern</replaceable> <optional>, <replaceable>start</replaceable> <optional>, <replaceable>N</replaceable> <optional>, <replaceable>flags</replaceable> <optional>, <replaceable>subexpr</replaceable> </optional></optional></optional></optional>). <replaceable>pattern</replaceable> is searched for in <replaceable>string</replaceable>, normally from the beginning of the string, but if the <replaceable>start</replaceable> parameter is provided then beginning from that character index. If <replaceable>N</replaceable> is specified then the <replaceable>N</replaceable>'th match of the pattern is returned, otherwise the first match is returned. The <replaceable>flags</replaceable> parameter is an optional text string containing zero or more single-letter flags that change the function's behavior. Supported flags are described in <xref linkend="posix-embedded-options-table"/>. For a pattern containing parenthesized subexpressions, <replaceable>subexpr</replaceable> is an integer indicating which subexpression is of interest: the result is the substring matching that subexpression. Subexpressions are numbered in the order of their leading parentheses. When <replaceable>subexpr</replaceable> is omitted or zero, the result is the whole match regardless of parenthesized subexpressions. </para> <para> Some examples: <programlisting> regexp_substr('number of your street, town zip, FR', '[^,]+', 1, 2) <lineannotation> town zip</lineannotation> regexp_substr('ABCDEFGHI', '(c..)(...)', 1, 1, 'i', 2) <lineannotation>FGH</lineannotation> </programlisting> </para>  <sect3 id="posix-syntax-details"> <title>Regular Expression Details</title> <para> <productname>PostgreSQL</productname>'s regular expressions are implemented using a software package written by Henry Spencer. Much of the description

The text provides examples of using `regexp_split_to_table` and `regexp_split_to_array` functions in PostgreSQL. It clarifies how zero-length matches are handled. Then, it introduces the `regexp_substr` function, which returns the substring matching a POSIX regular expression pattern. It explains the function's syntax, parameters like start position, N'th match, flags, and subexpression, and provides examples of its usage.