Home Explore Blog CI



postgresql

21th chunk of `doc/src/sgml/charset.sgml`
325f0b89fe89d9a9b5288c0f4f83a512765e3ec9c9c8d4b00000000100000fa6
 </entry>
        </row>

        <row>
         <entry><literal>kn</literal></entry>
         <entry><literal>true</literal>, <literal>false</literal></entry>
         <entry><literal>false</literal></entry>
         <entry>
          If set to <literal>true</literal>, numbers within a string are
          treated as a single numeric value rather than a sequence of
          digits. For example, <literal>'id-45'</literal> sorts before
          <literal>'id-123'</literal>.
         </entry>
        </row>

        <row>
         <entry><literal>kk</literal></entry>
         <entry><literal>true</literal>, <literal>false</literal></entry>
         <entry><literal>false</literal></entry>
         <entry>
          <para>
           Enable full normalization; may affect performance. Basic
           normalization is performed even when set to
           <literal>false</literal>. Locales for languages that require full
           normalization typically enable it by default.
          </para>
          <para>
           Full normalization is important in some cases, such as when
           multiple accents are applied to a single character. For example,
           the code point sequences <literal>U&amp;'\0065\0323\0302'</literal>
           and <literal>U&amp;'\0065\0302\0323'</literal> represent
           an <literal>e</literal> with circumflex and dot-below accents
           applied in different orders. With full normalization
           on, these code point sequences are treated as equal; otherwise they
           are unequal.
          </para>
         </entry>
        </row>

        <row>
         <entry><literal>kr</literal></entry>
         <entry>
          <literal>space</literal>, <literal>punct</literal>,
          <literal>symbol</literal>, <literal>currency</literal>,
          <literal>digit</literal>, <replaceable>script-id</replaceable>
         </entry>
         <entry></entry>
         <entry>
          <para>
           Set to one or more of the valid values, or any BCP 47
           <replaceable>script-id</replaceable>, e.g. <literal>latn</literal>
           ("Latin") or <literal>grek</literal> ("Greek"). Multiple values are
           separated by "<literal>-</literal>".
          </para>
          <para>
           Redefines the ordering of classes of characters; those characters
           belonging to a class earlier in the list sort before characters
           belonging to a class later in the list. For instance, the value
           <literal>digit-currency-space</literal> (as part of a language tag
           like <literal>und-u-kr-digit-currency-space</literal>) sorts
           punctuation before digits and spaces.
          </para>
         </entry>
        </row>

        <row>
         <entry><literal>ks</literal></entry>
         <entry><literal>level1</literal>, <literal>level2</literal>, <literal>level3</literal>, <literal>level4</literal>, <literal>identic</literal></entry>
         <entry><literal>level3</literal></entry>
         <entry>
          Sensitivity (or "strength") when determining equality, with
          <literal>level1</literal> the least sensitive to differences and
          <literal>identic</literal> the most sensitive to differences. See
          <xref linkend="icu-collation-levels"/> for details.
         </entry>
        </row>

        <row>
         <entry><literal>kv</literal></entry>
         <entry>
          <literal>space</literal>, <literal>punct</literal>,
          <literal>symbol</literal>, <literal>currency</literal>
         </entry>
         <entry><literal>punct</literal></entry>
         <entry>
          Classes of characters ignored during comparison at level 3. Setting
          to a later value includes earlier values;
          e.g. <literal>symbol</literal> also includes
          <literal>punct</literal> and <literal>space</literal> in the
          characters to be ignored. Key <literal>ka</literal> must be set to
          <literal>shifted</literal>

Title: ICU Collation Settings
Summary
The ICU collation settings continue to provide options for customizing the collation process, including numeric sorting, full normalization, redefinition of character class ordering, sensitivity levels, and character classes to ignore during comparison, with keys such as kn, kk, kr, ks, and kv, each with specific values and default settings that can be used to fine-tune the collation behavior for various use cases.