ICU Collation Settings

</entry> </row> <row> <entry><literal>kn</literal></entry> <entry><literal>true</literal>, <literal>false</literal></entry> <entry><literal>false</literal></entry> <entry> If set to <literal>true</literal>, numbers within a string are treated as a single numeric value rather than a sequence of digits. For example, <literal>'id-45'</literal> sorts before <literal>'id-123'</literal>. </entry> </row> <row> <entry><literal>kk</literal></entry> <entry><literal>true</literal>, <literal>false</literal></entry> <entry><literal>false</literal></entry> <entry> <para> Enable full normalization; may affect performance. Basic normalization is performed even when set to <literal>false</literal>. Locales for languages that require full normalization typically enable it by default. </para> <para> Full normalization is important in some cases, such as when multiple accents are applied to a single character. For example, the code point sequences <literal>U&'\0065\0323\0302'</literal> and <literal>U&'\0065\0302\0323'</literal> represent an <literal>e</literal> with circumflex and dot-below accents applied in different orders. With full normalization on, these code point sequences are treated as equal; otherwise they are unequal. </para> </entry> </row> <row> <entry><literal>kr</literal></entry> <entry> <literal>space</literal>, <literal>punct</literal>, <literal>symbol</literal>, <literal>currency</literal>, <literal>digit</literal>, <replaceable>script-id</replaceable> </entry> <entry></entry> <entry> <para> Set to one or more of the valid values, or any BCP 47 <replaceable>script-id</replaceable>, e.g. <literal>latn</literal> ("Latin") or <literal>grek</literal> ("Greek"). Multiple values are separated by "<literal>-</literal>". </para> <para> Redefines the ordering of classes of characters; those characters belonging to a class earlier in the list sort before characters belonging to a class later in the list. For instance, the value <literal>digit-currency-space</literal> (as part of a language tag like <literal>und-u-kr-digit-currency-space</literal>) sorts punctuation before digits and spaces. </para> </entry> </row> <row> <entry><literal>ks</literal></entry> <entry><literal>level1</literal>, <literal>level2</literal>, <literal>level3</literal>, <literal>level4</literal>, <literal>identic</literal></entry> <entry><literal>level3</literal></entry> <entry> Sensitivity (or "strength") when determining equality, with <literal>level1</literal> the least sensitive to differences and <literal>identic</literal> the most sensitive to differences. See <xref linkend="icu-collation-levels"/> for details. </entry> </row> <row> <entry><literal>kv</literal></entry> <entry> <literal>space</literal>, <literal>punct</literal>, <literal>symbol</literal>, <literal>currency</literal> </entry> <entry><literal>punct</literal></entry> <entry> Classes of characters ignored during comparison at level 3. Setting to a later value includes earlier values; e.g. <literal>symbol</literal> also includes <literal>punct</literal> and <literal>space</literal> in the characters to be ignored. Key <literal>ka</literal> must be set to <literal>shifted</literal>

The ICU collation settings continue to provide options for customizing the collation process, including numeric sorting, full normalization, redefinition of character class ordering, sensitivity levels, and character classes to ignore during comparison, with keys such as kn, kk, kr, ks, and kv, each with specific values and default settings that can be used to fine-tune the collation behavior for various use cases.