forked from docs/doc-exports
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com> Co-authored-by: Lu, Huayi <luhuayi@huawei.com> Co-committed-by: Lu, Huayi <luhuayi@huawei.com>
132 lines
23 KiB
HTML
132 lines
23 KiB
HTML
<a name="EN-US_TOPIC_0000001213045052"></a><a name="EN-US_TOPIC_0000001213045052"></a>
|
|
|
|
<h1 class="topictitle1">Collation Support</h1>
|
|
<div id="body0000001213045052"><p id="EN-US_TOPIC_0000001213045052__p8060118">The collation feature allows specifying the data sorting order and data classification rules in a character set. This alleviates the restriction that the <strong id="EN-US_TOPIC_0000001213045052__b679593018019">LC_COLLATE</strong> and <strong id="EN-US_TOPIC_0000001213045052__b1332243615020">LC_CTYPE</strong> settings of a database cannot be changed after its creation.</p>
|
|
<div class="section" id="EN-US_TOPIC_0000001213045052__section1523891573513"><h4 class="sectiontitle">Overview</h4><p id="EN-US_TOPIC_0000001213045052__p1089013283912">Every expression of a collatable data type has a collation. (The built-in collatable data types are text, varchar, and char. User-defined base types can also be marked collatable, and of course a domain over a collatable data type is collatable.) If the expression is a column reference, the collation of the expression is the defined collation of the column. If the expression is a constant, the collation is the default collation of the data type of the constant. The collation of a more complex expression is derived from the collations of its inputs.</p>
|
|
</div>
|
|
<div class="section" id="EN-US_TOPIC_0000001213045052__section15437634113612"><h4 class="sectiontitle">Collation Combination Principles</h4><ul id="EN-US_TOPIC_0000001213045052__ul1384063516372"><li id="EN-US_TOPIC_0000001213045052__li136308531491">The collation of an expression can be the default collation, which means the locale settings defined for the database. It is also possible for an expression's collation to be indeterminate. In such cases, ordering operations and other operations that need to know the collation will fail.</li><li id="EN-US_TOPIC_0000001213045052__li1363018536911">For a function or operator call, the collation that is derived by examining the argument collations is used at run time for performing the specified operation. If the result of the function or operator call is of a collatable data type, the collation is also used as the defined collation of the function or operator expression, in case there is a surrounding expression that requires knowledge of its collation.</li><li id="EN-US_TOPIC_0000001213045052__li363117531294">The collation derivation of an expression can be implicit or explicit. This distinction affects how collations are combined when multiple different collations appear in an expression. An explicit collation derivation occurs when a <strong id="EN-US_TOPIC_0000001213045052__b2606162781511">COLLATE </strong>clause is used; all other collation derivations are implicit. When multiple collations need to be combined, the following rules are used:<ul id="EN-US_TOPIC_0000001213045052__ul194403581916"><li id="EN-US_TOPIC_0000001213045052__li1784063511371">If any input expression has an explicit collation derivation, then all explicitly derived collations among the input expressions must be the same, otherwise an error is raised. If any explicitly derived collation is present, that is the result of the collation combination.</li><li id="EN-US_TOPIC_0000001213045052__li18401735123712">Otherwise, all input expressions must have the same implicit collation derivation or the default collation. If any non-default collation is present, that is the result of the collation combination. Otherwise, the result is the default collation.</li><li id="EN-US_TOPIC_0000001213045052__li8840835183710">If there are conflicting non-default implicit collations among the input expressions, then the combination is deemed to have indeterminate collation. This is not an error condition unless the particular function being invoked requires knowledge of the collation it should apply. If it does, an error will be raised at run-time.</li></ul>
|
|
</li><li id="EN-US_TOPIC_0000001213045052__li2767113601014">In a CASE expression, the comparison rule is subject to the COLLATE setting in the WHEN clause.</li><li id="EN-US_TOPIC_0000001213045052__li8519163941020">Explicit COLLATE derivation takes effect only in the current query (CTE or SUBQUERY). Outside the query, implicit derivation takes effect.</li></ul>
|
|
</div>
|
|
<div class="section" id="EN-US_TOPIC_0000001213045052__section1751119285181"><h4 class="sectiontitle">Collation Tips</h4><ul id="EN-US_TOPIC_0000001213045052__ul951119286183"><li id="EN-US_TOPIC_0000001213045052__li155105287188">Do not use multiple collations in the same query statement. Otherwise, exceptional result sets may be generated.</li><li id="EN-US_TOPIC_0000001213045052__li851152814186">Do not use multiple COLLATE clauses to specify a collation.</li></ul>
|
|
</div>
|
|
<div class="section" id="EN-US_TOPIC_0000001213045052__section653695253714"><h4 class="sectiontitle">Case-insensitive Collation Support</h4><p id="EN-US_TOPIC_0000001213045052__p73339397397">Since cluster 8.1.3, GaussDB(DWS) has added the built-in case_insensitive collation, which is case-insensitive to character types in some actions (such as sorting, comparison, and hash).</p>
|
|
<p id="EN-US_TOPIC_0000001213045052__p12464424015">Constraints:</p>
|
|
<ul id="EN-US_TOPIC_0000001213045052__ul1468520471406"><li id="EN-US_TOPIC_0000001213045052__li96856473402">Supported character types: char, character, nchar, and varchar/character varying/varchar2/nvarchar2/clob/text.</li><li id="EN-US_TOPIC_0000001213045052__li1068564784010">The character types <strong id="EN-US_TOPIC_0000001213045052__b333142119295">char </strong>and <strong id="EN-US_TOPIC_0000001213045052__b847162432912">name </strong>are not supported.</li><li id="EN-US_TOPIC_0000001213045052__li126851547114019">The following encoding formats are not supported: PG_EUC_JIS_2004, PG_MULE_INTERNAL, PG_LATIN10 and PG_WIN874.</li><li id="EN-US_TOPIC_0000001213045052__li10685147174017">It cannot be specified to <strong id="EN-US_TOPIC_0000001213045052__b6149165210523">LC_COLLATE</strong> when <strong id="EN-US_TOPIC_0000001213045052__b20885757165219">CREATE DATABASE</strong> is executed.</li><li id="EN-US_TOPIC_0000001213045052__li146861847164012">Regular expressions are not supported.</li><li id="EN-US_TOPIC_0000001213045052__li168634713404">Record comparison of the character type (for example, <strong id="EN-US_TOPIC_0000001213045052__b65481028143214">record_eq</strong>) is not supported.</li><li id="EN-US_TOPIC_0000001213045052__li189910138121">Time series tables are not supported.</li><li id="EN-US_TOPIC_0000001213045052__li2421105512180">Skew optimization is not supported.</li><li id="EN-US_TOPIC_0000001213045052__li123741824101914">RoughCheck optimization is not supported.</li></ul>
|
|
</div>
|
|
<div class="section" id="EN-US_TOPIC_0000001213045052__section0994727174110"><h4 class="sectiontitle">Examples</h4><p id="EN-US_TOPIC_0000001213045052__p8902175472215">The COLLATE clause is specified in the statement.</p>
|
|
<div class="codecoloring" codetype="Sql" id="EN-US_TOPIC_0000001213045052__screen69091111260"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal">1</span>
|
|
<span class="normal">2</span>
|
|
<span class="normal">3</span>
|
|
<span class="normal">4</span>
|
|
<span class="normal">5</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">SELECT</span><span class="w"> </span><span class="s1">'a'</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="s1">'A'</span><span class="p">,</span><span class="w"> </span><span class="s1">'a'</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="s1">'A'</span><span class="w"> </span><span class="k">COLLATE</span><span class="w"> </span><span class="n">case_insensitive</span><span class="p">;</span>
|
|
<span class="w"> </span><span class="o">?</span><span class="k">column</span><span class="o">?</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="o">?</span><span class="k">column</span><span class="o">?</span>
|
|
<span class="c1">----------+----------</span>
|
|
<span class="w"> </span><span class="n">f</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="n">t</span>
|
|
<span class="p">(</span><span class="mi">1</span><span class="w"> </span><span class="k">row</span><span class="p">)</span>
|
|
</pre></div></td></tr></table></div>
|
|
|
|
</div>
|
|
<p id="EN-US_TOPIC_0000001213045052__p987628112320">Set the column attribute to <strong id="EN-US_TOPIC_0000001213045052__b91914806241110">case_insensitive</strong> when creating a table.</p>
|
|
<div class="codecoloring" codetype="Sql" id="EN-US_TOPIC_0000001213045052__screen23861823248"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal"> 1</span>
|
|
<span class="normal"> 2</span>
|
|
<span class="normal"> 3</span>
|
|
<span class="normal"> 4</span>
|
|
<span class="normal"> 5</span>
|
|
<span class="normal"> 6</span>
|
|
<span class="normal"> 7</span>
|
|
<span class="normal"> 8</span>
|
|
<span class="normal"> 9</span>
|
|
<span class="normal">10</span>
|
|
<span class="normal">11</span>
|
|
<span class="normal">12</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">CREATE</span><span class="w"> </span><span class="k">TABLE</span><span class="w"> </span><span class="n">t1</span><span class="w"> </span><span class="p">(</span><span class="n">a</span><span class="w"> </span><span class="nb">text</span><span class="w"> </span><span class="k">collate</span><span class="w"> </span><span class="n">case_insensitive</span><span class="p">);</span>
|
|
<span class="n">NOTICE</span><span class="p">:</span><span class="w"> </span><span class="n">The</span><span class="w"> </span><span class="s1">'DISTRIBUTE BY'</span><span class="w"> </span><span class="n">clause</span><span class="w"> </span><span class="k">is</span><span class="w"> </span><span class="k">not</span><span class="w"> </span><span class="n">specified</span><span class="p">.</span><span class="w"> </span><span class="k">Using</span><span class="w"> </span><span class="n">round</span><span class="o">-</span><span class="n">robin</span><span class="w"> </span><span class="k">as</span><span class="w"> </span><span class="n">the</span><span class="w"> </span><span class="n">distribution</span><span class="w"> </span><span class="k">mode</span><span class="w"> </span><span class="k">by</span><span class="w"> </span><span class="k">default</span><span class="p">.</span>
|
|
<span class="n">HINT</span><span class="p">:</span><span class="w"> </span><span class="n">Please</span><span class="w"> </span><span class="n">use</span><span class="w"> </span><span class="s1">'DISTRIBUTE BY'</span><span class="w"> </span><span class="n">clause</span><span class="w"> </span><span class="k">to</span><span class="w"> </span><span class="n">specify</span><span class="w"> </span><span class="n">suitable</span><span class="w"> </span><span class="k">data</span><span class="w"> </span><span class="n">distribution</span><span class="w"> </span><span class="k">column</span><span class="p">.</span>
|
|
<span class="k">CREATE</span><span class="w"> </span><span class="k">TABLE</span>
|
|
<span class="err">\</span><span class="n">d</span><span class="w"> </span><span class="n">t1</span>
|
|
<span class="w"> </span><span class="k">Table</span><span class="w"> </span><span class="ss">"public.t1"</span>
|
|
<span class="w"> </span><span class="k">Column</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="k">Type</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="n">Modifiers</span>
|
|
<span class="c1">--------+------+--------------------------</span>
|
|
<span class="w"> </span><span class="n">a</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="nb">text</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="k">collate</span><span class="w"> </span><span class="n">case_insensitive</span>
|
|
|
|
<span class="k">INSERT</span><span class="w"> </span><span class="k">INTO</span><span class="w"> </span><span class="n">t1</span><span class="w"> </span><span class="k">values</span><span class="p">(</span><span class="s1">'a'</span><span class="p">),(</span><span class="s1">'A'</span><span class="p">),(</span><span class="s1">'b'</span><span class="p">),(</span><span class="s1">'B'</span><span class="p">);</span>
|
|
<span class="k">INSERT</span><span class="w"> </span><span class="mi">0</span><span class="w"> </span><span class="mi">4</span>
|
|
</pre></div></td></tr></table></div>
|
|
|
|
</div>
|
|
<p id="EN-US_TOPIC_0000001213045052__p72468591233">This parameter is specified during table creation and does not need to be specified during query.</p>
|
|
<div class="codecoloring" codetype="Sql" id="EN-US_TOPIC_0000001213045052__screen186040269244"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal"> 1</span>
|
|
<span class="normal"> 2</span>
|
|
<span class="normal"> 3</span>
|
|
<span class="normal"> 4</span>
|
|
<span class="normal"> 5</span>
|
|
<span class="normal"> 6</span>
|
|
<span class="normal"> 7</span>
|
|
<span class="normal"> 8</span>
|
|
<span class="normal"> 9</span>
|
|
<span class="normal">10</span>
|
|
<span class="normal">11</span>
|
|
<span class="normal">12</span>
|
|
<span class="normal">13</span>
|
|
<span class="normal">14</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">SELECT</span><span class="w"> </span><span class="n">a</span><span class="p">,</span><span class="w"> </span><span class="n">a</span><span class="o">=</span><span class="s1">'a'</span><span class="w"> </span><span class="k">FROM</span><span class="w"> </span><span class="n">t1</span><span class="p">;</span>
|
|
<span class="w"> </span><span class="n">a</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="o">?</span><span class="k">column</span><span class="o">?</span>
|
|
<span class="c1">---+----------</span>
|
|
<span class="w"> </span><span class="n">A</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="n">t</span>
|
|
<span class="w"> </span><span class="n">B</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="n">f</span>
|
|
<span class="w"> </span><span class="n">a</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="n">t</span>
|
|
<span class="w"> </span><span class="n">b</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="n">f</span>
|
|
<span class="p">(</span><span class="mi">4</span><span class="w"> </span><span class="k">rows</span><span class="p">)</span>
|
|
<span class="k">SELECT</span><span class="w"> </span><span class="n">a</span><span class="p">,</span><span class="w"> </span><span class="k">count</span><span class="p">(</span><span class="mi">1</span><span class="p">)</span><span class="w"> </span><span class="k">FROM</span><span class="w"> </span><span class="n">t1</span><span class="w"> </span><span class="k">GROUP</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="n">a</span><span class="p">;</span>
|
|
<span class="w"> </span><span class="n">a</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="k">count</span>
|
|
<span class="c1">---+-------</span>
|
|
<span class="w"> </span><span class="n">a</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="mi">2</span>
|
|
<span class="w"> </span><span class="n">B</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="mi">2</span>
|
|
<span class="p">(</span><span class="mi">2</span><span class="w"> </span><span class="k">rows</span><span class="p">)</span>
|
|
</pre></div></td></tr></table></div>
|
|
|
|
</div>
|
|
<p id="EN-US_TOPIC_0000001213045052__p17304131517241">CASE expression, which is subject to the COLLATE setting in the WHEN clause.</p>
|
|
<div class="codecoloring" codetype="Sql" id="EN-US_TOPIC_0000001213045052__screen631291792418"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal">1</span>
|
|
<span class="normal">2</span>
|
|
<span class="normal">3</span>
|
|
<span class="normal">4</span>
|
|
<span class="normal">5</span>
|
|
<span class="normal">6</span>
|
|
<span class="normal">7</span>
|
|
<span class="normal">8</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">SELECT</span><span class="w"> </span><span class="n">a</span><span class="p">,</span><span class="k">case</span><span class="w"> </span><span class="n">a</span><span class="w"> </span><span class="k">when</span><span class="w"> </span><span class="s1">'a'</span><span class="w"> </span><span class="k">collate</span><span class="w"> </span><span class="n">case_insensitive</span><span class="w"> </span><span class="k">then</span><span class="w"> </span><span class="s1">'case1'</span><span class="w"> </span><span class="k">when</span><span class="w"> </span><span class="s1">'b'</span><span class="w"> </span><span class="k">collate</span><span class="w"> </span><span class="ss">"C"</span><span class="w"> </span><span class="k">then</span><span class="w"> </span><span class="s1">'case2'</span><span class="w"> </span><span class="k">else</span><span class="w"> </span><span class="s1">'case3'</span><span class="w"> </span><span class="k">end</span><span class="w"> </span><span class="k">FROM</span><span class="w"> </span><span class="n">t1</span><span class="p">;</span>
|
|
<span class="w"> </span><span class="n">a</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="k">case</span>
|
|
<span class="c1">---+-------</span>
|
|
<span class="w"> </span><span class="n">A</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="n">case1</span>
|
|
<span class="w"> </span><span class="n">B</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="n">case3</span>
|
|
<span class="w"> </span><span class="n">a</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="n">case1</span>
|
|
<span class="w"> </span><span class="n">b</span><span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="n">case2</span>
|
|
<span class="p">(</span><span class="mi">4</span><span class="w"> </span><span class="k">rows</span><span class="p">)</span>
|
|
</pre></div></td></tr></table></div>
|
|
|
|
</div>
|
|
<p id="EN-US_TOPIC_0000001213045052__p61061642132513">Implicit derivation across subqueries.</p>
|
|
<div class="codecoloring" codetype="Sql" id="EN-US_TOPIC_0000001213045052__screen028117872410"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal">1</span>
|
|
<span class="normal">2</span>
|
|
<span class="normal">3</span>
|
|
<span class="normal">4</span>
|
|
<span class="normal">5</span>
|
|
<span class="normal">6</span>
|
|
<span class="normal">7</span>
|
|
<span class="normal">8</span>
|
|
<span class="normal">9</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">SELECT</span><span class="w"> </span><span class="o">*</span><span class="w"> </span><span class="k">FROM</span><span class="w"> </span><span class="p">(</span><span class="k">SELECT</span><span class="w"> </span><span class="n">a</span><span class="w"> </span><span class="k">collate</span><span class="w"> </span><span class="ss">"C"</span><span class="w"> </span><span class="k">from</span><span class="w"> </span><span class="n">t1</span><span class="p">)</span><span class="w"> </span><span class="k">WHERE</span><span class="w"> </span><span class="n">a</span><span class="w"> </span><span class="k">in</span><span class="w"> </span><span class="p">(</span><span class="s1">'a'</span><span class="p">,</span><span class="s1">'b'</span><span class="p">);</span>
|
|
<span class="w"> </span><span class="n">a</span>
|
|
<span class="c1">---</span>
|
|
<span class="w"> </span><span class="n">a</span>
|
|
<span class="w"> </span><span class="n">b</span>
|
|
<span class="p">(</span><span class="mi">2</span><span class="w"> </span><span class="k">rows</span><span class="p">)</span>
|
|
<span class="k">SELECT</span><span class="w"> </span><span class="o">*</span><span class="w"> </span><span class="k">FROM</span><span class="w"> </span><span class="n">t1</span><span class="p">,(</span><span class="k">SELECT</span><span class="w"> </span><span class="n">a</span><span class="w"> </span><span class="k">collate</span><span class="w"> </span><span class="ss">"C"</span><span class="w"> </span><span class="k">from</span><span class="w"> </span><span class="n">t1</span><span class="p">)</span><span class="w"> </span><span class="n">t2</span><span class="w"> </span><span class="k">WHERE</span><span class="w"> </span><span class="n">t1</span><span class="p">.</span><span class="n">a</span><span class="o">=</span><span class="n">t2</span><span class="p">.</span><span class="n">a</span><span class="p">;</span>
|
|
<span class="n">ERROR</span><span class="p">:</span><span class="w"> </span><span class="n">could</span><span class="w"> </span><span class="k">not</span><span class="w"> </span><span class="n">determine</span><span class="w"> </span><span class="n">which</span><span class="w"> </span><span class="k">collation</span><span class="w"> </span><span class="k">to</span><span class="w"> </span><span class="n">use</span><span class="w"> </span><span class="k">for</span><span class="w"> </span><span class="n">string</span><span class="w"> </span><span class="n">hashing</span>
|
|
<span class="n">HINT</span><span class="p">:</span><span class="w"> </span><span class="n">Use</span><span class="w"> </span><span class="n">the</span><span class="w"> </span><span class="k">COLLATE</span><span class="w"> </span><span class="n">clause</span><span class="w"> </span><span class="k">to</span><span class="w"> </span><span class="k">set</span><span class="w"> </span><span class="n">the</span><span class="w"> </span><span class="k">collation</span><span class="w"> </span><span class="n">explicitly</span><span class="p">.</span>
|
|
</pre></div></td></tr></table></div>
|
|
|
|
</div>
|
|
<div class="caution" id="EN-US_TOPIC_0000001213045052__note18725121420431"><span class="cautiontitle"><img src="public_sys-resources/caution_3.0-en-us.png"> </span><div class="cautionbody"><ul id="EN-US_TOPIC_0000001213045052__ul553910328173"><li id="EN-US_TOPIC_0000001213045052__li16540153211718"><strong id="EN-US_TOPIC_0000001213045052__b413061912553">collate case_insensitive</strong> is an insensitive sorting, and the result set is uncertain. If sensitive sorting is used after <strong id="EN-US_TOPIC_0000001213045052__b2200631175518">collate case_insensitive</strong> sorting, the result set may be unstable. Therefore, do not use sensitive sorting and insensitive sorting together in statements.</li><li id="EN-US_TOPIC_0000001213045052__li1054016322179">If <strong id="EN-US_TOPIC_0000001213045052__b208731014245">collate case_insensitive</strong> is used to specify character behaviors as case-insensitive, the performance will be affected. If you require high performance, exercise caution when configuring this parameter.</li></ul>
|
|
</div></div>
|
|
</div>
|
|
</div>
|
|
|