Yahoo Web Search

Search results

      • The Script property itself assigns single script values to all Unicode code points, identifying a primary script association, where possible. The Script_Extensions property assigns sets of Script property values, providing more detail for cases where characters are commonly used with multiple scripts.
      www.unicode.org › reports › tr24
  1. People also ask

  2. Special script property values. In addition to explicit or specific script properties, Unicode uses three special values: Common Unicode can assign a character in the UCS to a single script only. However, many characters—those that are not part of a formal natural-language writing system or are unified across many writing systems—may be ...

    • 1 Introduction
    • 2 Usage Model
    • 3 Values
    • Acknowledgments
    • References

    The majority of characters encoded in the Unicode Standard [Unicode] are elements of collections called scripts. Exceptions include symbols, punctuation characters intended for use with multiple scripts, and characters that do not have a stand-alone script identity because they are intended to be used in combination with another character. Therefor...

    2.1 Special vs. Explicit Script Property Values

    The script property values form a full partition of the codespace: every code point is assigned a single script property value. This value is either the value for a specific script, such as Cyrillic,or is one of the following three special values: 1. Inherited—for characters that may be used with multiple scripts, and that inherit their script from the preceding characters. These include nonspacing marks, enclosing marks, and the zero width joiner/non-joiner characters. 2. Common—for other ch...

    2.2 Handling Characters with the Common Script Property

    In determining the boundaries of a run of text in a given script, programs must resolve any of the special script property values, such as Common,based on the context of the surrounding characters. A simple heuristic uses the script of the preceding character, which works well in many cases. However, this may not always produce optimal results. For example, in the text “... gamma (γ) is ...”, this heuristic would cause matching parentheses to be in different scripts. Generally, paired punctua...

    2.3 Handling Combining Marks

    Implementations that determine the boundaries between characters of given scripts should never break between a combining mark (a character with General_Category value of Mc, Mn or Me) and its base character. Thus, for boundary determinations and similar sorts of processing, a combining mark—whatever its script property value—should inherit the script property value of its base character. Spacing combining marks are typically only used with one script and have the corresponding script property...

    Table 2 illustrates some of the script property values used in the Scripts.txt data file. The short name for the Unicode script property value matches the ISO 15924 code. Further subdivisions of scripts by ISO 15924 into varieties are shown in parentheses. For a complete list of values and short names, see PropertyValueAliases.txt [PropValue]. As w...

    Thanks to Julie Allen for comments on this annex, including earlier versions. Asmus Freytag added significant sections to the text for Revisions 7 and 9 and assisted in the rewrite of Section 3 for Revision 13. Eric Muller added Section 2.4 (now 2.5) for Revision 11 and suggested modifications for Section 2.3.

    For references for this annex, see Unicode Standard Annex #41, “Common References for Unicode Standard Annexes.”

  3. Aug 14, 2023 · Summary. This annex describes two related Unicode code point properties. Both properties share the use of Script property values. The Script property itself assigns single script values to all Unicode code points, identifying a primary script association, where possible.

  4. en.wikipedia.org › wiki › UnicodeUnicode - Wikipedia

    The Unicode Standard has imposed rules intended to guarantee stability. Depending on the strictness of a rule, a change can be prohibited or allowed. For example, a "name" given to a code point cannot and will not change. But a "script" property is more flexible, by Unicode's own rules.

  1. People also search for