Next | First | Previous TEI meets Unicode 13

Identification of languages and scripts

  • Unicode characters have a `script' property. The term script here is not identical with script as used in the Guidelines and elsewhere in linguistic theory. For this, KW suggested `orthography', to avoid confusion among the TEI users, that are also confronted with Unicode/XML terminology.
  • Script codes exist in ISO 15924. Indicating the script(s) used in a document (with these codes) will faciliate the selection of appropriate fonts and rendering.
  • TEI could provide a better mechanism to describe the language used e.g. for the `rend' attribute to specify local, specific exceptions to the general rendering of an element. This could for example be the language used in (X)HTML on the `style' attribute and in CSS stylesheets.