Why is CJK called "CJK" and not CJ, or CJKV?

In computer systems, there are the concepts of complex layouting, and complex input.

  1. Kanji characters are simple-layout (Wikipedia: “[possible to] display each character one after another”).
  2. Arabic characters are complex-layout (letters shapeshift in response to surrounding letters).
  3. Arabic characters are simple-input. Strike a key, get your result.
  4. Diacritics (ä)/ligatures (Æ) and certain digraphs (きょ) take a special place and we never ;-) talk about them, whether the script is Latin, Arabic, Hebrew or something else.
  5. Hiragana/Katakana are syllabic and thus simple-input. However, most people likely employ complex input for Hira/Kana because it allows them to reuse their muscle memory and not learn a new key arrangement.
  6. Hanzi/Kanji/Hanja are generally complex-input. That has not stopped Google from making a simple-input (but practically-complex) round awkward version.
  7. Korean is technically simple-input because you could input jamo and then have them complex-layouted. (ㄴㅏ'ㄴㅡㄴ => 나는) Complex layouting is more work than complex input, so there was an incentive to store Korean in precomposed form (as syllabic blocks) rather than as jamo.
  8. Japanese computer systems existed as early as 1982, Korean encoding standards date 1987; DOS/V (pictured) debuted in 1990. Complex input was easily possible. Complex layouting however… it seems like Sakhr Arabize Windows 3.1 did it in 1993. Indic layouting seems to not have appeared until Windows 2000.

All of that could have contributed to CJK being CJK rather than CJ, CJKV (CJK+Vietnamese), or CJKA (= CJK+Arabic) / CJKI (= CJK+Indic).

Posted 2024-11-16 00:11 / Tags: I18n, Internationalization. / link

Notes on photo sphere/panoramic EXIF metadata

/ Tags: Photo.