The orthography of a language specifies a standardized way of using a specific writing system (script) to write the language. Where more than one writing system is used for a language, for example Kurdish, Uyghur or Serbian, there can be more than one orthography. Orthography is distinct from typography
Orthography in English comes from orthographie (French, 13c.), from Latin: orthographia, from Greek ὀρθός orthós, “correct”, and γράφειν gráphein, “to write”.
While “orthography” colloquially is often used synonymously with spelling, spelling is only part of orthography. Other elements of the field of orthography are hyphenation, capitalization, word breaks, emphasis, and punctuation. Orthography describes or defines the set of symbols (graphemes and diacritics) used, and the rules about how to write these symbols.
Most natural languages developed as oral-aural languages, and writing systems have usually been crafted or adapted afterwards as representations of the spoken language. In an etic sense, the rules for writing systems are arbitrary, which is to say that any set of rules could be considered “correct” if the users of the language mutually agreed to convene upon that set of rules as the standard way to represent the spoken language. However, as standardization takes stronger hold, an emic epistemology of “right and wrong” develops, in which compliance with, or violations of, the standards are viewed as right, or wrong, in a way analogous to moral right and wrong, and in which each word has a written identity that is no less standardized than its oral-aural identity, which is emically unitary. The term orthography is sometimes used in a linguistic sense to refer to any method of writing a language, without judgment as to right and wrong, with a scientific understanding that orthographic standardization exists on a spectrum of strength of convention. But the original sense of the word stem, which evolved long before linguistic science, implies a dichotomy of correct and incorrect, and the word stem is still most often used to refer not just to a way of writing a language but more specifically to the thoroughly standardized (emically “correct”) way of writing it.
An orthography may be described as “efficient” if it has one grapheme per phoneme (distinctive speech sound) and vice versa. An orthography may also have varying degrees of efficiency for reading or writing. For example, diverse letter, digraph, and diacritic shapes contribute to diverse word shapes, which aid fluent reading, while heavy use of apostrophes or diacritics makes writing slow, and the use of symbols not found on standard keyboards makes computer or cell phone input awkward.
Typology of spelling systems
A phonemic orthography is an orthography that has a dedicated symbol or sequence of symbols for each phoneme (distinctive speech sound) and vice versa, that is, graphemes and phonemes are bijective functions of one another. Spanish and Italian are very close to being phonemic, and English is among the least phonemic.
A morpho-phonemic orthography considers not only what is phonemic, as above, but also the underlying structure of the words. For example, in English, /s/ and /z/ are distinct phonemes, so in a phonemic orthography the plurals of cat and dog would be cats and dogz. However, English orthography recognizes that the /s/ sound in cats and the /z/ sound in dogs are the same element (archiphoneme), automatically pronounced differently depending on its environment, and therefore writes them the same despite their differing pronunciation. German and Russian are morpho-phonemic in this sense, whereas Turkish is purely phonemic. Korean hangul has changed over the centuries from a highly phonemic to a largely morpho-phonemic orthography, and there are moves in Turkey to make that script more morpho-phonemic as well.
A “defective orthography” is one in which there is not a one-to-one correspondence between the letters and the phonemes in the language, such as those of English or Arabic. Most languages of western Europe (which are written with the Latin alphabet), as well as the modern Greek language to a lesser extent (written with the Greek alphabet), have defective scripts. In some of these, there are sounds with more than one possible spelling, usually for etymological or morpho-phonemic reasons (like /dʒ/ in English, which can be written with ‹j›, ‹g›, ‹dg›, ‹dge›, or ‹ge›). In other cases, there are not enough letters in the alphabet to represent all phonemes. The remaining ones must then be represented by using such devices as diacritics, digraphs that reuse letters with different values (like ‹th› in English, whose sound value is normally not /t/ + /h/), or simply inferred from the context (for example the short vowels in abjads like the Arabic and Hebrew alphabets, which are normally left unwritten).
Another term to describe this characteristic is “deep orthography”. (Note that the term “defective orthography” should not indicate that the writing system is flawed; some defects, such as the aforementioned absence of short vowels in abjads for Semitic languages, serve the languages better than a supposedly “perfect” orthography would.) Deep orthographies are writing systems that do not have a full correspondence between the spoken phoneme and the written grapheme (as listed above). Shallow orthographies, however, have a one-to-one relationship between graphemes and phonemes. The syllabary systems of Japanese (hiragana and katakana) are examples of shallow orthography.
Complex orthographies often combine different types of scripts and/or utilize many different complex punctuation rules. Some widely accepted examples of languages with complex orthographies include Thai, Chinese, Japanese, and Khmer.