Religion:Arabic diacritics

From HandWiki
Short description: Diacritics used in the Arabic script
Early written Arabic used only rasm (in black). Later, Arabic added i‘jām diacritics (examples in red) so that letters such as these five ـيـ ,ـنـ ,ـثـ ,ـتـ ,ـبـ (b, t, th, n, y) could be distinguished. Ḥarakāt diacritics (examples in blue)—which is used in the Qur'an but not in most written Arabic—indicate short vowels, long consonants, and some other vocalizations.

Arabic script has numerous diacritics, which include consonant pointing known as Script error: The function "transl" does not exist. (إِعْجَام), and supplementary diacritics known as Script error: The function "transl" does not exist. (تَشْكِيل). The latter include the vowel marks termed Script error: The function "transl" does not exist. (حَرَكَات; singular: حَرَكَة, Script error: The function "transl" does not exist.).

The Arabic script is a modified abjad, where short consonants and long vowels are represented by letters but short vowels and consonant length are not generally indicated in writing. Script error: The function "transl" does not exist. is optional to represent missing vowels and consonant length. Modern Arabic is always written with the i‘jām—consonant pointing, but only religious texts, children's books and works for learners are written with the full tashkīl—vowel guides and consonant length. It is however not uncommon for authors to add diacritics to a word or letter when the grammatical case or the meaning is deemed otherwise ambiguous. In addition, classical works and historic documents rendered to the general public are often rendered with the full tashkīl, to compensate for the gap in understanding resulting from stylistic changes over the centuries.

Tashkil (marks used as phonetic guides)

The literal meaning of تَشْكِيل Script error: The function "transl" does not exist. is 'forming'. As the normal Arabic text does not provide enough information about the correct pronunciation, the main purpose of Script error: The function "transl" does not exist. (and Script error: The function "transl" does not exist.) is to provide a phonetic guide or a phonetic aid; i.e. show the correct pronunciation for children who are learning to read or foreign learners.

The bulk of Arabic script is written without Script error: The function "transl" does not exist. (or short vowels). However, they are commonly used in texts that demand strict adherence to exact pronunciation. This is true, primarily, of the Qur'an ⟨ٱلْقُرْآن⟩ (Script error: The function "transl" does not exist.) and poetry. It is also quite common to add Script error: The function "transl" does not exist. to hadithsٱلْحَدِيث⟩ (Script error: The function "transl" does not exist.; plural: Script error: The function "transl" does not exist.) and the Bible. Another use is in children's literature. Moreover, Script error: The function "transl" does not exist. are used in ordinary texts in individual words when an ambiguity of pronunciation cannot easily be resolved from context alone. Arabic dictionaries with vowel marks provide information about the correct pronunciation to both native and foreign Arabic speakers. In art and calligraphy, Script error: The function "transl" does not exist. might be used simply because their writing is considered aesthetically pleasing.

An example of a fully vocalised (vowelised or vowelled) Arabic from the Bismillah:

بِسْمِ ٱللَّٰهِ ٱلرَّحْمَٰنِ ٱلرَّحِيمِ
Script error: The function "transl" does not exist.
In the name of God, the All-Merciful, the Especially-Merciful.

Some Arabic textbooks for foreigners now use Script error: The function "transl" does not exist. as a phonetic guide to make learning reading Arabic easier. The other method used in textbooks is phonetic romanisation of unvocalised texts. Fully vocalised Arabic texts (i.e. Arabic texts with Script error: The function "transl" does not exist./diacritics) are sought after by learners of Arabic. Some online bilingual dictionaries also provide Script error: The function "transl" does not exist. as a phonetic guide similarly to English dictionaries providing transcription.

Harakat (short vowel marks)

The Script error: The function "transl" does not exist. حَرَكَات, which literally means 'motions', are the short vowel marks. There is some ambiguity as to which Script error: The function "transl" does not exist. are also Script error: The function "transl" does not exist.; the Script error: The function "transl" does not exist., for example, are markers for both vowels and consonants.

Fatḥah

ـَ

The Script error: The function "transl" does not exist.فَتْحَة⟩ is a small diagonal line placed above a letter, and represents a short /a/ (like the /a/ sound in the English word "cat"). The word Script error: The function "transl" does not exist. itself (فَتْحَة) means opening and refers to the opening of the mouth when producing an /a/. For example, with Script error: The function "transl" does not exist. (henceforth, the base consonant in the following examples): ⟨دَ/da/.

When a Script error: The function "transl" does not exist. is placed before a plain letter ⟨ا⟩ (Script error: The function "transl" does not exist.) (i.e. one having no hamza or vowel of its own), it represents a long /aː/ (close to the sound of "a" in the English word "dad", with an open front vowel /æː/, not back /ɑː/ as in "father"). For example: ⟨دَا/daː/. The Script error: The function "transl" does not exist. is not usually written in such cases. When a fathah is placed before the letter ⟨⟩ (yā’), it creates an /aj/ (as in "lie"); and when placed before the letter ⟨و⟩ (wāw), it creates an /aw/ (as in "cow").

Although paired with a plain letter creates an open front vowel (/a/), often realized as near-open (/æ/), the standard also allows for variations, especially under certain surrounding conditions. Usually, in order to have the more central (/ä/) or back (/ɑ/) pronunciation, the word features a nearby back consonant, such as the emphatics, as well as Script error: The function "transl" does not exist., or Script error: The function "transl" does not exist.. A similar "back" quality is undergone by other vowels as well in the presence of such consonants, however not as drastically realized as in the case of Script error: The function "transl" does not exist..[1][2][3]

Kasrah

ـِ

A similar diagonal line below a letter is called a Script error: The function "transl" does not exist.كَسْرَة⟩ and designates a short /i/ (as in "me", "be") and its allophones [i, ɪ, e, e̞, ɛ] (as in "Tim", "sit"). For example: ⟨دِ/di/.[4]

When a Script error: The function "transl" does not exist. is placed before a plain letter ⟨⟩ (Script error: The function "transl" does not exist.), it represents a long /iː/ (as in the English word "steed"). For example: ⟨دِي/diː/. The Script error: The function "transl" does not exist. is usually not written in such cases, but if Script error: The function "transl" does not exist. is pronounced as a diphthong /aj/, Script error: The function "transl" does not exist. should be written on the preceding consonant to avoid mispronunciation. The word Script error: The function "transl" does not exist. means 'breaking'.[1]

Ḍammah

ـُ

The Script error: The function "transl" does not exist.ضَمَّة⟩ is a small curl-like diacritic placed above a letter to represent a short /u/ (as in "duke", shorter "you") and its allophones [u, ʊ, o, o̞, ɔ] (as in "put", or "bull"). For example: ⟨دُ/du/.[4]

When a Script error: The function "transl" does not exist. is placed before a plain letter ⟨و⟩ (Script error: The function "transl" does not exist.), it represents a long /uː/ (like the 'oo' sound in the English word "swoop"). For example: ⟨دُو/duː/. The Script error: The function "transl" does not exist. is usually not written in such cases, but if Script error: The function "transl" does not exist. is pronounced as a diphthong /aw/, Script error: The function "transl" does not exist. should be written on the preceding consonant to avoid mispronunciation.[1]

The word ḍammah (ضَمَّة) in this context means rounding, since it is the only rounded vowel in the vowel inventory of Arabic.

Alif Khanjariyah

ــٰ

The superscript (or dagger) Script error: The function "transl" does not exist.أَلِف خَنْجَرِيَّة⟩ (Script error: The function "transl" does not exist.), is written as short vertical stroke on top of a consonant. It indicates a long /aː/ sound for which Script error: The function "transl" does not exist. is normally not written. For example: ⟨هَٰذَا⟩ (Script error: The function "transl" does not exist.) or ⟨رَحْمَٰن⟩ (Script error: The function "transl" does not exist.).

The dagger Script error: The function "transl" does not exist. occurs in only a few words, but they include some common ones; it is seldom written, however, even in fully vocalised texts. Most keyboards do not have dagger Script error: The function "transl" does not exist.. The word Allahالله⟩ (Script error: The function "transl" does not exist.) is usually produced automatically by entering Script error: The function "transl" does not exist.. The word consists of Script error: The function "transl" does not exist. + ligature of doubled Script error: The function "transl" does not exist. with a Script error: The function "transl" does not exist. and a dagger Script error: The function "transl" does not exist. above Script error: The function "transl" does not exist..

Maddah

ـٓآ

The Script error: The function "transl" does not exist.مَدَّة⟩ is a tilde-shaped diacritic, which can only appear on top of an Script error: The function "transl" does not exist. (آ) and indicates a glottal stop /ʔ/ followed by a long /aː/.

In theory, the same sequence /ʔaː/ could also be represented by two Script error: The function "transl" does not exist.s, as in *⟨أَا⟩, where a hamza above the first Script error: The function "transl" does not exist. represents the /ʔ/ while the second Script error: The function "transl" does not exist. represents the /aː/. However, consecutive Script error: The function "transl" does not exist.s are never used in the Arabic orthography. Instead, this sequence must always be written as a single Script error: The function "transl" does not exist. with a Script error: The function "transl" does not exist. above it, the combination known as an Script error: The function "transl" does not exist.. For example: ⟨قُرْآن/qurˈʔaːn/.

Alif waslah

ٱ

The Script error: The function "transl" does not exist.وَصْلَة⟩, Script error: The function "transl" does not exist.أَلِف وَصْلَة⟩ or Script error: The function "transl" does not exist.هَمْزَة وَصْل⟩ looks like a small letter Script error: The function "transl" does not exist. on top of an Script error: The function "transl" does not exist.ٱ⟩ (also indicated by an Script error: The function "transl" does not exist.ا⟩ without a Script error: The function "transl" does not exist.). It means that the Script error: The function "transl" does not exist. is not pronounced when its word does not begin a sentence. For example: ⟨بِٱسْمِ⟩ (Script error: The function "transl" does not exist.), but ⟨ٱمْشُوا۟⟩ (imshū not mshū). This is because no Arab word can start with a vowel-less consonant: If the second letter from the Script error: The function "transl" does not exist. has a kasrah, the alif-waslah makes the sound /i/. However, when the second letter from it has a dammah, it makes the sound /u/.

It occurs only in the beginning of words, but it can occur after prepositions and the definite article. It is commonly found in imperative verbs, the perfective aspect of verb stems VII to X and their verbal nouns (Script error: The function "transl" does not exist.). The alif of the definite article is considered a Script error: The function "transl" does not exist..

It occurs in phrases and sentences (connected speech, not isolated/dictionary forms):

  • To replace the elided hamza whose alif-seat has assimilated to the previous vowel. For example: فِي ٱلْيَمَن or في اليمن (Script error: The function "transl" does not exist.) ‘in Yemen’.
  • In hamza-initial imperative forms following a vowel, especially following the conjunction ⟨و⟩ (Script error: The function "transl" does not exist.) ‘and’. For example: َقُمْ وَٱشْرَبِ ٱلْمَاءَ (Script error: The function "transl" does not exist.) ‘rise and then drink the water’.

Like the superscript alif, it is not written in fully vocalized scripts, except for sacred texts, like the Quran and Arabized Bible.

Sukūn

ـْـ

The Script error: The function "transl" does not exist.سُكُونْ⟩ is a circle-shaped diacritic placed above a letter ( ْ). It indicates that the consonant to which it is attached is not followed by a vowel, i.e., zero-vowel.

It is a necessary symbol for writing consonant-vowel-consonant syllables, which are very common in Arabic. For example: ⟨دَدْ⟩ (Script error: The function "transl" does not exist.).

The Script error: The function "transl" does not exist. may also be used to help represent a diphthong. A Script error: The function "transl" does not exist. followed by the letter ⟨⟩ (Script error: The function "transl" does not exist.) with a Script error: The function "transl" does not exist. over it (ـَيْ) indicates the diphthong Script error: The function "transl" does not exist. (IPA /aj/). A Script error: The function "transl" does not exist., followed by the letter ⟨⟩ (Script error: The function "transl" does not exist.) with a Script error: The function "transl" does not exist., (ـَوْ) indicates /aw/.

ـۡـ

The Script error: The function "transl" does not exist. may have also an alternative form of the small high head of Script error: The function "transl" does not exist. (U+06E1 ۡ ), particularly in some Qurans. Other shapes may exist as well (for example, like a small comma above ⟨ʼ⟩ or like a circumflex ⟨ˆ⟩ in Script error: The function "transl" does not exist.).[5]

Tanwin (final postnasalized or long vowels)

ـٌ‎  ـٍ‎  ـً

The three vowel diacritics may be doubled at the end of a word to indicate that the vowel is followed by the consonant n. They may or may not be considered Script error: The function "transl" does not exist. and are known as Script error: The function "transl" does not exist.تَنْوِين⟩, or nunation. The signs indicate, from left to right, Script error: The function "transl" does not exist..

These endings are used as non-pausal grammatical indefinite case endings in Literary Arabic or classical Arabic (triptotes only). In a vocalised text, they may be written even if they are not pronounced (see pausa). See Script error: The function "transl" does not exist. for more details. In many spoken Arabic dialects, the endings are absent. Many Arabic textbooks introduce standard Arabic without these endings. The grammatical endings may not be written in some vocalized Arabic texts, as knowledge of Script error: The function "transl" does not exist. varies from country to country, and there is a trend towards simplifying Arabic grammar.

The sign ⟨ـً⟩ is most commonly written in combination with ⟨ـًا⟩ (Script error: The function "transl" does not exist.), ⟨ةً⟩ (Script error: The function "transl" does not exist.), ⟨أً⟩ (alif hamzah) or stand-alone ⟨ءً⟩ (Script error: The function "transl" does not exist.). Script error: The function "transl" does not exist. should always be written (except for words ending in Script error: The function "transl" does not exist. or diptotes) even if Script error: The function "transl" does not exist. is not. Grammatical cases and Script error: The function "transl" does not exist. endings in indefinite triptote forms:

  • Script error: The function "transl" does not exist.: nominative case;
  • Script error: The function "transl" does not exist.: accusative case, also serves as an adverbial marker;
  • Script error: The function "transl" does not exist.: genitive case.

Shaddah (consonant gemination mark)

ـّـ

The shadda or shaddahشَدَّة⟩ (Script error: The function "transl" does not exist.), or tashdidتَشْدِيد⟩ (Script error: The function "transl" does not exist.), is a diacritic shaped like a small written Latin "w".

It is used to indicate gemination (consonant doubling or extra length), which is phonemic in Arabic. It is written above the consonant which is to be doubled. It is the only Script error: The function "transl" does not exist. that is commonly used in ordinary spelling to avoid ambiguity. For example: ⟨دّ/dd/; Script error: The function "transl" does not exist.مَدْرَسَة⟩ ('school') vs. Script error: The function "transl" does not exist.مُدَرِّسَة⟩ ('teacher', female).

I‘jām (phonetic distinctions of consonants)

7th-century kufic script without any Script error: The function "transl" does not exist. or Script error: The function "transl" does not exist..

The i‘jāmإِعْجَام⟩ (sometimes also called nuqaṭ)[6] are the diacritic points that distinguish various consonants that have the same form (Script error: The function "transl" does not exist.), such as ⟨ـبـ/b/ ب, ⟨ـتـ/t/ ت, ⟨ـثـ/θ/ ث, ⟨ـنـ/n/ ن, and ⟨ـيـ/j/ ي. Typically i‘jām are not considered diacritics but part of the letter.

Early manuscripts of the Script error: The function "transl" does not exist. did not use diacritics either for vowels or to distinguish the different values of the Script error: The function "transl" does not exist.. Vowel pointing was introduced first, as a red dot placed above, below, or beside the Script error: The function "transl" does not exist., and later consonant pointing was introduced, as thin, short black single or multiple dashes placed above or below the rasm (image). These i‘jām became black dots about the same time as the Script error: The function "transl" does not exist. became small black letters or strokes.

Typically, Egyptians do not use dots under final Script error: The function "transl" does not exist.ي⟩, which looks exactly like Script error: The function "transl" does not exist.ى⟩ in handwriting and in print. This practice is also used in copies of the Script error: The function "transl" does not exist. (Qurʾān) scribed by Script error: The function "transl" does not exist.. The same unification of Script error: The function "transl" does not exist. and Script error: The function "transl" does not exist. has happened in Persian, resulting in what the Unicode Standard calls "Arabic Letter Farsi Yeh", that looks exactly the same as Script error: The function "transl" does not exist. in initial and medial forms, but exactly the same as Script error: The function "transl" does not exist. in final and isolated forms ⟨یـ  ـیـ  ـی⟩.

Isolated kāf with ‘alāmātu-l-ihmāl and without top stroke next to initial kāf with top stroke.
سۡ سۜ سۣ سٚ ڛ
Several ways of writing /s/.

At the time when the i‘jām was optional, unpointed letters were ambiguous. To clarify that a letter would lack i‘jām in pointed text (i.e. ⟨ح/ħ/, ⟨د/d/, ⟨ر/r/, ⟨س/s/, ⟨ص/sˤ/, ⟨ط/tˤ/, ⟨ع/ʕ/, ⟨ل/l/, ⟨ه/h/), the letter could be marked with a small v- or seagull-shaped diacritic above, also a superscript semicircle (crescent), a subscript dot (except in the case of ⟨ح⟩; three dots were used with ⟨س⟩), or a subscript miniature of the letter itself. A superscript stroke known as jarrah, resembling a long fatħah, was used for a contracted (assimilated) sin. Thus ⟨ڛ سۣ سۡ سٚ⟩ were all used to indicate that the letter in question was truly ⟨س⟩ and not ⟨ش⟩.[7] These signs, collectively known as ‘alāmātu-l-ihmāl, are still occasionally used in modern Arabic calligraphy, either for their original purpose (i.e. marking letters without i‘jām), or often as purely decorative space-fillers. The small ک above the kāf in its final and isolated forms ⟨ك  ـك⟩ was originally an ‘alāmatu-l-ihmāl that became a permanent part of the letter. Previously this sign could also appear above the medial form of kāf, when that letter was written without the stroke on its ascender. When kaf was written without that stroke, it could be mistaken for lam, thus kaf was distinguished with a superscript kaf or a small superscript hamza (nabrah), and lam with a superscript l-a-m (lam-alif-mim).[8]

Hamza (glottal stop semi-consonant)

ئ  ؤ  إ  أ ء

Although normally a diacritic is not considered a letter of the alphabet, the hamza هَمْزة (Script error: The function "transl" does not exist., glottal stop), often stands as a separate letter in writing, is written in unpointed texts and is not considered a Script error: The function "transl" does not exist.. It may appear as a letter by itself or as a diacritic over or under an Script error: The function "transl" does not exist., Script error: The function "transl" does not exist., or Script error: The function "transl" does not exist..

Which letter is to be used to support the Script error: The function "transl" does not exist. depends on the quality of the adjacent vowels;

  • If the glottal stop occurs at the beginning of the word, it is always indicated by hamza on an Script error: The function "transl" does not exist.: above if the following vowel is /a/ or /u/ and below if it is /i/.
  • If the glottal stop occurs in the middle of the word, Script error: The function "transl" does not exist. above Script error: The function "transl" does not exist. is used only if it is not preceded or followed by /i/ or /u/:
    • If /i/ is before or after the glottal stop, a Script error: The function "transl" does not exist. with a Script error: The function "transl" does not exist. is used (the two dots which are usually beneath the Script error: The function "transl" does not exist. disappear in this case): ⟨ئ⟩.
    • Otherwise, if /u/ is before or after the glottal stop, a Script error: The function "transl" does not exist. with a Script error: The function "transl" does not exist. is used: ⟨ؤ⟩.
  • If the glottal stop occurs at the end of the word (ignoring any grammatical suffixes), if it follows a short vowel it is written above Script error: The function "transl" does not exist., Script error: The function "transl" does not exist., or Script error: The function "transl" does not exist. the same as for a medial case; otherwise on the line (i.e. if it follows a long vowel, diphthong or consonant).
  • Two Script error: The function "transl" does not exist.s in succession are never allowed: /ʔaː/ is written with Script error: The function "transl" does not exist.آ⟩ and /aːʔ/ is written with a free Script error: The function "transl" does not exist. on the line ⟨اء⟩.

Consider the following words: ⟨أَخ/ʔax/ ("brother"), ⟨إسْماعِيل/ʔismaːʕiːl/ ("Ismael"), ⟨أُمّ/ʔumm/ ("mother"). All three of above words "begin" with a vowel opening the syllable, and in each case, Script error: The function "transl" does not exist. is used to designate the initial glottal stop (the actual beginning). But if we consider middle syllables "beginning" with a vowel: ⟨نَشْأة/naʃʔa/ ("origin"), ⟨أَفْئِدة/ʔafʔida/ ("hearts"—notice the /ʔi/ syllable; singular ⟨فُؤاد/fuʔaːd/), ⟨رُؤُوس/ruʔuːs/ ("heads", singular ⟨رَأْس/raʔs/), the situation is different, as noted above. See the comprehensive article on Script error: The function "transl" does not exist. for more details.

Tone markers

Historically Arabic script has been adopted and used by many tonal languages, examples include Xiao'erjing for Mandarin Chinese as well as Ajami script adopted for writing various languages of Western Africa. However, one of the shortcomings of Arabic, especially in comparison to Latin-derived scripts or other indigenous writing systems was that Arabic did not have a way of indicating tones.

However, in the adoption of the Arabic Script for Rohingya language, known as Rohingya Fonna, 3 tone markers have been developed and used in manuscripts. These tone markers form part of the standardized and accepted orthographic convention of Rohingya. This is the only known instance of tone markers within the Arabic script.[9][10]

Tone markers act as "modifiers" of vowel diacritics. In simpler words, they are "diacritics for the diacritics". They are written "outside" of the word, meaning that they are written above the vowel diacritic if the diacritic is written above the word, and they are written below the diacritic if the diacritic is written below the word. They are only ever written where there are vowel diacritics. This is important to note, as without the diacritic present, there is no way to distinguish between tone markers and I‘jām i.e. dots that are used for purpose of phonetic distinctions of consonants.

◌࣪ / ◌࣭

The Hārbāy as it is called in Rohingya, is a single dot that's placed on top of Fatḥah and Ḍammah, or curly Fatḥah and curly Ḍammah (vowel diacritics unique to Rohinghya), or their respective Fatḥatan and Ḍammatan versions, and it's placed underneath Kasrah or curly Kasrah, or their respective Kasratan version. (e.g. دً࣪ / دٌ࣪ / دࣨ࣪ / دٍ࣭) This tone marker indicates a short high tone (/˥/).[9][10]

◌࣫ / ◌࣮

The Ṭelā as it is called in Rohingya, is two dots that are placed on top of Fatḥah and Ḍammah, or curly Fatḥah and curly Ḍammah, or their respective Fatḥatan and Ḍammatan versions, and it's placed underneath Kasrah or curly Kasrah, or their respective Kasratan version. (e.g. دَ࣫ / دُ࣫ / دِ࣮) This tone marker indicates a long falling tone (/˥˩/).[9][10]

◌࣬ / ◌࣯

The Ṭāna as it is called in Rohingya, is a fish-like looping line that is placed on top of Fatḥah and Ḍammah, or curly Fatḥah and curly Ḍammah, or their respective Fatḥatan and Ḍammatan versions, and it's placed underneath Kasrah or curly Kasrah, or their respective Kasratan version. (e.g. دࣤ࣬ / دࣥ࣬ / دࣦ࣯) This tone marker indicates a long rising tone (/˨˦/).[9][10]

History

Evolution of early Arabic calligraphy (9th–11th century). The Basmala was taken as an example, from kufic Script error: The function "transl" does not exist. manuscripts. (1) Early 9th century, script with no dots or diacritic marks (see image of early Basmala Kufic);
(2) and (3) 9th–10th century under Abbasid dynasty, Abu al-Aswad's system established red dots with each arrangement or position indicating a different short vowel; later, a second black-dot system was used to differentiate between letters like Script error: The function "transl" does not exist. and Script error: The function "transl" does not exist. (see image of middle Kufic);
(4) 11th century, in al-Farāhídi's system (system we know today) dots were changed into shapes resembling the letters to transcribe the corresponding long vowels (see image of modern Kufic in Qur'an).

According to tradition, the first to commission a system of harakat was Ali who appointed Abu al-Aswad al-Du'ali for the task. Abu al-Aswad devised a system of dots to signal the three short vowels (along with their respective allophones) of Arabic. This system of dots predates the Script error: The function "transl" does not exist., dots used to distinguish between different consonants.

Abu al-Aswad's system

Abu al-Aswad's system of Harakat was different from the system we know today. The system used red dots with each arrangement or position indicating a different short vowel.

A dot above a letter indicated the vowel Script error: The function "transl" does not exist., a dot below indicated the vowel Script error: The function "transl" does not exist., a dot on the side of a letter stood for the vowel Script error: The function "transl" does not exist., and two dots stood for the Script error: The function "transl" does not exist..

However, the early manuscripts of the Qur'an did not use the vowel signs for every letter requiring them, but only for letters where they were necessary for a correct reading.

Al Farahidi's system

The precursor to the system we know today is Al Farahidi's system. Script error: The function "transl" does not exist. found that the task of writing using two different colours was tedious and impractical. Another complication was that the Script error: The function "transl" does not exist. had been introduced by then, which, while they were short strokes rather than the round dots seen today, meant that without a color distinction the two could become confused.

Accordingly, he replaced the Script error: The function "transl" does not exist. with small superscript letters: small alif, yā’, and wāw for the short vowels corresponding to the long vowels written with those letters, a small s(h)īn for shaddah (geminate), a small khā’ for khafīf (short consonant; no longer used). His system is essentially the one we know today.[11]

Automatic diacritization

The process of automatically restoring diacritical marks is called diacritization or diacritic restoration. It is useful to avoid ambiguity in applications such as Arabic machine translation, text-to-speech, and information retrieval. Automatic diacritization algorithms have been developed.[12][13] For Modern Standard Arabic, the state-of-the-art algorithm has a word error rate (WER) of 4.79%. The most common mistakes are proper nouns and case endings.[14] Similar algorithms exist for other varieties of Arabic.[15]

See also

  • Arabic alphabet:
    • Script error: The function "transl" does not exist. (إِعْرَاب), the case system of Arabic
    • Script error: The function "transl" does not exist. (رَسْم), the basic system of Arabic consonants
    • Script error: The function "transl" does not exist. (تَجْوِيد), the phonetic rules of recitation of Qur'an in Arabic
  • Hebrew:
    • Hebrew diacritics, the Hebrew equivalent
    • Niqqud, the Hebrew equivalent of Script error: The function "transl" does not exist.
    • Dagesh, the Hebrew diacritic similar to Arabic Script error: The function "transl" does not exist. and shaddah

References

  1. 1.0 1.1 1.2 Karin C. Ryding, "A Reference Grammar of Modern Standard Arabic", Cambridge University Press, 2005, pgs. 25-34, specifically “Chapter 2, Section 4: Vowels”
  2. Anatole Lyovin, Brett Kessler, William Ronald Leben, "An Introduction to the Languages of the World", "5.6 Sketch of Modern Standard Arabic", Oxford University Press, 2017, pg. 255, Edition 2, specifically “5.6.2.2 Vowels”
  3. Amine Bouchentouf, Arabic For Dummies®, John Wiley & Sons, 2018, 3rd Edition, specifically section "All About Vowels"
  4. 4.0 4.1 "Introduction to Written Arabic". http://web.uvic.ca/hrd/hist455/vowels/vowels_pres.htm. 
  5. "Arabic character notes". https://r12a.github.io/scripts/arabic/block#char0652. 
  6. Ibn Warraq (2002). Ibn Warraq. ed. What the Koran Really Says : Language, Text & Commentary. New York: Prometheus. p. 64. ISBN 1-57392-945-X. https://www.ebooklibs.co/book/view/1m41/what-the-koran-really-says.html. Retrieved 9 April 2019. 
  7. Gacek, Adam (2009). "Unpointed letters". Arabic Manuscripts: A Vademecum for Readers. BRILL. p. 286. ISBN 978-90-04-17036-0. https://books.google.com/books?id=NeaHnLb6RdUC&pg=PA286. 
  8. Gacek, Adam (1989). "Technical Practices and Recommendations Recorded by Classical and Post-Classical Arabic Scholars Concerning the Copying and Correction of Manuscripts". in Déroche, François. Les manuscrits du Moyen-Orient: essais de codicologie et de paléographie. Actes du colloque d'Istanbul (Istanbul 26–29 mai 1986). p. 57 (§ 8. Diacritical marks and vowelisation). http://www.islamicmanuscripts.info/reference/books/Deroche-1989-MMO/MMO-1989-051-060-Gacek.pdf. 
  9. 9.0 9.1 9.2 9.3 Priest, Lorna A.; Hosken, Martin (10 August 2010). "Proposal to add Arabic script characters for African and Asian languages". https://www.unicode.org/L2/L2010/10288r-arabic-proposal.pdf. 
  10. 10.0 10.1 10.2 10.3 Pandey, Anshuman (27 October 2015). "Proposal to encode the Hanifi Rohingya script in Unicode". http://unicode.org/L2/L2015/15278-hanifi-rohingya.pdf. 
  11. Versteegh, C. H. M. (1997). The Arabic Language. Columbia University Press. pp. 56ff. ISBN 978-0-231-11152-2. https://books.google.com/books?id=2tghviSsrF8C. 
  12. Azmi, Aqil M.; Almajed, Reham S. (2013-10-10). "A survey of automatic Arabic diacritization techniques" (in en). Natural Language Engineering 21 (3): 477–495. doi:10.1017/S1351324913000284. ISSN 1351-3249. https://www.cambridge.org/core/product/identifier/S1351324913000284/type/journal_article. 
  13. Almanea, Manar (2021). "Automatic Methods and Neural Networks in Arabic Texts Diacritization: A Comprehensive Survey". IEEE Access 9: 145012–145032. doi:10.1109/ACCESS.2021.3122977. ISSN 2169-3536. 
  14. Thompson, Brian; Alshehri, Ali (2021-09-28). "Improving Arabic Diacritization by Learning to Diacritize and Translate". arXiv:2109.14150 [cs.CL].
  15. Masmoudi, Abir; Aloulou, Chafik; Abdellahi, Abdel Ghader Sidi; Belguith, Lamia Hadrich (2021-08-08). "Automatic diacritization of Tunisian dialect text using SMT model" (in en). International Journal of Speech Technology 25: 89–104. doi:10.1007/s10772-021-09864-6. ISSN 1572-8110. https://doi.org/10.1007/s10772-021-09864-6.