Tagged: persian

Persian loanwords in Arabic


Pahlavi inscription from the mid-Sassanian period. The Pahlavi script was based on Aramaic and is a testament to the cultural exchange between Iranian and Semitic languages. Source.

Much has been said about the influence of Arabic, as the language through which Islam was spread, on the speech of majority Muslim peoples. However, not nearly as much attention has been paid to how other languages have shaped and changed Arabic. Persian, mostly through word borrowings, has been one of the most important — if not the most important — of these languages, and what follows is a brief description of the nature of Persian’s lexical influence and how Arabic adapted its sound and word structure.


The first thing to note is not all Persian words entered into Arabic directly. Many of them were borrowed through Aramaic, which was the major lingua franca and trade language of the region (as well as serving as the official language of the Achaemenid dynasty of ancient Persia). As such, many of the Arabic words for spices, plants, precious stones, and other common goods are Persian in origin. Examples include the following:

  • Arab. ‹ballūr› ‘crystal’ < Pers. ‹belūr›
  • ‹fayrūz› ‘turquoise’ < ‹pīrūze›
  • ‹hāl›/‹hayl› ‘cardamom’ < ‹hel›
  • ‹ˀibrīq› ‘water jug’ < ultimately from ‹āb› ‘water’ + ‹rīxtan› ‘to pour’
  • ‹kanz› ‘treasure’ < Mid. Pers. ‹ganj› via Aram.
  • ‹lāzaward› ‘lapis lazuli’ < ‹lāj(a)vard›
  • ‹līmūn› ‘lemon’ < ‹līmū›
  • ‹marjān› ‘coral’ < Mid. Pers. ‹murvārīt› ‘pearl’ via Aram. ‹margānītā›
  • ‹mawz› ‘banana’ < Mid. Pers. ‹mōz›
  • ‹misk› ‘musk’ < Mid. Pers. ‹mušk›
  • ‹nisrīn› ‘dog rose’ < ‹nasrīn›
  • ‹sabānix› ‘spinach’ < ‹aspanāx›
  • ‹sunbul› ‘hyacinth’ < ‹sonbol›
  • ‹šabat›/‹šibitt› ‘dill’ < ‹ševīd›
  • ‹xiyār› ‘cucumber’ < ‹xiyār›
  • ‹yāqūt› ‘ruby’ < ‹yāqūt›
  • ‹yašb› ‘jasper’ < ‹yashp›
  • ‹yasmīn› ‘jasmine’ < ‹yāsamīn›
  • ‹zanjabīl› ‘ginger’ < Mid. Pers.‹singavēr› via Aram.
  • ‹zumurrud› ‘emerald’ < ‹zomorrod›

The defining characteristic of the majority of these loanwords is that they diverge from the typical native Arabic word structure, which is usually composed of only three root consonants that are modified according to specific, productive patterns. Words like <lāzaward> and <zanjabīl> immediately stand out in this regard.

Some of the words that seem to look like Arabic words conflict in meaning. For example, the word ‹misk› seems to have the root m-s-k, but there is an existing native root meaning ‘touch, grasp.’ Similarly, ‹xiyār› ‘cucumber’ conflicts with the root x-y-r meaning ‘choice.’


Persian and Arabic, although having a long history of interaction and mutual influence, are not related languages. The former is a member of the Indo-European language family and is genetically related to Greek, Latin, Armenian, Sanskrit, English, etc. Arabic, on the other hand, is a member of the Afro-Asiatic language family, like Aramaic, Ge‘ez, Tamazight, Hausa, Somali, etc. There are significant differences between the two, including in their respective sound systems, which means that when words are borrowed, they must undergo a process of (sometimes drastic) adaptation to the sound, syllable, and word structure of the borrowing language.

This happens in Arabic borrowings into Persian; for example, Arabic ‹ádab› ‘discipline, politeness’ (stress on the first syllable in accordance with Arabic stress rules) becomes ‹adáb› in Persian (stress on the second syllable in accordance with Persian stress rules). And certainly the reverse is true in that Arabic also adapted the pronunciation and structure of Persian words into a format compatible with its grammar. The following three sections highlight some of these changes.


Many words in Middle Persian ended in ‹-g›; for example, the name of the language was ‹pārsīg›, ‘plan’ was ‹barnāmag›, and ‘pistachio’ was ‹pistag›. Middle Persian was spoken up until the 9th century, meaning that Arabic borrowed many words from this stage of Persian, including ones ending in ‹-g›. Both Standard Arabic (whether modern or classical) lacks a ‹g›, so the reflex of the Persian is usually ‹j› or ‹q›. Thus Middle Persian ‹barnāmag› became Arabic ‹barnāmij› and ‹pistag› became ‹fustuq›. It should be noted that ‹j› developed from an original ‹g› and that ‹q› is pronounced as ‹g› in many varieties of Arabic, including ancient dialects from the pre-Islamic and early Islamic eras. So the alternative forms that we see in Arabic are not random substitutions.

As with all languages, Middle Persian experienced changes and eventually developed into New (i.e. modern) Persian. Among the changes was that the final ‹-g› was dropped. In modern Persian, the name of the language is ‹pārsī› or <fārsī>, ‘plan’ is ‹barnāme›, and ‘pistachio’ is ‹peste›. But this sound change only occurred in Persian, meaning it had no effect on any Persian loanwords in Arabic. The result is that Arabic contains “fossilized” forms of many Persian words. Other examples include the following:

  • Arab. ‹banafsaj› ‘violet’ versus Pers. ‹banafše›
  • ‹baydaq› ‘pawn (chess)’ versus ‹piyāde›
  • ‹dībāj› ‘silk brocade’ versus ‹dībā›
  • ‹namūḏaj› ‘example’ versus ‹namūne›
  • ‹ṭāzaj› ‘fresh’ versus ‹tāze›

Interestingly, in some spoken Arabic varieties, the word for ‘fresh’ is ‹ṭāza›, lacking any evidence of the Middle Persian final ‹-g›. Ostensibly, this would suggest that Persian loans in Arabic were indeed affected by the sound change. However, this is actually an instance in which Arabic re-borrowed the word – this second time however via Turkish, which had adopted it from Persian after the final ‹-g› had disappeared.

CHANGE OF ‹č› TO ‹ṣ› OR ‹s›

Persian has a ‹č› sound (as in ‘chair’) which standard Arabic and many colloquial varieties lack. Normally, in borrowings from other languages, this sound is usually changed to ‹š› (as in ‘share’), such as Arabic ‹šekk› from English ‘check.’ However, in words of Persian origin, ‹č› tends to correspond to Arabic ‹ṣ› or occasionally ‹s›.

At first glance, this is strange because ‹ṣ› is an emphatic sound, meaning it is pronounced by creating a constriction in the back of the mouth (anywhere from the velum to the pharynx) simultaneously as the sound is produced in the mouth. Compare the plain ‹s> of sūs› (‘licorice’) and its pharyngealized counterpart ‹ṣ in ṣūṣ› (‘chick’). The latter sounds very like a deep ‹s› but nothing at all like ‹č›. This is likely due to the tradition of Iranian languages, such as Sogdian and Pahlavi, to use the Aramaic letter representing ‹ṣ› for ‹č› (in turn due to the fact that ‹ṣ› itself was originally an affricate). Aramaic speakers would have likely read the letter as ‹ṣ›, which then was passed on to Arabic speakers. There are a number of loanwords that exhibit this change, including the following:

  • Arab. ‹jaṣṣ› ‘plaster, gypsum’ < Pers. ‹gač›
  • ‹raṣāṣ› ‘lead’ < Mid. Pers. ‹arčīč›
  • ‹ṣandal› ‘sandal, sandalwood’ < ‹čandal›
  • ‹ṣārūj› ‘mortar’ < Mid. Pers. ‹čārūg›
  • ‹ṣihrīj› ‘cistern’ < Mid. Pers. ‹čahrēg›
  • ‹ṣīn› ‘China’ < ‹čīn›
  • ‹sirāj› ‘lamp’ < ‹čerāġ›


Arabic, being a Semitic language, has a root-and-pattern system of morphology, meaning that roots composed of consonants are applied to pre-existing patterns to form words. An example of this is ‹kitāb› ‘book’ and ‹maktab› ‘desk,’ both from the root k-t-b ‘write.’ The plurals of many words are also determined by pre-existing patterns, so that ‘books’ is ‹kutub› and ‘desks’ is ‹makātib›. Notice that the consonants stay the same, but the vowels are altered to indicate pluralization.

Some Persian words that were borrowed into Arabic very strongly resembled certain plural patterns, and indeed Arabic speakers interpreted these words as plurals. But by reinterpreting originally singular words as plurals, speakers created a lexical ‘gap’ where a singular ought to have been. The solution for Arabic speakers was to back-form new singular forms from plurals based on the forms that exist for native Arabic words.

To illustrate, the Middle Persian word for ‘pawn’ (as in the chess piece) was ‹payādag›, which was borrowed as Arabic ‹bayādiq›. Comparing ‹bayādiq› to ‹makātib› ‘desks,’ one can immediately recognize that while the consonants differ, the vowels are identical. Thus Arabic speakers interpreted ‹bayādiq› not as ‘pawn’ but as ‘pawns.’ Based on this interpretation, if ‹bayādiq› is a plural whose form matches ‹makātib›, then it would follow that the singular form of the former would match that of the latter. Thus, ‹baydaq› ‘pawn’ — having the same vowels and structure as ‹maktab› ‘desk’ — was back-formed as the new singular form.

Some other words that exhibit this back-formation include:

  • ‹firdaws› ‘paradise’ from Old Iranian *‹paridaiza›, which was borrowed as ‹farādīs› ‘paradises.’
  • ‹jāmūs› ‘water buffalo’ from Middle Persian ‹gāwmeš›, which was borrowed as ‹jawāmīs› ‘water buffalos.’
  • ‹nibr› ‘warehouse’ from Middle Persian ‹anbār›, which was borrowed as ‹ˀanbār› ‘warehouses.’

Persian nasta‘līq calligraphy by Maqsud ibn Mahmud, 1708. Nasta‘līq is a calligraphic style invented by Persians for their adapted Arabic script in the 14th-15th century. Source.


Many if not most of the loanwords discussed above were clearly borrowed in pre-Islamic or early Islamic times. This makes sense, given that Arabic was not an established language of prestige in that era unlike Persian, which was the language of one of the two most powerful empires in the region at the time. As Arabic ascended in influence through association with Islam, Persian borrowings into Arabic decreased.

However, they did not stop, and many spoken Arabic varieties continued to borrow Persian words. Obviously, those varieties spoken near or in Iran, such as Iraqi Arabic, contain more Persian loanwords than those that are not. Two examples from my own dialect (urban Palestinian) are ‹bābūj› ‘slipper’ from ‹pāpūč› and ‹šākūš› ‘hammer’ from ‹čakoš›. Neither of these words exists in standard Arabic, but they are both widely used in many spoken varieties of Arabic. The existence of two words for ‘fresh’ borrowed from two different eras of Persian discussed above also demonstrates the continued influence of the language on Arabic.


Iranian Loanwords in Arabic.’ Encyclopædia Iranica.

Jeffrey, A. (1938). Foreign Vocabulary of the Qur’an.

Pharyngealization in Arabic.’ UCLA Phonetics Lab.

On the etymology of ‘orange’

The word for ‘orange’ in most European languages ultimately comes from the Sanskrit नारङ्ग ‹nāraṅga›. This was borrowed into Persian as نارنگ ‹nârang›, which in turn was entered into Arabic as نارنج ‹nāranj›. Spanish and Portuguese borrowed this word from Arabic with minimal modification as ‹naranja› and ‹laranja›, respectively. The English form “orange” comes from Old Provençal ‹auranja› via Old French ‹orenge›. The dropping of the initial n in these latter examples is most likely a result of rebracketing, in which a phrase like ‹une norenge› was interpreted by speakers as ‹une orenge›.

41ff786842e866f27eef96ed55fc9061Despite such historical influence, Arabic ‹nāranj› has become obsolete in the modern language, and the current word is برتقال ‹burtuqāl›, which is a medieval transliteration of “Portugal.” A number of other languages also derive their word for ‘orange’ from the name Portugal. Examples include Albanian ‹portokall›, Amharic ብርቱካን ‹bərtukan›, Georgian ფორთოხალი ‹p’ort’oxali›, Greek πορτοκάλι ‹portokáli›, Kurdish پرته‌قاڵ ‹pirteqal›, Neapolitan ‹purtuallo›, Persian پرتقال ‹porteqāl›, and Turkish ‹portakal›.

The reason for this intriguing quirk is that Portugal was once the largest exporter of oranges in the Mediterranean. The fruit became so associated with the country that it took on its name in virtually all the languages spoken in the wider region.

In a way, the spread of this name is a medieval-era meme. It demonstrates how ideas (for example, the association of orange and Portugal) migrate through language itself.