Centre for Digital Humanities

Datahub SSH

Arabic corpora

In this project two large Arabic Corpora will be prepared for analysis:

  1. The massive Shi’i encyclopedia of legal texts Oceans of Light (written around 1700, in 110 volumes).
  2. A collection of fourteen encyclopedic anthologies of poetry and belles-lettres, all written, from the 9th to the 18th century, in the Sunni world.

The legal texts will primarily be analyzed for references to the Qur’an. For this a tool Footprinter, is developed. In time the tool will be hosted by the Digital Humanities Lab.

The anthologies will be subjected to a sentiment analysis, specifically targeting the diachronic appreciation of the five bodily senses.