The World Federation · Position Paper

Preserving the Shia Textual Heritage in the Digital Age

al-Makhzan (المخزن) — a unified, multilingual repository for Islamic scholarship

The condition of the corpus

The Shia scholarly tradition stretches across more than a thousand years and produces texts in at least six living languages — Arabic, Persian, English, Urdu, Gujarati, and others. Despite the scale of this corpus, there is today no platform on which a scholar can search across it as a whole. The texts that constitute the tradition are present in digital form, but they are dispersed: lib.eshia.ir and the Ahlulbayt Library each hold significant Arabic and Persian holdings; Noorlib hosts a comprehensive Iranian platform whose long-term jurisdictional and technical durability is not assured; English Shia material is scattered across community sites with no shared preservation guarantee. A scholar pursuing a question in fiqh, kalam, or rijāl must navigate four or five platforms in three languages and cross-reference results by hand.

The asymmetry with Sunni scholarship is telling. Through projects such as al-Maktaba al-Shamila, the Sunni textual corpus has been digitised, indexed, and made searchable on durable, community-maintained infrastructure used daily by tens of thousands of scholars. The Shia community has produced individual digital libraries of comparable ambition, but none of equivalent reach or institutional permanence. The everyday research conditions of Shia scholarship — at the Hawza in Najaf, Qom, and London, in university Islamic-studies departments, and among independent researchers — quietly lag those of neighbouring traditions, not for want of texts, but for want of unified access to them.

A historic precedent

Every major Shia compilation project of the last four centuries was a response to dispersion. Allama al-Majlisi's Bihar al-Anwar gathered hadith from scattered manuscripts at risk of loss. Shaykh al-Hurr al-Amili's Wasa'il al-Shia organised legal traditions into a single workable corpus, transforming the practice of istinbāṭ. Aqa Buzurg al-Tihrani's al-Dhari'a indexed an entire bibliographic tradition that might otherwise have fallen into obscurity. Each was the act of a single scholar or small circle responding to a fragmentation that threatened the coherence of inherited knowledge.

The digital age demands that act anew. The dispersion is no longer one of manuscripts in scattered libraries but of digital archives in scattered jurisdictions and platforms — equally fragile, often more so. al-Makhzan is conceived in that lineage: a unifying repository, in the form the present age requires, undertaken not as innovation but as the continuation of a recognisable scholarly responsibility.

The act of compilation has, throughout Shia history, been the act by which a tradition refuses dispersion. al-Makhzan is that act in the medium of our time.

What al-Makhzan is

al-Makhzan (المخزن — "the repository") is a unified, multilingual digital corpus of Shia and Islamic texts, with a research interface designed for scholars and the Hawza. A working platform is live at al-makhzan.org, containing 14,075 books and 11.35 million pages of full searchable text drawn from the existing Shia digital libraries. Work is underway to expand the corpus through systematic ingestion of further Arabic, Persian, English, and Urdu sources — including the World Federation's own ~700 publications, the al-Islam.org library, and the major Sunni textual collections — toward an estimated 35,000–40,000 unique works after deduplication. The platform supports full-text search across the whole corpus, source-preserving citations, page-level reading with annotation, and a research workspace in which a scholar's notes and bookmarks accumulate across the entire tradition. Cross-language and conceptual retrieval, supported by carefully scoped AI capabilities, are being added in a manner that preserves rather than abstracts the integrity of the source texts.

What this enables

A unified, searchable, multilingual corpus changes what is practically possible. Cross-tradition inquiry — fiqh comparisons, hadith parallels, terminological histories — becomes a query rather than a year of footwork. The transmission of a single concept like wilāya can be traced through Arabic hadith, Persian commentary, and English academic writing in a single view. The geographic barrier collapses: an emerging scholar in Lagos, Karachi, or Sydney has the same access to the corpus as a researcher in Qom or Najaf. Methodologically, the tradition's own commitments — to isnād, rijāl, and traceable attribution — are well served by a system that preserves source and provenance per page rather than dissolving everything into anonymous snippets. What the great compilation projects achieved for the manuscript age, al-Makhzan undertakes for the digital one.

Approach and principles

Source preservation
Every page carries its provenance; no text is divorced from the library and edition it came from.
Multilingual at parity
Arabic and Persian as primary, with English, Urdu, and Gujarati supported as full citizens.
Cross-tradition reach
Shia sources alongside Sunni — because serious Shia inquiry never proceeds in isolation from the wider Islamic intellectual tradition.
Editorial curation
A built-in review system through which trusted scholars verify text quality, deduplicate editions, and flag errors.
Institutional durability
Hosted, maintained, and stewarded under the World Federation, not dependent on any individual maintainer.
AI in service of the text
AI capabilities applied to support scholarship — cross-language search, conceptual retrieval — never as a substitute for the source.

An invitation

The World Federation invites Hawza institutions, university Islamic-studies departments, foundations, marāji'-affiliated bodies, and senior scholars to engage with this work — as readers, collaborators, contributors, and endorsers. The act of compilation has, in every age of this tradition, been a communal one; al-Makhzan is no exception. Correspondence on use, integration of additional sources, or institutional partnership is welcomed at the address below.