Review Comment:
This paper presents the construction of an ontology and a knowledge graph representing Tafsir al-Tabari. A Tafsir is a body of commentary and explication, aimed at explaining the meanings of the Qur'an. Muhammad ibn Jarir al-Tabari is a Persian scholar.
This work is the continuation of a work published in the Semantic Web journal on the construction of a knowledge graph representing the hadith corpus. A hadith is an action of the prophet Mohammed, reported by a chain of narrators.
In general, the paper is well written and pedagogically presented and its content fits in well with the topics of the journal and the produced KG is interesting for cultural heritage.
However the paper presents several major shortcomings:
First, there is a lot of repetition and generalities in the article. The content could be presented in a much shorter paper.
Second, there is an overlap with the paper previously published by the authors in the SWJ on the SemanticHadith KG. There is for sure a delta but it is unclear, e.g. the new ontology comprises terms from the previous one, some competency questions are the same, the general structure of the papers and positioning are very similar.
Third, I could not access the SPARQL endpoint (nor that of the SemanticHadith). And I do not find the KG on GitHub.
Fourth, the SPARQL queries implementing the competency questions are not discussed in the paper (nor in the previous paper). There is no evaluation of the produced KG (nor of the SemanticHadith KG, which is left for future work in the published paper).
Detailed comments:
Introduction
P2 line 6-7, there shoud not be a distinction here between Linked data application and KG application, medecine should go in the list of domains line 4.
Background
This section presents some redundancies with the introduction, in particular sub-section 2.4. Merging both sections would avoid it.
SemanticTafsir Ontology
The positioning wrt the Semantic Hadith ontology is unclear.
P5 line 48 a namespace should be chosen (whatever the prefix), which is not given in the paper. The ontology should be published, with dereferenceable URIs.
P6 The overlap between competency questions that can be answered on the SemanticHadith and those specific to the Semantic Tafsir should be clearly stated.
P8 “concept classes” is not correct, nor “object type properties”, and a class does not comprise properties.
P9 Modelling decisions: similar to that of the previous paper.
access to ref 62 and 63 dates back to 2022. The references are not complete and the date is not correct.
The modelling choice is very debatable, in my view there should be 3 subclasses of Narrator, and narrator individuals should be declared as instances of them or of the superclass Narrator.
P10 Figure 3 does not reflect the n-ary patterns: there should be a refersTo relation between a ThematicVerseFragmentReference and a VerseFragment, and property hasSubTheme should be hasTheme.
Also in my view the model lacks a thesaurus for special individuals: hadith, narrators, etc.
Methodology
The text lines 27-44 is far too long, describing obvious notions for SWJ readers.
Subsection 4.1 should go in section 3
P11 lines 23-26 the text is redundant with what is already detailed in sections 1 and 2.
Section 4.2 is redundant with what is already descrubed in section 3.
P12-13 data class, concept class, object class do not make sense, nor a class equipped with properties.
Section 4.3 should be shortened. Technical staff should not be described. Obvious things should just be deleted, e.g. subsection 4.3.1 and most of 4.3.2. Subsection is not precise enough, which tools have been used? Who are the experts? 4.4 are generalities on GitHub that can be avoided.
Results and Discussion
Subsection 5.1 are technical details that are not challenges to be discussed
What is described in subsection 5.1 is not an evaluation, it is usual staff in ontology engineering that should not be described in a SWJ paper.
The statistics on the KG in Table 2 should go in Section 4. In this table, 3 millions of axioms must be a typo. The table should report the numbers of links between classes and between individuals (with a clear distinction).
A presentation of the SPARQL queries implementing the competency questions and a discussion on their results are missing.
The paper would benefit from a presentation of interesting outputs in the field of Islamic studies produced by querying the KG.
There are many assertions which are not precise enough, e.g. “significant advancement”: what are the KPI which enable to measure it? “revealing underlying patterns in the Tafsir’s interpretative framework”: which are they?
How the visualisations were created from the KG is not discussed.
The text in subsection 5.4 before and at the beginning of subsection 5.4.1 is far too general, redundant with the first sections.
Conclusion
I disagree with the assertion that a “rigorous evaluation” was presented.
In my view, the paper is better suited to a publication in a journal in Digital Humanities.
|