Review Comment:
This paper describes (1) Japnization and RDFization of two representative resources: WordNet and DBpedia, and (2) their linking as LOD. The resulting combined resource can definitely be an important and useful one for intelligent computational processes such as multilingual natural language processing or semantic search. The current description of the resource, as described in the draft, could be further improved by considering the following points.
(1) Detailed information about the Japanese WordNet and DBpedia Japan (as specified in the "Submission Types" section of http://www.semantic-web-journal.net/authors) have to be presented in an organized way. This can improve the completeness of the descriptions.
(2) There are almost no self-evaluations about the results summarized in Table 1 and 2. This could lead a situation where the potential readers cannot judge the usefulness of the resulting resource. The authors should state whether these results are satisfying/promising or not. Besides, the paper should argue possible methodologies to improve the results, if the current figures do not achieve desired levels of coverage and accuracy.
(3) As far as the reviewer sees, one of the technically interesting decisions is the use of skos:closeMatch instead of owl:sameAs. Although the footnote#5 discusses this choice in one way, the discussion could be further detailed probably by referencing related work, such as (Garcia et al., 2012).
- Ref: J. Gracia, et al. "Challenges for the multilingual web of data.", Journal of Web Semantics, 11:63-71, 2012.
The following are less important and/or detailed issues.
[p.1, left]
- The intention and meaning of the last two sentences in the first paragraph are unclear. What does it mean by "incoherent to development in other languages"? Besides, the intention behind the introduction of EDR here is far from the reviewer's understanding.
- It should be reminded: the originators of WordNet say (Miller and Fellbaum, 07) that Princeton WordNet is not an ontology.
- Miller, G., and Fallbaum, C. (2007). WordNet than and now. LR&E, 41:209-124.
[p.1, right]
- The reviewer is not very sure that the use of the term "cross-media data" is appropriate for the context here.
- NICT Japanese WordNet: "NICT" should be mentioned in the place where the Japanese WordNet is first introduced, p.1, right-top.
[p.2, left ~ right]
- The content of section 2.2 can be included in section 2.3. It was a bit odd for the reviewer to see an independent section was given to this topic.
[p.2, right]
- The issue described by the last sentence of section 2.3 could an issue of WN-ja, not an issue of conversion to RDF.
- In section 3, the notion of "ontology building in DBpedia Japan" is quite hard to understand, making the potential readers to interpret the results in Table 1 difficult.
- The reason why the mapping rates for Japanese were around 50% of that of English should be discussed.
[p.3, left]
- The logic behind the last sentence of the first paragraph is hard to follow. What are the connections between "to link literally" and the "basic dataset"?
- The reviewer supposes that "to link literally" can achieve high precision (accuracy) at the cost of recall (coverage). The results shown in Table 1 may reveal the fact of low recall. Nevertheless, perfect accuracy would not be attained even with the "to link literally" strategy, due to potential semantic ambiguities. Therefore some evaluation for the accuracy, even it is partial, would be necessary to acclaim the usefulness of the resulting resource.
- The meaning of the last sentence of section 4.1 is unclear: maybe "over" should be deleted? Even so, the phrase, "the viewpoint of OWL, syntactically and semantically", should be properly explained.
[p.3, right]
- The text structure of section 4.3 may be flawed, especially the first two paragraphs. The materials there should be reorganized to improve the readability.
- In the first paragraph of section 4.3, the wording "needless ambiguity" or "unnecessary ambiguity" could introduce "unnecessary misunderstanding." It might be rather far clear and fair to simply say, for example, "simply selected only noun concepts for interlinking."
[p.4, left]
- "CC-BY" and "CC-BY-SA" should be properly described.
|