Review Comment:
This paper presents a study using mixed methods for investigating the topics and trends in Semantic Web research. The authors identified top-down research topics from four seminal papers and used three data analysis tools to extract bottom-up research topics from Semantic Web research papers. The top-down and bottom-up topics are then compared and the results are discussed.
Pros:
This paper presents an original and interesting study on the main topics in the field of Semantic Web using mixed methods. The insights summarized and discovered from the seminal papers and data-driven analysis can provide useful references for scholars in this field. This paper is also well-written.
Cons:
There are two major issues with this paper. First, the authors didn't discuss the impact of parameter selection on the data-driven analysis results. Do the three tools require some input parameters, such as the number of topics? What are the parameters selected for your experiments and why do you select these values? It is also possible that the tools may not require any parameters from the users. In either case, the authors may need to provide some explanations to enhance the reproducibility of this work. Second, this paper overall is presented in a qualitative manner although it took a mixed methods approach. Is there anyway that the authors can, e.g., quantitatively compare the rankings of the topics output by the three tools (e.g., using Spearman's correlation coefficient), and explain why the ranking is different?
Some more detailed comments are listed as below:
- Abstract: "Overall, we provide a reflectional study on the past decade of Semantic Web research, however the findings and the..." should be put into two sentences: "...past decade of Semantic Web research. However the findings and the..."
- Page 3: "Another common solution is the adoption of probabilistic topic models, such
as Latent Dirichlet Analysis (LDA)" LDA refers to "Latent Dirichlet Allocation" not "Latent Dirichlet Analysis".
- PoolParty seems to be a commercial service. Did you purchase their service for this research or they have a free academic version? Some explanation is necessary here.
- Figure 10: what do the different colors of the nodes represent? If you use a metric (e.g., degree of centrality) to define the colors, you will need to add a legend here.
- Page 19: "Among the topics experiencing strong variations through time, Web Service is a declining one. After experiencing a peak of use in 2008 with a 40% distribution in the documents, it then dropped to less than 20% in 2015." It is important to note that fewer explicit mentions do not necessarily mean less popular. At the beginning of Web, people often say "World Wide Web" but nowadays people just say "Web". One more sentence explanation should be added here.
- Paper length: overall, this paper is too long. The authors may consider shortening some discussions.
|