An Improved Semantic Query Expansion Approach Using Incremental User Tag Profile for Efficient Information Retrieval

Authors

  • Muhammad Ahsan Raza Department of Information Technology, Bahauddin Zakariya University, Multan 60000, Pakistan
  • Muhammad Ali Department of Information Technology, Bahauddin Zakariya University, Multan 60000, Pakistan
  • Maruf Pasha Department of Information Technology, Bahauddin Zakariya University, Multan 60000, Pakistan
  • Mubashir Ali Department of Software Engineering, Lahore Garrison University, Lahore 54000, Pakistan.

DOI:

https://doi.org/10.21015/vtse.v10i3.1136

Abstract

The World Wide Web (WWW) comprises a wide range of information, and it is mainly operated on the principles of keyword matching which often reduces accurate information retrieval. The Keyword matching mechanism faces word mismatch problems while retrieving relevant information. Furthermore, the inherent ambiguity of short keyword queries demands enhanced methods for Web retrieval. Ontological-based query expansion is one of the primary methods for Web retrieval, and it handles the vocabulary mismatch problem by computing semantics from the ontology knowledgebase. However, the retrieval of information relevant to user interests is a major challenge. In this paper, we seek to improve retrieval performance by leveraging user preferences and ontology semantics in the process of query expansion. The expansion words are added to the user query using WordNet lexicon and domain ontology. Additionally, the search intent of the user is also added as expansion words by exploiting a tag-based user profile. When it comes to obtaining relevant documents, the proposed framework outperforms the keyword-based approach by achieving a 76% F1-score. This noticeable improvement accurately reflects the importance of including user intents in the process of semantic query expansion.

References

S. Pani, B. Sahu, J. Mishra, S. N. Mohanty, and A. Panigrahi, "Pragmatic Analysis of Social Web Components on Semantic Web Mining," in Social Network Analysis, ed, 2022, pp. 83-108.

P. Khare, "User Profile Mining and Personalization of Web Services," International Journal of Computer Applications, vol. 105, pp. 12-15, 11/14 2014.

M. Raza, R. Mokhtar, N. Ahmad, M. Pasha, and U. Pasha, "A Taxonomy and Survey of Semantic Approaches for Query Expansion," IEEE Access, vol. PP, pp. 1-1, 01/24 2019.

D. K. Sharma, R. Pamula, and D. Chauhan, "Semantic approaches for query expansion," Evolutionary Intelligence, vol. 14, pp. 1101-1116, 2021.

D. Malhotra and O. P. Rishi, "A comprehensive review from hyperlink to intelligent technologies based personalized search systems," Journal of Management Analytics, vol. 6, pp. 365-389, 2019/10/02 2019.

R. Ojha and G. Deepak, "Metadata Driven Semantically Aware Medical Query Expansion," Cham, 2021, pp. 223-233.

S. Jain, K. R. Seeja, and R. Jindal, "A fuzzy ontology framework in information retrieval using semantic query expansion," International Journal of Information Management Data Insights, vol. 1, p. 100009, 2021/04/01/ 2021.

A. Allahim, A. Cherif, and A. Imine, "A Hybrid Approach for Optimizing Arabic Semantic Query Expansion," in 2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA), 2021, pp. 1-8.

M. Khedr, F. El-Licy, and A. Salah, "Ontology based Semantic Query Expansion for Searching Queries in Programming Domain," International Journal of Advanced Computer Science and Applications, vol. 12, 01/01 2021.

N. Wu and Y. Pan, "Semantic query expansion method based on pay-as-yougo fashion for graph model," Journal of Physics: Conference Series, vol. 1971, 2021.

H. K. Azad, A. Deepak, C. Chakraborty, and K. Abhishek, "Improving query expansion using pseudo-relevant web knowledge for information retrieval," Pattern Recognition Letters, vol. 158, pp. 148-156, 2022/06/01/ 2022.

Z. Sun, S. Lu, C. Ma, X. Liu, and C. Guo, Query Expansion and Entity Weighting for Query Reformulation Retrieval in Voice Assistant Systems, 2022.

M. R. Bouadjenek, H. Hacid, and M. Bouzeghoub, "Personalized Social Query Expansion Using Social Annotations," in Transactions on Large-Scale Data- and Knowledge-Centered Systems XL, A. Hameurlain, R. Wagner, F. Morvan, and L. Tamine, Eds., ed Berlin, Heidelberg: Springer Berlin Heidelberg, 2019, pp. 1-25.

I. H. Witten, A. Moffat, and T. C. Bell, "Managing Gigabytes: Compressing and Indexing Documents and Images," IEEE Transactions on Information Theory, vol. 41, p. 2101, 1995. DOI: https://doi.org/10.1109/TIT.1995.476344

M. F. Porter, "An algorithm for suffix stripping," Program, vol. 14, pp. 130-137, 1980. DOI: https://doi.org/10.1108/eb046814

M. Raza, R. Mokhtar, N. Ahmad, and M. Ashraf, "Sensual Semantic Analysis for Effective Query Expansion," International Journal of Advanced Computer Science and Applications, vol. 9, 01/01 2018.

S. Jupp et al., "A new Ontology Lookup Service at EMBL-EBI," In: Malone, J. et al. (eds.) Proceedings of SWAT4LS International Conference 2015, 2015.

D. Guessoum, M. Miraoui, and C. Tadj, "A modification of wu and palmer semantic similarity measure," in The Tenth International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies, 2016, pp. 42-46.

M. W. Berry, Z. Drmac, and E. R. Jessup, "Matrices, Vector Spaces, and Information Retrieval," SIAM Rev., vol. 41, pp. 335–362, 1999. DOI: https://doi.org/10.1137/S0036144598347035

A. I. F. AI, A. Goldbloom, P. Lin, P. Mooney, C. Schoenick, S. Kohlmeier, et al. (30-May-2022). COVID-19 Open Research Dataset Challenge (CORD-19). Available: https://www.kaggle.com/datasets/allen-institute-for-ai/CORD-19-research-challenge

Downloads

Published

2022-09-21

How to Cite

Raza, M. A., Ali, M., Pasha, M., & Ali, M. (2022). An Improved Semantic Query Expansion Approach Using Incremental User Tag Profile for Efficient Information Retrieval. VFAST Transactions on Software Engineering, 10(3), 1–9. https://doi.org/10.21015/vtse.v10i3.1136