An Improved Semantic Query Expansion Approach Using Incremental User Tag Profile for Efficient Information Retrieval
DOI:
https://doi.org/10.21015/vtse.v10i3.1136Abstract
The World Wide Web (WWW) comprises a wide range of information, and it is mainly operated on the principles of keyword matching which often reduces accurate information retrieval. The Keyword matching mechanism faces word mismatch problems while retrieving relevant information. Furthermore, the inherent ambiguity of short keyword queries demands enhanced methods for Web retrieval. Ontological-based query expansion is one of the primary methods for Web retrieval, and it handles the vocabulary mismatch problem by computing semantics from the ontology knowledgebase. However, the retrieval of information relevant to user interests is a major challenge. In this paper, we seek to improve retrieval performance by leveraging user preferences and ontology semantics in the process of query expansion. The expansion words are added to the user query using WordNet lexicon and domain ontology. Additionally, the search intent of the user is also added as expansion words by exploiting a tag-based user profile. When it comes to obtaining relevant documents, the proposed framework outperforms the keyword-based approach by achieving a 76% F1-score. This noticeable improvement accurately reflects the importance of including user intents in the process of semantic query expansion.
References
S. Pani, B. Sahu, J. Mishra, S. N. Mohanty, and A. Panigrahi, "Pragmatic Analysis of Social Web Components on Semantic Web Mining," in Social Network Analysis, ed, 2022, pp. 83-108.
P. Khare, "User Profile Mining and Personalization of Web Services," International Journal of Computer Applications, vol. 105, pp. 12-15, 11/14 2014.
M. Raza, R. Mokhtar, N. Ahmad, M. Pasha, and U. Pasha, "A Taxonomy and Survey of Semantic Approaches for Query Expansion," IEEE Access, vol. PP, pp. 1-1, 01/24 2019.
D. K. Sharma, R. Pamula, and D. Chauhan, "Semantic approaches for query expansion," Evolutionary Intelligence, vol. 14, pp. 1101-1116, 2021.
D. Malhotra and O. P. Rishi, "A comprehensive review from hyperlink to intelligent technologies based personalized search systems," Journal of Management Analytics, vol. 6, pp. 365-389, 2019/10/02 2019.
R. Ojha and G. Deepak, "Metadata Driven Semantically Aware Medical Query Expansion," Cham, 2021, pp. 223-233.
S. Jain, K. R. Seeja, and R. Jindal, "A fuzzy ontology framework in information retrieval using semantic query expansion," International Journal of Information Management Data Insights, vol. 1, p. 100009, 2021/04/01/ 2021.
A. Allahim, A. Cherif, and A. Imine, "A Hybrid Approach for Optimizing Arabic Semantic Query Expansion," in 2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA), 2021, pp. 1-8.
M. Khedr, F. El-Licy, and A. Salah, "Ontology based Semantic Query Expansion for Searching Queries in Programming Domain," International Journal of Advanced Computer Science and Applications, vol. 12, 01/01 2021.
N. Wu and Y. Pan, "Semantic query expansion method based on pay-as-yougo fashion for graph model," Journal of Physics: Conference Series, vol. 1971, 2021.
H. K. Azad, A. Deepak, C. Chakraborty, and K. Abhishek, "Improving query expansion using pseudo-relevant web knowledge for information retrieval," Pattern Recognition Letters, vol. 158, pp. 148-156, 2022/06/01/ 2022.
Z. Sun, S. Lu, C. Ma, X. Liu, and C. Guo, Query Expansion and Entity Weighting for Query Reformulation Retrieval in Voice Assistant Systems, 2022.
M. R. Bouadjenek, H. Hacid, and M. Bouzeghoub, "Personalized Social Query Expansion Using Social Annotations," in Transactions on Large-Scale Data- and Knowledge-Centered Systems XL, A. Hameurlain, R. Wagner, F. Morvan, and L. Tamine, Eds., ed Berlin, Heidelberg: Springer Berlin Heidelberg, 2019, pp. 1-25.
I. H. Witten, A. Moffat, and T. C. Bell, "Managing Gigabytes: Compressing and Indexing Documents and Images," IEEE Transactions on Information Theory, vol. 41, p. 2101, 1995. DOI: https://doi.org/10.1109/TIT.1995.476344
M. F. Porter, "An algorithm for suffix stripping," Program, vol. 14, pp. 130-137, 1980. DOI: https://doi.org/10.1108/eb046814
M. Raza, R. Mokhtar, N. Ahmad, and M. Ashraf, "Sensual Semantic Analysis for Effective Query Expansion," International Journal of Advanced Computer Science and Applications, vol. 9, 01/01 2018.
S. Jupp et al., "A new Ontology Lookup Service at EMBL-EBI," In: Malone, J. et al. (eds.) Proceedings of SWAT4LS International Conference 2015, 2015.
D. Guessoum, M. Miraoui, and C. Tadj, "A modification of wu and palmer semantic similarity measure," in The Tenth International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies, 2016, pp. 42-46.
M. W. Berry, Z. Drmac, and E. R. Jessup, "Matrices, Vector Spaces, and Information Retrieval," SIAM Rev., vol. 41, pp. 335–362, 1999. DOI: https://doi.org/10.1137/S0036144598347035
A. I. F. AI, A. Goldbloom, P. Lin, P. Mooney, C. Schoenick, S. Kohlmeier, et al. (30-May-2022). COVID-19 Open Research Dataset Challenge (CORD-19). Available: https://www.kaggle.com/datasets/allen-institute-for-ai/CORD-19-research-challenge
Downloads
Published
How to Cite
Issue
Section
License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC-By) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
This work is licensed under a Creative Commons Attribution License CC BY