ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

A quantum-like semantic model for text retrieval in Arabic

Journal: Scientific and Technical Journal of Information Technologies, Mechanics and Optics (Vol.21, No. 1)

Publication Date:

Authors : ;

Page : 102-108

Keywords : Bell inequality; quantum entanglement; information retrieval; HAL; IR algorithms; quantum theory; Arabic language; natural language processing;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

The subject of study. The paper focuses on the extraction of semantics from texts in Arabic. In particular, the applicability of the Bell test to word pairs is investigated as a measure of the semantic words relatedness in a context. The study applies the quantum formalism to the task of information retrieval in Arabic texts and presents the results of this work. The authors also examine the influence of the context width on the effectiveness of information retrieval. Method. The research is based on the vector representation of the context. It uses the well-known approach based on the HAL (Hyperspace Analogue to Language) matrix and Bell test. The HAL matrix allows taking into account both the frequency of the words occurrence in the context and the distance to the target word. Quantum theory operates with probability density matrices. Quantum theory allows describing probabilities in the vector space in a more natural way, i.e., words can be represented as vectors. Main results. The results demonstrate that using the Bell's test for texts in Arabic provides a better ranking of search results compared to the results of search services. Practical significance. The research results can be used in the development of the information retrieval systems, as well as for the further development of methods based on the distributive hypothesis.

Last modified: 2021-03-05 01:22:59