A quantum-like semantic model for text retrieval in Arabic
Journal: Scientific and Technical Journal of Information Technologies, Mechanics and Optics (Vol.21, No. 1)Publication Date: 2021-25-02
Authors : Shaker A. Bessmertny I.A. Miroslavskaya L.A. Koroleva Ju.A.;
Page : 102-108
Keywords : Bell inequality; quantum entanglement; information retrieval; HAL; IR algorithms; quantum theory; Arabic language; natural language processing;
Abstract
The subject of study. The paper focuses on the extraction of semantics from texts in Arabic. In particular, the applicability of the Bell test to word pairs is investigated as a measure of the semantic words relatedness in a context. The study applies the quantum formalism to the task of information retrieval in Arabic texts and presents the results of this work. The authors also examine the influence of the context width on the effectiveness of information retrieval. Method. The research is based on the vector representation of the context. It uses the well-known approach based on the HAL (Hyperspace Analogue to Language) matrix and Bell test. The HAL matrix allows taking into account both the frequency of the words occurrence in the context and the distance to the target word. Quantum theory operates with probability density matrices. Quantum theory allows describing probabilities in the vector space in a more natural way, i.e., words can be represented as vectors. Main results. The results demonstrate that using the Bell's test for texts in Arabic provides a better ranking of search results compared to the results of search services. Practical significance. The research results can be used in the development of the information retrieval systems, as well as for the further development of methods based on the distributive hypothesis.
Other Latest Articles
- Goodpoint: unsupervised learning of key point detection and description
- Human psyche creation by application of natural language processing technologies
- Application prospects for unmanned transport ships in the seas of the Russian Arctic Basin
- Defocus impact analysis on telescope wavefront reconstruction by scattering spot with parametric optimization technique
- Fourier spectroscopy in blood plasma study with type two diabetes
Last modified: 2021-03-05 01:22:59