ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Using Stochastic Automaton for Data Consolidation

Journal: Naukovi Visti NTUU KPI (Vol.21, No. 2)

Publication Date:

Authors : ; ; ;

Page : 29-36

Keywords : Open data sources; Data consolidation; Information-analytical systems; Information retrieval systems; Probabilistic models; Relevance; Big data tasks;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Background. Development of methods and algorithms for efficient search of relevant information on demand. The article deals with the consolidation of data for subsequent use in the information and analytical systems. Objective. The aim of the paper is to identify capabilities and build relevant information search algorithms from disparate sources by analyzing the probability information identifying the possible presence of relevant documents in these sources. Methods. To find the relevant information for search queries the approach based on the use of probability estimates of relevant documents available in the sources of further increasing the number of selected documents from these sources to analyze their relevance to the query is used. Results. A stochastic programmable automaton structure to ensure selection of the most possible information sources by relevance parameters and information retrieval algorithm based on the use of stochastic automaton were developed. Conclusions. The described algorithm using stochastic automaton for data consolidation allows developing a set of software tools, provides plenty full and holistic data consolidation problem-solving for diverse systems which search for information from information sources different in composition and presentation type.

Last modified: 2017-05-15 21:26:00