Dialogue system based on spoken conversations with access to an unstructured knowledge base
Journal: Scientific and Technical Journal of Information Technologies, Mechanics and Optics (Vol.23, No. 1)Publication Date: 2023-02-16
Authors : Masliukhin S.M.;
Page : 88-95
Keywords : dialogue systems; conversational agents; information retrieval; text augmentation; retrieval augmented generation;
Abstract
This paper describes an approach for constructing a task-oriented dialog system (a conversational agent) with an unstructured knowledge access based on spoken conversations including: written speech augmentation that simulates the speech recognition results; combination of classifiers; retrieval augmented text generation. The proposed approach provides the training data augmentation in two ways: by converting the original texts into sound waves by a text-tospeech model and then transforming back into texts by an automated speech recognition model; injecting artificially generated errors based on phonetic similarity. A dialogue system with access to the unstructured knowledge base solves the task of detecting a turn, which requires searching for additional information in an unstructured knowledge base. For this purpose, the Support Vector Machine, Convolutional Neural Network, Bidirectional Encoder Representations from Transformers, and Generative Pre-trained Transformer 2 models were trained. The best of the presented models are used in the weighted combination. Next, a suitable text fragment is selected from the knowledge base and a reasonable answer is generated. The tasks are solved by adapting the retrieval augmented text generation model Retrieval Augmented Generation. The proposed method tested on the data from the 10th Dialogue System Technology Challenge. In all metrics, except Precision, the new approach significantly outperformed the results of the basic models proposed by the organizers of the competition. The results of the work can be used to create chat-bot systems that provide automatic processing of user requests in natural language based on an unstructured knowledge access, such as a database of answers to frequently asked questions.
Other Latest Articles
- Computational methods to increase the speed of FPGA-based discrete wavelet transforms
- Investigation on impact and wear behavior of Al6061 (SiC + Al2O3) and Al7075 (SiC + Al2O3) hybrid composites
- Improvement of the automatic temperature stabilisation process in the cryovacuum unit
- Method for identification of sinusoidal signal parameters with variable unknown amplitude
- A method of optimizing the structure of hierarchical distributed control systems
Last modified: 2023-02-16 17:08:14