Multi-agent adaptive routing by multi-head-attention-based twin agents using reinforcement learning
Journal: Scientific and Technical Journal of Information Technologies, Mechanics and Optics (Vol.22, No. 6)Publication Date: 2022-12-22
Authors : Gribanov T.A. Filchenkov A.A. Azarov A.A. Shalyto A.A.;
Page : 1178-1186
Keywords : routing; multi-agent learning; reinforcement learning; adaptive routing;
Abstract
A regular condition, typical for packet routing, for the problem of cargo transportation, and for the problem of flow control, is the variability of the graph. Reinforcement learning based adaptive routing algorithms are designed to solve the routing problem with this condition. However, with significant changes in the graph, the existing routing algorithms require complete retraining. To handle this challenge, we propose a novel method based on multi-agent modeling with twin-agents for which new neural network architecture with multi-headed internal attention is proposed, pre-trained within the framework of the multi-view learning paradigm. An agent in such a paradigm uses a vertex as an input, twins of the main agent are placed at the vertices of the graph and select a neighbor to which the object should be transferred. We carried out a comparative analysis with the existing DQN-LE-routing multi-agent routing algorithm on two stages: pre-training and simulation. In both cases, launches were considered by changing the topology during testing or simulation. Experiments have shown that the proposed adaptability enhancement method provides global adaptability by increasing delivery time only by 14.5 % after global changes occur. The proposed method can be used to solve routing problems with complex path evaluation functions and dynamically changing graph topologies, for example, in transport logistics and for managing conveyor belts in production.
Other Latest Articles
- A STUDY OF FEQUENCIES OF CRAO CASES IN POST COVID 19 RHINO ORBITAL MUCOR MYCOSIS
- Automated evaluation of ECG parameters during the COVID-19 pandemic
- Application of the text wave model to the sentiment analysis problem
- SOME ASPECTS OF CONDUCTING EXPERT RESEARCH ON SALARY ACCOUNTING AND PAYMENT OF WAGES
- Method for monitoring the state of elements of cyber-physical systems based on time series analysis
Last modified: 2022-12-22 19:19:31