Developing a computer system for student learning based on vision-language models
Journal: Discrete and Continuous Models and Applied Computational Science (Vol.32, No. 2)Publication Date: 2024-11-02
Authors : Eugeny Shchetinin; Anastasia Glushkova; Anastasia Demidova;
Page : 234-241
Keywords : deep learning; vision-language learning model; neural networks-transformers; throughchannel attention module;
Abstract
In recent years, artificial intelligence methods have been developed in various fields, particularly in education. The development of computer systems for student learning is an important task and can significantly improve student learning. The development and implementation of deep learning methods in the educational process has gained immense popularity. The most successful among them are models that consider the multimodal nature of information, in particular the combination of text, sound, images, and video. The difficulty in processing such data is that combining multimodal input data by different channel concatenation methods that ignore the heterogeneity of different modalities is an inefficient approach. To solve this problem, an inter-channel attention module is proposed in this paper. The paper presents a computer vision-linguistic system of student learning process based on the concatenation of multimodal input data using the inter-channel attention module. It is shown that the creation of effective and flexible learning systems and technologies based on such models allows to adapt the educational process to the individual needs of students and increase its efficiency.
Other Latest Articles
- Well-posedness of the microwave heating problem
- Statistical causality analysis
- Clenshaw algorithm in the interpolation problem by the Chebyshev collocation method
- The recent progress in terahertz channel characterization and system design
- Solution of a two-dimensional time-dependent Schrödinger equation describing two interacting atoms in an optical trap
Last modified: 2024-11-02 04:40:31