Segmentation of word gestures in sign language video
Journal: Scientific and Technical Journal of Information Technologies, Mechanics and Optics (Vol.23, No. 5)Publication Date: 2023-10-23
Authors : Dang Khanh Bessmertny I.A.;
Page : 980-988
Keywords : sign language; word gesture segmentation; MediaPipe; LSTM; thresholding method; sign language recognition;
Abstract
Despite the widespread use of automatic speech recognition and video subtitles, sign language is still a significant communication channel for people with hearing impairments. An important task in the process of automatic recognition of sign language is the segmentation of video into fragments corresponding to individual words. In contrast to the known methods of segmentation of sign language words, the paper proposes an approach that does not require the use of sensors (accelerometers). To segment the video into words in this study, an assessment of the dynamics of the image is used, and the boundary between words is determined using a threshold value. Since in addition to the speaker, there may be other moving objects in the frame that create noise, the dynamics in the work is estimated by the average change from frame to frame of the Euclidean distance between the coordinate characteristics of the hand, forearm, eyes and mouth. The calculation of the coordinate characteristics of the hands and head is carried out using the MediaPipe library. The developed algorithm was tested for the Vietnamese sign language on an open set of 4364 videos collected at the Vietnamese Sign Language Training Center, and demonstrated accuracy comparable to manual segmentation of video by an operator and low resource consumption, which will allow using the algorithm for automatic gesture recognition in real time. The experiments have shown that the task of segmentation of sign language, unlike the known methods, can be effectively solved without the use of sensors. Like other methods of gesture segmentation, the proposed algorithm does not work satisfactorily at a high speed of sign language when words overlap each other. This problem is the subject of further research.
Other Latest Articles
- Clustering in big data analytics: a systematic review and comparative analysis (review article)
- A new efficient adaptive rood pattern search motion estimation algorithm
- Method for testing NLP models with text adversarial examples
- The use of anthropometric points to introduce restrictions into the synthesis of a 3D model of the human body using SMPL
- Method for optimization of camera installation parameters for video monitoring of arbitrary surveillance zone
Last modified: 2023-10-24 18:33:43