Approaches and tools for Russian text linguistic profiling
Journal: Russian Language Studies (Vol.22, No. 4)Publication Date: 2025-02-22
Authors : Marina Solnyshkina; Valery Solovyev; Yulia Ebzeeva;
Page : 501-517
Keywords : linguistic analysis; text profiler RuLingva; text complexity; educational text; typological passport of the text; complexity predictors;
Abstract
Approaches and tools for assessing linguistic and cognitive complexity of educational texts are in demand both in science and teaching. Predicting difficulties of perception and understanding and ranking texts by classes, i.e. the number of years of learning or levels of language proficiency (A1-C2), are of particular importance for education. The study is aimed at demonstrating modern methodologies, algorithms, and tools for analyzing Russian texts in text profiler and automatic analyzer RuLingva and at presenting articles from the thematic issue on comprehensive analysis of Russian language textbooks for Russian and Belarusian schools. The research demonstrates that the modern paradigm of discourse complexology is based on the methods of stylistic statistics, which identifies functional characteristics of language units and verifies them using big data. The services on RuLingva are designed for teachers and researchers; they automatically analyze educational texts and predict their target audience based on readability, lexical diversity, abstractness, frequency, and terminological density. In “Russian as a Foreign Language” mode, RuLingva downloads lists of words from the text according to each level of language proficiency and estimates their proportion. This provides material for pre- and post-text work. RuLingva algorithm is based on the typology of educational texts and is to be supplied with tools for assessing a person’s verbal intelligence and reading literacy. The nearest prospect of RuLingva lies in widening the range of complexity predictors and installing automatic subject area discriminator. Both directions are planned to be implemented using neural networks, classification models, “typological passports” of educational texts with different complexity, and thematic orientation.
Other Latest Articles
- Numerical simulation of physica lparameters in the atmosphere applying the WRF model to analyze the weather with inTiquipaya municipality (Bolivia)
- Theoretical study of hemiacetal synthesis catalyzed by Lewis acids
- Behavior and analysis of insolation and solar irradiation of the Uru-Chipaya Andeanhousing: Revaluation of ancestral knowledge of andean architecture from digital simulationsfor bioclimatic design
- EDITORIAL NOTE
- Identification of people without a mask using Deep Learning
Last modified: 2025-02-22 07:43:18