Электронный архив

COMPARATIVE ANALYSIS OF LEXICAL DENSITY, LEXICAL DIVERSITY, AND MULTIWORD EXPRESSIONS IN RUSSIAN, ENGLISH, AND FRENCH LEGAL TEXTS: IMPLICATIONS FOR READABILITY AND UNDERSTANDABILITY

Показать сокращенную информацию

dc.contributor Казанский федеральный университет
dc.contributor.author Mariko Mohamed Lamine Lamin
dc.date.accessioned 2025-07-11T11:28:40Z
dc.date.available 2025-07-11T11:28:40Z
dc.date.issued 2025
dc.identifier.citation Mariko M. Comparative analysis of lexical density, lexical diversity, and multiword expressions in Russian, English, and French legal texts: implications for readability and understandability / M. Mariko // Russian Linguistic Bulletin. - 2025. - № 7 (67). - URL: https://rulb.org/en/archive/7-67-2025-july/10.60797/RULB.2025.67.5. - DOI: 10.60797/RULB.2025.67.5
dc.identifier.uri https://dspace.kpfu.ru/xmlui/handle/net/185323
dc.description.abstract Lexical density is closely related to the notion of information packaging as content words in a text; therefore, texts with a higher proportion of content words are dense as they contain more information as opposed to texts that have a higher proportion of function words [10, P. 61-79]. Type-token ratio (TTR), also known as vocabulary size divided by text length, is a simple measure of lexical diversity. Lexical diversity refers to how varied the vocabulary used in a text is. For texts of similar length, the traditional type-token ratio can be used, which is the number of different words (types) in a text divided by the total number of words (tokens) [1, P. 185-207]. Multiword expressions refer to a diverse group of linguistic phenomena, connected by the fact that they do not fit neatly into the word-phrase dichotomy. Like phrases, they appear to be made up of multiple words. In our research, we analyzed text materials with lexical density, TTR, and multiword expressions from legal texts (texts of the United Nations). We compared the automatic analysis results to three linguistic measures in Russian, English, and French. A 60,000-word-based corpus was built for the analysis. Our research aimed at examining the lexical density, TTR, and multiword expressions of Russian, English, and French UN texts. To reach that goal, we used Rulingva, TextInspector, and LancsBox to compute our data. The results showed that the linguistic features selected for the investigation could impact complexity on account of lexical richness, being a multidimensional concept that encompasses several aspects of lexis use [12, P. 19].
dc.language.iso en
dc.relation.ispartofseries Russian Linguistic Bulletin
dc.rights открытый доступ
dc.subject French UN texts
dc.subject English UN texts
dc.subject Russian UN texts
dc.subject TTR
dc.subject lexical density
dc.subject multi-word expressions
dc.subject n-grams
dc.subject readability.
dc.subject.other Языкознание
dc.title COMPARATIVE ANALYSIS OF LEXICAL DENSITY, LEXICAL DIVERSITY, AND MULTIWORD EXPRESSIONS IN RUSSIAN, ENGLISH, AND FRENCH LEGAL TEXTS: IMPLICATIONS FOR READABILITY AND UNDERSTANDABILITY
dc.type Article
dc.contributor.org Институт филологии и межкультурной коммуникации
dc.description.pages 1-4
dc.relation.ispartofseries-issue 7 (67)
dc.pub-id 315822
dc.identifier.doi 10.60797/RULB.2025.67.5


Файлы в этом документе

Данный элемент включен в следующие коллекции

Показать сокращенную информацию

Поиск в электронном архиве


Расширенный поиск

Просмотр

Моя учетная запись

Статистика