dc.contributor |
Казанский федеральный университет |
|
dc.contributor.author |
Mariko Mohamed Lamine Lamin |
|
dc.date.accessioned |
2025-07-11T11:28:40Z |
|
dc.date.available |
2025-07-11T11:28:40Z |
|
dc.date.issued |
2025 |
|
dc.identifier.citation |
Mariko M. Comparative analysis of lexical density, lexical diversity, and multiword expressions in Russian, English, and French legal texts: implications for readability and understandability / M. Mariko // Russian Linguistic Bulletin. - 2025. - № 7 (67). - URL: https://rulb.org/en/archive/7-67-2025-july/10.60797/RULB.2025.67.5. - DOI: 10.60797/RULB.2025.67.5 |
|
dc.identifier.uri |
https://dspace.kpfu.ru/xmlui/handle/net/185323 |
|
dc.description.abstract |
Lexical density is closely related to the notion of information packaging as content words in a text; therefore, texts with a
higher proportion of content words are dense as they contain more information as opposed to texts that have a higher
proportion of function words [10, P. 61-79]. Type-token ratio (TTR), also known as vocabulary size divided by text length, is a
simple measure of lexical diversity. Lexical diversity refers to how varied the vocabulary used in a text is. For texts of similar
length, the traditional type-token ratio can be used, which is the number of different words (types) in a text divided by the total
number of words (tokens) [1, P. 185-207]. Multiword expressions refer to a diverse group of linguistic phenomena, connected
by the fact that they do not fit neatly into the word-phrase dichotomy. Like phrases, they appear to be made up of multiple
words. In our research, we analyzed text materials with lexical density, TTR, and multiword expressions from legal texts (texts
of the United Nations). We compared the automatic analysis results to three linguistic measures in Russian, English, and
French. A 60,000-word-based corpus was built for the analysis. Our research aimed at examining the lexical density, TTR, and
multiword expressions of Russian, English, and French UN texts. To reach that goal, we used Rulingva, TextInspector, and
LancsBox to compute our data. The results showed that the linguistic features selected for the investigation could impact
complexity on account of lexical richness, being a multidimensional concept that encompasses several aspects of lexis use [12,
P. 19]. |
|
dc.language.iso |
en |
|
dc.relation.ispartofseries |
Russian Linguistic Bulletin |
|
dc.rights |
открытый доступ |
|
dc.subject |
French UN texts |
|
dc.subject |
English UN texts |
|
dc.subject |
Russian UN texts |
|
dc.subject |
TTR |
|
dc.subject |
lexical density |
|
dc.subject |
multi-word expressions |
|
dc.subject |
n-grams |
|
dc.subject |
readability. |
|
dc.subject.other |
Языкознание |
|
dc.title |
COMPARATIVE ANALYSIS OF LEXICAL DENSITY, LEXICAL DIVERSITY, AND MULTIWORD EXPRESSIONS IN RUSSIAN, ENGLISH, AND FRENCH LEGAL TEXTS: IMPLICATIONS FOR READABILITY AND UNDERSTANDABILITY |
|
dc.type |
Article |
|
dc.contributor.org |
Институт филологии и межкультурной коммуникации |
|
dc.description.pages |
1-4 |
|
dc.relation.ispartofseries-issue |
7 (67) |
|
dc.pub-id |
315822 |
|
dc.identifier.doi |
10.60797/RULB.2025.67.5 |
|