Показать сокращенную информацию
dc.contributor | Казанский федеральный университет | |
dc.contributor.author | Mariko Mohamed Lamine Lamin | |
dc.date.accessioned | 2025-07-11T11:28:40Z | |
dc.date.available | 2025-07-11T11:28:40Z | |
dc.date.issued | 2025 | |
dc.identifier.citation | Mariko M. Comparative analysis of lexical density, lexical diversity, and multiword expressions in Russian, English, and French legal texts: implications for readability and understandability / M. Mariko // Russian Linguistic Bulletin. - 2025. - № 7 (67). - URL: https://rulb.org/en/archive/7-67-2025-july/10.60797/RULB.2025.67.5. - DOI: 10.60797/RULB.2025.67.5 | |
dc.identifier.uri | https://dspace.kpfu.ru/xmlui/handle/net/185323 | |
dc.description.abstract | Lexical density is closely related to the notion of information packaging as content words in a text; therefore, texts with a higher proportion of content words are dense as they contain more information as opposed to texts that have a higher proportion of function words [10, P. 61-79]. Type-token ratio (TTR), also known as vocabulary size divided by text length, is a simple measure of lexical diversity. Lexical diversity refers to how varied the vocabulary used in a text is. For texts of similar length, the traditional type-token ratio can be used, which is the number of different words (types) in a text divided by the total number of words (tokens) [1, P. 185-207]. Multiword expressions refer to a diverse group of linguistic phenomena, connected by the fact that they do not fit neatly into the word-phrase dichotomy. Like phrases, they appear to be made up of multiple words. In our research, we analyzed text materials with lexical density, TTR, and multiword expressions from legal texts (texts of the United Nations). We compared the automatic analysis results to three linguistic measures in Russian, English, and French. A 60,000-word-based corpus was built for the analysis. Our research aimed at examining the lexical density, TTR, and multiword expressions of Russian, English, and French UN texts. To reach that goal, we used Rulingva, TextInspector, and LancsBox to compute our data. The results showed that the linguistic features selected for the investigation could impact complexity on account of lexical richness, being a multidimensional concept that encompasses several aspects of lexis use [12, P. 19]. | |
dc.language.iso | en | |
dc.relation.ispartofseries | Russian Linguistic Bulletin | |
dc.rights | открытый доступ | |
dc.subject | French UN texts | |
dc.subject | English UN texts | |
dc.subject | Russian UN texts | |
dc.subject | TTR | |
dc.subject | lexical density | |
dc.subject | multi-word expressions | |
dc.subject | n-grams | |
dc.subject | readability. | |
dc.subject.other | Языкознание | |
dc.title | COMPARATIVE ANALYSIS OF LEXICAL DENSITY, LEXICAL DIVERSITY, AND MULTIWORD EXPRESSIONS IN RUSSIAN, ENGLISH, AND FRENCH LEGAL TEXTS: IMPLICATIONS FOR READABILITY AND UNDERSTANDABILITY | |
dc.type | Article | |
dc.contributor.org | Институт филологии и межкультурной коммуникации | |
dc.description.pages | 1-4 | |
dc.relation.ispartofseries-issue | 7 (67) | |
dc.pub-id | 315822 | |
dc.identifier.doi | 10.60797/RULB.2025.67.5 |