Kazan Federal University Digital Repository

Methods and software tools of morphological disambiguation in the texts in tatar

Show simple item record

dc.contributor.author Gataullin R.
dc.contributor.author Gilmullin R.
dc.contributor.author Suleymanov D.
dc.date.accessioned 2018-09-18T20:12:26Z
dc.date.available 2018-09-18T20:12:26Z
dc.date.issued 2015
dc.identifier.issn 0973-4562
dc.identifier.uri https://dspace.kpfu.ru/xmlui/handle/net/137559
dc.description.abstract © Research India Publications. This article provides a review of analytical methods for resolving the problem of morphological ambiguity and analysis of their applicability to the Tatar language. Since the task was set still in the 50-60-ies of XX century, the methods of solution have been accumulated quite a lot. Basically they can be divided into methods of rule-based and statistical and probabilistic methods. Methods are mainly language independent, each has its advantages and disadvantages, and their accuracy varies from one language to another. For example, for the English language, which has a poor morphology and the fixed order of the words, the accuracy reaches 94-96%. And for the Russian language with free word order, such accuracy is difficult. To resolve the ambiguity in morphological Tatar language in terms of the characteristics of the language such as agglutinative feature and free word order, it is offered a fusion of these methods, by which a high precision resolution is supposed to be achieved. At the moment, the research is still in progress, the tools for the development of contextual rules have been designed, subcorpus for statistical machine learning and probabilistic models is also being elaborated. In addition to the methods, the article describes the current state of the electronic corpus of the Tatar language, and discusses the problems and possible solutions to the problem of polysemanticism in the corpus markings.
dc.relation.ispartofseries International Journal of Applied Engineering Research
dc.subject Electronic corpus of the language
dc.subject Resolving morphological ambiguity
dc.subject The Tatar language
dc.title Methods and software tools of morphological disambiguation in the texts in tatar
dc.type Article
dc.relation.ispartofseries-issue 24
dc.relation.ispartofseries-volume 10
dc.collection Публикации сотрудников КФУ
dc.relation.startpage 44795
dc.source.id SCOPUS09734562-2015-10-24-SID84955584925


Files in this item

This item appears in the following Collection(s)

  • Публикации сотрудников КФУ Scopus [24551]
    Коллекция содержит публикации сотрудников Казанского федерального (до 2010 года Казанского государственного) университета, проиндексированные в БД Scopus, начиная с 1970г.

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics