Электронный архив

Neural Network Recognition of Russian Noun and Adjective Cases in the Google Books Ngram Corpus

Показать сокращенную информацию

dc.contributor.author Savinkov A.V.
dc.contributor.author Bochkarev V.V.
dc.contributor.author Shevlyakova A.V.
dc.contributor.author Khristoforov S.V.
dc.date.accessioned 2022-02-09T20:33:46Z
dc.date.available 2022-02-09T20:33:46Z
dc.date.issued 2021
dc.identifier.issn 0302-9743
dc.identifier.uri https://dspace.kpfu.ru/xmlui/handle/net/169033
dc.description.abstract The article proposes a solution to the problem of automatic recognition of Russian noun and adjective cases in the Google Books Ngram corpus. The recognition was performed by using information on word co-occurrence statistics extracted from the corpus. Explicit Word Vectors composed of frequencies of ordinary and syntactic bigrams that include a given word were fed to the input of the recognizer. Comparative testing of several types of vector representation and preliminary data normalization were carried out. The trained model was a multi-layer perceptron with a softmax output layer. To train and test the model, we selected 50000 adjectives and 50000 nouns that were most frequently used in the Google Books Ngram Russian subcorpus between 1920 and 2009. Parts of speech and cases were determined using the OpenCorpora electronic morphological dictionary. The recognition accuracy of the cases obtained using the trained neural network model was 96.45% for the nouns and 99.63% for the adjectives.
dc.relation.ispartofseries Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.subject Disambiguation
dc.subject Google books Ngram
dc.subject Neural networks
dc.subject Russian cases
dc.title Neural Network Recognition of Russian Noun and Adjective Cases in the Google Books Ngram Corpus
dc.type Conference Proceeding
dc.relation.ispartofseries-volume 12997 LNAI
dc.collection Публикации сотрудников КФУ
dc.relation.startpage 626
dc.source.id SCOPUS03029743-2021-12997-SID85116357619


Файлы в этом документе

Данный элемент включен в следующие коллекции

  • Публикации сотрудников КФУ Scopus [24551]
    Коллекция содержит публикации сотрудников Казанского федерального (до 2010 года Казанского государственного) университета, проиндексированные в БД Scopus, начиная с 1970г.

Показать сокращенную информацию

Поиск в электронном архиве


Расширенный поиск

Просмотр

Моя учетная запись

Статистика