Электронный архив

Introducing baselines for Russian named entity recognition

Показать сокращенную информацию

dc.contributor.author Gareev R.
dc.contributor.author Tkachenko M.
dc.contributor.author Solovyev V.
dc.contributor.author Simanovsky A.
dc.contributor.author Ivanov V.
dc.date.accessioned 2018-09-18T20:08:19Z
dc.date.available 2018-09-18T20:08:19Z
dc.date.issued 2013
dc.identifier.issn 0302-9743
dc.identifier.uri https://dspace.kpfu.ru/xmlui/handle/net/136885
dc.description.abstract Current research efforts in Named Entity Recognition deal mostly with the English language. Even though the interest in multi-language Information Extraction is growing, there are only few works reporting results for the Russian language. This paper introduces quality baselines for the Russian NER task. We propose a corpus which was manually annotated with organization and person names. The main purpose of this corpus is to provide gold standard for evaluation. We implemented and evaluated two approaches to NER: knowledge-based and statistical. The first one comprises several components: dictionary matching, pattern matching and rule-based search of lexical representations of entity names within a document. We assembled a set of linguistic resources and evaluated their impact on performance. For the data-driven approach we utilized our implementation of a linear-chain CRF which uses a rich set of features. The performance of both systems is promising (62.17% and 75.05% F1 measure), although they do not employ morphological or syntactical analysis. © 2013 Springer-Verlag.
dc.relation.ispartofseries Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.title Introducing baselines for Russian named entity recognition
dc.type Conference Paper
dc.relation.ispartofseries-issue PART 1
dc.relation.ispartofseries-volume 7816 LNCS
dc.collection Публикации сотрудников КФУ
dc.relation.startpage 329
dc.source.id SCOPUS03029743-2013-7816-1-SID84875516972


Файлы в этом документе

Данный элемент включен в следующие коллекции

  • Публикации сотрудников КФУ Scopus [24551]
    Коллекция содержит публикации сотрудников Казанского федерального (до 2010 года Казанского государственного) университета, проиндексированные в БД Scopus, начиная с 1970г.

Показать сокращенную информацию

Поиск в электронном архиве


Расширенный поиск

Просмотр

Моя учетная запись

Статистика