Kazan Federal University Digital Repository

Introducing baselines for Russian named entity recognition

Show simple item record

dc.contributor.author Gareev R.
dc.contributor.author Tkachenko M.
dc.contributor.author Solovyev V.
dc.contributor.author Simanovsky A.
dc.contributor.author Ivanov V.
dc.date.accessioned 2018-09-18T20:08:19Z
dc.date.available 2018-09-18T20:08:19Z
dc.date.issued 2013
dc.identifier.issn 0302-9743
dc.identifier.uri https://dspace.kpfu.ru/xmlui/handle/net/136885
dc.description.abstract Current research efforts in Named Entity Recognition deal mostly with the English language. Even though the interest in multi-language Information Extraction is growing, there are only few works reporting results for the Russian language. This paper introduces quality baselines for the Russian NER task. We propose a corpus which was manually annotated with organization and person names. The main purpose of this corpus is to provide gold standard for evaluation. We implemented and evaluated two approaches to NER: knowledge-based and statistical. The first one comprises several components: dictionary matching, pattern matching and rule-based search of lexical representations of entity names within a document. We assembled a set of linguistic resources and evaluated their impact on performance. For the data-driven approach we utilized our implementation of a linear-chain CRF which uses a rich set of features. The performance of both systems is promising (62.17% and 75.05% F1 measure), although they do not employ morphological or syntactical analysis. © 2013 Springer-Verlag.
dc.relation.ispartofseries Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.title Introducing baselines for Russian named entity recognition
dc.type Conference Paper
dc.relation.ispartofseries-issue PART 1
dc.relation.ispartofseries-volume 7816 LNCS
dc.collection Публикации сотрудников КФУ
dc.relation.startpage 329
dc.source.id SCOPUS03029743-2013-7816-1-SID84875516972


Files in this item

This item appears in the following Collection(s)

  • Публикации сотрудников КФУ Scopus [24551]
    Коллекция содержит публикации сотрудников Казанского федерального (до 2010 года Казанского государственного) университета, проиндексированные в БД Scopus, начиная с 1970г.

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics