Электронный архив

Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification

Показать сокращенную информацию

dc.contributor.author Alekseev A.
dc.contributor.author Tutubalina E.
dc.contributor.author Malykh V.
dc.contributor.author Nikolenko S.
dc.date.accessioned 2021-02-25T20:37:43Z
dc.date.available 2021-02-25T20:37:43Z
dc.date.issued 2020
dc.identifier.issn 1064-1246
dc.identifier.uri https://dspace.kpfu.ru/xmlui/handle/net/162084
dc.description.abstract © 2020 - IOS Press and the authors. All rights reserved. Deep learning architectures based on self-attention have recently achieved and surpassed state of the art results in the task of unsupervised aspect extraction and topic modeling. While models such as neural attention-based aspect extraction (ABAE) have been successfully applied to user-generated texts, they are less coherent when applied to traditional data sources such as news articles and newsgroup documents. In this work, we introduce a simple approach based on sentence filtering in order to improve topical aspects learned from newsgroups-based content without modifying the basic mechanism of ABAE. We train a probabilistic classifier to distinguish between out-of-domain texts (outer dataset) and in-domain texts (target dataset). Then, during data preparation we filter out sentences that have a low probability of being in-domain and train the neural model on the remaining sentences. The positive effect of sentence filtering on topic coherence is demonstrated in comparison to aspect extraction models trained on unfiltered texts.
dc.relation.ispartofseries Journal of Intelligent and Fuzzy Systems
dc.subject Aspect extraction
dc.subject deep learning
dc.subject out-of-domain classification
dc.subject topic coherence
dc.subject topic models
dc.title Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification
dc.type Article
dc.relation.ispartofseries-issue 2
dc.relation.ispartofseries-volume 39
dc.collection Публикации сотрудников КФУ
dc.relation.startpage 2487
dc.source.id SCOPUS10641246-2020-39-2-SID85091103086


Файлы в этом документе

Данный элемент включен в следующие коллекции

  • Публикации сотрудников КФУ Scopus [24551]
    Коллекция содержит публикации сотрудников Казанского федерального (до 2010 года Казанского государственного) университета, проиндексированные в БД Scopus, начиная с 1970г.

Показать сокращенную информацию

Поиск в электронном архиве


Расширенный поиск

Просмотр

Моя учетная запись

Статистика