Электронный архив

Speaker Diarization through Waveform and Neural Net

Показать сокращенную информацию

dc.contributor.author Latypov R.
dc.contributor.author Stolov E.
dc.date.accessioned 2022-02-09T20:47:48Z
dc.date.available 2022-02-09T20:47:48Z
dc.date.issued 2021
dc.identifier.issn 2305-7254
dc.identifier.uri https://dspace.kpfu.ru/xmlui/handle/net/170349
dc.description.abstract This paper presents an approach to the speaker diarization problem based on speech local waveform analysis. We assume that the recorded sound scene consists of a known number of sources and that the single microphone is utilized for recording. The research goal is to develop an algorithm for speaker diarization in online mode. The most significant attention is paid to limiting computer resources when solving the problem. We suppose that the speech file is already segmented so that any segment belongs to a single speaker. Our method is as follows. We divide each part into non-overlapping fragments of the constant length and change any sample in the piece to its absolute value. A particular technique is used to choose a threshold value Thr. After that, we select the portions of the fragments that exceed Thr and implement coding to describe the source signal's revealed parts as normalized cumulative sums containing the same number of items. These sums are used as input vectors for two types of neural networks. For comparison, we also developed a simple algorithm that does not leverage the neural net but fits the problem. The experiment shows that the end-to-end neural classification of the fragments brings acceptable results.
dc.relation.ispartofseries Conference of Open Innovation Association, FRUCT
dc.title Speaker Diarization through Waveform and Neural Net
dc.type Conference Proceeding
dc.relation.ispartofseries-volume 2021-May
dc.collection Публикации сотрудников КФУ
dc.relation.startpage 234
dc.source.id SCOPUS23057254-2021-2021-SID85107441138


Файлы в этом документе

Данный элемент включен в следующие коллекции

  • Публикации сотрудников КФУ Scopus [24551]
    Коллекция содержит публикации сотрудников Казанского федерального (до 2010 года Казанского государственного) университета, проиндексированные в БД Scopus, начиная с 1970г.

Показать сокращенную информацию

Поиск в электронном архиве


Расширенный поиск

Просмотр

Моя учетная запись

Статистика