The DReaM corpus: A multilingual annotated corpus of grammars for the world's languages

dc.contributor.author	Virk S.M.
dc.contributor.author	Hammarström H.
dc.contributor.author	Forsberg M.
dc.contributor.author	Wichmann S.
dc.date.accessioned	2021-02-25T06:54:59Z
dc.date.available	2021-02-25T06:54:59Z
dc.date.issued	2020
dc.identifier.uri	https://dspace.kpfu.ru/xmlui/handle/net/161498
dc.description.abstract	© European Language Resources Association (ELRA), licensed under CC-BY-NC There exist as many as 7000 natural languages in the world, and a huge number of documents describing those languages have been produced over the years. Most of those documents are in paper format. Any attempts to use modern computational techniques and tools to process those documents will require them to be digitized first. In this paper, we report a multilingual digitized version of thousands of such documents searchable through some well-established corpus infrastructures. The corpus is annotated with various meta, word, and text level attributes to make searching and analysis easier and more useful.
dc.subject	Corpus
dc.subject	Grammatical descriptions
dc.subject	Natural languages
dc.subject	World's languages
dc.title	The DReaM corpus: A multilingual annotated corpus of grammars for the world's languages
dc.type	Conference Paper
dc.collection	Публикации сотрудников КФУ
dc.relation.startpage	878
dc.source.id	SCOPUS-2020-SID85096574636

Файлы в этом документе

Размер: 46.85Kb

Формат: PDF

Публикации сотрудников КФУ Scopus [24551]
Коллекция содержит публикации сотрудников Казанского федерального (до 2010 года Казанского государственного) университета, проиндексированные в БД Scopus, начиная с 1970г.