Kazan Federal University Digital Repository

The DReaM corpus: A multilingual annotated corpus of grammars for the world's languages

Show simple item record

dc.contributor.author Virk S.M.
dc.contributor.author Hammarström H.
dc.contributor.author Forsberg M.
dc.contributor.author Wichmann S.
dc.date.accessioned 2021-02-25T06:54:59Z
dc.date.available 2021-02-25T06:54:59Z
dc.date.issued 2020
dc.identifier.uri https://dspace.kpfu.ru/xmlui/handle/net/161498
dc.description.abstract © European Language Resources Association (ELRA), licensed under CC-BY-NC There exist as many as 7000 natural languages in the world, and a huge number of documents describing those languages have been produced over the years. Most of those documents are in paper format. Any attempts to use modern computational techniques and tools to process those documents will require them to be digitized first. In this paper, we report a multilingual digitized version of thousands of such documents searchable through some well-established corpus infrastructures. The corpus is annotated with various meta, word, and text level attributes to make searching and analysis easier and more useful.
dc.subject Corpus
dc.subject Grammatical descriptions
dc.subject Natural languages
dc.subject World's languages
dc.title The DReaM corpus: A multilingual annotated corpus of grammars for the world's languages
dc.type Conference Paper
dc.collection Публикации сотрудников КФУ
dc.relation.startpage 878
dc.source.id SCOPUS-2020-SID85096574636


Files in this item

This item appears in the following Collection(s)

  • Публикации сотрудников КФУ Scopus [24551]
    Коллекция содержит публикации сотрудников Казанского федерального (до 2010 года Казанского государственного) университета, проиндексированные в БД Scopus, начиная с 1970г.

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics