Показать сокращенную информацию
dc.contributor.author | Virk S.M. | |
dc.contributor.author | Hammarström H. | |
dc.contributor.author | Forsberg M. | |
dc.contributor.author | Wichmann S. | |
dc.date.accessioned | 2021-02-25T06:54:59Z | |
dc.date.available | 2021-02-25T06:54:59Z | |
dc.date.issued | 2020 | |
dc.identifier.uri | https://dspace.kpfu.ru/xmlui/handle/net/161498 | |
dc.description.abstract | © European Language Resources Association (ELRA), licensed under CC-BY-NC There exist as many as 7000 natural languages in the world, and a huge number of documents describing those languages have been produced over the years. Most of those documents are in paper format. Any attempts to use modern computational techniques and tools to process those documents will require them to be digitized first. In this paper, we report a multilingual digitized version of thousands of such documents searchable through some well-established corpus infrastructures. The corpus is annotated with various meta, word, and text level attributes to make searching and analysis easier and more useful. | |
dc.subject | Corpus | |
dc.subject | Grammatical descriptions | |
dc.subject | Natural languages | |
dc.subject | World's languages | |
dc.title | The DReaM corpus: A multilingual annotated corpus of grammars for the world's languages | |
dc.type | Conference Paper | |
dc.collection | Публикации сотрудников КФУ | |
dc.relation.startpage | 878 | |
dc.source.id | SCOPUS-2020-SID85096574636 |