Using FLOSS for Storing, Processing and Linking Corpus Data

Nevzorova O.; Mukhamedshin D.; Kirillovich A.

Using FLOSS for Storing, Processing and Linking Corpus Data

Mukhamedshin D.; Nevzorova O.; Kirillovich A.

URI: https://dspace.kpfu.ru/xmlui/handle/net/161415

Date: 2020

Abstract:

© 2020, IFIP International Federation for Information Processing. Corpus data is widely used to solve different linguistic, educational and applied problems. The Tatar corpus management system (http://tugantel.tatar) is specifically developed for Turkic languages. The functionality of our corpus management system includes a search of lexical units, morphological and lexical search, a search of syntactic units, a search of N-grams and others. The search is performed using open source tools (database management system MariaDB, Redis data store). This article describes the process of choosing FLOSS for the main components of our system and also processing a search query and building a linked open dataset based on corpus data.

Show full item record

Files in this item

Name: SCOPUS18684238-20 ...

Size: 83.03Kb

Format: PDF

View/Open

This item appears in the following Collection(s)

Публикации сотрудников КФУ Scopus [24551]
Коллекция содержит публикации сотрудников Казанского федерального (до 2010 года Казанского государственного) университета, проиндексированные в БД Scopus, начиная с 1970г.