Kazan Federal University Digital Repository

Vkontakte’ local friendship networks: Identifying the missed residence of users in profile data

Show simple item record

dc.date.accessioned 2019-01-22T20:53:04Z
dc.date.available 2019-01-22T20:53:04Z
dc.date.issued 2018
dc.identifier.uri https://dspace.kpfu.ru/xmlui/handle/net/149251
dc.description.abstract © 2018 Russian Public Opinion Research Center, VCIOM. All rights reserved. Online social networks (e. g. the most popular Russian website ‘VKontakte’) are a source of available information about users due to the open data policy. Therefore, researchers have great opportunities to study the topology of interaction networks in the online environment using a social network analysis. However, the personal data that users provide in their public profiles are often incomplete: sections on gender, age or city may be missed inadvertently or skipped intentionally. At the same time, these essential characteristics serve as ‘nodes’ (i. e. users) and help single out clusters of similar agents and their behavior patterns. The absence of some data can significantly affect network metrics (e. g. size of network, average path length between two participants, distribution of the number of connections between them, etc.) and cause distorted results. In this regard, there is a need to fill gaps in data. The paper presents a case study on the design and applications of a classifier which would determine whether a VKontakte user whose location was not specified in the profile is a resident of a particular city. The classifier was created and tested for the Izhevsk city user network. It is based on the decision tree method which gradually filters the accounts by a series of questions. The paper explains the choice of the main indicators helping the classifier to determine the user’s city, describes the algorithm and shows how the network topology changes as the missing data on user’s location are added.
dc.subject Big data
dc.subject Missing data
dc.subject Network homophily
dc.subject Network topology
dc.subject Online communities
dc.subject Social network analysis
dc.subject Using R for data analysis
dc.subject VKontakte
dc.title Vkontakte’ local friendship networks: Identifying the missed residence of users in profile data
dc.type Article
dc.relation.ispartofseries-issue 3
dc.relation.ispartofseries-volume 145
dc.collection Публикации сотрудников КФУ
dc.relation.startpage 78
dc.source.id SCOPUS-2018-145-3-SID85050077117


Files in this item

This item appears in the following Collection(s)

  • Публикации сотрудников КФУ Scopus [24551]
    Коллекция содержит публикации сотрудников Казанского федерального (до 2010 года Казанского государственного) университета, проиндексированные в БД Scopus, начиная с 1970г.

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics