Robustness of network centrality metrics in the context of digital communication data

Ju Sung Lee, Juergen Pfeffer

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Social media data and other web-based network data are large and dynamic rendering the identification of structural changes in such systems a hard problem. Typically, online data is constantly streaming and results in data that is incomplete thus necessitating the need to understand the robustness of network metrics on partial or sampled network data. In this paper, we examine the effects of sampling on key network centrality metrics using two empirical communication datasets. Correlations between network metrics of original and sampled nodes offer a measure of sampling accuracy. The relationship between sampling and accuracy is convergent and amenable to nonlinear analysis. Naturally, larger edge samples induce sampled graphs that are more representative of the original graph. However, this effect is attenuated when larger sets of nodes are recovered in the samples. Also, we find that the graph structure plays a prominent role in sampling accuracy. Centralized graphs, in which fewer nodes enjoy higher centrality scores, offer more representative samples.

Original languageEnglish
Title of host publicationProceedings of the 48th Annual Hawaii International Conference on System Sciences, HICSS 2015
EditorsTung X. Bui, Ralph H. Sprague
PublisherIEEE Computer Society
Pages1798-1807
Number of pages10
ISBN (Electronic)9781479973675
DOIs
StatePublished - 26 Mar 2015
Externally publishedYes
Event48th Annual Hawaii International Conference on System Sciences, HICSS 2015 - Kauai, United States
Duration: 5 Jan 20158 Jan 2015

Publication series

NameProceedings of the Annual Hawaii International Conference on System Sciences
Volume2015-March
ISSN (Print)1530-1605

Conference

Conference48th Annual Hawaii International Conference on System Sciences, HICSS 2015
Country/TerritoryUnited States
CityKauai
Period5/01/158/01/15

Keywords

  • Digital communication
  • Network analysis
  • Sampling

Fingerprint

Dive into the research topics of 'Robustness of network centrality metrics in the context of digital communication data'. Together they form a unique fingerprint.

Cite this