Author Archives: bolozna

Polarized and nonpolarized Twitter networks from the 2019 Finnish Parliamentary Elections

This dataset includes 183 Twitter retweet networks collected during the 2019 Finnish Parliamentary Elections. The nodes in the networks represent anonymized Twitter accounts, and directed ties indicate retweet endorsements on specific topics. Each file contains three columns: retweeter, retweeted, and weight. The are citation networks used in the articles:

Chen, T. H. Y., Salloum, A., Gronow, A., Ylä-Anttila, T., & Kivelä, M. (2021). Polarization of climate politics results from partisan sorting: Evidence from Finnish Twittersphere. Global Environmental Change, 71, 102348. https://doi.org/10.1016/j.gloenvcha.2021.102348

Salloum, A., Chen, T. H. Y., & Kivelä, M. (2022). Separating polarization from noise: comparison and normalization of structural polarization measures. Proceedings of the ACM on human-computer interaction, 6(CSCW1), 1-33. https://doi.org/10.1145/3512962

Get the data from here: https://zenodo.org/records/8434372

License: Creative Commons Attribution Non Commercial 4.0 International

Networks of Nokia investors (1998-2002)

A network of correlations among investors in the Nokia security. Nodes are anonymized investors on the Helsinki stock exchange, divided into three groups: households, financial institutions, and non-financial institutions. Edges are pairwise correlations between the time series of daily net volumes of the individual investors, spanning 1998-2002. Data used in the article:

S Ranganathan, M Kivelä, and J Kanniainen, “Dynamics of investor spanning trees around dot-com bubble”, PLoS ONE 13 (6), e0198807 (2018)

Get the data from here: https://doi.org/10.5061/dryad.5b8n621

Licence:  CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

100 years of citation networks

Citation networks between journals with moving time window (10/5 years) starting from 1900 and ending in 2013. Contains metadata on journal classifications. Built from a database of 630 million citations. The are citation networks used in the article:

Darko Hric, Kimmo Kaski, Mikko Kivelä: “Stochastic Block Model Reveals Maps of Citation Patterns and Their Evolution in Time”

Get the data from here: https://zenodo.org/record/1255837

License: Creative Commons Attribution Non Commercial 4.0 International

Finnish last.fm social network

Friendship network of around 8k Finnish users in last.fm. This is the “lastfm” network used in the article:

Toivonen, R., Kovanen, L., Kivelä, M., Onnela, J. P., Saramäki, J., & Kaski, K. (2009). A comparative study of social network models: Network evolution models and nodal attribute models. Social networks, 31(4), 240-254.

Get the data from here: https://zenodo.org/record/3416186

Licence: “Creative Commons Attribution-NonCommercial-ShareAlike 2.0 UK: England & Wales”

Last.fm social network and genders

Friendship network of around 189k lasf.fm users and their genders. This is the “Last.fm” network used in the article:

A. Asikainen, G. Iñiguez,  J. Ureña-Carrión,  K. Kaski, M. Kivelä. Cumulative effects of triadic closure and homophily in social networks. Science Advances, Vol. 6, no. 19, eaax7310 (2020)

Get the data from here: https://zenodo.org/record/3726824

Licence: “Creative Commons Attribution-NonCommercial-ShareAlike 2.0 UK: England & Wales”

Mapping temporal-network percolation to weighted, static event graphs

M Kivelä, J Cambe, J Saramäki, M Karsai

The dynamics of diffusion-like processes on temporal networks are influenced by correlations in the times of contacts. This influence is particularly strong for processes where the spreading agent has a limited lifetime at nodes: disease spreading (recovery time), diffusion of rumors (lifetime of information), and passenger routing (maximum acceptable time between transfers). We introduce weighted event graphs as a powerful and fast framework for studying connectivity determined by time-respecting paths where the allowed waiting times between contacts have an upper limit. We study percolation on the weighted event graphs and in the underlying temporal networks, with simulated and real-world networks. We show that this type of temporal-network percolation is analogous to directed percolation, and that it can be characterized by multiple order parameters.

Read the full text

Stochastic block model reveals maps of citation patterns and their evolution in time

D Hric, K Kaski, M Kivelä

In this study we map out the large-scale structure of citation networks of science journals and follow their evolution in time by using stochastic block models (SBMs). The SBM fitting procedures are principled methods that can be used to find hierarchical grouping of journals that show similar incoming and outgoing citations patterns. These methods work directly on the citation network without the need to construct auxiliary networks based on similarity of nodes. We fit the SBMs to the networks of journals we have constructed from the data set of around 630 million citations and find a variety of different types of groups, such as communities, bridges, sources, and sinks. In addition we use a recent generalization of SBMs to determine how much a manually curated classification of journals into subfields of science is related to the group structure of the journal network and how this relationship changes in time. The SBM method tries to find a network of blocks that is the best high-level representation of the network of journals, and we illustrate how these block networks (at various levels of resolution) can be used as maps of science.

Read the full text

Multilayer Networks Library

example1
Main features

  • Pure Python
  • Can handle general multilayer networks
  • Multilayer networks and multiplex networks (with automatically generated lazy-evaluation coupling edges)
  • Functionality: Analysis, transformations, reading and writing networks, network models etc.
  • Visualization (using Matplotlib or D3)
  • Integration with NetworkX for monoplex network analysis

Get it from Github.

Documentation

Tutorial

Isomorphisms in Multilayer Networks

M. Kivelä, M. A. Porter

We extend the concept of graph isomorphisms to multilayer networks with any number of “aspects” (i.e., types of layering), and we identify multiple types of isomorphisms. For example, in multilayer networks with a single aspect, permuting vertex labels, layer labels, and both vertex labels and layer labels each yield different isomorphism relations between multilayer networks. Multilayer network isomorphisms lead naturally to defining isomorphisms in any of the numerous types of network that can be represented as a multilayer network, and we thereby obtain isomorphisms for multiplex networks, temporal networks, networks with both of these features, and more. We reduce each of the multilayer network isomorphism problems to a graph isomorphism problem such that the size of the graph isomorphism problem grows linearly with the size of the multilayer network isomorphism problem. One can thus use software that has been developed to solve graph isomorphism problems as a practical means for solving multilayer network isomorphism problems. Our theory lays a foundation for extending many network analysis methods such as motifs, graphlets, structural roles, and network alignment to any multilayer network.

Read the full text

Estimating inter-event time distributions from finite observation periods in communication networks

M. Kivelä, M. A. Porter

A diverse variety of processes — including recurrent disease episodes, neuron firing, and communication patterns among humans — can be described using inter-event time (IET) distributions. Many such processes are ongoing, although event sequences are only available during a finite observation window. Because the observation time window is more likely to begin or end during long IETs than during short ones, the analysis of such data is susceptible to a bias induced by the finite observation period. In this paper, we illustrate how this length bias is born and how it can be corrected without assuming any particular shape for the IET distribution. To do this, we model event sequences using stationary renewal processes, and we formulate simple heuristics for determining the severity of the bias. To illustrate our results, we focus on the example of empirical communication networks, which are temporal networks that are constructed from communication events. The IET distributions of such systems guide efforts to build models of human behavior, and the variance of IETs is very important for estimating the spreading rate of information in networks of temporal interactions. We analyze several well-known data sets from the literature, and we find that the resulting bias can lead to systematic underestimates of the variance in the IET distributions and that correcting for the bias can lead to qualitatively different results for the tails of the IET distributions.

Physical Review E 92 052813

Read the full text