. 2024 Jan 19;19(1):e0296088.

doi: 10.1371/journal.pone.0296088. eCollection 2024.

Identifying the perceived local properties of networks reconstructed from biased random walks

Lucas Guerreiro¹, Filipi Nascimento Silva², Diego Raphael Amancio¹

Affiliations

¹ Institute of Mathematics and Computer Science - USP, Avenida Trabalhador São-carlense, São Carlos, SP, Brazil.
² Indiana University Network Science Institute, Bloomington, Indiana, United States of America.

PMID: 38241390
PMCID: PMC10798465
DOI: 10.1371/journal.pone.0296088

Identifying the perceived local properties of networks reconstructed from biased random walks

Lucas Guerreiro et al. PLoS One. 2024.

. 2024 Jan 19;19(1):e0296088.

doi: 10.1371/journal.pone.0296088. eCollection 2024.

Authors

Lucas Guerreiro¹, Filipi Nascimento Silva², Diego Raphael Amancio¹

Affiliations

¹ Institute of Mathematics and Computer Science - USP, Avenida Trabalhador São-carlense, São Carlos, SP, Brazil.
² Indiana University Network Science Institute, Bloomington, Indiana, United States of America.

PMID: 38241390
PMCID: PMC10798465
DOI: 10.1371/journal.pone.0296088

Abstract

Many real-world systems give rise to a time series of symbols. The elements in a sequence can be generated by agents walking over a networked space so that whenever a node is visited the corresponding symbol is generated. In many situations the underlying network is hidden, and one aims to recover its original structure and/or properties. For example, when analyzing texts, the underlying network structure generating a particular sequence of words is not available. In this paper, we analyze whether one can recover the underlying local properties of networks generating sequences of symbols for different combinations of random walks and network topologies. We found that the reconstruction performance is influenced by the bias of the agent dynamics. When the walker is biased toward high-degree neighbors, the best performance was obtained for most of the network models and properties. Surprisingly, this same effect is not observed for the clustering coefficient and eccentric, even when large sequences are considered. We also found that the true self-avoiding displayed similar performance as the one preferring highly-connected nodes, with the advantage of yielding competitive performance to recover the clustering coefficient. Our results may have implications for the construction and interpretation of networks generated from sequences.

Copyright: © 2024 Guerreiro et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. Example of a subgraph (highlighted in red) representing the limited information observed by a random walk starting at node A.**

**Fig 2. Schematics of the methodology.**
First, we look into the original network and annotate the concerned properties for each node. In the next step, we iterate over the network with the desired dynamics in order to generate a sequence of symbols. In the third step, we reconstruct the network using the discovered nodes and edges, and we annotate the node’s properties in this reconstructed network. Finally, we build correlations among the reconstructed and original nodes by comparing each node i in the reconstructed network to its respective node in the original network.

**Fig 3. Scatter-plot of node degree obtained in original vs. reconstructed LFR networks (with mixing parameter m = 0.05).**
Each subpanel corresponds to a different configuration of agent dynamics and walks length (w). The walk length is distributed between 100 and 50,000 steps.

**Fig 4. Evolution of the efficacy in recovering network metrics in reconstructed networks.**
The x-axis represents the walk length used to generate the sequence, and the y-axis is the Pearson correlation for local metrics obtained in the original vs. reconstructed networks. The last columns are the NMI and ARI, which are used to compare the partitions in networks with community structure. While the coreness is not defined in BA and LFR networks, the NMI and ARI are computed only for networks with community structure.

**Fig 5. Pearson correlation for the selected real networks.**
Each data point represents the correlation of the same nodes for the original and reconstructed network from sequences acquired by dynamics for a given walk length, w (100 ≤ w ≤ 50000). The last columns represent the average NMI and ARI of each model obtained with the Leiden algorithm.

**Fig 6. Efficiency in recovering network metrics in for the network models.**
Each data point represents the correlation of local network metrics between the original and reconstructed network from sequences acquired by dynamics for a given walk length, where the x-axis represents the knowledge acquired for each walk length (w).

**Fig 7. Pearson correlation for the selected real networks.**
Each data point represents the correlation of the same nodes for the original and reconstructed network from sequences acquired by dynamics for a given walk length, where the x-axis represents the knowledge acquired for each walk length, w, where 100 ≤ w ≤ 50000.

See this image and copyright information in PMC

Cited by

Comparing random walks in graph embedding and link prediction.
Vital A Jr, Silva FN, Amancio DR. Vital A Jr, et al. PLoS One. 2024 Nov 6;19(11):e0312863. doi: 10.1371/journal.pone.0312863. eCollection 2024. PLoS One. 2024. PMID: 39504339 Free PMC article.

References

1. Sizemore AE, Karuza EA, Giusti C, Bassett DS. Knowledge gaps in the early growth of semantic feature networks. Nature human behaviour. 2018;2(9):682–692. doi: 10.1038/s41562-018-0422-4 - DOI - PMC - PubMed
1. Arruda HF, Silva FN, Costa LF, Amancio DR. Knowledge acquisition: A Complex networks approach. Information Sciences. 2017;421:154–166. doi: 10.1016/j.ins.2017.08.091 - DOI
1. Cantwell GT, Kirkley A, Newman M. The friendship paradox in real and model networks. Journal of Complex Networks. 2021;9(2):cnab011. doi: 10.1093/comnet/cnab011 - DOI
1. Rzhetsky A, Foster JG, Foster IT, Evans JA. Choosing experiments to accelerate collective discovery. Proceedings of the National Academy of Sciences. 2015;112(47):14569–14574. doi: 10.1073/pnas.1509757112 - DOI - PMC - PubMed
1. Costa LdF, Oliveira ON Jr, Travieso G, Rodrigues FA, Villas Boas PR, Antiqueira L, et al. Analyzing and modeling real-world phenomena with complex networks: a survey of applications. Advances in Physics. 2011;60(3):329–412. doi: 10.1080/00018732.2011.572452 - DOI

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Identifying the perceived local properties of networks reconstructed from biased random walks

Affiliations

Identifying the perceived local properties of networks reconstructed from biased random walks

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Related information

LinkOut - more resources

Full Text Sources