Statistical clustering of temporal networks through a dynamic stochastic block model
Summary
Statistical node clustering in discrete time dynamic networks is an emerging field that raises many challenges. Here, we explore statistical properties and frequentist inference in a model that combines a stochastic block model for its static part with independent Markov chains for the evolution of the nodes groups through time. We model binary data as well as weighted dynamic random graphs (with discrete or continuous edges values). Our approach, motivated by the importance of controlling for label switching issues across the different time steps, focuses on detecting groups characterized by a stable within‐group connectivity behaviour. We study identifiability of the model parameters and propose an inference procedure based on a variational expectation–maximization algorithm as well as a model selection criterion to select the number of groups. We carefully discuss our initialization strategy which plays an important role in the method and we compare our procedure with existing procedures on synthetic data sets. We also illustrate our approach on dynamic contact networks: one of encounters between high school students and two others on animal interactions. An implementation of the method is available as an R package called dynsbm.
Citing Literature
Number of times cited according to CrossRef: 59
- Vincent Miele, Rodrigo Ramos‐Jiliberto, Diego P. Vázquez, Core–periphery dynamics in a plant–pollinator network, Journal of Animal Ecology, 10.1111/1365-2656.13217, 89, 7, (1670-1677), (2020).
- Neelu Chaudhary, Hardeo Kumar Thakur, Parametric Classification of Dynamic Community Detection Techniques, Micro-Electronics and Telecommunication Engineering, 10.1007/978-981-15-2329-8_34, (333-340), (2020).
- Sharmodeep Bhattacharyya, Shirshendu Chatterjee, Consistent Recovery of Communities from Sparse Multi-relational Networks: A Scalable Algorithm with Optimal Recovery Conditions, Complex Networks XI, 10.1007/978-3-030-40943-2_9, (92-103), (2020).
- Samia Mohand Arab, Noria Taghezout, Fatima Zohra Benkaddour, A novel approach for choosing dynamic community detection algorithms using PROMETHEE II, Multiagent and Grid Systems, 10.3233/MGS-190315, 15, 4, (327-341), (2020).
- Dragana M. Pavlović, Bryan R.L. Guillaume, Soroosh Afyouni, Thomas E. Nichols, Multi‐subject stochastic blockmodels with mixed effects for adaptive analysis of individual differences in human brain network cluster structure, Statistica Neerlandica, 10.1111/stan.12219, 74, 3, (363-396), (2020).
- Joshua Daniel Loyal, Yuguo Chen, Statistical Network Analysis: A Review with Applications to the Coronavirus Disease 2019 Pandemic, International Statistical Review, 10.1111/insr.12398, 88, 2, (419-440), (2020).
- Matthew Ludkin, Inference for a generalised stochastic block model with unknown number of blocks and non-conjugate edge models, Computational Statistics & Data Analysis, 10.1016/j.csda.2020.107051, (107051), (2020).
- Giulio Rossetti, ANGEL: efficient, and effective, node-centric community discovery in static and dynamic networks, Applied Network Science, 10.1007/s41109-020-00270-6, 5, 1, (2020).
- Dragana M. Pavlović, Bryan R.L. Guillaume, Emma K. Towlson, Nicole M.Y. Kuek, Soroosh Afyouni, Petra E. Vértes, B.T. Thomas Yeo, Edward T. Bullmore, Thomas E. Nichols, Multi-subject Stochastic Blockmodels for adaptive analysis of individual differences in human brain network cluster structure, NeuroImage, 10.1016/j.neuroimage.2020.116611, (116611), (2020).
- Francesco Bartolucci, Silvia Pandolfi, An exact algorithm for time-dependent variational inference for the dynamic stochastic block model, Pattern Recognition Letters, 10.1016/j.patrec.2020.07.014, (2020).
- Jun Liu, Jiangzhou Wang, Binghui Liu, Community Detection of Multi-Layer Attributed Networks via Penalized Alternating Factorization, Mathematics, 10.3390/math8020239, 8, 2, (239), (2020).
- Jose Cadena, Ana Paula Sales, Doris Lam, Heather A. Enright, Elizabeth K. Wheeler, Nicholas O. Fischer, Modeling the temporal network dynamics of neuronal cultures, PLOS Computational Biology, 10.1371/journal.pcbi.1007834, 16, 5, (e1007834), (2020).
- Abir El Haj, Yousri Slaoui, Pierre-Yves Louis, Zaher Khraibani, Estimation in a binomial stochastic blockmodel for a weighted graph by a variational expectation maximization algorithm, Communications in Statistics - Simulation and Computation, 10.1080/03610918.2020.1743858, (1-20), (2020).
- Mirai Igarashi, Nobuhiko Terui, Characterization of topic-based online communities by combining network data and user generated content, Statistics and Computing, 10.1007/s11222-020-09947-5, (2020).
- Riccardo Rastelli, Michael Fop, A stochastic block model for interaction lengths, Advances in Data Analysis and Classification, 10.1007/s11634-020-00403-w, (2020).
- Tianxi Li, Lihua Lei, Sharmodeep Bhattacharyya, Koen Van den Berge, Purnamrita Sarkar, Peter J. Bicke, Elizaveta Levina, Hierarchical community detection by recursive partitioning, Journal of the American Statistical Association, 10.1080/01621459.2020.1833888, (1-39), (2020).
- Louise M. J. O’Connor, Laura J. Pollock, João Braga, Gentile Francesco Ficetola, Luigi Maiorano, Camille Martinez‐Almoyna, Alessandro Montemaggiori, Marc Ohlmann, Wilfried Thuiller, Unveiling the food webs of tetrapods across Europe through the prism of the Eltonian niche, Journal of Biogeography, 10.1111/jbi.13773, 47, 1, (181-192), (2019).
- Brenda Betancourt, Abel Rodríguez, Naomi Boyd, Modelling and prediction of financial trading networks: an application to the New York Mercantile Exchange natural gas futures market, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/rssc.12387, 69, 1, (195-218), (2019).
- Kevin H. Lee, Lingzhou Xue, David R. Hunter, Model-based clustering of time-evolving networks through temporal exponential-family random graph models, Journal of Multivariate Analysis, 10.1016/j.jmva.2019.104540, (104540), (2019).
- P. Mazzarisi, P. Barucca, F. Lillo, D. Tantari, A dynamic network model with persistent links and node-specific latent variables, with an application to the interbank market, European Journal of Operational Research, 10.1016/j.ejor.2019.07.024, (2019).
- S. Balqis Samdin, Chee-Ming Ting, Hernando Ombao, undefined, 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), 10.1109/ISBI.2019.8759405, (1483-1487), (2019).
- Ann-Kristin Becker, Hajo Holzmann, Nonparametric Identification in the Dynamic Stochastic Block Model, IEEE Transactions on Information Theory, 10.1109/TIT.2019.2893947, 65, 7, (4335-4344), (2019).
- Thomas Magelinski, Kathleen M. Carley, Community-based time segmentation from network snapshots, Applied Network Science, 10.1007/s41109-019-0136-1, 4, 1, (2019).
- Qiangjuan Huang, Chengli Zhao, Xue Zhang, Dongyun Yi, Community discovering in temporal network with spectral fusion, Chaos: An Interdisciplinary Journal of Nonlinear Science, 10.1063/1.5086769, 29, 4, (043122), (2019).
- Lin Zhang, Alexander Gorovits, Petko Bogdanov, undefined, 2019 IEEE International Conference on Data Mining (ICDM), 10.1109/ICDM.2019.00092, (816-825), (2019).
- Clement Lee, Darren J. Wilkinson, A review of stochastic block models and extensions for graph clustering, Applied Network Science, 10.1007/s41109-019-0232-2, 4, 1, (2019).
- Marjan Cugmas, Anuška Ferligoj, Luka Kronegger, Scientific Co‐Authorship Networks, Advances in Network Clustering and Blockmodeling, 10.1002/9781119483298, (363-387), (2019).
- Jack Leitch, Kathleen A. Alexander, Srijan Sengupta, Toward epidemic thresholds on temporal networks: a review and open questions, Applied Network Science, 10.1007/s41109-019-0230-4, 4, 1, (2019).
- Remy Cazabet, Giulio Rossetti, Challenges in Community Discovery on Temporal Networks, Temporal Network Theory, 10.1007/978-3-030-23495-9_10, (181-197), (2019).
- Sune Lehmann, Fundamental Structures in Temporal Communication Networks, Temporal Network Theory, 10.1007/978-3-030-23495-9_2, (25-48), (2019).
- Mario Molina, Filiz Garip, Machine Learning for Sociology, Annual Review of Sociology, 10.1146/annurev-soc-073117-041106, 45, 1, (27-45), (2019).
- Charles Bouveyron, Gilles Celeux, T. Brendan Murphy, Adrian E. Raftery, , Model-Based Clustering and Classification for Data Science, 10.1017/9781108644181, (2019).
- Jing Lei, Kehui Chen, Brian Lynch, Consistent community detection in multi-layer network data, Biometrika, 10.1093/biomet/asz068, (2019).
- Vincent Miele, Catherine Matias, Stéphane Robin, Stéphane Dray, Nine quick tips for analyzing network data, PLOS Computational Biology, 10.1371/journal.pcbi.1007434, 15, 12, (e1007434), (2019).
- Mirko Signorelli, Ernst C. Wit, Model-based clustering for populations of networks, Statistical Modelling, 10.1177/1471082X19871128, (1471082X1987112), (2019).
- Shi Gu, Cedric Huchuan Xia, Rastko Ciric, Tyler M Moore, Ruben C Gur, Raquel E Gur, Theodore D Satterthwaite, Danielle S Bassett, Unifying the Notions of Modularity and Core–Periphery Structure in Functional Brain Networks during Youth, Cerebral Cortex, 10.1093/cercor/bhz150, (2019).
- Thorben Funke, Till Becker, Stochastic block models: A comparison of variants and inference methods, PLOS ONE, 10.1371/journal.pone.0215296, 14, 4, (e0215296), (2019).
- Nathan B. Wikle, Ephraim M. Hanks, David P. Hughes, A Dynamic Individual-Based Model for High-Resolution Ant Interactions, Journal of Agricultural, Biological and Environmental Statistics, 10.1007/s13253-019-00363-5, (2019).
- Amal Agarwal, Lingzhou Xue, Model-Based Clustering of Nonparametric Weighted Networks With Application to Water Pollution Analysis, Technometrics, 10.1080/00401706.2019.1623076, (1-21), (2019).
- Daizaburo Shizuka, Allison E Johnson, How demographic processes shape animal social networks, Behavioral Ecology, 10.1093/beheco/arz083, (2019).
- Jingfei Zhang, Will Wei Sun, Lexin Li, Mixed-Effect Time-Varying Network Model and Application in Brain Connectivity Analysis, Journal of the American Statistical Association, 10.1080/01621459.2019.1677242, (1-15), (2019).
- Timothée Tabouy, Pierre Barbillon, Julien Chiquet, Variational Inference for Stochastic Block Models From Sampled Data, Journal of the American Statistical Association, 10.1080/01621459.2018.1562934, (1-23), (2019).
- Marco Corneli, Charles Bouveyron, Pierre Latouche, Fabrice Rossi, The dynamic stochastic topic block model for dynamic networks with textual edges, Statistics and Computing, 10.1007/s11222-018-9832-4, 29, 4, (677-695), (2018).
- Matthieu Latapy, Tiphaine Viard, Clémence Magnien, Stream graphs and link streams for the modeling of interactions over time, Social Network Analysis and Mining, 10.1007/s13278-018-0537-7, 8, 1, (2018).
- Pengfei Jiao, Wenjun Wang, Di Jin, Constrained common cluster based model for community detection in temporal and multiplex networks, Neurocomputing, 10.1016/j.neucom.2017.09.013, 275, (768-780), (2018).
- Ulf Aslak, Martin Rosvall, Sune Lehmann, Constrained information flows in temporal networks reveal intermittent communities, Physical Review E, 10.1103/PhysRevE.97.062312, 97, 6, (2018).
- Francesco Bartolucci, Maria Francesca Marino, Silvia Pandolfi, Dealing with reciprocity in dynamic stochastic block models, Computational Statistics & Data Analysis, 10.1016/j.csda.2018.01.010, 123, (86-100), (2018).
- Zhiwei Yang, Weigang Wu, Yishun Chen, Xiaola Lin, Jiannong Cao, ( Q, S )- distance model and counting algorithms in dynamic distributed systems , International Journal of Distributed Sensor Networks, 10.1177/1550147718756872, 14, 1, (155014771875687), (2018).
- Roseric Azondekon, Zachary James Harper, Fiacre Rodrigue Agossa, Charles Michael Welzig, Susan McRoy, Scientific authorship and collaboration network analysis on malaria research in Benin: papers indexed in the web of science (1996–2016), Global Health Research and Policy, 10.1186/s41256-018-0067-x, 3, 1, (2018).
- Fuchen Liu, David Choi, Lu Xie, Kathryn Roeder, Global spectral clustering in dynamic networks, Proceedings of the National Academy of Sciences, 10.1073/pnas.1718449115, 115, 5, (927-932), (2018).
- Giulio Rossetti, Rémy Cazabet, Community Discovery in Dynamic Networks, ACM Computing Surveys, 10.1145/3172867, 51, 2, (1-37), (2018).
- RICCARDO RASTELLI, PIERRE LATOUCHE, NIAL FRIEL, Choosing the number of groups in a latent stochastic blockmodel for dynamic networks, Network Science, 10.1017/nws.2018.19, (1-25), (2018).
- C Matias, T Rebafka, F Villers, A semiparametric extension of the stochastic block model for longitudinal networks, Biometrika, 10.1093/biomet/asy016, (2018).
- Vincent Miele, Catherine Matias, Revealing the hidden structure of dynamic ecological networks, Royal Society Open Science, 10.1098/rsos.170251, 4, 6, (170251), (2017).
- Marco Corneli, Pierre Latouche, Fabrice Rossi, Multiple change points detection and clustering in dynamic networks, Statistics and Computing, 10.1007/s11222-017-9775-1, (2017).
- Matthew Ludkin, Idris Eckley, Peter Neal, Dynamic stochastic block models: parameter estimation and detection of changes in community structure, Statistics and Computing, 10.1007/s11222-017-9788-9, (2017).
- C. Bouveyron, P. Latouche, R. Zreik, The stochastic topic block model for the clustering of vertices in networks with textual edges, Statistics and Computing, 10.1007/s11222-016-9713-7, 28, 1, (11-31), (2016).
- Wenjun Wang, Pengfei Jiao, Dongxiao He, Di Jin, Lin Pan, Bogdan Gabrys, Autonomous overlapping community detection in temporal networks: A dynamic Bayesian nonnegative matrix factorization approach, Knowledge-Based Systems, 10.1016/j.knosys.2016.07.021, 110, (121-134), (2016).
- Lata Kodali, Srijan Sengupta, Leanna House, William H. Woodall, The value of summary statistics for anomaly detection in temporally evolving networks: A performance evaluation study, Applied Stochastic Models in Business and Industry, 10.1002/asmb.2548, 0, 0, (undefined).




