Ascent‐based Monte Carlo expectation– maximization
Abstract
Summary. The expectation–maximization (EM) algorithm is a popular tool for maximizing likelihood functions in the presence of missing data. Unfortunately, EM often requires the evaluation of analytically intractable and high dimensional integrals. The Monte Carlo EM (MCEM) algorithm is the natural extension of EM that employs Monte Carlo methods to estimate the relevant integrals. Typically, a very large Monte Carlo sample size is required to estimate these integrals within an acceptable tolerance when the algorithm is near convergence. Even if this sample size were known at the onset of implementation of MCEM, its use throughout all iterations is wasteful, especially when accurate starting values are not available. We propose a data‐driven strategy for controlling Monte Carlo resources in MCEM. The algorithm proposed improves on similar existing methods by recovering EM's ascent (i.e. likelihood increasing) property with high probability, being more robust to the effect of user‐defined inputs and handling classical Monte Carlo and Markov chain Monte Carlo methods within a common framework. Because of the first of these properties we refer to the algorithm as ‘ascent‐based MCEM’. We apply ascent‐based MCEM to a variety of examples, including one where it is used to accelerate the convergence of deterministic EM dramatically.
Citing Literature
Number of times cited according to CrossRef: 51
- Dootika Vats, Nathan Robertson, James M. Flegal, Galin L. Jones, Analyzing Markov chain Monte Carlo output, WIREs Computational Statistics , 10.1002/wics.1501, 12, 4, (2020).
- Stefan Reinhold, Timo Damm, Sebastian Büsse, Stanislav Gorb, Claus-C. Glüer, Reinhard Koch, An Analysis by Synthesis Method that Allows Accurate Spatial Modeling of Thickness of Cortical Bone from Clinical QCT, Medical Image Computing and Computer Assisted Intervention – MICCAI 2020, 10.1007/978-3-030-59725-2_62, (641-651), (2020).
- Keerthi S Shetty, B. Annappa, Clumped-MCEM: Inference for multistep transcriptional processes, Computational Biology and Chemistry, 10.1016/j.compbiolchem.2019.107092, (2019).
- Marti J. Anderson, Perry de Valpine, Andrew Punnett, Arden E. Miller, A pathway for multivariate analysis of ecological communities using copulas, Ecology and Evolution, 10.1002/ece3.4948, 9, 6, (3276-3294), (2019).
- Aaron J Molstad, Li Hsu, Wei Sun, Gaussian process regression for survival time prediction with genome-wide gene expression, Biostatistics, 10.1093/biostatistics/kxz023, (2019).
- Obafèmi Philippe Koutchadé, Alain Carpentier, Fabienne Femenia, Modeling Heterogeneous Farm Responses to European Union Biofuel Support with a Random Parameter Multicrop Model, American Journal of Agricultural Economics, 10.1093/ajae/aax091, 100, 2, (434-455), (2018).
- Keerthi S. Shetty, B. Annappa, Inferring Transcriptional Dynamics with Time-Dependent Reaction Rates Using Stochastic Simulation, Recent Findings in Intelligent Computing Techniques, 10.1007/978-981-10-8636-6_58, (549-556), (2018).
- Mengjiao Peng, Liming Xiang, Shanshan Wang, Semiparametric regression analysis of clustered survival data with semi-competing risks, Computational Statistics & Data Analysis, 10.1016/j.csda.2018.02.003, 124, (53-70), (2018).
- Keerthi S. Shetty, Annappa B, Transcriptional processes: Models and inference, Journal of Bioinformatics and Computational Biology, 10.1142/S0219720018500233, 16, 05, (1850023), (2018).
- Michael Ciere, Carlos Gañán, Michel van Eeten, Partial Device Fingerprints, Machine Learning and Knowledge Discovery in Databases, 10.1007/978-3-319-71246-8_14, (222-237), (2017).
- Tiansheng Yao, Arthur Choi, Adnan Darwiche, Learning Bayesian network parameters under equivalence constraints, Artificial Intelligence, 10.1016/j.artint.2015.05.007, 244, (239-257), (2017).
- Cristiano C. Santos, Rosangela H. Loschi, EM-Type algorithms for heavy-tailed logistic mixed models, Journal of Statistical Computation and Simulation, 10.1080/00949655.2017.1350678, 87, 15, (2940-2961), (2017).
- Yukun Zhang, Xuewen Lu, Anthony F. Desmond, Variable Selection in a Log–Linear Birnbaum–Saunders Regression Model for High-Dimensional Survival Data via the Elastic-Net and Stochastic EM, Technometrics, 10.1080/00401706.2015.1133457, 58, 3, (383-392), (2016).
- Lei Gong, James M. Flegal, A Practical Sequential Stopping Rule for High-Dimensional Markov Chain Monte Carlo, Journal of Computational and Graphical Statistics, 10.1080/10618600.2015.1044092, 25, 3, (684-700), (2016).
- Sik-Yum Lee, Xin-Yuan Song, A Unified Maximum Likelihood Approach for Analyzing Structural Equation Models With Missing Nonstandard Data, Sociological Methods & Research, 10.1177/0049124106292357, 35, 3, (352-381), (2016).
- Charlotte Baey, Samis Trevezas, Paul-Henry Cournède, A non linear mixed effects model of plant growth and estimation via stochastic variants of the EM algorithm, Communications in Statistics - Theory and Methods, 10.1080/03610926.2014.930909, 45, 6, (1643-1669), (2015).
- David Volent Lindberg, Henning Omre, Inference of the Transition Matrix in Convolved Hidden Markov Models and the Generalized Baum–Welch Algorithm, IEEE Transactions on Geoscience and Remote Sensing, 10.1109/TGRS.2015.2440415, 53, 12, (6443-6456), (2015).
- Y. K. Tseng, Y. R. Su, M. Mao, J. L. Wang, An extended hazard model with longitudinal covariates, Biometrika, 10.1093/biomet/asu058, 102, 1, (135-150), (2015).
- Chen Yue, Shaojie Chen, Haris I. Sair, Raag Airan, Brian S. Caffo, Estimating a graphical intra-class correlation coefficient (GICC) using multivariate probit-linear mixed models, Computational Statistics & Data Analysis, 10.1016/j.csda.2015.02.012, 89, (126-133), (2015).
- Cong Xu, Paul Baines, Jane-Ling Wang, Improved Estimation and Uncertainty Quantification Using Monte Carlo-Based Optimization Algorithms, Journal of Computational and Graphical Statistics, 10.1080/10618600.2014.927361, 24, 3, (771-791), (2015).
- Michael C. Burkhart, Yeonsook Heo, Victor M. Zavala, Measurement and verification of building systems under uncertain data: A Gaussian process modeling approach, Energy and Buildings, 10.1016/j.enbuild.2014.01.048, 75, (189-198), (2014).
- S. Trevezas, S. Malefaki, P.-H. Cournède, Parameter estimation via stochastic variants of the ECM algorithm with applications to plant growth modeling, Computational Statistics & Data Analysis, 10.1016/j.csda.2014.04.004, 78, (82-99), (2014).
- Maciej Augustyniak, Maximum likelihood estimation of the Markov-switching GARCH model, Computational Statistics & Data Analysis, 10.1016/j.csda.2013.01.026, 76, (61-75), (2014).
- Michael A. Tope, Joel M. Morris, undefined, 2014 48th Annual Conference on Information Sciences and Systems (CISS), 10.1109/CISS.2014.6814110, (1-6), (2014).
- Samis Trevezas, Paul-Henry Cournède, A Sequential Monte Carlo Approach for MLE in a Plant Growth Model, Journal of Agricultural, Biological, and Environmental Statistics, 10.1007/s13253-013-0134-1, 18, 2, (250-270), (2013).
- Richard A. Levine, Juanjuan Fan, Pamela Ohman Strickland, Shaban Demirel, Frailty modeling via the empirical Bayes–Hastings sampler, Computational Statistics & Data Analysis, 10.1016/j.csda.2011.09.004, 56, 6, (1303-1318), (2012).
- Bernie J Daigle, Min K Roh, Linda R Petzold, Jarad Niemi, Accelerated maximum likelihood parameter estimation for stochastic biochemical systems, BMC Bioinformatics, 10.1186/1471-2105-13-68, 13, 1, (68), (2012).
- Perry de Valpine, Frequentist analysis of hierarchical models for population dynamics and demographic data, Journal of Ornithology, 10.1007/s10336-010-0642-5, 152, S2, (393-408), (2011).
- Adam R. Brentnall, Martin J. Crowder, David J. Hand, Approximate repeated-measures shrinkage, Computational Statistics & Data Analysis, 10.1016/j.csda.2010.09.014, 55, 2, (1150-1159), (2011).
- Radu V. Craiu, Thierry Duchesne, Daniel Fortin, Sophie Baillargeon, Conditional Logistic Regression With Longitudinal Follow-up and Individual-Level Random Coefficients: A Stable and Efficient Two-Step Estimation Method, Journal of Computational and Graphical Statistics, 10.1198/jcgs.2011.09189, 20, 3, (767-784), (2011).
- Richard D. Wilkinson, Michail Vrettas, Dan Cornford, Jeremy E. Oakley, Quantifying Simulator Discrepancy in Discrete-Time Dynamical Simulators, Journal of Agricultural, Biological, and Environmental Statistics, 10.1007/s13253-011-0077-3, 16, 4, (554-570), (2011).
- Yong Wang, Fisher scoring: An interpolation family and its Monte Carlo implementations, Computational Statistics & Data Analysis, 10.1016/j.csda.2010.02.006, 54, 7, (1744-1755), (2010).
- Ingrid Svensson, Sara Sjöstedt-de Luna, Asymptotic properties of a stochastic EM algorithm for mixtures with censored data, Journal of Statistical Planning and Inference, 10.1016/j.jspi.2009.06.014, 140, 1, (111-127), (2010).
- Shufang Wang, Alex Tsodikov, A self-consistency approach to multinomial logit model with random effects, Journal of Statistical Planning and Inference, 10.1016/j.jspi.2010.01.034, 140, 7, (1939-1947), (2010).
- Sonia Malefaki, Samis Trevezas, Nikolaos Limnios, An EM and a Stochastic Version of the EM Algorithm for Nonparametric Hidden Semi-Markov Models, Communications in Statistics - Simulation and Computation, 10.1080/03610910903411185, 39, 2, (240-261), (2009).
- J Staudenmayer, E E Lake, M P Wand, Robustness for general design mixed models using the t -distribution , Statistical Modelling: An International Journal, 10.1177/1471082X0800900304, 9, 3, (235-255), (2009).
- Cristian Meza, Florence Jaffrézic, Jean-Louis Foulley, Estimation in the probit normal model for binary outcomes using the SAEM algorithm, Computational Statistics & Data Analysis, 10.1016/j.csda.2008.11.024, 53, 4, (1350-1360), (2009).
- Jeffrey Heath, Michael Fu, Wolfgang Jank, New Global Optimization Algorithms for Model-Based Clustering, SSRN Electronic Journal, 10.2139/ssrn.1434390, (2009).
- Jeffrey W. Heath, Michael C. Fu, Wolfgang Jank, New global optimization algorithms for model-based clustering, Computational Statistics & Data Analysis, 10.1016/j.csda.2009.07.007, 53, 12, (3999-4017), (2009).
- Adam R. Brentnall, Martin J. Crowder, David J. Hand, A statistical model for the temporal pattern of individual automated teller machine withdrawals, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/j.1467-9876.2007.00599.x, 57, 1, (43-59), (2008).
- Ramani S. Pilla, Yongdai Kim, Hakbae Lee, On casting random‐effects models in a survival framework, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/j.1467-9868.2007.00652.x, 70, 3, (629-642), (2008).
- J.M. Wang, D.J. Fleet, A. Hertzmann, Gaussian Process Dynamical Models for Human Motion, IEEE Transactions on Pattern Analysis and Machine Intelligence, 10.1109/TPAMI.2007.1167, 30, 2, (283-298), (2008).
- Asger Hobolth, A Markov chain Monte Carlo Expectation Maximization Algorithm for Statistical Analysis of DNA Sequence Evolution with Neighbor-Dependent Substitution Rates, Journal of Computational and Graphical Statistics, 10.1198/106186008X289010, 17, 1, (138-162), (2008).
- Michael G. Rabbat, MÁrio A. T. Figueiredo, Robert D. Nowak, Network Inference From Co-Occurrences, IEEE Transactions on Information Theory, 10.1109/TIT.2008.926315, 54, 9, (4053-4068), (2008).
- F. Y. Kuo, W. T. M. Dunsmuir, I. H. Sloan, M. P. Wand, R. S. Womersley, Quasi-Monte Carlo for Highly Structured Generalised Response Models, Methodology and Computing in Applied Probability, 10.1007/s11009-007-9045-3, 10, 2, (239-275), (2007).
- Nicolas Rodrigue, Hervé Philippe, Nicolas Lartillot, Exploring Fast Computational Strategies for Probabilistic Phylogenetic Analysis, Systematic Biology, 10.1080/10635150701611258, 56, 5, (711-726), (2007).
- Kunling Wu, Lang Wu, Generalized linear mixed models with informative dropouts and missing covariates, Metrika, 10.1007/s00184-006-0083-6, 66, 1, (1-18), (2006).
- Wolfgang Jank, The EM Algorithm, Its Randomized Implementation and Global Optimization: Some Challenges and Opportunities for Operations Research, Perspectives in Operations Research, 10.1007/978-0-387-39934-8, (367-392), (2006).
- Wolfgang Jank, Ascent EM for fast and global solutions to finite mixtures: An application to curve-clustering of online auctions, Computational Statistics & Data Analysis, 10.1016/j.csda.2006.03.013, 51, 2, (747-761), (2006).
- Wolfgang Jank, Implementing and Diagnosing the Stochastic Approximation EM Algorithm, Journal of Computational and Graphical Statistics, 10.1198/106186006X157469, 15, 4, (803-829), (2006).
- Wolfgang Jank, P. K. Kannan, Understanding Geographical Markets of Online Firms Using Spatial Models of Customer Choice, Marketing Science, 10.1287/mksc.1050.0145, 24, 4, (623-634), (2005).




