Volume 67, Issue 2

Ascent‐based Monte Carlo expectation– maximization

Brian S. Caffo

Johns Hopkins University, Baltimore, USA

Search for more papers by this author
Wolfgang Jank

University of Maryland, College Park, USA

Search for more papers by this author
Galin L. Jones

University of Minnesota, Minneapolis, USA

Search for more papers by this author
First published: 09 March 2005
Citations: 51
Brian S. Caffo, Department of Biostatistics, Johns Hopkins University, 615 Wolfe Street, Baltimore, MD 21205, USA.
E‐mail: bcaffo@jhsph.edu

Abstract

Summary. The expectation–maximization (EM) algorithm is a popular tool for maximizing likelihood functions in the presence of missing data. Unfortunately, EM often requires the evaluation of analytically intractable and high dimensional integrals. The Monte Carlo EM (MCEM) algorithm is the natural extension of EM that employs Monte Carlo methods to estimate the relevant integrals. Typically, a very large Monte Carlo sample size is required to estimate these integrals within an acceptable tolerance when the algorithm is near convergence. Even if this sample size were known at the onset of implementation of MCEM, its use throughout all iterations is wasteful, especially when accurate starting values are not available. We propose a data‐driven strategy for controlling Monte Carlo resources in MCEM. The algorithm proposed improves on similar existing methods by recovering EM's ascent (i.e. likelihood increasing) property with high probability, being more robust to the effect of user‐defined inputs and handling classical Monte Carlo and Markov chain Monte Carlo methods within a common framework. Because of the first of these properties we refer to the algorithm as ‘ascent‐based MCEM’. We apply ascent‐based MCEM to a variety of examples, including one where it is used to accelerate the convergence of deterministic EM dramatically.

Number of times cited according to CrossRef: 51

  • Analyzing Markov chain Monte Carlo output, WIREs Computational Statistics , 10.1002/wics.1501, 12, 4, (2020).
  • An Analysis by Synthesis Method that Allows Accurate Spatial Modeling of Thickness of Cortical Bone from Clinical QCT, Medical Image Computing and Computer Assisted Intervention – MICCAI 2020, 10.1007/978-3-030-59725-2_62, (641-651), (2020).
  • Clumped-MCEM: Inference for multistep transcriptional processes, Computational Biology and Chemistry, 10.1016/j.compbiolchem.2019.107092, (2019).
  • A pathway for multivariate analysis of ecological communities using copulas, Ecology and Evolution, 10.1002/ece3.4948, 9, 6, (3276-3294), (2019).
  • Gaussian process regression for survival time prediction with genome-wide gene expression, Biostatistics, 10.1093/biostatistics/kxz023, (2019).
  • Modeling Heterogeneous Farm Responses to European Union Biofuel Support with a Random Parameter Multicrop Model, American Journal of Agricultural Economics, 10.1093/ajae/aax091, 100, 2, (434-455), (2018).
  • Inferring Transcriptional Dynamics with Time-Dependent Reaction Rates Using Stochastic Simulation, Recent Findings in Intelligent Computing Techniques, 10.1007/978-981-10-8636-6_58, (549-556), (2018).
  • Semiparametric regression analysis of clustered survival data with semi-competing risks, Computational Statistics & Data Analysis, 10.1016/j.csda.2018.02.003, 124, (53-70), (2018).
  • Transcriptional processes: Models and inference, Journal of Bioinformatics and Computational Biology, 10.1142/S0219720018500233, 16, 05, (1850023), (2018).
  • Partial Device Fingerprints, Machine Learning and Knowledge Discovery in Databases, 10.1007/978-3-319-71246-8_14, (222-237), (2017).
  • Learning Bayesian network parameters under equivalence constraints, Artificial Intelligence, 10.1016/j.artint.2015.05.007, 244, (239-257), (2017).
  • EM-Type algorithms for heavy-tailed logistic mixed models, Journal of Statistical Computation and Simulation, 10.1080/00949655.2017.1350678, 87, 15, (2940-2961), (2017).
  • Variable Selection in a Log–Linear Birnbaum–Saunders Regression Model for High-Dimensional Survival Data via the Elastic-Net and Stochastic EM, Technometrics, 10.1080/00401706.2015.1133457, 58, 3, (383-392), (2016).
  • A Practical Sequential Stopping Rule for High-Dimensional Markov Chain Monte Carlo, Journal of Computational and Graphical Statistics, 10.1080/10618600.2015.1044092, 25, 3, (684-700), (2016).
  • A Unified Maximum Likelihood Approach for Analyzing Structural Equation Models With Missing Nonstandard Data, Sociological Methods & Research, 10.1177/0049124106292357, 35, 3, (352-381), (2016).
  • A non linear mixed effects model of plant growth and estimation via stochastic variants of the EM algorithm, Communications in Statistics - Theory and Methods, 10.1080/03610926.2014.930909, 45, 6, (1643-1669), (2015).
  • Inference of the Transition Matrix in Convolved Hidden Markov Models and the Generalized Baum–Welch Algorithm, IEEE Transactions on Geoscience and Remote Sensing, 10.1109/TGRS.2015.2440415, 53, 12, (6443-6456), (2015).
  • An extended hazard model with longitudinal covariates, Biometrika, 10.1093/biomet/asu058, 102, 1, (135-150), (2015).
  • Estimating a graphical intra-class correlation coefficient (GICC) using multivariate probit-linear mixed models, Computational Statistics & Data Analysis, 10.1016/j.csda.2015.02.012, 89, (126-133), (2015).
  • Improved Estimation and Uncertainty Quantification Using Monte Carlo-Based Optimization Algorithms, Journal of Computational and Graphical Statistics, 10.1080/10618600.2014.927361, 24, 3, (771-791), (2015).
  • Measurement and verification of building systems under uncertain data: A Gaussian process modeling approach, Energy and Buildings, 10.1016/j.enbuild.2014.01.048, 75, (189-198), (2014).
  • Parameter estimation via stochastic variants of the ECM algorithm with applications to plant growth modeling, Computational Statistics & Data Analysis, 10.1016/j.csda.2014.04.004, 78, (82-99), (2014).
  • Maximum likelihood estimation of the Markov-switching GARCH model, Computational Statistics & Data Analysis, 10.1016/j.csda.2013.01.026, 76, (61-75), (2014).
  • undefined, 2014 48th Annual Conference on Information Sciences and Systems (CISS), 10.1109/CISS.2014.6814110, (1-6), (2014).
  • A Sequential Monte Carlo Approach for MLE in a Plant Growth Model, Journal of Agricultural, Biological, and Environmental Statistics, 10.1007/s13253-013-0134-1, 18, 2, (250-270), (2013).
  • Frailty modeling via the empirical Bayes–Hastings sampler, Computational Statistics & Data Analysis, 10.1016/j.csda.2011.09.004, 56, 6, (1303-1318), (2012).
  • Accelerated maximum likelihood parameter estimation for stochastic biochemical systems, BMC Bioinformatics, 10.1186/1471-2105-13-68, 13, 1, (68), (2012).
  • Frequentist analysis of hierarchical models for population dynamics and demographic data, Journal of Ornithology, 10.1007/s10336-010-0642-5, 152, S2, (393-408), (2011).
  • Approximate repeated-measures shrinkage, Computational Statistics & Data Analysis, 10.1016/j.csda.2010.09.014, 55, 2, (1150-1159), (2011).
  • Conditional Logistic Regression With Longitudinal Follow-up and Individual-Level Random Coefficients: A Stable and Efficient Two-Step Estimation Method, Journal of Computational and Graphical Statistics, 10.1198/jcgs.2011.09189, 20, 3, (767-784), (2011).
  • Quantifying Simulator Discrepancy in Discrete-Time Dynamical Simulators, Journal of Agricultural, Biological, and Environmental Statistics, 10.1007/s13253-011-0077-3, 16, 4, (554-570), (2011).
  • Fisher scoring: An interpolation family and its Monte Carlo implementations, Computational Statistics & Data Analysis, 10.1016/j.csda.2010.02.006, 54, 7, (1744-1755), (2010).
  • Asymptotic properties of a stochastic EM algorithm for mixtures with censored data, Journal of Statistical Planning and Inference, 10.1016/j.jspi.2009.06.014, 140, 1, (111-127), (2010).
  • A self-consistency approach to multinomial logit model with random effects, Journal of Statistical Planning and Inference, 10.1016/j.jspi.2010.01.034, 140, 7, (1939-1947), (2010).
  • An EM and a Stochastic Version of the EM Algorithm for Nonparametric Hidden Semi-Markov Models, Communications in Statistics - Simulation and Computation, 10.1080/03610910903411185, 39, 2, (240-261), (2009).
  • Robustness for general design mixed models using the t -distribution , Statistical Modelling: An International Journal, 10.1177/1471082X0800900304, 9, 3, (235-255), (2009).
  • Estimation in the probit normal model for binary outcomes using the SAEM algorithm, Computational Statistics & Data Analysis, 10.1016/j.csda.2008.11.024, 53, 4, (1350-1360), (2009).
  • New Global Optimization Algorithms for Model-Based Clustering, SSRN Electronic Journal, 10.2139/ssrn.1434390, (2009).
  • New global optimization algorithms for model-based clustering, Computational Statistics & Data Analysis, 10.1016/j.csda.2009.07.007, 53, 12, (3999-4017), (2009).
  • A statistical model for the temporal pattern of individual automated teller machine withdrawals, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/j.1467-9876.2007.00599.x, 57, 1, (43-59), (2008).
  • On casting random‐effects models in a survival framework, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/j.1467-9868.2007.00652.x, 70, 3, (629-642), (2008).
  • Gaussian Process Dynamical Models for Human Motion, IEEE Transactions on Pattern Analysis and Machine Intelligence, 10.1109/TPAMI.2007.1167, 30, 2, (283-298), (2008).
  • A Markov chain Monte Carlo Expectation Maximization Algorithm for Statistical Analysis of DNA Sequence Evolution with Neighbor-Dependent Substitution Rates, Journal of Computational and Graphical Statistics, 10.1198/106186008X289010, 17, 1, (138-162), (2008).
  • Network Inference From Co-Occurrences, IEEE Transactions on Information Theory, 10.1109/TIT.2008.926315, 54, 9, (4053-4068), (2008).
  • Quasi-Monte Carlo for Highly Structured Generalised Response Models, Methodology and Computing in Applied Probability, 10.1007/s11009-007-9045-3, 10, 2, (239-275), (2007).
  • Exploring Fast Computational Strategies for Probabilistic Phylogenetic Analysis, Systematic Biology, 10.1080/10635150701611258, 56, 5, (711-726), (2007).
  • Generalized linear mixed models with informative dropouts and missing covariates, Metrika, 10.1007/s00184-006-0083-6, 66, 1, (1-18), (2006).
  • The EM Algorithm, Its Randomized Implementation and Global Optimization: Some Challenges and Opportunities for Operations Research, Perspectives in Operations Research, 10.1007/978-0-387-39934-8, (367-392), (2006).
  • Ascent EM for fast and global solutions to finite mixtures: An application to curve-clustering of online auctions, Computational Statistics & Data Analysis, 10.1016/j.csda.2006.03.013, 51, 2, (747-761), (2006).
  • Implementing and Diagnosing the Stochastic Approximation EM Algorithm, Journal of Computational and Graphical Statistics, 10.1198/106186006X157469, 15, 4, (803-829), (2006).
  • Understanding Geographical Markets of Online Firms Using Spatial Models of Customer Choice, Marketing Science, 10.1287/mksc.1050.0145, 24, 4, (623-634), (2005).