Volume 79, Issue 2
Original Article

Probabilistic multi‐resolution scanning for two‐sample differences

Jacopo Soriano

Corresponding Author

E-mail address: jacopo.soriano@duke.edu

Duke University, Durham, USA

Address for correspondence: Jacopo Soriano, Department of Statistical Science, Box 90251, Duke University, Durham, NC 27768‐0251, USA. E‐mail: jacopo.soriano@duke.eduSearch for more papers by this author
Li Ma

Duke University, Durham, USA

Search for more papers by this author
First published: 27 April 2016
Citations: 5

Summary

We propose a multi‐resolution scanning approach to identifying two‐sample differences. Windows of multiple scales are constructed through nested dyadic partitioning on the sample space and a hypothesis regarding the two‐sample difference is defined on each window. Instead of testing the hypotheses on different windows independently, we adopt a joint graphical model, namely a Markov tree, on the null or alternative states of these hypotheses to incorporate spatial correlation across windows. The induced dependence allows borrowing strength across nearby and nested windows, which we show is critical for detecting high resolution local differences. We evaluate the performance of the method through simulation and show that it substantially outperforms other state of the art two‐sample tests when the two‐sample difference is local, involving only a small subset of the data. We then apply it to a flow cytometry data set from immunology, in which it successfully identifies highly local differences. In addition, we show how to control properly for multiple testing in a decision theoretic approach as well as how to summarize and report the inferred two‐sample difference. We also construct hierarchical extensions of the framework to incorporate adaptivity into the construction of the scanning windows to improve inference further.

Number of times cited according to CrossRef: 5

  • A Bayesian nonparametric testing procedure for paired samples, Biometrics, 10.1111/biom.13234, 0, 0, (2020).
  • A Bayesian hierarchical model for related densities by using Pólya trees, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/rssb.12346, 82, 1, (127-153), (2019).
  • Bayesian Graphical Compositional Regression for Microbiome Data, Journal of the American Statistical Association, 10.1080/01621459.2019.1647212, (1-15), (2019).
  • Analysis of Distributional Variation Through Graphical Multi-Scale Beta-Binomial Models, Journal of Computational and Graphical Statistics, 10.1080/10618600.2017.1402774, 27, 3, (529-541), (2018).
  • Efficient Functional ANOVA Through Wavelet-Domain Markov Groves, Journal of the American Statistical Association, 10.1080/01621459.2017.1286241, 113, 522, (802-818), (2018).