Analysis of longitudinal data with irregular, outcome‐dependent follow‐up
Abstract
Summary. A frequent problem in longitudinal studies is that subjects may miss scheduled visits or be assessed at self‐selected points in time. As a result, observed outcome data may be highly unbalanced and the availability of the data may be directly related to the outcome measure and/or some auxiliary factors that are associated with the outcome. If the follow‐up visit and outcome processes are correlated, then marginal regression analyses will produce biased estimates. Building on the work of Robins, Rotnitzky and Zhao, we propose a class of inverse intensity‐of‐visit process‐weighted estimators in marginal regression models for longitudinal responses that may be observed in continuous time. This allows us to handle arbitrary patterns of missing data as embedded in a subject's visit process. We derive the large sample distribution for our inverse visit‐intensity‐weighted estimators and investigate their finite sample behaviour by simulation. Our approach is illustrated with a data set from a health services research study in which homeless people with mental illness were randomized to three different treatments and measures of homelessness (as percentage days homeless in the past 3 months) and other auxiliary factors were recorded at follow‐up times that are not fixed by design.
Citing Literature
Number of times cited according to CrossRef: 50
- Eleanor M. Pullenayegum, Meeting the Assumptions of Inverse-Intensity Weighting for Longitudinal Data Subject to Irregular Follow-Up: Suggestions for the Design and Analysis of Clinic-Based Cohort Studies, Epidemiologic Methods, 10.1515/em-2018-0016, 0, 0, (2020).
- Armend Lokku, Lily S. Lim, Catherine S. Birken, Eleanor M. Pullenayegum, Summarizing the extent of visit irregularity in longitudinal data, BMC Medical Research Methodology, 10.1186/s12874-020-01023-w, 20, 1, (2020).
- JangDong Seo, Joint Models of Longitudinal Outcomes and Informative Time, Journal of Modern Applied Statistical Methods, 10.22237/jmasm/1556670000, 18, 1, (2020).
- Andrew Kouri, Janet Yamada, Joanna E M Sale, Sharon E Straus, Samir Gupta, Primary Care Pre-Visit Electronic Patient Questionnaire for Asthma: Uptake Analysis and Predictor Modeling, Journal of Medical Internet Research, 10.2196/19358, 22, 9, (e19358), (2020).
- Özgür Asar, David Bolin, Peter J. Diggle, Jonas Wallin, Linear mixed effects models for non‐Gaussian continuous repeated measurement data, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/rssc.12405, (2020).
- Lily Siok Hoon Lim, Brian M. Feldman, Using Registry Data to Understand Disease Evolution in Inflammatory Myositis and Other Rheumatic Diseases, Current Rheumatology Reports, 10.1007/s11926-019-0874-1, 22, 1, (2019).
- Manxia Liu, Fabio Stella, Arjen Hommersom, Peter J.F. Lucas, Lonneke Boer, Erik Bischoff, A comparison between discrete and continuous time Bayesian networks in learning from clinical time series data with irregularity, Artificial Intelligence in Medicine, 10.1016/j.artmed.2018.10.002, (2019).
- Richard J Cook, Jerald F Lawless, Independence conditions and the analysis of life history studies with intermittent observation, Biostatistics, 10.1093/biostatistics/kxz047, (2019).
- Susan M Shortreed, Andrea J Cook, R Yates Coley, Jennifer F Bobb, Jennifer C Nelson, Challenges and Opportunities for Using Big Health Care Data to Advance Medical Science and Public Health, American Journal of Epidemiology, 10.1093/aje/kwy292, (2019).
- Marlena Maziarz, Tianxi Cai, Li Qi, Anna S Lok, Yingye Zheng, Evaluating longitudinal markers under two-phase study designs, Biostatistics, 10.1093/biostatistics/kxy013, 20, 3, (485-498), (2018).
- Lianqiang Qu, Liuquan Sun, Xinyuan Song, A Joint Modeling Approach for Longitudinal Data with Informative Observation Times and a Terminal Event, Statistics in Biosciences, 10.1007/s12561-018-9221-8, 10, 3, (609-633), (2018).
- D.R. Cox, Christiana Kartsonaki, Ruth H. Keogh, Big data: Some statistical issues, Statistics & Probability Letters, 10.1016/j.spl.2018.02.015, 136, (111-115), (2018).
- Amy E. Wahlquist, Lutfiyya N. Muhammad, Teri Lynn Herbert, Viswanathan Ramakrishnan, Paul J. Nietert, Dissemination of novel biostatistics methods: Impact of programming code availability and other characteristics on article citations, PLOS ONE, 10.1371/journal.pone.0201590, 13, 8, (e0201590), (2018).
- Grace Y. Yi, Grace Y. Yi, Longitudinal Data with Covariate Measurement Error, Statistical Analysis with Measurement Error or Misclassification, 10.1007/978-1-4939-6640-0_5, (193-256), (2017).
- Richard J. Cook, Jerald F. Lawless, Analysis of Chronic Disease Processes Based on Cohort and Registry Data, Mathematical and Statistical Applications in Life Sciences and Engineering, 10.1007/978-981-10-5370-2, (305-325), (2017).
- Delaram Farzanfar, Asmaa Abumuamar, Jayoon Kim, Emily Sirotich, Yue Wang, Eleanor Pullenayegum, Longitudinal studies that use data collected as part of usual care risk reporting biased results: a systematic review, BMC Medical Research Methodology, 10.1186/s12874-017-0418-1, 17, 1, (2017).
- Wen Su, Hangjin Jiang, Semiparametric analysis of longitudinal data with informative observation times and censoring times, Journal of Applied Statistics, 10.1080/02664763.2017.1403574, (1-16), (2017).
- Eleanor M Pullenayegum, Lily SH Lim, Longitudinal data subject to irregular observation: A review of methods with a focus on visit processes, assumptions, and study design, Statistical Methods in Medical Research, 10.1177/0962280214536537, 25, 6, (2992-3014), (2016).
- YanBo Pei, Ting Du, LiuQuan Sun, Time-varying latent model for longitudinal data with informative observation and terminal event times, Science China Mathematics, 10.1007/s11425-016-0112-6, 59, 12, (2393-2410), (2016).
- Rui Miao, Xin Chen, Liu-quan Sun, Analyzing longitudinal data with informative observation and terminal event times, Acta Mathematicae Applicatae Sinica, English Series, 10.1007/s10255-016-0624-3, 32, 4, (1035-1052), (2016).
- Ting Du, Jieli Ding, Liuquan Sun, Joint modeling and estimation for longitudinal data with informative observation and terminal event times, Communications in Statistics - Theory and Methods, 10.1080/03610926.2014.960589, 45, 22, (6521-6539), (2016).
- T.-F.C. Lu, C.-M. Hsu, K.-H. Shu, S.-C. Weng, C.-M. Chen, Joint analysis of longitudinal data and competing terminal events in the presence of dependent observation times with application to chronic kidney disease, Journal of Applied Statistics, 10.1080/02664763.2016.1155202, 43, 16, (2922-2940), (2016).
- Sha Fang, Haixiang Zhang, Liuquan Sun, Joint analysis of longitudinal data with additive mixed effect model for informative observation times, Journal of Statistical Planning and Inference, 10.1016/j.jspi.2015.08.001, 169, (43-55), (2016).
- Sui He, Ting Du, Liuquan Sun, Joint modeling of longitudinal data with a dependent terminal event, Communications in Statistics - Theory and Methods, 10.1080/03610926.2013.851237, 45, 3, (813-835), (2016).
- Yong Chen, Jing Ning, Chunyan Cai, Regression analysis of longitudinal data with irregular and informative observation times, Biostatistics, 10.1093/biostatistics/kxv008, 16, 4, (727-739), (2015).
- J. F. Lawless, N. Nazeri Rad, Estimation and assessment of markov multistate models with intermittent observations on individuals, Lifetime Data Analysis, 10.1007/s10985-014-9310-z, 21, 2, (160-179), (2014).
- Liang Zhu, Hui Zhao, Jianguo Sun, Stanley Pounds, A Conditional Approach for Regression Analysis of Longitudinal Data with Informative Observation Time and Non-negligible Observation Duration, Communications in Statistics - Theory and Methods, 10.1080/03610926.2012.738841, 43, 23, (4998-5011), (2014).
- Jianguo Sun, Xingqiu Zhao, Jianguo Sun, Xingqiu Zhao, Regression Analysis of Panel Count Data II, Statistical Analysis of Panel Count Data, 10.1007/978-1-4614-8715-9_6, (121-153), (2013).
- M. C. Donohue, P. S. Aisen, Mixed model of repeated measures versus slope models in Alzheimer’s disease clinical trials, The journal of nutrition, health & aging, 10.1007/s12603-012-0047-7, 16, 4, (360-364), (2012).
- Liuquan Sun, Xinyuan Song, Jie Zhou, Lei Liu, Joint Analysis of Longitudinal Data With Informative Observation Times and a Dependent Terminal Event, Journal of the American Statistical Association, 10.1080/01621459.2012.682528, 107, 498, (688-700), (2012).
- Sehee Kim, Donglin Zeng, Lloyd Chambless, Yi Li, Joint Models of Longitudinal Data and Recurrent Events with Informative Terminal Event, Statistics in Biosciences, 10.1007/s12561-012-9061-x, 4, 2, (262-281), (2012).
- Liuquan Sun, Xinyuan Song, Jie Zhou, Regression analysis of longitudinal data with time-dependent covariates in the presence of informative observation and censoring times, Journal of Statistical Planning and Inference, 10.1016/j.jspi.2011.03.013, 141, 8, (2902-2919), (2011).
- Liu-quan Sun, Xiao-yun Mu, Zhi-hua Sun, Xing-wei Tong, Semiparametric analysis of longitudinal data with informative observation times, Acta Mathematicae Applicatae Sinica, English Series, 10.1007/s10255-011-0037-2, 27, 1, (29-42), (2010).
- Peter J. Diggle, Raquel Menezes, Ting‐li Su, Geostatistical inference under preferential sampling, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/j.1467-9876.2009.00701.x, 59, 2, (191-232), (2010).
- Stijn Vansteelandt, James Carpenter, Michael G. Kenward, Analysis of Incomplete Data Using Inverse Probability Weighting and Doubly Robust Estimators, Methodology, 10.1027/1614-2241/a000005, 6, 1, (37-48), (2010).
- P. Buzkova, E. R. Brown, G. C. John-Stewart, Longitudinal Data Analysis for Generalized Linear Models Under Participant-Driven Informative Follow-up: An Application in Maternal Health Epidemiology, American Journal of Epidemiology, 10.1093/aje/kwp353, 171, 2, (189-197), (2009).
- P. H. Van Ness, H. G. Allore, T. R. Fried, H. Lin, Inverse Intensity Weighting in Generalized Linear Models as an Option for Analyzing Longitudinal Data with Triggered Observations, American Journal of Epidemiology, 10.1093/aje/kwp333, 171, 1, (105-112), (2009).
- Liuquan Sun, Xingwei Tong, Analyzing longitudinal data with informative observation times under biased sampling, Statistics & Probability Letters, 10.1016/j.spl.2008.12.022, 79, 9, (1162-1168), (2009).
- Jamie Perin, John S. Preisser, Paul J. Rathouz, Semiparametric Efficient Estimation for Incomplete Longitudinal Binary Data, With Application to Smoking Trends, Journal of the American Statistical Association, 10.1198/jasa.2009.ap08527, 104, 488, (1373-1384), (2009).
- Liuquan Sun, Shaojun Guo, Min Chen, Marginal Regression Model with Time-Varying Coefficients for Panel Data, Communications in Statistics - Theory and Methods, 10.1080/03610920802395686, 38, 8, (1241-1261), (2009).
- Peter McCullagh, Sampling bias and logistic models, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/j.1467-9868.2007.00660.x, 70, 4, (643-677), (2008).
- A. Wade, J. Kurmanavicius, Creating unbiased cross-sectional covariate-related reference ranges from serial correlated measurements, Biostatistics, 10.1093/biostatistics/kxn022, 10, 1, (147-154), (2008).
- Peter Diggle, Daniel Farewell, Robin Henderson, Analysis of longitudinal data with drop‐out: objectives, assumptions and a proposal, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/j.1467-9876.2007.00590.x, 56, 5, (499-550), (2007).
- An-Lin Cheng, Haiqun Lin, Wesley Kasprow, Robert A. Rosenheck, Impact of Supported Housing on Clinical Outcomes, The Journal of Nervous and Mental Disease, 10.1097/01.nmd.0000252313.49043.f2, 195, 1, (83-88), (2007).
- Jianguo Sun, Liuquan Sun, Dandan Liu, Regression Analysis of Longitudinal Data in the Presence of Informative Observation and Censoring Times, Journal of the American Statistical Association, 10.1198/016214507000000851, 102, 480, (1397-1406), (2007).
- Peter H. Van Ness, Terrence E. Murphy, Katy L.B. Araujo, Margaret A. Pisani, Heather G. Allore, The use of missingness screens in clinical epidemiologic research has implications for regression modeling, Journal of Clinical Epidemiology, 10.1016/j.jclinepi.2007.03.006, 60, 12, (1239-1245), (2007).
- Analysis of Panel Count Data, The Statistical Analysis of Interval-censored Failure Time Data, 10.1007/0-387-37119-2, (205-228), (2006).
- John Copas, Shinto Eguchi, Local model uncertainty and incomplete‐data bias (with discussion), Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/j.1467-9868.2005.00512.x, 67, 4, (459-513), (2005).
- Tor Jacobson, Jesper Lindé, Kasper Roszbach, Credit Risk Versus Capital Requirements under Basel II: Are SME Loans and Retail Credit Really Different?, Journal of Financial Services Research, 10.1007/s10693-005-4356-4, 28, 1-3, (43-75), (2005).
- Tor Jacobson, Jesper Lindé, Kasper Roszbach, Credit Risk versus Capital Requirements under Basel II: Are SME Loans and Retail Credit Really Different?, SSRN Electronic Journal, 10.2139/ssrn.498882, (2004).




