Volume 66, Issue 3

Analysis of longitudinal data with irregular, outcome‐dependent follow‐up

Haiqun Lin

Yale University, New Haven, USA

Search for more papers by this author
Daniel O. Scharfstein

Johns Hopkins Bloomberg School of Public Health, Baltimore, USA

Search for more papers by this author
Robert A. Rosenheck

Veterans Affairs Northeast Program Evaluation Center and Yale University, West Haven, USA

Search for more papers by this author
First published: 15 July 2004
Citations: 50
Haiqun Lin, Division of Biostatistics, Department of Epidemiology and Public Health, Yale University, Room 208 LEPH, 60 College Street, New Haven, CT 06520, USA.
E‐mail: haiqun.lin@yale.edu

Abstract

Summary. A frequent problem in longitudinal studies is that subjects may miss scheduled visits or be assessed at self‐selected points in time. As a result, observed outcome data may be highly unbalanced and the availability of the data may be directly related to the outcome measure and/or some auxiliary factors that are associated with the outcome. If the follow‐up visit and outcome processes are correlated, then marginal regression analyses will produce biased estimates. Building on the work of Robins, Rotnitzky and Zhao, we propose a class of inverse intensity‐of‐visit process‐weighted estimators in marginal regression models for longitudinal responses that may be observed in continuous time. This allows us to handle arbitrary patterns of missing data as embedded in a subject's visit process. We derive the large sample distribution for our inverse visit‐intensity‐weighted estimators and investigate their finite sample behaviour by simulation. Our approach is illustrated with a data set from a health services research study in which homeless people with mental illness were randomized to three different treatments and measures of homelessness (as percentage days homeless in the past 3 months) and other auxiliary factors were recorded at follow‐up times that are not fixed by design.

Number of times cited according to CrossRef: 50

  • Meeting the Assumptions of Inverse-Intensity Weighting for Longitudinal Data Subject to Irregular Follow-Up: Suggestions for the Design and Analysis of Clinic-Based Cohort Studies, Epidemiologic Methods, 10.1515/em-2018-0016, 0, 0, (2020).
  • Summarizing the extent of visit irregularity in longitudinal data, BMC Medical Research Methodology, 10.1186/s12874-020-01023-w, 20, 1, (2020).
  • Joint Models of Longitudinal Outcomes and Informative Time, Journal of Modern Applied Statistical Methods, 10.22237/jmasm/1556670000, 18, 1, (2020).
  • Primary Care Pre-Visit Electronic Patient Questionnaire for Asthma: Uptake Analysis and Predictor Modeling, Journal of Medical Internet Research, 10.2196/19358, 22, 9, (e19358), (2020).
  • Linear mixed effects models for non‐Gaussian continuous repeated measurement data, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/rssc.12405, (2020).
  • Using Registry Data to Understand Disease Evolution in Inflammatory Myositis and Other Rheumatic Diseases, Current Rheumatology Reports, 10.1007/s11926-019-0874-1, 22, 1, (2019).
  • A comparison between discrete and continuous time Bayesian networks in learning from clinical time series data with irregularity, Artificial Intelligence in Medicine, 10.1016/j.artmed.2018.10.002, (2019).
  • Independence conditions and the analysis of life history studies with intermittent observation, Biostatistics, 10.1093/biostatistics/kxz047, (2019).
  • Challenges and Opportunities for Using Big Health Care Data to Advance Medical Science and Public Health, American Journal of Epidemiology, 10.1093/aje/kwy292, (2019).
  • Evaluating longitudinal markers under two-phase study designs, Biostatistics, 10.1093/biostatistics/kxy013, 20, 3, (485-498), (2018).
  • A Joint Modeling Approach for Longitudinal Data with Informative Observation Times and a Terminal Event, Statistics in Biosciences, 10.1007/s12561-018-9221-8, 10, 3, (609-633), (2018).
  • Big data: Some statistical issues, Statistics & Probability Letters, 10.1016/j.spl.2018.02.015, 136, (111-115), (2018).
  • Dissemination of novel biostatistics methods: Impact of programming code availability and other characteristics on article citations, PLOS ONE, 10.1371/journal.pone.0201590, 13, 8, (e0201590), (2018).
  • Longitudinal Data with Covariate Measurement Error, Statistical Analysis with Measurement Error or Misclassification, 10.1007/978-1-4939-6640-0_5, (193-256), (2017).
  • Analysis of Chronic Disease Processes Based on Cohort and Registry Data, Mathematical and Statistical Applications in Life Sciences and Engineering, 10.1007/978-981-10-5370-2, (305-325), (2017).
  • Longitudinal studies that use data collected as part of usual care risk reporting biased results: a systematic review, BMC Medical Research Methodology, 10.1186/s12874-017-0418-1, 17, 1, (2017).
  • Semiparametric analysis of longitudinal data with informative observation times and censoring times, Journal of Applied Statistics, 10.1080/02664763.2017.1403574, (1-16), (2017).
  • Longitudinal data subject to irregular observation: A review of methods with a focus on visit processes, assumptions, and study design, Statistical Methods in Medical Research, 10.1177/0962280214536537, 25, 6, (2992-3014), (2016).
  • Time-varying latent model for longitudinal data with informative observation and terminal event times, Science China Mathematics, 10.1007/s11425-016-0112-6, 59, 12, (2393-2410), (2016).
  • Analyzing longitudinal data with informative observation and terminal event times, Acta Mathematicae Applicatae Sinica, English Series, 10.1007/s10255-016-0624-3, 32, 4, (1035-1052), (2016).
  • Joint modeling and estimation for longitudinal data with informative observation and terminal event times, Communications in Statistics - Theory and Methods, 10.1080/03610926.2014.960589, 45, 22, (6521-6539), (2016).
  • Joint analysis of longitudinal data and competing terminal events in the presence of dependent observation times with application to chronic kidney disease, Journal of Applied Statistics, 10.1080/02664763.2016.1155202, 43, 16, (2922-2940), (2016).
  • Joint analysis of longitudinal data with additive mixed effect model for informative observation times, Journal of Statistical Planning and Inference, 10.1016/j.jspi.2015.08.001, 169, (43-55), (2016).
  • Joint modeling of longitudinal data with a dependent terminal event, Communications in Statistics - Theory and Methods, 10.1080/03610926.2013.851237, 45, 3, (813-835), (2016).
  • Regression analysis of longitudinal data with irregular and informative observation times, Biostatistics, 10.1093/biostatistics/kxv008, 16, 4, (727-739), (2015).
  • Estimation and assessment of markov multistate models with intermittent observations on individuals, Lifetime Data Analysis, 10.1007/s10985-014-9310-z, 21, 2, (160-179), (2014).
  • A Conditional Approach for Regression Analysis of Longitudinal Data with Informative Observation Time and Non-negligible Observation Duration, Communications in Statistics - Theory and Methods, 10.1080/03610926.2012.738841, 43, 23, (4998-5011), (2014).
  • Regression Analysis of Panel Count Data II, Statistical Analysis of Panel Count Data, 10.1007/978-1-4614-8715-9_6, (121-153), (2013).
  • Mixed model of repeated measures versus slope models in Alzheimer’s disease clinical trials, The journal of nutrition, health & aging, 10.1007/s12603-012-0047-7, 16, 4, (360-364), (2012).
  • Joint Analysis of Longitudinal Data With Informative Observation Times and a Dependent Terminal Event, Journal of the American Statistical Association, 10.1080/01621459.2012.682528, 107, 498, (688-700), (2012).
  • Joint Models of Longitudinal Data and Recurrent Events with Informative Terminal Event, Statistics in Biosciences, 10.1007/s12561-012-9061-x, 4, 2, (262-281), (2012).
  • Regression analysis of longitudinal data with time-dependent covariates in the presence of informative observation and censoring times, Journal of Statistical Planning and Inference, 10.1016/j.jspi.2011.03.013, 141, 8, (2902-2919), (2011).
  • Semiparametric analysis of longitudinal data with informative observation times, Acta Mathematicae Applicatae Sinica, English Series, 10.1007/s10255-011-0037-2, 27, 1, (29-42), (2010).
  • Geostatistical inference under preferential sampling, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/j.1467-9876.2009.00701.x, 59, 2, (191-232), (2010).
  • Analysis of Incomplete Data Using Inverse Probability Weighting and Doubly Robust Estimators, Methodology, 10.1027/1614-2241/a000005, 6, 1, (37-48), (2010).
  • Longitudinal Data Analysis for Generalized Linear Models Under Participant-Driven Informative Follow-up: An Application in Maternal Health Epidemiology, American Journal of Epidemiology, 10.1093/aje/kwp353, 171, 2, (189-197), (2009).
  • Inverse Intensity Weighting in Generalized Linear Models as an Option for Analyzing Longitudinal Data with Triggered Observations, American Journal of Epidemiology, 10.1093/aje/kwp333, 171, 1, (105-112), (2009).
  • Analyzing longitudinal data with informative observation times under biased sampling, Statistics & Probability Letters, 10.1016/j.spl.2008.12.022, 79, 9, (1162-1168), (2009).
  • Semiparametric Efficient Estimation for Incomplete Longitudinal Binary Data, With Application to Smoking Trends, Journal of the American Statistical Association, 10.1198/jasa.2009.ap08527, 104, 488, (1373-1384), (2009).
  • Marginal Regression Model with Time-Varying Coefficients for Panel Data, Communications in Statistics - Theory and Methods, 10.1080/03610920802395686, 38, 8, (1241-1261), (2009).
  • Sampling bias and logistic models, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/j.1467-9868.2007.00660.x, 70, 4, (643-677), (2008).
  • Creating unbiased cross-sectional covariate-related reference ranges from serial correlated measurements, Biostatistics, 10.1093/biostatistics/kxn022, 10, 1, (147-154), (2008).
  • Analysis of longitudinal data with drop‐out: objectives, assumptions and a proposal, Journal of the Royal Statistical Society: Series C (Applied Statistics), 10.1111/j.1467-9876.2007.00590.x, 56, 5, (499-550), (2007).
  • Impact of Supported Housing on Clinical Outcomes, The Journal of Nervous and Mental Disease, 10.1097/01.nmd.0000252313.49043.f2, 195, 1, (83-88), (2007).
  • Regression Analysis of Longitudinal Data in the Presence of Informative Observation and Censoring Times, Journal of the American Statistical Association, 10.1198/016214507000000851, 102, 480, (1397-1406), (2007).
  • The use of missingness screens in clinical epidemiologic research has implications for regression modeling, Journal of Clinical Epidemiology, 10.1016/j.jclinepi.2007.03.006, 60, 12, (1239-1245), (2007).
  • Analysis of Panel Count Data, The Statistical Analysis of Interval-censored Failure Time Data, 10.1007/0-387-37119-2, (205-228), (2006).
  • Local model uncertainty and incomplete‐data bias (with discussion), Journal of the Royal Statistical Society: Series B (Statistical Methodology), 10.1111/j.1467-9868.2005.00512.x, 67, 4, (459-513), (2005).
  • Credit Risk Versus Capital Requirements under Basel II: Are SME Loans and Retail Credit Really Different?, Journal of Financial Services Research, 10.1007/s10693-005-4356-4, 28, 1-3, (43-75), (2005).
  • Credit Risk versus Capital Requirements under Basel II: Are SME Loans and Retail Credit Really Different?, SSRN Electronic Journal, 10.2139/ssrn.498882, (2004).