Volume 76, Issue 1
Original Article

Covariate balancing propensity score

Kosuke Imai

Corresponding Author

Princeton University, USA

Address for correspondence: Kosuke Imai, Department of Politics, Princeton University, Princeton, NJ 08544, USA. E-mail: E-mail address: kimai@princeton.eduSearch for more papers by this author
First published: 03 July 2013
Citations: 216

Summary

The propensity score plays a central role in a variety of causal inference settings. In particular, matching and weighting methods based on the estimated propensity score have become increasingly common in the analysis of observational data. Despite their popularity and theoretical appeal, the main practical difficulty of these methods is that the propensity score must be estimated. Researchers have found that slight misspecification of the propensity score model can result in substantial bias of estimated treatment effects. We introduce covariate balancing propensity score (CBPS) methodology, which models treatment assignment while optimizing the covariate balance. The CBPS exploits the dual characteristics of the propensity score as a covariate balancing score and the conditional probability of treatment assignment. The estimation of the CBPS is done within the generalized method‐of‐moments or empirical likelihood framework. We find that the CBPS dramatically improves the poor empirical performance of propensity score matching and weighting methods reported in the literature. We also show that the CBPS can be extended to other important settings, including the estimation of the generalized propensity score for non‐binary treatments and the generalization of experimental estimates to a target population. Open source software is available for implementing the methods proposed.

Number of times cited according to CrossRef: 216

  • Banks’ Exposure to Interest Rate Risk and The Transmission of Monetary Policy, Journal of Monetary Economics, 10.1016/j.jmoneco.2020.03.011, (2020).
  • A novel approach for propensity score matching and stratification for multiple treatments: Application to an electronic health record–derived study, Statistics in Medicine, 10.1002/sim.8540, 39, 17, (2308-2323), (2020).
  • How signal intensity of behavioral orientations affects crowdfunding performance: The role of entrepreneurial orientation in crowdfunding business ventures, Journal of Business Research, 10.1016/j.jbusres.2020.04.060, 115, (204-220), (2020).
  • All These Worlds are Yours, Except India: The Effectiveness of Cash Subsidies to Export in Nepal, European Economic Review, 10.1016/j.euroecorev.2020.103494, (103494), (2020).
  • Machine learning outcome regression improves doubly robust estimation of average causal effects, Pharmacoepidemiology and Drug Safety, 10.1002/pds.5074, 29, 9, (1120-1133), (2020).
  • The impact of digital technology development on sitting time across Europe, Technology in Society, 10.1016/j.techsoc.2020.101406, 63, (101406), (2020).
  • Are eco‐labels good for the local economy?, Papers in Regional Science, 10.1111/pirs.12502, 99, 3, (645-661), (2020).
  • Wage premium of Communist Party membership: Evidence from China, Pacific Economic Review, 10.1111/1468-0106.12318, 25, 3, (309-338), (2020).
  • Treatment Effect Estimation via Differentiated Confounder Balancing and Regression, ACM Transactions on Knowledge Discovery from Data (TKDD), 10.1145/3365677, 14, 1, (1-25), (2020).
  • Methylparaben in meconium and risk of maternal thyroid dysfunction, adverse birth outcomes, and Attention-Deficit Hyperactivity Disorder (ADHD), Environment International, 10.1016/j.envint.2020.105716, 139, (105716), (2020).
  • Quantifying the bias due to observed individual confounders in causal treatment effect estimates, Statistics in Medicine, 10.1002/sim.8549, 39, 18, (2447-2476), (2020).
  • Aid, collective action and benefits to smallholders: Evaluating the World Food Program's purchase for progress pilot, Food Policy, 10.1016/j.foodpol.2020.101911, (101911), (2020).
  • Gauging detention dosage: Assessing the impact of pretrial detention on sentencing outcomes using propensity score modeling, Journal of Criminal Justice, 10.1016/j.jcrimjus.2020.101719, (101719), (2020).
  • A Survey of Learning Causality with Data, ACM Computing Surveys, 10.1145/3397269, 53, 4, (1-37), (2020).
  • Determination of the optimal number of strata for propensity score subclassification, Statistics & Probability Letters, 10.1016/j.spl.2020.108951, (108951), (2020).
  • Balancing vs modeling approaches to weighting in practice, Statistics in Medicine, 10.1002/sim.8659, 39, 24, (3227-3254), (2020).
  • Estimation of causal effect using propensity score and weighted-average method, Procedia Computer Science, 10.1016/j.procs.2020.09.076, 176, (810-817), (2020).
  • Direct and stable weight adjustment in non‐experimental studies with multivalued treatments: analysis of the effect of an earthquake on post‐traumatic stress, Journal of the Royal Statistical Society: Series A (Statistics in Society), 10.1111/rssa.12561, 183, 4, (1387-1410), (2020).
  • Social Revolution and Authoritarian Durability, World Politics, 10.1017/S0043887120000106, 72, 4, (557-600), (2020).
  • Culture-negative infective endocarditis (CNIE): impact on postoperative mortality, Open Medicine, 10.1515/med-2020-0193, 15, 1, (571-579), (2020).
  • Adjusting for Confounding with Text Matching, American Journal of Political Science, 10.1111/ajps.12526, 64, 4, (887-903), (2020).
  • Stirring the pot: Switching from blended fee‐for‐service to blended capitation models of physician remuneration, Health Economics, 10.1002/hec.4145, 29, 11, (1435-1455), (2020).
  • Estimating controlled direct effects through marginal structural models, Political Science Research and Methods, 10.1017/psrm.2020.3, (1-18), (2020).
  • Enduring El Niño: impact of market access programmes on livelihood outcomes during drought conditions in Haiti, Journal of Development Effectiveness, 10.1080/19439342.2020.1751241, (1-25), (2020).
  • Does the choice of balance-measure matter under genetic matching?, Empirical Economics, 10.1007/s00181-020-01873-9, (2020).
  • When are International Institutions Effective? The Impact of Domestic Veto Players on Compliance with WTO Rulings, International Studies Quarterly, 10.1093/isq/sqz094, (2020).
  • Residual Balancing: A Method of Constructing Weights for Marginal Structural Models, Political Analysis, 10.1017/pan.2020.2, (1-20), (2020).
  • A framework for covariate balance using Bregman distances, Scandinavian Journal of Statistics, 10.1111/sjos.12457, 0, 0, (2020).
  • Improving Effect Estimates by Limiting the Variability in Inverse Propensity Score Weights, The American Statistician, 10.1080/00031305.2020.1737229, (1-12), (2020).
  • Assessing the Causal Impact of Delayed Oral Health Care on Emergency Department Utilization, North American Actuarial Journal, 10.1080/10920277.2020.1735448, (1-13), (2020).
  • Robust estimation of causal effects via a high-dimensional covariate balancing propensity score, Biometrika, 10.1093/biomet/asaa020, (2020).
  • Does asylum seeker immigration increase support for the far right? Evidence from the United Kingdom, 2000–2015, Journal of Ethnic and Migration Studies, 10.1080/1369183X.2020.1776596, (1-18), (2020).
  • Poniendo a prueba la teoría de la reproducción del capital cultural en Colombia. Caso de las artes escénicas, conciertos y cine, Lecturas de Economía, 10.17533/udea.le.n92a04, 92, (101-131), (2020).
  • Policy design and public support for carbon tax: Evidence from a 2018 US national online survey experiment, Public Administration, 10.1111/padm.12657, 0, 0, (2020).
  • Effect of vaccination on children’s learning achievements: findings from the India Human Development Survey, Journal of Epidemiology and Community Health, 10.1136/jech-2019-213483, (jech-2019-213483), (2020).
  • Estimation of semiparametric varying-coefficient spatial autoregressive models with missing in the dependent variable, Journal of the Korean Statistical Society, 10.1007/s42952-019-00048-2, (2020).
  • Evaluating and improving a matched comparison of antidepressants and bone density, Biometrics, 10.1111/biom.13374, 0, 0, (2020).
  • Association of Prenatal Acetaminophen Exposure Measured in Meconium With Risk of Attention-Deficit/Hyperactivity Disorder Mediated by Frontoparietal Network Brain Connectivity, JAMA Pediatrics, 10.1001/jamapediatrics.2020.3080, (2020).
  • Physical activity barriers according to social stratification in Europe, International Journal of Public Health, 10.1007/s00038-020-01488-y, (2020).
  • Propensity score specification for optimal estimation of average treatment effect with binary response, Statistical Methods in Medical Research, 10.1177/0962280220934847, (096228022093484), (2020).
  • Causal inference of latent classes in complex survey data with the estimating equation framework, Statistics in Medicine, 10.1002/sim.8382, 39, 3, (207-219), (2019).
  • Optimally balanced Gaussian process propensity scores for estimating treatment effects, Journal of the Royal Statistical Society: Series A (Statistics in Society), 10.1111/rssa.12502, 183, 1, (355-377), (2019).
  • Safety surveillance and the estimation of risk in select populations: Flexible methods to control for confounding while targeting marginal comparisons via standardization, Statistics in Medicine, 10.1002/sim.8410, 39, 4, (369-386), (2019).
  • Economic Analysis of Hospital Palliative Care: Investigating Heterogeneity by Noncancer Diagnoses, MDM Policy & Practice, 10.1177/2381468319866451, 4, 2, (238146831986645), (2019).
  • Machine learning methods for developing precision treatment rules with observational data, Behaviour Research and Therapy, 10.1016/j.brat.2019.103412, (103412), (2019).
  • Filing Speed, Information Leakage, and Price Formation, SSRN Electronic Journal, 10.2139/ssrn.3363056, (2019).
  • Financial Participation and Collective Conflicts: Evidence from French Firms, Industrial Relations: A Journal of Economy and Society, 10.1111/irel.12244, 58, 4, (674-703), (2019).
  • Propensity Score Analysis in Non‐Randomized Experimental Designs: An Overview and a Tutorial Using R Software, New Directions for Child and Adolescent Development, 10.1002/cad.20309, 2019, 167, (65-89), (2019).
  • Psychology and morality of political extremists: evidence from Twitter language analysis of alt-right and Antifa, EPJ Data Science, 10.1140/epjds/s13688-019-0193-9, 8, 1, (2019).
  • Spillover Presidential Ads and Campaign Contributions in a Polarized System, SSRN Electronic Journal, 10.2139/ssrn.3499808, (2019).
  • Evaluating Clinical Effectiveness with CF Registries, Cystic Fibrosis - Heterogeneity and Personalized Treatment [Working Title], 10.5772/intechopen.77691, (2019).
  • Survival and Reintervention Risk by Patient Age and Preoperative Abdominal Aortic Aneurysm Diameter after Endovascular Aneurysm Repair, Annals of Vascular Surgery, 10.1016/j.avsg.2018.05.053, 54, (215-225), (2019).
  • Risk analysis and management for highway operations safety using a covariate-balanced determinant detector, Accident Analysis & Prevention, 10.1016/j.aap.2019.105290, 133, (105290), (2019).
  • Heterogeneous impacts of China's economic and development zone program, Journal of Regional Science, 10.1111/jors.12465, 59, 5, (797-818), (2019).
  • Polarization, Participation, and Premiums: How Political Behavior Helps Explain Where the ACA Works, and Where It Doesn't, Journal of Health Politics, Policy and Law, 10.1215/03616878-7785787, 44, 6, (855-884), (2019).
  • Production of physician services under fee‐for‐service and blended fee‐for‐service: Evidence from Ontario, Canada, Health Economics, 10.1002/hec.3951, 28, 12, (1418-1434), (2019).
  • Collective action and heterogeneous welfare effects: Evidence from Ethiopian villages, World Development Perspectives, 10.1016/j.wdp.2019.100150, (100150), (2019).
  • Re-examining the effect of door-to-balloon delay on STEMI outcomes in the context of unmeasured confounders: a retrospective cohort study, Scientific Reports, 10.1038/s41598-019-56353-7, 9, 1, (2019).
  • Retinal microvascular dysfunction in patients with coronary artery disease with and without heart failure: a continuum?, European Journal of Heart Failure, 10.1002/ejhf.1537, 21, 8, (988-997), (2019).
  • Optimal Experimental Design for Staggered Rollouts, SSRN Electronic Journal, 10.2139/ssrn.3483934, (2019).
  • Peripheral versus central extracorporeal membrane oxygenation for postcardiotomy shock: Multicenter registry, systematic review, and meta-analysis, The Journal of Thoracic and Cardiovascular Surgery, 10.1016/j.jtcvs.2019.10.078, (2019).
  • Aiming Right at You: Group versus Individual Clientelistic Targeting in Brazil, Journal of Politics in Latin America, 10.1177/1866802X1801000202, 10, 2, (41-76), (2019).
  • Economic-Burden Trajectories in Commercially Insured US Infants With Respiratory Syncytial Virus Infection, The Journal of Infectious Diseases, 10.1093/infdis/jiz160, (2019).
  • Long-term Assessment of Healthcare Utilization 5 Years After Respiratory Syncytial Virus Infection in US Infants, The Journal of Infectious Diseases, 10.1093/infdis/jiz278, (2019).
  • Propensity Score Methods in Health Technology Assessment: Principles, Extended Applications, and Recent Advances, Frontiers in Pharmacology, 10.3389/fphar.2019.00973, 10, (2019).
  • Propensity score-integrated power prior approach for incorporating real-world evidence in single-arm clinical studies, Journal of Biopharmaceutical Statistics, 10.1080/10543406.2019.1657133, (1-18), (2019).
  • Optimising balance using covariate balancing propensity score: The case of South African child support grant, Development Southern Africa, 10.1080/0376835X.2019.1664895, (1-17), (2019).
  • Social Media News Use and Political Cynicism: Differential Pathways Through “News Finds Me” Perception, Mass Communication and Society, 10.1080/15205436.2019.1651867, (2019).
  • A Quasi-Experimental Analysis of the Adult Learning Effect on Problem-Solving Skills, Adult Education Quarterly, 10.1177/0741713619861073, (074171361986107), (2019).
  • Bias-adjusted Kaplan–Meier survival curves for marginal treatment effect in observational studies, Journal of Biopharmaceutical Statistics, 10.1080/10543406.2019.1633659, (1-14), (2019).
  • The use of complementary and integrative health approaches for chronic musculoskeletal pain in younger US Veterans: An economic evaluation, PLOS ONE, 10.1371/journal.pone.0217831, 14, 6, (e0217831), (2019).
  • The Vertical Transfer Penalty among Bachelor’s Degree Graduates, The Journal of Higher Education, 10.1080/00221546.2019.1609323, (1-26), (2019).
  • Why Propensity Scores Should Not Be Used for Matching, Political Analysis, 10.1017/pan.2019.11, (1-20), (2019).
  • Optimal Prescriptive Trees, INFORMS Journal on Optimization, 10.1287/ijoo.2018.0005, (ijoo.2018.0005), (2019).
  • A Comparison of Approaches to Advertising Measurement: Evidence from Big Field Experiments at Facebook, Marketing Science, 10.1287/mksc.2018.1135, (2019).
  • More interest in interest: Does poll coverage help or hurt efforts to make more young voters show up at the ballot box?, European Union Politics, 10.1177/1465116519837351, (146511651983735), (2019).
  • Evaluating Flexible Modeling of Continuous Covariates in Inverse-Weighted Estimators, American Journal of Epidemiology, 10.1093/aje/kwz004, (2019).
  • No Easy Way Out: The Effect of Military Coups on State Repression, The Journal of Politics, 10.1086/707309, (2019).
  • An Evaluation of a School-Based Savings Program and Its Effect on Sexual Risk Behaviors and Victimization Among Young Ghanaians, Youth & Society, 10.1177/0044118X18824730, (0044118X1882473), (2019).
  • Variable Selection for Causal Effect Estimation: Nonparametric Conditional Independence Testing With Random Forests, Journal of Educational and Behavioral Statistics, 10.3102/1076998619872001, (107699861987200), (2019).
  • An omnibus approach to assess covariate balance in observational studies using the distance covariance, Statistical Methods in Medical Research, 10.1177/0962280219878215, (096228021987821), (2019).
  • The effects of youth labour market reforms: evidence from Italian apprenticeships, Oxford Economic Papers, 10.1093/oep/gpz053, (2019).
  • Minimal dispersion approximately balancing weights: asymptotic properties and practical considerations, Biometrika, 10.1093/biomet/asz050, (2019).
  • Does assisted hatching affect live birth in fresh, first cycle in vitro fertilization in good and poor prognosis patients?, Journal of Assisted Reproduction and Genetics, 10.1007/s10815-019-01619-2, (2019).
  • Regularized calibrated estimation of propensity scores with model misspecification and high-dimensional data, Biometrika, 10.1093/biomet/asz059, (2019).
  • The role of primary surgical repair technique on late outcomes of Tetralogy of Fallot: a multicentre study, European Journal of Cardio-Thoracic Surgery, 10.1093/ejcts/ezz270, (2019).
  • Matching Using Sufficient Dimension Reduction for Causal Inference, Journal of Business & Economic Statistics, 10.1080/07350015.2019.1609974, (1-13), (2019).
  • Evaluating Hospital Readmissions for Persons With Serious and Complex Illness: A Competing Risks Approach, Medical Care Research and Review, 10.1177/1077558718823919, (107755871882391), (2019).
  • Subgroup balancing propensity score, Statistical Methods in Medical Research, 10.1177/0962280219870836, (096228021987083), (2019).
  • Taking a Break, or Taking a Class? Examining the Effects of Incentivized Summer Enrollment on Student Persistence, Research in Higher Education, 10.1007/s11162-018-9527-x, 60, 5, (606-635), (2018).
  • Double robust estimator in general treatment regimes based on Covariate-balancing, Communications in Statistics - Theory and Methods, 10.1080/03610926.2017.1414259, 48, 3, (462-478), (2018).
  • Assessing covariate balance when using the generalized propensity score with quantitative or continuous exposures, Statistical Methods in Medical Research, 10.1177/0962280218756159, 28, 5, (1365-1377), (2018).
  • Gender Identity Nondiscrimination Laws in Public Accommodations: a Review of Evidence Regarding Safety and Privacy in Public Restrooms, Locker Rooms, and Changing Rooms, Sexuality Research and Social Policy, 10.1007/s13178-018-0335-z, 16, 1, (70-83), (2018).
  • Oral corticosteroid exposure and increased risk of related complications in patients with noninfectious intermediate, posterior, or panuveitis: Real-world data analysis, Ophthalmic Epidemiology, 10.1080/09286586.2018.1513042, 26, 1, (27-46), (2018).
  • Propensity score methods for causal inference: an overview, Behaviormetrika, 10.1007/s41237-018-0058-8, 45, 2, (317-334), (2018).
  • Combination of post-operative radiotherapy and cetuximab for high-risk cutaneous squamous cell cancer of the head and neck: A propensity score analysis, Oral Oncology, 10.1016/j.oraloncology.2018.01.015, 78, (102-107), (2018).
  • A comparative evaluation of regional subsidies for collaborative and individual R&D in small and medium-sized enterprises, Research Policy, 10.1016/j.respol.2018.04.022, 47, 8, (1437-1447), (2018).
  • Covariate Distribution Balance via Propensity Scores, SSRN Electronic Journal, 10.2139/ssrn.3258551, (2018).
  • Supply Chain Disruptions and Causal Outcomes: Evidence from the Bankruptcy of Hanjin Shipping, SSRN Electronic Journal, 10.2139/ssrn.3293272, (2018).
  • DNN: A Two-Scale Distributional Tale of Heterogeneous Treatment Effect Inference, SSRN Electronic Journal, 10.2139/ssrn.3238897, (2018).
  • See more