TY - JOUR
T1 - Covariate selection in propensity score matching
T2 - A case study of how the Shinkansen has impacted population changes in Japan
AU - Wang, Jingyuan
AU - Terabe, Shintaro
AU - Yaginuma, Hideki
N1 - Publisher Copyright:
© 2025
PY - 2025/3
Y1 - 2025/3
N2 - This study presents a novel covariate selection method that combines Maximum Likelihood Estimation (MLE) with the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC), focusing on achieving a balance between model fit and complexity. Our findings emphasize the significant impact of covariates on estimated results in Propensity Score Matching (PSM) analyses. Through case studies, we validate the effectiveness of our proposed method across various PSM approaches, including one-to-one matching, K-nearest neighbors matching, radius matching, kernel matching, and Inverse Probability Weighting (IPW). For researchers constrained to cross-sectional data, our comparisons among different PSM methodologies provide valuable insights. Additionally, we explore the applicability of our method to PSM extensions such as Covariate Balancing Propensity Score (CBPS) and PSM-Difference-in-Differences (DID). Our case study reveals significant causal effects of Japan's Shinkansen on population changes, with notable growth observed in both cross-sectional and panel data analyses.These findings hold important implications for transportation policy, and we offer recommendations for relevant policymakers based on our results.
AB - This study presents a novel covariate selection method that combines Maximum Likelihood Estimation (MLE) with the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC), focusing on achieving a balance between model fit and complexity. Our findings emphasize the significant impact of covariates on estimated results in Propensity Score Matching (PSM) analyses. Through case studies, we validate the effectiveness of our proposed method across various PSM approaches, including one-to-one matching, K-nearest neighbors matching, radius matching, kernel matching, and Inverse Probability Weighting (IPW). For researchers constrained to cross-sectional data, our comparisons among different PSM methodologies provide valuable insights. Additionally, we explore the applicability of our method to PSM extensions such as Covariate Balancing Propensity Score (CBPS) and PSM-Difference-in-Differences (DID). Our case study reveals significant causal effects of Japan's Shinkansen on population changes, with notable growth observed in both cross-sectional and panel data analyses.These findings hold important implications for transportation policy, and we offer recommendations for relevant policymakers based on our results.
KW - Causal Inference
KW - Covariate Selection
KW - Double/Debiased Machine Learning
KW - High-speed railway
KW - Propensity Score
UR - http://www.scopus.com/inward/record.url?scp=85217372457&partnerID=8YFLogxK
U2 - 10.1016/j.cstp.2025.101389
DO - 10.1016/j.cstp.2025.101389
M3 - Article
AN - SCOPUS:85217372457
SN - 2213-624X
VL - 19
JO - Case Studies on Transport Policy
JF - Case Studies on Transport Policy
M1 - 101389
ER -