
























Abstract:Standard maximum likelihood estimation of polychoric correlations is highly sensitive to contamination in survey data, including response errors, interviewer effects, and careless responding, yet assigns equal weight to all observations regardless of data quality. We develop robust estimators for polychoric correlation under complex survey designs based on two minimum divergence criteria -- Hellinger distance (HD) and negative exponential disparity (NED) -- incorporating survey weights through Horvitz--Thompson adjusted cell frequencies. For HD, we propose penalized Ridge and Lasso variants that regularize nuisance parameters while leaving the correlation unpenalized, and establish consistency and asymptotic normality with a sandwich covariance reflecting the sampling design. The influence function is finite but not uniformly bounded, reflecting Hellinger's sensitivity to sparse cells. Simulations under Poisson proportional-to-size sampling examine three contamination geometries -- concordant upper, concordant lower, and discordant mixed corner -- crossed with standard and non-standard latent marginals. The two estimator classes offer complementary advantages: penalized HD methods achieve the lowest mean squared error under concordant contamination, while NED performs best under discordant contamination and under compound misspecification--contamination effects. We provide practical guidelines for method selection based on anticipated contamination patterns in survey practice.
From: David Kepplinger [view email]
[v1]
Wed, 24 Jun 2026 19:49:04 UTC (202 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。