




















Abstract:We study clinically motivated capacity-constrained treatment-adherence outreach through a belief-state restless multi-armed bandit model, in which each patient is a partially observed two-state Markov decision process and interventions induce reset-type belief dynamics. For the discounted criterion, partial conservation law (PCL)-based conditions are used to establish single-patient threshold-policy optimality and indexability (threshold-indexability) and yield a closed-form Whittle index, threshold performance metrics, and an explicit optimal threshold map. We also prove a single-patient long-run average analogue on the invariant belief core and obtain an explicit average-criterion Whittle index. For the multi-patient model, the PCL-derived formulas give an analytic Lagrangian relaxation, efficient dual bounds, and computable Lagrangian index benchmark policies, including a forced-capacity variant. We analyze how the Whittle index depends on lapse and spontaneous-recovery probabilities. In large-scale experiments with two-type, three-type, and jittered finite-mixture populations, the Whittle and forced-capacity Lagrangian index policies are the strongest performers, while myopic prioritization can be substantially worse under tight capacity.
From: José Niño-Mora [view email]
[v1]
Sun, 11 Jan 2026 16:18:07 UTC (332 KB)
[v2]
Mon, 15 Jun 2026 21:30:34 UTC (325 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。