
























Abstract:We study estimation of a class prior for unlabeled target samples which possibly differs from that of source population. Moreover, it is assumed that the source data is partially observable: only samples from the positive class and from the whole population are available (PU learning scenario). We introduce a novel direct estimator of a class prior which avoids estimation of posterior probabilities in both populations and has a simple geometric interpretation. It is based on a distribution matching technique together with kernel embedding in a Reproducing Kernel Hilbert Space and is obtained as an explicit solution to an optimisation task. We establish its asymptotic consistency as well as an explicit non-asymptotic bound on its deviation from the unknown prior, which is calculable in practice. We study finite sample behaviour for synthetic and real data and show that the proposal works consistently on par or better than its competitors.
| Subjects: | Machine Learning (stat.ML); Machine Learning (cs.LG) |
| Cite as: | arXiv:2502.21194 [stat.ML] |
| (or arXiv:2502.21194v3 [stat.ML] for this version) | |
| https://doi.org/10.48550/arXiv.2502.21194 arXiv-issued DOI via DataCite |
From: Paweł Teisseyre [view email]
[v1]
Fri, 28 Feb 2025 16:12:53 UTC (1,571 KB)
[v2]
Fri, 12 Sep 2025 08:49:56 UTC (1,299 KB)
[v3]
Thu, 21 May 2026 06:57:56 UTC (1,303 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。