惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

A
Arctic Wolf
V
V2EX
P
Proofpoint News Feed
The Hacker News
The Hacker News
GbyAI
GbyAI
G
Google Developers Blog
S
Schneier on Security
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
W
WeLiveSecurity
Security Archives - TechRepublic
Security Archives - TechRepublic
博客园 - Franky
Recent Announcements
Recent Announcements
腾讯CDC
Hacker News - Newest:
Hacker News - Newest: "LLM"
K
Kaspersky official blog
U
Unit 42
Engineering at Meta
Engineering at Meta
J
Java Code Geeks
Google Online Security Blog
Google Online Security Blog
Last Week in AI
Last Week in AI
V
Vulnerabilities – Threatpost
N
News and Events Feed by Topic
O
OpenAI News
量子位
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
Y
Y Combinator Blog
博客园 - 【当耐特】
Vercel News
Vercel News
Hacker News: Ask HN
Hacker News: Ask HN
T
Tor Project blog
Apple Machine Learning Research
Apple Machine Learning Research
Microsoft Security Blog
Microsoft Security Blog
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
AWS News Blog
AWS News Blog
MongoDB | Blog
MongoDB | Blog
S
Security Affairs
A
About on SuperTechFans
Project Zero
Project Zero
D
Darknet – Hacking Tools, Hacker News & Cyber Security
博客园 - 聂微东
Webroot Blog
Webroot Blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
Cloudbric
Cloudbric
T
Tenable Blog
月光博客
月光博客
C
Check Point Blog
宝玉的分享
宝玉的分享
V
Visual Studio Blog
T
The Blog of Author Tim Ferriss
NISL@THU
NISL@THU

math.ST updates on arXiv.org

A Polyak-Ruppert Central Limit Theorem for SA-Adam with Momentum and Non-Convergent Adaptive Preconditioning Inference Optimal Long Run Variance Estimation with Lugsail Kernels Non-asymptotic Tail Bounds for the Kostlan--Shub--Smale Field: Tensor PCA and Spherical $k$-Spin Complexity Conformal Prediction Intervals with Tail-Specific Guarantees Statistical Foundations of LLM-based A/B Testing: A Surrogacy Framework for Human Causal Inference Tight $L_\infty$ Sample Complexity for Low-Degree and Sparse Boolean Polynomials Dependent Censoring Based on Geometric Optimization Proximal Mediation Analysis with Hidden Recanting Witnesses Spectral recovery of a planted triangle-dense subgraph On Response-Adaptive Targeting Strategies for Multi-Treatment Experiments Statistical Advantages of Oblique Randomized Decision Trees and Forests Consistency of Variational Inference for Nonlinear Inverse Problems of Partial Differential Equations Active Subsampling for Measurement-Constrained M-Estimation of Individualized Thresholds with High-Dimensional Data Parameter Estimation for Partially Observed Affine and Polynomial Processes Moving Least Squares without Quasi-Uniformity: A Stochastic Approach Tests for categorical data beyond Pearson: A distance covariance and energy distance approach Randomized Midpoint Method for Log-Concave Sampling under Constraints Convergence rate of Euler--Maruyama scheme to the invariant probability measure under total variation distance for the SDEs Learning Survival Models with Right-Censored Reporting Delays Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach Learning from Biased and Costly Data Sources: Minimax-optimal Data Collection under a Budget Splitting schemes and estimators for stochastic differential equations with Hölder multiplicative noise Limit theorems of Azadkia-Chatterjee's conditional graph correlation Phase Transition in Convex Relaxations for Graph Alignment Information Gap and Feasibility-Aware Inference in Binomial Logistic Mixtures Minimax Synthesis of Network Mechanisms Wild bootstrap for mean response inference in functional linear regression models On the Geometry of Separation in Finite Gaussian Mixtures Moment-Free Kunchenko Stochastic Polynomials via Empirical Characteristic Function Higher-order spectral perturbation expansions II: Kernel matrices and manifold learning Calibrating the Brody exponent as a quantitative measure of short-range exclusion in 2D spatial point processes Optimal Multiscale Learning of Linear Operators Learning the Geometry of Data: A Mathematical Review of Shape Space Analysis A Decision-Theoretic View of Test-Time Training: When, How Far, and Which Directions to Adapt Non-Equilibrium Model Selection via Finite-Time Thermodynamics Euler Stratifications of Second Hypersimplices via Delta-matroids Testing for a Hidden Geometry in Random Graphs Filtered Conformal Ellipsoids for Graph-Native Time Series Separate Exchangeability as Modeling Principle in Bayesian Nonparametrics Estimation of High-Dimensional Normal Means through Inferential Models KL-BSS: Rethinking optimality for neighbourhood selection in structural equation models TrIM: Transformed Iterative Mondrian Forests for Gradient-based Dimension Reduction and High-Dimensional Regression Change Point Detection in Precision Matrices with D-trace Loss Functional Extreme-PLS A Necessary and Sufficient Condition for Size Controllability of Heteroskedasticity Robust Test Statistics Optimal structure learning and conditional independence testing Kernel Two-Sample Testing via Directional Components Analysis Robustified Gaussian quasi-likelihood inference for volatility Matching correlated VAR time series Debiased Inference for High-Dimensional Regression Models Based on Profile M-Estimation Stein's method for the matrix normal distribution Detecting Where Effects Occur by Testing Hypotheses in Order Treatment effect estimation under convergent network interference Robustified Gaussian quasi-BIC for volatility Sharp One-Dimensional Sub-Gaussian Comparison in Convex Order Can Neural Networks Achieve Optimal Computational-statistical Tradeoff? An Analysis on Single-Index Model Data augmented bootstrap: Unifying confidence interval construction by approximate invariance Parametrically Adaptive Transition Polynomial: a Signed-Parity Continuous-alpha Extension of Kunchenko Stochastic Polynomials Trace-Class Results for MCMC Algorithms for Student-t Regression Models Adaptive Sequential Change Detection using Mixtures of Predictive Distributions Asymptotically Optimal Sequential Testing with Markovian Data Eigen-Spike Emergence and Quadratic Equivalents for Conjugate Kernels on Nonlinearly Separable Data The 'Right' Extension of Type-I Error to Data-Dependent Levels Safe and Sharp Honest Inference for Nonparametric Estimation via Empirical Bernstein Calibration High-Dimensional Robust Change-Point Detection via Angular Kernel Statistics Imbalanced Classification under Capacity Constraints Optimism Stabilizes Thompson Sampling for Adaptive Inference
Extended feature allocation models
[Submitted on 14 Feb 2025 (v1), last revised 16 Jun 2026 (this v · 2026-06-17 · via math.ST updates on arXiv.org

View PDF HTML (experimental)

Abstract:Feature allocation models are Bayesian nonparametric tools tailored to data in which each observation can simultaneously exhibit multiple characteristics, or features. A fundamental limitation of standard formulations is that feature labels are assumed to be independent and identically distributed, and therefore play no role in posterior inference. The present paper introduces a unified Bayesian framework for extended feature allocation models, in which feature labels and proportions are modeled jointly, thereby enabling the simultaneous discovery of features and learning of dependencies among their labels. Building on point process theory, we develop a full Bayesian analysis of these models. Within this general setting, we also characterize previously proposed priors as those leading to poor predictive distributions, which cannot capture label dependencies and are insensitive to the observed frequency spectrum. Our methodology is designed to move beyond such standard formulations by leveraging the information carried by feature labels. We demonstrate the usefulness of our approach by introducing: (i) a Cox process prior that clusters genomic variant embeddings while predicting new variants and new variant clusters; (ii) a determinantal point process prior for repeated forest surveys, where prediction concerns both the number and the locations of unobserved trees.

Submission history

From: Lorenzo Ghilotti [view email]
[v1] Fri, 14 Feb 2025 16:08:42 UTC (1,566 KB)
[v2] Mon, 3 Mar 2025 10:11:44 UTC (1,566 KB)
[v3] Tue, 16 Jun 2026 09:05:33 UTC (1,261 KB)