惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

cs updates on arXiv.org

End-to-End Intracortical Speech Decoding from Neural Activity When Does Synthetic Patent Data Help? Volume-Fidelity Trade-offs in Low-Resource Multi-Label Classification SliceWorld: A Predictive and Controllable World-State Model for CT Report Generation From One-Pass SGD to Data Reuse: Mini-Batch Scaling Laws in Sketched Linear Regression PAIRED: A Process-Anchored Framework for Transparent Reporting of AI Contributions in Scientific Research Terrain-Adaptive Grouser Wheel for Optimal Planetary Exploration: Design and Experimental Investigation Optimizing Digital Therapeutic Interventions: Online Learning under Endogenous Adherence Ant Backpressure Routing for Dynamic Wireless Multi-hop Networks with Mixed Traffic Patterns Beyond Final Answers: Auditing Trajectory-Level Hallucinations in Multi-Agent Industrial Workflows Structure-Aware RAG: Structured Retrieval Augmented Generation from Noisy Data for Conversational Agents Improving the Accuracy of the Exponentially Fitted Scheme on Piecewise Uniform Meshes Partner-Aware Hierarchical Skill Discovery for Robust Human-AI Collaboration Me, Myself, and My Voice: Exploring Cultural and Linguistic Identity in AAC AI-generated Voices AcroRL: Learning Aggressive Quadrotor Inversion using Bidirectional Thrust Treatment Effect Estimation with Differentiated Networked Effect on Graph Data CurveRL: Principled Distribution-Aware Context Reweighting for LLM Reasoning Distinguishing Right from Wrong in Debates: Attribution Analysis of Chinese Harmful Memes Enhancing Reliability in LLM-Based Secure Code Generation CRISP -- Clustering-Based Redundancy-Reduced Instance Sampling for Pathology Case Representation and Retrieval Unlocking Apple's Private Cloud Compute: An Analysis of Privacy-Preserving Artificial Intelligence Concept Drift Adaptation Using Self-Supervised and Reinforcement Learning In Android Malware Detection Faithfulness as Information Flow: Evaluating and Training Faithful Chain-of-Thought Reasoning SinFormer: A Tailored Transformer for Robust Radio Frequency Fingerprint Identification Five Queries Are Enough: Query-Efficient and Surrogate-Free Membership Inference Attacks on RAG via Entailment Benchmarking Patent Embeddings: A Multi-Task Evaluation of 22 Models Across Retrieval, Classification, and Clustering ScaleAcross Explorer: Exploring Communication Optimization for Scale-Across AI Model Training Contested Temporalities in Critical Minerals and Resource Extraction for Electric Vehicles ViViD-5K: Vineyard vision dataset for field-based berry detection and segmentation and grape cluster closure estimation Causal Physics Steering in Video World Models via Concept Activation Vectors LEARNT: A Practical Estimator for Cardinality of LIKE Queries with Formal Accuracy Guarantees A lift for input-convex neural network training Discovering Lexical Gaps Using Embeddings from Multilingual LLMs Omissive Bias in Religious Representation: Benchmarking LLM Answers to Everyday Ethical Decision-making Learning Laplacian Eigenspace with Mass-Aware Neural Operators on Point Clouds GEESE: Genotype-aware End-to-End Spatio-temporal Embedding for Behavioral Phenotyping ChaosBench-Logic v2: Evaluating LLM Logical Reasoning over Dynamical Systems at Scale Side-by-side Comparison Amplifies Dialect Bias in Language Models ECo-MoE: Embodiment-Conditioned Mixture of Experts Increases the Evolvability of Robots Polar: Agentic RL on Any Harness at Scale Rubato: Transcribing Piano Music with Timestamps MeVer at CheckThat! 2026: Cluster-Aware Hard-Negative Mining for Multilingual Scientific-Source Retrieval Safety-Oriented Routing Analysis of Mixtral MoE Under Benign and Harmful Prompts Resident KV Claims: A Conformance Contract for Future Reuse under Active KV Pressure Bayesian Rational Search Engine User Gaussian Rank-Based Neighborhood Degree for Graph Neural Networks in Image Classification How Much Structure Do LLMs Need? Evaluating LLMs for Bibliometric Cluster Description Modernizing User Privacy Preference Measurement through GPPI: A GDPR-aligned Privacy Preference Item Bank RxGS: Receiver-Generalizable 3D Gaussian Splatting for Radio-Frequency Data Synthesis Designs, linear codes, plateaued functions, and their interconnections Network Digital Twin for Congestion-Aware Predictive Traffic Routing using Graph MPNNs Spectral analysis and sine transform based preconditioning for a structure preserving stabilized scheme approximating the space-fractional Allen Cahn equation with logarithmic potential Reframing LLM Agent Security as an Agent-Human Interaction Problem Cross-Modal Action Recognition in Egocentric Video Using Mamba: Integrating RGB and Hand Skeleton Streams via CLS Token Fusion Strategies IsaacIPC: Coupling High-Fidelity Simulation and Realistic Rendering for Contact-Rich Robotic Systems Tacit Signal Infrastructure: Towards AI Systems that Model Expert Sensing Over Time Geometry-Preserving Nudged Elastic Band and Dimer Methods under Anisotropic Force Uncertainty PACT: Proactive Asking for Continual Task Assistance in Human-Robot Collaboration JT-SAFE-V2: Safety-by-Design Foundation Model with World-Context Data SparseWorld: Enhancing End-to-End Autonomous Driving via World Models with Sparse Scene Representation Adaptive Human-AI Coordination via Hierarchical Action Disentanglement Unified 3D Scene Understanding Through Physical World Modeling On Permutation Groups of Cyclic Codes over Finite Fields ContextEcho: A Benchmark for Persona Drift in Long Agentic-Coding Sessions Rethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditions TUBE: Tangent Upper Bound on Evidence for Discrete Diffusion Language Models Fourier Feature Pyramids for Physics-Informed Neural Networks PrivFusion: A Privacy-preserving Multi-Agent Framework for Harmonizing Distributed Datasets DRInQ: Evaluating Conversational Implicature with Controlled Context Variation ChainzRule: Sample-Efficient, Robust Deep Learning Across Tabular, NLP, and Vision Tasks Interdomain Attention: Beyond Token-Level Key-Value Memory LLMs Show No Signs Of Individuated Metacognition Refined Analysis of Entropy-Regularized Actor-Critic AvAtar: Learning to Align via Active Optimal Transport Assessing the Operational Viability of Foundation Models for Time Series Forecasting Evolving Robustness--Exploration Trade-off in Online Reinforcement Learning via Quantile Bayesian Risk MDPs Private Adaptive Covariance Estimation via Gaussian Graphical Models An Interactive Paradigm for Deep Research An Empirical Evaluation of LLM-Generated Code Security Across Prompting Methods Synheart Capacity: A Theory-Driven Physiological Representation of Cognitive Capacity Dynamics from Wearable Signals Identifying and Mitigating Systemic Measurement Bias in Production LLM Inference Benchmarks Sketch Bug: Using Sketch-Based Input for Interactive Code Debugging Analyzing the Effects of Two-Stage Peer Evaluation The Model Is Not the Product: A Dual-Pillar Architecture for Local-First Psychological Coaching A Unified Python Framework for Direct PPO-based Control of AHUs with Economizer Logic and CO2-Constrained Ventilation GIBLy: Improving 3D Semantic Segmentation through an Architecture-Agnostic Lightweight Geometric Inductive Bias Layer Toward Enactive Artificial Intelligence Accuracy Analysis of the Proxy Point Method with Applications to Some Toeplitz Matrices Attested Tool-Server Admission: A Security Extension to the Model Context Protocol Plume Segmentation from MethaneSAT with Cross-Sensor Transfer Learning and Physics-Informed Postprocessing Program Synthesis for Non-Linear Real Arithmetic: Going Beyond Realizability Deep-Research Agents Can Be Poisoned via User-Generated Content How Well Do Models Follow Their Constitutions? Generative OOD-regularized Model-based Policy Optimization Improving Labeling Consistency with Detailed Constitutional Definitions and AI-Driven Evaluation Humans Cannot Detect AI-Generated Media But Communities May -- For Now: Collaborative AI Detection in r/RealOrAI on Reddit CoDA: Color Distribution Probing for Efficient and Generalizable AI-Generated Image Detection ArtSplat: Feed-Forward Articulated 3D Gaussian Splatting from Sparse Multi-State Uncalibrated Views Can Graph-Based Microservice Performance Detection Be Used for Microservice Intrusion Detection? A Comprehensive Evaluation of Vertex Elimination Algorithms for Algorithmic Differentiation Learning regime-dependent governing equations: A symbolic decision tree approach
Dual Prototype-Conditioned Diffusion Model for Scalable Multi-Class Unsupervised Anomaly Detection in Large Category Spaces
Yaoxuan Feng · 2026-05-26 · via cs updates on arXiv.org

View PDF HTML (experimental)

Abstract:Multi-class anomaly detection aims to build unified models across diverse product categories. However, as the number of categories grows, its performance often degrades due to increasingly complex and heterogeneous normal distributions. To address this challenge, we propose DPDiff-AD, a Dual Prototype-conditioned Diffusion model for large-scale multi-class Anomaly Detection. DPDiff-AD models heterogeneous normal distributions through complementary local and global prototypes. Local prototypes capture representative fine-grained structural patterns via nearest-prototype aggregation, while global prototypes regulate holistic feature geometry through optimal transport regularization. Together, these dual-scale representations define a structured normality space. This space is refined through diffusion-based reconstruction conditioned on both local and global prototypes via prototype-aware attention. By jointly leveraging dual prototypes during generation, DPDiff-AD achieves precise normality modeling, preserves structured separability as category cardinality grows, and enables scalable anomaly discrimination. Extensive experiments across five benchmarks demonstrate the effectiveness and scalability of DPDiff-AD. On the 160-category large-scale dataset, it improves image- and pixel-level AUROC by 5.3 and 2.9 points over the previous state-of-the-art method Dinomaly+, while maintaining stable performance as category cardinality increases.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2605.24402 [cs.CV]
  (or arXiv:2605.24402v1 [cs.CV] for this version)
  https://doi.org/10.48550/arXiv.2605.24402

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Yaoxuan Fen [view email]
[v1] Sat, 23 May 2026 05:10:18 UTC (23,766 KB)