惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

cs updates on arXiv.org

End-to-End Intracortical Speech Decoding from Neural Activity Attested Tool-Server Admission: A Security Extension to the Model Context Protocol Enhancing Reliability in LLM-Based Secure Code Generation TUBE: Tangent Upper Bound on Evidence for Discrete Diffusion Language Models Cross-Modal Action Recognition in Egocentric Video Using Mamba: Integrating RGB and Hand Skeleton Streams via CLS Token Fusion Strategies A Comprehensive Evaluation of Vertex Elimination Algorithms for Algorithmic Differentiation Optimizing Digital Therapeutic Interventions: Online Learning under Endogenous Adherence How Well Do Models Follow Their Constitutions? Polar: Agentic RL on Any Harness at Scale PrivFusion: A Privacy-preserving Multi-Agent Framework for Harmonizing Distributed Datasets Program Synthesis for Non-Linear Real Arithmetic: Going Beyond Realizability Modernizing User Privacy Preference Measurement through GPPI: A GDPR-aligned Privacy Preference Item Bank CRISP -- Clustering-Based Redundancy-Reduced Instance Sampling for Pathology Case Representation and Retrieval ChaosBench-Logic v2: Evaluating LLM Logical Reasoning over Dynamical Systems at Scale Private Adaptive Covariance Estimation via Gaussian Graphical Models Discovering Lexical Gaps Using Embeddings from Multilingual LLMs Identifying and Mitigating Systemic Measurement Bias in Production LLM Inference Benchmarks Ant Backpressure Routing for Dynamic Wireless Multi-hop Networks with Mixed Traffic Patterns Accuracy Analysis of the Proxy Point Method with Applications to Some Toeplitz Matrices An Empirical Evaluation of LLM-Generated Code Security Across Prompting Methods A lift for input-convex neural network training RxGS: Receiver-Generalizable 3D Gaussian Splatting for Radio-Frequency Data Synthesis Can Graph-Based Microservice Performance Detection Be Used for Microservice Intrusion Detection? Deep-Research Agents Can Be Poisoned via User-Generated Content ArtSplat: Feed-Forward Articulated 3D Gaussian Splatting from Sparse Multi-State Uncalibrated Views AcroRL: Learning Aggressive Quadrotor Inversion using Bidirectional Thrust Safety-Oriented Routing Analysis of Mixtral MoE Under Benign and Harmful Prompts Rethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditions ContextEcho: A Benchmark for Persona Drift in Long Agentic-Coding Sessions Faithfulness as Information Flow: Evaluating and Training Faithful Chain-of-Thought Reasoning From One-Pass SGD to Data Reuse: Mini-Batch Scaling Laws in Sketched Linear Regression LLMs Show No Signs Of Individuated Metacognition Fourier Feature Pyramids for Physics-Informed Neural Networks DRInQ: Evaluating Conversational Implicature with Controlled Context Variation Analyzing the Effects of Two-Stage Peer Evaluation Beyond Final Answers: Auditing Trajectory-Level Hallucinations in Multi-Agent Industrial Workflows Concept Drift Adaptation Using Self-Supervised and Reinforcement Learning In Android Malware Detection Sketch Bug: Using Sketch-Based Input for Interactive Code Debugging MeVer at CheckThat! 2026: Cluster-Aware Hard-Negative Mining for Multilingual Scientific-Source Retrieval Bayesian Rational Search Engine User ECo-MoE: Embodiment-Conditioned Mixture of Experts Increases the Evolvability of Robots Rubato: Transcribing Piano Music with Timestamps An Interactive Paradigm for Deep Research Resident KV Claims: A Conformance Contract for Future Reuse under Active KV Pressure GIBLy: Improving 3D Semantic Segmentation through an Architecture-Agnostic Lightweight Geometric Inductive Bias Layer Learning regime-dependent governing equations: A symbolic decision tree approach When Does Synthetic Patent Data Help? Volume-Fidelity Trade-offs in Low-Resource Multi-Label Classification Humans Cannot Detect AI-Generated Media But Communities May -- For Now: Collaborative AI Detection in r/RealOrAI on Reddit Plume Segmentation from MethaneSAT with Cross-Sensor Transfer Learning and Physics-Informed Postprocessing Unlocking Apple's Private Cloud Compute: An Analysis of Privacy-Preserving Artificial Intelligence LEARNT: A Practical Estimator for Cardinality of LIKE Queries with Formal Accuracy Guarantees CoDA: Color Distribution Probing for Efficient and Generalizable AI-Generated Image Detection On Permutation Groups of Cyclic Codes over Finite Fields Terrain-Adaptive Grouser Wheel for Optimal Planetary Exploration: Design and Experimental Investigation Reframing LLM Agent Security as an Agent-Human Interaction Problem Benchmarking Patent Embeddings: A Multi-Task Evaluation of 22 Models Across Retrieval, Classification, and Clustering Toward Enactive Artificial Intelligence Improving Labeling Consistency with Detailed Constitutional Definitions and AI-Driven Evaluation QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks A Survey of Text and Speech Resources for Hausa and Fongbe: Availability, Quality, and Gaps for NLP Development How Far Will They Go? Red-Teaming Online Influence with Large Language Models RAS: Reflection-Augmented Scaling with In-Context Learning for Executable Cypher Query Generation A Reproducible Universal Dependencies-Style Pipeline for Katharevousa Greek Parliamentary Text Memorization Dynamics of Fill-in-the-Middle Pretraining A Proactive Multi-Agent Dialogue Framework for Assessing Social Language Disorder Traits in Autism Brain-LLM Alignment Tracks Training Data, Not Typology DreamerNLplus: Interpretable Modeling of Mental Health Dynamics from Social Media Timelines using Hybrid Rule-Based and RAG Methods NeuroNL2LTL: A Neurosymbolic Framework for Natural Language Translation of Linear Temporal Logic RMA: an Agentic System for Research-Level Mathematical Problems DFKI-MLT at SemEval-2026 TASK 7: Steering Multilingual Models Towards Cultural Knowledge SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research The Efficiency Frontier: A Unified Framework for Cost-Performance Optimization in LLM Context Management Energy per Successful Goal: Goal-Level Energy Accounting for Agentic AI Systems A Comparative Evaluation of Structural Topic Models and BERTopic for Short, Open-Ended Survey Responses ImProver 2: Iteratively Self-Improving LMs for Neurosymbolic Proof Optimization Self-Improving In-Context Learning Redrawing the AI Map: A Theory of Accountability Boundaries in Agentic Ecosystems Foundation Protocol: A Coordination Layer for Agentic Society Convergence Without Understanding: When Language Models Agree on Representations but Disagree on Reasoning GENSTRAT: Toward a Science of Strategic Reasoning in Large Language Models AraHopeCorpus: Annotation Guidelines and Dataset for Hope Speech in Arabic Social Media Crisis Discourse Design and Report Benchmarks for Knowledge Work ClimateChat-300K: A Multi-Modal Facebook Dataset for Understanding Diverse Perspectives in Climate Communication Parallel Context Compaction for Long-Horizon LLM Agent Serving Emotion Recognition in Sign Language Conversation Ontological Knowledge Blocks: Executable Compliance and Profile-Based Validation for Trustworthy AI Systems GEM-4D: Geometry-Enhanced Video World Models for Robot Manipulation Cultural Adaptation in Large Language Models for Political Discourse DART: Semantic Recoverability for Structured Tool Agents Seeing without Looking: Do Vision-Language Benchmarks Really Test Vision? From Correctness to Preference: A Framework for Personalized Agentic Reinforcement Learning Human-in-the-Loop Multi-Agent Ventilator Decision Support with Contextual Bandit Preference Learning Suicide Risk Assessment from AI-powered Video Surveillance: An Interpretable Framework for Prevention in Metro Stations Beyond Binary Edits Robust Multimodal Knowledge Editing with Adversarial Subspace Alignment How Human-Like Are Large Language Models? A Register-Aware Linguistic Evaluation Framework OnePred: Next-Query Prediction via Recursive Intent Memory in Multi-Turn Conversations Exploiting Longitudinal Context in Clinician-Verified Interactive Lesion Tracking ChartFI: Benchmarking Faithfulness and Insightfulness of Chart Descriptions from Multimodal Large Language Models An AI-Driven Framework for Energy-Efficient Environmental Monitoring in Smart Cities Using Edge Intelligence VisAnalog: A Diagnostic Suite for Visual Concept Transfer on Natural Images
Five Queries Are Enough: Query-Efficient and Surrogate-Free Membership Inference Attacks on RAG via Entailment
Nguyen Linh · 2026-05-26 · via cs updates on arXiv.org

View PDF HTML (experimental)

Abstract:Retrieval-augmented generation (RAG) has become central to large language model (LLM) deployments, grounding responses in enterprise or proprietary data to reduce hallucinations. However, this design introduces a new privacy risk: model outputs may signal the presence of specific documents in the retrieval corpus, enabling membership inference attacks (MIAs) that leak sensitive information. Existing MIAs are feasible, but they often rely on easily detected templated queries or require many non-templated yet costly and repetitive queries, limiting practicality. We ask: Can an adversary launch a limited-budget, surrogate-free, stealthy, and defense-agnostic membership inference attack using non-templated queries? We present MEntA (Membership Entailment Attack), a query-efficient MIA that leverages natural-language entailment to maximize information gained per query. By asking low-cost, broad, information-seeking questions and measuring entailment between model responses and candidate documents, MEntA eliminates the need for costly shadow models and large query budgets. Across NFCorpus, SCIDOCS, and TREC-COVID, MEntA achieves up to 0.991 AUC with only 5 queries, outperforming prior methods by 0.20 to 0.50 AUC under equivalent conditions. It remains effective under state-of-the-art (SOTA) RAG defenses, while current detectors either miss MEntA or flag benign queries at high rates. Regarding cost, MEntA reduces total attack cost by up to 65 $\times$ lower compared to SOTA attacks under the same attack setting. Our findings expose the feasibility of realistic, low-cost privacy leakage in RAG systems and highlight the urgent need for privacy-aware retrieval and defense mechanisms.
Comments: Accepted to USENIX Security 2026
Subjects: Cryptography and Security (cs.CR)
Cite as: arXiv:2605.24312 [cs.CR]
  (or arXiv:2605.24312v1 [cs.CR] for this version)
  https://doi.org/10.48550/arXiv.2605.24312

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Nguyen Linh Bao Nguyen [view email]
[v1] Sat, 23 May 2026 00:38:59 UTC (1,300 KB)