惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Jina AI
Jina AI
博客园_首页
C
Check Point Blog
博客园 - 三生石上(FineUI控件)
爱范儿
爱范儿
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
人人都是产品经理
人人都是产品经理
腾讯CDC
N
News and Events Feed by Topic
O
OpenAI News
T
Troy Hunt's Blog
Help Net Security
Help Net Security
雷峰网
雷峰网
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
量子位
Hacker News - Newest:
Hacker News - Newest: "LLM"
Schneier on Security
Schneier on Security
N
News and Events Feed by Topic
酷 壳 – CoolShell
酷 壳 – CoolShell
Security Archives - TechRepublic
Security Archives - TechRepublic
TaoSecurity Blog
TaoSecurity Blog
Google Online Security Blog
Google Online Security Blog
T
Tenable Blog
NISL@THU
NISL@THU
L
LINUX DO - 最新话题
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
I
Intezer
小众软件
小众软件
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
S
Security @ Cisco Blogs
V
V2EX
Apple Machine Learning Research
Apple Machine Learning Research
The Cloudflare Blog
大猫的无限游戏
大猫的无限游戏
The Hacker News
The Hacker News
T
Tailwind CSS Blog
Google DeepMind News
Google DeepMind News
T
Threatpost
宝玉的分享
宝玉的分享
WordPress大学
WordPress大学
P
Palo Alto Networks Blog
Forbes - Security
Forbes - Security
博客园 - 司徒正美
罗磊的独立博客
博客园 - 叶小钗
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
V
Visual Studio Blog
C
Cisco Blogs

cs updates on arXiv.org

Beyond Binary Edits Robust Multimodal Knowledge Editing with Adversarial Subspace Alignment Agentic Proving for Program Verification MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection ChartFI: Benchmarking Faithfulness and Insightfulness of Chart Descriptions from Multimodal Large Language Models OnePred: Next-Query Prediction via Recursive Intent Memory in Multi-Turn Conversations OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents How Human-Like Are Large Language Models? A Register-Aware Linguistic Evaluation Framework Benchmarking Google Embeddings 2 against Open-Source Models for Multilingual Dense Retrieval and RAG Systems Structure-Guided Entity Resolution: Fine-Tuning LLMs for Robust Name Matching in Complex Linguistic Contexts Solving the Aircraft Disassembly Scheduling Problem Co-ReAct: Rubrics as Step-Level Collaborators for ReAct Agents CP or DP? Why Not Both: A Case Study in the Partial Shop Scheduling Problem Asking For An Old Friend: Diagnosing and Mitigating Temporal Failure Modes in LLM-based Statutory Question Answering EDGE-OPD: Internalizing Privileged Context with Evidence Guided On-Policy Distillation ARES: Automated Rubric Synthesis for Scalable LLM Reinforcement Learning SSDAU: Structured Semantic Data Augmentation for Joint Entity and Relation Extraction Naturalistic measure of social norms alignment Articulatory strategy as a source of variation in acoustic vowel dynamics When Planning Fails Despite Correct Execution: On Epistemic Calibration for LLM-Based Multi-Agent Systems EquiSumm : A Gender Bias-Aware Framework for Inclusive Tweet Summarization Metacognition as Reward: Reinforcing LLM Reasoning via Knowledge and Regulation Signals From Correctness to Preference: A Framework for Personalized Agentic Reinforcement Learning Cultural Adaptation in Large Language Models for Political Discourse Emotion Recognition in Sign Language Conversation ClimateChat-300K: A Multi-Modal Facebook Dataset for Understanding Diverse Perspectives in Climate Communication AraHopeCorpus: Annotation Guidelines and Dataset for Hope Speech in Arabic Social Media Crisis Discourse Human-in-the-Loop Multi-Agent Ventilator Decision Support with Contextual Bandit Preference Learning Convergence Without Understanding: When Language Models Agree on Representations but Disagree on Reasoning DART: Semantic Recoverability for Structured Tool Agents Ontological Knowledge Blocks: Executable Compliance and Profile-Based Validation for Trustworthy AI Systems Parallel Context Compaction for Long-Horizon LLM Agent Serving When Is Next-Token Prediction Useful? Marginalization, Ergodicity, Mixture Identifiability, Local Sufficiency, RAG, Tools, and Programming Design and Report Benchmarks for Knowledge Work GENSTRAT: Toward a Science of Strategic Reasoning in Large Language Models Foundation Protocol: A Coordination Layer for Agentic Society AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery Hidden Human-Like Nature of Machine-Generated Texts: Theory and Detection Enhancement Self-Improving In-Context Learning Redrawing the AI Map: A Theory of Accountability Boundaries in Agentic Ecosystems Positional Failures in Long-Context LLMs: A Blind Spot in Reasoning Benchmarks Fast-dDrive: Efficient Block-Diffusion VLM for Autonomous Driving Same Model, Different Weakness: How Language and Modality Reshape the Jailbreak Attack Surface in Frontier MLLMs When Symptoms Are Not Enough: Evidence-Weighting Patterns in Large Language Model Psychiatric Screening As X, Do Y: How Persona and Task Combine in Instruction-Tuned LLMs VisAnalog: A Diagnostic Suite for Visual Concept Transfer on Natural Images Exploiting Longitudinal Context in Clinician-Verified Interactive Lesion Tracking CoReVAD: A Contextual Reasoning Framework for Training-Free Video Anomaly Detection Inconsistency-aware Multimodal Schrödinger Bridge for Deepfake Localization Inductive Deductive Synthesis: Enabling AI to Generate Formally Verified Systems A Fine-Tuned BERT Classifier for Personal-Letter Titles in Late-Ming and Early-Qing Collected Works A Comparative Evaluation of Structural Topic Models and BERTopic for Short, Open-Ended Survey Responses PathCal: State-Aware Reflection-Marker Calibration for Efficient Reasoning The Efficiency Frontier: A Unified Framework for Cost-Performance Optimization in LLM Context Management Flow Mismatching: Unsupervised Anomaly Detection via Velocity Discrepancies in Flow Matching Models DFKI-MLT at SemEval-2026 TASK 7: Steering Multilingual Models Towards Cultural Knowledge RoboSurg-VQA: A Multimodal Benchmark for Surgical Segmentation-Aware Visual Question Answering What Training Data Teaches RL Memory Agents: An Empirical Study of Curriculum Effects in Memory-Augmented QA Dithering Defense: Adversarial Robustness of Vision Foundation Models via Multi-Level Floyd-Steinberg Dithering Millimeter-wave Imaging for Anthropometric Body Measurement Model Collapse as Cultural Evolution DreamerNLplus: Interpretable Modeling of Mental Health Dynamics from Social Media Timelines using Hybrid Rule-Based and RAG Methods The TIME Machine: On The Power of Motion for Efficient Perception HawkesLLM: Semantic Uncertainty Propagation in Agentic Text Simulation Do Language Models Know What Not to Say? Causal Evidence for Statistical Preemption in LLMs Multilingual Steering by Design: Multilingual Sparse Autoencoders and Principled Layer Selection Sparse Autoencoders Map Brain-LLM Alignment onto Cortical Semantic Topography Brain-LLM Alignment Tracks Training Data, Not Typology The Deterministic Horizon: Impossibility Results as Design Specifications for Trustworthy AI Systems Scene Reconstruction as Mapping Priors for 3D Detection CoMoGen: COntrollable MOtion Dynamics and Interactions with Mask-Guided Video GENeration A Proactive Multi-Agent Dialogue Framework for Assessing Social Language Disorder Traits in Autism Memorization Dynamics of Fill-in-the-Middle Pretraining A Reproducible Universal Dependencies-Style Pipeline for Katharevousa Greek Parliamentary Text When AI Takes Sides on Questions of Faith: Persistent Asymmetries in AI-Mediated Faith Guidance Can AI Guess What You Know? Performance Comparison of Large Language Models for Human Domain Knowledge Estimation From Communication Logs Graph Alignment Topology as an Inductive Bias for Grounding Detection GazeBehavior Annotation Toolkit (GBAT): AI-powered toolkit for automatic annotation of egocentric eye-tracking and video data of child-caregiver interaction Improved Vision-to-Chart Buoy Association with Learned World-to-Image Projection Learnability-Informed Fine-Tuning of Diffusion Language Models RAS: Reflection-Augmented Scaling with In-Context Learning for Executable Cypher Query Generation VideoOdyssey: A Benchmark for Ultra-Long-Context and Omni-Modal Video Understanding EVE-Agent: Evidence-Verifiable Self-Evolving Agents Suicide Risk Assessment from AI-powered Video Surveillance: An Interpretable Framework for Prevention in Metro Stations Seeing without Looking: Do Vision-Language Benchmarks Really Test Vision? Mediative Fuzzy Logic: From Type-1 Foundations to Type-2, Type-3 and Quantum Extensions ImProver 2: Iteratively Self-Improving LMs for Neurosymbolic Proof Optimization Energy per Successful Goal: Goal-Level Energy Accounting for Agentic AI Systems GEM-4D: Geometry-Enhanced Video World Models for Robot Manipulation How Far Will They Go? Red-Teaming Online Influence with Large Language Models SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research RMA: an Agentic System for Research-Level Mathematical Problems NeuroNL2LTL: A Neurosymbolic Framework for Natural Language Translation of Linear Temporal Logic BOHM: Zero-Cost Hierarchical Attribution for Compound AI Systems GAGPO: Generalized Advantage Grouped Policy Optimization Knowledge Distillation for Low-Resource Open-source Text-to-SQL Model Query-Adaptive Semantic Chunking for Retrieval-Augmented Generation: A Dynamic Strategy with Contextual Window Expansion A Survey of Text and Speech Resources for Hausa and Fongbe: Availability, Quality, and Gaps for NLP Development Evaluating Large Language Models in a Complex Hidden Role Game An AI-Driven Framework for Energy-Efficient Environmental Monitoring in Smart Cities Using Edge Intelligence
CloakLM: Obfuscating GPU Memory Layout to Mitigate Model Ex-filtration for Serving
[Submitted on 16 Jun 2026] · 2026-06-18 · via cs updates on arXiv.org

View PDF

Abstract:Large foundation models deployed on third-party and shared accelerator infrastructure face a practical risk of model exfiltration that existing defenses do not fully address. In common serving deployments, model providers control the VM or bare-metal serving stack but not the surrounding hardware substrate. The host to GPU interconnect, accelerator fabric, and neighboring infrastructure components remain outside the tenant's trust boundary and have been shown to be exploitable. Hermes demonstrates lossless DNN reconstruction from passive PCIe observation, while TunnelS exfiltrates HBM contents at high throughput via driver-level access without disrupting inference. Co-tenant VMs can further access memory-mapped interfaces or misconfigured RDMA regions without physical co-location. These attacks exploit a common property of ML systems: model weights are stored in large, contiguous, and repeatedly accessed memory regions, making intercepted PCIe transfers and HBM dumps rich enough to reveal model structure and parameters.
We present CloakLM, a software-only memory-obfuscation framework that removes this structural regularity without changing the inference stack's logical view of memory. CloakLM combines three mechanisms: PCIe traffic shaping, inter- and intra-layer weight shuffling, and physical HBM page remapping. Authorized execution retains a valid virtual memory layout with negligible overhead, while unauthorized observers see fragmented and semantically incoherent state.
CloakLM integrates with vLLM and PyTorch, requires no hardware changes, and complements confidential computing. Evaluation on distributed inference workloads using LLaMA and Qwen models shows near-native performance while significantly increasing resistance to PCIe snooping and HBM dump attacks, making inference-time model exfiltration substantially less practical.

Submission history

From: Kunal Jain [view email]
[v1] Tue, 16 Jun 2026 18:47:47 UTC (2,488 KB)