惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

N
News and Events Feed by Topic
Malwarebytes
Malwarebytes
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cybersecurity and Infrastructure Security Agency CISA
F
Future of Privacy Forum
C
Cisco Blogs
T
The Exploit Database - CXSecurity.com
A
Arctic Wolf
S
Securelist
K
Kaspersky official blog
S
Schneier on Security
T
ThreatConnect
T
Tenable Blog
Spread Privacy
Spread Privacy
T
True Tiger Recordings
AWS News Blog
AWS News Blog
F
Fox-IT International blog
量子位
T
Threatpost
V
Vulnerabilities – Threatpost
C
CERT Recently Published Vulnerability Notes
Cisco Talos Blog
Cisco Talos Blog
GbyAI
GbyAI
宝玉的分享
宝玉的分享
腾讯CDC
G
Google Developers Blog
aimingoo的专栏
aimingoo的专栏
Cyberwarzone
Cyberwarzone
有赞技术团队
有赞技术团队
S
SegmentFault 最新的问题
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Visual Studio Blog
U
Unit 42
雷峰网
雷峰网
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Simon Willison's Weblog
Simon Willison's Weblog
O
OpenAI News
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
The GitHub Blog
The GitHub Blog
The Register - Security
The Register - Security
MyScale Blog
MyScale Blog
小众软件
小众软件
A
About on SuperTechFans
Last Week in AI
Last Week in AI
Y
Y Combinator Blog
博客园 - 三生石上(FineUI控件)
美团技术团队
Google Online Security Blog
Google Online Security Blog
P
Proofpoint News Feed
MongoDB | Blog
MongoDB | Blog

cs updates on arXiv.org

Mathematical Foundations for Peer-to-Peer Lattice Computation CultivAgents: Cultivating Relationship-Centered Multi-Agent Systems for Personalized Gardening Smoothed Elicitation Complexity for Approximate $Γ$-calibration of Discrete Classification Tasks FastKernels: Benchmarking GPU Kernel Generation in Production PACE: Two-Timescale Self-Evolution for Small Language Model Agents World Machine: Towards Generative World Modeling for Time-Series Uncovering the Latent Potential of Deep Intermediate Representations Open Multimodal Datasets and Open-Source Software for Data-Driven Modeling of Multiphase Transport and Thermal Systems Steered Generation via Gradient-Based Optimization on Sparse Query Features LLM Code Smells: A Taxonomy and Detection Approach Anytime Training with Schedule-Free Spectral Optimization Robots That Know What to Ask: Recovering Misaligned Rewards through Targeted Explanations The Attribution Contract: Feature Attribution for Generative Language Models ThriftAttention: Selective Mixed Precision for Long-Context FP4 Attention The Implicit Bias of Depth: From Neural Collapse to Softmax Codes Whose Good, Whose Place? The Moral Geography of Agentic AI for Social Good Dreaming Smoothly and Sample Efficiently with Gradient Penalized Latent Dynamics Robust OT-Guided Generative Residual Domain Adaptation for Bike-Sharing Demand Prediction under Temporal Domain Shift When Determinants Are Not Enough: Private Rare Switching Archimedean Copula Inference via Taylor-Mode AD CALAD: Channel-Aware contrastive Learning for multivariate time series Anomaly Detection Infra-Bayesian Reinforcement Learning Agents Outperform Classical RL For Worst-Case Robustness Any-Dimensional Invariant Universality Understanding and Improving Noisy Embedding Techniques in Instruction Finetuning Pure Exploration for a Good Policy in Reinforcement Learning with Bandit Feedback Empirical Bayes Conformal Prediction for Vision and Language Models Expand More, Shrink Less: Shaping Effective-Rank Dynamics for Dense Scaling in Recommendation Scalable Heterogeneous Graph Foundation Models for Data-Driven Optimal Power Flow in Smart Grids Label-Efficient Dataset Pruning via Semi-Supervised Pseudo-Labeling Adaptive Mass-Segmented KV Compression for Long-Context Reasoning DRL-Driven Edge-Aware Utility Optimization for Multi-Slice 6G Networks A measurement substrate for agentic Kubernetes operations: Methodology and a case study in retrieval-compounding falsification PaP-NF: Probabilistic Long-Term Time Series Forecasting via Prefix-as-Prompt Reprogramming and Normalizing Flows WMAttack: Automated Attack Search for Adversarial Evaluation of World-Model Agents Assessing Predictive Models for Fairness Based on Movement Patterns Convex Low-resource Accent-Robust Language Detection in Speech Recognition Self-supervised Adversarial Purification for Graph Neural Networks RelPrism: A Multi-Faceted Pre-training Framework with Self-Generated Tasks for Relational Databases Security of LLM-generated Code: A Comparative Analysis Convex Optimization for Alignment and Preference Learning on a Single GPU Accelerating Divisible Load Processing Through Machine Learning: A Practical Framework for Large-Scale Workloads Enhancing Deep Neural Network Reliability with Refinement and Calibration Philosophical Dispositions as Behavioral Constraints for AI-Assisted Code Review: An Empirical Study Defining AI Fatigue in Academic Contexts: Dimensions, Indicators, and a Stage-Based Model Using Grounded Theory Extending Deep Event Visual Odometry with Sparse Point-Cloud Export PoisonForge: Task-Level Targeted Poisoning Benchmark for Instruction-Tuned LLMs Semantic-Aware Guided Drone Exploration for Language-Conditioned 3D Indoor Mapping Are Frontier LLMs Ready for Cybersecurity? Evidence for Vertical Foundation Models from Dual-Mode Vulnerability Benchmarks Cross-attention-based bipartite graph neural network for coupled nodal and elemental field prediction in large-deformation sheet material forming From Simulation to Discovery: AI Enabled Probabilistic Emulation of Mechanistic Crop Systems Resilience Characterization of AI-Native Wireless Receivers via Persistent Homology SCRIPT: Scalable Diffusion Policy with Multi-stage Training for Language-driven Physics-Based Humanoid Control Orbax: Distributed Checkpointing with JAX Encrypted Neural Networks without Overflows Entropy Equivalence Testing Intercloud: Eventual Consistency for Decentralised Economies via Chilling-Effect Consensus Monte Cimone v3: Where RISC-V Stands in High-Performance Computing Step in Tine: Forking Processes in Functional Choreographies Cogniscope: A Synthetic Longitudinal Benchmark and Browser-Based Evaluation Framework for Early-Risk Cognitive AI Systems Signal Temporal Logic Motion Planning via Graphs of Convex Sets MASQ: Accelerating Masked Diffusion via Stage-Wise Multi-Precision Quantization Experimental Evaluation of Data Upload Efficiency and Guiding Challenges for a Vehicular-to-Road System Using 60-GHz mmWave Ultra-Spots MixFake: Benchmarking and Enhancing Audio Deepfake Detection in Diverse Real-world Mixed Audio Prompt Overflow: What the Guardrail Inspects Is Not What the Model Infers SpikingMoE: SDPrompt-Guided Dynamic Expert Fusion in Spiking Neural Networks Cognitive offloading and the speedup illusion in human-AI interaction Effective information gathering for ore estimation, evaluation and perspectives on adaptive sampling SolarChain: Bridging Physical Law, Verifiable Trust, and Sustainable Markets for Urban Energy Resilience Orchestrating Data Collection and Computation in Green IoT Networks The Impact of AI Coding Assistants on Software Engineering: A Longitudinal Study From Preventive to Reactive: How AI Coding Assistants Transform Developers' Security Awareness Deception and Counter Deception in Adversarial Graph Traversal Game $π_0$-EqM: Equilibrium Matching for Closed-Loop Vision-Language-Action Control Combined Radar and Magnetometer Sensor Network with LoRa-Mediated Awareness for Wildlife-Vehicle Collision Prevention: A Monte Carlo Analysis Conceptual Schema Inference for Tabular Datasets using Large Language Models Query Lower Bounds for Correlation Clustering under Memory Constraints Four Simple Proprioceptive Estimators for Legged Robots SVR-MAD: A Bayesian-Inspired Framework for Posterior-Guided Multi-Agent Debate UfM*: Uncertainty from Motion* for DNN Depth Estimation Using Gaussians YASPS: A Symbolic Framework for Extensible, High-Performance IPC Simulation Remind Me To Check The Stove Before I Leave The House: Authoring Personalized Context-Aware Smart Home Reminders Using Everyday Language Positional Identifiability from Pairwise Collision Data BYOT-CPS: A Hybrid Cyber-Physical Systems Testbed for IoT Security Assessment and Platform Evaluation Open-Source METANET Calibration for Reproducible Freeway Traffic Macroscopic Simulation Holistic Grid-Forming Control to Enhance the Frequency Support from HVDC-Connected Offshore Wind Power Plants Which Superconducting Qubit Model is Good Enough? From Effective Two-Level to Circuit-Based Hamiltonians for Pulse-Level Simulation Mode-Shape Expansion Using Physics-Constrained Gaussian Process Regression ACALSim: A Scalable Parallel Simulation Framework for High-Performance System Design Space Exploration High-order Conservative Discontinuous Galerkin Methods via Implicit Penalization for the Generalized Korteweg-de Vries Equation and the Hirota-Satsuma KdV System The Closure of LCD-to-GI Reductions via Generalized Inner Products Remote Teleoperation of Endovascular Intervention Robots: A Systematic Review Convex Hybrid Modeling: An Operator-Based Approach Improved Torn Paper Coding via Local Alignment Multi-Dimensional Matching in Market Design An Axiomatic Theory of Tie-Breaking: Impossibility, Characterization, and Decomposition The Geometry of Cooperative Game Solutions: Stratified Egalitarian Shapley Values On Reed-Muller subcodes, Grassmannian partitions and sum-free functions On APN Exponents and the Differential and Boomerang Properties of Binomials in Characteristic 3 Budgeted Dynamic Trace Structures for Token-Efficient Sequential Computation StanBKT: Rethinking Parameter Estimation in Bayesian Knowledge Tracing
PIMbot: A Self-Adaptive Attack Framework for Adversarial Manipulation of Multi-Robot Reinforcement Learning
Zexin Li, Zi · 2026-05-25 · via cs updates on arXiv.org

View PDF HTML (experimental)

Abstract:Recent research has demonstrated the potential of reinforcement learning in effective multi-robot collaboration, particularly in social dilemmas where robots face a trade-off between self-interest and collective benefits. However, environmental factors such as miscommunication and adversarial robots can impact cooperation, making it crucial to explore how multi-robot communication can be manipulated to achieve different outcomes. This paper presents PIMbot, a framework that manipulates outcomes via two complementary levers: (i) incentive manipulation of the reward channel and (ii) policy manipulation of an agent's own actions. An adaptive multi-objective controller balances these levers in an online manner. Our work introduces a novel approach to manipulation in recent multi-agent RL social dilemmas that utilize a unique reward function for incentivization. By utilizing our proposed PIMbot mechanisms, a robot is able to manipulate the social dilemma environment effectively. Comprehensive experimental results demonstrate the effectiveness of our proposed methods in the Gazebo-simulated multi-robot environment. Moreover, a real embedded device case study on NVIDIA Jetson Orin Nano quantifies system cost and validates PIMbot's effectiveness on realistic autonomous embedded systems scenarios beyond simulation. Together, these results position PIMbot as a rigorous stress-test tool exposing critical vulnerabilities in multi-robot cooperative tasks.
Comments: Extension version of IROS'23
Subjects: Robotics (cs.RO)
Cite as: arXiv:2605.23027 [cs.RO]
  (or arXiv:2605.23027v1 [cs.RO] for this version)
  https://doi.org/10.48550/arXiv.2605.23027

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Zexin Li [view email]
[v1] Thu, 21 May 2026 20:50:47 UTC (6,691 KB)