惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

The Cloudflare Blog
G
GRAHAM CLULEY
Spread Privacy
Spread Privacy
V
Vulnerabilities – Threatpost
Security Latest
Security Latest
T
Threatpost
Scott Helme
Scott Helme
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
Cisco Talos Blog
Cisco Talos Blog
T
The Exploit Database - CXSecurity.com
C
Cisco Blogs
Attack and Defense Labs
Attack and Defense Labs
Hacker News - Newest:
Hacker News - Newest: "LLM"
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
Intezer
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
P
Privacy International News Feed
Project Zero
Project Zero
Google Online Security Blog
Google Online Security Blog
O
OpenAI News
Forbes - Security
Forbes - Security
C
CERT Recently Published Vulnerability Notes
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
The Hacker News
The Hacker News
T
Threat Research - Cisco Blogs
Security Archives - TechRepublic
Security Archives - TechRepublic
C
Cybersecurity and Infrastructure Security Agency CISA
T
Tenable Blog
Webroot Blog
Webroot Blog
A
Arctic Wolf
S
Schneier on Security
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
Google DeepMind News
Google DeepMind News
爱范儿
爱范儿
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
V
V2EX
Help Net Security
Help Net Security
大猫的无限游戏
大猫的无限游戏
宝玉的分享
宝玉的分享
雷峰网
雷峰网
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
罗磊的独立博客
IT之家
IT之家
Know Your Adversary
Know Your Adversary
博客园_首页
有赞技术团队
有赞技术团队
月光博客
月光博客

cs.AI updates on arXiv.org

Physics-Informed State Space Models for Reliable Solar Irradiance Forecasting in Off-Grid Systems Detecting Safety Violations Across Many Agent Traces Solving Physics Olympiad via Reinforcement Learning on Physics Simulators Budget-Aware Uncertainty for Radiotherapy Segmentation QA Using nnU-Net C-ReD: A Comprehensive Chinese Benchmark for AI-Generated Text Detection Derived from Real-World Prompts A Mechanistic Analysis of Looped Reasoning Language Models ClawGuard: A Runtime Security Framework for Tool-Augmented LLM Agents Against Indirect Prompt Injection GenTac: Generative Modeling and Forecasting of Soccer Tactics ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks Grounded World Model for Semantically Generalizable Planning Discourse Diversity in Multi-Turn Empathic Dialogue Endogenous Information in Routing Games: Memory-Constrained Equilibria, Recall Braess Paradoxes, and Memory Design Evaluating Cooperation in LLM Social Groups through Elected Leadership On the Robustness of Watermarking for Autoregressive Image Generation SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context A Mamba-Based Multimodal Network for Multiscale Blast-Induced Rapid Structural Damage Assessment Agentic Driving Coach: Robustness and Determinism of Agentic AI-Powered Human-in-the-Loop Cyber-Physical Systems Fairness is Not Flat: Geometric Phase Transitions Against Shortcut Learning Legal2LogicICL: Improving Generalization in Transforming Legal Cases to Logical Formulas via Diverse Few-Shot Learning AffordSim: A Scalable Data Generator and Benchmark for Affordance-Aware Robotic Manipulation Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents SCNO: Spiking Compositional Neural Operator -- Towards a Neuromorphic Foundation Model for Nuclear PDE Solving CUTEv2: Unified and Configurable Matrix Extension for Diverse CPU Architectures with Minimal Design Overhead Symmetry Reveals Layerwise Dynamics: How Transformers Perform In-Context Classification A Triadic Suffix Tokenization Scheme for Numerical Reasoning Minimizing classical resources in variational measurement-based quantum computation for generative modeling Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo bacpipe: a Python package to make bioacoustic deep learning models accessible FM-Agent: Scaling Formal Methods to Large Systems via LLM-Based Hoare-Style Reasoning Time is Not a Label: Continuous Phase Rotation for Temporal Knowledge Graphs and Agentic Memory NovBench: Evaluating Large Language Models on Academic Paper Novelty Assessment A collaborative agent with two lightweight synergistic models for autonomous crystal materials research Problem Reductions at Scale: Agentic Integration of Computationally Hard Problems Limited Perfect Monotonical Surrogates constructed using low-cost recursive linkage discovery with guaranteed output From Translation to Superset: Benchmark-Driven Evolution of a Production AI Agent from Rust to Python EdgeCIM: A Hardware-Software Co-Design for CIM-Based Acceleration of Small Language Models Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization Not All Forgetting Is Equal: Architecture-Dependent Retention Dynamics in Fine-Tuned Image Classifiers Deep Learning for Sequential Decision Making under Uncertainty: Foundations, Frameworks, and Frontiers Lectures on AI for Mathematics METER: Evaluating Multi-Level Contextual Causal Reasoning in Large Language Models Quantization Dominates Rank Reduction for KV-Cache Compression ADD for Multi-Bit Image Watermarking Anthropogenic Regional Adaptation in Multimodal Vision-Language Model SLALOM: Simulation Lifecycle Analysis via Longitudinal Observation Metrics for Social Simulation Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration Think Before you Write: QA-Guided Reasoning for Character Descriptions in Books Hardening x402: PII-Safe Agentic Payments via Pre-Execution Metadata Filtering METRO: Towards Strategy Induction from Expert Dialogue Transcripts for Non-collaborative Dialogues Emulating Non-Differentiable Metrics via Knowledge-Guided Learning: Introducing the Minkowski Image Loss Efficient Emotion-Aware Iconic Gesture Prediction for Robot Co-Speech Retrieval as Generation: A Unified Framework with Self-Triggered Information Planning One Scale at a Time: Scale-Autoregressive Modeling for Fluid Flow Distributions From Agent Loops to Structured Graphs:A Scheduler-Theoretic Framework for LLM Agent Execution From Redaction to Restoration: Deep Learning for Medical Image Anonymization and Reconstruction Minimal Embodiment Enables Efficient Learning of Number Concepts in Robot Learning from Contrasts: Synthesizing Reasoning Paths from Diverse Search Trajectories The Missing Knowledge Layer in Cognitive Architectures for AI Agents CoRe-ECG: Advancing Self-Supervised Representation Learning for 12-Lead ECG via Contrastive and Reconstructive Synergy Governance by Design: A Parsonian Institutional Architecture for Internet-Wide Agent Societies Do LLMs Know Tool Irrelevance? Demystifying Structural Alignment Bias in Tool Invocations S$^3$: Structured Sparsity Specification Network Effects and Agreement Drift in LLM Debates The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems Learning to Forget -- Hierarchical Episodic Memory for Lifelong Robot Deployment BankerToolBench: Evaluating AI Agents in End-to-End Investment Banking Workflows 3D-Anchored Lookahead Planning for Persistent Robotic Scene Memory via World-Model-Based MCTS Enhancing Multimodal Large Language Models for Ancient Chinese Character Evolution Analysis via Glyph-Driven Fine-Tuning The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping Consistency of AI-Generated Exercise Prescriptions: A Repeated Generation Study Using a Large Language Model THEIA: Learning Complete Kleene Three-Valued Logic in a Pure-Neural Modular Architecture RECIPER: A Dual-View Retrieval Pipeline for Procedure-Oriented Materials Question Answering Regional Explanations: Bridging Local and Global Variable Importance Exploring Knowledge Conflicts for Faithful LLM Reasoning: Benchmark and Method CocoaBench: Evaluating Unified Digital Agents in the Wild MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis Taking a Pulse on How Generative AI is Reshaping the Software Engineering Research Landscape EmbodiedGovBench: A Benchmark for Governance, Recovery, and Upgrade Safety in Embodied Agent Systems Cost-optimal Sequential Testing via Doubly Robust Q-learning Environmental Footprint of GenAI Research: Insights from the Moshi Foundation Model From Answers to Arguments: Toward Trustworthy Clinical Diagnostic Reasoning with Toulmin-Guided Curriculum Goal-Conditioned Learning BoxTuning: Directly Injecting the Object Box for Multimodal Model Fine-Tuning Semantic-Geometric Dual Compression: Training-Free Visual Token Reduction for Ultra-High-Resolution Remote Sensing Understanding Use of AI Tools: Guidelines to Maintain Academic Integrity in Computing Colleges Frugal Knowledge Graph Construction with Local LLMs: A Zero-Shot Pipeline, Self-Consistency and Wisdom of Artificial Crowds ActorMind: Emulating Human Actor Reasoning for Speech Role-Playing Efficient Training for Cross-lingual Speech Language Models Bottleneck Tokens for Unified Multimodal Retrieval E2E-REME: Towards End-to-End Microservices Auto-Remediation via Experience-Simulation Reinforcement Fine-Tuning Guardrails Beat Guidance: A Large-Scale Study of Rules, Skills, and Persistent Configuration for Coding Agents Towards Proactive Information Probing: Customer Service Chatbots Harvesting Value from Conversation Hodoscope: Unsupervised Monitoring for AI Misbehaviors Lightweight Low-Light Image Enhancement via Distribution-Normalizing Preprocessing and Depthwise U-Net PRISM Risk Signal Framework: Hierarchy-Based Red Lines for AI Behavioral Risk AI Integrity: A New Paradigm for Verifiable AI Governance Shared Emotion Geometry Across Small Language Models: A Cross-Architecture Study of Representation, Behavior, and Methodological Confounds A Systematic Analysis of the Impact of Persona Steering on LLM Capabilities Intelligent Approval of Access Control Flow in Office Automation Systems via Relational Modeling
Gaussian DP for Reporting Differential Privacy Guarantees in Machine Learning
Juan Felipe Gomez, Bogdan Kulynych, Georgios Kaissis, Flavio P. · 2025-03-14 · via cs.AI updates on arXiv.org

Current practices for reporting differential privacy (DP) guarantees for machine learning (ML) algorithms such as DP-SGD provide an incomplete and potentially misleading picture. For instance, if only a single $(\varepsilon, δ)$ is known about a mechanism, standard analyses show that there could exist highly accurate inference attacks against training data records, when, upon a more careful analysis, such accurate attacks do not exist for most practical mechanisms. In this position paper, we argue that using _non-asymptotic_ Gaussian Differential Privacy (GDP) as the primary means of communicating DP guarantees in ML avoids these potential downsides. Using two recent developments in the DP literature: (i) open-source numerical accountants capable of computing the privacy profile and $f$-DP curves of DP-SGD to arbitrary accuracy, and (ii) a decision-theoretic metric over DP representations, we show how to provide non-asymptotic bounds on GDP using numerical accountants, and show that GDP can capture the entire privacy profile of DP-SGD and related algorithms with virtually no error, as quantified by the metric. To support our claims, we investigate the privacy profiles of state-of-the-art DP large-scale image classification, and the TopDown algorithm for the U.S. Decennial Census, observing that GDP fits their profiles remarkably well in all cases. We conclude with a discussion on the strengths and weaknesses of this approach, and discuss which other privacy mechanisms could benefit from GDP.