惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

cs.AI updates on arXiv.org

Why We Need World Models for AGI: Where LLMs Fail and How World Models May Outperform Test-Time Deep Thinking to Explore Implicit Rules Fundamental Limitation in Explaining AI When Does Synthetic Patent Data Help? Volume-Fidelity Trade-offs in Low-Resource Multi-Label Classification RECTOR: Priority-Aware Rule-Based Reranking for Compliance-Aware Autonomous Driving Trajectory Selection When Mean CE Fails: Median CE Can Better Track Language Model Quality Low-Cost Labels, Reliable Choices: Rollout-Calibrated Hyper-Heuristics for Job Shop Scheduling BODHI: Precise OS Kernel Specification Inference PALoRA: Projection-Adaptive LoRA for Preserving Reasoning in Large Language Models Exploration of Perceptual Speech Features for Clinical Decision-Support in Mental Health Care Beyond Inference-Only Deployment: Comparing Weight-Based Consolidation Against Cascading Compaction Measuring Reasoning Quality in LLMs: A Multi-Dimensional Behavioral Framework Advancing Graph Few-Shot Learning via In-Context Learning Lattice theory and algebraic models for deep convolutional learning based on mathematical morphology LGMT: Logic-Grounded Metamorphic Testing for Evaluating the Reasoning Reliability of LLMs QUIVER: A Formal Framework for Quantifying Perturbation Propagation and Bifurcation in Compound AI Systems Uncertainty Decomposition via Cyclical SG-MCMC and Soft-label Learning for Subjective NLP Associations between echocardiographic traits and AI-ECG predictions of heart failure Privacy-Preserving Local Language Models for Longitudinal Data Retrieval in Chronic Dermatologic Disease: Implementation in Pemphigus Patients PANDO: Efficient Multimodal AI Agents via Online Skill Distillation How Much Thinking is Enough? Quantifying and Understanding Redundancy in LLM Reasoning AI Cartography: Mapping the Latent Landscape of AI Benchmark Ecosystems Agent Manufacturing: Foundation-Model Agents as First-Class Industrial Entities Towards Multi-Turn Dialog Systems for Industrial Asset Operations and Maintenance Beyond the Frontier: Stochastic Backtracking for Efficient Test-Time Scaling Beyond Final Answers: Auditing Trajectory-Level Hallucinations in Multi-Agent Industrial Workflows Inference Time Context Sparsity: Illusion or Opportunity? SPACE: Unifying Symmetric and Asymmetric Routing Problems for Generalist Neural Solver Hypothesis Generation and Inductive Inference in Children and Language Models Reason--Imagine--Act: Closed-Loop LLM Decision Making with World Models for Autonomous Driving The Model Is Not the Product: A Dual-Pillar Architecture for Local-First Psychological Coaching Context: Proactive Goal-Directed Intelligence via Composable Sandboxed Programs, Declarative Wiring, and Structured Interaction GRAIL: AI translation for scientists application workflow on satellite data TIGER: Text-Informed Generalized Enzyme-Reaction Retrieval Jailbreak to Protect: Buffering and Reinforcing via Temporary Jailbreaking for Safe Fine-Tuning in Large Language Models Beyond Control-Flow: Integrating the Resource Perspective into Multi-Collaborative Process Modeling from Text ProActor: Timing-Aware Reinforcement Learning for Proactive Task Scheduling Agents Hylos: Operability Contracts for Model-Native Spatial Intelligence Inverting the Shield: Systematically Generating Safety Tests from Policy Specifications When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs Geo-Expert: Towards Expert-Level Geological Reasoning via Parameter-Efficient Fine-Tuning NeurIPS: Neuro-anatomical Inductive Priors for Sphere-based Brain Decoding Quantum Frog: Emergent Cooperation and Difficulty Scaling in a Quantized-Time Cooperative Game Trust but Verify: Prover-Verifier Deliberation for Selective LLM Prediction Toward Reliable Design of LLM-Enabled Agentic Workflows: Optimizing Latency-Reliability-Cost Tradeoffs Partner-Aware Hierarchical Skill Discovery for Robust Human-AI Collaboration Machine Psychometrics: A Mathematical Psychology of Artificial Intelligence Agent-as-Peer-Debriefer: A Multi-Agent Framework with Perspective-Based Refinement for Qualitative Analysis HeartBeatAI: An Interpretable and Robust Deep Learning Framework for Multi-Label ECG Arrhythmia Detection LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition Distilling Game Code World Model Generation into Lightweight Large Language Models ConceptM$^3$oE: Concept-Guided Multimodal Mixture of Experts for Interpretable Computational Pathology Towards trustworthy agentic AI: a comprehensive survey of safety, robustness, privacy, and system security Breaking the Chains of Probability: Neutrosophic Logic as a New Framework for Epistemic Uncertainty in Large Language Models Safety-Oriented Routing Analysis of Mixtral MoE Under Benign and Harmful Prompts Automated Detection and Classification of Delusion-related Content in Naturalistic Audio Diaries Using Multi-Agent Language Models Mitigating Object Hallucinations in Vision-Language Models through Region-Aware Attention Recalibration How Well Do Models Follow Their Constitutions? MDIA: A Multi-Agent Diagnostic Intelligence Pipeline on HealthBench Professional Fuzzy, Neutrosophic, and Uncertain Graph Theory: Properties and Applications Identifying and Mitigating Systemic Measurement Bias in Production LLM Inference Benchmarks Adaptive Human-AI Coordination via Hierarchical Action Disentanglement Market Regime Council for Dynamic Credit Assignment in Multi-Agent LLM Decision Systems DemoEvolve: Overcoming Sparse Feedback in Agentic Harness Evolution with Demonstrations JT-SAFE-V2: Safety-by-Design Foundation Model with World-Context Data GlobalDentBench: A Multinational Benchmark for Evaluating LLM Clinical Reasoning in Dentistry with Expert Calibration Emission-Aware Reinforcement Learning for Sustainable Electric Vehicle Charging and Carbon Dioxide Reduction Under Varying Renewable Penetration AVBench: Human-Aligned and Automated Evaluation Benchmark for Audio-Video Generative Models Benchmarking the Limits of In-Context Reinforcement Learning for Ad-Hoc Teamwork Emotional intelligence in large language models is fragmented across perception, cognition, and interaction Reasoning as an Attack Surface: Adaptive Evolutionary CoT Jailbreaks for LLMs Residual Drift Dominates Contradiction in Multi-Turn Constraint Reasoning Proper Scoring Rules for Agentic Uncertainty Quantification PRIMA: Operational Patterns for Resilient Multi-Agent Research with Verifiable Identity and Convergent Feedback CoRe-Code: Collaborative Reinforcement Learning for Code Generation AION: Next-Generation Tasks and Practical Harness for Time Series DRIVE: Modeling Skills at the Reasoning and Interaction Levels for Web Agents under Continual Learning Energy Shields for Fairness Confidence Calibration in Large Language Models Authority Inversion in LLM-Mediated Ubiquitous Systems: When Models Trust Users Over Sensors TaBIIC2: Interactive Building of Ontological Taxonomies using Weighted Self-Organizing Maps Clustering as Reasoning: A $k$-Means Interpretation of Chain-of-Thought Graph Learning Right-Sizing Communication and Recommendation Set Size in AI-Assisted Search MAPLE: Multi-State Aggregated Policy Evaluation for AlphaZero in Imperfect-Information Games From Accuracy to Auditability: A Survey of Determinism in Financial AI Systems Summoning the Oracle to Slay It: Mitigating Look-Ahead Bias in Financial Backtesting with Large Language Models BoxLitE: A Faithful Knowledge Base Embedding Based on Convex Optimization When Correct Beliefs Collapse: Epistemic Resilience of LLMs under Clinical Pressure A governance horizon for ethical-use constraints in open-weight AI models AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning A Signal-Language Foundation Model for Broad-Spectrum Cardiovascular Assessment from Routine Electrocardiography Second Guess: Detecting Uncertainty Through Abstention and Answer Stability in Small Language Models Noise-Robust Financial Numerical Entity Attribute Tagging Learning to Reason Efficiently with A* Post-Training Solving Combinatorial Counting Problems with Weighted First-Order Model Counting Understanding and Mitigating Premature Confidence for Better LLM Reasoning In Search of the Ingredients of Open-Endedness: Replicating Picbreeder with Large Vision-Language Models Toward Enactive Artificial Intelligence StructBreak: Structural Cognitive Overload-Induced Safety Failures in MLLMs SAM: State-Adaptive Memory for Long-Horizon Reasoning Agent
ATWL: A Formal Language for Representing, Comparing, and Reusing Visual Analytics Workflows
Natalia Andr · 2026-05-26 · via cs.AI updates on arXiv.org

View PDF HTML (experimental)

Abstract:Visual analytics (VA) workflows are inherently complex, involving data transformation, feature engineering, visual representation, and human interpretation. They are typically described in unstructured prose, hindering systematic comparison, reuse of proven strategies, and training of novices. We present Artifact-Transform Workflow Language (ATWL), a domain-agnostic, declarative language that formally represents VA workflows by capturing their structure and underlying analytical intent. ATWL is built upon a modular ontology of eight artifact types (entities, features, arrangements, visualisations, patterns, models, knowledge, specifications) and transforms characterised by standardised intents (e.g., define-unit, characterise, contextualise, abstract). To show that formalisation effort need not impede adoption, we extract workflows from research papers through supervised interaction with LLM agents, reducing the human role to review and refinement. Using this process, we constructed a library of seventeen ATWL workflows from published VA papers. Cross-workflow analysis reveals structural regularities -- a recurrent meta-structure, recurring motifs, reusable building blocks, diverse iterative strategies, and cross-domain equivalences -- that remain invisible in prose. We further evaluate practical utility through a controlled experiment in which the same LLM addressed two analytical problems with the library supplied either as original papers or as ATWL representations. Both forms enabled useful recommendations, but the formal representation systematically added explicit iteration structure, typed data flow, fragment-level adaptation provenance, and compactness supporting scaling beyond what prose libraries can fit in an LLM's context. ATWL enables a transition from narrative descriptions to formally represented, comparable, and reusable analytical knowledge.
Subjects: Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
Cite as: arXiv:2605.25489 [cs.AI]
  (or arXiv:2605.25489v1 [cs.AI] for this version)
  https://doi.org/10.48550/arXiv.2605.25489

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Gennady Andrienko [view email]
[v1] Mon, 25 May 2026 06:51:02 UTC (109 KB)