惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

U
Unit 42
V
V2EX
Martin Fowler
Martin Fowler
博客园 - Franky
P
Proofpoint News Feed
P
Palo Alto Networks Blog
H
Hackread – Cybersecurity News, Data Breaches, AI and More
B
Blog
The Register - Security
The Register - Security
Latest news
Latest news
S
Security @ Cisco Blogs
Simon Willison's Weblog
Simon Willison's Weblog
Recorded Future
Recorded Future
大猫的无限游戏
大猫的无限游戏
M
Microsoft Research Blog - Microsoft Research
Scott Helme
Scott Helme
T
Tailwind CSS Blog
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Application and Cybersecurity Blog
Application and Cybersecurity Blog
T
True Tiger Recordings
有赞技术团队
有赞技术团队
I
Intezer
Cisco Talos Blog
Cisco Talos Blog
Hacker News - Newest:
Hacker News - Newest: "LLM"
The GitHub Blog
The GitHub Blog
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
T
Tenable Blog
博客园 - 叶小钗
Hugging Face - Blog
Hugging Face - Blog
Hacker News: Ask HN
Hacker News: Ask HN
S
Security Archives - TechRepublic
F
Future of Privacy Forum
爱范儿
爱范儿
PCI Perspectives
PCI Perspectives
H
Help Net Security
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
T
The Blog of Author Tim Ferriss
MyScale Blog
MyScale Blog
N
Netflix TechBlog - Medium
罗磊的独立博客
Apple Machine Learning Research
Apple Machine Learning Research
MongoDB | Blog
MongoDB | Blog
Security Latest
Security Latest
美团技术团队
博客园 - 三生石上(FineUI控件)
S
Schneier on Security
量子位
C
CERT Recently Published Vulnerability Notes
SecWiki News
SecWiki News

cs.AI updates on arXiv.org

The Impact of AI Usage and Informativeness on Skill Development in Logical Reasoning LLM-Metrics: Measuring Research Impact Through Large Language Model Memory WorkstreamBench: Evaluating LLM Agents on End-to-End Spreadsheet Tasks in Finance AOP-Wiki EMOD 3.0: Data Model Expansions and Content Evaluation Framework for Using Agentic AI to Improve Integration between AOPs and New Approach Methodologies (NAMs) Addressing the Synergy Gap: The Six Elements of the Design Space When Are Teacher Tokens Reliable? Position-Weighted On-Policy Self-Distillation for Reasoning Frequency-Domain Regularized Adversarial Alignment for Transferable Attacks against Closed-Source MLLMs S2ED: From Story to Executable Descriptions for Consistency-Aware Story Illustration A Camera-Cooperative ISAC Framework for Multimodal Non-Cooperative UAVs Sensing Can AI Make Conflicts Worse? An Alignment Failure in LLM Deployment Across Conflict Contexts Toward AI VIS Co-Scientists: A General and End-to-End Agent Harness for Solving Complex Data Visualization Tasks Is Capability a Liability? More Capable Language Models Make Worse Forecasts When It Matters Most The Shape of Testimony: A Scalable Framework for Oral History Archive Comparison Faster Completion, Less Learning: Generative AI Reduced Study Time on Math Problems and the Knowledge They Build Investigating Concept Alignment Using Implausible Category Members The Attribution Impossibility: No Feature Ranking Is Faithful, Stable, and Complete Under Collinearity Implicit Safety Alignment from Crowd Preferences Trace2Skill: Verifier-Guided Skill Evolution for Long-Context EDA Agents Towards a General Intelligence and Interface for Wearable Health Data IdleSpec: Exploiting Idle Time via Speculative Planning for LLM Agents RefusalBench: Why Refusal Rate Misranks Frontier LLMs on Biological Research Prompts MindLoom: Composing Thought Modes for Frontier-Level Reasoning Data Synthesis Scaling Observation-aware Planning in Uncertain Domains Unlocking Proactivity in Task-Oriented Dialogue OPPO: Bayesian Value Recursion for Token-Level Credit Assignment in LLM Reasoning Understanding Perspectives of Patients, Caregivers and Clinicians towards Emerging Collaborative-decision Making Technologies AttuneBench: A Conversation-Based Benchmark for LLM Emotional Intelligence A Causal Argumentation Method for Explainability of Machine Learning Models LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent Systems TBP-mHC: full expressivity for manifold-constrained hyper connections through transportation polytopes Protein Thoughts: Interpretable Reasoning with Tree of Thoughts and Embedding-Space Flow Matching for Protein-Protein Interaction Discovery Benchmarking and Improving Monitors for Out-Of-Distribution Alignment Failure in LLMs The Log is the Agent: Event-Sourced Reactive Graphs for Auditable, Forkable Agentic Systems Latent-space Attacks for Refusal Evasion in Language Models Beyond the Org Chart: AI and the Transformation of Invisible Work Towards a compositional semantics for quantitative confidence assessment in assurance arguments Evaluating Large Language Models as Live Strategic Agents: Provider Performance, Hybrid Decomposition, and Operational Gaps in Timed Risk Play Active Evidence-Seeking and Diagnostic Reasoning in Large Language Models for Clinical Decision Support Adapting the Interface, Not the Model: Runtime Harness Adaptation for Deterministic LLM Agents Harnesses for Inference-Time Alignment over Execution Trajectories Visibility nowcasting in South Korea: a machine learning approach to class imbalance and distribution shift SGR-Bench: Benchmarking Search Agents on State-Gated Retrieval Graph neural network explanations reveal a topological signature of disease-associated hubs in biological networks Advancing Mathematics Research with AI-Driven Formal Proof Search Evaluation of Pipelines for Data Integration into Knowledge Graphs Ex-GraphRAG: Interpretable Evidence Routing for Graph-Augmented LLMs AtelierEval: Agentic Evaluation of Humans & LLMs as Text-to-Image Prompters ExComm: Exploration-Stage Communication for Error-Resilient Agentic Test-Time Scaling Learning Altruistic Collaboration in Heterogeneous Multi-Team Systems Towards Direct Evaluation of Harness Optimizers via Priority Ranking Who Uses AI? Platforms, Workforce, and AI Exposure Adversarial Trust Poisoning in Vehicular Collaborative Perception Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning Forecasting Scientific Progress with Artificial Intelligence CausalGuard: Conformal Inference under Graph Uncertainty Deep Reinforcement Learning for Flexible Job Shop Scheduling with Random Job Arrivals Support-aware offline policy selection for advertising marketplaces Patch Hierarchical Attention Transformer for Efficient Particle Jet Tagging Planning, Scheduling, and Behavior in EV Charging Systems: A Critical Survey and Trilemma Framework Multivariate Financial Forecasting using the Chronos Time Series Foundation Models EvoScene-VLA: Evolving Scene Beliefs Inside the Action Decoder for Chunked Robot Control LLM Retrieval for Stable and Predictable Ad Recommendations The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation SMDD-Bench: Can LLMs Solve Real-World Small Molecule Drug Design Tasks? A Subjective Logic-based method for runtime confidence updates in safety arguments What Counts as AI Sycophancy? A Taxonomy and Expert Survey of a Fragmented Construct Compiling Agentic Workflows into LLM Weights: Near-Frontier Quality at Two Orders of Magnitude Less Cost Think Thrice Before You Speak: Dual knowledge-enhanced Theory-of-Mind Reasoning for Persuasive Agents ArborKV: Structure-Aware KV Cache Management for Scaling Tree-based LLM Reasoning ECPO: Evidence-Coupled Policy Optimization for Evidence-Certified Candidate Ranking Skill Weaving: Efficient LLM Improvement via Modular Skillpacks Parametric Modular Answer Set Programs Made Declarative AI-Enabled Serious Games: Integrating Intelligence and Adaptivity in Training Systems Format-Constraint Coupling in Knowledge Graph Construction from Statistical Tables Knowledge Graph Re-engineering Along the Ontological Continuum (extended version) Meta-Soft: Leveraging Composable Meta-Tokens for Context-Preserving KV Cache Compression Autonomous LLM Agents & CTFs: A Second Look SciCore-Mol: Augmenting Large Language Models with Pluggable Molecular Cognition Modules Memory-Induced Supra-Competitive Outcomes Between Deep Reinforcement Learning Agents in Optimal Trade Execution MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems Measuring Cross-Modal Synergy: A Benchmark for VLM Explainability Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention CLORE: Content-Level Optimization for Reasoning Efficiency Scalable On-Policy Reinforcement Learning via Adaptive Batch Scaling Predicting Performance of Symbolic and Prompt Programs with Examples PocketAgents: A Manifest-Driven Library of Autonomous Defense Agents High-speed Networking for Giga-Scale AI Factories A Reproducible Log-Driven AutoML Framework for Interpretable Pipeline Optimization in Healthcare Risk Prediction Engineering Hybrid Physics-Informed Neural Networks for Next-Generation Electricity Systems: A State-of-the-Art Review Local Covariate Selection for Average Causal Effect Estimation without Pretreatment and Causal Sufficiency Assumptions TO-Agents: A Multi-Agent AI Pipeline for Preference-Guided Topology Optimization HarnessAPI: A Skill-First Framework for Unified Streaming APIs and MCP Tools TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks Claw AI Lab: An Autonomous Multi-Agent Research Team Meta-Learning for Rapid Adaptation in Reference Tracking of Uncertain Nonlinear Systems Thermodynamic Irreversibility of Training Algorithms KAPPS: A knowledge-based CPPS Architecture for the Circular Factory PEARL: Unbiased Percentile Estimation via Contrastive Learning for Industrial-Scale Livestream Recommendation Cross-domain benchmarks reveal when coordinated AI agents improve scientific inference from partial evidence FLUID: From Ephemeral IDs to Multimodal Semantic Codes for Industrial-Scale Livestreaming Recommendation
ChronoMedicalWorld: A Medical World Model for Learning Patient Trajectories from Longitudinal Care Data
Jiangyuan Wa · 2026-05-23 · via cs.AI updates on arXiv.org

View PDF HTML (experimental)

Abstract:Long-horizon clinical simulation -- predicting how a patient's physiology evolves over years under specified interventions -- is central to chronic-disease care, yet existing electronic health record (EHR) models are predominantly discriminative, and general-purpose large language models drift under repeated interventions. We propose the \textbf{ChronoMedicalWorld Model (CMWM)}, an action-conditioned latent world-model framework for learning patient trajectories from longitudinal care data. CMWM couples a joint-embedding state encoder with a wide action encoder that admits both structured intervention indicators and free-text communication embeddings, and trains a recurrent latent transition module under a six-term objective: next-observation supervision, next-latent prediction, SIGReg latent regularisation, and three physiology-aware shape priors (slope, continuity, large-jump penalty). A closed-loop rollout-prefix protocol matches training to deployment, so the model is optimised against the same multi-step error it exhibits at inference. As a concrete case study, we instantiate CMWM for annual estimated glomerular filtration rate (eGFR) trajectory forecasting in chronic kidney disease (CKD). On a 2{,}232-patient nephrology cohort, the CKD instantiation achieves a dynamic-50\% history rollout test mean absolute error (MAE) of 7.384 and root-mean-square error (RMSE) of 10.256, against 7.964 and 11.069 for a tuned GPT-5.5 structured-prompting baseline ($-7.28\%$ MAE, $-7.35\%$ RMSE), with the gain dominated by the dialogue portion of patient--health-coach communication. The framework is not CKD-specific: its architecture, loss design, and training protocol apply to any chronic condition that can be cast as periodic clinical state interleaved with structured and conversational interventions.
Comments: 14 pages, 2 figures, 6 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as: arXiv:2605.21963 [cs.LG]
  (or arXiv:2605.21963v1 [cs.LG] for this version)
  https://doi.org/10.48550/arXiv.2605.21963

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Jiangyuan Wang [view email]
[v1] Thu, 21 May 2026 03:50:17 UTC (23 KB)