惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

U
Unit 42
V
V2EX
Martin Fowler
Martin Fowler
博客园 - Franky
P
Proofpoint News Feed
P
Palo Alto Networks Blog
H
Hackread – Cybersecurity News, Data Breaches, AI and More
B
Blog
The Register - Security
The Register - Security
Latest news
Latest news
S
Security @ Cisco Blogs
Simon Willison's Weblog
Simon Willison's Weblog
Recorded Future
Recorded Future
大猫的无限游戏
大猫的无限游戏
M
Microsoft Research Blog - Microsoft Research
Scott Helme
Scott Helme
T
Tailwind CSS Blog
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Application and Cybersecurity Blog
Application and Cybersecurity Blog
T
True Tiger Recordings
有赞技术团队
有赞技术团队
I
Intezer
Cisco Talos Blog
Cisco Talos Blog
Hacker News - Newest:
Hacker News - Newest: "LLM"
The GitHub Blog
The GitHub Blog
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
T
Tenable Blog
博客园 - 叶小钗
Hugging Face - Blog
Hugging Face - Blog
Hacker News: Ask HN
Hacker News: Ask HN
S
Security Archives - TechRepublic
F
Future of Privacy Forum
爱范儿
爱范儿
PCI Perspectives
PCI Perspectives
H
Help Net Security
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
T
The Blog of Author Tim Ferriss
MyScale Blog
MyScale Blog
N
Netflix TechBlog - Medium
罗磊的独立博客
Apple Machine Learning Research
Apple Machine Learning Research
MongoDB | Blog
MongoDB | Blog
Security Latest
Security Latest
美团技术团队
博客园 - 三生石上(FineUI控件)
S
Schneier on Security
量子位
C
CERT Recently Published Vulnerability Notes
SecWiki News
SecWiki News

cs.AI updates on arXiv.org

Evaluation of Pipelines for Data Integration into Knowledge Graphs LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent Systems SciCore-Mol: Augmenting Large Language Models with Pluggable Molecular Cognition Modules The Log is the Agent: Event-Sourced Reactive Graphs for Auditable, Forkable Agentic Systems SMDD-Bench: Can LLMs Solve Real-World Small Molecule Drug Design Tasks? Investigating Concept Alignment Using Implausible Category Members A Reproducible Log-Driven AutoML Framework for Interpretable Pipeline Optimization in Healthcare Risk Prediction Forecasting Scientific Progress with Artificial Intelligence TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks Evaluating Large Language Models as Live Strategic Agents: Provider Performance, Hybrid Decomposition, and Operational Gaps in Timed Risk Play Adapting the Interface, Not the Model: Runtime Harness Adaptation for Deterministic LLM Agents Cross-domain benchmarks reveal when coordinated AI agents improve scientific inference from partial evidence AOP-Wiki EMOD 3.0: Data Model Expansions and Content Evaluation Framework for Using Agentic AI to Improve Integration between AOPs and New Approach Methodologies (NAMs) A Subjective Logic-based method for runtime confidence updates in safety arguments Meta-Learning for Rapid Adaptation in Reference Tracking of Uncertain Nonlinear Systems Towards a General Intelligence and Interface for Wearable Health Data The Impact of AI Usage and Informativeness on Skill Development in Logical Reasoning Unlocking Proactivity in Task-Oriented Dialogue Harnesses for Inference-Time Alignment over Execution Trajectories Autonomous LLM Agents & CTFs: A Second Look Active Evidence-Seeking and Diagnostic Reasoning in Large Language Models for Clinical Decision Support IdleSpec: Exploiting Idle Time via Speculative Planning for LLM Agents Scaling Observation-aware Planning in Uncertain Domains Compiling Agentic Workflows into LLM Weights: Near-Frontier Quality at Two Orders of Magnitude Less Cost ECPO: Evidence-Coupled Policy Optimization for Evidence-Certified Candidate Ranking A Causal Argumentation Method for Explainability of Machine Learning Models Towards a compositional semantics for quantitative confidence assessment in assurance arguments WorkstreamBench: Evaluating LLM Agents on End-to-End Spreadsheet Tasks in Finance Meta-Soft: Leveraging Composable Meta-Tokens for Context-Preserving KV Cache Compression Memory-Induced Supra-Competitive Outcomes Between Deep Reinforcement Learning Agents in Optimal Trade Execution The Attribution Impossibility: No Feature Ranking Is Faithful, Stable, and Complete Under Collinearity Is Capability a Liability? More Capable Language Models Make Worse Forecasts When It Matters Most Advancing Mathematics Research with AI-Driven Formal Proof Search The Shape of Testimony: A Scalable Framework for Oral History Archive Comparison MindLoom: Composing Thought Modes for Frontier-Level Reasoning Data Synthesis High-speed Networking for Giga-Scale AI Factories Visibility nowcasting in South Korea: a machine learning approach to class imbalance and distribution shift LLM-Metrics: Measuring Research Impact Through Large Language Model Memory Trace2Skill: Verifier-Guided Skill Evolution for Long-Context EDA Agents Can AI Make Conflicts Worse? An Alignment Failure in LLM Deployment Across Conflict Contexts Toward AI VIS Co-Scientists: A General and End-to-End Agent Harness for Solving Complex Data Visualization Tasks KAPPS: A knowledge-based CPPS Architecture for the Circular Factory CLORE: Content-Level Optimization for Reasoning Efficiency ExComm: Exploration-Stage Communication for Error-Resilient Agentic Test-Time Scaling AI-Enabled Serious Games: Integrating Intelligence and Adaptivity in Training Systems Knowledge Graph Re-engineering Along the Ontological Continuum (extended version) FLUID: From Ephemeral IDs to Multimodal Semantic Codes for Industrial-Scale Livestreaming Recommendation ArborKV: Structure-Aware KV Cache Management for Scaling Tree-based LLM Reasoning Measuring Cross-Modal Synergy: A Benchmark for VLM Explainability A Camera-Cooperative ISAC Framework for Multimodal Non-Cooperative UAVs Sensing S2ED: From Story to Executable Descriptions for Consistency-Aware Story Illustration Towards Direct Evaluation of Harness Optimizers via Priority Ranking HarnessAPI: A Skill-First Framework for Unified Streaming APIs and MCP Tools Claw AI Lab: An Autonomous Multi-Agent Research Team Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning AtelierEval: Agentic Evaluation of Humans & LLMs as Text-to-Image Prompters Beyond the Org Chart: AI and the Transformation of Invisible Work Predicting Performance of Symbolic and Prompt Programs with Examples Benchmarking and Improving Monitors for Out-Of-Distribution Alignment Failure in LLMs Graph neural network explanations reveal a topological signature of disease-associated hubs in biological networks Deep Reinforcement Learning for Flexible Job Shop Scheduling with Random Job Arrivals MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems AttuneBench: A Conversation-Based Benchmark for LLM Emotional Intelligence TO-Agents: A Multi-Agent AI Pipeline for Preference-Guided Topology Optimization Implicit Safety Alignment from Crowd Preferences What Counts as AI Sycophancy? A Taxonomy and Expert Survey of a Fragmented Construct Latent-space Attacks for Refusal Evasion in Language Models MPDocBench-Parse: Benchmarking Practical Multi-page Document Parsing Who Uses AI? Platforms, Workforce, and AI Exposure SGR-Bench: Benchmarking Search Agents on State-Gated Retrieval Skill Weaving: Efficient LLM Improvement via Modular Skillpacks Format-Constraint Coupling in Knowledge Graph Construction from Statistical Tables Multivariate Financial Forecasting using the Chronos Time Series Foundation Models Think Thrice Before You Speak: Dual knowledge-enhanced Theory-of-Mind Reasoning for Persuasive Agents Parametric Modular Answer Set Programs Made Declarative Ratchet: A Minimal Hygiene Recipe for Self-Evolving LLM Agents Planning in the LLM Era: Building for Reliability and Efficiency Detecting Synthetic Political Narratives in Cross-Platform Social Media Discourse Enhancing Visual Token Representations for Video Large Language Models via Training-Free Spatial-Temporal Pooling and Gridding Search-E1: Self-Distillation Drives Self-Evolution in Search-Augmented Reasoning HealthCraft: A Reinforcement Learning Safety Environment for Emergency Medicine ST-SimDiff: Balancing Spatiotemporal Similarity and Difference for Efficient Video Understanding with MLLMs AMEL: Accumulated Message Effects on LLM Judgments Teaching Language Models to Forecast Research Success Through Comparative Idea Evaluation Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings LACO: Adaptive Latent Communication for Collaborative Driving Efficient Agentic Reasoning Through Self-Regulated Simulative Planning Don't Collapse Your Features: Why CenterLoss Hurts OOD Detection and Multi-Scale Mahalanobis Wins Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models Tackle CSM in JPEG Steganalysis with Data Adaptation Echo: Learning from Experience Data via User-Driven Refinement Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality? Code Generation by Differential Test Time Scaling NeuroQA: A Large-Scale Image-Grounded Benchmark for 3D Brain MRI Understanding STELLAR: Scaling 3D Perception Large Models for Autonomous Driving Multi-agent Collaboration with State Management \ECUAS{n}: A family of metrics for principled evaluation of uncertainty-augmented systems AgentAtlas: Beyond Outcome Leaderboards for LLM Agents DEL: Digit Entropy Loss for Numerical Learning of Large Language Models
Protein Thoughts: Interpretable Reasoning with Tree of Thoughts and Embedding-Space Flow Matching for Protein-Protein Interaction Discovery
Kingsley Yeo · 2026-05-23 · via cs.AI updates on arXiv.org

View PDF HTML (experimental)

Abstract:Protein-protein interactions (PPIs) govern nearly all cellular processes, yet computational methods for identifying binding partners typically produce ranked predictions without mechanistic justification. This creates a fundamental barrier to adoption because biologists cannot assess whether predictions reflect genuine biochemical insight or spurious correlations. We present \textbf{Protein Thoughts}, a framework that reformulates PPI discovery as an interpretable search problem with explicit reasoning. The system decomposes binding evidence into four biologically meaningful signals: sequence similarity reflecting evolutionary relationships, structural complementarity capturing geometric fit, interface balance, and chemical compatibility encoding residue-level interactions. Rather than collapsing these signals into an opaque score, we preserve their individual contributions through a transparent value function that enables both ranking and auditing. To navigate large candidate spaces efficiently, we introduce hypothesis-guided entropy-regularized Tree-of-Thoughts search. A fine-tuned language model generates search directives from embedding-derived features, classifying candidates as high-priority, exploratory, or skippable. These directives condition a Boltzmann policy that balances exploitation with entropy-driven exploration, while hypothesis-aware pruning prevents premature abandonment of promising candidates. For candidates exhibiting score disagreement, hypothesis-conditioned embedding-space flow matching transports protein embeddings toward the binder manifold. On the SHS148k benchmark, Protein Thoughts achieves mean best-binder rank of 11.2 versus 47.7 for an entropic tree search baseline, a 76% improvement, and for binding prediction the trained value function achieves $91.08 \pm 0.19$ Micro-F1, outperforming existing PPI methods on the same dataset.
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:2605.21522 [q-bio.QM]
  (or arXiv:2605.21522v1 [q-bio.QM] for this version)
  https://doi.org/10.48550/arXiv.2605.21522

arXiv-issued DOI via DataCite

Submission history

From: Kingsley Yeon [view email]
[v1] Tue, 19 May 2026 04:14:06 UTC (6,221 KB)