惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

大猫的无限游戏
大猫的无限游戏
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
AWS News Blog
AWS News Blog
V
V2EX - 技术
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Cloudbric
Cloudbric
S
Securelist
L
LINUX DO - 最新话题
Scott Helme
Scott Helme
T
Threat Research - Cisco Blogs
S
Schneier on Security
Simon Willison's Weblog
Simon Willison's Weblog
G
GRAHAM CLULEY
I
Intezer
C
Cybersecurity and Infrastructure Security Agency CISA
C
CERT Recently Published Vulnerability Notes
SecWiki News
SecWiki News
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
TaoSecurity Blog
TaoSecurity Blog
D
Darknet – Hacking Tools, Hacker News & Cyber Security
Attack and Defense Labs
Attack and Defense Labs
S
Security Affairs
D
Docker
The Cloudflare Blog
博客园 - 三生石上(FineUI控件)
爱范儿
爱范儿
美团技术团队
W
WeLiveSecurity
阮一峰的网络日志
阮一峰的网络日志
月光博客
月光博客
Recent Commits to openclaw:main
Recent Commits to openclaw:main
博客园_首页
G
Google Developers Blog
C
Cisco Blogs
T
Tor Project blog
B
Blog RSS Feed
Vercel News
Vercel News
宝玉的分享
宝玉的分享
Recorded Future
Recorded Future
Cisco Talos Blog
Cisco Talos Blog
P
Palo Alto Networks Blog
Application and Cybersecurity Blog
Application and Cybersecurity Blog
E
Exploit-DB.com RSS Feed
PCI Perspectives
PCI Perspectives
K
Kaspersky official blog
量子位
Google Online Security Blog
Google Online Security Blog
Jina AI
Jina AI
Hacker News - Newest:
Hacker News - Newest: "LLM"
aimingoo的专栏
aimingoo的专栏

cs.AI updates on arXiv.org

GIANTS: Generative Insight Anticipation from Scientific Literature Should We be Pedantic About Reasoning Errors in Machine Translation? Computational Implementation of a Model of Category-Theoretic Metaphor Comprehension CoSToM:Causal-oriented Steering for Intrinsic Theory-of-Mind Alignment in Large Language Models ASPIRin: Action Space Projection for Interactivity-Optimized Reinforcement Learning in Full-Duplex Speech Language Models CircuitSynth: Reliable Synthetic Data Generation Think in Sentences: Explicit Sentence Boundaries Enhance Language Model's Capabilities CodaRAG: Connecting the Dots with Associativity Inspired by Complementary Learning From Query to Counsel: Structured Reasoning with a Multi-Agent Framework and Dataset for Legal Consultation ReFEree: Reference-Free and Fine-Grained Method for Evaluating Factual Consistency in Real-World Code Summarization LLMs Should Incorporate Explicit Mechanisms for Human Empathy Early Decisions Matter: Proximity Bias and Initial Trajectory Shaping in Non-Autoregressive Diffusion Language Models Bridging Linguistic Gaps: Cross-Lingual Mapping in Pre-Training and Dataset for Enhanced Multilingual LLM Performance Computational Lesions in Multilingual Language Models Separate Shared and Language-specific Brain Alignment Efficient Process Reward Modeling via Contrastive Mutual Information Learning and Enforcing Context-Sensitive Control for LLMs Too Nice to Tell the Truth: Quantifying Agreeableness-Driven Sycophancy in Role-Playing Language Models Deep-Reporter: Deep Research for Grounded Multimodal Long-Form Generation Generating Multiple-Choice Knowledge Questions with Interpretable Difficulty Estimation using Knowledge Graphs and Large Language Models Do BERT Embeddings Encode Narrative Dimensions? A Token-Level Probing Analysis of Time, Space, Causality, and Character in Fiction TInR: Exploring Tool-Internalized Reasoning in Large Language Models Advancing Polish Language Modeling through Tokenizer Optimization in the Bielik v3 7B and 11B Series AOP-Smart: A RAG-Enhanced Large Language Model Framework for Adverse Outcome Pathway Analysis Mem$^2$Evolve: Towards Self-Evolving Agents via Co-Evolutionary Capability Expansion and Experience Distillation Uncertainty-Aware Web-Conditioned Scientific Fact-Checking A Systematic Analysis of the Impact of Persona Steering on LLM Capabilities When Verification Fails: How Compositionally Infeasible Claims Escape Rejection When Valid Signals Fail: Regime Boundaries Between LLM Features and RL Trading Policies Shared Emotion Geometry Across Small Language Models: A Cross-Architecture Study of Representation, Behavior, and Methodological Confounds Efficient Training for Cross-lingual Speech Language Models CocoaBench: Evaluating Unified Digital Agents in the Wild MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis Exploring Knowledge Conflicts for Faithful LLM Reasoning: Benchmark and Method Do LLMs Know Tool Irrelevance? Demystifying Structural Alignment Bias in Tool Invocations Enhancing Multimodal Large Language Models for Ancient Chinese Character Evolution Analysis via Glyph-Driven Fine-Tuning Retrieval as Generation: A Unified Framework with Self-Triggered Information Planning METRO: Towards Strategy Induction from Expert Dialogue Transcripts for Non-collaborative Dialogues Think Before you Write: QA-Guided Reasoning for Character Descriptions in Books METER: Evaluating Multi-Level Contextual Causal Reasoning in Large Language Models Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization NovBench: Evaluating Large Language Models on Academic Paper Novelty Assessment Time is Not a Label: Continuous Phase Rotation for Temporal Knowledge Graphs and Agentic Memory Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo A Triadic Suffix Tokenization Scheme for Numerical Reasoning RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind Legal2LogicICL: Improving Generalization in Transforming Legal Cases to Logical Formulas via Diverse Few-Shot Learning Evaluating Cooperation in LLM Social Groups through Elected Leadership Discourse Diversity in Multi-Turn Empathic Dialogue C-ReD: A Comprehensive Chinese Benchmark for AI-Generated Text Detection Derived from Real-World Prompts General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks Digital hybridity and relics in cultural heritage: using corpus linguistics to inform design in emerging technologies from AI to VR How LLMs Might Think CONSCIENTIA: Can LLM Agents Learn to Strategize? Emergent Deception and Trust in a Multi-Agent NYC Simulation Pioneer Agent: Continual Improvement of Small Language Models in Production COMPOSITE-Stem Instructing LLMs to Negotiate using Reinforcement Learning with Verifiable Rewards Cross-Cultural Value Awareness in Large Vision-Language Models Demographic and Linguistic Bias Evaluation in Omnimodal Language Models From UAV Imagery to Agronomic Reasoning: A Multimodal LLM Benchmark for Plant Phenotyping FinTrace: Holistic Trajectory-Level Evaluation of LLM Tool Calling for Long-Horizon Financial Tasks Learning from Emptiness: De-biasing Listwise Rerankers with Content-Agnostic Probability Calibration The Amazing Agent Race: Strong Tool Users, Weak Navigators Thinking Fast, Thinking Wrong: Intuitiveness Modulates LLM Counterfactual Reasoning in Policy Evaluation AI Patents in the United States and China: Measurement, Organization, and Knowledge Flows Calibration Collapse Under Sycophancy Fine-Tuning: How Reward Hacking Breaks Uncertainty Quantification in LLMs NSFL: A Post-Training Neuro-Symbolic Fuzzy Logic Framework for Boolean Operators in Neural Embeddings Skill-SD: Skill-Conditioned Self-Distillation for Multi-turn LLM Agents SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting Bringing Value Models Back: Generative Critics for Value Modeling in LLM Reinforcement Learning Detecting RAG Extraction Attack via Dual-Path Runtime Integrity Game Speaking to No One: Ontological Dissonance and the Double Bind of Conversational AI Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music ZoomR: Memory Efficient Reasoning through Multi-Granularity Key Value Retrieval A molecular clock for writing systems reveals the quantitative impact of imperial power on cultural evolution CFMS: A Coarse-to-Fine Multimodal Synthesis Framework for Enhanced Tabular Reasoning Back to the Barn with LLAMAs: Evolving Pretrained LLM Backbones in Finetuning Vision Language Models Min-$k$ Sampling: Decoupling Truncation from Temperature Scaling via Relative Logit Dynamics Towards Proactive Information Probing: Customer Service Chatbots Harvesting Value from Conversation Use of AI Tools: Guidelines to Maintain Academic Integrity in Computing Colleges Guardrails Beat Guidance: A Large-Scale Study of Rules, Skills, and Persistent Configuration for Coding Agents RECIPER: A Dual-View Retrieval Pipeline for Procedure-Oriented Materials Question Answering Teaching Language Models How to Code Like Learners: Conversational Serialization for Student Simulation The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems Learning from Contrasts: Synthesizing Reasoning Paths from Diverse Search Trajectories Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration Anthropogenic Regional Adaptation in Multimodal Vision-Language Model Quantization Dominates Rank Reduction for KV-Cache Compression Agentic Driving Coach: Robustness and Determinism of Agentic AI-Powered Human-in-the-Loop Cyber-Physical Systems SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Detecting Safety Violations Across Many Agent Traces Detecting HIV-Related Stigma in Clinical Narratives Using Large Language Models Data Selection for Multi-turn Dialogue Instruction Tuning How Alignment Routes: Localizing, Scaling, and Controlling Policy Circuits in Language Models SenBen: Sensitive Scene Graphs for Explainable Content Moderation Rays as Pixels: Learning A Joint Distribution of Videos and Camera Trajectories WOMBET: World Model-Based Experience Transfer for Robust and Sample-efficient Reinforcement Learning Semantic Intent Fragmentation: A Single-Shot Compositional Attack on Multi-Agent AI Pipelines
The Long Delay to Arithmetic Generalization: When Learned Representations Outrun Behavior
Laura Gomezjurado Gonzalez · 2026-03-31 · via cs.AI updates on arXiv.org

Grokking in transformers trained on algorithmic tasks is characterized by a long delay between training-set fit and abrupt generalization, but the source of that delay remains poorly understood. In encoder-decoder arithmetic models, we argue that this delay reflects limited access to already learned structure rather than failure to acquire that structure in the first place. We study one-step Collatz prediction and find that the encoder organizes parity and residue structure within the first few thousand training steps, while output accuracy remains near chance for tens of thousands more. Causal interventions support the decoder bottleneck hypothesis. Transplanting a trained encoder into a fresh model accelerates grokking by 2.75 times, while transplanting a trained decoder actively hurts. Freezing a converged encoder and retraining only the decoder eliminates the plateau entirely and yields 97.6% accuracy, compared to 86.1% for joint training. What makes the decoder's job harder or easier depends on numeral representation. Across 15 bases, those whose factorization aligns with the Collatz map's arithmetic (e.g., base 24) reach 99.8% accuracy, while binary fails completely because its representations collapse and never recover. The choice of base acts as an inductive bias that controls how much local digit structure the decoder can exploit, producing large differences in learnability from the same underlying task.