慣性聚合 関心のあるブログ、ニュース、テクノロジーを効率的に追跡
原文を読む 慣性聚合で開く

おすすめ購読元

N
Netflix TechBlog - Medium
Microsoft Azure Blog
Microsoft Azure Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
S
Security Archives - TechRepublic
Cyberwarzone
Cyberwarzone
D
Darknet – Hacking Tools, Hacker News & Cyber Security
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
博客园 - 【当耐特】
A
About on SuperTechFans
T
ThreatConnect
IT之家
IT之家
阮一峰的网络日志
阮一峰的网络日志
B
Blog
T
Tailwind CSS Blog
G
GRAHAM CLULEY
F
Future of Privacy Forum
V
Vulnerabilities – Threatpost
J
Java Code Geeks
量子位
博客园 - 叶小钗
Last Week in AI
Last Week in AI
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Stack Overflow Blog
Stack Overflow Blog
李成银的技术随笔
Spread Privacy
Spread Privacy
The Hacker News
The Hacker News
S
Schneier on Security
T
True Tiger Recordings
Vercel News
Vercel News
C
CXSECURITY Database RSS Feed - CXSecurity.com
C
Cybersecurity and Infrastructure Security Agency CISA
Latest news
Latest news
F
Fox-IT International blog
The Register - Security
The Register - Security
MongoDB | Blog
MongoDB | Blog
博客园 - 聂微东
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Know Your Adversary
Know Your Adversary
GbyAI
GbyAI
L
LangChain Blog
MyScale Blog
MyScale Blog
AWS News Blog
AWS News Blog
D
Docker
小众软件
小众软件
Stack Overflow Blog
Stack Overflow Blog
Microsoft Security Blog
Microsoft Security Blog
T
Tor Project blog
T
The Exploit Database - CXSecurity.com
P
Palo Alto Networks Blog
Malwarebytes
Malwarebytes

cs.LG updates on arXiv.org

Personalized Generative Models for Contextual Debiasing From Privacy to Generalization: Linear Max-Information Bounds for DP-SGD When Does Deep RL Beat Calibrated Baselines? A Benchmark Study on Adaptive Resource Control Amortized Factor Inference Networks for Posterior Inference Classification and detection of multiple UAVs using rational Gaussian wavelet neural networks Planning Neural Dynamics with Lie Group Embedding through Supervised Projective Manifold Learning Modeling Dynamic Mixtures of Time-Delay Systems from Streaming Time Series AirCast-SR: A Foundation Model for Kilometer-Scale Atmospheric Super-Resolution via Latent Consistency Diffusion Neural Bayesian Sequential Routing GAC: Noise-Aware Adaptive Mixing for Hybrid SFT-RL Post-Training Provably Communication-Efficient and Privacy-Preserving Federated Graph Neural Networks Function-Valued Causal Influence in Nonlinear Time Series The Bridge-Garden Dilemma in LLM Distillation: Why Mixing Hard and Soft Labels Works Balancing Plasticity and Stability with Fast and Slow Successor Features InfoQuant: Shaping Activation Distributions for Low-Bit LLM Quantization FM-fMRI: Event Conditioned Flow Matching for Rest-to-Task fMRI Time-Series Synthesis TrackRef3D: Multi-View Consistent Track-then-Label for Open-World Referring Segmentation in 3D Gaussian Splatting TSFMAudit: Data Contamination Auditing in Forecasting Time Series Foundation Models On the Push-Based Asynchronous Federated Learning: A Bias-Correction Aggregation Approach CSV-ViT: A Vision Transformer with the Variable-sized Cortical Supervertices for Detection of Alzheimer's Disease Pathologies Max-Window Scale Estimation for Near-Lossless HiF8 W8A8 Quantization-Aware Training Online Learning on Hidden-Convex Losses via Algorithmic Equivalence: Optimal Regret, Geometric Barrier, and Bandit Feedback Curriculum Learning for Safety Alignment A PAC-Bayesian View of Generalisation for Physics-Informed Machine Learning Dynamic Link Prediction with Temporally Enhanced Signed Graph Neural Networks GEM: Geometric Entropy Mixing for Optimal LLM Data Curation Reparametrizing Shampoo and SOAP for Subspace Basis Updates and BFloat16 Storage Unified Neural Scaling Laws Semigroup Consistency as a Diagnostic for Learned Physics Simulators QAM-W: Joint 2D Codebook Quantization for LLM Weights via Hadamard Rotation and Activation-Aware Scaling HRVConformer: Neonatal Hypoxic-Ischemic Encephalopathy Classification from the Heart Rate signals Scaling World-Model Reinforcement Learning Through Diffusion Policy Optimization A Hybrid Vision-Language Architecture for Automated Defect Reasoning and Report Generation in Industrial Inspection Quantized Keys Steal Attention: Bias Correction for KV-Cache Compression in Video Diffusion BioFact-MoE: Biologically Factorized Mixture of Experts for Vision-Language Prognostic Modeling in Hepatocellular Carcinoma Bridging Classification and Reconstruction: Cooperative Time Series Anomaly Detection SilIF: Silhouette-Augmented Isolation Forest for Unsupervised Transaction Fraud Detection Co-folding model guided by structural proteomics Energy-Gated Attention and Wavelet Positional Encoding: Complementary Inductive Biases for Transformer Attention Stateful Inference for Low-Latency Multi-Agent Tool Calling Two-Parameter Flows for Learning Population Dynamics of Physical Systems On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series A Fast and Generic Energy-Shifting Transformer for Hybrid Monte Carlo Radiotherapy Calculation ARBITER: Reasoning Trajectory Basins and Majority Vote Failures in Test-Time Sampling When Rule Violations Are Rare: Chimera Training for Logical Anomaly Detection The Constraint Tax: Measuring Validity-Correctness Tradeoffs in Structured Outputs for Small Language Models When Correct Demonstrations Hurt: Rethinking the Role of Exemplars in In-Context Learning Benchmarking Convolutional, Transformer, Hybrid, and Vision Language Models for Multi Disease Retinal Screening Rotation-Invariant Spherical Watermarking via Third-Order SO(3) Representation Coupling MULTISEISMO: A Multimodal Seismic Dataset and Model for Cross-Modal Seismic Understanding
Dense Retrievalのためのテスト時計算:フローズンエンコーディングモデルによるエージェントプログラム生成
Han Xiao · 2026-05-13 · via cs.LG updates on arXiv.org

PDFを表示 HTML(実験的)

要約:テスト時の計算は広く大規模な推論モデルだけが利益を得ると信じられています。私たちはそれが小規模な埋め込みモデルにも役立つことを示しました。現代の埋め込みモデルはLLMのバックボーンから派生しているため、フリーズされたエンコーダは再訓練なしで追加の推論計算から利益を得るべきです。エージェントプログラム-サーチループはフリーズされたエンコーダAPIを探索し、144の候補プログラムから十二つのパレート最適プログラムを生成し、単一パスベースラインに対してコスト比率が$c=1.2$から$14.7$までの範囲をカバーします。サーチは独立してRocchio疑似関連フィードバック、ColBERTスタイルのMaxSimを文粒度で、逆位相融合、Fisher線形判別を再発見し、すべてのトレーニング可能なパラメータや外部モデルなしで行われました。各フロンティアプログラムは、法的、金融、長文、一般のドメインをカバーする14のMMTEB検索タスク全体において、フリーズされたベースラインよりもnDCG@10を向上させます。プログラムは変更なしで未知のエンコーダファミリーと19の除外された検索タスクに転移し、68%のモデル-タスクペアは少なくとも1つのフロンティアプログラムがコサインベースラインよりも改善することを認めます。
コメント: 16ページ、4図
分野: 機械学習 (cs.LG); 計算と言語 (cs.CL); 情報検索 (cs.IR)
引用: arXiv:2605.11374 [cs.LG]
  (または arXiv:2605.11374v3 [cs.LG]) このバージョン用)
  https://doi.org/10.48550/arXiv.2605.11374

arXiv発行のDOI via DataCite

提出履歴

送信者: Han Xiao [メールを表示]
[v1] 火, 2026年5月12日 00:56:34 UTC (215 KB)
[v2] 水曜日、2026年5月13日 00:56:03 UTC (126 KB)
[v3] 火曜日、2026年5月26日 14:57:14 UTC (264 KB)