인셔셔RSS 관심 있는 블로그, 뉴스, 기술 정보를 효율적으로 추적하고 읽으세요
원문 읽기 InertiaRSS에서 열기

추천 피드

The GitHub Blog
The GitHub Blog
T
ThreatConnect
C
Check Point Blog
T
The Exploit Database - CXSecurity.com
U
Unit 42
云风的 BLOG
云风的 BLOG
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
T
Tenable Blog
博客园 - 叶小钗
D
Docker
T
Threatpost
WordPress大学
WordPress大学
腾讯CDC
I
Intezer
T
Tailwind CSS Blog
Engineering at Meta
Engineering at Meta
D
Darknet – Hacking Tools, Hacker News & Cyber Security
Hugging Face - Blog
Hugging Face - Blog
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
The Register - Security
The Register - Security
Stack Overflow Blog
Stack Overflow Blog
PCI Perspectives
PCI Perspectives
S
Security Archives - TechRepublic
Simon Willison's Weblog
Simon Willison's Weblog
A
Arctic Wolf
MongoDB | Blog
MongoDB | Blog
小众软件
小众软件
Hacker News: Ask HN
Hacker News: Ask HN
O
OpenAI News
博客园 - 【当耐特】
L
LINUX DO - 最新话题
C
Comments on: Blog
S
Securelist
月光博客
月光博客
S
Secure Thoughts
Security Latest
Security Latest
MyScale Blog
MyScale Blog
NISL@THU
NISL@THU
F
Full Disclosure
M
Microsoft Research Blog - Microsoft Research
T
True Tiger Recordings
SecWiki News
SecWiki News
aimingoo的专栏
aimingoo的专栏
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 热门话题
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
AWS News Blog
AWS News Blog
Hacker News - Newest:
Hacker News - Newest: "LLM"
L
Lohrmann on Cybersecurity
H
Help Net Security

cs.LG updates on arXiv.org

Personalized Generative Models for Contextual Debiasing From Privacy to Generalization: Linear Max-Information Bounds for DP-SGD When Does Deep RL Beat Calibrated Baselines? A Benchmark Study on Adaptive Resource Control Amortized Factor Inference Networks for Posterior Inference Classification and detection of multiple UAVs using rational Gaussian wavelet neural networks Planning Neural Dynamics with Lie Group Embedding through Supervised Projective Manifold Learning Modeling Dynamic Mixtures of Time-Delay Systems from Streaming Time Series AirCast-SR: A Foundation Model for Kilometer-Scale Atmospheric Super-Resolution via Latent Consistency Diffusion Neural Bayesian Sequential Routing GAC: Noise-Aware Adaptive Mixing for Hybrid SFT-RL Post-Training Provably Communication-Efficient and Privacy-Preserving Federated Graph Neural Networks Function-Valued Causal Influence in Nonlinear Time Series The Bridge-Garden Dilemma in LLM Distillation: Why Mixing Hard and Soft Labels Works Balancing Plasticity and Stability with Fast and Slow Successor Features InfoQuant: Shaping Activation Distributions for Low-Bit LLM Quantization FM-fMRI: Event Conditioned Flow Matching for Rest-to-Task fMRI Time-Series Synthesis TrackRef3D: Multi-View Consistent Track-then-Label for Open-World Referring Segmentation in 3D Gaussian Splatting TSFMAudit: Data Contamination Auditing in Forecasting Time Series Foundation Models On the Push-Based Asynchronous Federated Learning: A Bias-Correction Aggregation Approach CSV-ViT: A Vision Transformer with the Variable-sized Cortical Supervertices for Detection of Alzheimer's Disease Pathologies Max-Window Scale Estimation for Near-Lossless HiF8 W8A8 Quantization-Aware Training Online Learning on Hidden-Convex Losses via Algorithmic Equivalence: Optimal Regret, Geometric Barrier, and Bandit Feedback Curriculum Learning for Safety Alignment A PAC-Bayesian View of Generalisation for Physics-Informed Machine Learning Dynamic Link Prediction with Temporally Enhanced Signed Graph Neural Networks GEM: Geometric Entropy Mixing for Optimal LLM Data Curation Reparametrizing Shampoo and SOAP for Subspace Basis Updates and BFloat16 Storage Unified Neural Scaling Laws Semigroup Consistency as a Diagnostic for Learned Physics Simulators QAM-W: Joint 2D Codebook Quantization for LLM Weights via Hadamard Rotation and Activation-Aware Scaling HRVConformer: Neonatal Hypoxic-Ischemic Encephalopathy Classification from the Heart Rate signals Scaling World-Model Reinforcement Learning Through Diffusion Policy Optimization A Hybrid Vision-Language Architecture for Automated Defect Reasoning and Report Generation in Industrial Inspection Quantized Keys Steal Attention: Bias Correction for KV-Cache Compression in Video Diffusion BioFact-MoE: Biologically Factorized Mixture of Experts for Vision-Language Prognostic Modeling in Hepatocellular Carcinoma Bridging Classification and Reconstruction: Cooperative Time Series Anomaly Detection SilIF: Silhouette-Augmented Isolation Forest for Unsupervised Transaction Fraud Detection Co-folding model guided by structural proteomics Energy-Gated Attention and Wavelet Positional Encoding: Complementary Inductive Biases for Transformer Attention Stateful Inference for Low-Latency Multi-Agent Tool Calling Two-Parameter Flows for Learning Population Dynamics of Physical Systems On the Role of Inductive Bias in Time-Series Pretraining: A Case Study in Learning Generalizable Representations for Clinical Time Series A Fast and Generic Energy-Shifting Transformer for Hybrid Monte Carlo Radiotherapy Calculation ARBITER: Reasoning Trajectory Basins and Majority Vote Failures in Test-Time Sampling When Rule Violations Are Rare: Chimera Training for Logical Anomaly Detection The Constraint Tax: Measuring Validity-Correctness Tradeoffs in Structured Outputs for Small Language Models When Correct Demonstrations Hurt: Rethinking the Role of Exemplars in In-Context Learning Benchmarking Convolutional, Transformer, Hybrid, and Vision Language Models for Multi Disease Retinal Screening Rotation-Invariant Spherical Watermarking via Third-Order SO(3) Representation Coupling MULTISEISMO: A Multimodal Seismic Dataset and Model for Cross-Modal Seismic Understanding
Few-Step Generative Models를 샘플 기반 변분 추론을 분산시켜 정렬하기
Jaewoo Lee, · 2026-05-27 · via cs.LG updates on arXiv.org

PDF 보기 HTML (실험 중)

요약: 몇 단계 생성 모델을 맞추는 것은 어려운 일입니다. 그 이유는 기존의 맞추기 프레임워크가 일반적으로 제한적인 가정에 의존하기 때문입니다: 처리 가능한 확률, 특정 ODE/SDE 해결기, 또는 특정 모델 가족. 우리는 FAV, 샘플 기반 변이 추론을 통한 몇 단계 생성 모델 맞추기(Few-step Generative Models Alignment via Sample-based Variational Inference), 일반적인 맞추기 프레임워크를 소개합니다. 이 프레임워크는 생성기와 참조 분포에 대한 샘플 접근만 필요합니다. 우리는 맞추기를 참조 분포에 고정된 보상 기울기 분포에서 샘플링으로 설정합니다. 우리는 스테인 변이 경사 하강법을 샘플 기반 변이 추론 스케마로 활용하고, 고정점 회귀를 통해 그 파티클 업데이트를 생성기 매개변수에 분산시킵니다. 우리는 로봇 조작 및 이미지 생성기 맞추기 두 도메인에서 FAV를 평가합니다. 로봇 조작 생성 정책 맞추기에서 FAV는 56개 오프라인 및 30개 오프라인-온라인 RL 작업에 걸쳐 주류 정책 추출 기준을 모두 능가합니다. 이미지 생성기 맞추기에 대해서는 FAV가 GAN, 이동 모델, 일관성 모델 및 흐름 맵을 포함한 다양한 몇 단계 백본을 미세 조정하며, ImageNet-$256$에서 1024$^2$ 텍스트-이미지 합성으로 확장합니다. 코드는 다음에 제공됩니다.이 https URL.
コメント: レビュー中
トピック: 機械学習 (cs.LG); 人工知能 (cs.AI)
引用: arXiv:2605.26552 [cs.LG]
  (または arXiv:2605.26552v1 [cs.LG]) for this version)
  https://doi.org/10.48550/arXiv.2605.26552

DataCite를 통한 arXiv 발행 DOI (등록 예정)

제출 이력

From: Jaewoo Lee [이메일 보기]
[v1] 화, 26 5월 2026 05:02:49 UTC (18,419 KB)