惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Help Net Security
Help Net Security
P
Privacy & Cybersecurity Law Blog
A
Arctic Wolf
P
Privacy International News Feed
Security Latest
Security Latest
L
Lohrmann on Cybersecurity
Stack Overflow Blog
Stack Overflow Blog
WordPress大学
WordPress大学
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
大猫的无限游戏
大猫的无限游戏
酷 壳 – CoolShell
酷 壳 – CoolShell
Know Your Adversary
Know Your Adversary
爱范儿
爱范儿
Simon Willison's Weblog
Simon Willison's Weblog
C
Cisco Blogs
T
Tailwind CSS Blog
T
Tor Project blog
博客园_首页
S
Schneier on Security
I
InfoQ
D
Docker
L
LINUX DO - 热门话题
Apple Machine Learning Research
Apple Machine Learning Research
D
DataBreaches.Net
阮一峰的网络日志
阮一峰的网络日志
Microsoft Security Blog
Microsoft Security Blog
M
MIT News - Artificial intelligence
美团技术团队
Project Zero
Project Zero
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
C
CXSECURITY Database RSS Feed - CXSecurity.com
GbyAI
GbyAI
NISL@THU
NISL@THU
C
Cyber Attacks, Cyber Crime and Cyber Security
AWS News Blog
AWS News Blog
Cyberwarzone
Cyberwarzone
S
SegmentFault 最新的问题
aimingoo的专栏
aimingoo的专栏
T
Threatpost
H
Help Net Security
PCI Perspectives
PCI Perspectives
K
Kaspersky official blog
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
A
About on SuperTechFans
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Jina AI
Jina AI
云风的 BLOG
云风的 BLOG
G
Google Developers Blog
L
LangChain Blog
S
Secure Thoughts

cs.LG updates on arXiv.org

EfficientSign: An Attention-Enhanced Lightweight Architecture for Indian Sign Language Recognition Unified Multimodal Uncertain Inference Low-Data Supervised Adaptation Outperforms Prompting for Cloud Segmentation Under Domain Shift Detecting Diffusion-generated Images via Dynamic Assembly Forests FIRE-CIR: Fine-grained Reasoning for Composed Fashion Image Retrieval CausalVAD: De-confounding End-to-End Autonomous Driving via Causal Intervention R3PM-Net: Real-time, Robust, Real-world Point Matching Network Needle in a Haystack: One-Class Representation Learning for Detecting Rare Malignant Cells in Computational Cytology Generative 3D Gaussian Splatting for Arbitrary-ResolutionAtmospheric Downscaling and Forecasting When & How to Write for Personalized Demand-aware Query Rewriting in Video Search HaloProbe: Bayesian Detection and Mitigation of Object Hallucinations in Vision-Language Models Memory-Guided Trust-Region Bayesian Optimization (MG-TuRBO) for High Dimensions EngageTriBoost: Predictive Modeling of User Engagement in Digital Mental Health Intervention Using Explainable Machine Learning Reservoir observer enhanced with residual calibration and attention mechanism Creator Incentives in Recommender Systems: A Cooperative Game-Theoretic Approach for Stable and Fair Collaboration in Multi-Agent Bandits Skip-Connected Policy Optimization for Implicit Advantage Efficient RL Training for LLMs with Experience Replay Wireless Communication Enhanced Value Decomposition for Multi-Agent Reinforcement Learning A Little Rank Goes a Long Way: Random Scaffolds with LoRA Adapters Are All You Need Adversarial Sensor Errors for Safe and Robust Wind Turbine Fleet Control PRAGMA: Revolut Foundation Model IKKA: Inversion Classification via Critical Anomalies for Robust Visual Servoing Adaptive Simulation Experiment for LLM Policy Optimization $p1$: Better Prompt Optimization with Fewer Prompts EvoLen: Evolution-Guided Tokenization for DNA Language Model Smartwatch-Based Sitting Time Estimation in Real-World Office Settings Structural Evaluation Metrics for SVG Generation via Leave-One-Out Analysis Loom: A Scalable Analytical Neural Computer Architecture Post-Hoc Guidance for Consistency Models by Joint Flow Distribution Learning Hierarchical Kernel Transformer: Multi-Scale Attention with an Information-Theoretic Approximation Analysis Spectral Geometry of LoRA Adapters Encodes Training Objective and Predicts Harmful Compliance Finite-Sample Analysis of Nonlinear Independent Component Analysis:Sample Complexity and Identifiability Bounds How does Chain of Thought decompose complex tasks? Uncertainty-Aware Transformers: Conformal Prediction for Language Models Adaptive Candidate Point Thompson Sampling for High-Dimensional Bayesian Optimization Using Synthetic Data for Machine Learning-based Childhood Vaccination Prediction in Narok, Kenya Delve into the Applicability of Advanced Optimizers for Multi-Task Learning Bridging SFT and RL: Dynamic Policy Optimization for Robust Reasoning Multi-Agent Decision-Focused Learning via Value-Aware Sequential Communication Predictive Entropy Links Calibration and Paraphrase Sensitivity in Medical Vision-Language Models Efficient Hierarchical Implicit Flow Q-learning for Offline Goal-conditioned Reinforcement Learning Modality-Aware Zero-Shot Pruning and Sparse Attention for Efficient Multimodal Edge Inference The nextAI Solution to the NeurIPS 2023 LLM Efficiency Challenge Feature-Label Modal Alignment for Robust Partial Multi-Label Learning Integrated electro-optic attention nonlinearities for transformers Toward World Models for Epidemiology Tracing the Chain: Deep Learning for Stepping-Stone Intrusion Detection Policy-Aware Design of Large-Scale Factorial Experiments R2G: A Multi-View Circuit Graph Benchmark Suite from RTL to GDSII Beyond Augmented-Action Surrogates for Multi-Expert Learning-to-Defer Continuous Orthogonal Mode Decomposition: Haptic Signal Prediction in Tactile Internet FIT-GNN: Faster Inference Time for GNNs that 'FIT' in Memory Using Coarsening Batch Distillation Data for Developing Machine Learning Anomaly Detection Methods Predicting Metabolic Dysfunction-Associated Steatotic Liver Disease using Machine Learning Methods: A Retrospective Cohort Study Adaptive Tuning of Parameterized Traffic Controllers via Multi-Agent Reinforcement Learning Bandwidth-constrained Variational Message Encoding for Cooperative Multi-agent Reinforcement Learning Neural Two-Stage Stochastic Optimization for Solving Unit Commitment Problem Measurement-Consistent Langevin Corrector for Stabilizing Latent Diffusion Inverse Problem Solvers SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning Rays as Pixels: Learning A Joint Distribution of Videos and Camera Trajectories PhysInOne: Visual Physics Learning and Reasoning in One Suite PDE-regularized Dynamics-informed Diffusion with Uncertainty-aware Filtering for Long-Horizon Dynamics Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Injection Regime-Conditional Retrieval: Theory and a Transferable Router for Two-Hop QA Identification and Anonymization of Named Entities in Unstructured Information Sources for Use in Social Engineering Detection Hypergraph Neural Networks Accelerate MUS Enumeration ASTRA: Adaptive Semantic Tree Reasoning Architecture for Complex Table Question Answering Neighbourhood Transformer: Switchable Attention for Monophily-Aware Graph Learning WOMBET: World Model-Based Experience Transfer for Robust and Sample-efficient Reinforcement Learning Revisiting the Capacity Gap in Chain-of-Thought Distillation from a Practical Perspective A Mathematical Framework for Temporal Modeling and Counterfactual Policy Simulation of Student Dropout Temporal Dropout Risk in Learning Analytics: A Harmonized Survival Benchmark Across Dynamic and Early-Window Representations MedFormer-UR: Uncertainty-Routed Transformer for Medical Image Classification Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs SenBen: Sensitive Scene Graphs for Explainable Content Moderation Deep Learning-Based Tracking and Lineage Reconstruction of Ligament Breakup Every Response Counts: Quantifying Uncertainty of LLM-based Multi-Agent Systems through Tensor Decomposition 3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding Evidential Transformation Network: Turning Pretrained Models into Evidential Models for Post-hoc Uncertainty Estimation StructRL: Recovering Dynamic Programming Structure from Learning Dynamics in Distributional Reinforcement Learning From Selection to Scheduling: Federated Geometry-Aware Correction Makes Exemplar Replay Work Better under Continual Dynamic Heterogeneity Detection of Hate and Threat in Digital Forensics: A Case-Driven Multimodal Approach Semantic Intent Fragmentation: A Single-Shot Compositional Attack on Multi-Agent AI Pipelines Joint Interference Detection and Identification via Adversarial Multi-task Learning From Dispersion to Attraction: Spectral Dynamics of Hallucination Across Whisper Model Scales AlphaLab: Autonomous Multi-Agent Research Across Optimization Domains with Frontier LLMs Act or Escalate? Evaluating Escalation Behavior in Automation with Language Models Multivariate Time Series Anomaly Detection via Dual-Branch Reconstruction and Autoregressive Flow-based Residual Density Estimation On the Spectral Geometry of Cross-Modal Representations: A Functional Map Diagnostic for Multimodal Alignment Structured Exploration and Exploitation of Label Functions for Automated Data Annotation MolPaQ: Modular Quantum-Classical Patch Learning for Interpretable Molecular Generation Reinforcement-aware Knowledge Distillation for LLM Reasoning SubQuad: Near-Quadratic-Free Structure Inference with Distribution-Balanced Objectives in Adaptive Receptor framework A Horizon-Aware Decision-Support Framework for Demand Forecasting Model Selection in Resilient Production Planning Multi-agent Adaptive Mechanism Design Relational Visual Similarity From Navigation to Refinement: Revealing the Two-Stage Nature of Flow-based Diffusion Models through Oracle Velocity On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs STCast: Adaptive Boundary Alignment for Global and Regional Weather Forecasting OmniPrism: Learning Disentangled Visual Concept for Image Generation
Deep Doubly Debiased Longitudinal Effect Estimation with ICE G-Computation
[Submitted on 12 Feb 2026 (v1), last revised 12 Jun 2026 (this v · 2026-06-15 · via cs.LG updates on arXiv.org

View PDF HTML (experimental)

Abstract:Estimating longitudinal treatment effects is essential for sequential decision-making but is challenging due to treatment-confounder feedback. While Iterative Conditional Expectation (ICE) G-computation offers a principled approach, its recursive structure suffers from error propagation, corrupting the learned outcome regression models. We propose D3-Net, a framework that mitigates error propagation in ICE training and then applies a robust final correction. First, to interrupt error propagation during learning, we train the ICE sequence using Sequential Doubly Robust (SDR) pseudo-outcomes, which provide bias-corrected targets for each regression. Second, we employ a multi-task transformer with a covariate simulator head for auxiliary supervision, regularizing representation learning, and a target network to stabilize training dynamics. For the final estimate, we discard the SDR correction and instead use the uncorrected nuisance models to perform Longitudinal Targeted Minimum Loss-Based Estimation (LTMLE) on the original outcomes. This second-stage, targeted debiasing ensures robustness and optimal finite-sample properties. Comprehensive experiments demonstrate that our model, D3-Net, robustly reduces bias and variance across different horizons, counterfactuals, and time-varying confoundings, compared to existing state-of-the-art ICE-based estimators.

Submission history

From: Wenxin Chen [view email]
[v1] Thu, 12 Feb 2026 20:16:27 UTC (905 KB)
[v2] Fri, 12 Jun 2026 07:25:09 UTC (1,474 KB)