惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

cs.AI updates on arXiv.org

Teaching Through Analogies: A Modular Pipeline for Educational Analogy Generation Summoning the Oracle to Slay It: Mitigating Look-Ahead Bias in Financial Backtesting with Large Language Models Feature Lottery? A Bifurcation Theory of Concept Emergence CausaLab: A Scalable Environment for Interactive Causal Discovery Toward AI Scientists Fundamental Limitation in Explaining AI GlobalDentBench: A Multinational Benchmark for Evaluating LLM Clinical Reasoning in Dentistry with Expert Calibration When Correct Beliefs Collapse: Epistemic Resilience of LLMs under Clinical Pressure CITYREP: A Unified Benchmark for Urban Representations Across Cities, Tasks, and Modalities Diff-Instruct with Diffused Reward: Towards Principled One-step Generator RL Cascade-KDE: Robust Time-Series Restoration under Out-of-Distribution Impulse Corruptions AvalancheBench: Evaluating Enterprise Data Agents Through Latent World Recovery Truthful Online Preference Aggregation for LLM Fine-Tuning in Mobile Crowdsourcing BoxLitE: A Faithful Knowledge Base Embedding Based on Convex Optimization Learning to Reason Efficiently with A* Post-Training Residual Drift Dominates Contradiction in Multi-Turn Constraint Reasoning Context: Proactive Goal-Directed Intelligence via Composable Sandboxed Programs, Declarative Wiring, and Structured Interaction Concept Drift Adaptation Using Self-Supervised and Reinforcement Learning In Android Malware Detection Reason--Imagine--Act: Closed-Loop LLM Decision Making with World Models for Autonomous Driving Generative Representation Learning on Hyper-relational Knowledge Graphs via Masked Discrete Diffusion The Model Is Not the Product: A Dual-Pillar Architecture for Local-First Psychological Coaching Knowledge Graph Modulated Deep Learning for Limited-Sample Clinical Data Analysis Authority Inversion in LLM-Mediated Ubiquitous Systems: When Models Trust Users Over Sensors Market Regime Council for Dynamic Credit Assignment in Multi-Agent LLM Decision Systems A Signal-Language Foundation Model for Broad-Spectrum Cardiovascular Assessment from Routine Electrocardiography Automated Detection and Classification of Delusion-related Content in Naturalistic Audio Diaries Using Multi-Agent Language Models Privacy-Preserving Local Language Models for Longitudinal Data Retrieval in Chronic Dermatologic Disease: Implementation in Pemphigus Patients Towards trustworthy agentic AI: a comprehensive survey of safety, robustness, privacy, and system security TRACER: A Semantic-Aware Framework for Fine-Grained Contamination Detection in Code LLMs Hidden-State Privacy Has an Empty Middle Machine Psychometrics: A Mathematical Psychology of Artificial Intelligence Breaking the Chains of Probability: Neutrosophic Logic as a New Framework for Epistemic Uncertainty in Large Language Models Raon-Speech Technical Report Enhancing Reliability in LLM-Based Secure Code Generation Low-Cost Labels, Reliable Choices: Rollout-Calibrated Hyper-Heuristics for Job Shop Scheduling LLM-AutoSciLab: Closed-Loop Scientific Discovery via Active Experimentation with LLMs IVR-R1: Refining Trajectories through Iterative Visual-Grounded Reasoning in Reinforcement Learning Mitigating Object Hallucinations in Vision-Language Models through Region-Aware Attention Recalibration Distributionally Robust Transfer Learning with Structurally Missing Covariates, with Application to Cross-National Cardiac Arrest Prediction When Mean CE Fails: Median CE Can Better Track Language Model Quality LC-ERD: Mining Latent Logic for Self-Evolving Reasoning via Consistency-Regulated Reward Decomposition Spectral Probe-Circuits: A Three-Step Recipe for Identifying Attention-Head Circuits in Pretrained Transformers CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection Not All Transitions Matter: Evidence from PPO FLOATBench: A Dataset and Benchmark for Floating Offshore Wind Turbine Tower Fatigue Confidence Calibration in Large Language Models Extracting Training Data from Diffusion Language Models via Infilling An Interpretable CF-RL-TOPSIS Fusion Model for Skills-Aware Talent Recommendation Jailbreak to Protect: Buffering and Reinforcing via Temporary Jailbreaking for Safe Fine-Tuning in Large Language Models An Interactive Paradigm for Deep Research High-Risk AI Systems and the Problem of Identity in the European AI Act Fuzzy, Neutrosophic, and Uncertain Graph Theory: Properties and Applications Improving Labeling Consistency with Detailed Constitutional Definitions and AI-Driven Evaluation Parameter Efficient Multi-Class Intelligent Scheduling for Multimodal Online Distributed Industrial Anomaly Detection MuCRASP: Multimodal Chain-of-thought Reasoning aware Structured Pruning Signs Beat Floats: Low-Rank Double-Binary Adaptation for On-Device Fine-Tuning Exploration of Perceptual Speech Features for Clinical Decision-Support in Mental Health Care Trust but Verify: Prover-Verifier Deliberation for Selective LLM Prediction Measuring Reasoning Quality in LLMs: A Multi-Dimensional Behavioral Framework Filtered Posterior Mean Collections: A Unified Framework for Analytical Models of Diffusion Generalization Clustering as Reasoning: A $k$-Means Interpretation of Chain-of-Thought Graph Learning MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning LGMT: Logic-Grounded Metamorphic Testing for Evaluating the Reasoning Reliability of LLMs DRIVE: Modeling Skills at the Reasoning and Interaction Levels for Web Agents under Continual Learning Mixture of Complementary Agents for Robust LLM Ensemble Nano World Models: A Minimalist Implementation of Future Video Prediction Inference Time Context Sparsity: Illusion or Opportunity? From Accuracy to Auditability: A Survey of Determinism in Financial AI Systems When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs Multimodal Alignment and Preference Optimization for Zero-Shot Conditional RNA Generation TriVAL: A Tri-Validation Framework for Faithful Automatic Optimization Modeling Towards Evaluation Engineering: An Empirical Study of ML Evaluation Harnesses in the Wild An Empirical Evaluation of LLM-Generated Code Security Across Prompting Methods SPACE: Unifying Symmetric and Asymmetric Routing Problems for Generalist Neural Solver Insuring Every Action: An Authority Frontier Framework for Runtime Actuarial Control of Autonomous AI Agents In Search of the Ingredients of Open-Endedness: Replicating Picbreeder with Large Vision-Language Models Beyond the Frontier: Stochastic Backtracking for Efficient Test-Time Scaling Overcoming "Physics Shock" in Earth Observation A Heteroscedastic Uncertainty Framework for PINN-based Flood Inference ChaosBench-Logic v2: Evaluating LLM Logical Reasoning over Dynamical Systems at Scale MDIA: A Multi-Agent Diagnostic Intelligence Pipeline on HealthBench Professional Federated Learning over Human-Body Communication for On-Body Edge Intelligence: A Survey, Taxonomy, and BODYFED-HBC Scheduling Vignette Lattice theory and algebraic models for deep convolutional learning based on mathematical morphology Why We Need World Models for AGI: Where LLMs Fail and How World Models May Outperform Catching The Correct Answer Trap: Characterising AI Tutor Blind Spots When Analysing Student Reasoning Hypothesis Generation and Inductive Inference in Children and Language Models Agent-ToM: Learning to Monitor Autonomous LLM Agents via Theory-of-Mind Reasoning EchoDistill:Alignment Noisy-to-Clean Self-Distillation for Robust Audio LLMs How Much Thinking is Enough? Quantifying and Understanding Redundancy in LLM Reasoning Iterative Refinement Neural Operators are Learned Fixed-Point Solvers: A Principled Approach to Spectral Bias Mitigation Geo-Expert: Towards Expert-Level Geological Reasoning via Parameter-Efficient Fine-Tuning Verified SHAP: Provable Bounds for Exact Shapley Values of Neural Networks RECTOR: Priority-Aware Rule-Based Reranking for Compliance-Aware Autonomous Driving Trajectory Selection A Large-Scale Dataset and Benchmark: Do Protein-Ligand Models Learn Binding Sites or Just Binding Likelihood? HeartBeatAI: An Interpretable and Robust Deep Learning Framework for Multi-Label ECG Arrhythmia Detection Quantum Frog: Emergent Cooperation and Difficulty Scaling in a Quantized-Time Cooperative Game Document Classification Pattern Recognition via Information Fusion: A Systematic Review of Multimodal and Multiview Representation Approaches Second Guess: Detecting Uncertainty Through Abstention and Answer Stability in Small Language Models Human-AI Collaboration in Science at Scale: A Global Large-scale Randomized Field Experiment QUIVER: A Formal Framework for Quantifying Perturbation Propagation and Bifurcation in Compound AI Systems
Task-Aligned Self-Supervised Learning for Medical Image Analysis: A Systematic Review and Practical Design Guidelines
Chathura Wim · 2026-05-26 · via cs.AI updates on arXiv.org

View PDF HTML (experimental)

Abstract:Self-supervised learning (SSL) has emerged as a promising paradigm for addressing the annotation bottleneck in medical imaging by learning representations from unlabeled data. However, its effectiveness depends heavily on the design of the pretext task and its alignment with the downstream clinical objective. We present a systematic, task-oriented review of SSL in medical imaging, examining how different pretext-task formulations influence performance across classification, segmentation, detection, and other tasks. Following PRISMA guidelines, we analyze 75 studies published between 2017 and 2025 and organize them into four paradigms: contrastive, non-contrastive and predictive, generative and reconstruction-based, and hybrid learning. Rather than cataloguing methods by architecture, we map each paradigm to the downstream objectives it best supports. Our analysis shows there is no universally optimal SSL strategy; instead, performance is governed by the alignment between the pretext task, the imaging modality, and the target task. Contrastive methods learn global discriminative features and align well with classification, but may overlook subtle pathological patterns. Generative and spatial prediction-based approaches better preserve local anatomical structure, making them more suitable for segmentation and other dense prediction tasks, while hybrid methods offer the most balanced performance. We further show that modality-specific design is critical and that SSL provides its greatest benefit in low-label and few-shot regimes. Finally, we distill these findings into practical design guidelines and outline open challenges, including pathology-aware pretext task design, resource-efficient training for high-dimensional data, and standardized evaluation protocols. This work offers practical guidance for designing more effective and clinically relevant SSL frameworks in medical imaging.
Comments: This manuscript is 31 pages with 4 tables and 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
MSC classes: 68T07, 68T45, 92C55, 68-02
ACM classes: I.2.6; I.4.0; I.5.4; I.2.10
Cite as: arXiv:2605.23995 [cs.CV]
  (or arXiv:2605.23995v1 [cs.CV] for this version)
  https://doi.org/10.48550/arXiv.2605.23995

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Kanakka Hewage Chathura Thimanka Wimalasiri [view email]
[v1] Mon, 18 May 2026 04:47:50 UTC (355 KB)