惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Vercel News
Vercel News
L
LangChain Blog
The Register - Security
The Register - Security
H
Hackread – Cybersecurity News, Data Breaches, AI and More
Engineering at Meta
Engineering at Meta
Microsoft Security Blog
Microsoft Security Blog
C
Check Point Blog
U
Unit 42
B
Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
F
Fortinet All Blogs
D
DataBreaches.Net
H
Help Net Security
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
G
Google Developers Blog
人人都是产品经理
人人都是产品经理
MyScale Blog
MyScale Blog
Martin Fowler
Martin Fowler
P
Proofpoint News Feed
aimingoo的专栏
aimingoo的专栏
N
Netflix TechBlog - Medium
雷峰网
雷峰网
I
InfoQ
T
The Blog of Author Tim Ferriss
博客园 - 叶小钗
Recent Announcements
Recent Announcements
博客园 - 司徒正美
Blog — PlanetScale
Blog — PlanetScale
美团技术团队
Recorded Future
Recorded Future
大猫的无限游戏
大猫的无限游戏
博客园 - 聂微东
WordPress大学
WordPress大学
罗磊的独立博客
小众软件
小众软件
Stack Overflow Blog
Stack Overflow Blog
The GitHub Blog
The GitHub Blog
博客园 - 【当耐特】
月光博客
月光博客
Apple Machine Learning Research
Apple Machine Learning Research
F
Full Disclosure
T
Tailwind CSS Blog
量子位
D
Docker
V
Visual Studio Blog
J
Java Code Geeks
Hugging Face - Blog
Hugging Face - Blog
V
V2EX
有赞技术团队
有赞技术团队
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com

cs.DC updates on arXiv.org

Poster: EdgeCitadel -- Hybrid NATS-MQTT Orchestration for Edge Multi-Agent Systems CoAgent: Concurrency Control for Multi-Agent Systems Green SARC: Predictive Cost and Carbon Governance for Agentic AI Systems Quantifying the Impact of Lossy Compression on Neural Generative Surrogate Modeling PreLort: Prefix-Nested LoRA for Federated Fine-Tuning under Rank Heterogeneity SMEPilot: Characterizing and Optimizing LLM Inference with Scalable Matrix Extensions Federated Medical Image Segmentation under Real-World Label Noise: A Benchmark Suite for Noisy Label Learning Method Selection Evaluating Gemma4 Models as AI Teaching Assistants for Introductory Parallel Programming: A DataRaceBench Study The Essence of Entity Component System Censorship-Resistant Sealed-Bid Auctions on Blockchains NEURON-Fabric: CXL-Side Low-Bit Gradient Aggregation for Distributed Training Solyx AI Grid: Hardware-Telemetry-Aware Routing Across Geographically Distributed GPU Clusters A RAG-Enhanced Bi-Level Cognitive Orchestration Framework for LEO Satellite Networks uringscope: Portable, Low-Overhead Observability for io_uring Coordinated Scheduling for MoE LLM Serving Generation Quality-Latency Tradeoff-Aware Inference Offloading for Multimodal LLMs in Cloud-Edge Continuum Adaptive Resource Management and Quality Control for Streaming Video Generation Is RISC-V Ready for Massively Parallel Astrophysical Codes? SDVDiag: Multimodal Causal Discovery for Online Diagnosis in Software-defined Vehicles Incentives and Evidence in Learned Service Orchestration SWARM-LLM: Collaborative Inference for Edge-based Small Language Models Secure and Low-Latency IoT Analytics Using an Edge-Based Streaming Architecture Raiders of the Lost Log: Synchronous Parallel In-Place Models and Algorithms SwiftCache: Efficient LLM Serving for Multi-turn Conversations with Heterogeneous KV Cache Sharing StorRep: Storage Research Experiment Patterns on Chameleon Cloud and Trovi Efficient Data Availability Sampling via Coded Distributed Arrays Tropical: Enhancing SLO Attainment in Disaggregated LLM Serving via SLO-Aware Multiplexing DRIFT: Risk-Constrained Diffusion with Imitation Priors for Mixed-Autonomy Traffic Generation Robust and Automated Reconfiguration of Byzantine Wide-Area Replication From the NYU Ultracomputer to Modern Exascale: A Historical and Architectural Survey of In-Network Computing and Scalable Synchronization CacheWise: Understanding Workloads and Optimizing KVCache Management for Efficiently Serving LLM Coding Agents A Unified Constant-Time Switch Rule for Constructing Edge-Disjoint Hamiltonian Cycles in Gaussian Networks Tangram: Hiding GPU Heterogeneity for Efficient LLM Parallelization Re-Rooting-Based Fault-Tolerant Broadcasting in Dense Gaussian Networks Beyond CPU-GPU Frequency: Memory-Clock and Tail Effects in Edge Inference Latency Estimation did:crdt: Coordination-Free Decentralised Identifiers via Signed CRDTs Single-Connection Mixed-Criticality Transport with CATS: Bounded Guarantees, Three Structural Limits, and a QUIC Escape Diagonal-Budgeted Trotterization for Efficient Quantum Hamiltonian Simulation Distributed Load Balancing with Workload-Dependent Service Rates Stannic: Systolic STochAstic ONliNe SchedulIng AcCelerator Complete CALM: A Coordination Criterion for Specifications HeRo: Adaptive Orchestration of Agentic RAG on Heterogeneous Mobile SoC RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure Nightjar: Dynamic Adaptive Speculative Decoding for Large Language Models Serving
Generated, Parallel, Scalable? A Study of Agentic AI-Generated Julia Code on Supercomputers
[Submitted on 15 Jun 2026] · 2026-06-16 · via cs.DC updates on arXiv.org

View PDF HTML (experimental)

Abstract:Julia is increasingly used in hpc as a single-language alternative to combining high-level scripting with low-level systems languages, but achieving scalable performance still requires expertise in parallel programming. llms are increasingly used for code generation and are advancing rapidly with each new version. Yet, existing studies focus on single-shot prompting rather than agentic settings, in which an llm autonomously plans, generates, and refines code through tool use.
Using an OpenCode-based agent extended with a Julia-documentation mcp server, we study agentic generation of parallel Julia code, focusing on task-based execution with this http URL. We evaluate three llms, OpenAI GPT-5.5, Anthropic Claude Opus 4.7, and the open-weight Qwen3-Coder-Next, on three problems with distinct parallel structures: {\pi} approximation, tiled general matrix multiplication, and tiled Cholesky decomposition. The generated this http URL implementations are compared against agent-generated this http URL and this http URL baselines, with shared-memory experiments scaling to 192 cores and distributed-memory experiments on two nodes.
The agents reliably produce executable code for small inputs but fail at larger scales due to deadlocks, oversubscription, or out-of-memory errors, with the open-weight model affected most severely. The two commercial models scale comparably on this http URL and this http URL, while their this http URL implementations expose recurring weaknesses in task dependencies, granularity, and scheduling. Agentic AI is promising for producing parallel Julia code, but generating robust, performance-aware implementations for large-scale hpc systems remains an open challenge.

Submission history

From: Jonas Posner [view email]
[v1] Mon, 15 Jun 2026 10:37:54 UTC (92 KB)