

















Abstract:Knowledge graphs (KGs) can provide structured scientific context to language models, but it remains unclear which graph facts actually shape the generated hypotheses. We study KG-guided hypothesis generation for battery materials across Mistral-7B, Llama-3.1-70B, and Gemini 2.5 Flash. We perturb local KGs by varying density, ontology richness, topology, and control structure, and evaluate outputs with both provided-graph and fixed-reference metrics. Across models, KG utility is selective and model-dependent: graph context changes outputs, but no-KG outputs also recover substantial graph content from model priors. Compact top-k subgraphs often approximate full-KG behavior, including when claimed-outcome triples are held out. At the same time, compression is not unique to one semantic ranking rule, random and topology-based subsets can also recover much of the signal. These results support a redundancy-aware Compressive KG hypothesis: useful KG signal is often recoverable from compact, scientifically structured subgraphs rather than requiring the full local graph.
| Subjects: | Artificial Intelligence (cs.AI) |
| Cite as: | arXiv:2605.27176 [cs.AI] |
| (or arXiv:2605.27176v1 [cs.AI] for this version) | |
| https://doi.org/10.48550/arXiv.2605.27176 arXiv-issued DOI via DataCite (pending registration) |
From: Sanjay Das [view email]
[v1]
Tue, 26 May 2026 15:29:41 UTC (2,579 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。