
























Abstract:Molecular property regression struggles with cases in chemically relevant target ranges that are underrepresented in datasets. Standard average error minimization approaches underperform in these highly relevant cases, and oversampling approaches lead to meaningless molecular representations. In this paper, we propose SPECTRA, a spectral, domain-aware graph generation method designed to improve the prediction of underrepresented but relevant molecular property values. It combines a rarity-aware budgeting scheme to focus generation where data are scarce, target-neighbors graph alignment to establish structural correspondence, and interpolation of Laplacian spectra, node features, and targets. Coupled with spectral GNN using edge-aware Chebyshev convolutions, SPECTRA shows its effectiveness in property prediction benchmarks with competitive performance over leading state-of-the-art methods in relevant target ranges, while requiring ~4x less computational time.
| Subjects: | Machine Learning (cs.LG); Spectral Theory (math.SP); Molecular Networks (q-bio.MN) |
| Cite as: | arXiv:2511.04838 [cs.LG] |
| (or arXiv:2511.04838v2 [cs.LG] for this version) | |
| https://doi.org/10.48550/arXiv.2511.04838 arXiv-issued DOI via DataCite |
From: Brenda Nogueira [view email]
[v1]
Thu, 6 Nov 2025 21:57:21 UTC (14,810 KB)
[v2]
Wed, 20 May 2026 19:04:15 UTC (6,118 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。