





















Abstract:State-of-the-art models represent proteins and molecules in separate embedding manifolds, limiting the modeling of systemic biological processes. We introduce ReactEmbed, a lightweight, plug-and-play module that bridges this gap. ReactEmbed leverages biochemical reaction networks as a source of functional context, based on the principle that co-participation in reactions defines a shared functional scope. The module aligns frozen embeddings from models like ESM-3 and MolFormer into a unified space using a weighted reaction graph and a specialized sampling strategy. This process enriches unimodal embeddings and enables strong performance on cross-domain benchmarks. ReactEmbed offers a practical method to unify biological representations without costly retraining. The code and database are available for open use\footnote{this https URL}.
| Subjects: | Machine Learning (cs.LG) |
| Cite as: | arXiv:2501.18278 [cs.LG] |
| (or arXiv:2501.18278v3 [cs.LG] for this version) | |
| https://doi.org/10.48550/arXiv.2501.18278 arXiv-issued DOI via DataCite |
From: Amitay Sicherman [view email]
[v1]
Thu, 30 Jan 2025 11:34:03 UTC (2,897 KB)
[v2]
Thu, 6 Feb 2025 11:18:35 UTC (2,898 KB)
[v3]
Sun, 24 May 2026 06:21:55 UTC (2,737 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。