





















Abstract:Chromatin regulators can alter transcriptional programs by modifying the accessibility of regulatory DNA elements. Understanding how regulatory sequences differ between wild-type (WT) and knockout (KO) conditions is crucial for deciphering transcriptional control. Here, we applied a convolutional neural network, \textbf{WTKO-CNN} with an attention mechanism to classify DNA sequences as WT or KO, achieving high predictive performance. To interpret the model, we generated saliency maps to identify nucleotide positions most influential for the classification decision. From these high-saliency regions, we extracted and clustered k-mers, enabling de novo motif discovery. Sequence logos and consensus motifs derived from the CNN filters revealed biologically meaningful patterns, which are further validated using MEME, TOMTOM, and HOMER against known transcription factor binding sites. Our analysis identified motifs associated with transcription factor families that discriminate WT from KO sequences, demonstrating that CNN-guided saliency mapping is a powerful approach for uncovering functional sequence features.
| Subjects: | Genomics (q-bio.GN); Artificial Intelligence (cs.AI) |
| Cite as: | arXiv:2605.24034 [q-bio.GN] |
| (or arXiv:2605.24034v1 [q-bio.GN] for this version) | |
| https://doi.org/10.48550/arXiv.2605.24034 arXiv-issued DOI via DataCite (pending registration) |
From: Lopamudra Dey [view email]
[v1]
Thu, 21 May 2026 07:46:50 UTC (2,244 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。