惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

MongoDB | Blog
MongoDB | Blog
IT之家
IT之家
J
Java Code Geeks
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Recent Announcements
Recent Announcements
博客园 - 三生石上(FineUI控件)
博客园_首页
MyScale Blog
MyScale Blog
腾讯CDC
I
InfoQ
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
人人都是产品经理
人人都是产品经理
Vercel News
Vercel News
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
量子位
爱范儿
爱范儿
U
Unit 42
aimingoo的专栏
aimingoo的专栏
B
Blog RSS Feed
云风的 BLOG
云风的 BLOG
M
MIT News - Artificial intelligence
A
About on SuperTechFans
T
The Blog of Author Tim Ferriss
Blog — PlanetScale
Blog — PlanetScale
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Engineering at Meta
Engineering at Meta
博客园 - 叶小钗
小众软件
小众软件
Jina AI
Jina AI
Hugging Face - Blog
Hugging Face - Blog
Google DeepMind News
Google DeepMind News
The Cloudflare Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
D
Docker
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
博客园 - 【当耐特】
博客园 - Franky
H
Help Net Security
Stack Overflow Blog
Stack Overflow Blog
阮一峰的网络日志
阮一峰的网络日志
C
Check Point Blog
C
CERT Recently Published Vulnerability Notes
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
Cisco Talos Blog
Cisco Talos Blog
H
Hackread – Cybersecurity News, Data Breaches, AI and More
I
Intezer
Latest news
Latest news
D
Darknet – Hacking Tools, Hacker News & Cyber Security
博客园 - 司徒正美
Microsoft Security Blog
Microsoft Security Blog

cs.IR updates on arXiv.org

S1-DeepResearch: Beyond Search, Toward Real-World Long-Horizon Research Agents An Empirical Analysis of Optimization Dynamics and Sparsity Boundaries in Large-Scale Pedestrian Attribute Recognition Combining Retrieval-Augmented Text Generation with LLMs for Reading Content Recommendations Co-Scraper: query-aware DOM Pruning and Reusable Scraper Synthesis for Lightweight Web Data Extraction Edu-Theater: A Data-Efficient Agent Framework for Scalable Learner Behavior Simulation through Staging Roll-Call Guiding Federated Graph Recommendation with LLM-encoded knowledge HoloRec: Holistic Encoding and Interleaved Reasoning for Generative Recommendation Retrievable Gradients: Continual Post-Training Without Cumulative Weight Drift MAGE-RAG: Multigranular Adaptive Graph Evidence for Agentic Multimodal RAG in Long-Document QA Entity Labels Are Not Entity Signals: A Framework for Observable Relevance in Document Re-Ranking Theorem-Grounded Execution Ontologies for Interpretable Machine Reasoning RL-Index: Reinforcement Learning for Retrieval Index Reasoning How Much Do Reviews Really Contribute? A Study on Text-Enriched Matrix Factorization for Recommendations Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus Transfer Learning for FHIR Questionnaire Terminology Binding MVEB: Massive Video Embedding Benchmark Interactor: Agentic RL oriented Iterative Creation for Ad Description Generation in Sponsored Search Understanding the Behaviors of Environment-aware Information Retrieval How Much Can We Trust LLM Search Agents? Measuring Endorsement Vulnerability to Web Content Manipulation Benchmarking LLM Agents on Meta-Analysis Articles from Nature Portfolio SCAR: Semantic Continuity-Aware Retrieval for Efficient Context Expansion in RAG Retrieval-as-a-Service:A System-Oriented Analysis of Industrial Retrieval Pipelines in Web Systems Beyond Positive Signals: Unlocking Implicit Negative Behaviors for Enhanced Sequential User Modeling OneBar: An End-to-End Content-Grounded Generative Query Recommendation Framework for E-Commerce Video Feeds Confidence-Based Stopping Methods for Systematic Reviews EventConnector: Mining Social Event Relations through Temporal Graphs One Sequential Recommendation Model Pretrained from Synthetic Priors Predicts Multiple Datasets Intelligent Multimodal Retrieval and Reasoning for Geospatial Knowledge Discovery on the I-GUIDE Platform Leveraging Code-Mixed Product Metadata and User Feedback for Personalized Recommendation on Daraz Bangladesh PIANO: Personalized Reranking via Information Aggregation Node for Music Search Optimization Harmonizing Semantic and Collaborative in LLMs: Reasoning-based Embedding Generator for Sequential Recommendation OneRank: Unified Transformer-Native Ranking Architecture for Multi-Task Recommendation A Theoretical Framework for Risk Analysis of Stochastic Rankers Viral Images: Identifying Reprintings within 1.5 Million Photographs in Chronicling America LLM-Driven Usefulness Judgment for Web Search Evaluation Beyond Chunks and Graphs: Retrieval-Augmented Generation through Triplet-Driven Thinking Ascend-RaBitQ: Heterogeneous NPU-CPU Acceleration of Billion-Scale Similarity Search with 1-bit Quantization RAGR: Review-Augmented Generative Recommendation Skill Is Not Document: A Query-Conditional Benchmark and Two-Stage Retriever for LLM Agent Skill Routing MARS: Multi-rate Aggregation of Recency Signals for Sequential Recommendation across Sparse and Dense Regimes When Does Latent Reasoning Help? MeRa: Metric-Space Bias for Spatial Prediction Charge as a Construct-Validity Factor in Chinese Legal Case Retrieval: A Cross-Benchmark Audit MolE-RAG: Molecular Structure-Enhanced Retrieval-Augmented Generation for Chemistry OneFeed: A Unified Generative Framework for Feed ContentEnhancement and Query Generation Beyond Retrieval: Learning Compact User Representations for Scalable LLM Personalization TechGraphRAG: An Agentic Graph-Augmented RAG Framework for Technical Literature Reasoning Bridging Passive and Active: Enhancing Conversation Starter Recommendation via Active Expression Modeling Empirical Study of Pop and Jazz Mix Ratios for Genre-Adaptive Chord Generation RoTRAG: Rule of Thumb Reasoning for Conversation Harm Detection with Retrieval-Augmented Generation Beyond Predefined Schemas: TRACE-KG for Context-Enriched Knowledge Graphs from Complex Documents AI Co-Scientist for Ranking: Discovering Novel Search Ranking Models alongside LLM-based AI Agents with Cloud Computing Access AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents All-Mem: Agentic Lifelong Memory via Dynamic Topology Evolution Orcheo: A Modular Full-Stack Platform for Conversational Search Self-Supervised Learning as Discrete Communication Beyond Case Law: Evaluating Structure-Aware Retrieval and Safety in Statute-Centric Legal QA MIRAGE: Runtime Scheduling for Multi-Vector Image Retrieval with Hierarchical Decomposition Peeking inside the Black-Box: Reinforcement Learning for Explainable and Accurate Relation Extraction Projection and Quantisation: A Unifying View of Learning to Hash, from Random Projections to the RAG Era Learning Unified User Quantized Tokenizers for User Representation A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task
KnowML: Improving Generalization of ML-NIDS with Attack Knowledge Graphs
[Submitted on 24 Jun 2025 (v1), last revised 14 Jun 2026 (this v · 2026-06-16 · via cs.IR updates on arXiv.org

View PDF HTML (experimental)

Abstract:Anomaly-based ML-NIDS (A-NIDS) model normal network behavior from benign data and classify deviations from this baseline as anomalies, theoretically enabling the detection of evolving attack variants without labeled attack data. The ability of A-NIDS to generalize critically depends on the quality of the feature space representing network behavior. However, the requirement for feature spaces that encode attack-relevant semantics has received little attention and remains poorly understood. As a consequence, these systems still struggle to meet practical operational constraints (low false positive rates without compromising detection performance and generalization to attack variants). We identify two limitations in the current feature spaces. First, Out-of-Dimension Blindness, where features do not capture essential attack mechanism properties. Second, Attack Strategy Aggregation Failure, where features cannot encode composite attack behaviors. Moreover, we demonstrate that two SotA data-driven generalization frameworks (based on incremental and contrastive learning) cannot compensate for these feature-level shortcomings. To bridge this gap, we present KnowML, a framework that encodes attack domain knowledge directly into the feature space. For each attack family, our method employs LLMs to construct a corresponding Knowledge Graph (KG) from attack implementations. Symbolic reasoning is then applied over the KG to enumerate potential attack strategies and their compositions. The resulting Knowledge-Augmented Feature Space enables effective generalization even when trained exclusively on benign traffic, a capability beyond current approaches. Systematic empirical evaluations show that KnowML achieves up to 99% detection rates while maintaining false positive rates at or below 0.0137%, substantially outperforming contemporary feature-based baselines across diverse attack variants.

Submission history

From: Xin Fan Guo [view email]
[v1] Tue, 24 Jun 2025 17:08:58 UTC (400 KB)
[v2] Sun, 14 Jun 2026 20:42:06 UTC (811 KB)