惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

SecWiki News
SecWiki News
D
Darknet – Hacking Tools, Hacker News & Cyber Security
I
Intezer
月光博客
月光博客
Cyberwarzone
Cyberwarzone
雷峰网
雷峰网
Security Latest
Security Latest
量子位
博客园 - 聂微东
小众软件
小众软件
NISL@THU
NISL@THU
C
Cisco Blogs
The GitHub Blog
The GitHub Blog
C
Cybersecurity and Infrastructure Security Agency CISA
T
Tor Project blog
Y
Y Combinator Blog
V
V2EX
博客园 - 三生石上(FineUI控件)
P
Privacy & Cybersecurity Law Blog
F
Full Disclosure
Cisco Talos Blog
Cisco Talos Blog
Microsoft Security Blog
Microsoft Security Blog
S
Security @ Cisco Blogs
The Register - Security
The Register - Security
Google DeepMind News
Google DeepMind News
J
Java Code Geeks
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
IT之家
IT之家
Webroot Blog
Webroot Blog
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
aimingoo的专栏
aimingoo的专栏
腾讯CDC
S
Schneier on Security
L
LINUX DO - 最新话题
Latest news
Latest news
Simon Willison's Weblog
Simon Willison's Weblog
罗磊的独立博客
A
Arctic Wolf
MyScale Blog
MyScale Blog
云风的 BLOG
云风的 BLOG
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
S
Secure Thoughts
S
Securelist
Stack Overflow Blog
Stack Overflow Blog
T
Troy Hunt's Blog
Recorded Future
Recorded Future
I
InfoQ
The Cloudflare Blog
H
Heimdal Security Blog
Hugging Face - Blog
Hugging Face - Blog

Wiz Blog | RSS feed

Meet Wiz for M365: Bringing SaaS into the Security Graph How to Harden GitHub Actions: An Updated Guide Bringing Security Visibility to Vercel with Wiz Axios NPM Distribution Compromised in Supply Chain Attack Tracking TeamPCP: Investigating Post-Compromise Attacks Seen in the Wild The Wiz Blue Agent, now Generally Available Beyond the Badge: What Achieving Microsoft’s Certified Software Designation Means for Your Cloud Security Introducing the Green Agent: AI-Powered Remediation for the Cloud Three’s a Crowd: TeamPCP trojanizes LiteLLM in Continuation of Campaign KICS GitHub Action Compromised: TeamPCP Strikes Again in Supply Chain Attack Introducing the Wiz Red Agent- AI-Powered Attacker Introducing Wiz AI Application Protection Platform (AI-APP) Introducing Wiz Agents & Workflows: Security at the Speed of AI AI Runtime Threat Detection: From Input to Real-World Impact Trivy Compromised: Everything You Need to Know about the Latest Supply Chain Attack It’s Official: Wiz Joins Google Understanding and Reducing AI Risk in Modern Applications Introducing Wiz Tenant Manager: Multi-Tenant Management for Federated Organizations The Agile FedRAMP Playbook, Part 4: Reactive Risk Management through Enriched Incident Response Wiz Achieves CPSTIC Certification in Spain Seeing AI Clearly: Building Visibility Across Modern AI Applications The Agile FedRAMP Playbook, Part 3: Preventative Risk Management by building Secure by Design Wiz Leads the 2026 Latio Application Security Report with awards in 4 categories Building an Agentic Cloud Security Ecosystem: A Reference Architecture with Wiz MCP and Infosys Cyber Next The Agile FedRAMP Playbook, Part 2: Proactive Risk Management with Continuous Monitoring Cloud-native Security for your Windows environment: Announcing the Wiz Runtime Sensor for Windows Would You Click ‘Accept’? Automatically detecting malicious Azure OAuth applications using LLMs Wiz Named a Leader in The Forrester Wave™: Cloud Native Application Protection Solutions, Q1 2026 From Detection to Remediation: It’s Time to Rethink AppSec Around Exploitability and Root Cause Fixes The Agile FedRAMP Playbook, Part 1: Why Risk is Your Best Starting Point Introducing AI Cyber Model Arena: A Real-World Benchmark for AI Agents in Cybersecurity Wiz + Spotify Backstage: Security at the Developer’s Desk Building AI Security Together: New Ways to Partner with Wiz for AI Security in 2026 Hacking Moltbook: The AI Social Network Any Human Can Control The Year in Wiz Research: 2025 Most Read Blogs WizExtend is Here: AI and Cloud Security Insights in Your Daily Workflow From Detection to Remediation: Wiz in Your JetBrains IDE Agentic Browser Security: 2025 Year-End Review CodeBreach: Infiltrating the AWS Console Supply Chain and Hijacking AWS GitHub Repositories via CodeBuild A 90-Day Action Plan to Turn Resolutions into Results with Wiz Introducing the Wiz Partner Alliance: A New Chapter for Partner Success Preparing for Post-Quantum Cryptography Wiz Recognized as a 2025 Customers’ Choice in the Gartner® Peer Insights™ Voice of the Customer for CNAPP Expanding the Zero Critical Club to set a new standard for AppSec and SecOps teams Snipping the Long Tail of Shai-Hulud 2.0 Protecting Against Zero-Day Vulnerabilities with SOC-Level ASM Alert MongoBleed (CVE-2025-14847) exploited in the wild: everything you need to know The Kenna Transition: Your Strategic Shift to Exposure Management From MCP to Vibe Coding: Full Endpoint Visibility in Wiz AI Security Bringing Oracle Cloud Identity to Wiz Zero‑Days in the Age of AI: Behind the Scenes of ZeroDay.cloud 2025, with a Record High of CVEs in Critical Cloud Infra Gogs 0-Day Exploited in the Wild Code to Cloud Attacks: From Github PAT to Cloud Control Plane Top AWS re:Invent Announcements for Security Teams in 2025 React2Shell: Technical Deep-Dive & In-the-Wild Exploitation of CVE-2025-55182 React2Shell (CVE-2025-55182): Everything You Need to Know About the Critical React Vulnerability Wiz Product Announcements at re:Invent 2025: Expanding Visibility from Code to Cloud Introducing Wiz SAST: Where Code Risk Meets Cloud Context Wiz Becomes Fastest Security ISV to Reach $1 Billion in AWS Marketplace Lifetime Sales It's Here! Wiz Exposure Management is Now GA Shai-Hulud 2.0 Aftermath: Trends, Victimology and Impact Service Catalog is Here: Expand Risk Visibility for Your Service and Its Dependencies, Simplify Issue Ownership WizOS: Powering Secured Image Adoption with AI 3 OAuth TTPs Seen This Month — and How to Detect Them with Entra ID Logs Mastering Software Governance with Hosted Technologies Inventory Shai-Hulud 2.0 Supply Chain Attack: 25K+ Repos Exposing Secrets Get Certified on Wiz Defend for Threat Detection and Response Blueprint for Security: A Guide to Code, Governance, and Response Frameworks Google Unified Security Recommended Program Names Wiz Among First 3 Strategic Partners Introducing Posture Issues: Transform Security Findings into Actionable Outcomes Empower and Accelerate Your SOC with the Blue Agent Exposure Report: 65% of Leading AI Companies Found with Verified Secret Leaks Wizdom 2025 Product Announcements: Extending the Cloud Operating Model When AI Becomes the Heart of Security: Powering a Future You Can Trust AI-Powered Wiz: From Agents to Everyday Intelligence Defend Agentless Workload Detection: Bringing Visibility to Blind Spots in Threat Detection Securing AI Agents with Wiz AI-SPM Introducing Wiz ASM: Context-Driven Attack Surface Management Securing Critical Infrastructure in the Cloud Era: A Policy and Technology Blueprint How CISOs Should Plan Security Budgets for 2026 Beyond the Checkbox: How Wiz Transforms SOC 2 into a Security Powerhouse Bringing Visibility to Kubernetes: Unified Inventory and Network Insight The Foundation Modern AppSec Is Still Missing: Code to Cloud, Rebuilt the Right Way Dismantling a Critical Supply Chain Risk in VSCode Extension Marketplaces Introducing HoneyBee: How We Automate Honeypot Deployment for Threat Research RediShell: Critical Remote Code Execution Vulnerability (CVE-2025-49844) in Redis, 10 CVSS score Defending against database ransomware attacks AI Security 101: Mapping the AI Attack Surface Introducing zeroday.cloud: First-of-its-kind cloud and AI hacking competition Unifying Cloud Risk and Network Defense: Wiz and Check Point The emerging use of malware invoking AI Wiz achieves FedRAMP High authorization Wiz + HCP Terraform: Close the IaC-to-Cloud Infrastructure Security Gap IMDS Abused: Hunting Rare Behaviors to Uncover Exploits Beyond CVEs: The Exploitation of Everyday Misconfigurations Wiz Research Discovers One in Five Organizations Exposed to Systemic Risks in Vibe-Coded Applications - Here's How to Secure Them Introducing Wiz Incident Response: Your Expert Partner for Cloud Security Incidents Shai-Hulud: Ongoing Package Supply Chain Worm Delivering Data-Stealing Malware DORA Compliance in the Cloud Era: Insights from Deloitte and Wiz How Wiz Customers like Brex and FICO See AI Changing Security
AskAI – Text to Security Graph Query
Daniel Lazarev, Erez Harush · 2024-10-23 · via Wiz Blog | RSS feed

TL; DR 

In this blog post, we take you through our journey of developing a user-friendly text-to-query search engine for Wiz's Security Graph. We’ll discuss the challenges we faced and the innovative solutions we implemented to make complex cybersecurity data accessible to everyone. Whether you're a tech enthusiast or new to the field, we break it down in a way that’s easy to follow, highlighting how we leveraged advanced technologies to simplify the process of querying intricate cloud security data. 

Intro 

These days, LLMs seem to solve almost everything, but some tasks still throw curveballs—especially when the model hasn’t been trained on the topic. There’s a lot of ways to fill those gaps, so in this post, we will explain from concept to production how our engineering and research teams handled building a text search engine over Wiz’s unique Security Graph.  

Challenges of querying over the Security Graph 

In the cybersecurity domain, it's common to encounter custom query languages designed to interact with specialized databases and platforms. These languages enable security professionals to perform complex queries tailored to their organization's specific needs. However, mastering these custom query languages can be daunting due to their complexity and steep learning curve. Even experienced professionals may find it challenging to construct queries that fully leverage the capabilities of these systems. 

Enter the Wiz Security Graph— a powerful cybersecurity tool that provides a comprehensive view of an organization's cloud infrastructure and potential security risks. It leverages advanced data analysis and machine learning to map relationships between various cloud assets, configurations, and vulnerabilities across multiple platforms. Implemented as a graph database without a structured schema, the Security Graph presents numerous combinations and paths. 

As cloud ecosystems grow increasingly complex, security teams often struggle to efficiently search for the most critical and relevant data within their vast infrastructure. To navigate this complexity, we've developed a specialized Wiz Query Language for querying and searching over the graph. But even with a handy UI, getting to the heart of the data still poses challenges, even for graph pros. 

Here’s an example query (JSON) for “virtual machines with unencrypted disks”: 

Our method, while specifically tailored to Wiz's Security Graph Query Language, offers a generalizable approach applicable to other custom security query languages. By leveraging LLMs and advanced data retrieval techniques, we aim to simplify the process of constructing complex queries. This approach makes querying more accessible to users who may not be familiar with the intricacies of custom query syntaxes, ultimately enhancing efficiency and effectiveness in the cybersecurity domain. 

  However, the dynamic relationships between entities, possible paths of vertices and edges, constraints, and the constant addition of new technologies, vulnerabilities, and features pose significant challenges to an LLM—especially when tasked with generating a query JSON it isn't familiar with. Additionally, the continuous updates of cloud resources, attack vectors, malware, and more create challenges in keeping pace with the ever-changing cloud security domain. 

Therefore, we adopted an LLM-based approach to simplify the workflow with the graph for both customers and internal Wiz users. This method not only addresses the complexities of the Wiz Security Graph but also offers insights into how similar challenges can be tackled in other custom security query languages. 

Choosing the Right LLM: Sonnet 3.5 via Amazon Bedrock 

 One of the critical decisions in our approach was selecting the appropriate LLM that balances performance, cost, and latency. We chose Anthropic’s Sonnet 3.5 accessed through Amazon Bedrock as our LLM. It gave us the perfect balance of speed, performance, and cost-efficiency, making it an ideal choice for our users.

At Anthropic, we're focused on building models that can effectively balance performance, cost, and latency across real-world use cases. Collaborating with Wiz on Sonnet 3.5 through Amazon Bedrock highlights how these models can be leveraged to simplify complex queries while maintaining high levels of accuracy and operational efficiency. We're excited to see Sonnet 3.5 powering innovative applications in cybersecurity.

Garvan Doyle, Applied AI Lead at Anthropic

It's all about the prompt 

The secret to getting the LLM to play nice with our Security Graph? The right prompts. 

Instead of just asking the model to generate queries from scratch, we guided it with carefully designed prompts. This meant giving it context and constraints. Here’s how we did it: 

  • Zero-shot learning: We asked the LLM to generate a query based on a natural language description without any prior examples. 

  • Few-shot learning: In addition, we provided examples of successful queries and then asked the LLM to build on those. 

  • RAG (Retrieval-Augmented Generation): We didn’t rely on the LLM alone. Using RAG, we retrieved the most relevant data from our knowledge base and fed it to the model, ensuring it had all the context it needed to generate the perfect query. 

For example, when a user types “show me all VMs with unpatched vulnerabilities,” we first pull related metadata from our security graph and enrich it with LLM-generated suggestions. The result? A tight, accurate query that digs up exactly what the user is looking for—fast. 

Divide-and-conquer approach 

To ensure the model constructs queries that are both syntactically correct and contextually accurate, we needed to break down the process into manageable steps. Instead of just providing a static explanation of query syntax and structure, we offered the model the key building blocks to guide it. Here’s how we approached it: 

  1. From the user's input, we retrieve the most similar query examples using cosine similarity with embeddings. 

  2. We then extract all the relevant graph entities from these queries and retrieve their properties and descriptions. 

  3. Next, we gather all the allowed operators and value types associated with those entities. 

  4. For each property, we retrieve the allowed values. 

  5. We also retrieve descriptions and unique IDs for all relevant technologies. 

  6. Finally, we calculate possible paths and allowed relationships based on the retrieved information. 

Each of these steps is crucial, as they are interdependent. An error in one step could significantly impact the accuracy of the final query. The structure of the prompt, which includes all of this information, is also critical. Since it’s a large amount of context to provide to the model, we went through several iterations of prompt engineering. We even collaborated with Anthropic's research team to ensure we were on the right path.

Initial review 

We started by compiling a benchmark dataset of 200 query examples, each paired with its corresponding JSON. The initial results showed 80% accuracy, verified through exact matches or manual review. This performance gave us the confidence to move forward with a feature preview, sharing it with both internal teams and external users to gather feedback in real-world scenarios. 

However, once data from live environments started coming in, we quickly encountered gaps between our assumptions and reality. The way real-world queries were phrased—terminology, query structures, and even the distribution of topics—diverged from the examples in our demonstration database.   

To address these discrepancies, we immediately began refining our prompts and validation process. While these improvements led to slight gains in accuracy, it became clear that more extensive research was necessary. We needed a deeper understanding of the nature of real-world questions, so we could fine-tune the data and examples provided to the model. 

Data First approach 

Our approach centers around the initial retrieval of examples that closely match the semantic meaning of the user's input. The foundation of our model’s effectiveness lies in constructing a robust dataset filled with diverse examples that reflect the real-life questions users are likely to ask. 
 

Take, for instance, the query, “Find me the AWS service account AROA111222333444ABCDE.” While this request is straightforward, users expect to retrieve the same results if they input just “AROA111222333444ABCDE.” By providing a wealth of examples in our database, we enable the model to infer that this string is indeed an AWS entity. 
 

Additionally, we must remain vigilant about the constant changes within Wiz, including new vulnerabilities, emerging resources, and ongoing developments within our security graph. 

We also recognize the importance of continuously reviewing our current production models alongside any new iterations. This ongoing evaluation process drives us to enhance our examples database, refine our retrieval-augmented generation (RAG) techniques, and optimize our prompt structures. 

Clustering using UMAP & DBSCAN  

To gain meaningful insights into the topics being queried through our LLM search engine and identify potential blind spots, we embarked on a clustering analysis of anonymized customer requests. Importantly, the anonymization process preserved the semantic meaning of each query. 

We began by embedding all input queries into vectors that represent their content. Utilizing UMAP (Uniform Manifold Approximation and Projection), a powerful dimensionality reduction algorithm, we transformed our high-dimensional data into a lower-dimensional space. This technique not only simplifies visualization but also maintains the local and global structures of the data, allowing us to uncover hidden patterns. With UMAP, clusters within our data became much more apparent. 

Once we distilled our data into a two-dimensional representation, we employed DBSCAN (Density-Based Spatial Clustering of Applications with Noise) to identify approximately 150 unique topics. To further analyze these clusters, we randomly selected 50 queries from each and leveraged an LLM to categorize them. For example, we might label a cluster as “EC2 Related Queries.” This categorization enabled us to examine the distribution of topics and identify areas that our example database might not adequately cover.

Query generation  

With a deeper understanding of the topics and nuances of real query requests, we initiated a two-pronged approach to enrich our example database: 

  1. Manual Insertion: Our dedicated team of researchers, product managers, and security graph experts meticulously crafted query examples that closely mirror the tone and style of actual user inquiries. This hands-on effort ensures that the examples resonate with the real-world context of our users. 

  2. LLM-Generated Queries: Leveraging insights from our clustering analysis, we utilized LLMs to generate new query descriptions. Each topic generated a set of representative queries that reflected diverse real-life scenarios, closely aligned with previously observed examples. To ensure accuracy, our researchers and graph experts completed the corresponding JSON structures, creating a comprehensive dataset. 

In addition, we implemented an automated query generation process to keep pace with ongoing developments. This system scans our graph schema to generate generic queries that reflect new features and entities, ensuring that every addition is represented in our example database. 

These concerted efforts significantly increased the volume of relevant examples and allowed us to eliminate those that caused confusion for the LLM during JSON generation. As a result, we observed a substantial boost in accuracy, supported by positive feedback from users and expert validation through random sampling of requests. 

Re-Ranking Using MMR 

While our extensive research has significantly improved the accuracy of our example database, we recognize that there’s always room for refinement. The initial retrieval process can yield examples that, although relevant, may be overly similar or redundant. To enhance the quality of our generated responses, it’s essential to provide the model with not just relevant information but also a diverse array of inputs. This is where re-ranking becomes crucial. 

Various techniques exist for re-ranking, but we must balance cost and performance, as some methods involve employing another LLM, which can lead to increased expenses and latency. To strike this balance, we adopted a straightforward yet effective approach. 

Maximal Marginal Relevance (MMR) is an information retrieval technique designed to optimize the selection of documents or pieces of information by balancing relevance and diversity. In the context of our Retrieval-Augmented Generation (RAG) models, MMR enhances the retrieval process by ensuring that the selected query examples are not only pertinent to the user's query but also varied, effectively reducing redundancy. 

For example, consider the input: 

“A virtual machine containing sensitive data and an AI model that is exposed to the internet.” 

Without MMR, we might retrieve the following examples: 

With MMR, however, the retrieved examples would look like this: 

The MMR-retrieved examples showcase greater variance and, most importantly, introduce another example that provides our LLM with relevant context regarding AI models. Without MMR, that critical context would be absent. 

Summary 

To elevate the accuracy of generating queries over the Wiz Security Graph with LLMs, we honed in on enhancing our examples database. Our approach involved a three-pronged strategy: manually adding query examples sourced from our team of experts, generating new descriptions through LLMs based on the clustering of real-world queries, and automatically creating generic queries for new graph features. 

 To ensure the model benefitted from both diversity and relevance, we integrated Maximal Marginal Relevance (MMR) into our example selection process. Furthermore, we employed UMAP and DBSCAN to cluster anonymized user inputs, enabling us to pinpoint underrepresented topics and adapt our data accordingly. 

 This comprehensive approach has significantly bolstered our model's accuracy, ensuring it resonates more closely with the genuine queries posed by our users in the field.