惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
Scott Helme
Scott Helme
爱范儿
爱范儿
WordPress大学
WordPress大学
博客园 - 三生石上(FineUI控件)
阮一峰的网络日志
阮一峰的网络日志
博客园 - Franky
V
V2EX
腾讯CDC
博客园_首页
博客园 - 司徒正美
酷 壳 – CoolShell
酷 壳 – CoolShell
T
Tailwind CSS Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
小众软件
小众软件
J
Java Code Geeks
大猫的无限游戏
大猫的无限游戏
月光博客
月光博客
Microsoft Azure Blog
Microsoft Azure Blog
B
Blog
雷峰网
雷峰网
Stack Overflow Blog
Stack Overflow Blog
IT之家
IT之家
罗磊的独立博客
Recorded Future
Recorded Future
博客园 - 聂微东
O
OpenAI News
S
Secure Thoughts
Hacker News: Ask HN
Hacker News: Ask HN
S
Schneier on Security
Hacker News - Newest:
Hacker News - Newest: "LLM"
Y
Y Combinator Blog
C
Cyber Attacks, Cyber Crime and Cyber Security
Project Zero
Project Zero
宝玉的分享
宝玉的分享
K
Kaspersky official blog
N
Netflix TechBlog - Medium
T
The Exploit Database - CXSecurity.com
Google Online Security Blog
Google Online Security Blog
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Webroot Blog
Webroot Blog
云风的 BLOG
云风的 BLOG
Simon Willison's Weblog
Simon Willison's Weblog
C
Check Point Blog
D
Darknet – Hacking Tools, Hacker News & Cyber Security
L
LINUX DO - 热门话题
美团技术团队
L
Lohrmann on Cybersecurity

Datadog | The Monitor blog

Introducing our open source AI-native SAST Instrument and monitor Boomi integration flows with OpenTelemetry and Datadog Not all index scans are equal: How we cut query latency by over 99% Platform engineering metrics: What to measure and what to ignore Integrate Recorded Future threat intelligence with Datadog Cloud SIEM CI/CD security: threat modeling using a MITRE-style threat matrix CI/CD security: How to secure your GitHub ecosystem Ingress NGINX is EOL: A practical guide for migrating to Kubernetes Gateway API Operating agentic AI with Amazon Bedrock AgentCore and Datadog LLM Observability: Lessons from NTT DATA Introducing the Datadog Code Security MCP Capture and analyze custom heatmaps in Session Replay Understand session replays faster with AI summaries and smart chapters Monitor ClickHouse query performance with Datadog Database Monitoring How we designed empathetic alert sounds for on-call engineers Search and act across Datadog to resolve issues faster with Bits Assistant Measure the business impact of every product change with Datadog Experiments Analyzing round trip query latency Configuring JavaScript caches for better performance Introducing Bits AI Dev Agent for Code Security Datadog achieves ISO 42001 certification for responsible AI Monitor Nutanix clusters, hosts, and VMs with Datadog Monitor Juniper Mist in Datadog A new Host Map for modern infrastructure Annotate traces to improve LLM quality with Datadog LLM Observability What’s new in Cloud SIEM: AI-powered investigations, enhanced threat intelligence, and scalable security operations Explore Kubernetes with native OpenTelemetry data Monitor Oracle Fusion Cloud Applications with Datadog Announcing the Datadog Terraform provider v4.0.0 Scaling Kubernetes workloads on custom metrics How to design cloud environments for AI-powered threat analysis Monitor Aruba Central in Datadog How we centralize and remediate risks with Datadog Case Management Accelerate incident response with Datadog and ServiceNow Monitor your application and network load balancer logs Understanding Karpenter architecture for Kubernetes autoscaling Tools for collecting metrics and logs from Karpenter Monitor Karpenter with Datadog What your product data is actually saying Key metrics for monitoring Karpenter Securing Datadog’s platform in the AI age: The role of observability data Four ways engineering teams use the Datadog MCP Server to power AI agents Approaching your observability migration with the right mindset Meet the new Bits AI SRE: Deeper reasoning, twice as fast Key learnings from the 2026 State of DevSecOps study Use plain English to query your multi-cloud infrastructure in Resource Catalog Simplifying troubleshooting across the user journey with Datadog Synthetic Monitoring Protect your OCI resources with Datadog Cloud Security This Month in Datadog - February 2026 Amazon EC2 security: How misconfigured and public AMIs expand your cloud attack surface Enable end-to-end visibility into your Java apps with a single command Measure and improve mobile app startup performance with Datadog RUM Evaluating our AI Guard application to improve quality and control cost Identify untested code across every level of your codebase Make use of guardrail metrics and stop babysitting your releases Monitor Versa Networks SD-WAN performance in Datadog Improve performance and reliability with APM Recommendations Remediate transitive vulnerabilities faster with Datadog Software Composition Analysis Generate audit-ready vulnerability and compliance reports with Datadog Sheets Monitor Fortinet FortiManager performance in Datadog Improve test coverage across codebases with Datadog Code Coverage Move fast, don’t break things: Consistent testing standards at scale Enrich logs with ServiceNow CMDB context before routing to any SIEM or logging tool Monitor Lustre with Datadog Make faster, better product decisions with Datadog Product Analytics Surface and remediate runtime posture issues with Workload Protection Findings Protect agentic AI applications with Datadog AI Guard How to optimize JavaScript code with CSS Trace Google Pub/Sub workloads in Cloud Run with Datadog Detect human names in logs with ML in Sensitive Data Scanner How we cut our NLQ agent debugging time from hours to minutes with LLM Observability Debug PostgreSQL query latency faster with EXPLAIN ANALYZE in Datadog Database Monitoring Datadog acquires Propolis Unify and correlate frontend and backend data with retention filters Scale compliance across global frameworks with Datadog Cloud Security Monitor Arista VeloCloud SD-WAN performance with Datadog Building reliable dashboard agents with Datadog LLM Observability Simplify log collection and aggregation for MSSPs with Datadog Observability Pipelines Mitigation for Node.js denial-of-service vulnerability affecting Datadog APM Automate flaky test fixes with the Bits AI Dev Agent and Test Optimization How we built an AI SRE agent that investigates like a team of engineers Datadog integrations 2025 recap: Observability for AI, security, and hybrid cloud Design effective executive dashboards with Datadog Implement dbt data quality checks with dbt-expectations Bring faster visibility into AWS Lambda functions with remote instrumentation Troubleshoot faster with the GitLab Source Code integration in Datadog How Cambia Health Solutions saved $30,000 monthly with Cloud Cost Management and the Datadog Resource Catalog Normalize any logs for Cloud SIEM with Datadog's OCSF processor Optimizing Datadog at scale: Cost-efficient observability at Zendesk Detect, diagnose, and resolve network issues easily with CNM Network Health Connect engineering errors to user impact in early-stage products Cilium configuration for Kubernetes operations at scale Designing feedback loops for progressive delivery Ship features faster and safer with Datadog Feature Flags Choosing the right OpenTelemetry Collector distribution Route your monitor alerts with Datadog monitor notification rules Automate Cloud SIEM investigations with Bits AI Security Analyst Cloud threat detection: How to identify risky activity across control and data planes Collecting Kafka performance metrics Monitoring Kafka with Datadog Monitoring Kafka performance metrics
Monitor your Google Gemini apps with Datadog LLM Observability
Siddarth Dwivedi, Tom Sobolik · 2025-01-06 · via Datadog | The Monitor blog

Google’s comprehensive AI offering includes Vertex AI, a cloud-based platform for building and deploying AI applications, AI Studio, a web platform for quickly prototyping and testing AI applications, and Gemini, their multimodal model. Gemini offers advanced capabilities in image, code, and text generation and can be used to implement chatbot assistants, perform complex data analysis, generate design assets, and more. Large language models (LLMs) like Gemini are powerful, but reliance on them in your applications via APIs can make it difficult to understand your AI applications’ behavior and debug issues.

We are pleased to announce that Datadog LLM Observability natively integrates with Google Gemini, allowing you to monitor, troubleshoot, improve, and secure your Google Gemini LLM applications. LLM Observability also supports auto-instrumentation for Vertex AI applications. In this post, we’ll explore how you can use LLM Observability with your Gemini applications to:

Swiftly troubleshoot Gemini applications with end-to-end tracing

Datadog LLM Observability now offers automatic instrumentation for Google Gemini, detecting and tracing API calls made by your application without the need for manual annotations. This simplifies the setup process, allowing you to focus on instrumenting other parts of your LLM applications. You can use LLM Observability to view detailed, end-to-end traces of your LLM chain executions and quickly spot performance bottlenecks and errors. When tracing a Gemini API call, you can carefully inspect the input prompt and observe each of the steps your application took to form the final response—such as retrieval-augmented generation (RAG) requests or moderation filters. By looking at these intermediate steps, you can quickly discover the root cause of unexpected responses.

View traces for Google Gemini LLM application requests

The trace side panel also provides key operational metrics—latency and token usage—for each call your application makes to Google Gemini. This way, you can quickly determine whether a call was unusually slow or expensive and then dive into the spans to determine the root cause of any issues.

Improve response quality and safety of Gemini applications

Due to their non-deterministic nature, LLM applications can unpredictably hallucinate), generating inaccurate, irrelevant, or potentially toxic responses. User-facing LLM applications like chatbots are particularly susceptible to jailbreaking, posing significant risk of exposing sensitive information. Datadog LLM Observability provides out-of-the-box quality and safety evaluations that help you continuously evaluate every user interaction to track and mitigate these risks.

The trace side panel allows you to access these quality evaluations, which include checks like “Failure to Answer”, “Topic Relevancy”, and “Negative Sentiment”. Additionally, safety checks for “Input Toxicity” and “Prompt Injection” ensure that you are able to identify inappropriate and malicious interaction with your application and take necessary action.

View out-of-the-box quality and safety evaluations for Gemini applications

LLM Observability also uses Sensitive Data Scanner to actively redact personally identifiable information (PII) from prompt traces. This helps you refrain from storing this data in your monitoring system to maintain compliance with regulations like GDPR, and also makes it easier to detect when customer PII was inadvertently passed in an LLM call or shown to the user.

Get complete visibility and control over Gemini usage

Cost efficiency and performance are key concerns for today’s LLM applications. As AI teams rapidly expand their use of Gemini APIs to address increasingly complex scenarios, it is essential to effectively monitor requests, latencies, and token consumption.

In many use cases, LLM token consumption can be highly variable from prompt to prompt. Whether you’re using Gemini 1.5 Pro, 1.0 Pro, or 1.5 Flash, your costs will vary significantly depending on how your usage interacts with your model’s pricing structure. LLM Observability helps you track both token usage and estimated costs at the application, trace, and span level, so you can easily characterize the cost of your AI services and break down the most significant sources of token usage.

View Google Gemini cost insights in Datadog

To get a comprehensive view of your Gemini application’s performance and usage patterns, you can use the out-of-the-box Operational Insights dashboard. This dashboard collates key usage metrics, such as request volume, usage per model, tokens per call, call response time, and span error rate. By getting all these detailed signals in a unified view, you can quickly focus your investigation into sources of errors, latency, and unexpected usage patterns.

View Gemini metrics and monitors in Datadog with the out-of-the-box dashboard

Monitor Google Gemini with Datadog

LLM-based applications are incredibly powerful and unlock many new product opportunities, and Gemini is a highly capable solution for implementing LLMs in your apps. However, there remains a pressing need for granular visibility into model quality, safety, performance, and cost. By monitoring your Gemini applications using Datadog LLM Observability, you can form actionable insights about their behavior from a unified interface. LLM Observability is now generally available for all Datadog customers—see our documentation for more information about how to get started.

If you’re brand new to Datadog, sign up for a free trial.