惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

酷 壳 – CoolShell
酷 壳 – CoolShell
H
Hacker News: Front Page
P
Palo Alto Networks Blog
T
ThreatConnect
Apple Machine Learning Research
Apple Machine Learning Research
博客园_首页
T
True Tiger Recordings
P
Privacy & Cybersecurity Law Blog
B
Blog
IT之家
IT之家
Last Week in AI
Last Week in AI
F
Full Disclosure
Hacker News: Ask HN
Hacker News: Ask HN
C
Comments on: Blog
Microsoft Azure Blog
Microsoft Azure Blog
C
Cybersecurity and Infrastructure Security Agency CISA
Microsoft Security Blog
Microsoft Security Blog
博客园 - 【当耐特】
N
News and Events Feed by Topic
NISL@THU
NISL@THU
腾讯CDC
雷峰网
雷峰网
Security Latest
Security Latest
李成银的技术随笔
M
Microsoft Research Blog - Microsoft Research
L
LangChain Blog
L
Lohrmann on Cybersecurity
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
C
Check Point Blog
Y
Y Combinator Blog
Recent Announcements
Recent Announcements
博客园 - Franky
N
News | PayPal Newsroom
V
V2EX
A
About on SuperTechFans
The Register - Security
The Register - Security
月光博客
月光博客
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
Google Online Security Blog
Google Online Security Blog
MyScale Blog
MyScale Blog
Cisco Talos Blog
Cisco Talos Blog
Vercel News
Vercel News
WordPress大学
WordPress大学
C
Cyber Attacks, Cyber Crime and Cyber Security
The Hacker News
The Hacker News
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
爱范儿
爱范儿
A
Arctic Wolf
L
LINUX DO - 最新话题
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More

Datadog | The Monitor blog

Reduce CVE noise with OpenVEX assessments in Datadog How we made a SQL query optimization agent 59% more accurate using autoresearch and LLM Observability How to audit and clean up monitors effectively Diagnose slow PostgreSQL queries faster with explain plan correlation Explore Datadog metrics with Natural Language Queries Toto 2.0: Time series forecasting enters the scaling era Simplify micro-frontend observability with Datadog RUM Attribute AI costs across providers with Datadog Cloud Cost Management Diagnose and resolve database performance issues faster with Database Investigator Datadog for Government achieves FedRAMP® High certification Analyze cloud costs with flexible spreadsheets in Datadog Sheets Inside Datadog’s AI Research Lab: Meet two PhD candidates behind Toto Connect triage and investigation in a single workflow with Datadog Cloud SIEM This Month in Datadog - April 2026 Monitor and optimize Supabase query performance with Datadog Database Monitoring Add dynamically updating context to logs with Reference Tables and Observability Pipelines Introducing ARFBench: A time series question-answering benchmark based on real incidents Test network paths with TCP, UDP, and ICMP in Datadog The product signal latency gap slowing your growth How to investigate cloud credential compromise with Bits AI Security Analyst Evaluate, optimize, and secure your Google Cloud AI stack with Datadog Turn developer feedback into operational insight with Datadog Forms and Sheets Identify and fix code issues faster with Datadog’s Azure DevOps Source Code integration Steganography at scale: Embedding share URLs in Datadog widget screenshots Bringing observability data hosting to the UK on AWS Centralize observability management with Datadog Governance Console Every team should be A/B testing Manage service tracing across hosts with Single Step Instrumentation rules Route OTel data from AI apps to ClickHouse and Datadog using Observability Pipelines Spotting CI/CD misconfigurations before the bots do: Securing GitHub Actions with Datadog IaC Security Detect runtime threats in Python Lambda functions with Datadog AAP Offline evaluation for AI agents: Best practices Introducing our open source AI-native SAST Instrument and monitor Boomi integration flows with OpenTelemetry and Datadog Not all index scans are equal: How we cut query latency by over 99% Platform engineering metrics: What to measure and what to ignore Integrate Recorded Future threat intelligence with Datadog Cloud SIEM CI/CD security: threat modeling using a MITRE-style threat matrix CI/CD security: How to secure your GitHub ecosystem Ingress NGINX is EOL: A practical guide for migrating to Kubernetes Gateway API How we built a real-world evaluation platform for autonomous SRE agents at scale Operating agentic AI with Amazon Bedrock AgentCore and Datadog LLM Observability: Lessons from NTT DATA Introducing the Datadog Code Security MCP Capture and analyze custom heatmaps in Session Replay Understand session replays faster with AI summaries and smart chapters Monitor ClickHouse query performance with Datadog Database Monitoring How we designed empathetic alert sounds for on-call engineers Search and act across Datadog to resolve issues faster with Bits Assistant Measure the business impact of every product change with Datadog Experiments Analyzing round trip query latency Configuring JavaScript caches for better performance Introducing Bits AI Dev Agent for Code Security Datadog achieves ISO 42001 certification for responsible AI Monitor Nutanix clusters, hosts, and VMs with Datadog Monitor Juniper Mist in Datadog A new Host Map for modern infrastructure When upserts don't update but still write: Debugging Postgres performance at scale Annotate traces to improve LLM quality with Datadog LLM Observability What's new in Cloud SIEM: AI-powered investigations, enhanced threat intelligence, and scalable security operations Explore Kubernetes with native OpenTelemetry data Monitor Oracle Fusion Cloud Applications with Datadog Announcing the Datadog Terraform provider v4.0.0 Scaling Kubernetes workloads on custom metrics How to design cloud environments for AI-powered threat analysis Monitor Aruba Central in Datadog How we centralize and remediate risks with Datadog Case Management Accelerate incident response with Datadog and ServiceNow Monitor your application and network load balancer logs Understanding Karpenter architecture for Kubernetes autoscaling Tools for collecting metrics and logs from Karpenter Monitor Karpenter with Datadog What your product data is actually saying Key metrics for monitoring Karpenter Securing Datadog's platform in the AI age: The role of observability data Closing the verification loop: Observability-driven harnesses for building with agents When an AI agent came knocking: Catching malicious contributions in Datadog’s open source repos Closing the verification loop, Part 2: Fully autonomous optimization Four ways engineering teams use the Datadog MCP Server to power AI agents Approaching your observability migration with the right mindset Meet the new Bits AI SRE: Deeper reasoning, twice as fast Designing MCP tools for agents: Lessons from building Datadog's MCP server Key learnings from the 2026 State of DevSecOps study Use plain English to query your multi-cloud infrastructure in Resource Catalog Simplifying troubleshooting across the user journey with Datadog Synthetic Monitoring Protect your OCI resources with Datadog Cloud Security This Month in Datadog - February 2026 Fine-tune Toto for turbocharged forecasts Amazon EC2 security: How misconfigured and public AMIs expand your cloud attack surface Enable end-to-end visibility into your Java apps with a single command Measure and improve mobile app startup performance with Datadog RUM Evaluating our AI Guard application to improve quality and control cost Identify untested code across every level of your codebase Make use of guardrail metrics and stop babysitting your releases Monitor Versa Networks SD-WAN performance in Datadog How we reduced the size of our Agent Go binaries by up to 77% Improve performance and reliability with APM Recommendations Remediate transitive vulnerabilities faster with Datadog Software Composition Analysis Generate audit-ready vulnerability and compliance reports with Datadog Sheets Monitor Fortinet FortiManager performance in Datadog Improve test coverage across codebases with Datadog Code Coverage
How Cambia Health Solutions saved $30,000 monthly with Cloud Cost Management and the Datadog Resource Catalog
2025-12-30 · via Datadog | The Monitor blog

This guest blog post is authored by Kevin Francis, a Platform Engineering Manager at Cambia Health Solutions

As part of the platform engineering team at Cambia Health Solutions, my team and I work to provide the modern, reliable infrastructure that enables application development teams to create and scale personalized health care services. In 2024, we supported our organization as it merged two internal divisions to reduce costs and simplify operations.

In this post, we’ll tell the story of how Datadog helped us successfully merge these divisions and significantly optimize their cloud costs. We’ll show you how Cloud Cost Management (CCM) and the Resource Catalog helped us uncover and implement cost-savings opportunities. We applied these insights to standardize our Amazon Relational Database Service (RDS) infrastructure and lay the groundwork for an improved Reserved Instance (RI) purchasing strategy, leading to nearly $30,000 in monthly RDS savings.

Unifying observability

Before the merger, the divisions used separate observability platforms, resulting in fragmented data across teams and AWS accounts. Infrastructure metrics, logs, cloud usage, cost information, and resource metadata were not visible in a centralized location. This fragmentation hindered operational awareness, and we struggled to find and mitigate issues such as unencrypted S3 buckets, outdated Python runtimes, and cost inefficiencies across our cloud databases.

Unified observability was a critical goal of the merger, both to simplify operations and to enable cost optimization. Without a consolidated view of resources and spending, we could not identify viable, impactful cost reduction opportunities.

Identifying high-impact RDS savings with CCM

By standardizing on Datadog across the newly integrated teams, we gained a complete, cross-account view of metrics, cost data, and resource inventory. CCM provided insight into historical usage and spend, while Resource Catalog supplied a unified inventory of cloud resources across all AWS accounts. With visibility into our full infrastructure footprint, CCM identified nearly $30,000 per month in potential savings through RI purchases for RDS, which had long been one of our most significant sources of cloud costs.

This recommendation represented a particularly high-impact opportunity—an RI purchasing strategy that would cover instances across multiple teams and AWS accounts, and that targeted one of the largest contributors to our cloud spending. It highlighted the breadth of RDS instance classes we used. With workloads distributed across many classes, any new RI purchases risked locking us into long-term commitments with low utilization. Resource Catalog reinforced this insight by showing that our environment included underutilized instances. To improve our RI coverage and realize the cost-savings potential of the merger, the teams needed to narrow their RDS footprint to a small number of standard instance classes.

Standardizing the RDS fleet to improve RI coverage

Acting on the recommendation required more than simply purchasing RIs. We first needed to ensure our RDS infrastructure was unified enough for those reservations to apply effectively. We focused on modernizing and consolidating our RDS fleet. Resource Catalog played a critical role by providing a unified, detailed view of all RDS instances, grouped by instance class, generation, engine type, and size.

This analysis showed us that the majority of our RDS usage relied on memory-optimized instance types such as db.r6i and db.r6g. We chose to standardize on the db.r6g instance class, which, due to its Graviton processor, offers excellent price-performance.

Migrating to db.r6g required coordination with workload owners to validate application performance under the new instance type and confirm migration readiness. Once the migrations were complete, we had a fully standardized RDS fleet across accounts—an essential prerequisite to making RI purchases that would span the entire organization.

Centralizing RI purchasing to maximize cost savings

Standardizing our RDS compute on a smaller set of instance types enabled us to simplify our RI purchasing and governance by covering more workloads with fewer reservations. As an even greater benefit, it helped us improve RI utilization by sharing discounts across accounts and using flexible instance sizes within each reservation.

Discount sharing

We also benefited from discount sharing, which distributes RI discounts across all accounts in an AWS Organization. Before the merger, each division purchased RIs independently, which contributed to inconsistent cost governance and underutilized reservations. This decentralized approach also led to gaps in RI coverage, which meant paying undiscounted costs for some of our RDS compute. Post-merger, CCM’s cross-account insight enabled us to create a centralized RI purchasing strategy that applied usage from multiple accounts, improving our RI utilization and cutting our RDS costs.

Size flexibility

Size flexibility allows a single RI to apply to any instance size within the same instance family, which helps us maintain high utilization even as workloads evolve. Because usage from all instance sizes in that family counts toward the same reservation, teams can choose the RDS instance sizes that best meet their needs without jeopardizing RI coverage. For example, both db.r6g.xlarge and db.r6g.large instances apply to the same RI, ensuring that we receive discounted pricing regardless of size differences. Standardizing on a smaller number of instance classes amplified this benefit by concentrating usage in a smaller set of families, making RI utilization both predictable and consistently high.

Monitoring savings with Datadog dashboards

To track the impact of our new reservations, we used Datadog dashboards to visualize month-over-month cost changes and share AWS cost data across the organization. Teams can explore costs by division or workload, see actual RI utilization and savings, compare results against CCM projections, and quickly identify new optimization opportunities.

Datadog dashboard showing month-over-month RDS cost changes, RI utilization, and cost savings opportunities.

Preventing drift with Resource Catalog

Resource Catalog remains a key part of our optimization workflow even after the RI purchase, giving teams a way to confirm that instances stay properly configured. This ongoing visibility helps prevent drift back into a fragmented mix of instance classes that would undermine our RI utilization.

Datadog Resource Catalog view showing current RDS instance configurations.

Achieving results and applying the framework forward

Our initiative to unify observability and cloud governance after the merger created the visibility needed to uncover significant RDS savings. Guided by CCM’s recommendations, we standardized our RDS infrastructure, modernized to Graviton-powered instances, and centralized our RI purchasing strategy, saving close to $30,000 per month on RDS costs.

This combination of recommendations, modernization, and centralized purchasing has created a repeatable approach for optimizing cloud costs, and we’re now applying this methodology to our DynamoDB and OpenSearch resources.

To learn more about how you can achieve similar visibility and savings, see the documentation on Cloud Cost Management and the Resource Catalog. If you haven’t yet started using Datadog, sign up for a 14-day free trial.