惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

酷 壳 – CoolShell
酷 壳 – CoolShell
H
Hacker News: Front Page
P
Palo Alto Networks Blog
T
ThreatConnect
Apple Machine Learning Research
Apple Machine Learning Research
博客园_首页
T
True Tiger Recordings
P
Privacy & Cybersecurity Law Blog
B
Blog
IT之家
IT之家
Last Week in AI
Last Week in AI
F
Full Disclosure
Hacker News: Ask HN
Hacker News: Ask HN
C
Comments on: Blog
Microsoft Azure Blog
Microsoft Azure Blog
C
Cybersecurity and Infrastructure Security Agency CISA
Microsoft Security Blog
Microsoft Security Blog
博客园 - 【当耐特】
N
News and Events Feed by Topic
NISL@THU
NISL@THU
腾讯CDC
雷峰网
雷峰网
Security Latest
Security Latest
李成银的技术随笔
M
Microsoft Research Blog - Microsoft Research
L
LangChain Blog
L
Lohrmann on Cybersecurity
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
C
Check Point Blog
Y
Y Combinator Blog
Recent Announcements
Recent Announcements
博客园 - Franky
N
News | PayPal Newsroom
V
V2EX
A
About on SuperTechFans
The Register - Security
The Register - Security
月光博客
月光博客
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
Google Online Security Blog
Google Online Security Blog
MyScale Blog
MyScale Blog
Cisco Talos Blog
Cisco Talos Blog
Vercel News
Vercel News
WordPress大学
WordPress大学
C
Cyber Attacks, Cyber Crime and Cyber Security
The Hacker News
The Hacker News
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
爱范儿
爱范儿
A
Arctic Wolf
L
LINUX DO - 最新话题
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More

Datadog | The Monitor blog

Reduce CVE noise with OpenVEX assessments in Datadog How we made a SQL query optimization agent 59% more accurate using autoresearch and LLM Observability How to audit and clean up monitors effectively Diagnose slow PostgreSQL queries faster with explain plan correlation Explore Datadog metrics with Natural Language Queries Toto 2.0: Time series forecasting enters the scaling era Simplify micro-frontend observability with Datadog RUM Attribute AI costs across providers with Datadog Cloud Cost Management Diagnose and resolve database performance issues faster with Database Investigator Datadog for Government achieves FedRAMP® High certification Analyze cloud costs with flexible spreadsheets in Datadog Sheets Inside Datadog’s AI Research Lab: Meet two PhD candidates behind Toto Connect triage and investigation in a single workflow with Datadog Cloud SIEM This Month in Datadog - April 2026 Monitor and optimize Supabase query performance with Datadog Database Monitoring Add dynamically updating context to logs with Reference Tables and Observability Pipelines Introducing ARFBench: A time series question-answering benchmark based on real incidents The product signal latency gap slowing your growth Test network paths with TCP, UDP, and ICMP in Datadog Turn developer feedback into operational insight with Datadog Forms and Sheets How to investigate cloud credential compromise with Bits AI Security Analyst Evaluate, optimize, and secure your Google Cloud AI stack with Datadog Bringing observability data hosting to the UK on AWS Identify and fix code issues faster with Datadog’s Azure DevOps Source Code integration Steganography at scale: Embedding share URLs in Datadog widget screenshots Every team should be A/B testing Centralize observability management with Datadog Governance Console Manage service tracing across hosts with Single Step Instrumentation rules Route OTel data from AI apps to ClickHouse and Datadog using Observability Pipelines Spotting CI/CD misconfigurations before the bots do: Securing GitHub Actions with Datadog IaC Security Offline evaluation for AI agents: Best practices Detect runtime threats in Python Lambda functions with Datadog AAP Introducing our open source AI-native SAST Instrument and monitor Boomi integration flows with OpenTelemetry and Datadog Not all index scans are equal: How we cut query latency by over 99% Platform engineering metrics: What to measure and what to ignore Integrate Recorded Future threat intelligence with Datadog Cloud SIEM CI/CD security: threat modeling using a MITRE-style threat matrix CI/CD security: How to secure your GitHub ecosystem Ingress NGINX is EOL: A practical guide for migrating to Kubernetes Gateway API How we built a real-world evaluation platform for autonomous SRE agents at scale Operating agentic AI with Amazon Bedrock AgentCore and Datadog LLM Observability: Lessons from NTT DATA Introducing the Datadog Code Security MCP Capture and analyze custom heatmaps in Session Replay Understand session replays faster with AI summaries and smart chapters Monitor ClickHouse query performance with Datadog Database Monitoring How we designed empathetic alert sounds for on-call engineers Search and act across Datadog to resolve issues faster with Bits Assistant Measure the business impact of every product change with Datadog Experiments Analyzing round trip query latency Configuring JavaScript caches for better performance Introducing Bits AI Dev Agent for Code Security Datadog achieves ISO 42001 certification for responsible AI Monitor Nutanix clusters, hosts, and VMs with Datadog Monitor Juniper Mist in Datadog A new Host Map for modern infrastructure When upserts don't update but still write: Debugging Postgres performance at scale Annotate traces to improve LLM quality with Datadog LLM Observability What's new in Cloud SIEM: AI-powered investigations, enhanced threat intelligence, and scalable security operations Explore Kubernetes with native OpenTelemetry data Monitor Oracle Fusion Cloud Applications with Datadog Announcing the Datadog Terraform provider v4.0.0 Scaling Kubernetes workloads on custom metrics How to design cloud environments for AI-powered threat analysis Monitor Aruba Central in Datadog How we centralize and remediate risks with Datadog Case Management Accelerate incident response with Datadog and ServiceNow Monitor your application and network load balancer logs Understanding Karpenter architecture for Kubernetes autoscaling Tools for collecting metrics and logs from Karpenter Monitor Karpenter with Datadog What your product data is actually saying Key metrics for monitoring Karpenter Securing Datadog's platform in the AI age: The role of observability data Closing the verification loop: Observability-driven harnesses for building with agents When an AI agent came knocking: Catching malicious contributions in Datadog’s open source repos Closing the verification loop, Part 2: Fully autonomous optimization Four ways engineering teams use the Datadog MCP Server to power AI agents Approaching your observability migration with the right mindset Meet the new Bits AI SRE: Deeper reasoning, twice as fast Designing MCP tools for agents: Lessons from building Datadog's MCP server Key learnings from the 2026 State of DevSecOps study Use plain English to query your multi-cloud infrastructure in Resource Catalog Simplifying troubleshooting across the user journey with Datadog Synthetic Monitoring Protect your OCI resources with Datadog Cloud Security This Month in Datadog - February 2026 Fine-tune Toto for turbocharged forecasts Amazon EC2 security: How misconfigured and public AMIs expand your cloud attack surface Enable end-to-end visibility into your Java apps with a single command Measure and improve mobile app startup performance with Datadog RUM Evaluating our AI Guard application to improve quality and control cost Identify untested code across every level of your codebase Make use of guardrail metrics and stop babysitting your releases Monitor Versa Networks SD-WAN performance in Datadog How we reduced the size of our Agent Go binaries by up to 77% Improve performance and reliability with APM Recommendations Remediate transitive vulnerabilities faster with Datadog Software Composition Analysis Generate audit-ready vulnerability and compliance reports with Datadog Sheets Monitor Fortinet FortiManager performance in Datadog Improve test coverage across codebases with Datadog Code Coverage
How microservice architectures have shaped the usage of database technologies
2025-12-22 · via Datadog | The Monitor blog
Bowen Chen

Bowen Chen

Scott Gerring

Scott Gerring

In the late 2000s, the big question in database design was SQL or NoSQL. While relational databases had long held their ground, document and key-value stores were emerging as serious alternatives. Many predicted a zero-sum, winner-take-all outcome. But when we look at how organizations are using database technologies today, no single tool or category has dominated the landscape. In fact, many organizations have adopted multiple databases to handle different sets of challenges, reflecting the industry’s broader shift towards microservice architectures.

The rise of microservices has quietly redefined the way companies use databases. Databases are no longer monolithic backbones. Instead, they’re fragmented across hundreds of services that share or maintain their own data storage. For monolithic architectures, the question of SQL versus NoSQL seemed more critical. A single shared database governed all application components, and selecting the database that best fit every feature or component could significantly improve cost-efficiency and performance. However, in microservice architectures, each team has the flexibility to adopt the solution that best addresses the needs unique to its service. While this makes the strategic selection of a single database irrelevant, it presents its own challenges that organizations are dealing with today.

With visibility into tens of thousands of customer stacks, we’re able to not only tell the story of this shift but also show you the data to back it up. For this blog, we analyzed Datadog APM service and integration data from over 2.5 million services in the past year. We’ll discuss how microservices have changed the way organizations use and think about databases and how these changes are reflected in their technology stacks.

To understand how microservices have influenced the adoption of different database technologies, we first need to understand monolithic architectures, their history, and their challenges.

Around the time that non-relational databases (NoSQL) began to gain popularity, monoliths were the dominant application architecture. These applications often shared a common database that acted as an integration point between monoliths and their components. The following diagram shows how this works:

Diagram showing an example of a monolithic database architecture.

Typically, the business user or the end user of your application sends queries to a load balancer, which distributes requests between your different application servers. All of these business applications query either dedicated or shared schemas within a single shared database. As a result, the shared database becomes an integration point for applications. Rather than relying on API calls to communicate, applications reference shared tables, and when data changes or new entries are added, triggers and stored procedures built into the database automatically perform follow-up actions. This model also guarantees strong data consistency. Transactions across tables either succeed or are rolled back to their pre-existing state, and all operations that threaten referential integrity are either rejected or handled via cascading deletes.

Having a single source of truth that is always correct and up-to-date is a strong motivation for the shared database model. However, allowing applications to rely on shared schemas also adds coupling complexity. Each new schema update becomes a coordinated effort across the engineering organization to ensure compatibility for all the dependent applications, and this can greatly slow down development velocity as your number of applications and teams grows.

The benefits of microservice architectures

Microservice architectures emerged as a technical solution to this organizational problem. By decreasing the size of the deployable artifact (the service itself) and reducing the number of engineers working on each component, organizations can continue scaling their engineering teams and create faster release cycles without the risk of breaking another team’s code in a shared codebase.

Diagram that shows an example of how microservices enable smaller teams and deployables.

As the industry has embraced microservice architectures, the decision of which database to use can now be made repeatedly, once for every service rather than once at the organizational level. Under this approach, services and their owners have the flexibility to choose data stores independent of the schemas and models of their organization as a whole.

Diagram that shows an example of how microservices enable independent data store selection.

While this offers teams more versatile development options, it doesn’t mean each team is always required to adopt their own database solutions. Organizations that have platform engineering teams often operate fully managed database platforms that offer golden paths to developers and help them adapt their use case to a standard schema within a shared database cluster.

Database adoption patterns in today’s landscape

We can see some of these trends reflected in our customer data. Organizations have begun to embrace the idea of using different databases for different tasks, with more than half of all customer organizations adopting three or more database technologies in their stack. What’s even more interesting is that a quarter of customers are now using five or more databases, nearly matching the share of organizations that still rely on a single database.

Share of organizations by number of database technologies used, %

The data paints a clearer picture when we group database adoption by company size. Of those that have adopted five or more database technologies, we see a mix of predominantly those that are medium and large in size. Many medium organizations that exist today were built from the ground up using microservices. On the other hand, large organizations often began with monolithic, on-premises architectures and have since migrated their workloads to the cloud. This places them on the far right of the spectrum in terms of technical maturity. To further optimize their workloads, they’ve adopted different databases for different application use cases.

This shift toward a wider range of database tools has enabled organizations to adopt a “right tool for the job” approach. Rather than using commercial relational databases like Microsoft SQL everywhere, they are starting to mix in other classes of databases. More customers now use open source SQL databases such as PostgreSQL and MySQL, which are often more cost-effective and flexible than traditional commercial SQL solutions. We also see strong adoption of NoSQL databases such as MongoDB, Couchbase, DynamoDB, and Aerospike.

Share of organizations using database technologies, %

Additionally, the use of SQL and NoSQL is no longer mutually exclusive. While it was once uncommon for an organization to use both SQL and NoSQL databases due to their differences, we now see this behavior from nearly half of our customers.

Workloads that require strong transactional consistency and structured data, such as inventory management or banking, are still best fit for SQL. Meanwhile, NoSQL is best suited for handling high-speed workloads and diverse datasets, such as real-time internet personalization and product catalogs.

Share of organizations using database technologies, %

A new set of challenges

Using multiple databases across one organization still comes with challenges. Many of the reasons this model was once considered unworkable—such as the lack of a single source of truth, differing schemas, and weaker service integration—did not just disappear as organizations shifted to microservices. In this section, we’ll discuss these obstacles and the tools we see customers use to address them.

More databases = more schemas = more service-level joins

By embracing microservices, organizations fragment one global schema into hundreds (or even thousands) of micro-schemas. Rather than acting as a shared artifact that both analysts and developers rely on, database schemas become a private implementation detail of the service they’re tied to. The main issue that arises from this shift is how to join data that spans across different services and schemas.

One solution we’ve seen address this problem is the usage of a data integration layer such as GraphQL. This approach fetches data from multiple data stores and APIs, combining their schemas under a unified GraphQL API schema. As services rely on GraphQL to aggregate data from an expanding pool of sources, much of the relational and join logic once handled inside the monolithic database is pulled upward into the data integration layer—specifically into how GraphQL resolvers are instrumented to fetch the requested data.

Our data shows that out of all customer services making GraphQL queries, 55% of these GraphQL executions contain more than 10 child resolve spans, with several services handling over 100 resolves per execution. For services that require large volumes of resolve operations, configuring the GraphQL server with optimizations such as batching or caching can significantly improve performance. For example, when querying data for 100 users, a naive GraphQL execution may trigger 100 separate downstream calls. Using a batching utility such as DataLoader combines these separate calls into a single backend query, reducing redundant I/O and keeping resolvers efficient even as the schema grows more intricate.

Share of services by ratio of graphql.resolve per graphql.execute, %

This added complexity within GraphQL executions also introduces a new structural cost related to query depth. GraphQL executes requests breadth-first, resolving each level of nested fields before moving to the next. While fields within a single level can run in parallel, deeper nesting increases the number of sequential steps required to complete a query. In our data, the median GraphQL execution (graphql.execute) takes around 200 ms, while individual field resolutions (graphql.resolve) complete in roughly 40 ms. The gap reflects how GraphQL efficiently parallelizes work within each layer but still incurs a cost for traversing additional layers of schema depth. Minimizing unnecessary nesting helps keep the integration layer responsive without overcomplicating the mapping logic that has replaced traditional database joins.

Duration of graphql.execute vs graphql.resolve, milliseconds

Analytics is now more difficult

The shared database model allows you to treat analytics as a separate role that accesses the database cluster. Storing all of your data in one place simplifies a lot of analytics problems. Under microservice architectures, however, organizations often use multiple databases simultaneously, each with its own schema and format. To perform analytics, data from these distributed systems must be transformed into a consistent schema and consolidated in a central data store—typically a data lake, a data warehouse, or a hybrid solution. This kind of database system is commonly referred to as Online Analytical Processing (OLAP), and it’s dedicated to handling analytical queries. This separates it from transactional databases built to be highly concurrent, Online Transaction Processing (OLTP) systems, which are better at handling day-to-day business operations.

To establish OLAP systems, organizations have adopted cloud data platforms such as Snowflake and Redshift to serve as large-scale data warehouses. By looking at a shortlist of nine data platforms1 Datadog integrates with, we see 44% of organizations using at least one of these analytics solutions. Unlike the shared singular databases used in monolithic architectures, these systems are not meant to power live applications. Instead, they provide cost-effective storage and highly parallelized, read-only performance for analytics workloads.

Service integration just got harder

Another byproduct of microservice architecture is increased network complexity. Microservices need a way to communicate with each other, but they can no longer do so within a singular application boundary using a shared database or in-memory method calls. Part of the solution has been an increase in the number of synchronous calls between services—for instance, using HTTP, REST, or gRPC—but relying on synchronous communication also creates fragile systems. If services are synchronously coupled, when one service fails to respond, it can create a cascading reaction that affects all upstream services.

To address this issue, we see nearly 70% of customers adopting message queues. These queues enable services to decouple from one another and communicate asynchronously. Services can publish events without waiting for a response from downstream consumers. This greatly increases system resiliency. During traffic spikes, messages are held in a queue and consumed once downstream services recover. When failures occur, this prevents them from cascading to other services, allowing responders to focus their recovery efforts.

Among message queue technologies, RabbitMQ, Kafka, and AWS SQS show the highest adoption. This indicates that customers are still largely using message queues to decouple their microservices using AMQP protocol but are also moving toward event-driven architectures that use data streaming platforms to perform analytics in data warehouses.

Share of organizations using message queues, %

Gain end-to-end visibility across all of your database technologies

Microservices changed the way we build software, and as a result, organizations have had to change how they use databases as well as which databases to use. Based on our data, we see customers today move past the SQL versus NoSQL debate to run both categories of databases side by side. As organizations now adopt more database technologies to handle different use cases, the fragmentation of schemas and data has encouraged the growth of more efficient data integration layers and event-driven architectures.

Datadog Database Monitoring helps you identify issues and optimization opportunities in your databases including PostgreSQL, MySQL, SQL Server, Oracle, and MongoDB. Datadog Data Streams Monitoring helps achieve end-to-end visibility across streaming technologies, integrating with Kafka, RabbitMQ, and more. If you’d like to read more research-based content featuring insights based on our customer stacks, check out our research reports.

If you don’t already have a Datadog account, sign up for a free 14-day trial today.

Footnotes

  1. Amazon Redshift, product analytics Redshift, Snowflake, Snowflake web, Databricks, Google BigQuery, Azure Data Lake Storage, Vertica, ClickHouse.