惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

WordPress大学
WordPress大学
T
Threat Research - Cisco Blogs
D
DataBreaches.Net
Microsoft Azure Blog
Microsoft Azure Blog
D
Docker
P
Proofpoint News Feed
小众软件
小众软件
博客园 - 聂微东
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
人人都是产品经理
人人都是产品经理
J
Java Code Geeks
Martin Fowler
Martin Fowler
L
LangChain Blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
李成银的技术随笔
MongoDB | Blog
MongoDB | Blog
M
MIT News - Artificial intelligence
阮一峰的网络日志
阮一峰的网络日志
Hacker News: Ask HN
Hacker News: Ask HN
C
CERT Recently Published Vulnerability Notes
H
Help Net Security
The GitHub Blog
The GitHub Blog
S
Security Archives - TechRepublic
AWS News Blog
AWS News Blog
Project Zero
Project Zero
Security Latest
Security Latest
P
Privacy International News Feed
T
Troy Hunt's Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
Intezer
酷 壳 – CoolShell
酷 壳 – CoolShell
The Hacker News
The Hacker News
I
InfoQ
P
Proofpoint News Feed
C
Cisco Blogs
aimingoo的专栏
aimingoo的专栏
T
ThreatConnect
Recorded Future
Recorded Future
P
Palo Alto Networks Blog
Hacker News - Newest:
Hacker News - Newest: "LLM"
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
V
V2EX
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
G
GRAHAM CLULEY
F
Future of Privacy Forum
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
N
News and Events Feed by Topic
Engineering at Meta
Engineering at Meta

DEV Community

Spot instances as GitHub Actions runners Agents Need Receipts, Not Just Better Prompts When AI Reads Blueprints: The Hidden Attack Surface of Multimodal Engineering Intelligence Simplicity scales — complexity kills side projects AI does exactly what you ask — that's the problem How a model upgrade silently broke our extraction prompt (and how we caught it) The Best Form Backend for Static Sites in 2026 # ⛽ I Built a Cross-Platform Fuel Finder with React & Supabase: The Indie Dev Journey The 11 Major Cloud Service Providers in 2025 Membangun Karya Visual: Mengintip Fasilitas Multimedia dan Studio Kreatif Amikom Visualizing Database Design: From Interactive Canvas to Drizzle, Prisma, and SQL in Real-time A tool to make your GitHub README impossible to ignore 🚀 Zero-Downtime Blue-Green and IP-Based Canary Deployments on ECS Fargate I reproduced a Claude Code RCE. The bug pattern is everywhere. We Replaced Our RAG Pipeline With Persistent KV Cache. Here's What We Found. Jenkins CI/CD Pipeline for a Dockerized Node.js Application: Manual Trigger vs Automatic Trigger Using GitHub Webhooks How to Stream Live Forex Rates to Google Sheets API: A Complete Guide Small Models Will Beat Giant Models (And Most People Haven’t Realized Why Yet) How I Built 5 Linux Automation Scripts on AWS EC2 I built TokenPatch to measure AI coding cost per applied patch I built a Chrome extension to stop squinting at the web Producer audit clean, six tests red Conversa — A Multi-Agent AI Platform Powered by Gemma 4 Build a Real Agent in 15 Minutes with Gemini's New Managed Agents API What I Actually Build: AI Systems That Ship, Not Demos That Impress The Box Ticked While You Read This: LinkedIn, AI Training, and the Switch You Did Not Flip Investasi Masa Depan: Mengintip Fasilitas Laboratorium Komputer Kelas Dunia di Yogyakarta I Cancelled My $20 Claude Cowork Plan After a Week With OpenWork Stop Reviewing Every Line of AI Code - Build the Trust Stack Instead How To Build an Image Cropper in Browser (Simple Steps) I built a macOS disk cleaner for developers and just launched it would love feedback Membangun Kompetensi dan Relasi: Mengapa Ekosistem Kampus Itu Penting I Built an AI That Decides Which AI to Talk To — Running 24/7 From My Living Room Codex Team Usage SOP How to Actually Become a Programmer: The Hard Part Nobody Wants to Explain Building a Production-Style Multi-Tool AI Agent with Python, Flask, React & Gemini AI The Caretaker Sandbox: An Offline-First Visual Playground & Template Engine powered by Gemma 4 # Building Instagram OSINT Projects with HikerAPI Your AI can read. Gemma 4 can see The Battle of the Senior Dev: Why AI Gives You Wings But Only If You're Ready to Pilot HiDream Raw Output Failed Tried Dev-2604 VRAM Math Killed It Won with a Prompt Enhancer Instead I Finally Finished a Project I Abandoned — And GitHub Copilot Helped Me Ship It SafeSMS: On-Device Threat Detection with Gemma 4 E4B, no internet required I Built OpenKap — A Loom Alternative for Small Teams Who Just Want to Ship Gemma 4 is Here: The Dawn of Local Multimodal Reasoning Offline-First Flutter: How We Built a CRM That Manages 100K+ Leads With No Internet Memory for Agents: When Vectors Meet Graphs, Bugs Drop 4 The Rise of Production-Grade AI Infrastructure I ran my idea-validation product through its own validator. The verdict was PIVOT. We Built an Agent Commerce API. Google I/O 2026 Changed Our 3-Month Roadmap in 24 Hours. "My Partner's Memory Was Full. I Didn't Know — Until We Tried to Talk." I’m a Front End Web Developer Learning Machine Learning From Scratch Laravel Waiting Request I Built a Chrome Extension to Track How Long You Actually Spend on Each Tab Why Google Can't See Your React Breadcrumbs (And the 4-Line Fix) AI Travel Assistant Powered by Gemma 4; With Streaming, Image Input, and Visual Recommendation Cards Microsoft tried to kill the printer driver. Healthcare said no. The Blueprint Beneath the Blueprint: Designing Data Model and Choosing Its Database REST APIs vs Webhooks in Telecom Billing - Which One Actually Makes Sense? Accounting Made Simple: AI-Powered Financial Insights of Japanese Companies with Gemma 4 The append-only AST trick that makes Flutter AI chat actually smooth Designing the Future of Payments — Why XML Still Matters in the Age of APIs From Legacy to Live — Reviving XMLPayments with GitHub Copilot Two Weeks Into Learning Solana XMLPayments — The Hidden Backbone of Modern Financial Orchestration AI Agents in Practice — Read from the beginning Reviving My Gemma Agentic Framework: From Prototype to Polished Repo Smart Contracts Demand Better Infrastructure: Building on contract.dev Self-Hosted LLM Tool Calling: Forge and the Build-vs-Buy Decision ORA-00072 오류 원인과 해결 방법 완벽 가이드 OpenWA for CTOs: Self-Hosted WhatsApp Gateway Trade-Offs NotebookLM Automation With notebooklm-py: Useful, But Classify Data First Docker v29.5.x Operator Upgrade Checklist Coding-Agent Instruction Design: The CLAUDE.md File That Prevents Rework When I Finally Realized My Runtime Was Holding Me Back GnokeOps: Host Your Own AI House Party The Death of Static Rate Limiters: Why Your Java Virtual Threads Need BBR-Style Adaptive Concurrency AI Agents in Practice — Part 2: What Makes Something an Agent Stop scattering LLM SDK/API calls across your codebase. Here is the 2-file rule that fixed mine Beyond Prompts: Structuring AI Workflows for Real Frontend Engineering From an Abandoned Hackathon Project to an AI Study Workspace 🚀 Terraform with AI: Build AWS Infra (Cursor + MCP) What If AI Didn’t Need the Internet? 750,000 Chips, 140 Trillion Tokens: The Math Behind DeepSeek's Permanent Price Cut You're Renting Someone Else's Compute — And It's Costing You More Than You Think CSS :has() Selector: The Layout Trick I Wish I Knew 5 Years Ago Five Clusters. Five Lessons. One Production System. Synaptic: A Local-First AI Dev Companion That Remembers How You Think Revolutionizing Edge MedTech: Building a Sovereign Sleep Apnea Companion ("XiHan Snore Coach") with Gemma 4 HDD Eksternal Tiba-Tiba Tidak Bisa Diakses di Windows? Ini Tiga Lapis Fix-nya DMARC p=none vs p=quarantine vs p=reject: what to use and when DSA Application in Real Life: How Git Diff Works: LCS Intuition, Myers Algorithm, and Real Code Changes I solo-built a reputation layer for AI agents on NEAR — and here's what I learned I built an AI faceless video generator in 2 months — here's the stack Diffusion Language Models: How NVIDIA Nemotron-Labs Diffusion Shatters the Autoregressive Speed Ceiling llm-nano-vm v0.8.0 — deterministic FSM runtime for LLM pipelines, now with output validation and per-step timeouts From the Renaissance to the Quantum Dawn: AI, Computation, and the Next Paradigm Shift How I Built a Review Site with 800+ Articles Using AI I Built a Smart Kitchen AI with Gemma 4 That Turns Fridge Photos Into Recipes Why your vulnerability dashboard is lying to you (and how to fix it)
What Is IOPS?
Khushi Dubey · 2026-05-23 · via DEV Community

If your application ever feels slow for no obvious reason, storage is often the quiet culprit. The CPU looks fine. Memory looks fine. Yet requests crawl. Nine times out of ten, the bottleneck turns out to be IOPS.

IOPS is one of those terms that gets thrown around in cloud and infrastructure conversations, usually without a clear definition. People mix it up with speed, with bandwidth, with throughput. Getting it right matters because IOPS affects both how fast your systems run and how much you pay for storage.

This guide explains what IOPS actually is, how it is measured, how it differs from throughput and latency, and how it plays out on cloud platforms like AWS and Azure. By the end, you will know how to size storage for your workload without overpaying for performance you never use.

What Is IOPS?

IOPS stands for Input/Output Operations Per Second. It is a measure of how many read and write operations a storage device or volume can complete in one second.

In plain terms, IOPS tells you how busy your storage can get. Every time an application reads a file, writes a log line, or updates a database row, that counts as an input/output operation. IOPS simply counts how many of those operations a disk or volume can handle each second.

A higher IOPS number means the storage can serve more simultaneous requests. A traditional hard drive might manage a couple of hundred IOPS. A modern NVMe solid-state drive can deliver hundreds of thousands. That huge gap is exactly why IOPS matters so much for databases, virtual machines, and any latency-sensitive workload.

Here is the short answer if you only need one line. IOPS is the speed limit for how many small read and write requests your storage can process per second, and it is one of the three numbers that decide whether your storage feels fast or painfully slow.

What Affects Your IOPS?

IOPS is not a single fixed number stamped on a disk. The same volume can deliver very different IOPS depending on how it is used. Several factors shape the result:

I/O size, also called block size. Smaller operations, such as 4 KB, allow more IOPS. Larger operations move more data per request but lower the count.

Random vs sequential access. Random reads and writes scattered across the disk are harder to serve than sequential ones, so they usually produce lower IOPS.

Read vs write mix. Many systems handle reads and writes at different speeds, so the ratio between them changes the effective number.

Queue depth. This is how many requests are in flight at once. Higher concurrency can raise IOPS, up to the limits of the hardware.

Storage media. Spinning disks, SATA SSDs, and NVMe drives sit in completely different performance classes.

There is also a simple relationship worth memorizing. Throughput equals IOPS multiplied by I/O size. So a workload running 3,000 IOPS at a 4 KB block size moves roughly 12 MB per second. This is why you cannot talk about IOPS sensibly without also knowing the block size behind it.

IOPS vs Throughput vs Latency

IOPS rarely travels alone. Storage performance is really a story told by three metrics together, and confusing them is the most common mistake people make.

IOPS
What It Measures: Number of read/write operations per second
Unit: Operations per second
Simple Analogy: How many cars pass per minute

Throughput
What It Measures: Volume of data moved per second
Unit: MB/s or GB/s
Simple Analogy: How wide the highway is

Latency
What It Measures: Delay to complete a single operation
Unit: Milliseconds or microseconds
Simple Analogy: How long each car waits at the toll

Here is how to think about it. IOPS counts the operations. Throughput measures the data those operations carry. Latency tells you how quickly each one finishes. A database needs high IOPS and low latency. A video streaming or backup workload cares far more about throughput. Match the metric to the job and the storage decision becomes much easier.

IOPS by Storage Type

Different storage media live in different performance worlds. The numbers below are general ranges, not exact specs, but they show the scale of the differences.

HDD (spinning disk)
Typical IOPS Range: 55 to 180 IOPS
Best For: Archives, backups, cold and bulk data

SATA SSD
Typical IOPS Range: 7,500 to 20,000 IOPS
Best For: General-purpose servers and apps

Enterprise SAS SSD
Typical IOPS Range: Tens of thousands of IOPS
Best For: Busy databases and virtualized hosts

NVMe SSD
Typical IOPS Range: Hundreds of thousands to 1M+ IOPS
Best For: High-performance databases and analytics

How IOPS Works in the Cloud

In the cloud, you do not buy physical disks. You choose a volume type, and that choice sets your IOPS ceiling. This is where IOPS stops being a hardware spec and becomes a budgeting decision.

IOPS on AWS

Amazon Elastic Block Store, or EBS, is the most common example. According to the official AWS EBS documentation, each volume type offers a different IOPS profile:

gp3 (General Purpose SSD)
Max IOPS per Volume: Up to 80,000
Best For: Most workloads, boot volumes, mid-size databases

io2 Block Express (Provisioned IOPS)
Max IOPS per Volume: Up to 256,000
Best For: Mission-critical, I/O-intensive databases

st1 (Throughput Optimized HDD)
Max IOPS per Volume: Lower IOPS, high throughput
Best For: Big data, logs, streaming workloads

sc1 (Cold HDD)
Max IOPS per Volume: Lowest IOPS
Best For: Infrequently accessed, cost-sensitive data

A useful detail: every gp3 volume includes a baseline of 3,000 IOPS and 125 MB/s of throughput at no extra cost, and you only pay more when you provision above that. At the top end, io2 Block Express is built for sub-millisecond latency and 99.999 percent durability, which is why it shows up under demanding databases like SAP HANA and Oracle.

IOPS on Azure

Microsoft Azure follows the same idea with its managed disks. As covered in the Azure managed disk documentation, tiers like Premium SSD v2 and Ultra Disk let you set IOPS independently of disk size, scaling well into the hundreds of thousands of IOPS for the most demanding workloads.

One catch that trips up many teams: your virtual machine or instance has its own IOPS limit, separate from the disk. You can attach a very fast volume and still be capped by the instance. Always check both numbers.

How to Calculate the IOPS You Actually Need

Guessing your IOPS requirement is how budgets get wasted. A quick, structured estimate is far better. Here is a simple approach.

Measure your current workload. Use monitoring tools to capture real read and write operations per second during normal and peak hours.

Separate reads from writes. Note the ratio, because some systems and RAID setups treat writes more expensively than reads.

Find your true peak, not the average. Storage must survive the busy moments, so size against a realistic peak rather than a calm daily mean.

Add a sensible buffer. A headroom of 20 to 30 percent absorbs growth and spikes without forcing constant re-tuning.

Match a volume type to the result. Pick the cheapest volume tier that comfortably covers your peak plus buffer, and no more.

This five-step habit replaces the two failure modes most teams fall into: provisioning for an imagined worst case, or under-provisioning and discovering it during an outage.

IOPS and Cloud Cost: Where the Money Leaks

Here is the part most performance guides skip. In the cloud, IOPS is not free, and provisioned IOPS is one of the easiest line items to overspend on.

On AWS gp3, IOPS above the free 3,000 baseline carries an additional per-IOPS monthly charge, and extra throughput is billed separately too. Provisioned IOPS volumes like io2 add an even higher per-IOPS cost. None of this is expensive on its own. The problem is scale. A few hundred over-provisioned volumes quietly turn into a serious monthly number.

In our experience, over-provisioned IOPS is one of the most common storage cost leaks, and it usually hides because the volume still works fine. Nothing breaks, so nobody looks. Treating storage performance as part of your wider cloud cost optimization effort, rather than a pure engineering setting, is what surfaces this kind of waste.

EXPERT INSIGHT

A pattern we see often: a team provisions io2 with high IOPS for a database launch, traffic never reaches the forecast, and the volume runs for months at a fraction of its provisioned performance. The fix is rarely dramatic. It is usually a switch to gp3, or simply dialing the provisioned IOPS down to match real demand. The savings are real, and the application does not notice the change at all.

Common IOPS Mistakes to Avoid

Most IOPS problems are not exotic. They come from the same handful of mistakes, on the performance side and the cost side alike.

Confusing IOPS with throughput. Provisioning high IOPS for a workload that actually needs throughput, or the reverse, wastes money and still feels slow.

Sizing for an imagined peak. Provisioning for a worst case that never arrives is the single biggest source of storage overspend.

Ignoring instance-level limits. Attaching a fast volume to an instance that caps IOPS lower than the volume. This is one of several common cloud cost mistakes that quietly inflate an AWS bill while performance still looks acceptable.

Relying on burst credits in production. Older burst-based volumes can fall off a performance cliff once credits run out, causing sudden, confusing slowdowns.

Never monitoring actual usage. If you do not track real read and write operations, you cannot tell whether you are over-provisioned or under-provisioned.

How to Optimize IOPS and Spend

Good IOPS management is a balance. You want enough performance for the busy moments and not a dollar more. A few practical habits get you there.

Monitor before you provision. Base every IOPS decision on measured data, not on a guess or a vendor default.

Right-size regularly. Workloads change. Review volume performance on a schedule and adjust provisioned IOPS down when demand drops.

Prefer modern volume types. On AWS, gp3 lets you tune IOPS and throughput independently and usually beats older types on price for performance.

Match the volume to the workload. Use HDD-backed storage for throughput-heavy or cold data and save SSD IOPS for transactional work.

Treat performance and cost as one decision. Smart storage tuning lowers spend and improves reliability at the same time, an idea explored well in this guide on turning performance into real cloud savings.

None of these steps are difficult. They simply require treating storage as something you measure and revisit, not something you set once and forget.

How Opslyft Helps Businesses Manage Storage and IOPS Costs

Understanding IOPS is the first step. Keeping storage performance and storage spend in balance, across hundreds of volumes, is the harder ongoing job. That is where Opslyft helps.

Opslyft is a FinOps platform that brings visibility and accountability to cloud spend across AWS, Azure, GCP, and Kubernetes, including the storage layer where IOPS costs live. Instead of finding over-provisioned volumes by accident, teams see them clearly.

In practice, Opslyft supports storage and IOPS cost management in a few concrete ways:

Integration that connects to your cloud accounts and surfaces storage and provisioned-IOPS spend alongside the rest of your bill.

Visibility and allocation that attributes storage cost to the right team, environment, or product so no volume is orphaned.

Optimization that flags over-provisioned IOPS, idle volumes, and storage that belongs on a cheaper tier.

Anomaly detection that catches sudden storage cost spikes before they become an invoice surprise.

Consulting and support for right-sizing, governance, and building a sustainable FinOps practice around infrastructure spend.

The goal is simple. It turns storage performance from a setting nobody revisits into a cost you actively manage.

Conclusion

IOPS is one of the most important storage metrics, yet one of the most misunderstood. It measures how many operations your storage can handle, and it works hand in hand with throughput and latency to decide whether your systems feel fast.

Size IOPS to your real workload, watch how it differs from throughput, and review it regularly. Get that right and you gain something rare in the cloud: strong performance and a storage bill you can actually predict.