惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

DEV Community

Spec-Driven Development Without an IDE: I Generated NestJS, Go, Spring Boot, Laravel, and Rust Apps From a Single PRD File Components are states Edge SEO y Middleware: Cómo Interceptar a Googlebot y LLMs antes de llegar a tu Servidor Context window exceeded at turn 23. Here's how I track token usage without a tokenizer. My Hermes agent spent $3 before I noticed. Now it can't. My Hermes agent's stop condition was a 40-line if/elif chain. I replaced it with 3 lines. My agent kept hitting context limits. This one function fixed it. Your Hermes agent's audit log is leaking customer emails. Here's a 100-line lib that fixes that. Create and configure Azure Firewall My agent kept forgetting what it was doing. A scratchpad fixed it. I replaced 200 lines of ad-hoc state management in my Hermes agent with one object. Per-Key Rate Limiting for Agent Tool Calls: Stop One User From Breaking Everything Composable Output Guardrails: Filter Agent Responses Before They Reach Users Sanitize Your LLM Message Lists Before Every API Call Thread a Run ID Through Every Agent Call So You Can Debug Anything Normalize Provider Error JSON So Your Agent Can Actually Handle Failures Priority Queue for Agent Sub-Tasks: Stop Processing Low-Priority Work First Static Lint Rules for Your LLM Prompts (Before They Hit Production) tool-call-budgets: Stop Runaway Agent Loops Before They Hit Your Invoice Step Through Your Agent's Failures Like a Debugger The Simplest Stop Condition: A Hard Cap on Agent Loop Iterations Score Your Agent's Responses With a 0.0-1.0 Rubric (No LLM Judge Required) Fix Bad Structured Output by Feeding the Error Back to the Model Building an effective Storyblok Tool Plugin with SvelteKit How to Get Your Renault / Dacia Radio Code for Free RAG 시스템 실전 구축 (v39) Retraction — scrml’s Living Compiler I built a fitness app where the AI roasts you for eating pizza (and hypes you when you PR) The Top SaaS Founder Communities on Discord (Beyond the AI Hype) I Built a Production-Grade Async Job Queue from Scratch — Here's Everything That Actually Happened How to watch SMS from multiple Android phones in one iOS app We Didn’t Want Another AI Wrapper — So We Explored a High-Speed Hermes Orchestrator for Engineering Crews Multi-tenant além do TenantId: problemas reais e aprendizados em sistemas .NET After failing 23 times, I am sharing How I Actually Prepare for a Tech Interview Every Single Time Now. I built an app that works like a nutritionist for your brain. Here's what happened in 7 days. GoBadge Dynamic: From Module Stats to Universal Badges LangGraph 워크플로우 템플릿 (v39) The git Commands You Forgot Exist (And Why AI Workflows Make Them Relevant Again) Six Levels of MCP Servers One container to replace Grafana + Loki + Tempo + Prometheus The Request/Response Cycle, HTTP, Auth, JWT, OAuth & Sessions — Explained Properly Python Week 3: We Stopped Repeating Ourselves (Loops!) Creating a Custom Grid Editor tool in Unreal Engine 我做了个付费 Telegram bot。Telegram Stars 实际给开发者多少钱,我算了一笔账。 I Got 96% Recall on LLM Hallucination Detection With No ML Model – Just 50 Lines of Python A practitioner's guide to getting more value out of AI coding: agent quality & token optimization How to Handle Telegram Albums in Telegraf I Built a Multilingual Spam Detection Dataset with 149K+ Messages Across 23 Languages How to Handle Telegram Albums in grammY RAG 시스템 실전 구축 (v38) Beyond Pip Install: Why Your AI Agent Needs a "Hermetic" Life-Support System to Survive Resume Building using HTML & CSS SpecFlow: Multi-Agent SDD in Cursor (4 phases, /approve, single code writer) Running ASR for smart homes in the NPU of Intel processors "Building a CI/CD Pipeline From Scratch: A Practical Guide for Developers (with GitHub Actions)" SpecFlow: SDD multi-agente en Cursor (4 fases, /approve, un solo escritor de código) How to Extract Your Full Team Hierarchy from HubSpot (the API doesn't expose it) Adobe Commerce Cloud now costs $40k/year. We migrated from Adobe Commerce to Magento Open Source — here's the honest breakdown .klickd v4.0.0 — Portable AI memory with constraints, strict schemas, and test vectors We Trust Third Party Code, It’s Time to Trust AI Generated Code LangGraph 워크플로우 템플릿 (v38) Sustainable AI Starts with Efficient AI Find Remove duplicated files in Google Drive How to Detect GPU Waste in a Kubernetes Cluster The Privacy Bug in My First Chrome Extension (And How to Avoid It) Serverless Mental Models: What They Don't Tell You Before You Build Preventing GPT hallucination in automated content pipelines: how I structure Make.com flows with data injection Hmm, where were we? AI Visibility Tools, Math Proofs, and Stripped Guardrails Shape Developer Landscape How AI and Electronics Are Changing Healthcare Devices: The Future of Smart Healthcare Author: Shivam Wakade | Founder, PrivSR Making Claude Sound Like Optimus Prime Understanding Reinforcement Learning with Human Feedback Part 5: Training the Reward Model with Loss Functions Learning Progress Pt.20 How Secure LoRa Communication Devices Work: Building the Future of Private and Long-Range Connectivity Author: Shivam Wakade | Founder, PrivSR How I Rebuilt an RPG Map Editor with Rust, React, and WASM Building a System That Automates YouTube Post-Production Building a 100% Serverless Digital Asset Packager in the Browser Game Recommended AI What is Human-In-The-Loop (HITL)? Deep Dive: React Server Components in TanStack Start Migrating off Google Analytics: Umami vs Plausible vs Fathom Building a Portfolio That Actually Demonstrates Software Engineering Async/Await in JavaScript: From Callbacks to Clean Code (2026) Benchmarking LLM Structured Outputs Angular 21 Multiselect Dropdown: A Migration-Friendly Component with Live Functional Tests ShareBox v5 — GPU transcoding, Netflix-style grid, and why I don't need Plex anymore TOML Schema is live Handling Duplicate Shopify Webhook Events (And Why You Must) Original Kubernetes Dashboard — retired upstream, upgraded to Angular 21. لماذا أسست ترينافو للتجار العرب الذين تتجاهلهم المنصات الغربية Construyendo un recomendador de películas en Python: de los datos al modelo When APIs Lie: A Lesson in Defensive Debugging Pope Leo XIV's AI Encyclical: What Builders Must Know (2026) Donna v0.3.0 HTB — MonitorsFour | Writeup The Free Tool You Trust Is the One You Should Fear the Most HTB — MonitorsFour | Writeup Fr 97. Embeddings and Vector Search: Semantic Search That Works Deep Dive: Building "Gravity Paint" - A Tactile Physics Instrument with React, Matter.js, and p5.js ABAP Unit Testing with Test Doubles and Mocking Frameworks: A Senior Architects Guide to Isolating Dependencies in SAP S/4HANA
The Linux Commands You Forgot Exist (And Why AI Workflows Make Them Relevant Again)
Don Johnson · 2026-05-26 · via DEV Community

These weren't in your bootcamp. They're not in most tutorials. They've been quietly available on every Linux box since before "AI workflow" was a phrase — and they're more useful now than they've ever been.

Try it yourself: clone linux-archaeology-lab, run bash setup.sh, and every command in this article has a working exercise waiting for you.


watch — monitor anything without a single line of code

watch runs a command on a repeating interval and fills your terminal with the refreshing output. That's it. No loop, no sleep, no script.

Why it's back: AI inference runs take time. watch -n1 nvidia-smi is the fastest way to see GPU memory climb and fall without touching the model process at all. watch -n2 'ls outputs/ | wc -l' tells you how far a batch job has gotten. One flag, zero instrumentation.


tee — two destinations, one stream

tee reads stdin and writes it to both stdout and a file simultaneously. Not sequentially — simultaneously, as data flows.

The pattern that comes up constantly in AI work:

agent-command 2>&1 | ts '[%H:%M:%S]' | tee run-$(date +%Y%m%d-%H%M%S).log

Enter fullscreen mode Exit fullscreen mode

You see it live. It's in a timestamped log. Stderr is captured. One pipeline, three things handled.


pv — a progress bar for any pipeline

pv is a transparent pipe segment. Data passes through it unchanged; it prints throughput, elapsed time, and a progress bar to stderr.

You don't modify the commands on either side. You just insert pv into the middle:

cat data.jsonl | pv | python3 process.py

Enter fullscreen mode Exit fullscreen mode

A blinking cursor becomes a progress bar with an ETA. For long inference batches — thousands of rows, slow API calls, large embeddings — pv turns a black box into something you can actually reason about.


ts — timestamp every line of output

ts prepends a timestamp to every line it receives on stdin. Nothing else.

The power is in the relative mode:

agent-command | ts -s '%.s'

Enter fullscreen mode Exit fullscreen mode

Each line is prefixed with the time since the previous line — so you can see exactly where an agent spent 4 seconds between steps. No profiler. No code changes.

ts is from moreutils. Install once: sudo apt install moreutils.


sponge — safe in-place pipeline transforms

This command exists to solve one specific problem, and it solves it perfectly.

The shell opens output files for writing before the pipeline starts — which truncates the file before it's been read. sponge soaks up all of stdin into memory first, then writes when it gets EOF. The file is safe.

sort file.txt | sponge file.txt        # safe
python3 -m json.tool cfg.json | sponge cfg.json   # safe
grep -v DEBUG app.log | sponge app.log            # safe

Enter fullscreen mode Exit fullscreen mode

Also from moreutils.


column — readable tables without Python

column formats delimited input into aligned columns. One flag for the delimiter, one flag for table mode.

Before:

model   provider    params  context_k
llama-3.1-8b    Meta    8B  128
mistral-7b  Mistral AI  7B  32

Enter fullscreen mode Exit fullscreen mode

After column -t -s $'\t':

model          provider      params  context_k
llama-3.1-8b   Meta          8B      128
mistral-7b     Mistral AI    7B      32

Enter fullscreen mode Exit fullscreen mode

For any command that emits structured text — tool call logs, benchmark results, model comparisons — column makes it scannable in one pipeline stage. No pandas. No formatting code.


comm — surgical set operations on text files

comm compares two sorted files and gives you three columns: lines only in file A, lines only in file B, lines in both. Suppress any column you don't need.

The comm -12 (intersection) and comm -23 (A minus B) patterns are the correct answer to "what's consistent across these two model runs?" and "what did run B drop that run A had?" — in one command, no Python, no diff | grep.

Process substitution makes it flexible:

comm -23 <(sort run-a.txt) <(sort run-b.txt)

Enter fullscreen mode Exit fullscreen mode


tac — read any file from the bottom

tac is cat spelled backwards. It reverses line order.

The killer use case:

tac agent.log | grep -m1 'ERROR'

Enter fullscreen mode Exit fullscreen mode

Find the most recent error in a log without reading the whole file. -m1 stops at the first match — which, in a reversed file, is the last occurrence. No tail, no awk, no Python.

Pair with head for newest-N-lines: tac logfile | head -20.


vidir — batch rename in your text editor

vidir opens a directory listing in $EDITOR. You rename files by editing text. You delete files by deleting lines.

1   outputs/output-1.txt
2   outputs/output-2.txt
3   outputs/output-3.txt

Enter fullscreen mode Exit fullscreen mode

Run :%s/output-/summary-/g, save, quit. All three files renamed. Your editor's full power — regex, macros, multicursor — applied to filesystem operations.

Replaces rename 's/pattern/replacement/' * (Perl regex you have to look up) and for f in *; do mv ...; done (quoting hell).

Also from moreutils.


parallel — concurrent tasks without threading code

GNU parallel is xargs -P with readable syntax, job control, retries, and output you can actually parse.

The batched inference pattern:

cat prompts.jsonl | parallel -j4 --pipe --block 10k inference-tool

Enter fullscreen mode Exit fullscreen mode

Four workers, each receiving a 10K block of JSONL. No threading code. No async boilerplate. Output is ordered and labeled with --tag. Failed jobs retry with --retries 3.

For AI workloads — running the same prompt against multiple models, calling an embedding API for each document in a dataset, processing output files — parallel turns a sequential loop into concurrent execution in one command.


Load the reasoning skill into Claude Code

Knowing the commands is one thing. Knowing which one to reach for is another.

The lab repo ships .claude/skills/linux-archaeology.md — a Claude Code skill that maps natural-language descriptions to the right command. Describe your problem and it reasons through the answer:

"I need a progress bar for this pipeline"pv
"How do I timestamp my agent logs?"ts

"I want to rename a batch of files without writing a script"vidir

Install in any project:

mkdir -p .claude/skills
curl -sL https://raw.githubusercontent.com/copyleftdev/linux-archaeology-lab/main/.claude/skills/linux-archaeology.md \
  > .claude/skills/linux-archaeology.md

Enter fullscreen mode Exit fullscreen mode


The thread

watch, tee, pv, ts, sponge, column, comm, tac, vidir, parallel — none of these are new. They were built for the terminal long before AI workflows existed. But AI workflows surfaced the exact problems they solve: long-running processes with no visibility, streams that need to go two places, logs that need timestamps, files that need in-place transforms, tasks that need to run in parallel.

The tools were there. The problems caught up.

Run every command in this article against real data:
linux-archaeology-lab — clone it, bash setup.sh, open exercises/.

Which one did you not know about? Drop it in the comments.


Tags: linux productivity devtools ai bash


Sister article: The git Commands You Forgot Exist