惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

博客园 - 司徒正美
aimingoo的专栏
aimingoo的专栏
MongoDB | Blog
MongoDB | Blog
云风的 BLOG
云风的 BLOG
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
酷 壳 – CoolShell
酷 壳 – CoolShell
博客园 - 聂微东
Y
Y Combinator Blog
T
Tailwind CSS Blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
S
SegmentFault 最新的问题
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
博客园 - 【当耐特】
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
J
Java Code Geeks
美团技术团队
Google DeepMind News
Google DeepMind News
博客园_首页
Apple Machine Learning Research
Apple Machine Learning Research
T
The Blog of Author Tim Ferriss

DEV Community

🤫 Firebase Is Quietly Preparing for an Offline-First AI Future Should Angular Apps Still Rely on RxJS in 2025? AI Workflow Automation Needs More Than Another Script Reviving Cineverse: From Local Storage to Firebase 🚀 Approaches to Streaming Data into Apache Iceberg Tables How to Add Rounded Corners to an Image Online The subtle impact of AI (&amp; IT) on jobs Made a Rust based AI agent What Clicked for Me After Building on Solana for a Few Days WhatsApp's Encryption Stack: What It Covers, What It Doesn't, and What a Federal Agent Spent 10 Months Investigating Building CogniPlan: A Local-First Task Planning System Using Apache Iceberg with Python and MPP Query Engines How I Built AegisDesk: A Zero-Token Semantic IT Agent with <5ms Latency I built CodeArchy: an open-source that turns any codebase into a visual, explainable architectural experience, powered by Gemma 4. The Day Our Bot Ran Out of Money How we're using Gemini Embeddings to build a smarter, community-driven feed on DEV The Speculative Decoding Pattern The PKCE "Gotcha" in Expo’s exchangeCodeAsync TharVA : Keeping India's Desert Heritage Alive with Offline AI (Gemma4) n8n for Healthcare: 5 Automations for Clinics, Practices, and Health Tech Teams (Free Workflow JSON) How I Built an OWASP Memory Guard for AI Agents (ASI06) Condition-Based vs Time-Based Maintenance: Making the Switch I Tested Spam Protection on Formspree vs Formgrid. The Results Were Surprising. May 27 - Video Understanding Workshop Beyond Keywords: How Google's 2026 Algorithms are Redefining SEO From Click to Cart: Ensuring an Accessible Customer Journey in WooCommerce Your company won't replace you with good AI. They'll replace you with bad AI. How to Use an SVG Icon Search Engine as a Claude Custom Connector O fim do “modelo que faz tudo”? Conheça o Conductor, a IA que orquestra outras IAs 10 First-Principles Strategies to Learn Any Programming Language Deeply 10 First-Principles Strategies to Learn Any Programming Language Deeply Understanding Embeddings easily. The Hidden Cost of “Move Fast and Break Things” Why Your Logs Are Useless Without Traces DressCode: Your AI Stylist for Tomorrow The Documented Shortcoming of Our Production Treasure Hunt Engine I'm 16, and I Built an AI Tool That Audits Your Technical Debt Without Ever Touching code Building Your Own Crypto Poker Bot: A Developer's Guide to Blockchain Gaming Logic Apache Iceberg Metadata Tables: Querying the Internals Hermes, The Self-Improving Agent You Can Actually Run Yourself Unity vs Unreal: 5 Things I Had to Relearn the Hard Way Building Agentic Commerce Infrastructure: Overcoming SQLite Concurrency for Autonomous Procurement Agents Solana Accounts vs Databases HTML Table Borders I built a skill that makes AI-generated AWS diagrams actually usable My first post! I'm kinda excited The Page Root Was the Wrong Unit How to audit what your IDE extension actually sends to the cloud I Migrated 23 Make.com Scenarios to n8n and Cut My Bill by 60% — Complete Migration Guide (2026) Solving a Logistics Problem Using Genetic Algorithms Claude Code Skills Explained: What They Are & When to Use Them (2026) Maintaining Apache Iceberg Tables: Compaction, Expiry, and Cleanup Zero-Idle Local LLMs: Running Llama 3 in AWS Lambda Containers We scanned 8 B2B SaaS companies across 5 categories. ChatGPT named the same 12 brands in every answer. How To "Market" Yourself As A Tech Pro We scanned 500 MCP servers on Smithery. Here is what we found. HTML Basics for Beginners – Markup Language, Elements and Types of CSS DiffWhisperer: How I Turned Cryptic Git Diffs into Architectural Stories with Gemma 4 I built a version manager for llama.cpp using nothing but vibe coding. Unit Testing vs System Testing: Key Differences, Use Cases, and Best Practices for 2026 A game design textbook explains why products with fewer features win How to Build a Raydium Launchpad Bonding Curve in 5 Minutes with forgekit How to turn an AI prototype into a production system How Data Lake Table Storage Degrades Over Time Partition and Sort Keys on DynamoDB: Modeling data for batch-and-stream convergence Auto-Generate Optimized GitHub Actions Workflows For Any Stack With This New CLI Tool Unchaining the African Creator Economy The Treasure Hunt Engine Gotcha - A Lesson in Constrained Performance great_cto v2.17 - no more tambourine dance When Catalogs Are Embedded in Storage SafeMind AI: Instant Health & Safety Intelligence What Is PKCE, How It Works & Flow Examples AI Agent Failure Modes Beyond Hallucination Fastest Way to Understand Stryker Solana Accounts Explained to a Web2 Developer TV Yayın Akışı Sitesi Geliştirirken Öğrendiğim Teknik Dersler $500 Challenge Drop My First Look at Google's Gemma 4: A Quick Introduction How I use an LLM as a translation judge Best Calendar and Scheduling API for Developers — 2026 Comparison Agentic AI in Travel: Why UCP Isn't Travel-Ready Yet — and What We Measured I Finished Machine Learning. And Then Changed The Plan. The Five-Thousand-Line File The AI Whirlwind: Why Your Local Agent Matters More Than Ever I Built an Oracle DBA That Lives in Telegram. It Cut a 500K-Row Scan to 5 - After Asking Permission. The Day 2 Reality of Running a Kubernetes Lab on Your Mac: Stop/Start, CKS Scenarios, and What I Learned Building It. n8n for Airtable Power Users: 5 Automations That Take Your Base to the Next Level Validating Gemma 4 for Industrial IoT: A Governance Pattern VS Code Now Credits Copilot on Every Commit by Default Astro and Islands Architecture: Why Your Portfolio Doesn't Need React for Everything Booting from FAT12: How I added file reading to my x86 kernel Unity’s AI agent went public: the developers of a static analysis tool on what that means for code quality Anna's Archive publica un llms.txt para los LLMs que rastrean su catálogo CRDTs for Offline-First Mobile Sync Why I Built Mneme HQ: Preventing AI Agent Architectural Drift Google Antigravity 2.0 Is the I/O 2026 Announcement You Should Actually Care About I Built a Pay-Per-Call Crypto Signal API with x402 — Heres the Architecture JWT Token Refresh Patterns in React 19: Avoiding the Silent Auth Death Spiral 🚀 “From Prompts to Autonomous Agents: What Google I/O 2026 Changed” The Power of Distributed Consensus in Autonomous SOCs
Your AI is not bad, your instructions are
Jenuel Oras · 2026-05-23 · via DEV Community

Your AI is not bad, your instructions are

Most AI failures are not model failures. They are instruction failures. This guide shows how to write clearer prompts, give better context, use examples, set boundaries, and turn AI from a clever guesser into a reliable collaborator.

Most people judge an AI model too quickly. They type one vague request, get a shallow answer, and conclude the tool is bad.

Sometimes the model really is the problem. It may hallucinate, misunderstand a niche domain, or fail at a task that needs fresh data or exact execution. But a lot of the time, the AI did exactly what it was asked to do. The problem is that the request was incomplete, fuzzy, or missing the information a human coworker would normally ask for before starting.

AI is not magic. It is a prediction system that responds to the shape of your instruction. If the instruction is lazy, the answer usually feels lazy. If the instruction is specific, grounded, and testable, the answer gets much better.

That does not mean you need to write a novel every time you use ChatGPT, Claude, Gemini, Copilot, or any other assistant. It means you need to stop treating prompts like search queries and start treating them like task briefs.

The prompt is the interface

With normal software, the interface gives you buttons, menus, validation, and guardrails. If you fill out a form wrong, the app tells you. With AI, the interface is language. That makes it powerful, but it also makes it easy to be sloppy.

When you ask, "Write me a blog about AI," the model has to guess almost everything: audience, tone, length, point of view, structure, level of technical depth, examples, sources, and what you actually believe. It will usually choose the safest average answer. That is why so much AI writing sounds like a conference brochure.

A better request sounds more like this:

Write a 1,200-word blog for software developers and creators. The argument is that many bad AI outputs come from weak instructions, not weak models. Use a direct, practical tone. Include examples of bad prompts and better prompts. Avoid hype. Include references to prompt engineering research and documentation. End with a reusable checklist.

Same tool. Different interface. Better result.

AI needs context more than clever wording

People often think prompt engineering is about secret phrases. It is not. Most of the value comes from context.

If you hired a designer, developer, editor, or assistant, you would not say, "Make this better" and walk away. You would explain who the work is for, what good looks like, what constraints matter, what has already failed, and what should be avoided.

AI needs the same thing. It needs the background that lives in your head.

  • Who is the audience?
  • What is the goal?
  • What should the output be used for?
  • What format do you need?
  • What should it avoid?
  • What examples should it follow?
  • What sources or facts should it rely on?

Without that, the model fills in the blanks. Sometimes it guesses well. Sometimes it invents a business-school version of what you meant.

Bad prompts hide the standard of success

The most common instruction mistake is asking for an output without defining success.

"Make this landing page better" is not enough. Better for whom? Better at conversion? Better at clarity? Better for SEO? Better for trust? Better for mobile users? Better for a skeptical buyer who has never heard of you?

Try this instead:

Review this landing page for a first-time visitor. The goal is to make them understand the product in less than 10 seconds and click the download button. Identify confusing copy, missing trust signals, weak calls to action, and anything that may hurt mobile readability. Give the answer as a prioritized list with suggested replacement copy.

That prompt gives the model a job, a user, a goal, evaluation criteria, and an output format. The answer will almost always be more useful because the AI no longer has to invent the target.

Examples beat explanations

If you want a specific style, show it. If you want a certain format, show it. If you want a category of answer, show one good example and one bad example.

This is one reason few-shot prompting works so well. Research and practice both show that models respond strongly to examples in the prompt. You are not just describing the task. You are giving the model a pattern to continue.

For example, instead of saying "write in my style," paste two or three paragraphs you actually wrote and say what you want preserved: short sentences, plain words, personal opinion, no corporate tone. If you want a bug report, show the exact structure. If you want sermon notes, show the rhythm and sections. If you want a product changelog, show a previous changelog that worked.

Examples reduce guessing. That is the whole game.

Give the AI boundaries before it makes a mess

AI assistants are eager. That eagerness is useful when you want momentum, but dangerous when the task has rules.

If the assistant should not invent statistics, say so. If it should ask questions before making assumptions, say so. If it should cite sources, say so. If it should keep the answer under 500 words, say so. If it should not change code outside a certain folder, say so.

Good constraints do not make the model less creative. They keep it pointed in the right direction.

Useful constraints sound like this:

  • Do not invent facts. If a fact is missing, write "needs source" instead.
  • Use only the information pasted below.
  • Ask up to three clarification questions if the task is ambiguous.
  • Return the answer as JSON matching this schema.
  • Do not edit files outside the src/components folder.
  • Prefer simple language over marketing language.

These boundaries are boring. They also save hours.

Ask for the process when the task is complex

For simple tasks, direct prompting is fine. For complex tasks, ask the model to slow down.

Chain-of-thought research showed that large language models often perform better on reasoning tasks when prompted to work through intermediate steps. You do not always need to see a hidden internal reasoning trace, and many modern tools handle reasoning differently now, but the practical lesson still matters: complex work improves when you separate thinking from final output.

Instead of asking, "Build me a marketing plan," ask for stages:

  • First, identify the audience and assumptions.
  • Then list possible angles.
  • Then choose the strongest angle and explain why.
  • Then draft the plan.
  • Finally, critique the plan for weak points.

This works because the model is less likely to collapse everything into a polished but shallow answer. It has to make its intermediate choices visible enough for you to correct them.

Treat AI output like a first draft, not a final authority

Even with great instructions, AI can be wrong. It can cite the wrong source, misread your intent, or produce code that looks clean but breaks an edge case.

The point of better instructions is not blind trust. The point is better raw material.

For writing, that means you still edit. For code, you still run tests. For research, you still check sources. For business decisions, you still apply judgment. AI can move the work forward, but it should not quietly replace verification.

A good workflow is simple:

  • Brief the AI clearly.
  • Review the output critically.
  • Ask it to revise against specific feedback.
  • Verify facts, links, code, and assumptions.
  • Save the prompt pattern if it works.

A reusable instruction template

When a prompt matters, use this structure:

Role: What perspective should the AI take?

Task: What exactly should it produce?

Context: What background does it need?

Audience: Who is this for?

Constraints: What rules, limits, or preferences matter?

Examples: What should it imitate or avoid?

Output format: Markdown, JSON, bullet list, table, code patch, email, article, or something else?

Verification: How should the result be checked?

Here is a practical version:

You are an experienced technical editor. Rewrite the article below for independent developers who use AI tools. Keep the tone direct and human. Remove hype, vague claims, and filler. Preserve the main argument. Add concrete examples where the draft is abstract. Do not invent sources. Return the result in Markdown with H2 headings, short paragraphs, and a final checklist.

That prompt will not solve every problem. But it gives the model a real assignment instead of a wish.

The better question

When AI gives you a bad answer, do not only ask, "Which model should I use instead?" Sometimes the better question is, "What did I fail to tell it?"

Did you define the audience? Did you include context? Did you show examples? Did you explain what good looks like? Did you set boundaries? Did you ask for sources? Did you verify the result?

Better models help. Better instructions help more often than people want to admit.

Your AI might not be bad. It might just be under-briefed.

References

Originally published at https://blog.jenuel.dev/blog/your-ai-is-not-bad-your-instructions-are