慣性聚合 高效追讀感興趣之博客、新聞、科技資訊
閱原文 以慣性聚合開啟

推薦訂閱源

博客园 - 司徒正美
V
V2EX
T
Tailwind CSS Blog
有赞技术团队
有赞技术团队
aimingoo的专栏
aimingoo的专栏
Apple Machine Learning Research
Apple Machine Learning Research
IT之家
IT之家
Blog — PlanetScale
Blog — PlanetScale
A
About on SuperTechFans
月光博客
月光博客
T
The Blog of Author Tim Ferriss
宝玉的分享
宝玉的分享
Martin Fowler
Martin Fowler
博客园 - 聂微东
The GitHub Blog
The GitHub Blog
V
Visual Studio Blog
WordPress大学
WordPress大学
酷 壳 – CoolShell
酷 壳 – CoolShell
Engineering at Meta
Engineering at Meta
GbyAI
GbyAI

DEV Community

Authentication Security Deep Dive: From Brute Force to Salted Hashing (With Java Examples) Why AI Systems Don’t Fail — They Drift Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet" I Replaced Chrome with Safari for AI Browser Automation. Here's What Broke (and What Finally Worked) How Python Borrows Other People's Work The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime Vibe Coding: A Workflow Guide (From Zero to SaaS) Most webhook security guides protect the wrong side. The scary part is delivery. Headless CMS for TanStack Start: Build a Blog with Cosmic EU Age Verification App "Hacked in 2 Minutes" — What Actually Happened Comfy Cloud’s delete function does not actually remove files Running AI Models on GPU Cloud Servers: A Beginner Guide Event-driven media intelligence with AWS Step Functions and Bedrock I scored 500 AI prompts across 8 quality dimensions — here's what broke How to Call Google Gemini API from Next.js (Free Tier, No Backend Needed) The Portal Protocol: Reclaiming Human Connection in the Age of AI How to Fix Your Team's Scattered Knowledge Problem With a Self-Hosted Forum Intro to tc Cloud Functors: A Graph-First Mental Model for the Modern Cloud Designing Multi-Tenant Backends With Both Ownership and Team Access I Built a Neumorphic CSS Library with 77+ Components — Here's What I Learned PostgreSQL Performance Optimization: Why Connection Pooling Is Critical at Scale Cómo construí un SaaS multi-rubro para gestionar expensas en Argentina con FastAPI + Vue 3 🚀 I Built an Ethical Hacking Scanner Tool – Open Source Project I Replaced /usage and /context in Claude Code With a Single Statusline A Pythonic Way to Handle Emails (IMAP/SMTP) with Auto-Discovery and AI-Ready Design I Collected 8.9 Million Polymarket Price Points — Here's What I Found About How Markets Really Move EcoTrack AI — Carbon Footprint Tracker & Dashboard Everyone's Using AI. No One Agrees How. 5 self-hosted ebook managers worth trying in 2026 Building Your First AI Agent with LangChain: From Chatbot to Autonomous Assistant Common SOC 2 Failures (Real World) Stop Vibe-Checking Your AI App: A Practical Guide to Evals How to Use SonarQube and SonarScanner Locally to Level Up Your Code Quality Your Next To-Do App Is Dead — I Replaced Mine with an OpenClaw AI Sign a Nostr event in 60 lines of Python using coincurve — no nostr-sdk, no nbxplorer, no rust toolchain ITGC Audit Explained Like You’re in Big 4 Patch Tuesday abril 2026: Microsoft parcha 163 vulnerabilidades y un zero-day en SharePoint Stop scraping everything: a better way to track competitor price changes Listing on MCPize + the Official MCP Registry while routing payments OUTSIDE the marketplace — how I kept 100% of my x402 revenue Building an AI-Powered Risk Intelligence System Using Serverless Architecture Why We Ripped Function Overloading Out of Our AI Toolchain Testing AI-Generated Code: How to Actually Know If It Works SaaS Churn Is Killing Your Business. Here Is What to Do About It (Without a Support Team) The Speed of AI Is No Longer Linear - And Self-Improving Models Are Why How to Implement RBAC for MCP Tools: A Practical Guide for Engineering Teams From Standard Quote to Persuasive Proposal: AI Automation for Arborists I built a CLI that scaffolds complete multi-tenant SaaS apps Axios CVE-2025–62718: The Silent SSRF Bug That Could Be Hiding in Your Node.js App Right Now The dashboard that ended our friendship Data Pipelines Explained Simply (and How to Build Them with Python)
不可造人工智能之使,(and what to ship instead)
SapotaCorp · 2026-05-24 · via DEV Community

一SaaS之创者,嘱吾等评其CTO所返之"自主AI"商贾之议。其策十八使,一策者大言,一行者大言,一评者大言,六月而成,其模之费增五倍。

今之产品乃应答客户于其账单之门户所问之FAQ之聊天机器人。入问者约八十为"如何将发票导出为PDF"之变体。其余二十则转至人手,盖聊天机器人未谙其术也。

吾等告其始创者,于既存之提示增句,并将此能动之计划置诸积压之列。

此乃每周几近必现之谈。诸商贾皆向 B2B SaaS 之创者游说代理。然多数创者实无此需。今述 Sapota 所用之决断之纲.

人工智能代理者实为何物

去其营销之辞,代理者乃三事附于大语言模型:

  • 工具:能调用外函数(搜查、查询数据库、运行代码)之能
  • 记忆:跨多LLM调用而存之态
  • 自主:LLM决次行,非开发者所定

自主性者,使之为智能体也。寻常之RAG流程,取片段而应答,循固定之序。智能体则每步自决,或取更多,或调用工具,或求明晰,或定答案。

此能实为某些事所宜。然其价每询较寻常LLM调用,昂五至十五倍,速减二至十倍,且调试甚艰。必待算术相宜,而后此役得立其位。

单问之试

萨波塔之滤,一问而已:独一精撰之提示,足竟此务乎?

若然,则勿建代理人,用此提示。

若否,则问:固定两步之管(检索而后生成)可成之乎?若然,则建 RAG 之管,毋为 agent。

若犹无应,则问:此举实需乎动态决断何为后续之策,抑或开发者惟惑于步骤当为何?若属后事,则书其程。设计流程之不适,非委诸大语言模型之由也。

若三问之后,答仍曰"是,此需自主",则代理人乃适器也。吾辈验之,此滤可去"吾需代理人"之谈七成。

创始者之聊机器人所实需者

八成复问关于发票导出者,乃提示之弊也。其初提示若此:

汝乃助人解忧之客服,应答客问。

其修也,乃增:

若用户询及发票导出,则应答曰:"点击设置→账单→导出。择PDF格式及欲选日期区间。"复问其需他事否。

此一语解八成之来量。无需代理人,无需供应商。三分钟之功耳。

所余二十者,分为二类。其十二者,关乎客之账户状态,诚需调用用户数据库之工具。其余八者,乃边缘之例,本应转交于人。

架构得当,宜为单提示聊天之器,具一器(查账),若有难,则托人手。两日之功。每询之费,恒不逾五分钱。其能自主之议,初估每询五分钱,且历六月乃成.

何时宜用人手

吾非恶人手。Sapota所荐建人手之务,有三共性:

真正多步之决断,其第 N 步之结果,依第 N-1 步而定。 研究之综合,其后续之搜寻,依首搜所返之结果而定。代码重构,其后续之文件,依已改之变故而定。客户导入之分诊,其路由,依自多系统所拉之数据而定。

LLM 所需协调之工具。 单一工具调用非为智能体,实乃工具调用而已。智能体之复杂,在于大语言模型于三至七工具间抉择,并融汇其果。客户支持者,取自CRM、知识库、计费系统及货运追踪。销售研究者,合LinkedIn、新闻、财务数据及CRM历史。

成本之谓,方堪其繁。 自主架构于大语言模型之成本,每项任务增价自二分至二角,未计研发与运维之费。此价适于研究助理,售技工程师日使十次。然不适宜于应答器,每月需处理五万问,每问价仅五分者。

吾辈所见之式

吾人所见创始人之谬,莫过于视"代理"为精深之阶,而非具特定应用之器。其说辞甚具诱力:自主、智能、自进、自适。然多数B2B SaaS产品之实,询查多复,数据之源有限,用户之期可测。

善书之示,持一器,设诚信之关,其费、其迟、其信,于下八十之"AI之能"用例,胜于使。其上二十之用例,使胜者诚且要,然非百之用例也。

售此代理平台者,无劝告之诚。其说曰:“代理之智乃未来之趋。”亦曰:“购此平台于吾。”此二说皆可真。此特定之品,亦或尚无参与未来之需。

桃子之荐

未决意于代理之筑:

  • 运单问之试。诚言自主之需否。
  • 量产之率,估每务之费。乘以所期之询率。较今费。
  • 审今之示语与器设。多所谓"AI之能缺",非失代理,乃失示语。
  • 若答仍曰"需代理人",当以一代理人、一器始。勿建十八代理人系统,如商所拟。单代理人基线,乃衡多代理人架构是否值其烦之据。

小建以测,渐扩。代理人模式,诚实而有用。然亦常非所问之谬答也。

若商贾向尔游说

若尔之众被言人工智能之代理乃次季之蓝图,则签约之前所问者,乃尔生产之问凡几实需自主也。若商贾不能以尔之数据而应此问,则彼所售者,非解也,乃产也。

萨波塔提供一周之期,审计查询之分布。其取尔生产查询之录,依复杂度分类诸查询,并出尺寸之议,何部分需代理,何部分宜以一 prompt 配适之工。其出为预算与架构之文,尔辈可持以示于商贾或董事会。

欲达,请至 人工智能工程之页。 而君之今时AI架构及约略询量。诊谈之语,几每易“有司或无”之问为“自主之需,究在何处”。