InertiaRSS Track and read blogs, news, and tech you care about
Read Original Open in InertiaRSS

Recommended Feeds

博客园 - 司徒正美
V
V2EX
T
Tailwind CSS Blog
有赞技术团队
有赞技术团队
aimingoo的专栏
aimingoo的专栏
Apple Machine Learning Research
Apple Machine Learning Research
IT之家
IT之家
Blog — PlanetScale
Blog — PlanetScale
A
About on SuperTechFans
月光博客
月光博客
T
The Blog of Author Tim Ferriss
宝玉的分享
宝玉的分享
Martin Fowler
Martin Fowler
博客园 - 聂微东
The GitHub Blog
The GitHub Blog
V
Visual Studio Blog
WordPress大学
WordPress大学
酷 壳 – CoolShell
酷 壳 – CoolShell
Engineering at Meta
Engineering at Meta
GbyAI
GbyAI

阮一峰的网络日志

科技爱好者周刊(第 396 期):互联网通信的替代方案 科技爱好者周刊(第 396 期):互联网通信的替代方案 - 阮一峰的网络日志 科技爱好者周刊(第 395 期):软件开发的第三种方式 科技爱好者周刊(第 395 期):软件开发的第三种方式 - 阮一峰的网络日志 科技爱好者周刊(第 393 期):脑腐状态 科技爱好者周刊(第 392 期):axios 投毒与好莱坞式骗术 科技爱好者周刊(第 391 期):AI 的贫富分化 科技爱好者周刊(第 390 期):没有语料,大模型就是智障 套壳中国大模型撑起500亿美元估值?扒一扒 Cursor 的"套壳"疑云 科技爱好者周刊(第 389 期):未来如何招聘程序员 科技爱好者周刊(第 388 期):测试是新的护城河 零安装的"云养虾":ArkClaw 使用指南 科技爱好者周刊(第 387 期):你是领先的 科技爱好者周刊(第 386 期):当外卖员接入 AI 字节全家桶 Seed 2.0 + TRAE 玩转 Skill 科技爱好者周刊(第 385 期):马斯克害怕中国车企吗? 智谱旗舰 GLM-5 实测:对比 Opus 4.6 和 GPT-5.3-Codex 科技爱好者周刊(第 384 期):为什么软件股下跌 科技爱好者周刊(第 383 期):你是第几级 AI 编程 Kimi 的一体化,Manus 的分层 科技爱好者周刊(第 382 期):独立软件的黄昏 科技爱好者周刊(第 381 期):中国 AI 大模型领导者在想什么 科技爱好者周刊(第 380 期):为什么人们拥抱"不对称收益" 科技爱好者周刊(第 379 期):《硅谷钢铁侠》摘录 我如何用 AI 处理历史遗留代码:MiniMax M2.1 升级体验 科技爱好者周刊(第 378 期):预测是新的互联网热点 科技爱好者周刊(第 377 期):14万美元的贫困线 科技爱好者周刊(第 376 期):太空数据中心的争议 科技爱好者周刊(第 375 期):一扇门的 Bug 终于有人做了 Subagent,TRAE 国内版 SOLO 模式来了 科技爱好者周刊(第 374 期):6GHz 的问题 VS Code 使用国产大模型 MiniMax M2 教程 科技爱好者周刊(第 373 期):数据模型是新产品的核心 国产大模型接入 Claude Code 教程:以 Doubao-Seed-Code 为例 科技爱好者周刊(第 372 期):软件界面如何设计 大模型比拼:MiniMax M2 vs GLM 4.6 vs Claude Sonnet 4.5 科技爱好者周刊(第 371 期):一个乐观主义者的专访 科技爱好者周刊(第 370 期):正确的代码高亮 错误处理:异常好于状态码 科技爱好者周刊(第 369 期):Tim 与罗永浩的对谈 科技爱好者周刊(第 368 期):不要这样管理软件团队 一天之内,智谱和 Anthropic 都发了最强编程模型 科技爱好者周刊(第 367 期):Nano Banana 的几个妙用 科技爱好者周刊(第 366 期):旧金山疯狂的 AI 广告 科技爱好者周刊(第 365 期):流量变现正在崩塌 科技爱好者周刊(第 364 期):最难还原的魔方 科技爱好者周刊(第 363 期):最好懂的神经网络解释 科技爱好者周刊(第 362 期):GitHub 工程师谈系统设计 科技爱好者周刊(第 361 期):暗网 Tor 安全吗? 科技爱好者周刊(第 360 期):Dan Wang 的新书
AI native Workspace might be the next stage for agents
阮一峰 · 2026-01-22 · via 阮一峰的网络日志

I. The Form of Intelligent Agents

I ask everyone a question, what is the product form of AI?

Large models are just underlying processing engines; you always need an application-level product to meet user needs. This application layer of AI is referred to as an "intelligent agent" (agent).

So, the question becomes, what should an "intelligent agent" look like?

Early intelligent agents were just conversational applications (as shown in the image above), later incorporating reasoning to think through complex problems.

Later, they evolved towards specialized fields, giving rise to programming agents (coding agents), image agents, video agents, and more, or by integrating with MCP, gaining external application operation capabilities, such as generating Office documents or controlling browsers.

These forms are basically mature, and many companies are now exploring what the next generation of intelligent agents will look like?

I recently started using the newly released AI native Workspace (AI Native Workspace), and happily feel that this might be the answer.

II. Cowork and Skill

This new product also incorporates two new concepts recently proposed by Anthropic: Cowork and Skill.

Cowork, simply put, is a "computer operation assistant." It is essentially the graphical interface version of a programming intelligence agent, allowing users who don't understand programming to express their needs in natural language, then have AI generate underlying code and execute it to automatically operate the local computer and complete tasks.

Skill is even simpler. It is a preset prompt, equivalent to a "user manual," which describes in detail to the AI how to complete a specific task. You can understand it this way: each Skill is an expert, giving AI specific skills in a particular field.

These two things, one is an operation assistant, and the other is expert mode. The former uses AI to operate the computer, and the latter enables AI to have specialized skills.

What will happen when they are combined?

MiniMax AI native Workspace is such a product, exploratively combining Cowork and Skill, while possessing both capabilities in a completely new product form.

Its desktop version (desktop) provides Cowork capabilities, while expert mode (experts) provides Skill capabilities.

III. Desktop Operation Assistant

Next, I will demonstrate where its differences lie from traditional intelligent agents.

Its desktop client is positioned as an "AI native workspace," with the following capabilities.

  • Directly access local files: able to read and write, as well as automatically upload or download files.
  • Automated workflows: capable of breaking down tasks and running web automation.
  • Deliver professional results: After execution, high-quality deliverables can be generated, such as Excel spreadsheets, PowerPoint presentations, and formatted documents.
  • Long-running tasks: For complex tasks, they can run for a long time without being affected by dialogue timeouts or context limitations.

Note that since it can operate computers and communicate with the internet, you must specify a directory before execution to prevent reading or writing to directories that should not be accessed, and you must have backups to prevent original files from being deleted or modified.

First, go to the official website to downloadDesktop clientAvailable for Windows/Mac versions, new registered users can currently try it for free for 3 days.

After installation, run it directly to enter the task interface, which is a traditional dialog box.

At this point, specifying the run directory enters the "workbench" mode, allowing operations on that directory. The software will pop up a warning message indicating the risk.

At this point, you can let it perform various tasks. For example, I had it organize PDF invoices from various electronic services and then generate a summary Excel document.

At this point, it will automatically install a Python virtual environment in the current directory, then generate and execute a Python script.

Excel file will be ready soon.

Similarly, various file organization tasks can be handed over to it, such as sorting photos, renaming files, and more.

It can also perform web automation, such as automatically browsing a webpage and extracting information, summarizing content.

IV. Expert Systems

Above, its workbench functions are demonstrated, capable of acting as a "digital employee," and below, let's take a look at its "expert system."

So-called "expert systems" involve injecting specific prompt files to expand the skills of an intelligent agent, equivalent to injecting deep knowledge and capabilities. Users can also upload private knowledge bases.

Everyone can open itWeb-basedClick "Explore Experts" on the left sidebar.

The system comes with some "preset experts" that can be used directly.

I chose a "Icon Maker" provided by the system, which is a skill for making logos, to see how it works.

I request to create a "Panda Eating Ice Cream" logo, and the system prompts to select a design style.

Finally, two files (sitting posture and standing posture) were generated for selection, and the results are quite good.

5. Create New Skill

In addition to the preset experts, the system also allows you to create "My Experts," which are custom skills of a certain type.

You need to enter the capability description and instructions, and you can also add corresponding MCP, SubAgent, environment variables, Supabase database, and so on.

I directly input the Skill fileprovided by Anthropic to see the effect.

I selected the frontend-design (Frontend Design) skill and after entering it, it can be seen on the "My Experts" page.

Note that the system currently only supports inputting skill description files and does not support uploading static resource files (assets). It is hoped that this can be added later.

After selecting this expert, I request to generate an algorithm visualization page.

"Create a sorting algorithm visualization website that lists animations of common sorting algorithms. Selecting a particular algorithm will display its animation effect."

The generation process took about ten minutes, and the result was obtained. The system generated animations for ten sorting algorithms and was directly deployed online.

I adjusted the animation colors again later, everyone can goThis websiteCheck out the effect, it's still cool.

VI. Summary

AI native WorkspaceThe AI intelligent agent has been introduced to the local computer, enabling automated operations, while also incorporating a skill interface that allows for the injection of external knowledge and capabilities. Moreover, all operations can be completed through natural language dialogue, requiring low demands from the user.

This instantly opens up the imagination space for AI agents. The tasks they can accomplish will no longer be limited by the capabilities of the model, but only by our imagination.

I believe that this product represents the direction of AI agent development in the next stage, it will open up many brand new possibilities for us to explore.

(End)