The Mystery of Backend in DeepAgents: Enabling AI Agents to Have File Operation Capabilities

Recommended Feeds

博

博客园 - 司徒正美

V2EX

Tailwind CSS Blog

有赞技术团队

aimingoo的专栏

Apple Machine Learning Research

IT之家

Blog — PlanetScale

About on SuperTechFans

月光博客

The Blog of Author Tim Ferriss

博客园 - work hard work smart

5 分钟上手：为 Cline 配置一个免费的 MCP 天气服务 Neo4j 图数据库安装与 Spring Boot 集成实战指南 LangFuse 实战指南：用 @observe 三行代码给 LLM 应用加上全链路追踪 Function Call 深度解析：让大模型从"嘴炮"到"实干"的技术革命 Spring AI 提示词模板实战：告别硬编码，实现提示词工程化管理 LangChain4j 实战指南：用 Java 轻松构建 AI 应用 - work hard work smart Spring AI 对话短期记忆实战：让大模型拥有"记忆力" Spring AI 提示词工程实战：让大模型更懂你的意图 Spring AI ChatClient 深度解析：优雅构建大模型应用的利器 Spring AI Alibaba DashScopeChatModel 实战 Spring 中 SSE 流式输出的多种实现方式详解 OpenSandbox 实战指南：为 AI Agent 构建安全的代码执行沙箱在本机启动 LangGraph 开发服务器：完整指南为什么选择 Go 开发 Web 接口？从入门到实践智能搜索DeepAgent笔记 RAG学习笔记2--系统查询流程 RAG学习笔记1--系统文件导入流程百炼 WebSearch 快速入门指南 Python 连接 MongoDB 完整指南：从连接配置到增删改查实战一行命令搞定 MongoDB 开发环境：Docker Compose 部署 + 可视化管理使用 Attu 可视化管理 Milvus 向量数据库在 Windows Docker 中快速安装 Milvus 2.5.6 minio使用 Spark 集群搭建 hadoop集群安装 Spring AI Alibaba 入门实战 Windows 安装 OpenClaw 实战指南 MyBatis 核心流程和原理 Idea中安装Claude code插件 Spark 编程使用Matplotlib 绘制直方图 Flink安装部署 Flume安装查找导致cpu过高的代码方法 JVisualVM监控远程Java进程 jmap jacoco多模块生成java单元测试报告实践 arthas 使用demo LockSupport Exchanger CyclicBarrier CountDownLatch 手把手教你用python开始第一个机器学习项目

The Mystery of Backend in DeepAgents: Enabling AI Agents to Have File Operation Capabilities

work hard wo · 2026-05-24 · via 博客园 - work hard work smart

When building intelligent agents (AI Agents), we often need to enable the Agent to have the ability to read, write, search, and edit files. The DeepAgents framework achieves this byBackend (Backend)The mechanism achieves this function, providing an Agent with a secure and controllable file operation environment. This paper will delve into the concept and usage of Backend, demonstrating its powerful capabilities through four progressive examples.

What is Backend?

The Backend is a core abstraction in the DeepAgents framework, defining how Agents interact with the file system. Through the Backend, Agents can:

List files and directories()ls_info)
Read file content()read）
write to new file （write）
search file content （grep_raw）
wildcard match files （glob_info）
edit existing files （edit）

The core value of Backend lies in security and flexibility：It can restrict Agent's file operations within specific directories, preventing unauthorized access, while supporting multiple storage media (local file systems, memory, cloud storage, etc.).

Usage Scenarios for __JHSNS_SEG_c369f8fe_28__Backend

Code Generation and Editing：Allow Agents to write and modify code in a sandbox environment
Data Analysis：Agents read, process, and generate data files
Document Processing：Automate document creation and content updates
Configuration Management：Dynamically generate and modify configuration files

Types of Backend Supported by DeepAgents

The DeepAgents framework provides multiple Backend implementations to meet the needs of different scenarios:

1. FilesystemBackend - Local filesystem

Function : Directly operate on local disk files
Features : Persistent storage, supports virtual_mode for secure isolation
Applicable : Scenarios requiring file persistence and sharing with other systems

2. LocalShellBackend - Local Shell execution

Function : File operations + Execution of local Shell commands
Features: Supports running scripts, calling system tools (git, docker, etc.)
Applicable to : Scenarios requiring code compilation, running scripts, and executing complex commands

3. SandboxBackend - Remote Sandbox Environment

Features : Executes commands and operates files in an isolated sandbox environment
Characteristics : Fully isolated execution environment, supports cloud sandbox (e.g., OpenSandbox, Daytona)
Subclass Implementations :
- OpenSandboxBackend: Open-source sandbox solution based on OpenSandbox
- DaytonaBackend: Cloud sandbox solution based on Daytona
is applicable to : scenarios requiring high-security isolation, running untrusted code, and cloud execution.

4. Custom Backend - Fully Flexible

Function : Customize storage logic by implementing the BackendProtocol interface.
Features : Can integrate with any storage medium (memory, database, cloud storage, etc.).
Examples : DictBackend (in-memory storage), RedisBackend (Redis storage), etc.
is applicable to : special requirements, testing environments, custom storage logic.

Comparison of Backend Types

Backend Types	File Operations	Command Execution	Persistence	Security Isolation	Typical Scenarios
FilesystemBackend	✅	❌	✅	Medium (virtual_mode)	Local File Processing
LocalShellBackend	✅	✅(Local)	✅	Medium(virtual_mode)	Local script execution
SandboxBackend	✅	✅(Sandbox)	Optional	High(Full isolation)	Cloud secure execution
Custom Backend	Custom	Custom	Custom	Custom	Special requirements

Four progressive examples

Example 1: Agents without a Backend

The simplest Agent configuration does not use a Backend and relies solely on tool calls to complete tasks:

from deepagents import create_deep_agent
from langgraph.checkpoint.memory import InMemorySaver

# 创建内存检查点保存器
checkpointer = InMemorySaver()

# 创建基础Agent
agent = create_deep_agent(
    model=llm,
    tools=[web_search],
    #checkpointer=checkpointer,
    system_prompt='你是一个助手，请根据用户输入的指令，进行相应的操作。'
)

Features:

Suitable for scenarios that only require calling external APIs (such as web search)
Unable to perform file operations
Simple to configure, quick to start

Example 2: FilesystemBackend - Filesystem Sandbox

When an Agent needs to operate on files,FilesystemBackendThe most common choice:

import os
from deepagents import create_deep_agent
from deepagents.backends import FilesystemBackend
from langgraph.checkpoint.memory import InMemorySaver

checkpointer = InMemorySaver()

# 创建Agent工作目录
temp_workspace = "./agent_workspace"
os.makedirs(temp_workspace, exist_ok=True)

agent = create_deep_agent(
    model=llm,
    tools=[web_search],
    checkpointer=checkpointer,
    backend=FilesystemBackend(
        root_dir=temp_workspace,
        virtual_mode=True,  # 防止Agent通过`../../`跳出根目录
    ),
    system_prompt='你是一个助手，请根据用户输入的指令，进行相应的操作。'
)

Key parameters:

root_dir：Specify the working root directory for the Agent
virtual_mode=True：Enable virtual mode to prevent path traversal attacks (such as ../../etc/passwd)

：Applicable scenarios：

needs persistent storage for files
Files generated by the Agent need to be retained on disk
Need to share files with other systems

Example 3: LocalShellBackend - Supports command execution

If the Agent not only needs to operate on files but also execute shell commands, LocalShellBackend is a better choice:

import os
import sys
from deepagents import create_deep_agent
from deepagents.backends import FilesystemBackend, LocalShellBackend
from langgraph.checkpoint.memory import InMemorySaver

checkpointer = InMemorySaver()

temp_workspace = "./agent_workspace"
os.makedirs(temp_workspace, exist_ok=True)

agent = create_deep_agent(
    model=llm,
    tools=[web_search],
    checkpointer=checkpointer,
    backend=LocalShellBackend(
        root_dir=temp_workspace,
        virtual_mode=True,
        timeout=30,  # 命令执行超时时间（秒）
        max_output_bytes=50000,  # 命令输出最大字节数
        env={
            "PATH": f"{os.path.dirname(sys.executable)};{os.environ.get('PATH', '')}",
        },
    ),
    system_prompt='你是一个助手，请根据用户输入的指令，进行相应的操作。'
)

New features：

supports executing shell commands (such as python script.py, ls -la).
can be configured with timeout and output limits to prevent malicious or inefficient commands.
custom environment variables to ensure commands run in the correct environment.

Applicable scenarios :

need to run scripts or compile code
need to call system tools (such as git, docker)
need to execute complex data processing pipelines

Example four: Custom Backend - DictBackend

The power of DeepAgents lies in its extensibility. By implementingBackendProtocol, we can create fully customizable Backend:

from deepagents.backends.protocol import BackendProtocol, WriteResult, EditResult
from deepagents.backends.utils import FileInfo, GrepMatch
from datetime import datetime
import re

class DictBackend(BackendProtocol):
    """将文件存储在内存字典中的自定义后端"""
    
    def __init__(self):
        self.files = {}  # 路径 -> 内容
        self.metadata = {}  # 路径 -> 元数据
    
    def ls_info(self, path: str) -> list[FileInfo]:
        """列出文件和目录信息"""
        result = []
        seen_dirs = set()
        for file_path in self.files.keys():
            if file_path.startswith(path):
                remaining = file_path[len(path):]
                if '/' in remaining:
                    dir_name = path + remaining.split('/')[0] + '/'
                    if dir_name not in seen_dirs:
                        seen_dirs.add(dir_name)
                        result.append(FileInfo(path=dir_name, is_dir=True))
                else:
                    meta = self.metadata.get(file_path, {})
                    result.append(FileInfo(
                        path=file_path,
                        is_dir=False,
                        size=meta.get('size', 0),
                        modified_at=meta.get('modified_at')
                    ))
        return result
    
    def read(self, file_path: str, offset: int = 0, limit: int = 2000) -> str:
        """读取文件内容，支持分页"""
        if file_path not in self.files:
            return f"Error: File '{file_path}' not found"
        content = self.files[file_path]
        lines = content.splitlines(keepends=True)
        start_line = offset
        end_line = start_line + limit if limit > 0 else len(lines)
        selected_lines = lines[start_line:end_line]
        result = ''.join(f"{start_line + i + 1}: {line}" for i, line in enumerate(selected_lines))
        return result if result else "(end of file)"
    
    def write(self, file_path: str, content: str) -> WriteResult:
        """写入新文件（仅创建，不覆盖）"""
        if file_path in self.files:
            return WriteResult(error=f"File '{file_path}' already exists (create-only).")
        self.files[file_path] = content
        self.metadata[file_path] = {
            'size': len(content),
            'modified_at': datetime.now().isoformat()
        }
        return WriteResult(path=file_path, files_update=None)
    
    def grep_raw(self, pattern: str, path: str | None = None, glob: str | None = None) -> list[GrepMatch] | str:
        """正则表达式搜索文件内容"""
        try:
            re.compile(pattern)
        except re.error as e:
            return f"Invalid regex pattern: {e}"
        matches = []
        for file_path, content in self.files.items():
            for i, line in enumerate(content.splitlines()):
                if re.search(pattern, line):
                    matches.append(GrepMatch(path=file_path, line=i+1, text=line))
        return matches
    
    def glob_info(self, pattern: str, path: str = "/") -> list[FileInfo]:
        """通配符匹配文件"""
        import fnmatch
        all_files = [
            FileInfo(path=p, is_dir=False, size=self.metadata[p]['size'], 
                    modified_at=self.metadata[p]['modified_at']) 
            for p in self.files.keys() if p.startswith(path)
        ]
        matched = [fi for fi in all_files if fnmatch.fnmatch(fi.path, path.rstrip('/') + '/' + pattern)]
        return matched
    
    def edit(self, file_path: str, old_string: str, new_string: str, replace_all: bool = False) -> EditResult:
        """编辑文件内容"""
        if file_path not in self.files:
            return EditResult(error=f"File '{file_path}' not found")
        content = self.files[file_path]
        if replace_all:
            new_content = content.replace(old_string, new_string)
            occurrences = content.count(old_string)
        else:
            if content.count(old_string) != 1:
                return EditResult(error=f"Found {content.count(old_string)} occurrences. For safety, edit requires exactly one match unless replace_all=True.")
            new_content = content.replace(old_string, new_string, 1)
            occurrences = 1
        if new_content == content:
            return EditResult(error=f"String '{old_string}' not found.")
        self.files[file_path] = new_content
        self.metadata[file_path]['size'] = len(new_content)
        self.metadata[file_path]['modified_at'] = datetime.now().isoformat()
        return EditResult(path=file_path, files_update=None, occurrences=occurrences)

using custom Backend :

from deepagents import create_deep_agent
from deepagents.backends import FilesystemBackend, LocalShellBackend
from langgraph.checkpoint.memory import InMemorySaver

from my_backend import DictBackend

checkpointer = InMemorySaver()

agent = create_deep_agent(
    model=llm,
    tools=[web_search],
    checkpointer=checkpointer,
    backend=DictBackend(),
    system_prompt='你是一个助手，请根据用户输入的指令，进行相应的操作。'
)

Advantages of custom Backend :

Full control : Can define any storage logic (database, cloud storage, encrypted storage, etc.)
Security isolation : Agent operations are entirely in memory, not affecting the real file system
Testability : Facilitates unit testing and debugging
Performance optimization : Optimizing read/write performance for specific scenarios

Backend Selection Guide

Scenario	Recommended Backend	Reason
Only API calls, no file operations	No Backend	Simple and efficient
Need persistent file storage	FilesystemBackend	Direct disk file operations
Need to execute shell commands	LocalShellBackend	Support command execution + file operations
Need full isolation or custom storage	Custom Backend	Flexible and controllable, suitable for special needs

Best practices

Always enable virtual_mode: Prevent path traversal attacks, ensure Agent operates within the sandbox
Set reasonable timeouts and limits: Avoid malicious or inefficient commands consuming excessive resources
Use checkpointer to save state: Support conversation history recovery and resume from breakpoints
Implement the complete interface when customizing Backend: Ensure all methods are correctly implemented to avoid runtime errors
Log Backend operations：For debugging and auditing Agent behavior

Summary

Backend is the bridge connecting AI Agent with the file system in the DeepAgents framework. By flexibly selecting or customizing Backend, we can create a secure, efficient, and controllable file operation environment for the Agent. From simple no Backend configuration to powerful LocalShellBackend, to fully customizable DictBackend, DeepAgents provides rich options to meet various application scenarios.

Mastering the use of Backend will help you build more powerful and practical AI Agents, enabling smart agents to truly have "hands-on" capabilities!

This content is automatically aggregated by InertiaRSS (RSS Reader) for reading reference only. Original from — Copyright belongs to the original author.