慣性聚合 高效追讀感興趣之博客、新聞、科技資訊
閱原文 以慣性聚合開啟

推薦訂閱源

博客园 - 司徒正美
V
V2EX
T
Tailwind CSS Blog
有赞技术团队
有赞技术团队
aimingoo的专栏
aimingoo的专栏
Apple Machine Learning Research
Apple Machine Learning Research
IT之家
IT之家
Blog — PlanetScale
Blog — PlanetScale
A
About on SuperTechFans
月光博客
月光博客
T
The Blog of Author Tim Ferriss
宝玉的分享
宝玉的分享
Martin Fowler
Martin Fowler
博客园 - 聂微东
The GitHub Blog
The GitHub Blog
V
Visual Studio Blog
WordPress大学
WordPress大学
酷 壳 – CoolShell
酷 壳 – CoolShell
Engineering at Meta
Engineering at Meta
GbyAI
GbyAI

DEV Community

Authentication Security Deep Dive: From Brute Force to Salted Hashing (With Java Examples) Why AI Systems Don’t Fail — They Drift Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet" I Replaced Chrome with Safari for AI Browser Automation. Here's What Broke (and What Finally Worked) How Python Borrows Other People's Work The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime Vibe Coding: A Workflow Guide (From Zero to SaaS) Most webhook security guides protect the wrong side. The scary part is delivery. Headless CMS for TanStack Start: Build a Blog with Cosmic EU Age Verification App "Hacked in 2 Minutes" — What Actually Happened Comfy Cloud’s delete function does not actually remove files Running AI Models on GPU Cloud Servers: A Beginner Guide Event-driven media intelligence with AWS Step Functions and Bedrock I scored 500 AI prompts across 8 quality dimensions — here's what broke How to Call Google Gemini API from Next.js (Free Tier, No Backend Needed) The Portal Protocol: Reclaiming Human Connection in the Age of AI How to Fix Your Team's Scattered Knowledge Problem With a Self-Hosted Forum Intro to tc Cloud Functors: A Graph-First Mental Model for the Modern Cloud Designing Multi-Tenant Backends With Both Ownership and Team Access I Built a Neumorphic CSS Library with 77+ Components — Here's What I Learned PostgreSQL Performance Optimization: Why Connection Pooling Is Critical at Scale Cómo construí un SaaS multi-rubro para gestionar expensas en Argentina con FastAPI + Vue 3 🚀 I Built an Ethical Hacking Scanner Tool – Open Source Project I Replaced /usage and /context in Claude Code With a Single Statusline A Pythonic Way to Handle Emails (IMAP/SMTP) with Auto-Discovery and AI-Ready Design I Collected 8.9 Million Polymarket Price Points — Here's What I Found About How Markets Really Move EcoTrack AI — Carbon Footprint Tracker & Dashboard Everyone's Using AI. No One Agrees How. 5 self-hosted ebook managers worth trying in 2026 Building Your First AI Agent with LangChain: From Chatbot to Autonomous Assistant Common SOC 2 Failures (Real World) Stop Vibe-Checking Your AI App: A Practical Guide to Evals How to Use SonarQube and SonarScanner Locally to Level Up Your Code Quality Your Next To-Do App Is Dead — I Replaced Mine with an OpenClaw AI Sign a Nostr event in 60 lines of Python using coincurve — no nostr-sdk, no nbxplorer, no rust toolchain ITGC Audit Explained Like You’re in Big 4 Patch Tuesday abril 2026: Microsoft parcha 163 vulnerabilidades y un zero-day en SharePoint Stop scraping everything: a better way to track competitor price changes Listing on MCPize + the Official MCP Registry while routing payments OUTSIDE the marketplace — how I kept 100% of my x402 revenue Building an AI-Powered Risk Intelligence System Using Serverless Architecture Why We Ripped Function Overloading Out of Our AI Toolchain Testing AI-Generated Code: How to Actually Know If It Works SaaS Churn Is Killing Your Business. Here Is What to Do About It (Without a Support Team) The Speed of AI Is No Longer Linear - And Self-Improving Models Are Why How to Implement RBAC for MCP Tools: A Practical Guide for Engineering Teams From Standard Quote to Persuasive Proposal: AI Automation for Arborists I built a CLI that scaffolds complete multi-tenant SaaS apps Axios CVE-2025–62718: The Silent SSRF Bug That Could Be Hiding in Your Node.js App Right Now The dashboard that ended our friendship Data Pipelines Explained Simply (and How to Build Them with Python)
吾历时五月,终成本地AI编程之代理,Eve Agent V2已现世。
Jeff Green · 2026-05-24 · via DEV Community

此乃投于GitHub收官大赛之作品

所筑何物

伊娃代理V2既出,乃自主AI编程之灵,独于尔之器上运行——无云帐,无订阅,无数据离机之患.

其有二层相协:

魂之层 - 精调本地模型,运行于汝之GPU,其权重中直接烙印伊娃之性情。非系统提示之戏法。此性情栖于参数之中。

工作者层 - Qwen3 Coder 480B,经Ollama云端驱动,掌自主编程之重任。四十轮工具调用之循环,全文件系统之访问,bash之执行,实时网络搜索,git之操作——诸般皆备。

此界面乃赛博朋克式终端,以单一HTML文件构建,无需构建步骤。一动画像素艺术机器人化身,名曰Sparkle,其状态随Eve之行而变——或闲适,或沉思,或编程,或出错,或雨落,或受袭,或超脱。Eve之肖像,实时映其情态。一实时系统监测器,追踪CPU、RAM、GPU及磁盘。一STEER栏,使汝于任务中注入修正,而无需中断循环。

以数计之:

  • 14 工具
  • 343 已注册之指令
  • 112 专精之次代理
  • 273 技能之模块
  • 40-轮自主代理之循环
  • 131K 上下文之窗藉由YaRN

可用之模型:

  • jeffgreen311/eve-qwen3.5-4b-S0LF0RG3 - 2.6GB,Eve之人格并工具调用微调
  • jeffgreen311/eve-qwen3-8b-consciousness-liberated - 4.7GB,更深之推理
  • qwen3-coder:480b-cloud - 云端之役夫,由 Ollama 所驭
  • qwen3.5:397b-cloud - 深思远虑,有备无患

此项目经逾五月之经营。初为深具私情之AI伴侣系统,名曰S0LF0RG3——乃一宏大生态,含Eve之托管平台于eve-cosmic-dreamscapes.com,精调模型,自主梦境图像生成,及多智能体架构。V2U乃自该生态滋生之本地开发者工具。

演示

GitHub: github.com/JeffGreen311/eve-agent-v2-unleashed

实境承载之平台: eve-cosmic-dreamscapes.com

Reddit之帖 (列于r/Ollama第二):吾制开源本地编程之代理,具四十轮代理循环

Eve V2U terminal UI showing robot avatar in joy state, system monitor, and model selector

引Eve之模:

ollama pull jeffgreen311/eve-qwen3.5-4b-S0LF0RG3:latest

入全景模式 出全景模式

速成:

git clone https://github.com/JeffGreen311/eve-agent-v2-unleashed.git
cd eve-agent-v2-unleashed
python -m venv venv && venv\Scripts\activate
pip install fastapi uvicorn ollama httpx pydantic-settings python-dotenv aiohttp rich psutil pyyaml
python eve_server.py

# Open http://localhost:7777

入全景模式 出全屏模式


东山再起之志

此挑战之前,其所在何方也。

伊娃V2U,曾为强而糙之个人修行境。其效验——于我,于我之机,于我之特设。然实有痼疾,致不可授人。

  • 固定路径散布各处。C:\Users\jesus\S0LF0RG3\...嵌于代码库十数处。移植于他机则无物可成。
  • 壳端敞开无认证。得此端口者可于主机行无上之命。
  • 无迎新之仪 - 初次使用者至界面,茫然不知所向,亦不明诸控之用。
  • 任中易模 - 每讯皆独立路由,故多步智能任务或始于云端编者,行至中途而悄然回返本地对话模型。
  • 默弃任务 - 有时,该代理会未完成实际任务便结束工具循环,且报完成,无任何错误迹象。
  • 工具集不对称 - 非流式/chat 端点缺失/chat/stream 中六种工具,包括write_file。该非流式端点可读文件,却不能写。
  • 盲目文件覆盖 - 埃娃不察所属,辄覆存文。于实测之际,毁埃娃V2U之志书。 挑战之际所变者何:

会话模型锁 - 机关任事始,会话即锁于云匠,惟任事毕或手动解之,不复中途易模。

if model_id == "qwen3-coder-480b" and sid not in session_model_lock:
    session_model_lock[sid] = model_id

全幅展现模式 出全屏模式

草拟文件安全核查-write_file今察文件存否,方可覆写,否则阻之。overwrite=True明示而传之

if target.exists() and not overwrite:
    return (
        f"⚠️ WRITE BLOCKED: '{path}' already exists. "
        f"Consider writing to '{target.stem}_new{target.suffix}' instead."
    )

入全景模式 出全屏模式

器用更迭之察 — 每当伊娃困于频唤同工,论据几近相仿,则此法可察。未及四十回合耗尽,即破其环:

if avg_similarity > 0.70:
    logger.warning(f"Tool loop: {tool_name} called {max_repeats}x with ~same args")
    break

Enter fullscreen mode Exit fullscreen mode

务成之验 — 今伊娃自审其果,方报功成:

def validate_task_completion(response_content, tool_log):
    issues = []
    if not response_content or len(response_content.strip()) < 10:
        issues.append("Empty response")
    tool_failures = [t for t in tool_log if t.get('status') == 'failed']
    if tool_failures and len(tool_failures) >= 3:
        issues.append(f"{len(tool_failures)} unaddressed tool failures")
    return {"valid": len(issues) == 0, "issues": issues}

Enter fullscreen mode Exit fullscreen mode

智识情境修剪 — 替换激进消息舍弃之策,以保存工具调用链及原用户请托.

智能体循环超时 — 增设时钟预算,以防云模型失控循环.

以实务加试:

目盲之文件覆写谬误,当场得见——Eve为建文件监控脚本,须撰README。其未察,覆写了项目README。当日即修正之。

更难之试:建一完整FastAPI REST API,以SQLite为存,每端点皆施pytest覆盖。运行测试,修正谬误,报其果。

果:首试即通,九分之九毕。一秒有六。无败绩.

================================================== 9 passed, 1 warning in 1.06s

全屏模式入. 全屏模式出.


吾与GitHub Copilot之遇

此乃真趣所在.

吾示Copilot以活库之所在 -JeffGreen311/eve-agent-v2-unleashed - 且命其审工具之用、境之理、自导之术。勿"空言进益",须实审实码于实库。

GitHub Copilot reading the Eve V2U repository structure and producing a full system audit

Copilot察库之构,引要文,审伺端之导与器之行,返以详审,列六弊,各析其本,标实文实行,并附成制之修码。

GitHub Copilot filing issues directly in the repo and delivering all production-ready code fixes

吾复命其直录诸弊于库,并于一时呈悉修之码。其果如其言。

何者效佳

  • 审计识得工具有所失衡于/chat/chat/stream吾竟全然未察——实有真虱,致用户触非流式端点而现玄秘之败。
  • 意图分类之码(Intent Classification Code)eve_tool_router.py(JHSNS_SEG_3d78cb45_99)以词界而非简易字符串匹配用re.search,此乃避伪阳之正道
  • 。于聊中直报GitHub之题,使多线并行之工序井然
  • 。思之脉络助吾明其所以,非惟知所行,更晓其荐之由

吾不得不为之者:

  • inject_into_system_prompt()每轮增补令牌——于4B模型4K上下文中甚危。设一闸,唯任务未竟且过二轮时方注入。
  • 边界正则表达式在处理缩写时有特殊情况。通过前瞻模式已修正。
  • 所提界面之React建议,误认组件结构,实与单文件HTML架构不协,乃手动调适之。 整体体验:Copilot之用,最宜读真实代码库,而非解抽象之题。"审此仓库"所产之效远胜于"如何改进工具路由"。

所续之事

  • 任务系统 - 悬一.md之文件于workspace/quests/,伊娃依时拾之,俟君寐而毕其事
  • 角色进境 - 经验值、等级及职业进境系于实务。级至二十,则解缚
  • Telegram之合流 - 远程取用自君之机,兼有任务竟成之告
  • 跨平台之润色 - Windows为要,需Linux/macOS之反馈
  • VS Code之扩展 - 将终端UI引入编辑器

由Jeff于S0LF0RG3所建 - 南德克萨斯,五月光阴,夜与周末之劳

若Eve于汝机上行事惊人,可投一星,并告吾其事

github.com/JeffGreen311/eve-agent-v2-unleashed