惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

L
Lohrmann on Cybersecurity
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
Recorded Future
Recorded Future
S
Schneier on Security
I
Intezer
Latest news
Latest news
N
News and Events Feed by Topic
Scott Helme
Scott Helme
T
Threat Research - Cisco Blogs
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
U
Unit 42
量子位
博客园 - 【当耐特】
S
Security @ Cisco Blogs
Google Online Security Blog
Google Online Security Blog
博客园 - 叶小钗
酷 壳 – CoolShell
酷 壳 – CoolShell
NISL@THU
NISL@THU
The Cloudflare Blog
李成银的技术随笔
T
ThreatConnect
L
LINUX DO - 最新话题
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
有赞技术团队
有赞技术团队
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Jina AI
Jina AI
T
Tor Project blog
The Hacker News
The Hacker News
人人都是产品经理
人人都是产品经理
小众软件
小众软件
S
Security Archives - TechRepublic
美团技术团队
博客园 - Franky
Security Latest
Security Latest
J
Java Code Geeks
P
Proofpoint News Feed
V
V2EX
The GitHub Blog
The GitHub Blog
WordPress大学
WordPress大学
Application and Cybersecurity Blog
Application and Cybersecurity Blog
H
Help Net Security
PCI Perspectives
PCI Perspectives
Cyberwarzone
Cyberwarzone
Hugging Face - Blog
Hugging Face - Blog
N
Netflix TechBlog - Medium
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
SecWiki News
SecWiki News
腾讯CDC
爱范儿
爱范儿
D
Docker

DEV Community

I wrapped Claude Code in a zsh function. Here's every decision I almost got wrong. Mobile Game Optimization: A Unity Developer's Checklist Git: Best Practices for Beginners Three days I lost chasing a ghost that was already dead on disk Why Too Many Parts Hurt ClickHouse Performance Guardrails for Agent Output: Pluggable Validation Before and After LLM Calls From Half‑dead Prototype to Local‑Only AI Medical Assistant: Rewiring MedClinic with GitHub Copilot Runninig a forkbomb in Jenkins What’s Actually Happening When You Use Git Preventing Recursive Tool Loops in LangChain Agents Building a Rock-Paper-Scissors CLI with TypeScript — Union Types, Conditionals, and Jest Your AI Coding Agent Wastes 80% of Its Context. Fixed That with Graph Theory. Why Flutter Has Become the Go-To Framework for Fintech App Development We built a scripting language just for AI agents. Here's why. Stop building AI inboxes. Build decision layers instead. Meme Monday Why I Built @editora/ui-react? Are AI tools the next level of abstraction in software development? Identity on Solana: Your Wallet Is Your Account One API Call Changed Everything The Internet Career Nobody Talks About Enough: What Is DevRel? Solar Panel Wiring Diagram: Series vs Parallel Hello everyone! Glad to join the dev.to community I Built an AI Agent That Tailors My Resume - Here's How Agents Actually Work I Built a WhatsApp OTP + AI Chatbot Platform for African Businesses MTP Explained — And Why It Matters for Android on Mac Most Beginners Learn Full-Stack Development Backwards GitHub Glow-Up: Open Source, READMEs, Badges, Streaks, Git and gh CLI System Design Cheat Sheet: Concepts Every Developer Should Know Are Junior Developer Roles Actually Dying? A Fresher's Honest Take Using DigitalOcean Droplets as Ephemeral Sandboxes for AI Agents I built a VSCode extension that visualises your code navigation as a call tree — made for legacy codebase pain Vite predev/prebuild: chaining scripts without losing your mind A website to save you from messy browser tabs Dear Web2 Developer... Solana is here calling Postgres JSONB indexes: GIN vs BTREE on the same column The $5 AI That Remembers Everything What are your goals for the week? #180 Zettelkasten for Developers: A Practical Method That Works OpenClaw vs Hermes Agent: Stars, Downloads & Usage 2026 `act` vs. `waitFor` Global Teams Don’t Struggle With Time Zones. They Struggle With Context Python as a JavaScript Dev $5.4 Billion in Damage. 8.5 Million Machines Down. Three YAML Controls Would Have Prevented It. Here's the Structural Analysis. 🚫 Stop Using PN532 V1 for Your NFC Projects (Real Debugging Experience) Probabilistic Graph Neural Inference for smart agriculture microgrid orchestration for extreme data sparsity scenarios Inference Is Becoming the New Steady-State Cost Center Why AI-Generated Code Is Always Good Enough — And Never Great I built a dark admin dashboard template in HTML — no React, no npm, just pure HTML What is the Difference Between Lattice-Based and Hash-Based Signatures? Next.js App Router caching: revalidate, dynamic, and no-store without the folklore Next.js App Router caching: revalidate, dynamic y no-store sin folklore I built Stashly — a full-stack content manager with a rich text editor published: false tags: react, node, mongodb, typescript Why I Started Building React Projects Instead of Just Watching Tutorials ? Every Tool Eventually Becomes Tuesday Nobody Warns You That Real Software Engineering Feels Chaotic Tích hợp VNPay, Stripe trong Odoo 19 BeautifulSoup and Requests for Web Scraping With Python: When Simple Still Works I Was Stuck Debugging React — Then Developer Tools Changed It Buck Converter Ripple: Sizing the Inductor and Capacitor With Confidence AWS Just Made Its MCP Server Generally Available. Here's What It Actually Gives AI Agents. RAMPART Tests Your AI Agents in Dev. What Catches Malicious Tool Calls in Production? Vibe Team Software Engineering: What a Real AI Human Dev Team Workflow Actually Looks Like An npm Package for AI Agent Orchestration Just Shipped With Its Front Door Unlocked. Here's What the CVE Actually Reveals. Microsoft Foundry Just Added CI/CD for AI Agents. Here's What That Actually Changes. The Best Career Insurance Is a Tech Event You Don't Want to Attend Your GitHub Profile Already Tells Recruiters More Than Your Resume. Most Devs Just Don't Surface It. How to Add Execution Budgets to OpenAI Agents SDK Binary Tree Interview Problems: 6 Traversal Patterns, 15 Problems We trained a personal voice DoRA on Qwen3-8B for $1.50 — beat stock model 100% in blind A/B Stop Leaking API Keys: Why I Built a Local-First Vault for Developers 🔐 RAG Explained: How Retrieval-Augmented Generation Actually Works I Built a Fast Async JioSaavn API Wrapper in Python 🎧 chown & chgrp Deploying Your First App on Kubernetes: A Beginner's Guide (Minikube & Kind) Logs in code It's called a PR "review" for a reason DePIN GPU Market: The Failed Job Receipt Developers Should Demand Why Your AI Agent Monitoring is Wrong (And How to Fix It) Lock Down Your Cloud Shares: A Beginner’s Guide to Azure Files Security. Building a Multi-Channel Content Syndication Pipeline with EmDash Plugins Turn Your Phone Into Voice Input for Any React Text Field Which package is bloating your Docker image? Putting Claude Code Under Version Control: Configs Since July, Memory Since April What I Thought DevRel Was vs. What It Actually Is (A Mentee's Honest Take) What I Thought DevRel Was vs. What It Actually Is (A Mentee's Honest Take) 400 Million Tokens Burned Overnight Reviving My Linux Mastery Game from a Merge Conflict — A Finish-Up-A-Thon Comeback Don’t let AI break your collective thinking: a practical guide for engineering teams First Gemma 4 ExecuTorch Deployment on Raspberry Pi 5 — and Why It's 7.7 Slower Than llama.cpp Per-Turn Evaluation: Dynamic Governance for AI Agents The AI Triforce of seed4j: Power, Wisdom, and Courage for Your Dev Agent Your AI agent reports 80% task completion. It fabricated it. Pourquoi les overlays d'accessibilité ne tiennent pas leurs promesses (et ce que la FTC vient d'acter) AI May Break Product-Market Fit in Enterprise Software I’m Building Around the Gap Between AI Output and Repo Truth How to Build a Stripe Customer Portal in Next.js SaaS On-Demand Pricing Feels Safe - Until You See the Bill Building an Internal Developer Portal with Backstage A Production Deployment Guide After the Last Song
Gemma Forge: Local AI Without the Setup Wall
thiscrazylif · 2026-05-25 · via DEV Community

title: Gemma Forge: Local AI Without the Setup Wall
published: false
description: A local Gemma 4 work harness that turns local AI into a guided, inspectable project workspace.
tags: devchallenge, gemmachallenge, gemma, localai
cover_image: https://raw.githubusercontent.com/TheRefreshCNFT/gemma-forge/main/docs/submission-media/screenshots/current/01-forge-harness-overview.png

This is a submission for the Gemma 4 Challenge: Build with Gemma 4

Gemma Forge harness overview

What I Built

Gemma Forge is a local Gemma 4 work harness that makes local AI useful before the user has to understand the machinery.

The goal is simple: everyone should be able to use local AI. Gemma Forge opens that door by putting Gemma 4 behind a guided workbench instead of a setup wall.

On first launch, Gemma Forge scans the user's machine, checks Ollama and model readiness, installs or verifies the local toolchain, stages bundled skills, and opens a project-focused workspace. From there, the user can describe what they want done in plain language. Gemma Forge turns that request into protocol cards for context, planning, execution, code intelligence, verification, and handoff.

Most consumers do not need to manage elaborate memory systems. They need useful work completed. Most businesses do not need extra ceremony either. They need planning, execution, testing, evaluation, delivery, and a clean handoff. Gemma Forge lets the user give the local model a direction, add the skills the project needs, and let the harness keep the work scoped, observable, and verifiable.

Gemma Forge comes pre-fueled with bundled skills for planning, code writing, UI/UX, scraping, PDF work, MCP servers, codebase search, structural analysis, and handoff discipline. Need more fire? Drop in a skill. If the user does not know how to create one, Gemma Forge maintenance mode can help scaffold, stage, and verify a new skill through a controlled allowlist flow.

The project is built in the open-source spirit of Gemma 4: local, inspectable, extensible, and practical for people who want AI they can run and adapt on their own machine.

Demo

Demo media and screenshots are available in the repository:

https://github.com/TheRefreshCNFT/gemma-forge/tree/main/docs/submission-media

https://github.com/TheRefreshCNFT/gemma-forge/releases/download/contest-video-20260524-232825/gemma-forge-contest-demo-20260524-232825.mov

Here is the product flow I would show in the walkthrough:

  1. Launch Gemma Forge and show the workspace scan.
  2. Show Forge Engine readiness and the selected Forge Brain.
  3. Start a no-directory project from a plain-language goal.
  4. Run Full Forge with Human Verify off.
  5. Watch Project Context, GSD Planning, Project Execution, Verification, and Handoff move through the protocol cards.
  6. Open the generated artifact.
  7. Show Settings with model route proof and local error visibility.

The main harness screen keeps local readiness, model selection, and workspace status visible in one place.

Forge Harness readiness view

New work starts with a plain-language project seed, then moves through protocol cards instead of disappearing into a loose chat thread.

Project intake and protocol cards

Forge Station shows the live evidence stream: card starts, skill selection, staged skills, browser fetches, status codes, character counts, screenshot captures, and other work events.

Forge Station evidence stream

Gemma Forge also leaves receipts on disk. This workspace screenshot shows the output of a real session: project context, GSD planning, research, execution notes, extra reviews, verification, handoff files, screenshots from browsed sources, and the generated index.html artifact.

Workspace artifacts and generated project files

Settings keeps model management local and explicit. Users can import installed Ollama models, search Hugging Face, name an Ollama model, provision supported repos, and confirm which model the harness actually called last.

Settings model provisioning

Code

Repository:

https://github.com/TheRefreshCNFT/gemma-forge

Quick start on macOS:

git clone https://github.com/TheRefreshCNFT/gemma-forge.git
cd gemma-forge
./launch_forge.command

Enter fullscreen mode Exit fullscreen mode

Then open:

http://127.0.0.1:5005/

Enter fullscreen mode Exit fullscreen mode

The macOS launcher is the recommended first-run path. It installs or verifies the local toolchain, starts Ollama, pulls the default source model gemma4:e4b, creates the local Forge Brain alias gemma-4-e4b-it, stages bundled skills, prepares support tools, and starts the harness.

The first-run install default is fixed so setup is predictable. It does not lock users in. After setup, users can import installed Ollama models, search Hugging Face, provision other compatible repos, and choose a different active local model from Settings.

Manual/development launch:

python3 -m venv .venv
source .venv/bin/activate
pip install -e .
gemma-forge

Enter fullscreen mode Exit fullscreen mode

How I Used Gemma 4

Gemma 4 is the planning and orchestration brain inside Gemma Forge.

I chose the E4B / 4B-class lane as the default because Gemma Forge needs more reasoning headroom than a tiny model for project planning, skill routing, repair loops, and verification, while still staying realistic for local hardware. The one-command installer pulls gemma4:e4b and aliases it locally as gemma-4-e4b-it.

The harness presents the expected local footprint before provisioning: about 10 GB of disk budget and 8 GB RAM for readiness, with the current quantized Ollama artifact around 5 GB on disk.

Gemma Forge uses Gemma 4 to:

  • Convert a raw user request into a structured project contract.
  • Decide which protocol cards and bundled skills are relevant.
  • Write project-scoped context and GSD-style phase plans.
  • Generate or repair deliverables through the Project Execution card.
  • Evaluate outputs against acceptance criteria and deterministic validation.
  • Explain next steps when the user does not know which control to use.
  • Produce handoff notes so work can resume cleanly.

Gemma Forge also records model-route proof: Forge Brain selection to Flask harness to Ollama /api/chat. That proof matters because the authenticity rule is strict. A valid result means the selected local Gemma model actually did the work through the harness workflow. Deterministic scripts, screenshots, code intelligence, and validators can verify or package the result, but they do not replace Gemma 4 doing the task.

Gemma Forge…Where local agents want to work