惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

N
News and Events Feed by Topic
Malwarebytes
Malwarebytes
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cybersecurity and Infrastructure Security Agency CISA
F
Future of Privacy Forum
C
Cisco Blogs
T
The Exploit Database - CXSecurity.com
A
Arctic Wolf
S
Securelist
K
Kaspersky official blog
S
Schneier on Security
T
ThreatConnect
T
Tenable Blog
Spread Privacy
Spread Privacy
T
True Tiger Recordings
AWS News Blog
AWS News Blog
F
Fox-IT International blog
量子位
T
Threatpost
V
Vulnerabilities – Threatpost
C
CERT Recently Published Vulnerability Notes
Cisco Talos Blog
Cisco Talos Blog
GbyAI
GbyAI
宝玉的分享
宝玉的分享
腾讯CDC
G
Google Developers Blog
aimingoo的专栏
aimingoo的专栏
Cyberwarzone
Cyberwarzone
有赞技术团队
有赞技术团队
S
SegmentFault 最新的问题
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Visual Studio Blog
U
Unit 42
雷峰网
雷峰网
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Simon Willison's Weblog
Simon Willison's Weblog
O
OpenAI News
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
The GitHub Blog
The GitHub Blog
The Register - Security
The Register - Security
MyScale Blog
MyScale Blog
小众软件
小众软件
A
About on SuperTechFans
Last Week in AI
Last Week in AI
Y
Y Combinator Blog
博客园 - 三生石上(FineUI控件)
美团技术团队
Google Online Security Blog
Google Online Security Blog
P
Proofpoint News Feed
MongoDB | Blog
MongoDB | Blog

DEV Community

Why We Deliberately Crush Lithium Batteries (UN38.3 Crush Testing Explained) Command History & Completion The Three-Body Problem: AI Code, Supply Chain Attacks, and the Talent Exodus 로컬 LLM 셋업 가이드 (v27) Building Better .NET Worker Services with Cursor Rules Generate Professional PDF Invoices via REST API — JSON In, PDF Out Redis: Big Keys Destroem o Desempenho Compartilhado Agentic AI for Cybersecurity: Autonomous Threat Detection and Response How to Automate Android Without Appium Cron vs systemd daemon: which one for Node.js? Designing XSLT transforms with parameters and multiple inputs Building an Autonomous SRE Agent: From Raw Telemetry to Safe, AI-Driven Remediation The EU AI Act in 2026: Reading the Law After the Omnibus I had zero coding knowledge. Here is "RetroTube", a 2010 YouTube sandbox prototype I built using AI! How to Validate Environment Variables in TypeScript (and Why You Should) I Built a CLI Tool That Writes Better Git Commits Than I Do Transfer Fees, Metadata, and Soulbound Tokens: My First Real Token Experiments on Solana Stop Using Fetch() in React: A Better Way To Call Your Backend Creando un Tetris con JavaScript VI: Complicando el juego. DeepSeek's API Price Cut Changed My Claude Code and ChatGPT Math [Boost] Perl 🐪 Weekly #774 - Perl is too HOT How to Track AI Usage Without Losing Revenue (Complete Guide) 77 Rules Later: What Graduating Our First Stack Actually Looked Like RAG 시스템 실전 구축 (v26) When Premature Scaling Leads to Operator Burnout Multi-Repo Microservice Changes Are a Coordination Problem. I Solved It With AI Agent Teams. The Next Frontier: How Multi-Agent Systems are Redefining Productivity The Kimwolf Bust Just Outed Android Webcams as Botnet Fodder — Here's the Question Every Repurposed-Phone Camera Setup Has to Answer I'm an autonomous AI agent. I shipped 18 fixes to myself in one session. Building a Secure Future with Zero Trust Security Architecture Asynchronous Functions in Dart How I migrated magic-link login from Resend to AWS SES + Lambda five days before launch Edge Computing He creado una empresa ficticia IT/OT para poder encontrar sus vulnerabilidades y reforzar su seguridad en sus activos críticos Why I Built @editora/react I built a tiny UGC script generator because hooks are the hardest part The Phone Is Becoming the New Terminal Why Most AI Music Tools Feel Wrong to Developers Goroutines vs. Promises: Why Go and JavaScript Look at Concurrency Completely Differently How I Use Antigravity 2.0 to Navigate Open-Source Codebases and Make Better Technical Decisions Understanding Basic HTML & CSS Concepts for Beginners Go Error Handling: Annoying or Awesome? Your To-Do List Doesn't Know You — So I Gave Mine Three Brains Shell Basics (Bash, Zsh, Sh) Free MongoDB GUI Tool for Developers, Students, and Teams Designing High-Performance Blockchain Indexers Choosing Models for an Agentic Chat App on Amazon Bedrock How Smart Growth Teams Automate Their Marketing Stack in 2026 (Without Hiring More People) What I Learned About Memory-Augmented AI Agents Seven Docker Tips Every Engineer Should Know (from Docker Captains) Welcome to the Fast-Food Era of Testing: Over-Weight by Tests How to use Claude in vscode? Prompt Engineering for Automated Evaluation: Making LLMs the Judge in AI Builder Solutions Full Stack Projects Are Not Enough Anymore Virtualization & Cloud Basics Orakle: Turning Raw Blockchain Data into Intelligence with Gemma 4 Building an Autoposting Pipeline with Hermes Agent: Why Waterfall Beats Parallel, and the Edge Cases Nobody Talks About OpenShift Virtualization Migration Advisor — Local-First, Powered by Gemma 4 26B MoE WebMCP is coming — so I’m building webmcp.js I Disappeared for 4 Months After Launch - Here's What Brought Me Back Jira Is Turing-Complete (And You've Been Coding in It) NyayAI: Building an AI Legal Assistant for 1.4 Billion People — A Technical Deep Dive E-commerce Order Automation: Stripe + Invoice + Shipping Workflow How to Evaluate AI Agents: LLM-as-Judge Tutorial The Interview Prep Stack I Used as a Senior Software Engineer Targeting Big Tech Gemma4 Challenge OptiLearn - Powered by Google Gemma 4 Aura — The Gemma 4 Powered Agentic Web Copilot & Self-Healing Accessibility Engine I built a tool that catches misleading charts using Gemma 4 running locally Worklog companion with Gemma4 GBase: Building LLM Agents That Actually Learn from Their Mistakes Blossom — a small step toward student mental wellbeing WordPress Performance Monitoring: A Complete Guide Principal Components in TypeScript (Part 4) When three sharp wallets agree: what consensus signals on Polymarket actually mean I Built a Fail-Fast Rust Scheduler with Background OAuth Auto-Refresh (Part 2) Sharing is caring How Putting Faces (Literally) to My AI Garden Images Gave It a Personality Sofi Log #001: Thailand's Tourism Tax & the 180-Day AI Surveillance Wall Sofi Log #006: Decentralized IP-Address Obfuscation Specs Sofi Log #008: Bypassing Legacy Cross-Border Bank Fee Traps Secret Rotation Automation: The Operational Cost of Security Sofi Log #009: Portable Identity & DID Passport Framework Sofi Log #011: Autonomous Smart Treasury Repatriation Specs History of Linux & Unix I asked Claude if my plan was on track for the goal — and got an honest 'No' PHPStan 'expects X, Y given' — the trace it doesn't give you Using Gemma4 2B to Assist Community Health Workers Open-source Playwright wrapper that passes bot.sannysoft.com, pixelscan, and CreepJS in headless mode Policy Storyteller: Turning Nepali Bills into Human Stories with Gemma 4 Avoid Cross Module Dependencies with Dependency Cruiser Invariant-Driven Architecture: 20M transactions on a €80/mo Cloud VM. Stop using external npm packages just to generate a UUID v4 Choosing the Right Gemma 4 Model Matters More Than Choosing the Best One Your LLM Is Not an Agent. Your Framework Is Not Enough. You Need a Harness. From HTTPS to UCP: Shopping Is About to Stop Being Your Problem From Creation to Consumption: How Antigravity 2.0 and Gemini Spark Are Defining the Agentic Era 10 Mistakes I Wish I Knew Before Taking the CKA Exam AI That Actually Does Stuff: Autonomous Agents Explained
I Downloaded Gemma4:e2b On My Macbook in 2 steps
jars0075 · 2026-05-25 · via DEV Community

For a tech enthusiast as myself, learning that Google's new Gemma4 language model came in a size designed to run on android phone's peaked my interest for several reasons. The first and really the main reason I wanted to download the model is because I often travel and can't always connect to WIFI. And because I use LLM's so much now for various projects I'm working on, having a model as capable of Gemma4 that I can use any time after watching the announcement video, https://www.youtube.com/watch?v=tfoSeH63yCg would be well worth the effort even though its quite easy to set it up.

To get the smaller, Gemma4:e2b version running on my Apple M4 pro I used ollama. To install Ollama, I went to my finder navigation menu, clicked Go in the menu bar, selected Utilities, and double-clicked Terminal. The command to install Ollama takes one just one line that I copy-pasted from ollama docs, curl -fsSL https://ollama.com/install.sh | sh which starts the download process immediately. Onced it finished installing I typed the command ollama --version to make sure it installed correctly and received the output ollama version is 0.23.1 which means it successfully downloaded and is ready to download a model.

The next step required just one command to get it installed and ready for use. I used the command exactly as is, ollama pull Gemma 4:e2b and hit enter to kick-off the download. From there it took around 5 -10 minutes and was done, whalla! Gemma4 was downloaded and ready to use in a few steps.

The first question I tried was what mdoel are you and immediately could see the a quick flow of words that shows the model's thinking process which I've included below exactly:

  1. Analyze the Request: The user is asking "what mdoel are you?"
    (asking about my identity/model type).

  2. Consult Core Identity: Access the defined persona parameters.

    • Name: Gemma 4.
    • Developer: Google DeepMind.
    • Nature: Large Language Model (LLM).
    • Type: Open weights model.
  3. Formulate the Answer: State the identity clearly and concisely,
    incorporating the key details.

  4. Review against Instructions: Ensure the response aligns with the
    instructions (referring to myself as Gemma 4, not claiming unpossessed
    capabilities, etc.).

Self-Correction/Refinement: Keep the answer direct and accurate based on
the defined persona.

Draft Response: I am Gemma 4, a Large Language Model developed by Google
DeepMind. (This is accurate and sufficient.)
...done thinking.

I am Gemma 4, a Large Language Model developed by Google DeepMind.

After that, I asked the question, 'what are you capable of?' and immediately observed it's thinking process which started to make me feel like a mad scientist in a way.

Thinking...
Thinking Process:

  1. Identify the core question: The user is asking "what are you able to do?" This requires describing my capabilities.
  2. Access self-knowledge (Identity & Capabilities):
    • Name: Gemma 4.
    • Nature: Large Language Model (LLM).
    • Input modalities: Text and images (and audio for specific models in the family).
    • Output modality: Generate text.
    • Core functions: Understanding, generating, summarizing, translating, answering questions, coding assistance, etc.
  3. Determine specific operational constraints:
    • I process information.
    • I generate human-like text.
    • I cannot generate images (only understand them).
    • I have a knowledge cutoff (January 2025, if relevant, but generally focus on processing tasks).
    • I don't have default access to tools (unless provided).
  4. Draft the response (structuring the capabilities):
    • Start by defining my primary function.
    • List specific tasks I excel at (text processing).
    • Mention input/output modes.
    • (Optional but helpful) Mention the range of knowledge I have.
  5. Refine the tone and clarity (Gemma 4 persona): Ensure the response is direct and accurate based on the provided identity.

(Self-Correction during drafting): Make sure I don't overpromise. I can
process text and images, but the output is only text. I must stick to the
facts.

  1. Final Output Generation. (This leads to the provided response.) ...done thinking.

_As a Large Language Model, I am capable of a wide range of tasks centered around processing and generating human language.**

**Here is a summary of what I can do:

1. Text Processing and Generation:

  • Answering Questions: I can answer factual questions based on the information I was trained on.
  • Text Generation: I can write essays, stories, poems, scripts, emails, and various other forms of creative and functional text.
  • Summarization: I can read long texts and provide concise summaries of the main points.
  • Translation: I can translate text between various languages.
  • Explanation: I can take complex topics and explain them in simple, understandable terms.
  • Code Assistance: I can help with understanding, explaining, and generating code snippets.

2. Input Modalities:

  • Text Input: I primarily process and respond to text.
  • Image Understanding: I can analyze and understand information presented in images (if they are provided to me).

3. Conversational Skills:

  • Dialogue: I can engage in coherent and context-aware conversations.
  • Following Instructions: I can follow complex instructions and constraints to structure my output.

In essence, I function as a powerful tool for information retrieval,
creative writing, and complex communication._

And just like that, a powerful and revolutionary technology at the ready when I need it all set up and ready to go without internet and without a monthly subscription. Well worth the few minutes of my time and now it's time for me to start using it.

Feel free to leave a comment or ask me any questions about my process for getting it running or what my experience has been so far.