For a tech enthusiast as myself, learning that Google's new Gemma4 language model came in a size designed to run on android phone's peaked my interest for several reasons. The first and really the main reason I wanted to download the model is because I often travel and can't always connect to WIFI. And because I use LLM's so much now for various projects I'm working on, having a model as capable of Gemma4 that I can use any time after watching the announcement video, https://www.youtube.com/watch?v=tfoSeH63yCg would be well worth the effort even though its quite easy to set it up.
To get the smaller, Gemma4:e2b version running on my Apple M4 pro I used ollama. To install Ollama, I went to my finder navigation menu, clicked Go in the menu bar, selected Utilities, and double-clicked Terminal. The command to install Ollama takes one just one line that I copy-pasted from ollama docs, curl -fsSL https://ollama.com/install.sh | sh which starts the download process immediately. Onced it finished installing I typed the command ollama --version to make sure it installed correctly and received the output ollama version is 0.23.1 which means it successfully downloaded and is ready to download a model.
The next step required just one command to get it installed and ready for use. I used the command exactly as is, ollama pull Gemma 4:e2b and hit enter to kick-off the download. From there it took around 5 -10 minutes and was done, whalla! Gemma4 was downloaded and ready to use in a few steps.
The first question I tried was what mdoel are you and immediately could see the a quick flow of words that shows the model's thinking process which I've included below exactly:
Analyze the Request: The user is asking "what mdoel are you?"
(asking about my identity/model type).-
Consult Core Identity: Access the defined persona parameters.
- Name: Gemma 4.
- Developer: Google DeepMind.
- Nature: Large Language Model (LLM).
- Type: Open weights model.
Formulate the Answer: State the identity clearly and concisely,
incorporating the key details.Review against Instructions: Ensure the response aligns with the
instructions (referring to myself as Gemma 4, not claiming unpossessed
capabilities, etc.).
Self-Correction/Refinement: Keep the answer direct and accurate based on
the defined persona.
Draft Response: I am Gemma 4, a Large Language Model developed by Google
DeepMind. (This is accurate and sufficient.)
...done thinking.
I am Gemma 4, a Large Language Model developed by Google DeepMind.
After that, I asked the question, 'what are you capable of?' and immediately observed it's thinking process which started to make me feel like a mad scientist in a way.
Thinking...
Thinking Process:
- Identify the core question: The user is asking "what are you able to do?" This requires describing my capabilities.
- Access self-knowledge (Identity & Capabilities):
- Name: Gemma 4.
- Nature: Large Language Model (LLM).
- Input modalities: Text and images (and audio for specific models in the family).
- Output modality: Generate text.
- Core functions: Understanding, generating, summarizing, translating, answering questions, coding assistance, etc.
- Determine specific operational constraints:
- I process information.
- I generate human-like text.
- I cannot generate images (only understand them).
- I have a knowledge cutoff (January 2025, if relevant, but generally focus on processing tasks).
- I don't have default access to tools (unless provided).
- Draft the response (structuring the capabilities):
- Start by defining my primary function.
- List specific tasks I excel at (text processing).
- Mention input/output modes.
- (Optional but helpful) Mention the range of knowledge I have.
- Refine the tone and clarity (Gemma 4 persona): Ensure the response is direct and accurate based on the provided identity.
(Self-Correction during drafting): Make sure I don't overpromise. I can
process text and images, but the output is only text. I must stick to the
facts.
- Final Output Generation. (This leads to the provided response.) ...done thinking.
_As a Large Language Model, I am capable of a wide range of tasks centered around processing and generating human language.**
**Here is a summary of what I can do:
1. Text Processing and Generation:
- Answering Questions: I can answer factual questions based on the information I was trained on.
- Text Generation: I can write essays, stories, poems, scripts, emails, and various other forms of creative and functional text.
- Summarization: I can read long texts and provide concise summaries of the main points.
- Translation: I can translate text between various languages.
- Explanation: I can take complex topics and explain them in simple, understandable terms.
- Code Assistance: I can help with understanding, explaining, and generating code snippets.
2. Input Modalities:
- Text Input: I primarily process and respond to text.
- Image Understanding: I can analyze and understand information presented in images (if they are provided to me).
3. Conversational Skills:
- Dialogue: I can engage in coherent and context-aware conversations.
- Following Instructions: I can follow complex instructions and constraints to structure my output.
In essence, I function as a powerful tool for information retrieval,
creative writing, and complex communication._
And just like that, a powerful and revolutionary technology at the ready when I need it all set up and ready to go without internet and without a monthly subscription. Well worth the few minutes of my time and now it's time for me to start using it.
Feel free to leave a comment or ask me any questions about my process for getting it running or what my experience has been so far.




















