๐ What I Built
I built Hermes Agent Assistant, a lightweight agentic AI system designed to demonstrate how modern AI agents can be structured using a modular architecture instead of a simple, single-prompt response model.
The system takes an abstract user task, breaks it down into structured steps using a dedicated planner, executes those steps sequentially via an execution engine, utilizes targeted tools, and stores the interaction context in a persistent memory system.
โ๏ธ Why I Built This
Most AI applications today are simple wrappers around LLMs that rely on a single input-output loop. I wanted to understand and demonstrate how production-grade, autonomous agent systems operate internally. Specifically, I wanted to explore how:
- Planning can be decoupled from execution to allow for complex error handling and multi-step reasoning.
- Tools can be dynamically integrated into an agent's reasoning loop.
- State and memory can persist across tasks to enable true contextual continuity.
Hermes Agent is my architecture simulation built to solve this problem in a highly accessible, lightweight, and scalable format.
๐ง System Architecture & Workflow
The codebase is split cleanly into four autonomous components that mirror real-world AI agent meshes:
User Request (e.g., /run?task=...)
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ PLANNER โ โ Slices abstract goals into
โโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโ structured, sequential steps.
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ EXECUTOR โ โ Orchestrates task completion
โโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโ by processing each step.
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ TOOLS LAYER โ โ Provides functional utilities
โโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโ (simulated web search, logic, maths).
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ MEMORY SYSTEM โ โ Persists execution logs statefully
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโ into local JSON storage.
๐ก Production Showcases & Links
- ๐ Live Production Demo: hermes-agent-tanush.onrender.com
- ๐ป Open Source Repository: https://github.com/tanush326k/hermes-agent-assistant.git
-
๐ Cloud Deployment Service ID:
srv-d88revegvqtc73bdj380(Render Infrastructure Node)
๐ก What Makes It Different
Unlike traditional, rigid APIs or simple conversational chatbots, Hermes Agent:
- Thinks in Workflows: It establishes an internal chain-of-thought lifecycle before executing anything.
- Separates Reasoning from Action: Slicing the Planner from the Executor prevents cascading generation failures.
- Is Highly Extensible: New tools and custom utility logic can be dropped into the system without breaking core routing.
- Maintains Context Persistence: The custom memory module ensures state history is preserved between network calls.
๐๏ธ API Interaction Example
Request
POST /run?task=search AI agents HTTP/1.1
Host: hermes-agent-tanush.onrender.com
Response
{
"task": "search AI agents",
"plan": [
"analyze request parameters",
"query tool registry for search utilities",
"summarize agent data structural output"
],
"result": "final structured output successfully generated and written to persistent storage."
}
๐งฐ Tech Stack
- Core Language: Python 3.10+
- Web Framework: FastAPI (Asynchronous Server Gateway Interface)
- Production Server: Uvicorn
- Memory Layer: Volatile-to-Persistent JSON state manager
- Architecture Pattern: Modular Agentic Workflow Design
๐ฎ Future Improvements & Roadmap
- ๐ค Real Foundation LLM Integration: Swapping out simulated logic for live OpenAI, Anthropic, or local open-source Ollama completion hooks.
- ๐๏ธ Vector Database Memory Upgrade: Transitioning flatfile storage over to a proper semantic vector indexing framework (FAISS / ChromaDB) for semantic chunk lookups.
- ๐ค Multi-Agent Orchestration: Upgrading the workflow to host distinct
Planner,Executor, andCriticagents working collaboratively with separate system prompts. - โก Live Server-Sent Events (SSE): Integrating real-time execution streaming so client frontends can observe the agent's thought process step-by-step.
























