
Nous Research · Self-Hosted Autonomous Agent with Persistent Memory
An open-source autonomous agent from Nous Research that runs on your own server, accumulates memory across every session, and creates reusable skills from experience. Built for developers and technical teams who want a persistent, private, model-agnostic AI agent they fully control.
Best for
Developers and technical power users who want a persistent, self-hosted AI agent with compounding memory and full control over their infrastructure and model choice
Not ideal for
Non-technical users or anyone who wants a ready-to-use cloud AI assistant without server setup and API key management
Who it's for
Developers and technical teams who want a self-hosted autonomous agent with persistent memory and no vendor lock-in
Hermes Agent's real value is persistence — not memory that can be toggled on, but an agent that accumulates knowledge across every session and stores it as readable markdown files on your own machine. The compounding effect is genuine: the longer you run it, the more contextual its behavior becomes because it builds up your workflows, preferences, and knowledge structures over time. The cost is equally real: you manage your own server, your own API keys, and all the operational overhead of a self-hosted system. Users who want a cloud chatbot with a memory toggle will find this overkill. Users who want their AI agent to learn their business deeply over months, run scheduled automations, and keep all data on their own infrastructure will find this is closer to what they actually needed.
Who should use it
Developers and technical teams running long-horizon projects who want an AI agent that improves with use and keeps all data and memory on their own infrastructure.
Who should skip it
Non-technical users, or anyone who wants a working AI assistant without managing server infrastructure, API keys, and ongoing maintenance overhead.
Hermes Agent
Free
With local models (Ollama)
$0 API cost
With cloud LLMs
Pay-per-token (varies)
Note: The agent itself is always free. Your only cost is the LLM inference you choose to connect. Local models via Ollama have no API cost. Cloud API pricing is set by your chosen provider and billed directly to you.
Modalities
Text, Image (vision), Image generation, Voice (TTS), Web browsing, Browser automation
Available models
A personal knowledge base that grows with use
Memory accumulates as readable markdown files on your machine — Hermes builds a persistent understanding of your projects, preferences, and working patterns that carries into every session.
Recurring automations defined in plain language
The built-in cron scheduler lets you set up daily reports, reminders, or data pulls without writing scripts. You describe the task in plain language; Hermes runs it on schedule.
Multi-platform AI presence from one server
One Hermes installation operates across Telegram, Discord, Slack, WhatsApp, and a CLI simultaneously — useful for small teams spread across different communication tools.
Privacy-first AI workflows
All conversation history, memory, and skills stay on your own server. The only external calls are to the LLM API you choose — and with Ollama, even those stay local.
Hermes Agent vs. ChatGPT (with memory)
ChatGPT's memory stores summaries in OpenAI's cloud and works within platform limits. Hermes Agent stores complete, layered memory as local files on your own machine with no platform-side retention. ChatGPT is more accessible; Hermes gives data ownership and true cross-session persistence.
Hermes Agent vs. Claude
Claude has strong instruction following and up to 1M-token context windows but resets between sessions. Hermes can wrap Claude (or any model) with persistent cross-session memory on your own infrastructure. Claude is better for individual high-quality tasks; Hermes is better for ongoing, evolving workflows that compound over time.
Hermes Agent vs. n8n / Make
n8n and Make are visual workflow automation platforms. Hermes is an autonomous agent with adaptive memory and skill-building. Hermes can trigger automations via cron, but if you need structured multi-step automation with clear logic flows and no server management, n8n or Make are more appropriate.
Is Hermes Agent free?
Yes. The agent itself is free and open source under the MIT license. You only pay for the LLM API you connect to — or use local models via Ollama at no API cost.
Can I use Hermes Agent on Windows?
Windows support exists but is described as early beta in the official documentation. Linux and macOS are the primary supported environments; WSL2 is the recommended path for Windows users.
What makes Hermes Agent different from ChatGPT with memory?
ChatGPT's memory is stored on OpenAI's servers and works within platform limits. Hermes Agent stores all memory as readable markdown files on your own machine — no cloud retention, no platform dependency. It also creates and refines its own skills from experience, which ChatGPT does not do.
Does Hermes Agent require a specific LLM?
No. It is fully model-agnostic. Connect any LLM via OpenRouter, use direct provider APIs from Anthropic, OpenAI, Google, and others, or run local models via Ollama or vLLM at no API cost.
How technical is the setup?
Setup requires a curl install and configuring your preferred LLM provider API keys. A basic VPS or local Linux or macOS machine is sufficient. Comfort with the terminal and server management is expected — this is not a zero-friction product for non-technical users.
Anthropic
Developers and writers who need the most accurate code generation, precise instruction following, and the ability to reason across very large documents
FreemiumOpenAI
Knowledge workers who need one AI subscription covering writing, research, coding, image/video generation, and team collaboration
FreemiumHermes Agent runs persistently on infrastructure you own — a laptop, VPS, or GPU cluster — and stores layered memory as readable markdown files on your machine. Unlike session-based AI assistants that reset after each conversation, Hermes accumulates knowledge, creates new skills from its own experience, and deepens its model of your work over time. It is model-agnostic: connect any LLM via OpenRouter, local models via Ollama, or direct provider APIs from Anthropic, OpenAI, Google, and others. It operates across 20+ messaging platforms including Telegram, Discord, Slack, WhatsApp, and a CLI. Best suited for developers who want full infrastructure ownership, data privacy, and long-running autonomous workflows — not a suitable choice if you want a zero-setup cloud assistant.
Are you the founder? Claim this listing →