Stop babysitting AI assistants. Start running a self-evolving agent.

What you build

Chat assistants don't accumulate

You spent forty minutes teaching your assistant the codebase — the conventions, the constraints, the reasons behind them. Then the session ended, and tomorrow you teach it all again from zero. No persistent memory, no skill library, no improvement loop. Every session starts at the same baseline, and you are the only component in the system that learns. You're not running an agent; you're doing the agent's job for it, one paste at a time.

The leverage you forfeit by staying assistant-driven

Six months from now you're still re-explaining your stack to a stateless chat window while your backlog compounds. Meanwhile the engineers who built their own agents early are compounding: their agent remembers every architectural decision it has seen, wrote itself a skill for the deploy pipeline last sprint and reused it this one, and delegates to specialist sub-agents overnight. Same models you have access to — different architecture around them. Memory, skills, and self-evolution compound like interest.

The six stages of Hermes mastery (H6)

Hermes is a self-evolving AI agent: an architecture you build once that then improves itself. H6 is the six-stage path from first principles to a multi-agent workforce, and each stage ships a working layer of your own instance. H1 Architecture stands up the agent loop, tool layer, and runtime. H2 Soul defines the fixed identity that frames every decision. H3 Memory gives the agent a three-tier stack so context survives every session. H4 Skills grows a versioned library the agent writes, tests, and reuses. H5 Evolution wires the reflection loop and the GEPA optimizer. H6 Workforce scales from one agent to a managed team of delegated specialists.

Built for engineers graduating from assistants to agents

Junior-to-mid engineers learn agent architecture, memory systems, and tooling fundamentals most engineers won't touch for years. Mid-to-senior engineers who have hit the assistant ceiling get compounding leverage and the systems-design depth that separates senior engineers from power prompters. Senior-to-staff engineers get the architecture — how memory, self-evolution, and multi-agent orchestration compose — and a workforce design they can run for a whole team and defend in a design review.

A self-evolving agent you own — running by Monday

Thirty video lessons across six modules, six hands-on build sessions that ship your own Hermes instance (not toy code), and eighteen brand-locked reference infographics. The kit includes four lead-magnet PDFs — the H6 cheatsheet, architecture quickstart, memory audit, and skills-evolution playbook — plus the Hermes starter scaffold, self-evolution loop templates, skill-library conventions, and multi-agent delegation and approval-gate configs. Lifetime access; every future update included.

Hermes crossed 90,000 GitHub stars within two months of release — one of the fastest-growing open-source agent frameworks on record.
Hermes ships a 687-skill Hub across 18 categories, so an agent's capability library starts large and grows through both community taps and skills it authors itself.

Frequently asked questions

What exactly is Hermes?

Hermes is an open-source, self-evolving AI agent framework built around a single AIAgent class that unifies execution, routing, and learning. Unlike a chatbot, it keeps persistent memory across sessions, writes and reuses its own skills, and improves itself between tasks — all running on your own hardware.

How is this different from an AI assistant like Copilot or Cursor?

Assistants are stateless and session-bound. Hermes accumulates: it remembers context across sessions in a three-tier memory stack, grows a versioned skill library, and runs a self-evolution loop. Assistants restart at the same baseline every time; Hermes compounds.

What will I actually build by the end?

A working Hermes instance on your own infrastructure: a running ReAct agent loop (H1), a defined SOUL.md identity (H2), a three-tier memory stack (H3), a self-evolving skill library (H4), an evolution loop with GEPA and safety rails (H5), and a multi-agent workforce with delegation (H6) — produced across six hands-on build sessions.

What are the H6 stages?

H1 Architecture, H2 Soul, H3 Memory, H4 Skills, H5 Evolution, and H6 Workforce. Each stage ships a working layer of your agent and ends with a hands-on build session.

What is SOUL.md and why does it matter?

SOUL.md is a static, hand-authored identity file loaded in slot #1 of the system prompt, before memory or skills. It defines persona, tone, communication style, and hard limits, and acts as the fixed frame through which all of the agent's memory and skill evolution is filtered.

How does Hermes remember things across sessions?

Through a three-tier memory system: Tier 1 is two capped Markdown files (MEMORY.md and USER.md) frozen into the prompt; Tier 2 is a SQLite database of every past conversation searchable on demand; Tier 3 is optional external memory providers that prefetch relevant context before each turn.

What are skills, and can the agent really write its own?

Skills are Markdown files with YAML frontmatter that encode procedures. The agent uses the skill_manage tool to author them autonomously after complex tasks or successful error recovery, then reuses them later. Progressive disclosure keeps even a 687-skill catalog token-efficient.

What is GEPA?

GEPA (Genetic-Pareto Prompt Evolution) is an offline optimization pipeline that improves skills using real execution traces rather than the agent's self-assessment. It runs an evolutionary search, scores candidates with an LLM-as-judge, enforces 100% test-pass and size gates, and ships the winner as a human-reviewed Pull Request — for about $2–$10 per run with no GPU.

How does Hermes keep self-improvement safe?

The agent evolves capabilities, never constraints. SOUL.md and hard limits stay fixed, the Curator archives rather than deletes (with tar.gz snapshots before every pass and one-command rollback), and GEPA changes require human approval via Pull Requests.

Can I run more than one agent?

Yes. Profiles let you clone fully isolated specialists — programmer, designer, researcher — each with its own config, memory, skills, and SOUL.md. You then wire a delegation protocol (planner, builders, reviewer) and manage them as a workforce, with the 90-turn budget shared across the team.

What do I need to run it?

Linux, macOS, or WSL2 with Python 3.11+. State lives under ~/.hermes/. Hermes also runs on Docker or a cloud GPU server, and works with Claude, GPT, Gemini, or local Ollama models via a two-line provider change.

Do I need an ML background?

No. This is systems engineering — APIs, storage, loops, and guardrails — not model training. Engineers from junior to staff complete the course with the same materials.

What's included and is access lifetime?

Thirty video lessons across six modules, six hands-on build-session recordings, eighteen reference infographics, and four lead-magnet PDFs (cheatsheet, architecture quickstart, memory audit, skills-evolution playbook). Access is lifetime, with every future update included.

I already use Cursor, Claude Code, or Copilot. Isn't that enough?

Those are assistants: stateless, session-bound, waiting on your next prompt. Hermes is an agent you own — it keeps memory across sessions, grows its own skill library, and improves itself between tasks. The course teaches you to build and run it; your assistants become tools it uses, including delegating coding execution to the Claude Code CLI.

Stop babysitting AI assistants. Start running a self-evolving agent.

See inside “Memory Systems That Compound: The Three-Tier Stack”

What you build

Chat assistants don't accumulate

The leverage you forfeit by staying assistant-driven

The six stages of Hermes mastery (H6)

Built for engineers graduating from assistants to agents

A self-evolving agent you own — running by Monday

The curriculum

Frequently asked questions

Stop babysitting AI assistants. Start running a self-evolving agent.