Hermes logo

Best cheap models for Hermes

Cheap

Hermes runs all day, so the model price is your real subscription fee. These are the best cheap models for Hermes right now: everything costs at most about $0.75 per million input tokens, ranked by how much agent capability survives the price cut.

The ranking
Updated June 2026
  • 1DeepSeek
    DeepSeek V4DeepSeekThe best cheap default: near-frontier quality, 1M context, $0.435 per million.
    Context1.049M
    Input$0.435/M
  • 2Gemini
    Gemini 3 FlashGoogle DeepMindThe fastest cheap pick for high-volume runs, $0.50 per million.
    Context1.049M
    Input$0.5/M
  • 3OpenAI
    GPT-5.4 MiniOpenAIThe cheapest GPT that still handles real agent work. Codex subscription or API.
    Context400K
    Input$0.75/M
  • 4MoonshotAI
    Kimi K2.5Moonshot AIOpen-weight agentic coder at $0.40 per million input tokens.
    Context262K
    Input$0.4/M
  • 5Minimax
    MiniMax-M3MiniMaxNear-frontier intelligence at $0.30 per million input tokens.
    Context205K
    Input$0.3/M
  • 6DeepSeek
    DeepSeek V4 FlashDeepSeekA 1M context window at $0.098 per million, for always-on background work.
    Context1.049M
    Input$0.098/M
  • 7Zhipu
    GLM 4.7 FlashZ.AIThe floor: $0.06 per million for sub-agents and routine steps.
    Context203K
    Input$0.06/M
What models users actually use for Hermes?

Source: OpenRouter (openrouter.ai/apps), as of 2026-06-10.

Intelligence vs. price

Each model's Artificial Analysis Intelligence Index score against its blended price per 1M tokens. Toward the top right is more intelligence per dollar.

Full interactive leaderboard on our Intelligence Index page.

Share:
Details:
  • Agent


    Hermes logoHermes
  • Models


    7
  • Filter


    Cheap
  • Updated


    June 2026
Ad
Website favicon