Hermes flexibility comes from supporting many LLM providers. Here is which keys you need, where to get them, and how to configure each.
You Need One (or More)
You need exactly one LLM provider configured. Choose based on your budget, privacy needs, and capability requirements.
Provider Options
1. OpenRouter (Recommended for Flexibility)
- 200+ models, single API key
- Pay-per-use
- Get key: openrouter.ai
hermes model
# Select OpenRouter
# Enter key when prompted
2. Nous Portal (Recommended for Beginners)
- OAuth login, zero config
- Subscription-based
- Run:
hermes modelthen select Nous Portal
3. Anthropic (Claude)
- Direct to Anthropic API
- Requires account at anthropic.com
- Pricing: $3-15/M tokens
hermes model
# Select Anthropic
# Enter key from dashboard
4. OpenAI (GPT-4)
- Direct to OpenAI
- More expensive
- Get key: platform.openai.com
5. Ollama (Local, Free)
- No API key needed
- Runs local models
- Install Ollama separately, configure in Hermes as custom endpoint
6. MiniMax ($10/mo flat)
- Fixed monthly, predictable cost
- Has dedicated Hermes setup page
- $10/month unlimited on M2.7
7. Kimi/Moonshot (Cheap, Popular)
- Very affordable
- Community favorite for cost-performance
- Get credits at platform.moonshot.cn
8. DeepSeek ($0.30/M)
- Among the cheapest
- 90% cache discount
- ~$2/month typical use
Setting Keys
hermes model
Select provider, enter key. Keys stored in config, encrypted.
Switching Providers
hermes model
Change anytime. No code changes needed.
Fallback Chain
v0.6.0+ supports multiple providers as fallback:
model:
provider: openrouter
fallback_providers:
- anthropic
- minimax
If primary fails, automatically tries next.
Cost Estimates by Provider
| Provider | Typical Monthly |
|---|---|
| Ollama | Hardware only |
| DeepSeek | ~$2 |
| Kimi | ~$3-5 |
| MiniMax | $10 flat |
| OpenRouter (mixed) | $5-25 |
| Anthropic | $3-15 |
| OpenAI | $5-20 |
Model agnostic BYOK — 200+ providers
FAQ
Can I use multiple providers? Yes — configure fallback chain or switch manually.
Which provider is best? Kimi or MiniMax for cost. Claude or GPT-4 for capability. DeepSeek for budget.