AI agents

Not one agent. A team of them.

Helmies-agent runs operations + marketing. Hermes takes phone calls + voice. OpenClaw ships code from your repos. All three share the same memory and tool registry, RLS-scoped to your tenant. Bring your own via MCP.

helmies-agent · supervised

Sonnet 4.6 · 142 tokens · €0.011

supervised

W

Wael · just now

Plan next 2 weeks of social posts. Spring colours, no buzzwords.

helmies-agent

brand.read…

trends.scan…

content.draft…

social.queue…

The roster

One memory. Three specialists.

Each agent has its own personality and tool set, but they share long-term memory and your tenant. Tasks can hand-off between them — Helmies-agent for ops, Hermes for the call, OpenClaw for the code.

Helmies-agent

Operations · marketing · finance

The default. Reads every row of your business. Picks Claude / GPT / Gemini per task. Three autonomy modes.

customers.listinvoice.draftmeta_ads.pausecontent.draftemail.send

Hermes

Voice · phone calls · autonomous

Phone-first. Answers calls, books meetings, summarises conversations into the CRM. Runs scheduled prompts via cron.

call.answertranscribesummariseschedule.queuesms.send

OpenClaw

Code · repos · pull requests

Connect a GitHub repo. OpenClaw reads, plans, writes the code, runs your tests, opens a PR against your branch.

repo.readbranch.createcode.writetests.runpr.open

MCP

Bring-your-own agents

Any MCP server you trust. Tools auto-exposed to the registry. Tenant-scoped permissions.

mcp.notionmcp.figmamcp.linearmcp.githubmcp.your-server

Try it

Pick a scenario. Watch it execute.

A static demo of the agent surface. Inside Helmies every call is logged, every action is reversible, and every tool can be scoped per role.

Autonomy

Scenarios

helmies-agentmode: supervised connected to 18 tools

W

Wael

Pause Meta campaigns where CTR < 0.4% and reallocate budget to the top 2.

helmies-agent · thinking

01meta_ads.listok

02meta_ads.pauseok

03meta_ads.budget_setok

04alerts.createok

summary

Paused 3 campaigns burning €420/day at 0.21–0.38% CTR. Moved €240/day into Northern Lights Pro and €180/day into Spring Bundle (top performers, CTR 1.6%+). Set a guardrail to alert if combined ROAS dips below 2.1.

⌘↵

Multi-model router

Right model. Right job.

We benchmark prompt → quality → cost continuously and switch the agent to the best model per tool. You see the margin per model in your usage dashboard.

claude · 1M ctx$$

Opus 4.7

Hard reasoning, code, ambiguous prompts

claude$

Sonnet 4.6

Default for writing, summaries, sequences

claude¢

Haiku 4.5

Batch ops, classification, cheap+fast

openai$$

GPT-5.5

Long-context, RAG over big docs

google$

Gemini 2.5

Multimodal, image understanding

groq¢

Llama 3.3 70B

Ultra-fast cheap fallback

fal.ai¢

flux-dev

Image generation for social

replicate¢

SDXL

Image generation, ControlNet

Model router

Auto-pick by tool · token meter visible

anthropic

Opus 4.7 (1M)

Hard reasoning, code, long context

$$

28%

on

anthropic

Sonnet 4.6

Default writer + sequence drafts

$

55%

on

anthropic

Haiku 4.5

Batch classify, triage, low-stakes

¢

13%

on

openai

GPT-5.5

Backup for RAG over big docs

$$

5%

on

google

Gemini 2.5 Pro

Multimodal · image understanding

$

4%

on

fal.ai

flux-dev

Image generation for social

¢

3%

on

groq

Llama 3.3 70B

Ultra-fast cheap fallback

¢

1%

off

Tool registry

The agent has 80+ tools and counting.

Auto-derived from the code. Every tool is documented in /admin/api-docs the moment it ships. Tools are scoped per role and rate-limited per tenant.

Customers

customers.list
customers.create
customers.update
customers.merge
customers.history

Projects

projects.list
projects.create
projects.tasks
projects.milestones
projects.profitability

Money

invoice.draft
invoice.send
invoice.list
quote.create
quote.accept
payroll.run
taxes.file

Marketing

leads.search
leads.enrich
sequence.create
ads.list
ads.pause
content.draft
social.queue
trends.scan

Comms

email.compose
email.send
email.list
slack.notify
calendar.add
calendar.find_slot

Files & data

files.upload
files.search
files.share
sql.read
sql.export

Code (OpenClaw)

repo.read
branch.create
code.write
tests.run
pr.open
pr.review

Voice (Hermes)

call.answer
call.dial
voice.transcribe
voice.summarise
sms.send

Hand-off

When the agent needs a shell. It opens one.

For code-ish tasks, the agent can drop into the in-app terminal, SSH to your server, and run Claude Code there. You see the session live, can stop it any time, and it's logged.

Read-only by default
Write requires explicit consent.
Sandboxed
Per-tenant SSH key vault, no cross-tenant access.
Audited
Every command + diff stored, 90 days.
Recoverable
Stop at any time; revert via git or backups.

Terminal · ssh helmies@deploy

ttyd + tmux · pane 1 · Claude Code

connected

›

Not one agent. A team of them.

helmies-agent · supervised

One memory. Three specialists.

Pick a scenario. Watch it execute.

Right model. Right job.

Model router

The agent has 80+ tools and counting.

When the agent needs a shell. It opens one.

Terminal · ssh helmies@deploy

Stop tab-switching. Start delegating.