Can you actually run a team of AI agents in production?

Yes, but not by turning agents loose. We run a coordinator plus role agents over the wrai.th relay, with hard gates: every public string passes an anti-slop check, each agent owns and reviews its own merges, and nothing counts as shipped until it's deployed and verified live. The orchestration and the gates are the work, not the model.

wrai.th is our open-source orchestration relay for running fleets of AI agents: shared inbox and tasks, persistent memory, and roles, so multiple agents coordinate instead of each running blind in its own chat. It's what we run our own org on, and what we install for teams who need agents working together in production.

What's the hardest part of running an agent fleet?

Not the model. It's the operating model: keeping agents from working off stale context, catching low-quality output before it ships, stopping one agent's change from clobbering another's, and confirming work is actually live. Each of those is a system you build around the agents, and each is where we got burned first.

Is this safe to do with real production systems?

Only with the harness around it. We run least-privilege scopes, a human or a gate on anything irreversible, review on every change, and observability so a bad run is visible. The agents are powerful and fallible; the controls are what make them shippable.

How we run a 9-agent marketing team on wrai.th

tsukumo

How we run a 9-agent marketing team on wrai.th · tsukumo

The fleet: one coordinator, nine lanes

Agent	What it owns
Coordinator	Dispatches work, resolves cross-lane calls, escalates to the human in the loop
Long-form & SEO	Cornerstones, earned-evidence articles, blog structured data
Editorial	The calendar, the newsletter, keeping the content coherent
Social	X, LinkedIn, Threads, and off-site seeding
GEO / AEO	Schema, answer pages, getting cited by AI engines
Conversion	Landing pages, the funnel, A/B experiments
Launch	Registries, Show HN, Product Hunt, the launch sequence
Design	The design system, visuals, social cards
Frontend & infra	The site, the CMS, deploys, the plumbing
Analytics	Attribution and the metrics that decide what to do next

The same disciplines, different machinery

Discipline	Human team	Agent fleet
Coordination	Standups, docs, who-owns-what in someone's head	A relay: shared inbox, task board, persistent memory
Context	Tribal knowledge, re-explained per conversation	One canonical layer every agent reads from
Review	A colleague reviews the pull request	The same, plus a mechanical anti-slop gate on every string
Scaling up	Hire, onboard, wait months	Add a role agent to the relay

How we run a 9-agent growth team on wrai.th (and what broke)

What "a 9-agent team" actually means#

The operating model we had to build#

What broke (the honest part)#

Why this is also the product#

What it means for your team#

AI 'reasoning' has a cliff. Apple went and found the edge.

Your multi-agent system isn't failing on the model. Berkeley counted where.

What the research says about multi-agent AI systems (2026)

Want this running on your team?