Parallel specialist lanes let one Gateway route different chats or rooms to different agents, while keeping the user experience fast. The trick is to treat parallelism as a scarce-resource design problem, not just as “more agents”.Documentation Index
Fetch the complete documentation index at: https://docs.openclaw.ai/llms.txt
Use this file to discover all available pages before exploring further.
First principles
A specialist lane only improves throughput when it reduces contention for the real bottlenecks:- Session locks: only one run should mutate a given session at a time.
- Global model capacity: all visible chat runs still share provider limits.
- Tool capacity: shell, browser, network, and repository work can be slower than the model turn itself.
- Context budget: long transcripts make every future turn slower and less focused.
- Ownership ambiguity: duplicate agents doing the same job waste capacity.
Recommended rollout
Phase 1: lane contracts + background heavy work
Give every lane a written contract in its workspace and system prompt:- Purpose: the work this lane owns.
- Non-goals: work it should hand off instead of attempting.
- Chat budget: quick answers stay in chat; long tasks should acknowledge briefly, then run in a background sub-agent or task.
- Handoff rule: when another lane owns the work, say where it should go and provide a compact handoff summary.
- Tool-risk rule: prefer the smallest tool surface that can do the job.
Phase 2: priority and concurrency controls
Tune queue and model capacity around the business value of each lane:Phase 3: coordinator / traffic controller
Add a small coordinator pattern once multiple lanes are active:- Track active lane tasks and owners.
- Detect duplicate requests across groups.
- Route handoff summaries between lanes.
- Surface only blockers, completed results, and decisions the human must make.