Abstract
We tested state-of-the-art LLMs under clinical-scale workloads using two designs: a single agent handling all tasks and a multi-agent orchestrator assigning each task to a dedicated worker. Across retrieval, extraction, and dosing tasks, batch sizes ranged from 5–80. Multi-agent accuracy remained high (90.6% at 5 tasks; 65.3% at 80), while single-agent accuracy collapsed (73.1% to 16.6%; p < 0.01). Multi-agent runs used up to 65-fold fewer tokens and limited latency growth. These findings show that lightweight orchestration preserves accuracy and efficiency under mixed-task clinical loads.
| Original language | English |
|---|---|
| Article number | 23 |
| Journal | npj Health Systems |
| Volume | 3 |
| Issue number | 1 |
| DOIs | |
| State | Published - Dec 2026 |
Fingerprint
Dive into the research topics of 'Orchestrated multi agents sustain accuracy under clinical-scale workloads compared to a single agent'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver