Skip to main navigation Skip to search Skip to main content

Abstract

We tested state-of-the-art LLMs under clinical-scale workloads using two designs: a single agent handling all tasks and a multi-agent orchestrator assigning each task to a dedicated worker. Across retrieval, extraction, and dosing tasks, batch sizes ranged from 5–80. Multi-agent accuracy remained high (90.6% at 5 tasks; 65.3% at 80), while single-agent accuracy collapsed (73.1% to 16.6%; p < 0.01). Multi-agent runs used up to 65-fold fewer tokens and limited latency growth. These findings show that lightweight orchestration preserves accuracy and efficiency under mixed-task clinical loads.

Original languageEnglish
Article number23
Journalnpj Health Systems
Volume3
Issue number1
DOIs
StatePublished - Dec 2026

Fingerprint

Dive into the research topics of 'Orchestrated multi agents sustain accuracy under clinical-scale workloads compared to a single agent'. Together they form a unique fingerprint.

Cite this