Skip to main content
GPUBeat Archive

/Tag: actava

OpenAI — ai-agents — OpenAI, Anthropic
Frontier Models 17h

New Benchmark Reveals AI Healthcare Agents’ Struggles with Workflows

A new benchmark shows top AI healthcare agents from OpenAI and Anthropic fail 72% of clinical workflows, raising concerns about their readiness for real-world applications.

More from this archive