
AI Agents Show Distinct Disparities in Browser Exploitation Capabilities
Carnegie Mellon researchers unveil a benchmark highlighting the stark performance gap between Anthropic's Claude Mythos and OpenAI's GPT-5.5 in exploiting browser vulnerabilities.
More from this archive