Frontier Models May 20 ago

Andrej Karpathy Joins Anthropic to Advance AI Pre-Training Research

Andrej Karpathy, formerly of OpenAI, has joined Anthropic to focus on pre-training research, signaling a pivotal moment in large language model development.

GPUBeat Desk

Desk · GPUBeat Media

Published

May 20 · 15:51 ET

Reading

1 min · 302 words

OpenAI — ai-infrastructure — OpenAI, Anthropic — Andrej Karpathy Joins Anthropic to Advance AI Pre-Training Research Source: GPUBeat

Andrej Karpathy, a key figure in AI and one of OpenAI’s original co-founders, has joined Anthropic to lead pre-training research initiatives. His move signifies a shift as he aims to apply his expertise to improve the training capabilities of large language models.

This week, Karpathy announced on X that he is returning to hands-on research and development after a diverse career that included a role at Tesla as head of AI for Autopilot. He expressed excitement about re-engaging with cutting-edge AI work, noting that the next few years in large language model development are expected to be important.

At Anthropic, Karpathy will report to Nick Joseph and concentrate on pre-training, a foundational stage in developing large language models where they learn from extensive data. This phase is among the most resource-intensive aspects of creating foundation models, with improvements here directly impacting model performance and scalability.

Anthropic has stated that Karpathy will not only contribute to research but will also help build a dedicated team focused on using their Claude model to accelerate pre-training research and experimentation. Advancements in this area could significantly boost the efficiency and capabilities of future AI models.

https://x.com/karpathy/status/2056753169888334312

Karpathy’s background in AI is impressive; after his early work at OpenAI, he joined Tesla in 2017 and left in 2022 to return briefly to OpenAI. He later founded Eureka Labs, an AI education startup, reflecting his dedication to education alongside his research pursuits. He holds a PhD in computer science from Stanford University, further enhancing his credentials in the AI field.

This development highlights Karpathy’s influence in AI and underscores Anthropic’s strategic emphasis on improving pre-training processes. As the AI sector evolves, Karpathy's contributions could significantly shape the future capabilities of large language models, driving innovation at a pivotal moment in AI.

GPUBeat Desk

Desk · joined 2026

GPUBeat Desk covers AI infrastructure — chips, foundation models, inference economics, datacenter buildouts, and the geopolitics of compute.

2033 stories

GPUBeat Desk

More on frontier models

Infratil CEO Highlights Untapped Data Center Potential in ANZ

Anthropic’s Olah Calls for Broader Oversight in AI Development

SK Telecom Partners with Defense Ministry to Advance AI in Military