Skip to main content
GPUBeat Frontier Models Andrej Karpathy Joins Anthropic to Lead…

Andrej Karpathy Joins Anthropic to Lead Pretraining Research Team

Andrej Karpathy has joined Anthropic's pretraining team, aiming to advance large language model research. His move reflects a significant shift in AI development strategies.

OpenAI — ai-infrastructure — OpenAI, Anthropic
Andrej Karpathy Joins Anthropic to Lead Pretraining Research Team Source: GPUBeat

Andrej Karpathy, a key figure in AI development and co-founder of OpenAI, has joined Anthropic's pretraining team, marking a strategic move to enhance the capabilities of large language models (LLMs). In a post on X (formerly Twitter), Karpathy shared his excitement for the future of LLMs, saying, "I think the next few years at the frontier of LLMs will be especially formative." His transition comes as Anthropic aims to strengthen its research efforts in a competitive environment.

Karpathy, who started his role earlier this week, will collaborate with a team led by Nicholas Joseph, another former OpenAI employee. This team is tasked with the large-scale training runs that support the capabilities of Anthropic's Claude models. Karpathy will focus on using Claude to improve pretraining research, a vital step in the development of advanced AI systems. Joseph welcomed Karpathy, stating, "I can't think of anyone better suited to do it."

Before this position, Karpathy played a significant role at OpenAI, contributing to foundational work in deep learning and computer vision. After departing OpenAI in 2017 for Tesla, he led the company's Full Self-Driving and Autopilot initiatives until he left in 2022. Karpathy briefly returned to OpenAI before launching his venture, Eureka Labs, which applies AI innovations in education. He is also recognized for creating educational content, including the course "Neural Networks: Zero to Hero," and for coining the term "vibe coding."

Karpathy's move holds significance beyond his personal career. It arrives as Anthropic reportedly pursues a $30 billion fundraising round, which could value the company at $900 billion, surpassing OpenAI's latest valuation of $852 billion. This fundraising initiative highlights the intensifying competition among AI firms to secure resources for developing advanced technologies.

Karpathy's role in pretraining is particularly important, as this phase is essential for providing AI models with foundational knowledge before fine-tuning and instruction training. The pretraining process requires substantial computational power and resources, making it one of the most demanding aspects of AI model development. With Karpathy leading this effort, Anthropic seeks to boost its ability to accelerate research and innovation in this area, further establishing its position in the AI sector.

See also  Demis Hassabis Backs Anthropic: A $14 Billion Trend in AI Startups

As the AI industry evolves rapidly, the addition of experienced leaders like Karpathy to companies such as Anthropic indicates a shift toward more sophisticated and resource-intensive approaches to AI development. The next few years may bring notable advancements, especially in LLM capabilities, as teams like Karpathy's work to expand the potential of these models. The implications for the broader AI ecosystem are significant, with potential effects across various sectors, including education and autonomous systems.

GD

GPUBeat Desk

Desk · joined 2026

GPUBeat Desk covers AI infrastructure — chips, foundation models, inference economics, datacenter buildouts, and the geopolitics of compute.