Andrej Karpathy joins Anthropic’s pre-training team

Andrej Karpathy, an OpenAI founding member and former Tesla AI leader, has joined Anthropic to work on Claude’s pre-training team. He announced the move Tuesday, saying the next few years at the frontier of large language models will be especially formative.

Reuters reported that Karpathy started this week and will report into the group led by Nick Joseph, Anthropic’s head of pre-training. TechCrunch reported an extra detail: Karpathy is expected to start a team focused on using Claude to accelerate pre-training research.

The job is the signal

This is not just another senior AI hire. Pre-training is the stage where a frontier model absorbs its broad knowledge and core capabilities through large-scale training runs. It is expensive, infrastructure-heavy, and central to whether a model family can stay competitive.

Karpathy’s background fits that layer of the stack. He helped start OpenAI, later led AI work at Tesla, and has become one of the most recognizable educators in machine learning through his public courses and explanations. Bringing him into pre-training gives Anthropic both technical credibility and a public-facing researcher who understands how to explain hard systems clearly.

The more interesting part is the reported mandate to use Claude to speed up pre-training research. That suggests Anthropic is looking beyond the simple formula of more GPUs, more data, and bigger runs. It wants models to help researchers improve the process of building models.

Why Anthropic would care now

Frontier AI is increasingly a race between three bottlenecks: compute, data, and research throughput. Anthropic has already been aggressive on infrastructure, including major cloud and chip commitments. But every leading lab is chasing the same scarce capacity.

If a lab can make researchers faster — by using agents to inspect experiments, generate hypotheses, compare training runs, debug data pipelines, or automate evaluation work — that becomes a different kind of advantage. It may not replace compute, but it can make each training cycle more useful.

That is where Karpathy’s role could matter. He has experience with large-scale AI systems, but also with teaching and tooling. The best version of this hire is not simply Karpathy writing better training code. It is Anthropic building a tighter loop between Claude as a product and Claude as a research assistant for the team making the next Claude.

A talent win in a moving market

The move also continues a broader reshuffling of high-profile AI talent. Reuters noted that OpenAI co-founder John Schulman previously left for Anthropic, while other OpenAI leaders such as Ilya Sutskever and Mira Murati have moved on to different projects.

That does not mean talent is flowing in only one direction. The frontier labs are all competing for a small group of people who have seen large model training up close. But Anthropic landing Karpathy gives it a visible win at a moment when investors, enterprise customers, and developers are watching which labs can keep improving their models without losing trust.

What to watch

The practical question is whether this becomes visible in Claude’s pace of improvement. If Anthropic can use Claude internally to shorten research cycles, the effect should show up as more frequent model improvements, better reliability, or faster progress in areas like coding, reasoning, and tool use.

There is also a risk of overreading one hire. Frontier AI progress depends on teams, infrastructure, product direction, and safety decisions, not one famous researcher. Still, the placement matters. Anthropic did not just hire Karpathy for an advisory role or a broad strategy title. It put him near the core machinery that makes Claude work.

That makes this a small but meaningful marker for the next phase of the AI race: labs are not only building AI products for users. They are trying to make AI a serious part of their own research engine.

Karpathy joins Anthropic as the AI lab shifts more attention to pre-training

The job is the signal

Why Anthropic would care now

A talent win in a moving market

What to watch

Related reviews & takes

ASML Says High-NA Chips Are Months Away

Google and Blackstone are turning TPUs into an infrastructure bet

OpenAI’s MRC spec makes AI networking a platform fight