The Race Nobody Wants to Be In, and Nobody Can Leave

A game-theoretic look at why the labs keep pushing forward even though most of their senior people will tell you, off the record, that the pace is insane.

Scott Alexander wrote an essay in 2014 called "Meditations on Moloch." It was long and strange and it traveled farther than most blog posts do. The argument, simplified: Moloch is a metaphor for coordination failures, situations where every individual actor follows their rational self-interest and the collective result is catastrophic for everyone, including the individuals. The tragedy of the commons, nuclear arms races, regulatory capture. Moloch wins not because anyone wants the bad outcome but because the incentive structure won't let anyone choose the good one.

When you map that framework onto the AI race, it fits with uncomfortable precision.

The Structure of the Trap

There are currently five or six organizations in the world that are competitive at the frontier of AI capability: OpenAI, Anthropic, Google DeepMind, Meta AI, xAI, and arguably Microsoft in virtue of its deep integration with OpenAI. Behind them, slightly less capable but catching up, are labs in China—primarily Baidu, Zhipu AI, and an increasingly capable set of state-backed research institutes.

Each of these organizations employs people who understand the risks. Anthropic was founded by people who left OpenAI over safety concerns. Demis Hassabis at DeepMind has been explicit about existential risk from AI for over a decade. Sam Altman has said in public interviews that he thinks there's a non-trivial chance his company is building one of the most dangerous things in human history.

They're all still building as fast as they can.

Call it Moloch rather than hypocrisy. Each organization's reasoning goes roughly like this: if we pause and competitors don't, we lose the race. If we lose the race, we have no influence over how the technology is deployed, and the outcome might be worse than if we'd stayed in and shaped it. Therefore, continue.

Replace "we" with any competitor's name and the same logic applies to them. Everyone in the race has a version of this argument, and each version is coherent on its own. The collective outcome is everyone pushing as hard as possible toward a technology they publicly acknowledge might be existentially dangerous.

Why "We'll Do It Safely" Doesn't Break the Trap

Safety commitments are real but structurally weak against competitive pressure. OpenAI has safety teams, safety papers, a Preparedness Framework. Anthropic publishes its Responsible Scaling Policy. DeepMind runs extensive internal safety evaluations. These are genuine investments.

But safety investment has a ceiling imposed by competitive dynamics. If additional testing delays a model release by six months, and a competitor releases in that window, the safer company loses market position, investor confidence, and potentially the engineering talent who go where the excitement is. Safety constraints are a competitive cost, and under Molochian dynamics, costs get competed away.

The pause letters were explicitly an attempt to break this dynamic through collective action. The most famous, the Future of Life Institute's open letter in March 2023, was signed by Elon Musk, Stuart Russell, Yoshua Bengio, and others. The logic was that if everyone paused simultaneously, no one would lose position. It failed. The argument for a pause was never formally refuted; it was ignored. The major labs declined to sign or quietly continued, and one lab announced a major model release the week after the letter published.

The Geopolitical Layer

Domestic coordination failures are hard, and international ones are nearly impossible. The AI race runs between companies in San Francisco and London, but it also runs between countries, with everything that implies about adversarial relationships, information asymmetry, and the absence of enforcement mechanisms.

The "if we don't, China will" argument is ubiquitous in conversations with people in the US AI industry, where it functions as a trump card against safety-based slowdowns. It carries real weight. China has substantial AI investment, is catching up on frontier capabilities, and operates under different regulatory constraints. But the argument proves less than it claims. Chinese AI development has different characteristics than US development, and it's far from clear that a US slowdown straightforwardly produces a less safe world.

What it does do is make coordination even harder. Every successful AI governance framework depends on parties with reasons not to cooperate agreeing to cooperate anyway. The international track record on that (nuclear nonproliferation, climate agreements, pandemic preparedness) is mixed at best.

The People Inside the Trap

Talk to people at the major labs and what's striking is how many of them will describe the situation in terms that could have come from an AI risk essay. They know about Moloch and race dynamics; they use the vocabulary fluently, and some of them helped develop it.

They continue anyway. The institutional logic is stronger than any individual's concern. Their personal impact on the outcome seems small relative to the scale of the forces involved. And someone has to build it, so they'd rather it be people who at least understand the problem.

That is what makes the trap so durable. It needs no villains, only normal people in an incentive structure that points toward catastrophe. Moloch runs on otherwise decent people following their rational interests in a game that has no good equilibrium.

Is There an Exit?

The theoretical exits are clear: binding international agreements, domestic regulation with genuine teeth, a coordinated industry pause with third-party verification. None of these currently exist. The regulatory responses so far (the EU AI Act, executive orders in the US, voluntary commitments from labs) are real but insufficient. They haven't slowed the pace of capability development or imposed safety requirements that bite at the frontier. Chip export controls are the closest thing to a structural brake, and they operate at the level of national security policy rather than safety policy.

We don't know whether there's an exit from the trap before the trap closes. Given current institutional arrangements, the game theory suggests there might not be. That describes the problem that has to be solved, and measures how far we are from solving it. The timelines are compressing faster than the solutions are arriving.

The Race Nobody Wants to Be In, and Nobody Can Leave

The Structure of the Trap

Why "We'll Do It Safely" Doesn't Break the Trap

The Geopolitical Layer

The People Inside the Trap

Is There an Exit?

Continue the briefing

The People Building AGI Just Warned Congress It Can Help Make Bioweapons

The New AI Safety Order Is Voluntary. The Labs Can Just Say No.

Pause AI: The Argument That Refuses to Die