The recursive self-improvement debate has been running for about thirty years, and both sides are still there, still arguing, with positions that have barely moved despite an enormous amount of new evidence. It's one of the few genuinely unresolved empirical questions in the field—not because the experiments haven't been done, but because people interpret the same experiments so differently.
The basic question: if an AI system can improve its own intelligence, how fast does that improvement happen? Fast enough to matter catastrophically? Or slow enough that human institutions can adapt?
The Hard-Takeoff Case
Eliezer Yudkowsky has been making the hard-takeoff argument since the late 1990s, when he was writing long essays on a mailing list that most people had never heard of. The core intuition: intelligence improvement is not linear. Small improvements in intelligence generate larger improvements in the ability to make further improvements. The feedback loop compounds. What starts as a modest advance becomes, within hours or days, something qualitatively different from anything that's existed before.
He coined the term "FOOM" for this scenario—a hard takeoff so fast it looks like an explosion. The word captures something important: the idea that there wouldn't be a gradual transition, a period of warning, a chance to adjust. The system crosses some threshold and then it's done. The world after is not recognizably continuous with the world before.
The argument isn't crazy. Compound growth does produce nonlinear outcomes. Small edges in capability can translate to large edges in output. There's no obvious physical reason why intelligence improvement couldn't be self-reinforcing to a degree that produces rapid, discontinuous change.
The Soft-Takeoff Case
Robin Hanson, the economist, has been the most persistent skeptic of the FOOM hypothesis. His counterargument is essentially empirical: we don't observe this pattern in the historical record. Other important technological developments—steam power, computing, communications—have shown fast progress but not FOOM-fast progress. They hit diminishing returns. They encountered physical bottlenecks. They required complementary inputs that took time to develop.
The specific bottleneck he emphasizes is that intelligence improvement is not bottlenecked only by intelligence. A more intelligent AI can design better algorithms. But it still needs training data, compute, energy, and engineering work to implement those algorithms. Those things don't scale with intelligence alone. So the compound feedback loop has friction—possibly a lot of friction.
Yudkowsky's response is essentially that Hanson is underestimating the magnitude of the intelligence advantage once it crosses a certain threshold. A system that's smarter than all humans combined could potentially work around most of those bottlenecks in ways we can't anticipate. The bottlenecks we can see are the ones that exist at human intelligence levels. A sufficiently superhuman system might simply solve them.
What Current Scaling Has and Hasn't Shown
GPT-4 to GPT-5. Claude 2 to Claude 3 to Claude 4. Each generation is demonstrably better than the last on benchmarks. Capabilities that seemed years away appeared within months of each other. The rate of progress over 2020-2025 was genuinely faster than most people expected, even people paid to track it.
Does this vindicate the hard-takeoff case? Not really—the progress, while fast, has been gradual and required enormous engineering effort and capital. The companies spending the most are also spending on things that don't scale with model capability: data centers, energy infrastructure, human researchers. If self-improvement were truly exponential and unconstrained, you'd expect to see the pace accelerating much faster than it is.
But there's a counterpoint to the counterpoint. Current AI systems can't actually improve themselves—they're trained by humans on human-curated data using human-designed architectures. The recursive self-improvement scenario requires something that doesn't yet exist: a system capable of meaningfully improving its own training process, not just its outputs. We don't know what the scaling curves look like once that threshold is crossed, because we haven't crossed it.
The Honest Position
I think the honest answer is that nobody knows. The hard-takeoff scenario is possible and the arguments for it haven't been refuted, they've been contested. The soft-takeoff scenario is also possible and the empirical evidence tilts in its direction, for now, for current systems.
What seems clear is that the uncertainty is asymmetric. If soft takeoff people are wrong, the world has time to adapt. If hard takeoff people are wrong, we wasted some caution. If hard takeoff people are right and the soft takeoff crowd bet against them, we don't get to be wrong twice.
Yudkowsky has made this asymmetry argument for decades. It's not obviously wrong. The problem is that acting on it implies constraints on AI development that no government or company has shown willingness to accept—and the argument for those constraints requires levels of certainty that the evidence doesn't yet provide.
Why the Recursive Self-Improvement Debate Matters
The speed of takeoff matters enormously for what kinds of safety measures are feasible. Slow takeoff gives you time for governance, for iteration, for interpretability research to catch up. Hard takeoff means none of that works—you either solve alignment before the first powerful recursive self-improver, or you don't solve it at all.
If Yudkowsky is right, almost everything the mainstream AI safety community is doing is running the wrong race. If Hanson is right, the slower pace means that something like normal human institutions might actually work.
We'll find out. Probably sooner than either side predicted.