Executive summary

Dramatic advances in artificial intelligence over the past decade (for narrow-purpose AI) and the last several years (for general-purpose AI) have transformed AI from a niche academic field to the core business strategy of many of the world’s largest companies, with hundreds of billions of dollars in annual investment in the techniques and technologies for advancing AI’s capabilities.

We now come to a critical juncture. As the capabilities of new AI systems begin to match and exceed those of humans across many cognitive domains, humanity must decide: how far do we go, and in what direction?

AI, like every technology, started with the goal of improving things for its creator. But our current trajectory, and implicit choice, is an unchecked race toward ever-more powerful systems, driven by economic incentives of a few huge technology companies seeking to automate large swathes of current economic activity and human labor. If this race continues much longer, there is an inevitable winner: AI itself – a faster, smarter, cheaper alternative to people in our economy, our thinking, our decisions, and eventually in control of our civilization.

But we can make another choice: via our governments, we can take control of the AI development process to impose clear limits, lines we won’t cross, and things we simply won’t do – as we have for nuclear technologies, weapons of mass destruction, space weapons, environmentally destructive processes, the bioengineering of humans, and eugenics. Most importantly, we can ensure that AI remains a tool to empower humans, rather than a new species that replaces and eventually supplants us.

This essay argues that we should keep the future human by closing the “gates” to smarter-than-human, autonomous, general-purpose AI – sometimes called “AGI” – and especially to the highly-superhuman version sometimes called “superintelligence.” Instead, we should focus on powerful, trustworthy AI tools that can empower individuals and transformatively improve human societies’ abilities to do what they do best. The structure of this argument follows in brief.

AI is different

AI systems are fundamentally different from other technologies. While traditional software follows precise instructions, AI systems learn how to achieve goals without being explicitly told how. This makes them powerful: if we can cleanly define the goal or a metric of success, in most cases an AI system can learn to achieve it. But it also makes them inherently unpredictable: we cannot reliably determine what actions they will take to achieve their objectives.

They are also largely unexplainable: although they are partly code, they are mostly an enormous set of inscrutable numbers – neural network “weights” – that cannot be parsed; we are not much better at understanding their inner workings than at discerning thoughts by peering inside a biological brain.

This core mode of training digital neural networks is rapidly increasing in complexity. The most powerful AI systems are created through massive computational experiments, using specialized hardware to train neural networks on enormous datasets, which are then augmented with software tools and superstructure.

This has led to the creation of very powerful tools for creating and processing text and images, performing mathematical and scientific reasoning, aggregating information, and interactively querying a vast store of human knowledge.

Unfortunately, while development of more powerful, more trustworthy technological tools is what we should do, and what nearly everybody wants and says they want, it is not the trajectory we are actually on.

AGI and superintelligence

Since the dawn of the field, AI research has instead focused on a different goal: Artificial General Intelligence. This focus has now become the focus of the titanic companies leading AI development.

What is AGI? It is often vaguely defined as “human-level AI,” but this is problematic: which humans, and at which capabilities is it human level? And what about the super-human capabilities it already has? A more useful way to understand AGI is through the intersection of three key properties: high Autonomy (independence of action), high Generality (broad scope and adaptability), and high Intelligence (competence at cognitive tasks). Current AI systems may be highly capable but narrow, or general but requiring constant human oversight, or autonomous but limited in scope.

Full A-G-I would combine all three properties at levels matching or exceeding top human capability. Critically, it is this combination that makes humans so effective and so different from current software; it is also what would enable people to be wholesale replaced by digital systems.

While human intelligence is special, it is by no means a limit. Artificial “superintelligent” systems could operate hundreds of times faster, parse vastly more data and hold enormous quantities “in mind” at once, and form aggregates that are much larger and more effective than collections of humans. They could supplant not individuals but companies, nations, or our civilization as a whole.

We are at the threshold

There is a strong scientific consensus that AGI is possible. AI already surpasses human performance in many general tests of intellectual capability, including recently high-level reasoning and problem solving. Lagging capabilities – such as continual learning, planning, self-awareness, and originality – all exist at some level in present AI systems, and known techniques exist that are likely to improve all of them.

While until a few years ago many researchers saw AGI as decades away, currently evidence for short timelines to AGI is strong:

Empirically verified “scaling laws” connect computational input to AI capability, and corporations are on-track to scale computational input by orders of magnitude over the coming several years. The human and fiscal resources dedicated to AI advancement now equal those of a dozen Manhattan Projects and several Apollo Projects.
AI corporations and their leaders publicly and privately believe that AGI (by some definition) is achievable within a few years. These companies have information the public does not, including some having the next generation of AI systems in-hand.
Expert predictors with proven track-records assign 25% probability to AGI (by some definition) arriving within 1-2 years, and 50% for 2-5 years (see Metaculus predictions for ‘weak’ and ‘full’ AGI).
Autonomy (including long-range flexible planning) lags in AI systems, but major companies are now focusing their vast resources on developing autonomous AI systems and have informally named 2025 the “year of the agent.”
AI is contributing more and more to its own improvement. Once AI systems are as competent as human AI researchers at doing AI research, a critical threshold for fast progress to much more powerful AI systems will be hit and likely lead to a runaway in AI capability. (Arguably, that runaway has already begun.)

The idea that smarter-than-human AGI is decades away or more is simply no longer tenable to the vast majority of experts in the field. Disagreements now are about how many months or years it will take if we stay on this course. The core question we face is: should we?

What is driving the race to AGI

The race toward AGI is being driven by multiple forces, each making the situation more dangerous. Major technology companies see AGI as the ultimate automation technology – not just augmenting human workers but replacing them largely or entirely. For companies, the prize is enormous: the opportunity to capture a significant fraction of the world’s $100 trillion annual economic output by automating away human labor costs.

Nations feel compelled to join this race, publicly citing economic and scientific leadership, but privately viewing AGI as a potential revolution in military affairs comparable to nuclear weapons. Fear that rivals might gain a decisive strategic advantage creates a classic arms race dynamic.

Those pursuing superintelligence often cite grand visions: curing all diseases, reversing aging, achieving breakthroughs in energy and space travel, or creating superhuman planning capabilities.

Less charitably, what drives the race is power. Each participant – whether company or country – believes that intelligence equals power, and that they will be the best steward of that power.

I argue that these motivations are real but fundamentally misguided: AGI will absorb and seek power rather than grant it; AI-created technologies will also be strongly double-edged, and where beneficial can be created with AI tools and without AGI; and even insofar as AGI and its outputs remain under control, these racing dynamics – both corporate and geopolitical – make large-scale risks to our society nearly inevitable unless decisively interrupted.

AGI and superintelligence pose a dramatic threat to civilization

Despite their allure, AGI and superintelligence pose dramatic threats to civilization through multiple reinforcing pathways:

Power concentration: superhuman AI could disempower the vast majority of humanity by absorbing huge swathes of social and economic activity into AI systems run by a handful of giant companies (which may in turn either be taken over by, or effectively take over, governments.)

Massive disruption: bulk automation of most cognitive-based jobs, replacement of our current epistemic systems, and rollout of vast numbers of active nonhuman agents would upend most of our current civilizational systems in a relatively short period of time.

Catastrophes: by proliferating the ability – potentially above human level – to create new military and destructive technologies and decoupling it from the social and legal systems grounding responsibility, physical catastrophes from weapons of mass destruction become dramatically more likely.

Geopolitics and war: major world powers will not sit idly by if they feel that a technology that could supply a “decisive strategic advantage” is being developed by their adversaries.

Runaway and loss of control: Unless it is specifically prevented, superhuman AI will have every incentive to further improve itself and could far outstrip humans in speed, data processing, and sophistication of thinking. There is no meaningful way in which we can be in control of such a system. Such AI will not grant power to humans; we will grant power to it, or it will take it.

Many of these risks remain even if the technical “alignment” problem – ensuring that advanced AI reliably does what humans want it to do – is solved. AI presents an enormous challenge in how it will be managed, and very many aspects of this management become incredibly difficult or intractable as human intelligence is breached.

Most fundamentally, the type of superhuman general-purpose AI currently being pursued would, by its very nature, have goals, agency, and capabilities exceeding our own. It would be inherently uncontrollable – how can we control something that we can neither understand nor predict? It would not be a technological tool for human use, but a second species of intelligence on Earth alongside ours. If allowed to progress further, it would constitute not just a second species but a replacement species.

Perhaps it would treat us well, perhaps not. But the future would belong to it, not us. The human era would be over.

This is not inevitable; humanity can, very concretely, decide not to build our replacement.

The creation of superhuman AGI is far from inevitable. We can prevent it through a coordinated set of governance measures:

First, we need robust accounting and oversight of AI computation (“compute”), which is a fundamental enabler of, and lever to govern, large-scale AI systems. This in turn requires standardized measurement and reporting of the total compute used in training AI models and running them, and technical methods of tallying, certifying, and verifying computation used.

Second, we should implement hard caps on AI computation, both for training and for operation; these prevent AI both from being too powerful and operating too quickly. These caps can be implemented through both legal requirements and hardware-based security measures built into AI-specialized chips, analogous to security features in modern phones. Because specialized AI hardware is made by only a handful of companies, verification and enforcement are feasible through the existing supply chain.

Third, we need enhanced liability for the most dangerous AI systems. Those developing AI that combines high autonomy, broad generality, and superior intelligence should face strict liability for harms, while safe harbors from this liability would encourage development of more limited and controllable systems.

Fourth, we need tiered regulation based on risk levels. The most capable and dangerous systems would require extensive safety and controllability guarantees before development and deployment, while less powerful or more specialized systems would face proportionate oversight. This regulatory framework should eventually operate at both national and international levels.

This approach – with detailed specification given in the full document – is practical: while international coordination will be needed, verification and enforcement can work through the small number of companies controlling the specialized hardware supply chain. It is also flexible: companies can still innovate and profit from AI development, just with clear limits on the most dangerous systems.

Longer-term containment of AI power and risk would require international agreements based on both self- and common-interest, just as controlling nuclear weapon proliferation does now. But we can start immediately with enhanced oversight and liability, while building toward more comprehensive governance.

The key missing ingredient is political and social will to take control of the AI development process. The source of that will, if it comes in time, will be reality itself – that is, from widespread realization of the real implications of what we are doing.

We can engineer Tool AI to empower humanity

Rather than pursuing uncontrollable AGI, we can develop powerful “Tool AI” that enhances human capability while remaining under meaningful human control. Tool AI systems can be extremely capable while avoiding the dangerous triple-intersection of high autonomy, broad generality, and superhuman intelligence, as long as we engineer them to be controllable at a level commensurate with their capability. They can also be combined into sophisticated systems that maintain human oversight while delivering transformative benefits.

Tool AI can revolutionize medicine, accelerate scientific discovery, enhance education, and improve democratic processes. When properly governed, it can make human experts and institutions more effective rather than replacing them. While such systems will still be highly disruptive and require careful management, the risks they pose are fundamentally different from AGI: they are risks we can govern, like those of other powerful technologies, not existential threats to human agency and civilization. And crucially, when wisely developed, AI tools can help people govern powerful AI and manage its effects.

This approach requires rethinking both how AI is developed and how its benefits are distributed. New models of public and non-profit AI development, robust regulatory frameworks, and mechanisms to distribute economic benefits more broadly can help ensure AI empowers humanity as a whole rather than concentrating power in a few hands. AI itself can help build better social and governance institutions, enabling new forms of coordination and discourse that strengthen rather than undermine human society. National security establishments can leverage their expertise to make AI tool systems genuinely secure and trustworthy, and a true source of defense as well as national power.

We may eventually choose to develop yet more powerful and more sovereign systems that are less like tools and – we can hope – more like wise and powerful benefactors. But we should do so only after we have developed the scientific understanding and governance capacity to do so safely. Such a momentous and irreversible decision should be made deliberately by humanity as a whole, not by default in a race between tech companies and nations.

In human hands

People want the good that comes from AI: useful tools that empower them, supercharge economic opportunities and growth, and promise breakthroughs in science, technology, and education. Why wouldn’t they? But when asked, overwhelming majorities of the general public want slower and more careful AI development, and do not want smarter-than-human AI that will replace them in their jobs and elsewhere, fill their culture and information commons with non-human content, concentrate power in a tiny set of corporations, pose extreme large-scale global risks, and eventually threaten to disempower or replace their species. Why would they?

We can have one without the other. It starts by deciding that our destiny is not in the supposed inevitability of some technology or in the hands of a few CEOs in Silicon Valley, but in the rest of our hands if we take hold of it. Let’s close the Gates, and keep the future human.

Please submit feedback and corrections to taylor@futureoflife.org

Essay navigation