Researchers from Google DeepMind and EPFL have successfully used deep reinforcement learning to control the magnetic fields of a nuclear fusion reactor. The AI agent managed the plasma within the Variable Configuration Tokamak, demonstrating the ability to maintain complex shapes and stability in real-time.
TLDR: An international collaboration has demonstrated that artificial intelligence can autonomously control the volatile plasma inside a nuclear fusion reactor. By using reinforcement learning, the system managed magnetic coils to maintain stability, solving a major engineering hurdle for the future of clean, limitless energy.
The pursuit of nuclear fusion is often likened to capturing the power of a star within a bottle. For decades, the primary obstacle to achieving this clean, virtually limitless energy source has not been the underlying physics, but the immense engineering challenge of containment. To facilitate fusion, hydrogen isotopes must be heated to temperatures exceeding 100 million degrees Celsius—hotter than the core of the sun. At these temperatures, matter enters a state known as plasma, a turbulent and highly conductive gas that is notoriously difficult to stabilize. In a tokamak reactor, this plasma is confined by powerful magnetic fields to prevent it from touching the vessel walls, which would instantly cool the reaction and damage the machine.
In a landmark collaboration, researchers from Google DeepMind and the Swiss Plasma Center at the École Polytechnique Fédérale de Lausanne (EPFL) have demonstrated a transformative solution to this control problem. They successfully applied deep reinforcement learning (RL) to manage the magnetic confinement systems of a tokamak in real-time. This breakthrough represents a fundamental shift from traditional control methods, which rely on rigid, manually programmed algorithms, toward an autonomous system capable of navigating the complex, non-linear dynamics of fusion plasma with unprecedented agility.
Traditional control systems for tokamaks are incredibly labor-intensive. Physicists must develop intricate mathematical models for every specific plasma configuration. In the case of the Variable Configuration Tokamak (TCV) at Lausanne, this involves coordinating 19 separate magnetic coils. Each coil’s magnetic field influences the plasma’s shape, position, and stability, creating a high-dimensional puzzle that evolves every millisecond. Historically, designing a controller for a new plasma shape could take months of simulation and testing. If a researcher wanted to experiment with a different geometry, the entire control architecture often had to be rebuilt from scratch.
The DeepMind and EPFL team bypassed this bottleneck by training an artificial intelligence agent in a high-fidelity simulator. Using reinforcement learning, the AI was not given specific instructions on how to move the magnets. Instead, it was given a set of goals—such as maintaining a specific shape or density—and allowed to discover the optimal voltage adjustments through millions of simulated trials. By adjusting the “reward function,” the researchers could encourage the AI to prioritize different physical properties, such as minimizing heat exhaust or maximizing stability. This trial-and-error process in a digital environment allowed the AI to master the physics of the TCV before ever touching the physical hardware.
The transition from the digital simulator to the physical TCV hardware was the ultimate test of the system’s robustness. When the AI agent was given control of the reactor, it successfully managed the plasma in real-time, making adjustments 10,000 times per second. The AI demonstrated remarkable versatility, maintaining standard “D-shaped” plasmas as well as more exotic configurations like the “snowflake” plasma, which helps distribute heat more effectively across the reactor walls. Most impressively, the AI managed to sustain two separate “droplets” of plasma within the chamber simultaneously, a feat that is exceptionally difficult to achieve with conventional controllers.
This capability allows scientists to explore a wider range of plasma geometries without the massive overhead of traditional software engineering. This flexibility is vital for the development of the International Thermonuclear Experimental Reactor (ITER) in France. ITER will require unprecedented levels of precision to achieve a net energy gain, and the ability of AI to adapt to unforeseen plasma behaviors could be the key to its success.
The success of this collaboration suggests that AI will be a cornerstone of future energy infrastructure. Beyond magnet control, AI could eventually predict “disruptions”—sudden losses of confinement—before they occur, allowing for corrective actions in microseconds. As the world seeks carbon-free energy, the marriage of machine learning and nuclear physics provides a clear pathway toward stable, commercial-scale fusion power.

