AI Designs Quantum Physics Experiments beyond What Any Human Has Conceived

Originally built to speed up calculations, a machine-learning system is now making shocking progress at the frontiers of experimental quantum physics

By Anil Ananthaswamy

Quantum physicist Mario Krenn remembers sitting in a café in Vienna in early 2016, poring over computer printouts, trying to make sense of what MELVIN had found. MELVIN was a machine-learning algorithm Krenn had built, a kind of artificial intelligence. Its job was to mix and match the building blocks of standard quantum experiments and find solutions to new problems. And it did find many interesting ones. But there was one that made no sense. “The first thing I thought was, ‘My program has a bug because the solution cannot exist,’” Krenn says.

MELVIN had seemingly solved the problem of creating highly complex entangled states involving multiple photons (entangled states being those that once made Albert Einstein invoke the specter of “spooky action at a distance”). Krenn, Anton Zeilinger of the University of Vienna and their colleagues had not explicitly provided MELVIN the rules needed to generate such complex states, yet it had found a way. Eventually Krenn realized that the algorithm had rediscovered a type of experimental arrangement that had been devised in the early 1990s. But those experiments had been much simpler. MELVIN had cracked a far more complex puzzle. “When we understood what was going on, we were immediately able to generalize [the solution],” says Krenn, who is now at the University of Toronto.

Since then, other teams have started performing the experiments identified by MELVIN, allowing them to test the conceptual underpinnings of quantum mechanics in new ways. Meanwhile Krenn, working with colleagues in Toronto, has refined their machine-learning algorithms. Their latest effort, an AI called THESEUS, has upped the ante: it is orders of magnitude faster than MELVIN, and humans can readily parse its output. While it would take Krenn and his colleagues days or even weeks to understand MELVIN’s meanderings, they can almost immediately figure out what THESEUS is saying. “It is amazing work,” says theoretical quantum physicist Renato Renner of the Institute for Theoretical Physics at the Swiss Federal Institute of Technology Zurich, who reviewed a 2020 study about THESEUS but was not directly involved in these efforts.

On supporting science journalism

If you're enjoying this article, consider supporting our award-winning journalism by subscribing. By purchasing a subscription you are helping to ensure the future of impactful stories about the discoveries and ideas shaping our world today.

Krenn stumbled on this entire research program somewhat by accident when he and his colleagues were trying to figure out how to experimentally create quantum states of photons entangled in a very particular manner. When two photons interact, they become entangled, and both can be mathematically described only using a single shared quantum state. If you measure the state of one photon, the measurement instantly fixes the state of the other even if the two are kilometers apart (hence Einstein’s derisive comments on entanglement being “spooky”).

In 1989 three physicists—Daniel Greenberger, the late Michael Horne and Zeilinger—described an entangled state that came to be known as GHZ (after their initials). It involved four photons, each of which could be in a quantum superposition of, say, two states, 0 and 1 (a quantum state called a qubit). In their paper, the GHZ state involved entangling four qubits such that the entire system was in a two-dimensional quantum superposition of states 0000 and 1111. If you measured one of the photons and found it in state 0, the superposition would collapse, and the other photons would also be in state 0. The same went for state 1. In the late 1990s Zeilinger and his colleagues experimentally observed GHZ states using three qubits for the first time.

Krenn and his colleagues were aiming for GHZ states of higher dimensions. They wanted to work with three photons, where each photon had a dimensionality of three, meaning it could be in a superposition of three states: 0, 1 and 2. This quantum state is called a qutrit. The entanglement the team was after was a three-dimensional GHZ state that was a superposition of states 000, 111 and 222. Such states are important ingredients for secure quantum communications and faster quantum computing. In late 2013 the researchers spent weeks designing experiments on blackboards and doing the calculations to see if their setups could generate the required quantum states. But each time they failed. “I thought, ‘This is absolutely insane. Why can’t we come up with a setup?’” Krenn says.

To speed up the process, Krenn first wrote a computer program that took an experimental setup and calculated the output. Then he upgraded the program to allow it to incorporate in its calculations the same building blocks that experimenters use to create and manipulate photons on an optical bench: lasers, nonlinear crystals, beam splitters, phase shifters, holograms, and the like. The program searched through a large space of configurations by randomly mixing and matching the building blocks, performed the calculations and spat out the result. MELVIN was born. “Within a few hours the program found a solution that we scientists—three experimentalists and one theorist—could not come up with for months,” Krenn says. “That was a crazy day. I could not believe that it happened.” Then he gave MELVIN more smarts. Anytime it found a setup that did something useful, MELVIN added that setup to its toolbox. “The algorithm remembers that and tries to reuse it for more complex solutions,” Krenn says.

It was this more evolved MELVIN that left Krenn scratching his head in a Viennese café. He had set it running with an experimental toolbox that contained two crystals, each capable of generating a pair of photons entangled in three dimensions. Krenn’s naive expectation was that MELVIN would find configurations that combined these pairs of photons to create entangled states of at most nine dimensions. But “it actually found one solution, an extremely rare case, that has much higher entanglement than the rest of the states,” Krenn says.

Eventually he figured out that MELVIN had used a technique that multiple teams had developed nearly three decades ago. In 1991 Xin Yu Zou, Li Jun Wang and Leonard Mandel, all then at the University of Rochester, designed one method. And in 1994 Zeilinger, then at the University of Innsbruck in Austria, and his colleagues came up with another. Conceptually these experiments attempted something similar, but the configuration that Zeilinger and his colleagues devised is simpler to understand. It starts with one crystal that generates a pair of photons (A and B). The paths of these photons go right through another crystal, which can also generate two photons (C and D). The paths of photon A from the first crystal and of photon C from the second overlap exactly and lead to the same detector. If that detector clicks, it is impossible to tell whether the photon originated from the first or the second crystal. The same goes for photons B and D.

A phase shifter is a device that effectively increases the path a photon travels as some fraction of its wavelength. If you were to introduce a phase shifter in one of the paths between the crystals and kept changing the amount of phase shift, you could cause constructive and destructive interference at the detectors. For example, each of the crystals could be generating, say, 1,000 pairs of photons per second. With constructive interference, the detectors would register 4,000 pairs of photons per second. And with destructive interference, they would detect none: the system as a whole would not create any photons even though individual crystals would be generating 1,000 pairs a second. “That is actually quite crazy, when you think about it,” Krenn says.

MELVIN’s funky solution involved such overlapping paths. What had flummoxed Krenn was that the algorithm had only two crystals in its toolbox. And instead of using those crystals at the beginning of the experimental setup, it had wedged them inside an interferometer (a device that splits the path of, say, a photon into two and then recombines them). After much effort, he realized that the setup MELVIN had found was equivalent to one involving more than two crystals, each generating pairs of photons, such that their paths to the detectors overlapped. The configuration could be used to generate high-dimensional entangled states.

Quantum physicist Nora Tischler, who was a Ph.D. student working with Zeilinger on an unrelated topic when MELVIN was being put through its paces, was paying attention to these developments. “It was kind of clear from the beginning [that such an] experiment wouldn’t exist if it hadn’t been discovered by an algorithm,” she says.

Besides generating complex entangled states, the setup using more than two crystals with overlapping paths can be employed to perform a generalized form of Zeilinger’s 1994 quantum interference experiments with two crystals. Aephraim Steinberg, an experimentalist who is a Toronto colleague of Krenn’s but has not worked on these projects, is impressed by what the AI found. “This is a generalization that (to my knowledge) no human dreamed up in the intervening decades and might never have done,” he says. “It’s a gorgeous first example of the kind of new explorations these thinking machines can take us on.”

In one such generalized configuration with four crystals, each generating a pair of photons, and overlapping paths leading to four detectors, quantum interference can create situations where either all four detectors click (constructive interference) or none of them do so (destructive interference). Until recently, carrying out such an experiment had remained a distant dream. Then, in a March preprint paper, a team led by Lan-Tian Feng of the University of Science and Technology of China, in collaboration with Krenn, reported that they had fabricated the entire setup on a single photonic chip and performed the experiment. The researchers collected data for more than 16 hours: a feat made possible because of the photonic chip’s incredible optical stability, something that would have been impossible to achieve in a larger-scale tabletop experiment. For starters, the setup would require a square meter’s worth of optical elements precisely aligned on an optical bench, Steinberg says. Besides, “a single optical element jittering or drifting by a thousandth of the diameter of a human hair during those 16 hours could be enough to wash out the effect,” he says.

During their early attempts to simplify and generalize what MELVIN had found, Krenn and his colleagues realized that the solution resembled abstract mathematical forms called graphs, which contain vertices and edges and are used to depict pairwise relations between objects. For these quantum experiments, every path a photon takes is represented by a vertex. And a crystal, for example, is represented by an edge connecting two vertices. MELVIN first produced such a graph and then performed a mathematical operation on it. The operation, called perfect matching, involves generating an equivalent graph in which each vertex is connected to only one edge. This process makes calculating the final quantum state much easier, although it is still hard for humans to understand.

That changed with MELVIN’s successor THESEUS, which generates much simpler graphs by winnowing the first complex graph representing a solution that it finds down to the bare minimum number of edges and vertices (such that any further deletion destroys the setup’s ability to generate the desired quantum states). Such graphs are simpler than MELVIN’s perfect matching graphs, so it is even easier to make sense of any AI-generated solution. Renner is particularly impressed by THESEUS’s human-interpretable outputs. “The solution is designed in such a way that the number of connections in the graph is minimized,” he says. “And that’s naturally a solution we can better understand than if you had a very complex graph.”

Eric Cavalcanti of Griffith University in Australia is both impressed by the work and circumspect about it. “These machine-learning techniques represent an interesting development. For a human scientist looking at the data and interpreting it, some of the solutions may look like ‘creative’ new solutions. But at this stage, these algorithms are still far from a level where it could be said that they are having truly new ideas or coming up with new concepts,” he says. “On the other hand, I do think that one day they will get there. So these are baby steps—but we have to start somewhere.” Steinberg agrees. “For now they are just amazing tools,” he says. “And like all the best tools, they’re already enabling us to do some things we probably wouldn’t have done without them.”