A challenge for physicists and mathematicians
The challenge in designing ONNs is that they have very different design restrictions than their digital counterparts. Unlike digital neural networks, optical networks can’t stack many layers of processing steps. “An optical ‘layer’ is a physical object,” explains ARCNL group leader Lyuba Amitonova, “and as light behaves mostly linearly, stacking more optical layers without strong nonlinear elements between them does not add more intelligence.” And because an ONN designed on a computer must eventually be built in the real world, even tiny imperfections can reduce its theoretical performance. An appealing alternative is to train the ONN physically, using real light rather than simulations. But this is difficult. Optical systems cannot easily implement the ‘backpropagation algorithm’ which is the method digital networks use to learn from their mistakes and thus the corner stone of Machine Learning. And adding switching optical elements to create step-by-step processing introduces even more loss and complexity. Because of this, the answer had to be found in much simpler neural networks, with only a single layer.
A pilot project?
This brings us to the joint project of CWI, ARCNL, and Photosynthetic.
The pilot project brought together the mathematics and computation knowledge of Tristan van Leeuwen and Vladyslav Andriiashen at CWI; the optical physics expertise of Lyuba Amitonova and Jakub Kraciuk at ARCNL; and the microfabrication capabilities of Photosynthetic’s Alexander Kostenko. As CWI postdoctoral researcher Vlad said, while he pointed at the screen: “Sometimes all you need are 4 dots.” The outcome of this three-month rollercoaster project was a physical proof-of-principle ONN. The team first trained a one-layer CNN digitally in a simulation of the optical setup, then built its optical counterpart using a spatial light modulator (SLM). After carefully tuning, the setup was able to classify digits. The 4 dots on the screen showed that the optical “lens” could reliably distinguish the number 3 from other numbers. Dots were shown at the top and bottom if the input was a ‘3’ and dots at the left and right if it was another number.
Much more valuable than the 4 dots, however, the project gave a glimpse of the future, where the boundaries between hardware and computation, and physics and mathematics are blurred… like 4 dots that paint one coherent picture. Imagine what large-scale ONNs in the future could do: data centers running AI faster and with a fraction of the electricity, smart devices performing tasks without power-hungry chips and autonomous vehicles and drones making instant decisions using light-speed computation. To realize this vision, though, more research is needed. And it will require a team of experts from all four fields: mathematics, physics, computer science, and manufacturing.