Abstract
We develop an autonomous experimentation platform to accelerate interpretable scientific discovery in ultrafast nanophotonics, targeting a novel method to steer spontaneous emission from reconfigurable semiconductor metasurfaces. Despite the potential of reconfigurable semiconductor metasurfaces with embedded sources for spatiotemporal control, achieving arbitrary far-field control remains challenging. Here, we present a self-driving lab (SDL) platform that addresses this challenge by discovering the governing equations for predicting the far-field emission profile from light-emitting metasurfaces. We discover that both the spatial gradient (grating-like) and the curvature (lens-like) of the local refractive index are key factors in steering spontaneous emission. The SDL employs a machine-learning framework comprising: (1) a variational autoencoder for generating complex spatial refractive index profiles, (2) an active learning agent for guiding experiments with real-time closed-loop feedback, and (3) a neural network-based equation learner to uncover structure-property relationships. The SDL demonstrates up to a four-fold enhancement in peak emission directivity (up to 77%) over a 74° field of view within ~300 experiments. Our findings reveal that combinations of positive gratings and lenses are as effective as negative lenses and gratings for all emission angles, offering a novel strategy for controlling spontaneous emission beyond conventional Fourier optics.
Similar content being viewed by others
Introduction
Self-driving labs (SDLs) represent a transformative approach to scientific discovery, employing machine-learning (ML) models to autonomously conduct experiments1,2,3,4,5,6,7,8,9,10. Current experiments within SDLs focus on low-dimensional problems due to the challenges in interpreting high-dimensional (>103 degrees of freedom) data structures9,11,12,13,14,15,16,17. Hence, SDLs have primarily concentrated on optimization tasks within low-dimensional or well-understood search spaces, accelerating discovery in material science and chemistry18,19,20,21,22,23,24,25. However, realizing interpretable scientific discovery26 for high-dimensional problems presents a significant challenge, as it involves navigating unknown high-dimensional spaces to establish new verifiable facts or concepts. Additionally, high-throughput automation of experiments (closed-loop) is necessary for tackling high-dimensional problems, which additionally limits SDLs27,28. ML models excel at learning correlations in high-dimensional spaces but struggle with extrapolation and interpretation, especially in the physical sciences, where they often act as “black-boxes”, failing to learn the underlying physical principles29,30,31,32. This inherently limits the generalizability of ML models since researchers cannot explain ‘why’ a particular discovery makes sense, for instance, in the form of an equation representing the process33,34. Since the advancement of scientific research over the past century has been successful in realizing interpretable solutions by following the scientific method35,36,37, we hypothesize that an ML framework implementing the scientific method can realize interpretable discovery. We therefore envision an SDL to generate high-dimensional experiments, select optimal hypothesis-driven experiments for testing, identify features of relevant optimal experiments, and uncover the relationship between these features and experimental results in an interpretable form. Here, we address two objectives: a) develop a machine-learning framework for autonomous scientific discovery, and b) apply this framework to discover a novel approach to steer spontaneous emission.
To achieve these objectives, we develop an ML framework for autonomous scientific discovery in three steps - specifically to address the needs and limitations of current SDLs:
-
a)
High dimensionality of inputs: We leverage the manifold hypothesis38,39 within the physical sciences, which suggests that a complex physical system requiring a large number of parameters at first glance, can be described with far fewer independent parameters. We employ generative models, specifically variational autoencoders (VAEs)40, to generate high-dimensional experiments beyond state-of-the-art, from a low-dimensional continuous latent space.
-
b)
Cost of Experiments: Active learning (AL)11 then selects optimal experiments from the VAE’s (low-dimensional) latent space, to develop an efficient design of experiments overcoming limitations in exploring high-dimensional spaces. Specifically, active learning predicts the next experiment to be conducted, balancing exploration and exploitation of the input space with appropriate acquisition functions.
-
c)
Interpretability of Results: Understanding experimental results is crucial, yet generative and active learning models often lack human interpretability. To bridge this gap, we develop a neural network-based equation learners (nn-EQLs) that uncover interpretable equations26,41. Our approach combines the expressive power of neural networks with physics-driven intuition to learn interpretable structure-property relationships through closed-loop experimental feedback.
We apply our ML framework to the problem of steering spontaneous emission, a challenging task with significant potential for clean energy solutions. Spontaneous light emission, as seen in light-emitting diodes (LEDs) and thermal lamps, lacks spatio-temporal control, but achieving such control could revolutionize fields like remote sensing and holographic displays42,43,44,45,46,47,48. Traditional methods for controlling coherent (e.g., phased array optics49 for lasers) light are not compatible with spontaneous emission. Light-emitting metasurfaces, composed of sub-wavelength periodic arrays of optical resonators with embedded emitters, offer a novel way to control spontaneous emission, and have demonstrated reconfigurable control of spontaneous emission through spatially periodic refractive index modulation50,51,52,53,54,55,56. However, predicting and controlling emission patterns from aperiodic refractive index modulation remains challenging due to a lack of suitable models and simulation tools (see Supplementary Information Section S1). Our approach involves utilizing the ultrafast (<1 ps) reconfigurability (through refractive index modulation with optical free-carrier injection) of the metasurface to realize nearly arbitrary phase-array optical elements, mimicking arbitrary high-dimensional spatial index profiles. Our SDL leverages the degrees of freedom enabled by the reconfigurability of the metasurfaces to discover the relationship between spatial refractive index profiles and emission patterns through closed-loop, noisy experimental feedback. Based on the exploration results of the AL agent within the latent space of the VAE, we improved the peak directivity by of an order of magnitude realizing up to 67% over a wide field of view (80°) when compared with state-of-the-art devices56 and realized human interpretable equations describing the steering process (Fig. 1a). The efficacy of combining generative models with active learning to tackle high-dimensional spaces demonstrates the acceleration of autonomous experimentation platforms towards scientific discovery. Using the results from our self-driving lab, we discovered a novel approach to steer incoherent emission—going beyond conventional Fourier optical principles that are based solely on momentum matching of light. We demonstrate that a spatial index profile that is a combination of a positive (convex) lens and a positive (sawtooth) grating, when imposed on the metasurface, can steer light emission across a 74° field of view. Similarly, a negative (concave) lens and a negative (sawtooth) grating also steer emission across the same field of view, while other spatial index combinations (e.g., positive (negative) lens with a negative (positive) grating) do not steer the emission from our reconfigurable light-emitting metasurface. Through our approach, we provide a potential pathway for augmenting scientific intuition using neural network equation learners, taking a step beyond scientific discovery to understanding the principles governing spontaneous emission.
a Self-driving lab framework: an active learning agent drives a generative model (variational autoencoder, VAE) and the ultrafast optical experiment. A 40 fs pulsed pump laser at 800 nm images a spatial light modulator onto a reconfigurable metasurface (scanning electron microscope image at the top right). This setup measures the directivity of photoluminescence (PL) from the metasurface with closed-loop feedback. The latent space discovered by the active learning surrogate model is recast as a human-interpretable equation using a neural network-based equation learner. b Far-field directivity emission from the metasurface under saw-tooth shaped (uniform spatial momentum) pump patterns applied on the SLM, with the grating order varying from −100 to +100. The directivity (D) of emission is defined as the ratio of the signal towards a given angle (f(θ_i)) to the sum of signals over all angles (Σ_jf(θ_j)).
Results and discussion
To rapidly measure spontaneous emission, we develop an automated, closed-loop, ultrafast momentum-resolved photoluminescence (PL) measurement setup featuring a reconfigurable GaAs metasurface with embedded, light-emitting, InAs quantum dots, grown epitaxially on a reflective distributed Bragg reflector. The local intensity of the ultrafast optical pump (800 nm, 40 fs pulse-width laser, 2–3 mJ/cm2 at 1 kHz) dynamically changes the local refractive index of the GaAs metasurface resonators through free-carrier refraction (Fig. 1a). An active learning (AL) agent drives the experiments, sampling the smooth lower-dimensional latent space of a variational autoencoder (VAE) to generate new optical pump patterns. These patterns are projected onto the metasurface as intensity patterns using a spatial light modulator (SLM) to realize the spatial refractive index profile on the metasurface. The far-field directivity of the PL is captured using a lock-in detector scanning the back focal plane of the metasurface emitter (see Supplementary Information S1). For this SDL exemplar, the optical pump pattern setting up the spatial index of profile on the metasurface forms the input image (generated by the VAE) for the experiment and the far-field intensity distribution of metasurface emission forms the output, which is used by the AL agent to generate the next experiment.
The GaAs metasurface is designed to achieve nearly a 0–2π phase shift in reflection as a function of local optical pump intensity and is fabricated using electron beam lithography and dry etching as described previously in other work51,56. The metasurface shows overlapping peaks in reflection and PL spectra (Fig. S1), indicating that optical resonances (peaks in spectra) measured in reflection can enhance the far-field emission from the metasurface. We demonstrate that the metasurface steers light emission (Fig. 1b) over an 80° field of view under one-dimensional uniform momentum profiles created by saw-tooth patterns with different spatial frequencies on the optical pump structured using the SLM. Far-field emission directivity measurements (D\(=\frac{f({\theta }_{i})}{{\sum }_{{{\rm{j}}}}f({\theta }_{j})}\), where f(θi) is the steered signal towards angle θi reveal a band of emission between ±14° (due to the distributed Bragg reflector substrate) for all applied grating orders, with some emission steered to off-normal angles responding to the pump pattern’s spatial momentum. This result (Fig. 1b) indicates that only part of the metasurface emission follows the known momentum matching principles of spontaneous light emission steering50,51,56,57,58,59,60,61,62,63. Therefore, ML has the potential to discover better solutions beyond momentum matching principles, solving the inverse problem of predicting the optimal spatial refractive index profile to maximize directivity towards a given angle. In our work, the ML framework controls the closed-loop ultrafast optical experiment to maximize the metasurface emission directivity towards a desired angle.
We first benchmark the components of our ML framework to ensure that the VAE can generate a wide variety of patterns beyond human intuition40 and that the AL agent can search over known spaces to rediscover known results. We quantify the generative capability of the VAE by visualizing the local-slope distribution of the optical pump patterns generated by the VAE, demonstrating that the generated optical pump patterns exceed the state-of-the-art (training set) by two orders of magnitude (See “Methods” and Supplementary Information S2). We also illustrate that the traditional grating order-based patterns and polynomial patterns lie on a subset of the space of patterns generated by sampling the VAE’s latent space (See Supplementary Information S2). This analysis confirms the validity of the manifold hypothesis in our problem, such that sampling this low-dimensional space is sufficient to discover governing equations relating features of high-dimensional pump patterns to the observed directivity. The AL agent begins by using a limited initial training set (optical pump patterns and their associated directivity) to predict directivity across various optical pump patterns. An acquisition function then identifies patterns that could maximize directivity. The ultrafast, automated PL experiment is then performed by projecting this pattern on the metasurface, and the measured result (and associated noise, statistics) is added to the training set of the AL agent. This loop continues until an optimum is reached, or a pre-set experimental budget is reached. Here, the active learning agent is limited by our experimental budget, and we thus note that maxima in directivity (figure of merit) achieved represent local maxima, and not a global maximum of the optimization problem. We find that the AL agent, when searching over the space of possible grating orders, re-discovers the optimal grating order for steering into a given direction with an order of magnitude fewer number of experiments than brute-force sampling (See “Methods” and Supplementary Information S2). Given the success of the AL agent in finding pump patterns described using a single parameter (grating order), we now use AL to find pump patterns described using a complex set of low-dimensional features, i.e., the VAE latent space.
In Fig. 2a, we show the results of the AL agent maximizing the steered signal towards two different emission angles without prior knowledge (i.e., the AL agent has no prior information of the VAE, the experimental noise, and past results). The AL agent samples the VAE’s latent space, aiming to maximize directivity while trying to minimize the number of experiments. Each experiment (dot in Fig. 2a) starts with the AL agent predicting a point in the VAE’s latent space representing a potentially high-directivity curve. This point generates a one-dimensional curve (YVAE of length 3840) through the VAE decoder, which is transformed into an optical pump pattern in two steps. i) phase wrapping and normalizing the curve to the 8-bit resolution of the SLM to generate YSLM = (YVAE%2π)/2π,; and ii) repeating YSLM along the orthogonal axis (2160 times) to form a two-dimensional image (3840 × 2160 pixels) projected onto the metasurface. While the VAE could generate patterns with high-spatial frequency going beyond the diffraction limit of the optical system, we note that the experimental setup imaging the pump pattern from the SLM to the metasurface will naturally smooth out these high-frequency elements. The noise in the far-field emission has four independent sources: the ultrafast-pulsed laser, SLM, infrared detector, and lock-in amplifier. The directivity of PL is estimated with 10 repeats of the experiment to derive the mean (output) and its standard deviation (noise, see Supplementary information S1). Combining the VAE’s enhanced generative capability with the AL agent’s efficiency, we maximized the directivity of spontaneous emission from the GaAs metasurface.
a Top (bottom) panel shows the active learning result for the emission angle at +(−) 26.6°. Blue dots represent training points (Sobol sampling) in the VAE latent space, and black points represent improved steering signals learned by the active learning agent. The blue dashed line indicates the average of the training points, while the black line indicates the average of the learning points. Red vertical stripes show measurement errors averaged over 10 repeats. b Relative improvement in peak directivity by the active learning agent across multiple far-field emission angles through closed-loop experimental feedback. The top panel shows the peak directivity of emission optimized for each far-field angle, with the active learning agent’s values in blue and the saw-tooth grating values in red. The bottom panel shows the relative improvement in peak directivity for different emission angles enabled by the active learning agent. c Left panel shows the optimal VAE output (normalized) learned by the active learning agent, and the right panel shows the normalized 1–D pump pattern transforming the VAE output for loading into the SLM: YSLM = (YVAE % 2π)/2π for different emission angles.
We find that the AL agent sampling the VAE’s latent space improves peak directivity across a 74° field-of-view within 300 closed-loop iterations without prior knowledge (Fig. 2a). Compared to the state-of-the-art saw-tooth grating patterns56, the absolute directivity of emission increases by an average of 2.2x (Fig. 2b), with a peak improvement of 3.77x at 14.4°. Here, we note that the AL agent utilizes the real-time noise (red error bars in Fig. 2a) in each experiment (See “methods”) performed by the SDL. Notably, the absolute directivity peak of 67% is one of the highest reported for classical static LEDs, which typically require bulky reflector lenses and collimators to achieve similar directivity64. We demonstrate that a dynamically reconfigurable light-emitting metasurface can be designed to increase emission directivity based on an aperiodic spatial refractive index pattern imposed on the emitter. The patterns discovered by the AL agent, resulting in high directivity, represent spatial index profiles beyond conventional optical elements, which are typically defined based on symmetries governed by the spatial phase profile of the device. Additionally, we note the patterns discovered by the AL agent do not have any high-spatial frequency features beyond the diffraction limit of the system. Thus, the AL agent, leveraging the VAE’s generative potential, provides a novel approach to maximize emission directivity, surpassing state-of-the-art methods to achieve comparable performance to commercial LEDs64 without additional packaging or bulky optics, while retaining the ability to steer emission over 74°.
Remarkably, we discover that the optimal VAE outputs for different emission angles, as identified by the AL agent, can be described as a combination of the spatial phase profiles of a lens and a grating (Fig. 2d). The spatial (x - pixels on SLM) phase (y) profile of an optical lens (used to focus light) is a parabola: y = ax2 (where a represents the lens curvature), while the saw-tooth gratings (used to deflect light) have a linear phase profile: y = bx (where b defines the deflection angle, with b = grating order/3840 pixels). Thus, the VAE patterns discovered by the AL agent can be described as YVAE = ax2 + bx, and the final optical pump pattern on the SLM becomes YSLM = (ax2 + bx) % 2π / 2π. This finding, enabled by the AL agent exploiting the generative capability of the VAE, surpasses human intuition, which typically relies on momentum matching (or Fourier transform-based) principles. Current methods for steering light depend solely on grating orders (bx) or the linear spatial gradient established by the refractive index or size profile of the metasurface resonators. Here, for the first time, we discover a fundamentally new way to steer light from light-emitting metasurfaces with high directivity through the AL agent.
To formalize our visual insights, we quantify the statistical correlation between the VAE’s latent space (z1–4) and the physical properties of the SLM pump pattern (YSLM): a- spatial curvature (∂2yvae/∂x2), b- spatial gradient (∂yvae/∂x), A- average pump intensity (<YSLM>), and ω- largest spatial frequency (|FourierTransform{YSLM}|max). These features are commonly used to describe the spatial refractive index profile of the optical pump pattern. Spearman correlation coefficients65, averaged over 10,000 VAE-generated curves (using Sobol Sampling), reveal: a) z4 weakly correlates with the spatial curvature, a; b) z3 negatively correlates with the average pump intensity, A; and c) the VAE’s latent space dimensions are orthogonal (See “Methods”). The rest of the latent space shows no or weak correlation with other physical propertie of the pump pattern. While Spearman correlations indicate isolated correlations, Sobol’ sensitivity indices66,67,68 help us understand the combined correlations of multiple latent space variables. These indices show that no single latent space dimension correlates strongly with a, but combinations of dimensions do. For spatial gradients, only z4 correlates weakly in isolation, but a combination of z1–3 and z4 correlates strongly with b. This indicates that the AL agent discovered a correlated sub-space of patterns with high performance across all angles, which are interpretable and tied to physically relevant quantities. We experimentally validate the AL agent’s discovery using a parameter sweep on the optical pump pattern (Fig. 3c–e, Supplementary Information S3), finding that specific combinations of lens and grating pump patterns result in high directivity. Combining lens and grating pump patterns creates an aperiodic spatial refractive index profile, dynamically reconfiguring the metasurface to achieve high directivity. Our work thus reveals a new structure-property relationship governing spontaneous emission steering at the nanoscale, relating aperiodic spatial refractive index (momentum) profiles and directivity beyond current momentum matching principles.
a Spearman correlation coefficient among latent dimensions learnt by the VAE, a,b-spatial curvature and gradient of the pump, A,ω-average intensity and the highest spatial frequency of the SLM pump profile. b Sobol correlations between latent space dimensions of the VAE with the spatial curvature (left panel), and the gradient (right panel) of the optical pump profile. The blue bars indicate correlation with respect to an individual variable (zi) while the red bars highlight the correlation in the presence of other latent space variables. The black stripes on top of the bars indicate the possible error bar in the correlation statistic. c–e Measured directivity of emission from the metasurface as a function of spatial curvature, a and gradient, b for steering results −14°,0,and +14° respectively.
We translate the structure-property relationship discovered by the AL agent and the VAE to a human-interpretable equation describing the directivity (D) of emission as a function of the latent space (a, b) using a neural network-based equation learner (nn-EQL)26,41. The nn-EQL is a two-layer neural network with non-linear activation functions (e.g., addition, t2, sin(t), cos(t), multiplication). We train the nn-EQL to minimize mean-squared error through backpropagation, while using iterative pruning (>90% sparsity)69, to obtain Eq. (1), see Fig. 4. Details on the nn-EQL setup, and the pruning process are described in “Methods”.
a Our equation learner framework uses a customized neural network with physics-based activation functions defined for each neuron. Stage 1 first performs an initial fit to the dataset, establishing an acceptable error level on the training and validation set (green). Stage 2 then iteratively prunes this network, removing neurons that have the lowest contribution in each layer, which is repeated until the highest level (e.g., 90%) of sparsity before the loss terms increase. Finally, in Stage 3, we write an equation using the neural network’s weights and activation functions and simplify it using sympy75. b The training process (logarithm of the mean squared error of the network vs the training epochs) for generating equations represents the three stages described in Fig. 4a. The dashed horizontal represents the loss-threshold for initiating the pruning of the least contributing neurons the network. The color bar represents the sparsity progression during the process.
The pruned network is read out as an equation and further simplified using Python packages (e.g., sympy). The nn-EQL distilled equation captures the oscillatory behavior observed in the dataset with a ‘sin(3.52a–0.04b)’ term, while ‘0.18(a–0.033b)2’ describes the dependence on spatial curvature. (See Fig. 4a, “Methods”) This equation is a subset of a master equation describing spontaneous emission steering towards 0° concerning the spatial gradient and curvature of the refractive index profile. Repeating this process for other steering orders allows us to develop a generalizable phenomenological model describing light emission from the metasurface. Examples of such additional equations are provided in the Supplementary Information Section S3. Analyzing these equations, we can observe a consistent trend wherein the optimal a/b ratio for maximizing light emission gradually increases as the redirection angle changes from −36° to +36°.
The spatial refractive index profile, representing a combination of a lens and a grating, presents an intriguing perspective from Fourier optics. Current approaches to steer spontaneous emission rely on momentum-matching principles, but our results (Fig. 3) and the obtained equation (Fig. 4) suggest a new operational principle. This principle considers not only momentum matching (using grating orders, b) but also the gradient of momentum (spatial curvature of the index, a), akin to lens-like characteristics. For instance, in Fig. 3c, combining positive lens (a) and positive grating (b) values steers light effectively (high directivity) in the first quadrant. However, combining a positive lens with a negative grating (fourth quadrant) shows no steering. In classical optics, a positive lens collimates an LED source, followed by a grating to steer the emission. The ML framework enables combining the lens and grating into a single optic on the light-emitter itself. Negative grating orders combined with positive lens characteristics (fourth quadrant) completely remove the capacity to steer light for all angles, contrary to classical optics. The second and third quadrants of Fig. 3c present an even more intriguing scenario where a negative (concave) lens with embedded emitters can steer light in the far-field with an additional negative grating order. A negative lens typically increases the divergence of incident light, counterintuitive to collimating a diverging source unless the source was initially converging. Multiple sets of light emission pathways trapped in the substrate may be out-coupled into free space along the measurement direction, but further modeling of the system is necessary to verify this possibility. The equivalence in performance between positive and negative lenses (with their corresponding gratings) for steering light emission in the far-field suggests that observed spontaneous light-matter interactions surpass classical Fourier optics’ understanding based only on momentum matching.
In summary, we demonstrate a self-driving nanophotonics lab capable of uncovering and elucidating novel structure-property relationships in nanoscale light-matter interactions. Our approach integrates a generative model (VAE), an active learning agent, and an equation learner network, constituting the core elements of a self-driving lab. Leveraging the VAE’s capacity to generate patterns beyond human intuition from a condensed design space representation, the active learning agent efficiently optimizes for the metasurface’s operational property (directivity of emission) using closed-loop feedback to minimize experimental iterations. The patterns unearthed by the active learning agent across multiple emission angles revealed new latent space variables (such as spatial curvature of the refractive index), enhancing control over spontaneous emission directivity. The nn-EQL distills the active learning agent’s discovery into concise equations, offering much-needed human interpretability to machine-learning models. The active learning agent identified optical pump patterns from the VAE, steering spontaneous emission from resonant metasurfaces 2.2x more effectively than human-intuition-driven saw-tooth pump patterns over a 74° field of view, peaking at 67% at normal incidence. We discovered that a spatial index profile formed by combining a lens and a grating outperforms a grating profile alone in steering spontaneous light emission. Specifically, the agent unveiled that these optical components (lens and grating) may be amalgamated into a single phase-space optical pump pattern on a metasurface, transcending current understanding based on Fourier optics. Moreover, we discover a concise structure-property relationship linking the spatial refractive index profile to emission directivity as an equation, facilitating the realization of energy-efficient spontaneous light sources (such as LEDs, thermal lamps, etc.). We anticipate that the methodologies and outcomes demonstrated herein to establish a self-driving ultrafast nanophotonics lab could be generalizable to other physical sciences. This approach has the potential to overcome limitations of human intuition and theoretical frameworks, leveraging generative machine-learning frameworks to drive new scientific discoveries.
Methods
Variational autoencoder (VAE)
We train a VAE to generate arbitrary pump patterns, represented as one-dimensional curves of length 3840 pixels. Given a pump pattern X, a VAE encodes this pump pattern into a low-dimensional latent space representation Z, using an encoder network that learns the distribution Q(Z∣X). In this work, we use a latent space representation of size four, making the latent space a four-dimensional Gaussian distribution. During training, the accuracy of the learnt latent representation Z is determined in part by the ability of the VAE to reconstruct the input pump pattern X from the latent space representation Z. The VAE achieves this reconstruction by using a decoder network that learns P(X∣Z). In this work, we use six fully connected layers for the encoder and the decoder. The objective of the VAE during training is to minimize Evidence Lower Bound (ELBO) loss:
Here, the first term is the L2-norm between the set of reconstructed pump patterns \(\hat{X}\), and the set of ground truth patterns \(X\). The second term is a KL-divergence that measures the difference between the encoder-learned distribution \(Q({Z|X})\), and the prior distribution \(p(Z)\), assumed here to be \(N\left(0,{I}\right)\). Once the VAE is trained, new pump patterns are generated by sampling the learned latent space of the VAE. The generated pump patterns are then transformed into two-dimensional images by repeating the intensity of the pump pattern along the y-axis. These images are projected onto the SLM for evaluation of spontaneous emission steering. The training set for the VAE encompasses both grating-order-based periodic patterns and aperiodic patterns comprising multiple frequencies and linear/quadratic curves. In total, 50,000 one-dimensional pump patterns are used for VAE training, using the Pytorch package70 and the Adam optimizer71 for training. See Supplementary Information Section S2 for benchmarking of the generative capability of our trained VAE.
Active learning on the latent space of the generative model
We search for optimal pump patterns by navigating the latent space of the trained VAE using active learning. Each point in the latent space represents a pump pattern (obtained using the VAE’s decoder network), and active learning efficiently searches the latent space (i.e., the space of pump patterns that could be generated by the VAE) to find optimal pump patterns. We define optimal pump patterns as patterns with high directivity \(D=\frac{f\left({\theta }_{i}\right)}{{\sum }_{j}\left(f\left({\theta }_{j}\right)\right)}\), i.e., patterns that steer emission maximally in a desired direction \({\theta }_{i}\), while minimally steering emission to other angles. Active learning begins with an initial dataset of pump patterns and associated directivity (\(D\)) values. This set of pump patterns is chosen by choosing points in the latent space using Sobol’s sampling and using the VAE’s decoder to obtain pump patterns. Directivity is measured using the automated experimental setup (see Supplementary Information), along with uncertainty. We assume the directivity measurement is assumed to contain errors from the pump power (combining modulation and laser measurement) \(\Delta P\), and the thermal noise of the detector \(\Delta S\). The uncertainty in the directivity is thus calculated as \(\frac{\Delta D}{D}=\sqrt{{\left(\frac{\Delta P}{P}\right)}^{2}+{\left(\frac{\Delta S}{S}\right)}^{2}}\). Using this initial dataset, a Gaussian process model GP predicts directivity, with uncertainty, across the latent space, i.e., \(D\left(z\right)={GP}\left(\mu \left(z\right),{K}\left(z,{z}^{\prime} \right)\right)\); where \(z\) is a point in the latent space of the VAE, \(\mu \left(z\right)\) is the average directivity for the pump pattern represented in the VAE dimension as \(z\), and \(K\left(z,{z}^{\prime} \right)\) is a kernel function representing covariance in directivity between two pump patterns represented as latent space points \(z\) and \(z^{\prime}\). See72 for more details on Gaussian process models. The directivity prediction from the Gaussian process model, along with uncertainty, is used for determining the next experiment that will be conducted. The next experiment \({z}^{*}\) is determined by an acquisition function, such as Expected Improvement (EI), with the intention of balancing exploration and exploitation in the latent space. Specifically, the next experiment with the EI acquisition function is chosen as: \({z}^{*}={argmax\; E}\left(\max \left({GP}\left(z\right)-{GP}\left({z}^{{curr}}\right)\right),\,0\right)\), where \({z}^{{curr}}\) is the point in the latent space with the best directivity so far, and \(z\) is any point in the latent space. The EI acquisition function thus chooses the next experiment to be conducted at a latent space \({z}^{*}\) where the Gaussian process model predicts the highest improvement in directivity, compared to the best point predicted so far. In this work, we use the Ax package73 for the active learning, using 100 points in the latent space as the initial dataset (sampled using Sobol’ sampling74). Subsequently, 1000 experiments are conducted using the EI acquisition to find optimal pump patterns.
Analyzing correlations in latent space
Correlations in latent space are analyzed using Spearman correlations and Sobol’ sensitivity indices. The Spearman correlation is defined as: \({\rho }_{a,{b}}=\frac{{\mathrm{cov}}\left({r}_{a},\,{r}_{b}\right)}{{\mathrm{var}}\left({r}_{b}\right){\mathrm{var}}\left({r}_{b}\right)}\) where \({r}_{i}\) is the rank of variable \(i\) (highest value is rank 1). Spearman correlations range between −1 and 1 and indicate correlations between pairs of variables. Moving beyond pairs of variables in isolation, Sobol’ sensitivity indices indicate the effect of sets of variables on a quantity of interest. Sobol’ sensitivity indices are of two types: first-order sensitivity indices and total-order sensitivity indices. First-order sensitivity indices are defined as: \({S}_{1,i}=\frac{{Va}{r}_{{X}_{i}}\left({E}_{X\ne {X}_{i}}\left({Y|}{X}_{i}\right)\right)}{{Var}(Y)}\) and measure the variance in the quantity of interest \(Y\) when varying one variable \({X}_{i}\), and averaging over all other variables. Total-order sensitivity indices are defined as: \({S}_{T,i}=\frac{{E}_{{X}_{ \sim i}}\left({{Var}}_{{X}_{i}}\left({Y|}{X}_{ \sim i}\right)\right)}{{Var}(Y)}\) and are the variance in the quantity of interest as combinations of \(\left\{{X}_{i}\right\}\) are varied.
Equation learner network
To distill active learning experiments into an interpretable form, we employ a custom equation learner network (EQL). The EQL is formulated as a dense feed-forward neural network (we use Pytorch70) with custom activation functions applied to each neuron. These custom activation functions are inspired by terms present in equations in the physical sciences (sin, cos, exp, product, etc.). The EQL is trained in three stages: (1) A two-layer network is trained to achieve an accurate fit to the data (without overfitting), (2) The trained network is pruned to a smaller network that achieves similar accuracy to the network trained in stage 1. We prune the EQL by removing weights with the least ‘contribution’ in each layer, followed by re-training for a few epochs to allow other weights to adjust to the removal of weights with the least contribution. Contribution here is defined as the product of the weight value and the value of the neuron activation from the previous layer. We use contribution as a metric instead of conventional magnitude-based pruning approaches69 to account for non-custom activation functions that are not monotonic with weight values (e.g., cos) (3). The pruned network is read out in terms of a human-readable equation using packages such as Sympy75.
Coupling experiments with powerful computing platforms
Our self-driving lab is driven by a laboratory computer capable of instrument control via a Python API. To overcome computational limitations, active learning and neural network equation learning is performed on a local machine with four Tesla V100 GPUs, while experimental instruments are controlled via a laboratory computer.
Data availability
The experimental beam steering data generated in this study have been deposited in the Zenodo database [https://doi.org/10.5281/zenodo.17253431] under the Creative Commons Attribution 4.0 International license. Also included is a Jupyter notebook to visualize the raw data and simple properties of the pump patterns.
Code availability
The code is available at https://github.com/saakethdesai/autoscilab.
References
Flores-Leonar, M. M. et al. Materials acceleration platforms: on the way to autonomous experimentation. Curr. Opin. Green Sustain. Chem. 25, 100370 (2020).
Hickman, R. J., Bannigan, P., Bao, Z., Aspuru-Guzik, A. & Allen, C. Self-driving laboratories: a paradigm shift in nanomedicine development. Matter 6, 1071–1081 (2023).
Lo, S. et al. Review of low-cost self-driving laboratories in chemistry and materials science: the "frugal twin" concept. Dig. Discov. 3, 842–868 (2024).
MacLeod, B. P. et al. A self-driving laboratory advances the Pareto front for material properties. Nat. Commun. 13, 995 (2022).
MacLeod, B. P. et al. Self-driving laboratory for accelerated discovery of thin-film materials. Sci. Adv. 6, eaaz8867 (2020).
Häse, F., Roch, L. M. & Aspuru-Guzik, A. Next-generation experimentation with self-driving laboratories. Trends Chem. 1, 282–291 (2019).
MacLeod, F. G. L., Parlane, A. K., Brown, J. E. & Hein, C. P. Berlinguette. Flexible automation accelerates materials discovery. Nat. Mater. 21, 722–726 (2022).
Milad, A. & Eugenia, K. The rise of self-driving labs in chemical and materials sciences. Nat. Synth. 2, 483–492 (2023).
Seifrid, M. Autonomous chemical experiments: challenges and perspectives on establishing a self-driving lab. Acc. Chem. Res. 55, 2454–2466 (2022).
Tabor, D. Accelerating the discovery of materials for clean energy in the era of smart automation. Nat. Rev. Mater. 3, 5–20 (2018).
Gongora, A. A Bayesian experimental autonomous researcher for mechanical design. Sci. Adv. 6, eaaz1708 (2020).
Deneault, J. R. et al. Toward autonomous additive manufacturing: Bayesian optimization on a 3D printer. MRS Bull. 46, 566–575 (2021).
Szymanski, N. J. et al. An autonomous laboratory for the accelerated synthesis of novel materials. Nature 624, 86–91 (2023).
Bateni, F. Autonomous nanocrystal doping by self-driving fluidic micro-processors. Adv. Intell. Syst. 4, 2200017 (2022).
Porwol, L. An autonomous chemical robot discovers the rules of inorganic coordination chemistry without prior knowledge. Angew. Chem. Int. Ed. 59, 11256–11261 (2020).
Abdel-Latif, K. et al. Self-driven multistep quantum dot synthesis enabled by autonomous robotic experimentation in flow. Adv. Intell. Syst. https://doi.org/10.1002/aisy.202170022 (2021).
Sagmeister, P. Autonomous multi-step and multi-objective optimization facilitated by real-time process analytics. Adv. Sci. 9, 2105547 (2022).
Nikolaev, P. Autonomy in materials research: a case study in carbon nanotube growth. npj Comput. Mater. 2, 16031 (2016).
Christensen, M. et al. Data-science-driven autonomous process optimization. Commun. Chem. 4, 112 (2021).
Volk, A. A. & Abolhasani, M. Autonomous flow reactors for discovery and invention. Trends Chem. 3, 519–522 (2021).
Shimizu, R., Kobayashi, S., Watanabe, Y., Ando, Y. & Hitosugi, T. Autonomous materials synthesis by machine learning and robotics. APL Mater. 8, 111110 (2020).
Gao, W., Raghavan, P. & Coley, C. W. Autonomous platforms for data-driven organic synthesis. Nat. Commun. 13, 1075 (2022).
Manzano, J. S. et al. An autonomous portable platform for universal chemical synthesis. Nat. Chem. 14, 1311–1318 (2022).
Stach, E. et al. Autonomous experimentation systems for materials development: a community perspective. Matter 4, 2702–2726 (2021).
Volk, A. A. & Abolhasani, M. Performance metrics to unleash the power of self-driving labs in chemistry and materials science. Nat. Commun. 15, 1378 (2024).
Desai, S. & Strachan, A. Parsimonious neural networks learn interpretable physical laws. Sci. Rep. 11, 12761 (2021).
Kusne, A. G. et al. On-the-fly closed-loop materials discovery via Bayesian active learning. Nat. Commun. 11, 5966 (2020).
Angello, N. H. et al. Closed-loop optimization of general reaction conditions for heteroaryl Suzuki-Miyaura coupling. Science (1979) 378, 399–405 (2022).
Linardatos, P., Papastefanopoulos, V. & Kotsiantis, S. Explainable AI: a review of machine learning interpretability methods. Entropy 23, 18 (2020).
Doshi-Velez, F. & Kim, B. Towards a rigorous science of interpretable machine learning. Preprint at arXiv https://doi.org/10.48550/arXiv.1702.08608 (2017).
Gilpin, L. H. et al. Explaining explanations: An overview of interpretability of machine learning. Proceedings - 2018 IEEE 5th International Conference on Data Science and Advanced Analytics, DSAA 2018 80–89 https://doi.org/10.1109/DSAA.2018.00018 (2018).
Goldstein, A., Kapelner, A., Bleich, J. & Pitkin, E. Peeking inside the black box: visualizing statistical learning with plots of individual conditional expectation. J. Comput. Graph. Stat. 24, 44–65 (2015).
Du, M., Liu, N. & Hu, X. Techniques for interpretable machine learning. Commun. ACM 63, 68–77 (2020).
Rudin, C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat. Mach. Intell. 1, 206–215 (2019).
Tsao, J. Y. et al. AI for technoscientific discovery: a human-inspired architecture. J. Creativity 34, 100077 (2024).
Narayanamurti, V. & Tsao, J. Y. How technoscientific knowledge advances: a Bell-Labs-inspired architecture. Res Policy 53, 104983 (2024).
Narayanamurti, V. & Tsao, J. Y. The genesis of technoscientific revolutions rethinking the nature and nurture of research.
Narayanan, H. & Mitter, S. Sample complexity of testing the manifold hypothesis. Adv. Neural Inf. Process. Syst. 23, (2010).
Fefferman, C., Mitter, S. & Narayanan, H. Testing the manifold hypothesis. J. Am. Math. Soc. 29, 983–1049 (2016).
Mahmud, M. S., Huang, J. Z. & Fu, X. Variational autoencoder-based dimensionality reduction for high-dimensional small-sample data classification. Int. J. Comput. Intell. Appl. https://doi.org/10.1142/S1469026820500029 (2020).
Sahoo, S. S., Lantpert, C. H. & Martius, G. Learning equations for extrapolation and control. 35th Int. Conf. Mach. Learn., ICML 2018 10, 7053–7061 (2018).
Greffet, J.-J. et al. Coherent emission of light by thermal sources. Nature 416, 61–64 (2002).
Guo, Y., Cortes, C. L., Molesky, S. & Jacob, Z. Broadband super-Planckian thermal emission from hyperbolic metamaterials. Appl Phys. Lett. 101, 131106 (2012).
Liberal, I. & Engheta, N. Tailoring thermal emission with epsilon-near-zero media augmented with dielectric rods. in Conference on Lasers and Electro-Optics FM4G.3 https://doi.org/10.1364/CLEO_QELS.2017.FM4G.3 (OSA, Washington, D.C., 2017).
Dyachenko, P. N. et al. Controlling thermal emission with refractory epsilon-near-zero metamaterials via topological transitions. Nat. Commun. 7, 11809 (2016).
Song, W. et al. 3D holographic displays: large-scale huygens’ metasurfaces for holographic 3D near-eye displays (Laser Photonics Rev. 15(9)/2021). Laser Photon Rev. 15, 2170047 (2021).
David, A., Benisty, H. & Weisbuch, C. Optimization of light-diffracting photonic-crystals for high extraction efficiency LEDs. J. Disp. Technol. 3, 133–148 (2007).
Krames, M. R. et al. Status and future of high-power light-emitting diodes for solid-state lighting. J. Disp. Technol. 3, 160–175 (2007).
Park, J. et al. All-solid-state spatial light modulator with independent phase and amplitude control for three-dimensional LiDAR applications. Nat. Nanotechnol. 16, 69–76 (2020).
Iyer, P. Unidirectional luminescence from InGaN/GaN quantum-well metasurfaces. Nat. Photon. 14, 543–548 (2020).
Iyer, P. P., Desai, S., Addamane, S., Dingreville, R. & Brener, I. Learning incoherent light emission steering from metasurfaces using generative models. In procc. IEEE/CVF Winter Conference on Applications of Computer Vision https://doi.org/10.1109/WACV56688.2023.00376 (2023).
Langguth, L., Schokker, A. H., Guo, K. & Koenderink, A. F. Plasmonic phase-gradient metasurface for spontaneous emission control. Phys. Rev. B 92, 205401 (2015).
Liu, S. et al. Light-emitting metasurfaces: simultaneous control of spontaneous emission and far-field radiation. Nano Lett. 18, 6906–6914 (2018).
Lodahl, P. et al. Controlling the dynamics of spontaneous emission from quantum dots by photonic crystals. Nature 430, 654–657 (2004).
Mohtashami, Y. et al. Light-emitting metalenses and meta-axicons for focusing and beaming of spontaneous emission. Nat. Commun. 12, 1–7 (2021).
Iyer, P. P. et al. Sub-picosecond steering of ultrafast incoherent emission from semiconductor metasurfaces. Nat. Photonics 17, 588–593 (2023).
Heki, L. et al. Designing highly directional luminescent phased-array metasurfaces with reciprocity-based simulations. ACS Omega 7, 22477–22483 (2022).
Heki, L. K. et al. Optimizing polarization selective unidirectional photoluminescence from phased-array metasurfaces. Adv. Opt. Mater. 12, 2303186 (2024).
Mohtashami, Y. et al. Metasurface light-emitting diodes with directional and focused emission. Nano Lett. 23, 10505–10511 (2023).
Svilainis, L. LEDs for large displays. Nitride Semiconductor Light-Emitting Diodes (LEDs): Materials, Technologies, and Applications: Second Edition 659–736 https://doi.org/10.1016/B978-0-08-101942-9.00021-6 (2018).
Svilainis, L. & Dumbrava, V. Numerical comparison of LED directivity approximation functions for video displays. Displays 31, 196–204 (2010).
Svilainis, L. LED directivity measurement in situ. Measurement 41, 647–654 (2008).
Chen, Q., Hu, R. & Luo, X. A statistical study to identify the effects of packaging structures on lumen reliability of LEDs. Microelectron. Reliab. 71, 51–55 (2017).
Tsao, J. Y. et al. Toward smart and ultra-efficient solid-state lighting. Adv. Optical Mater. 2, 809–836 (2014).
Spearman, C. The proof and measurement of association between two things. Am. J. Psychol. 15, 72 (1904).
Homma, T., Saltelli, A. Importance measures in global sensitivity analysis of nonlinear models. Reliability Engineering & System Safety 52, 17 (1996).
Sobol, I. M. On sensitivity estimation for nonlinear mathematical models. Math. Model. Comput. Exp 407, (1993).
Sobol, I. M. Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Math. Comput Simul. 55, 271–280 (2001).
Frankle, J. & Carbin, M. The lottery ticket hypothesis: finding sparse, trainable neural networks. 7th International Conference on Learning Representations, ICLR 2019. (2019).
Paszke, A. et al. PyTorch: An imperative style, high-performance deep learning library. Adv. Neural Inf. Process. Syst. 32, https://doi.org/10.48550/arXiv.1912.01703 (2019).
Kingma, D. P. & Ba, J. L. Adam: a method for stochastic optimization. 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings (2014).
Rasmussen, C. E. Gaussian processes in machine learning. Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinforma.) 3176, 63–71 (2004).
Bakshy, E. et al. AE: A domain-agnostic platform for adaptive experimentation. eytan.github.ioE Bakshy, L Dworkin, B Karrer, K Kashin, B Letham, A Murthy, S SinghConference on neural information processing systems, •eytan.github.io (2018).
Sobol’, I. M. On the distribution of points in a cube and the approximate evaluation of integrals. USSR Comput. Math. Math. Phys. 7, 86–112 (1967).
Meurer, A. et al. … O. Č.-P. C. & 2017,. SymPy: symbolic computing in Python. peerj.comA Meurer, CP Smith, M Paprocki, O Čertík, SB Kirpichev, M Rocklin, AMT Kumar, S IvanovPeerJ Computer Science, •peerj.com. (2017).
Acknowledgments
This work was supported by the US Department of Energy (DOE), Office of Basic Energy Sciences, Division of Materials Sciences and Engineering and performed, in part, at the Center for Integrated Nanotechnologies, an Office of Science User Facility operated for the US DOE Office of Science. Sandia National Laboratories is a multi-mission laboratory managed and operated by National Technology and Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International, Inc., for the US DOE’s National Nuclear Security Administration under contract no. DE-NA0003525. This Article describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the paper do not necessarily represent the views of the US DOE or the United States Government. This study was funded by the US DOE Basic Energy Science Program (BES20017574) and the Materials Science Research Foundation LDRD program (230710).
Author information
Authors and Affiliations
Contributions
S.D., J.Y.T. and P.P.I. conceptualized the idea. P.P.I. designed and fabricated the metasurface. S.A. grew the epitaxial sample used to fabricate the device. P.P.I. and I.B. developed and built the ultrafast optical test setup. S.D. R.D. and P.P.I. developed the machine-learning framework. All authors contributed to the data analysis, writing and review process of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests
Peer review
Peer review information
Nature Communications thanks Yandong Li and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Desai, S., Addamane, S., Tsao, J.Y. et al. Self-driving lab discovers principles for steering spontaneous emission beyond conventional Fourier optics. Nat Commun 17, 204 (2026). https://doi.org/10.1038/s41467-025-66916-0
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41467-025-66916-0






