Development of AI-assisted microscopy frameworks through realistic simulation with pySTED

Bilodeau, Anthony; Michaud-Gagnon, Albert; Chabbert, Julia; Turcotte, Benoit; Heine, Jörn; Durand, Audrey; Lavoie-Cardinal, Flavie

doi:10.1038/s42256-024-00903-w

Download PDF

Article
Open access
Published: 26 September 2024

Development of AI-assisted microscopy frameworks through realistic simulation with pySTED

Anthony Bilodeau^1,2,
Albert Michaud-Gagnon^1,2,
Julia Chabbert¹,
Benoit Turcotte^1,2,
Jörn Heine³,
Audrey Durand^2,4,5,6 &
…
Flavie Lavoie-Cardinal ORCID: orcid.org/0000-0002-4496-5088^1,2,7

Nature Machine Intelligence volume 6, pages 1197–1215 (2024)Cite this article

10k Accesses
7 Citations
11 Altmetric
Metrics details

Subjects

Abstract

The integration of artificial intelligence into microscopy systems significantly enhances performance, optimizing both image acquisition and analysis phases. Development of artificial intelligence-assisted super-resolution microscopy is often limited by access to large biological datasets, as well as by difficulties to benchmark and compare approaches on heterogeneous samples. We demonstrate the benefits of a realistic stimulated emission depletion microscopy simulation platform, pySTED, for the development and deployment of artificial intelligence strategies for super-resolution microscopy. pySTED integrates theoretically and empirically validated models for photobleaching and point spread function generation in stimulated emission depletion microscopy, as well as simulating realistic point-scanning dynamics and using a deep learning model to replicate the underlying structures of real images. This simulation environment can be used for data augmentation to train deep neural networks, for the development of online optimization strategies and to train reinforcement learning models. Using pySTED as a training environment allows the reinforcement learning models to bridge the gap between simulation and reality, as showcased by its successful deployment on a real microscope system without fine tuning.

A weak-labelling and deep learning approach for in-focus object segmentation in 3D widefield microscopy

Article Open access 28 July 2023

Microstructure segmentation with deep learning encoders pre-trained on a large microscopy dataset

Article Open access 19 September 2022

Automatic identification of crystal structures and interfaces via artificial-intelligence-based electron microscopy

Article Open access 02 October 2023

Main

Super-resolution microscopy has played a pivotal role in life sciences by allowing the investigation of the nano-organization of biological samples to a few tens of nanometres¹. Stimulated emission depletion (STED)², a point scanning-based super-resolution microscopy fluorescence modality, routinely allows resolution down to 30–80 nm to be reached in fixed and live samples¹. One drawback of STED microscopy is the photobleaching of fluorophores associated with increased light exposure at the sample^1,3,4. Photobleaching results in a decrease in fluorescence, limiting the ability to capture multiple consecutive images of a particular area and may also increase phototoxicity in living samples^4,5. In an imaging experiment, photobleaching and phototoxicity need to be minimized by the careful modulation of imaging parameters^5,6 or by adopting smart-scanning schemes^7,8,9. Integration of artificial intelligence (AI)-assisted smart modules to bioimaging acquisition protocols has been proposed to guide and control microscopy experiments^6,7,10,11. However, machine learning (ML) and deep learning (DL) algorithms generally require a large amount of annotated data to be trained, which can be difficult to obtain when working with biological samples. Diversity in curated training datasets also enhances the model’s robustness^12,13. Although large annotated datasets of diffraction-limited optical microscopy have been published in recent years^14,15, access to such datasets for super-resolution microscopy is still limited, in part due to the complexity of data acquisition and annotation as well as limited access to imaging resources. Similarly, the development of reinforcement learning (RL) methods adapted to the control of complex systems on a wide variety of tasks in games, robotics or even in microscopy imaging, are strongly dependent on the availability of large training datasets, generally relying on the development of accessible, realistic and modular simulation environments^{11,16,17,18,19,20}

To circumvent this limitation, simulation strategies have been used for high-end microscopy techniques. For instance, in fluorescence lifetime imaging microscopy, it is common practice to use simulation software to generate synthetic measurements to train ML/DL models²¹. The models can be completely trained in simulation or with few real measurements. Researchers in single-molecule localization microscopy have also adopted simulation tools in their image analysis pipelines to benchmark their algorithms^22,23,24. In an earlier work²⁵, a DL model was trained with simulated ground-truth detections and few experimental images, which was then deployed on real images. In STED microscopy, simulation software are also available. However, they are limited to theoretical models of the point spread function (PSF)^26,27 or effective PSF (E-PSF)^8,28, without reproducing realistic experimental settings influencing the design of STED acquisitions (for example, photobleaching, structures of interest and scanning schemes). This limits the generation of simulated STED datasets and associated training of ML/DL models for smart STED microscopy modules.

We created a simulation platform, pySTED, that emulates an in silico STED microscope with the aim to assist in the development of AI methods. pySTED is founded on theoretical and empirically validated models that encompass the generation of E-PSF in STED microscopy, as well as a photobleaching model^3,19,26,29. Additionally, it implements realistic point-scanning dynamics in the simulation process, allowing adaptive scanning schemes and non-uniform photobleaching effects to be mimicked. Realistic samples are simulated in pySTED by using a DL model that predicts the underlying structure (data maps) of real images.

pySTED can benefit the STED and ML communities by facilitating the development and deployment of AI-assisted super-resolution microscopy approaches (Extended Data Fig. 1). It is implemented in a CoLaboratory notebook to help trainees develop their intuition regarding STED microscopy on a simulated system (Extended Data Fig. 1(i)). We demonstrate how the performance of a DL model trained on a semantic segmentation task of nanostructures can be increased using synthetic images from pySTED (Extended Data Fig. 1(ii)). A second experiment shows how our simulation environment can be leveraged to thoroughly validate the development of AI methods and challenge their robustness before deploying them in a real-life scenario (Extended Data Fig. 1(iii)). Last, we show that pySTED enables the training of an RL agent that can learn by interacting with the realistic STED environment, which would not be possible on a real system due to data constraints³⁰. The resulting trained agent can be deployed in real experimental conditions to resolve nanostructures and recover biologically relevant features by bridging the reality gap (Extended Data Fig. 1(iv)).

STED simulation with pySTED

We have built a realistic, open-sourced STED simulation platform within the Python³¹ environment, namely, pySTED. pySTED breaks down STED acquisition into its main constituents: wavelength-dependent focusing properties of the objective lens, fluorophore excitation and depletion, and fluorescence detection. Each step of the acquisition process corresponds to an independent component of the pipeline and is created with its own parameters (Supplementary Tables 1–4) that users can modify according to their experimental requirements (Fig. 1a)²⁶. Generating a synthetic image with the pySTED simulator requires a map of the emitters in the field of view and specify the photophysical properties of the fluorophore (Fig. 1a and Supplementary Table 5). The map of fluorophores, referred to as a data map, can consist of automatically generated simple patterns (for example, beads and fibres) or more complex structures generated from real images (Methods). The emission and photobleaching properties of the fluorophores that are implemented in pySTED are inspired from previous theoretical and experimental models^3,29. As in a real experiment, the data map is continuously being updated during the simulation process to realistically simulate point-scanning acquisition schemes (Fig. 1a–e and Methods).

Realistic data map generation: data maps that can reproduce diverse biological structures of interest are required for the development of a simulation platform that enables the generation of realistic synthetic STED images. Combining primary object shapes such as points, fibres or polygonal structures is efficient and simple for some use cases, but is not sufficient to represent more complex and diverse structures that can be found in real biological samples^22,23,24,32. It is essential to reduce the gap between simulation and reality for microscopist trainees or to train AI models on synthetic samples before deployment on real tasks^33,34.

We sought to generate realistic data maps by training a DL model to predict the underlying structures from real STED images, which can then be used in synthetic pySTED acquisition. We chose the U-Net architecture, U-Net_{data map}, as it has been shown to perform well on various microscopy datasets of limited size^35,36 (Fig. 1f). We adapted a previously established approach in which a low-resolution image is mapped onto a resolution-enhanced image^37,38. Once convolved with an equivalent optical transfer function, the resolution-enhanced synthetic image is compared with the original image.

Here we trained U-Net_{data map} on the STED images of proteins in cultured hippocampal neurons (Methods, Supplementary Fig. 1 and Supplementary Table 6). During the training process, the model aims at predicting the underlying structure (data map) such that the convolution of the approximated PSF of the STED microscope (full-width at half-maximum of ~50 nm, as measured from the full-width at half-maximum of real STED images) minimizes the mean quadratic error with the real image (Fig. 1f). After training, given a real image, U-Net_{data map} generates the underlying structure (Supplementary Fig. 1). From this data map, a synthetic pySTED image can be simulated with different imaging parameters (low or high resolution). Qualitative comparison of the synthetic images acquired in pySTED with the real STED images (Supplementary Fig. 1) shows similar super-resolved structures for different neuronal proteins, confirming the capability of U-Net_{data map} to predict a realistic data map. We also evaluated the quality of images resulting from data maps generated with U-Net_{data map} or a conventional Richardson–Lucy deconvolution (Methods and Supplementary Fig. 2a). As highlighted in Supplementary Fig. 2b,c, the use of U-Net_{data map} instead of Richardson–Lucy deconvolution to generate data maps in pySTED results in improved synthetic images.

For the validation of pySTED with a real STED microscope, we characterized the capacities of pySTED to simulate realistic fluorophore properties by comparing the synthetic pySTED images with real STED microscopy acquisitions (Supplementary Table 7). We acquired STED images of the protein bassoon, which had been immunostained with the fluorophore ATTO-647N. We compared the effect of varying the imaging parameters on the pySTED simulation environment and on the real microscope (Supplementary Figs. 3–5). For pySTED, we used the photophysical properties of the fluorophore ATTO-647N from the literature (Supplementary Table 5)^3,39. The photobleaching constants (k₁ and b) were estimated from the experimental data by using a least-squares fitting method (Methods). Synthetic data maps were generated with U-Net_{data map} to facilitate a comparison between simulation and reality.

We first compared how the imaging parameters on real microscopes and in pySTED simulations (pixel dwell time, excitation power and depletion power) influenced the image properties by measuring the resolution⁴⁰ and signal ratio⁶ (Methods and Supplementary Fig. 3a). As expected, modulating the STED laser power influences the spatial resolution in real experiments and in pySTED simulations. Examples of the acquired and synthetic images are displayed in Supplementary Fig. 3b for visual comparison with different parameter combinations (Supplementary Fig. 3a). The impact of the imaging parameters in the resolution and signal ratio metrics in pySTED agree with the measurements that were performed on a real microscope. The small deviations can be explained by the variability that is typically observed in the determination of absolute values of fluorophore properties⁴¹.

Next, we validated the photobleaching model that is implemented within pySTED. We calculated the photobleaching by comparing the fluorescence signal in a low-resolution image acquired before (CONF1) and after (CONF2) the high-resolution acquisition⁶ (Methods). For the pixel dwell time, excitation power and other parameters (gating delay, gating time and line repetitions), we measured similar trends between real and synthetic image acquisitions (Supplementary Figs. 4a and 5). For a confocal acquisition, the photobleaching in pySTED is assumed to be 0 (Supplementary Fig. 4a), as it is generally negligible in a real confocal acquisition. Considering the flexibility of pySTED, different photobleaching dynamics specifically tailored for any particular experiment can be implemented and added in the simulation platform. Examples of sequential acquisition (ten images) are presented in Supplementary Fig. 4b, demonstrating the effect of imaging parameters on the photobleaching of the sample. pySTED also integrates background effects that can influence the quality of the acquired images as in real experiments^42,43 (Supplementary Fig. 4c,d).

pySTED as a development platform for AI-assisted microscopy

Dataset augmentation for training DL models

DL models are powerful tools to rapidly and efficiently analyse large databanks of images and perform various tasks such as cell segmentation^36,44. When no pretrained models are readily available online to solve the task⁴⁵, fine tuning or training a DL model from scratch requires the tedious process of annotating a dataset. Here we aim to reduce the required number of distinct images for training by using pySTED as an additional data augmentation step. As a benchmark, we used the F-actin segmentation task from ref. ⁴⁶, where the goal is to segment dendritic F-actin fibres or rings using a small dataset (42 images) of STED images (Fig. 2a and Methods). pySTED was used first as a form of data augmentation to increase the number of images in the training dataset without requiring new annotations. Using U-Net_{data map}, we generated F-actin data maps and a series of synthetic images in pySTED with various simulation parameters (Fig. 2b and Supplementary Table 8).

**Fig. 2: pySTED is used to artificially augment the training dataset of a DL model.**

We compared the segmentation performance by using the average precision (AP; Methods) of a DL model trained on the original dataset (O, ref. ⁴⁶) or with different image normalization and increased data augmentation (N). The segmentation performance was not impacted by increasing the amount of data augmentation (O versus N; Fig. 2c). Adding synthetic images from pySTED (N + S) into the training dataset to improve the diversity of the dataset significantly increases the performance of F-actin rings segmentation compared with O and N, and maintains the performance for the F-actin fibres segmentation (Fig. 2c). In biological experiments, where each image is costly to acquire, reducing the size of the training dataset results in a higher number of images for the post hoc analysis. Hence, we sought to measure the impact of reducing the number of real images in the training dataset by training on subsets of images that are augmented using pySTED (Supplementary Fig. 7). We measure a significant decrease in the AP for F-actin fibres when the model is trained on less than 50% of the images. Removing 25% of the dataset negatively impacts the segmentation performance of F-actin rings (Fig. 2d; Supplementary Fig. 6 shows the P values). However, adding synthetic images from pySTED during training allows the segmentation performance of the model to be maintained by training with only 25% of the original dataset (Fig. 2d; Supplementary Fig. 6 shows the P values).

Validation of AI methods

Benchmarking AI models for automating microscopy tasks on biological samples is challenging due to biological variability and the difficulty of comparing imaging strategies on the same region of interest^6,22,47. Assessing and comparing AI models requires multiple attempts in similar but different experimental conditions to limit the impact of biological variability. This inevitably increases the number of biological samples and the time required to develop robust AI-assisted adaptive microscopy strategies that can be deployed on a variety of samples and imaging conditions. pySTED allows the simulation of multiple versions of the same images as if the structure had been imaged with different experimental settings. Here we showcase the capability of pySTED in thoroughly validating ML approaches for the optimization of STED imaging parameters in a simulated controlled environment, enabling more robust performance assessments and comparisons.

We first demonstrate how pySTED can be used to characterize the performance of a multi-armed bandit optimization framework that uses Thompson sampling (TS) for exploration, that is, Kernel-TS. The application of Kernel-TS for the optimization of STED imaging parameters was demonstrated previously, but comparison between different experiments was challenging due to local variations in the size, brightness and photostability of the fluorescently tagged neuronal structures⁶. Using synthetic images generated with pySTED allows the performance of Kernel-TS to be evaluated on the same image sequence (50 repetitions; Methods) and with controlled photophysical properties of fluorophores (Extended Data Fig. 2 and Supplementary Tables 9 and 10). For experimental settings such as multichannel imaging or adaptive scanning, Kernel-TS is limited by the number of parameters that can be simultaneously optimized (~4) in an online setting⁶. We, thus, turned to a neural network implementation of TS, which was recently developed to solve the multi-armed bandit framework, LinTSDiag⁴⁸.

Using pySTED, we could characterize the performance of LinTSDiag on a microscopy optimization task on synthetic images without requiring real biological samples. As described above, LinTSDiag was trained on the same sequence (50 repetitions; Methods) using two different fluorophores (Fig. 3a and Supplementary Table 9). In a simple three-parameter optimization setting, LinTSDiag allows a robust optimization of the signal ratio, photobleaching and spatial resolution for fluorophores with distinct photophysical properties (Fig. 3b). We evaluate the performance of LinTSDiag using the preference score, which is obtained from a network that was trained to predict the preferences of an expert in the imaging optimization objective space (PrefNet; Methods)⁶. The convergence of the agent in the imaging optimization objective space is supported by the smaller standard deviation measured in the last iterations of the imaging session (Fig. 3c, red lines). pySTED enables a comparison of the optimized parameters for different fluorophores on the same data map. This experiment confirms that optimal parameters vary depending on the photophysical properties (Fig. 3d).

**Fig. 3: Validation of AI-assisted algorithms with pySTED for STED microscopy parameter optimization.**

LinTSDiag was then deployed on a real microscopy system to simultaneously optimize four parameters (excitation power, STED power, pixel dwell time and line steps) for the imaging of tubulin stained with STAR RED in kidney epithelial cells (Vero cell line). The model was able to optimize the imaging optimization objectives, improve the resolution and signal ratio, and maintain a low level of photobleaching over the course of optimization (Fig. 3e,f and Supplementary Table 9). Then, we sought to increase the number of parameters by tackling a dual-colour imaging scheme (six parameters: excitation power, STED power and line steps for both channels) for the STED imaging of Golgi stained with STAR ORANGE and nuclear pore complex (NPC) stained with STAR RED in Vero cells (Fig. 3g,h and Supplementary Table 9). The optimization framework allows four imaging optimization objectives to be simultaneously optimized (for example, resolution and signal ratio for both colours). As the visual selection of the trade-off in a four-dimensional space is challenging for the user in an online setting, we decided to optimize the combined resolution and signal ratio of both fluorophores (average of the imaging optimization objectives), allowing the users to indicate their preference in a two-dimensional optimization objective space. Online six-parameter optimization of LinTSDiag increases the signal ratio and maintains good image resolution for both imaging channels (Fig. 3h), enabling the resolution of both structures with sub-100-nm resolution.

Next, we developed a model that leverages prior information (context) to solve a task with a high-dimensional action space. This is the case for dynamic intensity minimum (DyMIN) microscopy that requires parameter selection to be adapted, particularly multiple illumination thresholds, to the current region of interest⁸ (Fig. 4a,b). We previously showed that contextual bandit algorithms can use the confocal image as a context to improve the DyMIN threshold optimization in a two-parameter setting⁴⁹. In this work, we aim to increase the number of parameters (seven parameters) that can be simultaneously optimized and validate the robustness of LinTSDiag⁴⁸ (Fig. 4b). We repeatedly trained LinTSDiag on the same data map sequence using the confocal image as prior information (50 repetitions). The parameter selection was compared by measuring whether the action selection correlated over time between the models (Fig. 4c, Supplementary Fig. 8, Supplementary Table 9 and Methods). For instance, the correlation matrix from the last ten images shows clusters of similar parameters that are better defined than for the first ten images (Fig. 4c). This is confirmed by the difference in the 90th and 10th quantiles in the correlation matrix, which rapidly increases with time (Fig. 4d). As expected with clustered policies, the average standard deviation of the action selection for each cluster reduces over time, implying similar parameter selection by the models (Fig. 4e). We also assessed whether the models would adapt their policies to different fluorophores (Fig. 4c,f, light/dark purple). As shown in Fig. 4f, there are specific policies for each fluorophore (for example, fluorophore A, 0 and 3; fluorophore B, 5), demonstrating the capability of the models in adapting their parameter selections to the experimental condition. Although the policy of the models is different, the measured imaging optimization objectives are similar for all the clusters (Fig. 4g), which suggests that different policies can solve this task, unveiling the delicate intricacies of DyMIN microscopy. More importantly, this shows that the model can learn one of the many possible solutions to optimize the imaging task.

**Fig. 4: Validation of contextual bandit algorithms with pySTED in a high-dimensional parameter space.**

The LinTSDiag optimization strategy was deployed in a real-life experiment for the seven-parameter optimization of DyMIN3D imaging of the post-synaptic protein PSD95 in dissociated primary hippocampal neurons stained with STAR 635P. Early during the optimization, the selected parameters produced images with poor resolution or missing structures (artefacts) (Fig. 4h and Supplementary Table 9). The final images were of higher quality (Fig. 4h, right) with fewer artefacts and high resolution. The parameter selection of the model converged in a region of parameter space that could improve all the imaging optimization objectives over the course of optimization (Fig. 4i,j). Parameters optimized with LinTSDiag allowed a significant improvement in the DyMIN3D imaging of PSD95 compared with conventional three-dimensional STED imaging (Supplementary Fig. 9). pySTED allowed us to validate the robustness of the model in a simulated environment before its deployment in a real experimental setting. This should benefit the ML community by allowing the validation of new online ML optimization algorithms on realistic tasks.

Learning through interactions with the system

Online optimization strategies such as Kernel-TS and LinTSDiag were trained from scratch on a new sample, implying a learning phase in which only a fraction of the images meet the minimal image quality requirements. For costly biological samples, there is a need to deploy algorithms that can make decisions based on the environment with a reduced initial exploration phase. Control tasks and sequential planning are particularly well suited for an RL framework where an agent (for example, replacing the microscopist) learns to make decisions by interacting with the environment (for example, select imaging parameters on a microscope) with the aim of maximizing a reward signal (for example, light exposure, signal ratio and resolution) over the course of an episode (for example, imaging session)⁵⁰. Deep RL agents are (unfortunately) famously data intensive, sometimes requiring millions of examples to learn a single task^17,30. This makes them less attractive to be trained on real-world tasks in which each sample can be laborious to obtain (for example, biological samples) or when unsuitable actions can lead to permanent damage (for example, overexposition of the photon detector). Simulation platforms are, thus, essential in RL to provide environments in which an agent can be trained at low cost for subsequent deployment in a real-life scenario⁵¹, which is referred to as simulation to reality (Sim2Real) in robotics. Although Sim2Real is widely studied in robotics and autonomous driving, its success for new fields of application is generally dependent on the gap between simulation and reality⁵².

Here pySTED is used as simulation software to train RL agents. We implemented pySTED in an OpenAI Gym environment (gym-STED) to facilitate the deployment and development of RL strategies for STED microscopy^19,53. To highlight the potential of gym-STED to train an RL agent, we crafted the task of resolving nanostructures in simulated data maps of various neuronal structures (Fig. 5a). In gym-STED, an episode unfolds as follows. At each time step, the agent observes the state of the sample: a visual input (image) and the current history (Methods and Fig. 5b). The agent then performs an action (adjusting pixel dwell time, excitation power and depletion power), receives a reward based on the imaging optimization objectives and transitions into the next state. A single-value reward is calculated using a preference network that was trained to rank the imaging optimization objectives (resolution, photobleaching and signal ratio) according to expert preferences⁶ (Methods). A negative reward is obtained when the selected parameters lead to a high photon count that would be detrimental to the detector in real experimental settings (for example, nonlinear detection of photon counts). This sequence is repeated until the end of the episode, that is, 30 time steps. In each episode, the goal of the agent is to balance between detecting the current sample configuration and acquiring high-quality images to maximize its reward (Fig. 5a). We trained a proximal policy optimization (PPO)⁵⁴ agent and evaluated its performance on diverse fluorophores (Methods). Domain randomization is heavily used within the simulation platform to cover a wide variety of fluorophores and structures and thus increase the generalization properties of the agent⁵⁵. In Fig. 5c–f (Supplementary Table 11), we report the performance of the agent on a fluorophore with simulated photophysical properties that would result in high brightness (high signal ratio) and high photostability (low photobleaching) in real experiments. The results of the agent on other simulated fluorophore properties are also reported (Supplementary Table 12 and Supplementary Fig. 10). Over the course of training, the agent adapts its policy to optimize the imaging optimization objectives (100k and 12M training steps; Fig. 5c). As expected from RL training, the reward of an agent during an episode is greater at the end of training compared with the beginning (Fig. 5d, red versus cyan). When evaluated on a new sequence, the agent trained over 12M steps rapidly adapts its parameter selection during the episode to acquire images with high resolution and signal ratio and minimize photobleaching (Fig. 5e,f). The agent shows a similar behaviour for various simulated fluorophores (Supplementary Fig. 10). We compared the number of good images acquired by the RL agent with that of bandit optimization for the first 30 images of the optimization. In similar experimental conditions, with the same fluorophore and parameter search space, the average number of good images was (18 ± 3) and (5 ± 3) for the RL agent and bandit, respectively (50 repetitions). This almost fourfold increase in the number of high-quality images highlights the improved efficiency of the RL agent at suggesting optimal imaging parameters.

**Fig. 5: RL agent training for STED imaging parameter optimization with pySTED.**

Given the capability of the agent in acquiring images for a wide variety of synthetic imaging sequences, we evaluated if the agent could be deployed in a real experimental setting. The experimental conditions chosen for the simulations were based on the parameter range available on the real microscope. Dissociated primary hippocampal neurons were stained for various neuronal proteins (Fig. 6, Extended Data Fig. 3 and Supplementary Table 11) and imaged on a STED microscope with the RL agent assistance for parameter choice. First, we evaluated the performance of our approach for Sim2Real on in-distribution images from F-actin and CaMKII-β in fixed neurons. Although the simulated images of both structures were available within the training environment, we wanted to evaluate if the agent could adapt to real-life imaging settings (Supplementary Fig. 11). As shown in Fig. 6a and Extended Data Fig. 3, the agent resolves the nano-organization of both proteins (Supplementary Fig. 11). We sought to confirm whether the quality of the images was sufficient to extract biologically relevant features (Methods). For both proteins, the measured quantitative features matched with values previously reported in the literature, enabling the resolution of the 190 nm periodicity of the F-actin lattice in axons, and the size distribution of CaMKII-β nanoclusters^56,57,58 (Fig. 6a and Extended Data Fig. 3). Next, we wanted to validate that the agent would adapt its parameter selection to structures, fluorophores properties or imaging conditions that were not included in the training set. We first observed that the agent could adapt to a very bright fluorescent signal and adjust the parameters to limit the photon counts on the detector (Extended Data Fig. 3). The morphology of the imaged PSD95 nano-cluster was in agreement with the values reported in another work⁵⁹ (Extended Data Fig. 3). We deployed the RL-based optimization scheme for the imaging of the mitochondrial protein TOM20 to evaluate the ability of the agent to adapt to out-of-distribution structures (Fig. 6b). The nano-organization and morphology described previously⁶⁰ for TOM20 in punctate structures is revealed using the provided imaging parameters in all the acquired images (Fig. 6b and Supplementary Fig. 11). Next, we evaluated the generalizability of the approach to a new imaging context, which is live-cell imaging. We used the optimization strategy for the imaging of the F-actin periodic lattice in living neurons (Fig. 6c). The quality of the acquired images is confirmed by the quantitative measurement of the periodicity, which matches the previously reported values of 190 nm from the literature^56,57. Finally, we verified the generalizability of our approach by deploying our RL-assisted strategy on a new microscope and samples (Fig. 6d,e, Extended Data Fig. 4 and Supplementary Table 11). We evaluated the performance of the RL agent for imaging in fixed Vero cells of tubulin stained with STAR RED and actin stained with STAR GREEN. The agent successfully adapted to the new imaging conditions, rapidly acquiring high-quality images, even in challenging photobleaching conditions such as with STED microscopy of the green-emitting fluorophore STAR GREEN. Using the pySTED simulation environment, we could successfully train RL agents that can be deployed in a variety of real experimental settings to tackle STED imaging parameter optimization tasks. To the best of our knowledge, this is the first application of RL agents to an online image acquisition task in optical microscopy.

**Fig. 6: Bridging the reality gap in RL by pretraining with pySTED.**

Discussion

We built pySTED, an in silico super-resolution STED environment, which can be used to develop and benchmark AI-assisted STED microscopy. Throughout the synthetic and real experiments, we have demonstrated that it can be used for the development and benchmarking of AI approaches in optical microscopy. The CoLaboratory notebook that was created as part of this work can be used by microscopist trainees to develop their skills and intuition for STED microscopy before using the microscope for the first time. The optimal set of parameters defined in pySTED for a specific fluorophore can guide the parameter choice on a real microscope, but should not replace optimization in real experimental settings to account for environmental effects and biological variability.

The simulation platform was built to be versatile and modular. This allows the users to create and test the efficiency of AI strategies and adaptive imaging schemes before deploying them on a real microscope. For instance, both DyMIN⁸ and RESCue⁶¹ microscopy techniques are readily available to the users. Additionally, the community can contribute open-source modules that would meet their experimental settings.

Smart microscopy requires the development of tools and modules to increase the capabilities of microscopes^10,62, which can be challenging when working on a real microscopy system. The development of simulation software is one way to mitigate the difficulty of building an AI-assisted microscopy setup. We mainly focused on the selection of imaging parameters, which is one branch of AI-assisted microscopy, but also showed that pySTED can be successfully applied to data augmentation in supervised learning settings. A recent trend in microscopy focuses on the implementation of data-driven microscopy systems. For example, systems are built to automatically select informative regions or improve the quality of the acquired images^63,64. The development and validation of such data-driven systems could be achieved with pySTED. An interesting avenue to pursue for data-driven systems could rely on generative models to create diverse data maps on the fly instead of relying on existing databanks of STED microscopy images, which could be integrated into the modular structure of the pySTED simulation environment. Online ML optimization strategies tested in the pySTED environment showed similar performances when transferred to the real microscopy environment, opening new possibilities to characterize and benchmark innovative data-driven microscopy approaches in pySTED before their deployment on real biological samples.

We also tackle the training of an RL agent—the first for optical microscopy—which would be impossible without access to a large databank of simulated data. The RL agent enables a full automatization of the imaging parameter selection on a real system when deployed from gym-STED, an OpenAI gym environment built around pySTED⁵³. Domain randomization was heavily used within the simulation platform⁵⁵, which resulted in an RL agent that could adapt its parameter selection to a wide variety of experimental conditions, even in living samples. Such strategies could be transformative to democratize STED microscopy to a larger diversity of experimental settings and allow non-expert users to acquire high-quality images on a new sample without previous optimization sessions.

Although RL agents can represent a powerful tool to automatize microscopy setups, they must be trained on a very large number of examples (for example, 12M steps in this work)^17,30, which would be infeasible on a real microscopy setup. The pySTED simulation environment allowed the RL agent to bridge the gap between simulation and reality without requiring any fine tuning. This makes pySTED an appealing platform for RL development as it is particularly well suited for complex control tasks requiring temporally distinct trade-offs to be made²⁰. In this work, the model relied on a constant preference function to convert the multi-objective optimization into a single-reward function. This preference function is ultimately user dependent. This could be complemented in the future by incorporating RL from human feedback in the training of the RL model^65,66. In future work, temporal dynamics could also be implemented in pySTED to open new possibilities to fully automatize the selection of informative regions and imaging parameters in an evolving environment.

Methods

pySTED simulation platform

Two main software implementations are incorporated within the pySTED simulation platform: (1) PSF calculation and (2) emitter–light interactions.

PSF calculation

PSF calculation in pySTED is inspired by previous theoretical work^26,29 (Fig. 1b). As in ref. ²⁶, we calculate the excitation and depletion PSF by using the electric field (Fig. 1b). The E-PSF is calculated by combining the excitation, depletion and detection PSFs using

$${\text{E-PSF}}\left(\overrightarrow{r}\right)={{\rm{PSF}}}_{{\rm{Exc}}}\left(\overrightarrow{r}\right)\exp \left[-{{\rm{PSF}}}_{{\rm{STED}}}\left(\overrightarrow{r}\right)\zeta \right]\left[{{\rm{PSF}}}_{\det }\left(\overrightarrow{r}\right)\otimes {\rm{circ}}(R)\right],$$

(1)

where R is the radius of the imaged aperture⁶⁷ and ζ is the saturation factor of the depletion defined as $\zeta =\frac{{I}_{{\rm{STED}}}}{{I}_{\rm{s}}}$, with I_s being the saturation intensity²⁹. The left side of equation (1) represents the probability that an emitter at position $\overrightarrow{r}$ contributes to the signal⁶⁸ and is calculated in pySTED using ηp_exc with

$${p}_{{\rm{exc}}}={q}_{{\rm{fl}}}(1-\exp \left(-{\sigma }_{{\rm{abs}}}{\varPhi }_{{\rm{exc}}}{\tau }_{{\rm{STED}}}\right)),$$

(2)

where q_fl is the quantum yield of the fluorophore, σ_abs is the absorption cross-section, Φ_exc is the photon flux from the excitation laser and τ_STED is the period of the STED laser. The parameter η allows the excitation probability to be modulated with the depletion laser or allows time gating to be considered during the acquisition^29,69. Time gating consists of activating the detector within a small window of time (T_g; typically 8 ns) after the excitation pulse (T_del; typically 750 ps) to prominently detect photons coming from spontaneous emission. The simulations performed with pySTED follow the scheme of pulsed STED microscopy in which time gating mostly reduces the correlated background⁶⁹. Following the derivation from ref. ²⁹ and assuming that T_g ≥ τ_STED, the emission probability of a fluorophore is described as

$$F(\gamma )=\exp \left(-{k}_{\rm{S1}}\gamma {t}_{{\rm{STED}}}\right)\left(\exp \left(-{k}_{{\rm{S1}}}{T}_{{\rm{del}}}\right)-\exp \left(-{k}_{{\rm{S1}}}({T}_{{\rm{del}}}+{T}_{{\rm{g}}})\right)\right),$$

(3)

where k_S1 is the spontaneous decay rate, γ is the effective saturation factor ($\gamma =\frac{\zeta {k}_{{\rm{vib}}}}{\left(\zeta {k}_{{\rm{S1}}+{k}_{{\rm{vib}}}}\right)}$; k_vib is the vibrational relaxation state of S_0′) and t_STED is the STED pulse width (Fig. 1c,d). In the confocal case (I_STED = 0), the emission probability simply reduces to

$$F(0)=\left(1-\exp \left(-{k}_{\rm{S1}}T\right)\right),$$

(4)

where T is the period between each STED pulse. This allows the probability of spontaneous decay η to be calculated using F(γ)/F(0). The calculated E-PSF is convolved on the data map to simulate the photons that are emitted and the one measured by the detector.

In real experiments, the number of detected photons is affected by several factors (for example, photon detection and collection efficiency of the detector, detection PSF, fluorophore brightness and so on), which were also integrated in the pySTED simulation environment (Supplementary Tables 1–5). We also included the possibility to add typical sources of noise that occur in a targeted microscopy experiment such as shot noise, dark noise and background noise, all of which are modelled by Poisson processes (Supplementary Table 3).

Emitter–light interactions

In a real microscopy experiment, the emitters can be degraded as they interact with excitation or depletion light. Photobleaching is the process by which an emitter becomes inactive following light exposure³. In STED microscopy, this process is mainly caused by the combination of excitation and depletion laser beams³. Reducing photobleaching is an optimization objective that the microscopist has to target during an imaging session and that must be minimized to preserve the sample health and sufficient imaging contrast. Hence, we implemented a realistic photobleaching model within the pySTED simulation software. The photobleaching model is based on the derivations from ref. ³, which were validated on real samples. Fig. 1d presents the energy states, decay rates and photobleaching state β that are used within the photobleaching model.

As in ref. ³, we define the photobleaching rate as

$${k}_{\beta }({I}_{{\rm{STED}}})={k}_{0}\left(\frac{{I}_{{\rm{STED}}}}{1\,{\rm{W}}\,{{\rm{m}}^{-2}}}\right)+{k}_{1}{\left(\frac{{I}_{{\rm{STED}}}}{1\,{\rm{W}}\,{{\rm{m}}^{-2}}}\right)}^{b},$$

(5)

where k₀, k₁ and b are dependent on the fluorophore and have to be determined experimentally. In the default parameters of pySTED, we assume that the linear photobleaching term is null (k₀ = 0) and that photobleaching occurs only from S1 during the STED pulse. Other photobleaching parameters could be easily integrated considering the modular structure of pySTED. We define the effective photobleaching rate k_b as the number of emitters transitioning from the S1 state to the photobleached state (P_β) over the course of a laser period T:

$${k}_{b}=\frac{{P}_{\beta }}{T}$$

(6)

with

$${P}_{\beta }={P}_{{\rm{S1}}}(t=0){k}_{\beta }({I}_{{\rm{STED}}})\frac{1-\exp \left(-{k}_{{\rm{S1}}}{t}_{{\rm{STED}}}(1+\gamma )\right)}{{k}_{{\rm{S1}}}(1+\gamma )}.$$

(7)

In pySTED, the number of emitters N in a pixel is updated by calculating their survival probability p = exp(–k_bt) from a binomial distribution for a given dwell time t (Fig. 1e). Although most parameters can be obtained from the literature for a specific fluorophore, some parameters such as k₁ and b need to be determined experimentally³. Given some experimental data (or a priori about the expected photobleaching of a sample), we can estimate the photobleaching properties (k₁ and b) of a fluorophore with

$${\rm{Photobleaching}}=1-\exp \left(-{k}_{\rm{b}}t\right),$$

(8)

by using nonlinear least-squares methods. We can also apply a similar process to estimate the absorption cross-section (σ_abs) of a fluorophore to optimize the confocal signal intensity to an expected value³.

Realistic data maps

A realistic data map, which can be used in pySTED, is generated by predicting the position of emitters in a real super-resolved image. A U-Net model (U-Net_{data map}, implemented in PyTorch⁷⁰) is trained to predict the underlying structure of a super-resolved image (Supplementary Table 13). A single U-Net_{data map} was trained in this Article with images of different subcellular structures (F-actin, tubulin, PSD95, αCaMKII and LifeAct) and was used to generate all the data maps to train and validate the ML, DL and RL models presented in this study. U-Net_{data map} has a depth of 4 with 64 filters in the first double convolution layer. Padding was used for each convolution layer to maintain the same image size. As in the seminal implementation of U-Net³⁵, maxpool with a kernel and stride of 2 was used. The number of filters in the double convolution layers doubled at each depth. In the encoder part of the model, each convolution is followed by batch normalization and a rectified linear unit (ReLU). Upsampling is performed using transposed convolution. The decoder part of the model uses double convolution layers as in the encoder part of the model. At each depth of the model, features from the encoder are propagated using skipping links and concatenated with the features obtained following the upsampling layer. A last convolution layer is used to obtain a single image followed by a sigmoid layer.

As previously mentioned, the goal of U-Net is to predict the underlying structure of super-resolved images. Training U-Net_{data map} in a fully supervised manner requires a training dataset of associated super-resolved images and underlying structures. However, such a dataset does not currently exist. Mathematically, a microscopy image is obtained from the convolution of the microscope E-PSF with the position of fluorophores at the sample. In the images from ref. ⁶, the E-PSF of the microscope can be approximated by a Gaussian function with a full-width at half-maximum of ~50 nm. Hence, U-Net_{data map} can be trained to predict that the data map that once convolved with the E-PSF will be similar to the input image (Fig. 1f). The L₂ error is calculated between the Gaussian-convolved data map and the original input image as the loss function to minimize.

To train the model, we used good-quality STED images of diverse neuronal proteins from an existing dataset⁶ (quality, >0.7). In ref. ⁶, the quality score of an image was obtained by asking an expert to rate the image based on a qualitative assessment of the resolution of the structure of interests and the signal-to-noise ratio on a scale from 0 to 1. The quality scores from the original dataset were used to train a DL model to automatically rate the quality of an image. Supplementary Table 6 presents the proteins imaged and the number of images that were used for training. Each 224 × 224 pixel² image is augmented with three (3) 90° rotations. The Adam optimizer was used with default parameters using a learning rate of 1 × 10⁻⁴. The model was trained for 1,000 epochs with a batch size of 32. We selected the model with the best generalization properties on the validation set, obtained from the mean squared error between the input image and data map after applying Gaussian convolution.

By default, the predicted data map reconstructs the background noise from the image. Filtering can be applied on the predicted data map to reduce the impact of noise. The number of emitters can be adapted to the experimental context, which is then converted into an integer value. U-Net_{data map} was trained with 224 × 224 pixel² images, but images of arbitrary size can be processed at inference time.

Data maps from deconvolution

Data maps were generated using the Richardson–Lucy deconvolution implementation⁷¹. The E-PSF of the input image was approximated by a Gaussian function with a full-width at half-maximum of ~50 nm. Thirty iterations were used for the deconvolution algorithm.

Imaging optimization objectives

Resolution

We calculated the resolution of the images by using the parameter-free image resolution estimation based on a decorrelation analysis developed in another work⁴⁰. The decorrelation analysis was used due to its simplicity in transferring from simulation to real imaging conditions.

Photobleaching

In all the experiments involving photobleaching as one of the imaging optimization objective, we measured the loss of fluorescence signal between a low-resolution image that is acquired before (Confocal1) and after (Confocal2) the high-resolution (STED) acquisition⁶. Photobleaching is defined as

$${\rm{Photobleaching}}=\frac{\overline{{{\rm{Confocal1}}}_{{\rm{fg}}}}-\overline{{{\rm{Confocal2}}}_{{\rm{fg}}}}}{\overline{{{\rm{Confocal1}}}_{{\rm{fg}}}}},$$

(9)

where $\overline{{{\rm{ConfocalX}}}_{{\rm{fg}}}}$ is the average signal on the foreground of the first confocal image (Confocal1). The foreground mask is determined using an Otsu threshold on Confocal1.

Signal ratio

We calculate the signal ratio as the ratio between the intensity in the high-resolution image and the respective confocal image using the following equation:

$${\rm{Signal}}\,{\rm{Ratio}}=\frac{{{\rm{STED}}}_{{\rm{fg}}}^{75}-\overline{{{\rm{STED}}}_{{\rm{bckg}}}}}{{{\rm{Confocal1}}}_{{\rm{fg}}}^{75}}.$$

(10)

The foreground mask of the STED and confocal images is determined using the Otsu method. The foreground signal in the mask is calculated as the 75th percentile of the image (STED or Confocal1). $\overline{{{\rm{STED}}}_{{\rm{bckg}}}}$ represents the mean signal of the background signal in the STED image.

Artefact

We measured the imaging artefacts with a metric inspired by SQUIRREL⁷² and MASK-SSIM⁷³ approaches. Specifically, we map the super-resolution image to a low-resolution image using a similar procedure to SQUIRREL but compare the structures only within a foreground mask. This foreground mask is obtained using the Otsu method on Confocal1. The average structural similarity index on the foreground between the low-resolution and optimized super-resolution image is reported as the metric. The value of the artefact metric that is reported in this Article is

$${\rm{Artefact}}=1-{{\rm{SSIM}}}_{{\rm{fg}}}.$$

(11)

Comparison of pySTED simulations with real acquisitions

We compared pySTED-simulated images with images acquired on a real STED microscope with similar imaging parameters. To evaluate the reliability of the simulations, we acquired ten images using a different combination of parameters. We varied each imaging parameter over a range that is commonly used for routine STED experiments and that would not damage the microscope (for example detectors, see the parameters listed in Supplementary Table 7). We used a sample of immunostained cultured hippocampal neurons of the neuronal protein bassoon tagged with the fluorophore ATTO-647N. The small clusters formed by bassoon are well suited for measurements of resolution. The same parameter combination is used in pySTED and on the microscope for fair comparison.

We optimized the photobleaching constants (k₁ and b) and the STED cross-section (σ_STED) of the fluorophore to match the measured photobleaching and resolution values using a least-squares method (Supplementary Fig. 4a, right). It is implemented iteratively, with the optimization of photobleaching and resolution done sequentially and repeated 15 times, since the optimization of σ_STED also affects photobleaching.

For each acquired real STED image, a data map is predicted with U-Net_{data map}. The number of emitters per pixel is obtained by multiplying the data map with a correction factor f to match the fluorescence signal in real images. This correction factor f can be obtained by fitting the intensity value obtained at pixel (x, y) to the real intensity of the acquired confocal image (I_CONF(x, y)). The intensity value of the synthetic image is approximated as the product between the E-PSF and data map (D):

$${I}_{{\rm{CONF}}}(x,y)=f\sum _{ij}{D}_{ij}(x,y){\rm{E}}-{{\rm{PSF}}}_{ij}.$$

(12)

Weakly supervised learning for the segmentation of F-actin nanostructures

We compared three training schemes (five random initializations per scheme) to train a U-Net model to segment two F-actin nanostructures (fibres and rings): (1) original model from ref. ⁴⁶ (O); (2) a model that uses a quantile normalization of the large image (min–max normalization using the 1st and 99th quantiles) and increased data augmentation during training (N; see below); and (3) a model trained as that in (2) with synthetic images (N + S). In all the conditions, the same architecture, training procedure and dataset are used, as per ref. ⁴⁶. A model is trained for 700 epochs, and the best model on the validation dataset is kept for testing. To compare only the impact of training, the validation dataset is kept constant in all the training instances.

In all the training schemes, an augmentation has 50% probability of being selected. For the training scheme O (ref. ⁴⁶), the augmentations consisted of horizontal/vertical flips, intensity scale and gamma adaptation. For the approaches using an increased data augmentation scheme (N and N + S), the augmentations from O are combined with random 90° rotations, crop normalization (1st and 99th percentile) and more intensity scale and gamma adaptation operations.

Synthetic F-actin images

The U-Net_{data map} model (Fig. 2) was used to extract the data maps of all the valid crops in the training dataset (contains >10% of dendrite, 256 × 256 pixel², 25% overlap). Five synthetic images with different resolution and noise properties were simulated for each crop with pySTED using a parameter combination that would minimally allow to resolve the F-actin nanostructures (Fig. 2b and Supplementary Table 8).

Generation of subsets

Models (with constant parameter initialization) were trained on subsets of the original dataset, to evaluate if pySTED can help reduce the number of original images in the training dataset. Five independent subsets with 0.75, 0.5, 0.25, 0.1, 0.05 and 0.025 ratios were used for training. The ratio of 0.025 corresponds to training on a single image (42 images in the training dataset). When an image is discarded from a subset, its corresponding crops (synthetic included) are also removed from training (Supplementary Fig. 7).

Performance evaluation

AP is used for performance evaluation. The AP is obtained from the precision and recall measured by the predicted segmentation compared with the ground-truth manual annotations. The AP corresponds to the area under the p_*(r) curve. p_*(r) is given by the maximum precision value that can be attained at any recall r_i greater than recall r, that is,

$${p}_{* }(r)=\mathop{\max }\limits_{{r}_{i}\ge r}p({r}_{i}).$$

(13)

The AP is calculated as

$${\rm{AP}}=\mathop{\int}\nolimits_{0}^{1}{p}_{* }(r){\rm{d}}r.$$

(14)

Multi-objective bandit optimization

The multi-objective bandit optimization aims at finding a set of imaging parameters that simultaneously optimizes all the imaging optimization objectives. Such a multi-objective problem is ill-defined as there exists a set of Pareto optimal objectives that could be used to solve the task. Hence, an external input (for example, a microscopist) is required to make the necessary trade-offs over the course of the optimization session.

Algorithms

The goal of the algorithm is to learn the mapping between imaging parameters (for example, laser power and pixel dwell time) and the imaging optimization objective (for example, resolution, photobleaching, artefact or signal ratio) by exploring the parameter space and exploiting its current knowledge of the parameter space to acquire high-quality images. A single model is built for each optimization objective as that in another work⁶. The exploration/exploitation trade-off is achieved via TS⁷⁴. At each time step of optimization, a function is sampled from the posterior of each model. The expected imaging optimization objective associated with each imaging parameter option is combined. The preferred combination is selected and an image is acquired with the associated parameters. The imaging optimization objectives are calculated from the resulting image and used to update each model.

The range of imaging parameters was normalized in [−1, 1] using a min–max normalization. The min–max values were given from the predefined range of a parameter. All the models are trained from scratch. At the start of each optimization session, three images are acquired with parameter combinations obtained from expert knowledge, allowing the models to gain insights into the imaging task. For further implementations, these parameter combinations could be obtained from (1) a previous imaging session, (2) different fluorophores or (3) publications from the field.

Kernel-TS: it is implemented by following the procedure from another work⁶. The regression model that maps the imaging parameters to the imaging optimization objectives is a non-parametric Gaussian process. All the parameters of the method (for example, kernel bandwidth or bounds on noise) were based on the recommendations from ref. ⁶. Kernel-TS works on a discrete parameter space of ten points for each optimized parameter. The values of imaging optimization objectives are rescaled using a whitening transformation.

LinTSDiag: it is a neural network implementation of TS⁴⁸. LinTSDiag was previously implemented to solve a two-parameter DyMIN task⁴⁹ (Supplementary Table 14). In this implementation, the neural network is a fully connected network with 2 layers with hidden sizes of 32. After each layer, ReLU activation is used and followed by a dropout layer (probability of 0.2). The last layer of the model projects to a single imaging optimization objective value. The model is implemented in PyTorch⁷⁰ and relies on the seminal implementation from ref. ⁴⁸. The loss of the model is the mean squared error and is optimized using stochastic gradient descent, with a learning rate of 1 × 10⁻³. After each acquisition, the weights of the model are updated until the error is <1 × 10⁻³ or 1,000 updates have been done. During training, the imaging optimization objectives are rescaled into a [0, 1] range.

Two parameters (ν and λ (ref. ⁴⁸)) control the exploration of the model. Increasing their values results in more exploration. In all the experiments using LinTSDiag, λ = 0.1 is used. The parameter ν varied depending on the task: (1) in simulation, ν = 0.01 (Fig. 3a–c); (2) in four-parameter optimization, ν = 0.1 (Fig. 3e,f); and (3) in six-parameter optimization, ν = 0.25 (Fig. 3g,h).

LinTSDiag handles a continuous parameter space. Hence, it is not possible to display all the possible trade-offs. To reduce the number of possibilities, only the Pareto optimal combination of optimization objectives are displayed (Pareto front). The Pareto optimal options are extracted using NSGA-II (ref. ⁷⁵) with the implementation from the DEAP Python library⁷⁶. Since computing the Pareto front is computationally expensive, a stopping criterion is used to reduce the calculations⁷⁷. The stopping criterion is based on the rolling standard deviation of the maximum crowding distance (window size of 10) during the NSGA-II search. The search is stopped when the standard deviation is lower than $\sqrt{2\times 1{0}^{-4}\times n}$, where n is the number of optimization objectives⁷⁷. In theory, the NSGA-II search should restart from scratch after each acquisition. However, given the high dimensionality of the parameter space, this may lead to high variability in the proposed parameter combination. To reduce this variability, a fraction of the previous options was retained as a warm start of the NSGA-II search⁷⁸. In this work, 30% of the previous options are randomly sampled and used as starting points for the next NSGA-II search. The resulting Pareto front of imaging optimization objectives is given as an input to the preference articulation method.

Contextual LinTSDiag: the contextual version of LinTSDiag heavily relies on the implementation of LinTSDiag described above (Supplementary Table 15). In this work, the contextual information was used to solve a DyMIN microscopy task. As previously mentioned, the confocal image serves as contextual information, but any other contextual information pertinent to the task could be provided to the model. The confocal image is encoded with a two-layer CNN. The first convolution layer with 8 filters, kernel size of 3 and padding of 1 is followed by a batch normalization layer, maxpooling layer (size, 4; stride, 4) and ReLU activation. The second convolution layer of 16 filters is followed by a batch normalization layer. Global average pooling is used to generate vector embedding. This is followed by ReLU activation and dropout layer with a probability of 0.2. The embedding is projected to 32 features using a fully connected layer and is followed by ReLU activation and a dropout layer with a probability of 0.2. The contextual features are concatenated with the parameter features (described in LinTSDiag). A single-layer fully connected model with a hidden size of 64 is used to predict the imaging optimization objectives. ReLU activation is used at the hidden layer. A single contextual encoder is created and shared between the imaging optimization objectives. The same training procedure and NSGA-II search are used as in LinTSDiag.

The exploration parameters λ = 0.1 and ν = 0.25 were used in simulation (Fig. 4c–g) and in three-dimensional DyMIN optimization (Fig. 4h–j).

Preference articulation

The optimization algorithms output possible trade-offs between the imaging optimization objectives. The preference articulation step consists of selecting the trade-off that is the most relevant for the task. Two preference articulation methods were used in bandit optimization: manual selection and automatic selection⁶.

Manual selection: this method requests a manual input from the microscopist at each image acquisition. The microscopist is asked to select the trade-off that is in line with their own preferences from the available options (point cloud). This method was used in all the experiments on the real microscope using the bandit optimization scheme (Figs. 3e–h and 4h–j).

Automatic selection: this method aims at reducing the number of interventions from the microscopist in the optimization loop by learning their preferences before the optimization session. In ref. ⁶, the neural network implementation PrefNet was used to learn the preferences from an expert. In the current work, two PrefNet models were trained from the preferences of an expert. The same model architecture and training procedure were used as that in ref. ⁶. The first model is trained for STED optimization to select from the imaging optimization objectives including resolution, photobleaching and signal ratio. The second model is trained for the DyMIN optimization to select the trade-off between resolution, photobleaching and artefact. The PrefNet model is used to repeatedly make the trade-offs in multiple optimizations in the simulation environment (Figs. 3b–d and 4c–g).

RL experiments

An RL agent interacts with an environment by sequentially making decisions based on its observations. The goal of the agent is to maximize its reward signal over the course of an episode.

RL formulation: the general problem in RL is formalized by a discrete-time stochastic control process, that is, it satisfies a Markov decision process. An agent starts in a given state ${s}_{t}\in {\mathbf{S}}$ and gathers some partial observations ${o}_{t}\in {\mathbf{O}}$. In a Markov decision process, the state is fully observable, that is, the agent has access to a complete observation of state s_t. At each time step t, the agent performs an action ${a}_{t}\in {\mathbf{A}}$ given some internal policy π after which the agent receives a reward ${r}_{t}\in {\mathbf{R}}$ and transitions to a state ${s}_{t+1}\in {\mathbf{S}}$ with a state transition function ${\mathbf{T}}({s}_{t+1}| {s}_{t},{a}_{t})$. Following the state transition, a reward signal r_t = R(s_t, a_t, s_t+1) is provided to the agent as feedback. The goal of the agent is to maximize the cumulative reward over the trajectory τ = (s_t, a_t, s_t+1, a_t+1…). Formally, the cumulative reward may be written in the form of

$$R(\tau )=\mathop{\sum }\limits_{t=0}^{T}{\gamma }^{t}{r}_{t},$$

(15)

where γ is a discount factor in the range [0, 1] to temporally weight the reward. Intuitively, using a discount factor close to 1 implies that the credit assignment of the current action is important for future reward, which is the case for a long planning horizon, whereas a discount factor close to 0 reduces the impact of temporally distant rewards⁵⁰.

Reward function: the optimization of super-resolution STED microscopy is a multi-objective problem (for example, resolution, signal ratio and photobleaching). However, the conventional RL settings and algorithms assume access to a reward function that is single valued, that is, single-objective optimization⁵⁰. Several methods were introduced to solve the multi-objective RL setting, for instance, by simultaneously learning multiple policies or by using a scalarization function (ref. ⁷⁹ provides a comprehensive review). The scalarization function is simple to implement and allows all the algorithms that were developed for RL to be used, but assumes that the preference from the user is known a priori. In this work, the multi-objective RL setting was transformed into a single scalar reward by using the neural network model PrefNet⁶, which was developed in the bandit experiments. Indeed, the PrefNet model was trained to reproduce the trade-off that an expert is willing to make into the imaging optimization objective space. This is done by the PrefNet model by assigning a value to a combination of imaging optimization objectives. The values predicted by the model for a combination of optimization objectives are arbitrary, but the ranking of these values is accurate. Hence, the values from the PrefNet model are proportional to the image quality. The reward of the agent can then be defined using equation (16). For safety precautions, when deploying the agent on a real microscopy system, the agent incurs a reward of –10 when the frequency of photons on the detector is higher than 20 MHz.

$${r}_{t}=\left\{\begin{array}{ll}-10\quad &{\rm{when}}\ {f}_{{\rm{photons}}} > 20\,{\rm{MHz}}\\ {\rm{PrefNet}}(R,P,S)\quad &{\rm{otherwise}}.\end{array}\right.$$

(16)

Although a negative reward can be used to limit the selection of actions that could damage the microscope, it is not required. For instance, the results shown in Fig. 6d,e and Extended Data Fig. 4 used a version of the reward function that did not include the negative reward. It is worth noting that in these cases, the range of parameters should be carefully selected to avoid damage to the microscope.

Agent: the PPO model⁵⁴ was used for all the RL experiments. PPO is considered to be the state of the art for many control tasks, and is widely used in robotics⁵¹. PPO allows a continuous action space, making it suitable for the task of microscopy parameter tuning. It is an on-policy algorithm, meaning that the same policy is used during data collection and updating phases. The model uses a deep neural network to map the state to the actions. Since PPO is an actor–critic method, it simultaneously learns a policy function and a value function that measures the quality of a selected action (Supplementary Tables 16 and 17). Both functions use the same model architecture. A convolutional neural network (CNN) extracts information from the visual inputs and a linear neural network (LN) extracts information from the history of the episode. The CNN encoder is similar to the one used in ref. ³⁰. The encoder is composed of three layers of convolutions, each followed by a leaky ReLU activation. The kernel size of each layer is 8, 4 and 3 with a stride of 4, 2 and 1. This allows the spatial size of the state space to be reduced. The LN model contains two linear layers projecting to sizes of 16 and 4. The information from both layers is concatenated and mapped to the action space using an LN layer.

During training, the Adam optimizer is used with default parameters and a learning rate of 1 × 10⁻⁴. The batch size of the model is set at 64. For each of the 512 steps in the environment, the model is trained for 10 batches, which are randomly sampled from the previous 512 steps. A maximal gradient of 1.0 during backpropagation is used to stabilize training.

Synthetic data maps: a bank of data maps was generated using U-Net_{data map}. Supplementary Table 18 presents the number of images per structure that were available during training. Data maps were randomly cropped to 96 × 96 pixel² with a higher probability of being sampled within the foreground of the data map. Random data augmentation is performed online with a 50% probability: random [1, 3] 90° rotations, up–down flips and left–right flips. The resultant cropped data map is multiplied by a value that is sampled from ${N}(\mu =40,\sigma =4)$ and turned into an integer array using the floor operation.

Synthetic fluorophores: synthetic fluorophore properties are generated on the fly during training by uniform sampling. Supplementary Table 19 displays the range of possible fluorophore properties. The parameters k₁, b and σ_abs are optimized using the procedure described in the ‘Comparison of pySTED simulations with real acquisitions’ section. During the optimization, it is assumed that the maximum number of emitters is 40. A scaling factor that is dependent on the type of structure is used during the optimization (Supplementary Table 20). At each iteration, the fluorophore parameters from the previous iterations are used as a starting point. The initial conditions of the parameters are k₁ = 2.9 × 10⁻¹⁶, b = 1.66 and σ_abs = 3.2 × 10⁻²¹ m².

STED microscopy experiments

STED imaging

Super-resolution imaging of neuronal proteins was performed on an Expert Line STED system (Abberior Instruments) equipped with a ×100 1.4-numerical-aperture oil-objective lens (Olympus, UPLSAPO100XO), motorized stage and auto-focus unit. Far-red dyes were imaged using a 640 nm pulsed diode (40 MHz), a 775 nm depletion laser (40 MHz) and an ET685/70 (Chroma) fluorescence filter. Fluorescence was detected on avalanche photodiode detectors with approximately 1 Airy unit detection pinhole. Images were processed using Fiji (ImageJ, 1.54f) software. One- and two-channel imaging processes of tubulin, NPC and golgi in Vero cells were performed on an Infinity line microscope (Abberior Instruments) using the imaging settings described elsewhere⁸.

Before the optimization, the excitation power of the confocal acquisition needed to be set to acquire <200 photons in 10 μs. To do so, the excitation power was first set to 10 μW and was halved until this criterion was met. This value is used by the model to incorporate knowledge about the brightness of the sample.

Kidney epithelial cell culture

Vero B4 cells were obtained from the DSMZ—German Collection of Microorganisms and Cell Cultures. They were maintained in Dulbecco’s modified Eagle’s medium (Gibco) supplemented with GlutaMAX (Thermo Fisher Scientific), 10% foetal bovine serum (Sigma-Aldrich), 1 mM sodium pyruvate (Thermo Fisher Scientific) and penicillin–streptomycin (100 μl ml^–1 and 0.1 mg ml^–1; Sigma-Aldrich) at 37 °C with 5% CO₂.

Sample preparation and staining procedures: for indirect immunostaining, cells were fixed in 8% paraformaldehyde in phosphate-buffered saline (PBS) and permeabilized with 0.5% Triton X-100/PBS for NPC proteins (Mab414, 1:200; abcam, code ab24609) and golgi (Giantin, 1:200; abcam, code ab80864) staining. Methanol was used as a fixative for tubulin staining (1:500; abcam, code ab18251).

After blocking with 2% bovine serum albumin/0.1% TWEEN 20/PBS, cells were incubated with the primary antibody for 1 h at the specified dilutions. Detection of primary antibodies was achieved using secondary STAR RED goat anti-mouse IgG (1:200, abberior, code STRED-1001-500UG) and STAR ORANGE goat anti-rabbit IgG (1:200, abberior, code STORANGE-1002-500UG) for the double staining of NPC and golgi. Tubulin was labelled with STAR RED goat anti-rabbit IgG (1:200, abberior, code STRED-1002-500UG). Secondary antibodies were also incubated for 1 h.

After stringent washing with PBS, cells were mounted in MOUNT SOLID ANTIFADE (abberior, code MM-2013-2X15ML). The protocol was adapted from ref. ⁸⁰.

Neuronal cell culture

Neuronal cultures from the hippocampus were obtained using neonatal Sprague Dawley rats. The rats, aged P0–P1, were sacrificed through decapitation before the hippocampi were dissected following procedures approved by the Animal Care Committee of Université Laval. The cells were then seeded onto 12 and 18 mm coverslips coated with poly-d-lysine and laminin, for fixed (12 mm, 40,000 per coverslip) and live-cell (18 mm, 100,000 cells per coverslip) STED imaging. Neurons were cultivated in a growth medium composed of Neurobasal and B27 (in a 50:1 ratio), enriched with penicillin–streptomycin (25 U ml^–1/25 μg ml^–1) and 0.5 mM l-GlutaMAX (Invitrogen). Ara-C (5 μM; Sigma-Aldrich) was added into the medium after five days to limit the proliferation of non-neuronal cells. Twice a week, ~50% of the growth medium was replaced with serum- and Ara-C-free medium. Cells were used between 12 and 16 days for in vitro experiments.

Sample preparation and staining procedures: fixation was performed for 10 min in 4% paraformaldehyde solution (paraformaldehyde, 4%; sucrose, 4%; phosphate buffer, 100 mM; Na-EGTA, 2 mM). Neurons were permeabilized with 0.1% Triton X-100 and aspecific binding sites were blocked for 30 min with 2% goat serum in 20 mM PBS. Primary and secondary antibodies were successively incubated for 2 and 1 h, respectively. Phalloidin was incubated for 1 h. All the incubations were done at room temperature in the blocking solution. Immunostained coverslips were mounted in Mowiol-DABCO for imaging. F-actin was stained with phalloidin-STAR635 (Sigma-Aldrich, cat. no. 30972-20μg, 1:50 dilution). All the antibodies used in this study with the associated concentrations are listed in Supplementary Tables 21 and 22.

For the live-cell experiment (Fig. 6d), the neurons were incubated for 8 min with SiR-actin (0.8 μM, cytoskeleton, cat. no. CY-SC001) diluted in HEPES-buffered artificial cerebrospinal fluid (NaCl, 98; KCl, 5; HEPES, 10; glucose, 10; CaCl₂, 0.6; MgCl₂, 5; all values are in mM). For live-cell STED microscopy, coverslips were mounted on a quick-release chamber (Wagner Instruments, cat. no. 61-1944) and imaged in HEPES-buffered artificial cerebrospinal fluid comprising 5 mM Mg²⁺/0.6 mM Ca²⁺ using a gravity perfusion system.

Quantification of biological structures

F-actin: line profiles of ~1 μm were manually extracted from each image. A linewidth of 3 pixels was used to average the profile values. The autocorrelation function (statsmodels library⁸¹) was calculated from the intensity profile. The length of periodicity of the signal was determined from the first peak maximum.

CaMKII-β and PSD95: we segmented the clusters using wavelet segmentation from a previous implementation⁸². The scales used were (1, 2) for STED and (3, 4) for confocal segmentation. A threshold of 200 was used. Small segmentation objects (<3 pixels) were removed and small holes (<6 pixels) were filled. In the STED image segmentation, only the object part of the confocal foreground was considered. For STED segmentation, watershed was used to split the merged segmented objects. The local peak maximum was used as the initial seed. Small segmentation objects (<3 pixels) resulting from the watershed split were filtered out. The properties of each segmented object was extracted using regionprops from the scikit-image Python library⁷¹.

TOM20: a similar approach to the one in ref. ⁶⁰ was used. Briefly, the confocal foreground of each mitochondrion was extracted using the same wavelet segmentation procedure as that used for CaMKII-β and PSD95. The two-dimensional autocorrelation on square crops of 320 nm × 320 nm centred on each mitochondrion was calculated. The diameter of the TOM20 cluster is defined as the standard deviation obtained from a two-dimensional Gaussian curve fit of the autocorrelation profile.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All the datasets used to train the models and the corresponding acquired images are available for download at https://s3.valeria.science/flclab-pysted/index.html. Source data are provided with this paper.

Code availability

All code used in this paper is open source. Code for the STED simulation platform, pySTED, is available via GitHub at https://github.com/FLClab/pySTED (ref. ⁸³). The bandit algorithms and training procedures are available via GitHub at https://github.com/FLClab/optim-sted (ref. ⁸⁴). The gym-STED environment and training routines are available via GitHub at https://github.com/FLClab/gym-sted (ref. ⁸⁵) and https://github.com/FLClab/gym-sted-pfrl (ref. ⁸⁶), respectively.

References

Schermelleh, L. et al. Super-resolution microscopy demystified. Nat. Cell Biol. 21, 72–84 (2019).
Article Google Scholar
Hell, S. W. & Wichmann, J. Breaking the diffraction resolution limit by stimulated emission: stimulated-emission-depletion fluorescence microscopy. Opt. Lett. 19, 780–782 (1994).
Article Google Scholar
Oracz, J., Westphal, V., Radzewicz, C., Sahl, S. J. & Hell, S. W. Photobleaching in STED nanoscopy and its dependence on the photon flux applied for reversible silencing of the fluorophore. Sci. Rep. 7, 11354 (2017).
Article Google Scholar
Tosheva, K. L., Yuan, Y., Pereira, P. M., Culley, S. & Henriques, R. Between life and death: strategies to reduce phototoxicity in super-resolution microscopy. J. Phys. D Appl. Phys. 53, 163001 (2020).
Article Google Scholar
Boudreau, C. et al. Excitation light dose engineering to reduce photo-bleaching and photo-toxicity. Sci. Rep. 6, 30892 (2016).
Article Google Scholar
Durand, A. et al. A machine learning approach for online automated optimization of super-resolution optical microscopy. Nat. Commun. 9, 5247 (2018).
Article Google Scholar
Dreier, J. et al. Smart scanning for low-illumination and fast RESOLFT nanoscopy in vivo. Nat. Commun. 10, 556 (2019).
Article Google Scholar
Heine, J. et al. Adaptive-illumination STED nanoscopy. Proc. Natl Acad. Sci. USA114, 9797–9802 (2017).
Article Google Scholar
Vinçon, B., Geisler, C., Egner, A. & Egner, A. Pixel hopping enables fast STED nanoscopy at low light dose. Opt. Express 28, 4516–4528 (2020).
Article Google Scholar
Scherf, N. & Huisken, J. The smart and gentle microscope. Nat. Biotechnol. 33, 815–818 (2015).
Article Google Scholar
Krull, A., Hirsch, P., Rother, C., Schiffrin, A. & Krull, C. Artificial-intelligence-driven scanning probe microscopy. Commun. Phys. 3, 54 (2020).
Article Google Scholar
Moen, E. et al. Deep learning for cellular image analysis. Nat. Methods 16, 1233–1246 (2019).
Article Google Scholar
von Chamier, L. et al. Democratising deep learning for microscopy with ZeroCostDL4Mic. Nat. Commun. 12, 2276 (2021).
Article Google Scholar
Maška, M. et al. The Cell Tracking Challenge: 10 years of objective benchmarking. Nat. Methods 20, 1010–1020 (2023).
Article Google Scholar
Ljosa, V., Sokolnicki, K. L. & Carpenter, A. E. Annotated high-throughput microscopy image sets for validation. Nat. Methods 9, 637–637 (2012).
Article Google Scholar
Vinyals, O. et al. Grandmaster level in StarCraft II using multi-agent reinforcement learning. Nature 575, 350–354 (2019).
Article Google Scholar
Levine, S., Pastor, P., Krizhevsky, A. & Quillen, D. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection. Int. J. Rob. Res. 37, 421–436 (2018).
Article Google Scholar
Chaware, A., Cooke, C. L., Kim, K. & Horstmeyer, R. Towards an intelligent microscope: adaptively learned illumination for optimal sample classification. In ICASSP 2020—2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 9284–9288 (IEEE, 2020).
Turcotte, B., Bilodeau, A., Lavoie-Cardinal, F. & Durand, A. pySTED: a STED microscopy simulation tool for machine learning training. In Association for the Advancement of Artificial Intelligence—AAAI Workshop on AI to Accelerate Science and Engineering (2022).
Kalinin, S. V. et al. Probe microscopy is all you need. Mach. Learn.: Sci. Technol. 4, 023001 (2023).
Google Scholar
Adhikari, M., Houhou, R., Hniopek, J. & Bocklitz, T. Review of fluorescence lifetime imaging microscopy (FLIM) data analysis using machine learning. J. Exp. Theor. Anal. 1, 44–63 (2023).
Article Google Scholar
Sage, D. et al. Super-resolution fight club: assessment of 2D and 3D single-molecule localization microscopy software. Nat. Methods 16, 387–395 (2019).
Article Google Scholar
Lagardère, M., Chamma, I., Bouilhol, E., Nikolski, M. & Thoumine, O. FluoSim: simulator of single molecule dynamics for fluorescence live-cell and super-resolution imaging of membrane proteins. Sci. Rep. 10, 19954 (2020).
Article Google Scholar
Novák, T., Gajdos, T., Sinkó, J., Szabó, G. & Erdélyi, M. TestSTORM: versatile simulator software for multimodal super-resolution localization fluorescence microscopy. Sci. Rep. 7, 951 (2017).
Article Google Scholar
Nehme, E., Weiss, L. E., Michaeli, T. & Shechtman, Y. Deep-STORM: super-resolution single-molecule microscopy by deep learning. Optica 5, 458–464 (2018).
Article Google Scholar
Xie, H., Liu, Y., Jin, D., Santangelo, P. J. & Xi, P. Analytical description of high-aperture STED resolution with 0-2π vortex phase modulation. J. Opt. Soc. Am. A 30, 1640–1645 (2013).
Article Google Scholar
Caprile, F., Masullo, L. A. & Stefani, F. D. PyFocus—a Python package for vectorial calculations of focused optical fields under realistic conditions. Application to toroidal foci. Comput. Phys. Commun. 275, 108315 (2022).
Article MathSciNet Google Scholar
Sarmento, M. J. et al. Exploiting the tunability of stimulated emission depletion microscopy for super-resolution imaging of nuclear structures. Nat. Commun. 9, 3415 (2018).
Article Google Scholar
Leutenegger, M., Eggeling, C. & Hell, S. W. Analytical description of STED microscopy performance. Opt. Express 18, 26417–26429 (2010).
Article Google Scholar
Mnih, V. et al. Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015).
Article Google Scholar
Van Rossum, G. & Drake, F. L. Python 3 Reference Manual (Create Space, 2009).
Sekh, A. A. et al. Physics-based machine learning for subcellular segmentation in living cells. Nat. Mach. Intell. 3, 1071–1080 (2021).
Article Google Scholar
Bolaños, L. A. et al. A three-dimensional virtual mouse generates synthetic training data for behavioral analysis. Nat. Methods 18, 378–381 (2021).
Article Google Scholar
Zhang, Y. et al. Rapid detection of neurons in widefield calcium imaging datasets after training with synthetic data. Nat. Methods 20, 747–754 (2023).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-Net: convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention - MICCAI 9351, 234–241 (2015).
Falk, T. et al. U-Net: deep learning for cell counting, detection, and morphometry. Nat. Methods 16, 67–70 (2019).
Google Scholar
He, Y. et al. Surpassing the resolution limitation of structured illumination microscopy by an untrained neural network. Biomed. Opt. Express 14, 106–117 (2023).
Article Google Scholar
Qiao, C. et al. Rationalized deep learning super-resolution microscopy for sustained live imaging of rapid subcellular processes. Nat. Biotechnol. 41, 367–377 (2022).
ATTO 647N; https://www.atto-tec.com/ATTO-647N.html?language=en
Descloux, A., Grußmayer, K. S. & Radenovic, A. Parameter-free image resolution estimation based on decorrelation analysis. Nat. Methods 16, 918–924 (2019).
Article Google Scholar
Drobizhev, M., Makarov, N. S., Tillo, S. E., Hughes, T. E. & Rebane, A. Two-photon absorption properties of fluorescent proteins. Nat. Methods 8, 393–399 (2011).
Article Google Scholar
Faklaris, O. et al. Quality assessment in light microscopy for routine use through simple tools and robust metrics. J. Cell Biol. 221, e202107093 (2022).
Article Google Scholar
Jahr, W., Velicky, P. & Danzl, J. G. Strategies to maximize performance in stimulated emission depletion (STED) nanoscopy of biological specimens. Methods 174, 27–41 (2019).
Article Google Scholar
Stringer, C., Wang, T., Michaelos, M. & Pachitariu, M. Cellpose: a generalist algorithm for cellular segmentation. Nat. Methods 18, 100–106 (2021).
Article Google Scholar
Ouyang, W. et al. BioImage model zoo: a community-driven resource for accessible deep learning in bioImage analysis. Preprint at bioRxiv https://doi.org/10.1101/2022.06.07.495102 (2022).
Lavoie-Cardinal, F. et al. Neuronal activity remodels the F-actin based submembrane lattice in dendrites but not axons of hippocampal neurons. Sci. Rep. 10, 11960 (2020).
Article Google Scholar
Lamiable, A. et al. Revealing invisible cell phenotypes with conditional generative modeling. Nat. Commun. 14, 6386 (2023).
Article Google Scholar
Zhang, W., Zhou, D., Li, L. & Gu, Q. Neural Thompson sampling. International Conference on Representation Learning https://doi.org/10.48550/arXiv.2010.00827 (2021).
Bilodeau, A., Bernatchez, R., Michaud-Gagnon, A., Lavoie-Cardinal, F. & Durand, A. Contextual bandit optimization of super-resolution microscopy. In Proc. Canadian Conference on Artificial Intelligence https://doi.org/10.21428/594757db.9d610b29 (2022).
Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction 2nd edn (The MIT Press, 2015).
Zhao, W., Queralta, J. P. & Westerlund, T. Sim-to-real transfer in deep reinforcement learning for robotics: a survey. In 2020 IEEE Symposium Series on Computational Intelligence (SSCI) 737–744 (IEEE, 2020).
Miao, Q., Lv, Y., Huang, M., Wang, X. & Wang, F.-Y. Parallel learning: overview and perspective for computational learning across Syn2Real and Sim2Real. IEEE CAA J. Autom. Sin. 10, 603–631 (2023).
Article Google Scholar
Brockman, G. et al. OpenAI Gym. Preprint at https://arxiv.org/abs/1606.01540 (2016).
Schulman, J., Wolski, F., Dhariwal, P., Radford, A. & Klimov, O. Proximal policy optimization algorithms. Preprint at https://arxiv.org/abs/1707.06347 (2017).
Risi, S. & Togelius, J. Increasing generality in machine learning through procedural content generation. Nat. Mach. Intell. 2, 428–436 (2020).
Article Google Scholar
Xu, K., Zhong, G. & Zhuang, X. Actin, spectrin, and associated proteins form a periodic cytoskeletal structure in axons. Science 339, 452–456 (2013).
Article Google Scholar
D’Este, E., Kamin, D., Göttfert, F., El-Hady, A. & Hell, S. W. STED nanoscopy reveals the ubiquity of subcortical cytoskeleton periodicity in living neurons. Cell Rep. 10, 1246–1251 (2015).
Article Google Scholar
Ferreira, J. S. et al. Distance-dependent regulation of NMDAR nanoscale organization along hippocampal neuron dendrites. Proc. Natl Acad. Sci. USA 117, 24526–24533 (2020).
Article Google Scholar
Nair, D. et al. Super-resolution imaging reveals that AMPA receptors inside synapses are dynamically organized in nanodomains regulated by PSD95. J. Neurosci. 33, 13204–13224 (2013).
Article Google Scholar
Wurm, C. A. et al. Nanoscale distribution of mitochondrial import receptor Tom20 is adjusted to cellular conditions and exhibits an inner-cellular gradient. Proc. Natl Acad. Sci. USA 108, 13546–13551 (2011).
Article Google Scholar
Staudt, T. M. Strategies to Reduce Photobleaching, Dark State Transitions and Phototoxicity in Subdiffraction Optical Microscopy. PhD thesis, Univ. Heidelberg (2009).
Shroff, H., Testa, I., Jug, F. & Manley, S. Live-cell imaging powered by computation. Nat. Rev. Mol. Cell Biol. 25, 443–463 (2024).
Bouchard, C. et al. Resolution enhancement with a task-assisted GAN to guide optical nanoscopy image analysis and acquisition. Nat. Mach. Intell. 5, 830–844 (2023).
Article Google Scholar
Alvelid, J., Damenti, M., Sgattoni, C. & Testa, I. Event-triggered STED imaging. Nat. Methods 19, 1268–1275 (2022).
Article Google Scholar
Christiano, P. F. et al. Deep reinforcement learning from human preferences. In Advances in Neural Information Processing Systems 30 (Curran Associates, 2017).
Rafailov, R. et al. Direct preference optimization: your language model is secretly a reward model. Adv. Neural Inf. Process. Syst. 36, 53728–53741 (2023).
Willig, K. I., Keller, J., Bossi, M. & Hell, S. W. STED microscopy resolves nanoparticle assemblies. New J. Phys. 8, 106–106 (2006).
Article Google Scholar
Hoeller, M. Advanced Fluorescence Fluctuation Spectroscopy with Pulsed Interleaved Excitation. PhD thesis, Ludwig-Maximilians-Univ. München (2011).
Vicidomini, G. et al. STED nanoscopy with time-gated detection: theoretical and experimental aspects. PLoS ONE 8, e54421 (2013).
Article Google Scholar
Paszke, A. et al. PyTorch: an imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems 32 (Curran Associates, 2019).
van der Walt, S. et al. Scikit-image: image processing in Python. PeerJ 2, e453 (2014).
Article Google Scholar
Culley, S. et al. Quantitative mapping and minimization of super-resolution optical imaging artifacts. Nat. Methods 15, 263–266 (2018).
Article Google Scholar
Gao, S. et al. DETECTOR: structural information guided artifact detection for super-resolution fluorescence microscopy image. Biomed. Opt. Express 12, 5751–5769 (2021).
Article Google Scholar
Thompson, W. R. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25, 285–294 (1933).
Article Google Scholar
Deb, K., Pratap, A., Agarwal, S. & Meyarivan, T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6, 182–197 (2002).
Article Google Scholar
Fortin, F.-A., Rainville, F.-M. D., Gardner, M.-A., Parizeau, M. & Gagné, C. DEAP: evolutionary algorithms made easy. J. Mach. Learn. Res. 13, 2171–2175 (2012).
MathSciNet Google Scholar
Roudenko, O. & Schoenauer, M. A steady performance stopping criterion for Pareto-based evolutionary algorithms. In 6th International Multi-Objective Programming and Goal Programming Conference (2004).
Deb, K., Rao N., U. B. & Karthik, S. Dynamic multi-objective optimization and decision-making using modified NSGA-II: a case study on hydro-thermal power scheduling. in Evolutionary Multi-Criterion Optimization Vol. 4403 (Springer, 2007).
Hayes, C. F. et al. A practical guide to multi-objective reinforcement learning and planning. Auton. Agents Multi-Agent Syst. 36, 26 (2022).
Article Google Scholar
Wurm, C. A., Neumann, D., Schmidt, R., Egner, A. & Jakobs, S. Sample preparation for STED microscopy. in Live Cell Imaging: Methods and Protocols Vol. 591 (Humana Press, 2010).
Seabold, S. & Perktold, J. Statsmodels: econometric and statistical modeling with Python. In Proc. Python in Science Conference 92–96 (2010). https://doi.org/10.25080/Majora-92bf1922-011 (2010).
Wiesner, T. et al. Activity-dependent remodelling of synaptic protein organization revealed by high throughput analysis of STED nanoscopy images. Front. Neural Circuits 14, 57 (2020).
Bilodeau, A., Turcotte, B. & Michaud-Gagnon, A. FLClab/pySTED: New release! v0.0.2. Zenodo https://doi.org/10.5281/zenodo.13174019 (2024).
Bilodeau, A. FLClab/optim-sted: First release! Zenodo https://doi.org/10.5281/zenodo.13175942 (2024).
Bilodeau, A. & Turcotte, B. FLClab/gym-sted: First release! v0.1.1. Zenodo https://doi.org/10.5281/zenodo.13174021 (2024).
Bilodeau, A. & Turcotte, B. FLClab/gym-sted-pfrl: New release! v0.1.0. https://doi.org/10.5281/zenodo.13174024 (2024).

Download references

Acknowledgements

We thank S. Pensivy and T. Roy from the Neuronal Cell Culture Platform of the CERVO Brain Research Center for the preparation of the dissociated hippocampal cultures, V. Clavet-Fournier for immunostainings of neuronal proteins and M. Bilodeau for help with the pySTED logo. Funding was provided by grants from the Natural Sciences and Engineering Research Council of Canada (NSERC) (RGPIN-06704-2019 to F.L.-C.), Fonds de Recherche Nature et Technologie (FRQNT) Team Grant (2021-PR-284335 to F.L.-C. and A.D.), the Canadian Institute for Health Research (CIHR) (F.L.-C.) and the Neuronex Initiative (National Science Foundation 2014862, Fonds de recherche du Québec–Santé 295824) (F.L.-C.). A.D. is a CIFAR Canada AI Chair, and F.L.-C. is a Canada Research Chair Tier II. A.B. was supported by a scholarship from NSERC. A.B. and A.M.-G. were awarded an excellence scholarship from the FRQNT strategic cluster UNIQUE.

Author information

Authors and Affiliations

CERVO Brain Research Center, Québec, Québec, Canada
Anthony Bilodeau, Albert Michaud-Gagnon, Julia Chabbert, Benoit Turcotte & Flavie Lavoie-Cardinal
Institute for Intelligence and Data, Québec, Québec, Canada
Anthony Bilodeau, Albert Michaud-Gagnon, Benoit Turcotte, Audrey Durand & Flavie Lavoie-Cardinal
Abberior Instruments GmbH, Göttingen, Germany
Jörn Heine
Department of Computer Science and Software Engineering, Université Laval, Québec, Québec, Canada
Audrey Durand
Department of Electrical and Computer Engineering, Université Laval, Québec, Québec, Canada
Audrey Durand
Canada CIFAR AI Chair, Mila, Québec, Canada
Audrey Durand
Department of Psychiatry and Neuroscience, Université Laval, Québec, Québec, Canada
Flavie Lavoie-Cardinal

Authors

Anthony Bilodeau
View author publications
Search author on:PubMed Google Scholar
Albert Michaud-Gagnon
View author publications
Search author on:PubMed Google Scholar
Julia Chabbert
View author publications
Search author on:PubMed Google Scholar
Benoit Turcotte
View author publications
Search author on:PubMed Google Scholar
Jörn Heine
View author publications
Search author on:PubMed Google Scholar
Audrey Durand
View author publications
Search author on:PubMed Google Scholar
Flavie Lavoie-Cardinal
View author publications
Search author on:PubMed Google Scholar

Contributions

A.B., A.D. and F.L.-C. designed the study. A.B., A.M.-G., B.T., A.D. and F.L.-C. developed the pySTED simulation platform. A.B. performed the DL and RL experiments and evaluated their performance. A.B. and A.M.-G. evaluated the performance of the pySTED platform and designed the ML experiments. A.B., J.C. and J.H. performed the live-cell and two-colour experiments. A.B. evaluated the deployment of ML and RL strategies in real experimental settings. A.B. and F.L.-C. wrote the paper.

Corresponding author

Correspondence to Flavie Lavoie-Cardinal.

Ethics declarations

Competing interests

CoLaboratory is a trademark of Google LLC and this paper is not endorsed by or affiliated with Google in anyway. J.H. is an employee of Abberior Instruments—a company manufacturing STED microscopes.

Peer review

Peer review information

Nature Machine Intelligence thanks Pierre Jouchet, Hector Santos and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 pySTED can benefit common microscopy tasks, that is image analysis and acquisition.

i) A CoLaboratory notebook implementing pySTED is created for trainees to develop their knowledge and intuition about STED microscopy. ii) pySTED can be leveraged in DL-related microscopy tasks to artificially augment the training datasets. iii) pySTED can be used to develop and thoroughly validate AI methods by limiting the impact of biological variability on the measurements and reducing the biological footprint. iv) pySTED reduces the reality gap between simulation and reality by training RL models that learn through interactions with the system. The trained models are then deployed in a wide range of real experimental conditions.

Extended Data Fig. 2 Optimization of imaging parameters using Kernel-TS.

a) Resulting imaging optimization objectives from Kernel-TS at 3 different timesteps (10 - cyan, 100 - grey, and 190 - red) for 50 independent models are presented for increasing signal ratio (top to bottom). With time, Kernel-TS acquires images that have a higher preference score for both fluorophores (purple contour lines) and converges into a similar imaging optimization objective space (red points). b) The standard deviation (STD) of the imaging optimization objectives and of the preference scores decreases during the optimization (cyan to red) supporting the convergence of Kernel-TS in a specific region of the imaging optimization objective space for both fluorophores. The dashed line separates the imaging optimization objectives (R: Resolution, P: Photobleaching, and S: Signal ratio) from the preference network (PN). c) Typical pySTED simulations on two different fluorophores (top/bottom) using the optimized parameters on fluorophore A (top) or B (bottom). See Supplementary Tab. 10 for imaging parameters.

Extended Data Fig. 3 The agent was deployed on a real STED microscope for the imaging of diverse neuronal proteins in dissociated hippocampal cultures.

In distribution a) (left) The clusters of CaMKII-β are revealed in all acquired images. (right) The size of the clusters is extracted from the acquired images (dashed vertical line represents the median of the distribution) and compared to the values that were previously reported in⁵⁸ (solid vertical line). Out of distribution b) A bright fluorophore associated to PSD95 is simulated experimentally (Methods). The STED images reveal the presence of nano-clusters. The minor (blue) and major (orange) axis length of the nano-clusters are measured and compared with the values reported in⁵⁹.

Extended Data Fig. 4 Images acquired by the RL agent in a real experiment on a different microscope.

NPC was stained with the STAR-RED fluorophore. The sequence of acquired images goes from top left to bottom right. The confocal images before (CONF1) and after (CONF2) are presented for photobleaching comparison. The CONF1 image is normalized to the CONF2 image. The STED images are normalized to the 99^th percentile of the intensity of the CONF1 image. Images are 5.12μm × 5.12μm. The evolution of the parameter selection (left; STED: STED power, Exc.: Excitation power, Pdt.: Pixel dwelltime) and imaging optimization objectives (right; R: Resolution, P: Photobleaching, S: Signal ratio) are presented.

Supplementary information

Supplementary Information

Supplementary Tables 1–22 and Figs. 1–12.

Reporting Summary

Supplementary Data 1

Statistical source data for the Supplementary Figures.

Source data

Source Data Fig. 2

Statistical source data.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Bilodeau, A., Michaud-Gagnon, A., Chabbert, J. et al. Development of AI-assisted microscopy frameworks through realistic simulation with pySTED. Nat Mach Intell 6, 1197–1215 (2024). https://doi.org/10.1038/s42256-024-00903-w

Download citation

Received: 25 March 2024
Accepted: 20 August 2024
Published: 26 September 2024
Issue date: October 2024
DOI: https://doi.org/10.1038/s42256-024-00903-w

This article is cited by

Video microscopy: an old story with a bright biological future
- Léa-Isabelle Renaud
- Kelliane Béland
- Eric Asselin
BioMedical Engineering OnLine (2025)