Denoising-autoencoder-facilitated MEMS computational spectrometer with enhanced resolution on a silicon photonic chip

Zhou, Jing; Zhang, Hui; Qiao, Qifeng; Chen, Heng; Huang, Qian; Wang, Hanxing; Ren, Qinghua; Wang, Nan; Ma, Yiming; Lee, Chengkuo

doi:10.1038/s41467-024-54704-1

Download PDF

Article
Open access
Published: 26 November 2024

Denoising-autoencoder-facilitated MEMS computational spectrometer with enhanced resolution on a silicon photonic chip

Jing Zhou^1,2^na1,
Hui Zhang ORCID: orcid.org/0000-0003-2989-7547^3,4,5,6^na1,
Qifeng Qiao⁷,
Heng Chen^1,2,
Qian Huang^1,2,
Hanxing Wang^1,2,
Qinghua Ren^1,2,
Nan Wang^1,2,
Yiming Ma ORCID: orcid.org/0000-0002-5730-3956^1,2 &
…
Chengkuo Lee ORCID: orcid.org/0000-0002-8886-3649^8,9,10

Nature Communications volume 15, Article number: 10260 (2024) Cite this article

5898 Accesses
11 Citations
Metrics details

Subjects

Abstract

Silicon photonics enables the construction of chip-scale spectrometers, in which those using a single tunable interferometer provide a simple and cost-effective solution. Among various tuning mechanisms, electrostatic MEMS reconfiguration stands out as an ideal candidate, given its high tuning efficiency and ultra-low power consumption. Nonetheless, MEMS devices face significant noise challenges arising from their susceptible minuscule components, adversely impacting spectral resolution. Here, we propose a distinct paradigm of spectrometers through synergizing an easily-fabricated MEMS-reconfigurable low-loss waveguide coupler on a silicon photonic chip and a convolutional autoencoder denoising (CAED) mechanism. The spectrometer offers a 300 nm bandwidth and a reconstruction resolution of 0.3 nm in a noise-free condition. In a noisy environment with a signal-to-noise ratio as low as 30 dB, the reconstruction resolution of the interferograms processed by the CAED exhibits an enhancement from 1.2 to 0.4 nm, approaching the noise-free value. Our technology is envisaged to provide a powerful and cost-effective solution for applications requiring accurate, broadband, and energy-efficient spectral analysis.

Integrated silicon photonic MEMS

Article Open access 20 March 2023

Nanoscale imaging of super-high-frequency microelectromechanical resonators with femtometer sensitivity

Article Open access 02 March 2023

Exceptional points enhance sensing in silicon micromechanical resonators

Article Open access 19 January 2024

Introduction

Optical spectrometry is a highly effective analytical tool employed in both academic and industrial areas^1,2. Its applications encompass material analysis, medical diagnostics, and environmental monitoring^3,4,5. To cater to the demands of portable, handheld, and wearable applications, miniature spectrometers are rapidly advancing^6,7. Chip-scale spectrometers based on silicon (Si) photonic integrated circuits (PICs) boast several advantages, including CMOS compatibility and high integration level, making them an appealing option for developing high-performance miniature spectrometers^8,9. Currently, most on-chip spectrometers utilize planar dispersive optics, narrowband filters, and Fourier transform (FT) interferometers^{10,11,12,13,14,15}. Computational spectrometry has recently emerged as a new paradigm, utilizing computational methods to approximate or reconstruct the incident spectrum from pre-calibrated spectral response information¹⁶. Computational spectrometers usually comprise arrays of photonic structures such as photonic crystal slabs¹⁷, photonic crystal nanobeam cavities¹⁸, and stratified waveguide filters⁷. The photonic structure arrays and corresponding detector arrays significantly increase the complexity, footprint, and cost of the PICs. Over the past few years, several spectrometers have been developed that make use of solely a single tunable filter or interferometer paired with a single detector^19,20,21. These devices offer a simpler, smaller, and more cost-effective alternative for computational spectrometry.

The tunability in Si PICs is typically realized by thermo-optic modulation and free carrier injection, both relying on the change of the Si refractive index^22,23. However, because of the weak perturbation of the Si refractive index, these methods frequently result in high power consumption²⁴. In comparison, microelectromechanical systems (MEMS) attain modulation by spatially displacing photonic components, consequently improving the modulation efficiency and reducing the power consumption^25,26. Among a variety of MEMS actuation mechanisms, electrostatic actuation stands out due to its ultra-low standby power and reconfiguration energy consumption²⁷. Therefore, reconfiguration using electrostatic MEMS actuation offers a simple, effective, and energy-efficient approach for the construction of on-chip spectrometers.

Nonetheless, the presence of noise detrimentally affects the quality of output generated by actuators used for converting information to physical, chemical, or biological effects²⁸. MEMS actuators are particularly susceptible to noise issues due to the movable structures and the small sizes of their electronic, mechanical, and other components^28,29. The spectral resolution of the spectrometer depends on both the reconstruction algorithm and the measurement noise³⁰. In very noisy environments, conventional algorithms frequently produce significant distortion of the reconstructed spectrum³¹. Therefore, it is important to remove noise effects, especially for MEMS-enabled spectrometers. However, it is challenging due to intricate noise mechanisms. The application of deep learning technologies is nowadays considered as a potentially promising solution for this problem in spectrum reconstruction^32,33,34. Autoencoder is a deep learning technology that can adaptively learn the structure of data and represent data efficiently^35,36. Autoencoders have demonstrated markable benefits for molecular property prediction³⁷, image segmentation³⁸, and quantum systems³⁹. Furthermore, they have been proven to be effective in reducing noise in single-cell RNA sequencing and ultrasonic signals^40,41. Denoising autoencoders, because of their weak constraints from noise generation mechanisms, show potential for reducing MEMS noise³⁵.

In this paper, we present a paradigm of computational spectrometers based on the synergy between electrostatic MEMS modulation and convolutional autoencoder denoising (CAED) mechanism. The device features a waveguide coupler reconfigured by an integrated MEMS cantilever actuator. Through a strategic reduction of the MEMS tuning range by revealing its counterintuitive relationship with the reconstruction performance, the device yields high fabrication efficiency and optimum reconstruction resolution. On top of the ultra-low power consumption enabled by the electrostatic MEMS tuning, a CAED strategy is proposed and utilized to minimize the side effects of the associated MEMS noise on the reconstruction performance. The autoencoder is trained on a diverse dataset of chip-collected interferograms, achieving optimal noise reduction with a resolution approaching the noise-free level. Spectrum reconstruction results demonstrate the effectiveness of CAED in mitigating noise effects with a low signal-to-noise ratio (SNR) of 30 dB, resulting in the improvement of the resolution from 1.2 to 0.4 nm. The proposed CAED-facilitated MEMS spectrometer presents a promising solution for broadband high-resolution spectral analysis in applications demanding precision and power efficiency. The utilization of advanced deep learning techniques of denoising autoencoders not only improves the performance of MEMS spectrometers but also presents a universal solution for mitigating noise-related challenges in computational spectrometers with calibration matrices.

Results

Design and architecture

Our proposed denoising-autoencoder-facilitated MEMS computational spectrometer consists of a MEMS-enabled computational spectrometer (MECS) and a CAED mechanism. The concept of the computational spectrometer here is analogous to FT spectrometers and centers around the generation of interferograms⁴². The interferograms at the output port are functions of received signal intensity over time and are converted to a wavelength-dependent spectrum via computational algorithms. The MECS is designed as a cantilever-tunable waveguide coupler, consisting of a straight waveguide and a cantilever waveguide (Fig. 1a). Both waveguides in the coupling region are supported by a single-sided structure, while the straight waveguide outside the coupling region is supported by a two-sided structure, thus defining the movable and stationary parts. When applying a bias voltage V, the cantilever waveguide can be electrostatically pulled down while the straight waveguide remains immobile. A vertical coupling gap h will be induced between the two waveguides, subsequently resulting in a change in the effective index difference between the symmetric mode (SM0) and the asymmetric mode (SM1) of the waveguide coupler. The effective index difference $\Delta n$ is a function of both the voltage V and the wavelength λ, thus can be represented by $\Delta n(\lambda,V)$. According to the coupled-mode theory, the output power of the straight waveguide can be described as⁴³:

$${P}_{{{{\rm{o}}}}}\left(\lambda,V\right)=A\left(\lambda \right){\cos }^{2}\left(\frac{\pi L}{\lambda }\Delta n(\lambda,V)\right)$$

(1)

where A(λ) is the spectrum of input light, L is the coupling length. When we apply time-variant bias voltage and thus time-domain modulation of the vertical gap h, an interferogram ${P}_{{{{\rm{o}}}}}(\lambda,V)$ will be generated at the output port for each wavelength, as depicted in Fig. 1b. Subsequently, we apply spectrum reconstruction algorithms to the interferogram data, and the reconstructed spectrum is shown in Fig. 1c. The existence of noise in the interferograms poses a challenge in reconstructing the spectrum with closely adjacent peaks, thereby limiting the reconstruction resolution.

**Fig. 1: Conceptual illustration of the spectrometer.**

To address this problem, we propose a CAED mechanism. The architecture of the CAED is illustrated in Fig. 1d, which consists of an encoder ${{{\mathcal{E}}}}$ for compression and a decoder ${{{\mathcal{D}}}}$ for reconstruction. The autoencoder enables denoising by learning a meaningful representation of input data through the compression and reconstruction process. The encoder ${{{\mathcal{E}}}}$ learns a compact representation of input interferogram data, filtering out the irrelevant information from the input, forcing the model to retain only the essential features for reconstruction, and then the decoder ${{{\mathcal{D}}}}$ reconstructs the clean input. The process can be expressed as:

$${P}_{{{{\rm{d}}}}}\left(\lambda,V\right)=\left({{{\mathcal{D}}}} \circ {{{\mathcal{E}}}}\right){P}_{{{{\rm{o}}}}}\left(\lambda,V\right)$$

(2)

where $\circ$ denotes the sequential application of functions. Convolutional autoencoders, utilizing convolutional layers, prove exceptional proficiency in capturing spatial structures and are particularly effective for denoising tasks related to images or spatiotemporal data^44,45. The interferogram after denoising is shown in Fig. 1e, and the incident spectrum can be accurately reconstructed from it, even when the two peaks are closely adjacent (Fig. 1f). In other words, the spectrum reconstruction performed to ${P}_{{{{\rm{d}}}}}$ improves the noise robustness of the MECS.

MEMS spectrometer

We first work on the principle of the proposed MECS. As shown in Fig. 2a–c, the device is fabricated on a silicon-on-insulator (SOI) wafer that consists of a 0.22 μm thick silicon device layer and a 2 μm thick buried oxide (BOX) layer. Both the straight and cantilever waveguides are 0.35 μm wide for single transverse-electric (TE) mode propagation. The waveguide coupler is designed with an initial coupling gap of 200 nm and a coupling length of 2030 μm. When a bias voltage is applied between the cantilever and the silicon substrate, electrostatic attraction induces downward displacement of the cantilever waveguide, while the straight waveguide remains stationary due to insulation grooves, as shown in Fig. 2d.

**Fig. 2: MECS and spectrum reconstruction.**

As illustrated in Fig. 2e, the coupling between the cantilever and straight waveguides can be understood by the interference of two supermodes (SM0 and SM1) formed in the waveguide coupler. The vertical coupling gap h, in conjunction with the wavelength λ, defines an effective index difference $\Delta n(\lambda,h)$ between SM0 and SM1, i.e., $\Delta n(\lambda,h)={n}_{1}(\lambda,h)-{n}_{2}(\lambda,h)$. Specifically, $\Delta n(\lambda,h)$ can be approximated by a polynomial function:

$$\Delta n(\lambda,h)\approx \left({a}_{1}+{a}_{2}\lambda+{a}_{3}{\lambda }^{2}\right)\left({b}_{1}+{b}_{2}h+{b}_{3}{h}^{2}+{b}_{4}{h}^{3}\right){=f}_{1}(\lambda )\cdot {f}_{2}(h)$$

(3)

This approximation is validated using numerical calculations (see Supplementary Note 1). The polynomial approximation can be fitted with a 99.72% R-squared value. h is a function of the applied bias voltage V and can be approximated as:

$$h\left(V\right)\approx {c}_{1}+{c}_{2}V+{c}_{3}{V}^{2}+{c}_{4}{V}^{3}$$

(4)

We also validate this approximation using numerical calculations (see Supplementary Note 2) and achieve a good fitting with a 99.95% R-squared value. By combining Eqs. (1), (3), and (4), the output power of the proposed spectrometer can be given as:

$${P}_{{{{\rm{o}}}}}\left(\lambda,V\right)=A\left(\lambda \right){\cos }^{2}\left(\frac{\pi L{f}_{1}(\lambda )\cdot {f}_{2}(h(V))}{\lambda }\right)$$

(5)

For our designed waveguide coupler with a certain coupling length L, an interferogram can be obtained at the output port by applying a time-variant bias voltage, using a light beam with a wavelength of λ.

We investigate the relationship between the device tuning range and the spectral reconstruction performance using correlation analysis (see Supplementary Note 3 and Fig. S3). We find that the fully decoupled condition beyond a certain range leads to a larger self-correlation width, indicating impaired reconstruction resolution. Therefore, we adopt a moderately decoupled condition, which not only guarantees satisfactory spectral resolution but also reduces the required tuning range to a level achievable by the release of the BOX layer⁴⁶. This approach significantly simplifies the device configuration and fabrication process. Other attempts, such as employing a trapezoidal supporting structure instead of subwavelength grating for the suspended waveguides, possess lower fabrication restrictions and improved wavelength scalability (see Fig. S4). Using a straight waveguide as the bus waveguide, instead of the traditional directional coupler, reduces propagation loss and improves the SNR. Additionally, edge couplers are utilized to enlarge the device bandwidth (see Fig. S5). The static transmission spectrum and the frequency response are provided in Supplementary Note 4. All these attempts contribute to an enhanced, easy-to-fabricate, and large-bandwidth MEMS spectrometer.

Spectrum reconstruction

Based on the interferograms obtained from the MECS, the spectrum can be reconstructed as follows. We first collect a matrix Y that indicates the spectral response of the device at each wavelength and each bias voltage. Interferograms for wavelengths from 1.3 to 1.6 μm (at a step of 0.1 nm) are acquired by applying a sequential bias voltage ranging from 0 to 29.9 V. The MEMS cantilever, measured 33 μm in length, possesses an estimated electrostatic pull-in voltage of 34.3 V (see Supplementary Note 2). Due to the small displacement of the cantilever at low voltage levels, which results in a limited optical response, a higher voltage increment is chosen at lower bias voltages, with 64 steps in total. Thus, the m-by-n matrix Y now has dimensions m = 64, n = 3001, where m is the number of bias voltages, and n is the number of wavelengths. As the laser intensity, edge coupler efficiency, and detector responsivity vary at different wavelengths, the 3001 elements in each row of the matrix Y need to be normalized to cancel out these wavelength-dependent testing system features. The normalization vector ${{{\bf{W}}}}={\left[{w}_{1},{w}_{2},\cdot \cdot \cdot,{w}_{n}\right]}^{{{{\rm{T}}}}}$ is the transmission spectrum of a reference straight waveguide on the same chip and with the same design as that of the MECS. Therefore, the calibration matrix P of our spectrometer can be given as:

$${{{\bf{P}}}}={{{\bf{Y}}}}\cdot {{{\rm{diag}}}}\left({w}_{1}^{-1},\,{w}_{2}^{-1},\,\cdot \cdot \cdot,{w}_{n}^{-1}\right)$$

(6)

where diag represents the diagonal matrix form. Here, each column of P (as shown in Fig. 2f) represents the interferogram of the corresponding wavelength.

The performance of a spectrometer is expected to achieve two properties: (i) The spectral response at each sampling channel has diverse features, so that the correlation length in the wavelength span can be small to provide high spectral resolution; (ii) The transmission spectra for any two sampling channels should be very different, i.e., orthogonal, to provide a transmission sampling matrix with a large rank⁷. Spectral self- and cross-correlations are calculated from the calibrated matrix P and shown in Fig. 2g, h (see more details in Supplementary Note 5). The self-correlation width, δλ, is read as 0.28 nm, providing an estimation of the spectral resolution. The low cross-correlation approaching almost 0 indicates that the spectra of these sampling channels contain very diverse features, which proves the effectiveness of our designed time-domain modulation channels.

Any output interferogram I, corresponding to a polychromatic signal represented by a column vector R, can be expressed as:

$${{{\bf{I}}}}={{{\bf{P}}}}\cdot {{{\bf{R}}}}$$

(7)

Thereby, the incident spectrum R can be determined from the interferogram I by solving the regularized regression problem, which involves using inadequate constraints (i.e., the I with a size of 64) to infer the R in a size of 3001. To specify a unique solution, the underconstrained system can be solved by using the equation below:

$${\min }_{{{\bf{R}}}}\{{\Vert {{{\bf{I}}}}\,-\,{{{\bf{P}}}}\cdot ({{{{\bf{R}}}}}_{1}+{{{{\bf{R}}}}}_{2})\Vert }_{2}^{2}+\alpha {\Vert {{{{\bf{R}}}}}_{1}\Vert }_{1}+\beta {\Vert {{{{\bf{R}}}}}_{2}\Vert }_{2}^{2}\}$$

(8)

where ${{{{\bf{R}}}}}_{1}$ and ${{{{\bf{R}}}}}_{2}$ denote the discrete and continuous component of R, respectively. α and β denote the regularization parameters that embody the intrinsic characteristics of the spectrometer. The optimal values of α and β are determined via cross-validation analysis²⁰. Using Eq. (8), it is feasible to reconstruct a spectrum of arbitrary shape without specific knowledge of spectral contents (see more discussions on the reconstruction method in Supplementary Note 6). Our model can accurately reconstruct an input single-wavelength spectrum with an accuracy of ±0.1 nm over the entire working wavelength range (300 nm bandwidth). Figure 2i presents the reconstruction of several single-wavelength spectra across the whole bandwidth.

We characterize the device tuning energies at different applied voltages using the method described in Ref. ²⁷. The average tuning power is derived as the tuning energy divided by the response time and plotted in Fig. 2j. Even at the maximum applied voltage of 29.9 V, the average tuning power is less than 70 μW. Additionally, the capacitor nature of the electrostatic MEMS actuator allows nearly zero standby power consumption, as measured and shown in Fig. 2k.

Noise-free reconstruction and noise analysis

Before performing the denoising of MECS, we first determine the noise-free reconstruction resolution for dual-wavelength spectra as a reference. With noise present, an interferogram I consists of two components, i.e., the actual interferogram ${\bf {I}}_{a}$ and the noise interference $\bf e$:

$${{{\bf{I}}}}={{{{\bf{I}}}}}_{a}+{{{\bf{e}}}}$$

(9)

As a result of the stochastic nature of noise, the interferogram of an identical light beam varies with each passage through the same waveguide coupler. Compared to the interferogram ${{{{\bf{I}}}}}_{1}={{{{\bf{I}}}}}_{a}+{{{{\bf{e}}}}}_{1}$ recorded during the calibration, the same interferogram recorded during the spectrum reconstruction can be expressed as ${{{{\bf{I}}}}}_{2}={{{{\bf{I}}}}}_{a}+{{{{\bf{e}}}}}_{2}={{{{\bf{I}}}}}_{1}-{{{{\bf{e}}}}}_{1}+{{{{\bf{e}}}}}_{2}$. Since the absolute value of noise is hard to be quantitated, we take the interferogram recorded during the calibration as a reference. The noise in the interferogram during the subsequent spectrum reconstruction is considered as the relative value ($\Delta {{{\bf{e}}}}={{{{\bf{e}}}}}_{2}-{{{{\bf{e}}}}}_{1}$) with respect to this reference. The interferogram recorded during the calibration is referred to as noise-free ($\Delta {{{\bf{e}}}}={0}$) in the following content.

As verified by our previous work, a dual-wavelength interferogram can be considered as the linear superposition of two single-wavelength ones²⁰. Therefore, the noise-free dual-wavelength interferogram can be obtained by weighted summation of any two column vectors (i.e., single-wavelength interferograms with a wavelength interval of $\Delta \lambda$) in the calibration matrix P, where the weights account for the realistic nonideality of different amplitudes of two laser sources. The spectrum reconstruction results for the synthesized dual-wavelength interferograms are shown in Fig. 3a. Figure 3b, c provide a zoom-in view of the reconstructed spectrum when $\Delta \lambda$ is 0.2 and 0.3 nm, respectively. The reconstruction resolution is defined by the minimum resolvable spacing between the two wavelengths, observed when the reconstructed spectrum closely matches the input spectrum. In order to quantify the reconstruction accuracy, here we utilize the widely adopted metric named relative error ε, which is defined as¹⁹:

$$\varepsilon=\frac{||{{{\bf{R}}}}-\widehat{{{{\bf{R}}}}}|{|}_{2}}{||{{{\bf{R}}}}|{|}_{2}}$$

(10)

where R and $\widehat{{{{\bf{R}}}}}$ are the input and reconstructed spectrum, respectively. The calculated ε in Fig. 3b, c indicates that distinguishing between two peaks with a separation of less than 0.2 nm is challenging, and the noise-free reconstruction resolution of the spectrometer is ~0.3 nm.

**Fig. 3: Noise-free spectrum reconstruction and noise analysis.**

However, in practical scenarios, the measured interferograms are inevitably influenced by noise, thus worsening the reconstruction resolution. In fact, due to their tiny and movable components, sources of noise in MEMS devices are quite diverse, including thermal noise, shot noise, 1/f noise, and others²⁸. In Supplementary Note 7, we analyse the significant effects of thermal and shot noises on cantilever displacement. Figure 3d illustrates the noise-induced floating displacements of the cantilever waveguide, which impose noise on the measured interferograms. After the calibration matrix collection, the interferograms are measured by the MECS again for 1000 wavelengths within the bandwidth. Then, we calculate their relative SNR, which is defined as $10{{\mathrm{lg}}}\frac{{{{\rm{||}}}}{{{{\bf{I}}}}}_{1}{{{{\rm{||}}}}}^{2}}{{{{\rm{||}}}}{{{{\bf{I}}}}}_{2}-{{{{\bf{I}}}}}_{1}{{{{\rm{||}}}}}^{2}}$ The resulting SNR amplitude distribution ranges from 30 to 55 dB, as shown in Fig. 3e. To investigate the impact of noise on the reconstruction resolution, we apply white noise with the highest noise level corresponding to 30 dB SNR to 20 randomly selected dual-wavelength interferograms. The spectrum reconstruction resolution falls in the range of 0.8–1.2 nm, which is significantly worse than the noise-free value of 0.3 nm, as shown in Fig. 3f. According to the Rayleigh criterion (refer to Supplementary Note 8), the length of the waveguide coupler needs to be extended 4 times to achieve the same 0.3 nm resolution. Therefore, it is imperative to mitigate noise effects.

To minimize the impact of noise on reconstruction resolution, an intuitive strategy is to limit the noise during device design, which, however, needs to be delicate while having limited effects. A noise reduction algorithm for interferograms would be a more effective and cost-efficient solution. Nevertheless, the interferograms formed by scatter plots are irregular and lack discernible frequencies, making traditional denoising techniques deficient⁴⁷. Considering that all column vectors in the calibration matrix P are linearly independent eigenvectors, any dual-wavelength interferogram can be regarded as a linear combination of two single-wavelength interferograms. This represents a form of feature space combination that is well-suited for autoencoder purification⁴⁸. Hereby, we propose a CAED mechanism to acquire a compact representation of input data and eliminate irrelevant information from the input.

Convolutional autoencoder denoising

Denoising autoencoder aims to learn a representation robust to noise added to the original data. Typically, training a denoising autoencoder aims to reconstruct the original data with minimal error. However, if the original data is complicated, the training process may be time-consuming and may lead to underfitting. Additionally, if the autoencoder is overly specialized for a certain type of input, it may lose generalizability to other patterns, necessitating different models for different input spectra. Hereby, we employ a different, noise-oriented training strategy: instead of training the autoencoder to recover the input pattern, we recover the noise pattern and then subtract it from the initial input data (see details in Supplementary Note 9)⁴⁹. To be specific, consider a noisy observation I, which consists of the original data ${{{{\bf{I}}}}}_{a}$ and the noise ${{{\bf{e}}}}$, i.e., ${{{\bf{I}}}}={{{{\bf{I}}}}}_{a}+{{{\bf{e}}}}$. Since ${{{\bf{e}}}}$ is simpler and has a more consistent pattern, we train the autoencoder by learning ${{{\bf{e}}}}$ and subtracting it from I, which is more effective than learning ${{{{\bf{I}}}}}_{a}$ directly. The schematics of the training and testing phases are depicted in Fig. 4a. The parameters of the autoencoder (i.e., encoder ${f}_{\theta }$ and decoder ${g}_{{\theta }^{{\prime} }}$) are optimized as follows:

$${\theta } ^{\,*},{\theta }^{\,{\prime} {*} }={{\arg}\,{\min}}_{\theta,{\theta }^{{\prime} }}\frac{1}{M}{\sum }_{i=1}^{M}{{{\mathcal{L}}}}\left({{{{\bf{e}}}}}^{(i)},{g}_{{\theta }^{{\prime} }}\left({f}_{\theta }\left({{{{\bf{I}}}}}^{\left(i\right)}\right)\right)\right)$$

(11)

where ${{{\mathcal{L}}}}$ is a loss function of mean squared error (MSE) between two inputs. During training phase, the ${{{{\bf{e}}}}}^{(i)}$ is derived by subtracting the ground truth ${{{{\bf{I}}}}}_{a}^{(i)}$ from the input sample ${{{{\bf{I}}}}}^{\left(i\right)}$. In test phase, we employ the trained autoencoder to predict ${\widetilde{{{{\bf{e}}}}}}^{(j)}$ and subtract it from the input sample to derive the regenerated data ${\widetilde{{{{\bf{I}}}}}}_{a}^{\,(\;j)}$, which can be represented as follows for all $j\in \left\{1,\ldots,L\right\}$:

$${\widetilde{{{{\bf{I}}}}}}_{a}^{\,(\;j)}={{{{{\bf{I}}}}}^{\left(j\right)}-g}_{{\theta }^{{\prime}*}}\left({f}_{{\theta }^{*}}\left({{{{\bf{I}}}}}^{\left(j\right)}\right)\right)$$

(12)

**Fig. 4: Construction of convolutional denoising autoencoder.**

While it is not necessary to include all possible patterns that will appear in the testing phase in the training set, maintaining diversity is crucial to ensure that the autoencoder learns the noise pattern rather than the pattern of any specific input type. Therefore, we construct a dataset comprising various input interferogram patterns, by sampling from the calibration matrix P (Fig. 4b). Taking dual-wavelength interferogram as an example, to ensure that each column feature of matrix P has the same probability of being sampled in the synthesized dataset, we reshape the transpose of P (${{{{\bf{P}}}}}^{\prime}$ size of 3001 × 64) by concatenating its first 100 rows to its end, forming a new matrix Q with a size of 3101 × 64. Index pairs (i, j) are randomly generated, where $0 \, < \, j-i\le 100$ and $1\le i\le 3001$, and the ith row of matrix Q is added with the jth row of Q to form dual-wavelength interferograms. Similarly, index triplets (i, j, k) and index quadruplets (i, j, k, l) are randomly generated to construct triple-wavelength and quadruple-wavelength data. The three types of data, each with 10,000 samples, together construct the mixed dataset (matrix M) of 30,000 samples. Gaussian white noises corresponding to SNRs of 30 and 36 dB are then added to each row of matrix M, forming an interferogram dataset N consisting of 60,000 noisy interferograms. Noises of different levels are added to enhance the diversity of the dataset, so as to improve the generalizability of the trained model⁵⁰.

The architecture of the convolutional autoencoder is depicted in Fig. 4c, which consists of an encoder for compression and a decoder for reconstruction. The encoder comprises convolutional layers, maximum pooling layers, and residual blocks, while the decoder includes transpose convolution layers and convolutional layers. The performance of CAED is optimized by employing the residual block and fine-tuning the convolution kernel size and the number of convolutional layers. According to Fig. 4d–f, the CAED model showing the optimal noise reduction performance contains 5 convolution kernels, 2 residual blocks, and 6 convolutional layers. During the training process, MSE is used as the loss function, and the resolution of spectrum reconstruction is evaluated on the test set (Fig. 4g). The MSE decreases rapidly during training, while the resolution gradually improves and eventually stabilizes. The trained autoencoder demonstrates great generalizability in denoising a variety of input interferogram patterns without the need for retraining for each specific kind of pattern (see details in Supplementary Note 9).

Figure 5a–c shows the non-denoised, denoised, and noise-free reconstruction resolutions of 20 sets of dual-wavelength interferograms under different SNR conditions of 35, 30, and 25 dB, respectively. The results show that CAED improves the reconstruction resolution in all three noise scenarios (from 0.4–0.8 to 0.3–0.4 nm for 35 dB SNR, from 0.8–1.2 to 0.3–0.4 nm for 30 dB SNR, and from 1.2–1.7 to 0.5–0.8 nm for 25 dB SNR). The resolution can be improved to nearly the noise-free value for SNR of 30 dB and above. The denoising performance of CAED at lower SNR levels of 20, 15, and 8 dB is presented in Supplementary Note 10, where resolution improvement is also observed. For these lower SNR levels, a corresponding noise training dataset may achieve better denoising results. Therefore, our CAED mechanism can effectively work across the entire SNR range in real-life applications.

**Fig. 5: Test of CAED effectiveness for spectrum reconstruction.**

Using the trained model, we first assess the effectiveness of CAED on dual-wavelength interferograms by combining two tunable laser sources via a 50/50 optical coupler. Figure 5d–g depict the reconstruction results of the dual-wavelength interferograms with different wavelength spacings. At a wavelength spacing of 1.2 nm (Fig. 5d) or greater, the input spectrum can be accurately reconstructed regardless of whether the interferogram is denoised or not. Wavelength spacings between 0.4 and 1.1 nm (Fig. 5e, f) are where reconstruction without denoising is not feasible, whereas denoising the interferogram enables the spectrum reconstruction. At a wavelength spacing of 0.3 nm or less (Fig. 5g), the incident spectrum cannot be reconstructed even when the interferogram is denoised. Therefore, the reconstruction resolution is successfully improved from 1.2 to 0.4 nm by CAED, almost approaching the noise-free value of 0.3 nm. In the synthesized dataset used to train the model, the maximum wavelength spacing of dual-wavelength spectra is 10 nm. To verify the effectiveness of CAED when the spacing between two wavelengths exceeds 10 nm, we also studied three scenarios with wavelength spacings of 20, 30, and 50 nm in Supplementary Note 11. Consistent reconstruction performance is observed, further illustrating that the denoising is independent of the input spectrum pattern. Our spectrometer also demonstrates robustness to temperature fluctuation of ±8 °C, which can be further extended to 10-70 °C as long as the calibration matrix at each temperature is pre-recorded (see Supplementary Note 12), covering the reasonable operating temperature range for practical applications. In Supplementary Note 13, we further analyse the tolerance of our spectrometer to fabrication errors.

Beyond the dual-wavelength experiment, a more challenging triple-wavelength testing is conducted. The result presented in Fig. 5h illustrates the successful reconstruction of three laser peaks and a spectral spacing of 0.4 nm between the two nearest peaks with a relative error ε of 0.126. In addition, the reconstruction of a broadband spectrum is demonstrated using an amplified spontaneous emission (ASE) source as the input. As shown in Fig. 5i, the spectral features are well recovered with a low relative error ε of 0.044. Furthermore, a mixed spectrum is examined, which combines a broadband signal (the ASE source) with a narrowband signal (a laser source) via a 50/50 optical coupler. Figure 5j presents the resolved mixed spectrum with an ε of 0.095, showing that a high reconstruction accuracy can still be attained. In Supplementary Note 14, we perform the reconstruction of a more broadband spectrum by simulation, providing additional evidence of the 300 nm bandwidth. Our spectrometer can find numerous real-life applications, for example, spectroscopic sensing of various molecules, such as N-methylaniline with a well-defined absorption fingerprint near 1.5 µm⁵¹.

Discussion

To benchmark our spectrometer, we conduct a comprehensive comparison with on-chip spectrometers that have been previously reported (see details in Supplementary Note 15)^{7,10,11,12,13,14,15,16,18,19,30,42,52,53,54,55,56,57,58,59,60,61}. In most reported spectrometers, a distinct trade-off exists between resolution and bandwidth, as shown in Fig. 6a. Specifically, when the resolution surpasses 1 nm, the bandwidth tends to narrow down to less than 200 nm. Our proposed MEMS spectrometer demonstrates state-of-the-art performance in terms of bandwidth, and further, with the assistance of a denoising autoencoder, breaks through the trade-off limitation between bandwidth and reconstruction resolution, achieving a bandwidth of 300 nm and a reconstruction resolution of 0.4 nm. In addition to improving reconstruction resolution and robustness, the autoencoder denoising, executed using the relative error between the calibration matrix and measurements, has the potential to enhance the tolerance of spectrometers to manufacturing imperfections in high-volume production. Some recent demonstrations, leveraging narrowband filtering and computational reconstruction, have improved the bandwidth-to-resolution ratios to several thousands, but often come with the drawback of requiring extended sampling times. Additionally, they commonly employed thermal tuning, necessitating meticulous temperature control and high power consumption of over 30 mW. In comparison, our device features ultra-low power consumption of less than 70 μW thanks to the electrostatic MEMS reconfiguration, which is three orders of magnitude lower, as illustrated in Fig. 6b.

**Fig. 6: Comparison of reported on-chip spectrometers.**

Recently, physically multi-stage structures have become popular practices for designing high-performance spectrometers by creating abundant sampling channels^60,61. As a pioneer in MEMS spectrometers, our device, although limited to the simplest case, i.e., a single physical stage, can see significant performance improvements by further leveraging a multi-stage structure as illustrated in Fig. 7a. In this simulation demonstration, we implement a 3-stage design with 8 voltage states per stage (i.e., 512 sampling channels in total). The corresponding calibration matrix is shown in Fig. 7b. Thanks to the improved channel decorrelation and increased channel number, the reconstruction resolution is improved by one order of magnitude (see Supplementary Note 16 and Fig. S19). Meanwhile, despite the estimated noise cumulated to 19 dB SNR in the 3-stage structure, our current denoising autoencoder still achieves over twofold improvement of resolution to 40 pm, as shown in Fig. 7c–f, which could be further enhanced by accordingly optimizing the design of the autoencoder network. The multi-stage structure and the CAED mechanism also work well in the reconstruction of triple-wavelength, broadband, and mixed broadband/narrowband spectra (see Fig. S20). Regarding footprint, our device is significantly smaller than those physically multi-stage spectrometers, while less compact compared to some of the narrowband-filter-based spectrometers. Nonetheless, narrowband filters usually suffer from an inherent compromise between the SNR and spectral resolution⁶². To further shrink the footprint, we can reduce the coupling gap of the waveguide coupler by replacing the cantilever actuator with a comb-drive actuator, changing the current out-of-plane reconfiguration scheme to an in-plane one (see Supplementary Note 17). The driving voltage can also be reduced through optimizing the structural parameters of the MEMS actuator (see Supplementary Note 18).

**Fig. 7: Investigation of multi-stage design.**

In conclusion, our study represents a significant advancement in the domain of on-chip optical spectrometry, specifically focusing on MEMS-enabled devices. Our work highlights the inherent limitations of existing on-chip spectrometers, emphasizing the advantages offered by Si PICs and the efficacy of electrostatic MEMS modulation. Recognizing the susceptibility of MEMS actuators to noise, we develop CAED - a deep learning technology - to effectively mitigate noise effects and elevate spectrum reconstruction resolution. The results underscore the tangible potential of the proposed CAED-facilitated MEMS spectrometer. With the noise reduction capability at 30 dB SNR, the reconstruction resolution of the spectrometer is improved from 1.2 to 0.4 nm, approaching the noise-free value of 0.3 nm. Our approach lays a solid foundation for broadband high-resolution spectral analysis, particularly in applications demanding precision, power efficiency, and noise resilience. Moreover, it is worth highlighting that the presented CAED mechanism would have broad applicability in computational spectrometers using calibration matrices, due to its weak restrictions from noise generation mechanisms. Beyond its immediate implications, the denoising autoencoders are ready to provide a strategic solution with far-reaching impacts on the ongoing evolution of miniaturized optical devices^36,39,63,64.

Methods

Device fabrication

The MECS is fabricated on an 8-inch SOI wafer using 193 nm deep ultraviolet (DUV) photolithography. The directional coupler and the MEMS actuator are formed by reactive ion etching (RIE). An aluminium (Al) thin film is then deposited and patterned for electrical connection to power the MEMS actuator. At last, hydrofluoric acid (HF) vapor etching is used to locally remove the BOX layer and release the directional coupler and the MEMS cantilever.

Device characterization

For single-, dual- and multi-wavelength characterization, a set of tunable lasers (Santec TSL-510, 550, and 710) are adopted as the input, which are also used to measure the calibration matrix. For broadband characterization, a C + L band ASE broadband light source (Amonics ALS-CL-13-B-FA) is used as the input. A polarization controller is used to ensure that only TE-polarized light is injected into the on-chip spectrometer. The spectrometer chip is mounted on an XYZ stage for fiber-chip alignment, with the temperature controlled by a temperature controller. The light is coupled in and out of the chip through two on-chip adiabatically tapered edge couplers for broadband operation. The output light from the MECS is collected by a photodetector (Thorlabs PDA-10CS-EC). Input spectra are also recorded using an optical spectrum analyzer (OSA, Yokogawa AQ6370D) as references. The recorded spectra from OSA have a fine resolution of 20 pm. The reference input spectra are created by resampling the raw data into a 3001-point sequence with a coarser resolution of 100 pm. A semiconductor characterization system (Keithley 4200-SCS) is employed for time-sequenced bias voltage supply to implement time-domain modulation of the MECS. The sampling time grid is ~0.1 s, resulting in a total sampling time of ~6.4 s given 64 sampling steps. The sampling process can be accelerated by synchronizing the electrical voltage scanning and optical power detection with a shared trigger signal.

Reconstruction implementation

Spectrum reconstruction is implemented using a MATLAB package of iterative regularization methods and test problems for linear inverse problems (IR Tools). This method can be used to reconstruct all types of spectra, including discrete, continuous, and mixed spectra. The CAED implementation in Python 3.6 uses Keras and its TensorFlow backend. Adam is used for optimization with a learning rate of 0.002. The learning rate is multiplied by 0.5 if the loss does not improve for 30 epochs. ReLU is used as the activation function for all layers except the last output layer. The final output layer utilizes LeakyReLU as the activation function. Our training of autoencoder is deployed on 4 pieces of NVIDIA TITAN Xp GPU. Training stops after 360 epochs. A batch size of 128 is used for all datasets. For the dataset volume of 30,000, each training epoch takes 1 s, the time taken for the total 360 epochs is ~6 min. After training, the model can be applied to random input samples, with each sample taking 0.15 μs for prediction.

Data availability

The data that support the findings of this study are included in the article and its Supplementary Information. Other data are available from the corresponding authors upon request.

Code availability

The codes in support of the results of this study are available from the corresponding authors upon request.

References

Yang, Z., Albrow-Owen, T., Cai, W. & Hasan, T. Miniaturization of optical spectrometers. Science 371, eabe0722 (2021).
Article CAS PubMed Google Scholar
Xia, L., Liu, Y., Chen, R. T., Weng, B. & Zou, Y. Advancements in miniaturized infrared spectroscopic-based volatile organic compound sensors: a systematic review. Appl. Phys. Rev. 11, 031306 (2024).
Article CAS Google Scholar
Manley, M. Near-infrared spectroscopy and hyperspectral imaging: non-destructive analysis of biological materials. Chem. Soc. Rev. 43, 8200–8214 (2014).
Article CAS PubMed Google Scholar
Ralbovsky, N. M. & Lednev, I. K. Towards development of a novel universal medical diagnostic method: Raman spectroscopy and machine learning. Chem. Soc. Rev. 49, 7428–7453 (2020).
Article CAS PubMed Google Scholar
Zhou, H. et al. Metal–organic framework‐surface‐enhanced infrared absorption platform enables simultaneous on‐chip sensing of greenhouse gases. Adv. Sci. 7, 2001173 (2020).
Article CAS Google Scholar
Yuan, S., Naveh, D., Watanabe, K., Taniguchi, T. & Xia, F. A wavelength-scale black phosphorus spectrometer. Nat. Photonics 15, 601–607 (2021).
Article ADS CAS Google Scholar
Li, A. & Fainman, Y. On-chip spectrometers using stratified waveguide filters. Nat. Commun. 12, 2704 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, A. et al. Advances in cost-effective integrated spectrometers. Light Sci. Appl. 11, 174 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, L. et al. Research progress on on‐chip Fourier transform spectrometer. Laser Photon. Rev. 15, 2100016 (2021).
Article ADS CAS Google Scholar
Souza, M. C. M. M., Grieco, A., Frateschi, N. C. & Fainman, Y. Fourier transform spectrometer on silicon with thermo-optic non-linearity and dispersion correction. Nat. Commun. 9, 665 (2018).
Article ADS PubMed PubMed Central Google Scholar
le Coarer, E. et al. Wavelength-scale stationary-wave integrated Fourier-transform spectrometry. Nat. Photonics 1, 473–478 (2007).
Article ADS Google Scholar
Xu, H., Qin, Y., Hu, G. & Tsang, H. K. Scalable integrated two-dimensional Fourier-transform spectrometry. Nat. Commun. 15, 436 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Hartmann, W. et al. Waveguide-integrated broadband spectrometer based on tailored disorder. Adv. Opt. Mater. 8, 1901602 (2020).
Article CAS Google Scholar
Hadibrata, W., Noh, H., Wei, H., Krishnaswamy, S. & Aydin, K. Compact, high‐resolution inverse‐designed on‐chip spectrometer based on tailored disorder modes. Laser Photon. Rev. 15, 2000556 (2021).
Article ADS CAS Google Scholar
Sun, C. et al. Broadband and high-resolution integrated spectrometer based on a tunable FSR-free optical filter array. ACS Photonics 9, 2973–2980 (2022).
Article CAS Google Scholar
Cheng, Z. et al. Generalized modular spectrometers combining a compact nanobeam microcavity and computational reconstruction. ACS Photonics 9, 74–81 (2022).
Article CAS Google Scholar
Wang, Z. et al. Single-shot on-chip spectral sensors based on photonic crystal slabs. Nat. Commun. 10, 1020 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, J., Cheng, Z., Dong, J. & Zhang, X. Cascaded nanobeam spectrometer with high resolution and scalability. Optica 9, 517–521 (2022).
Article ADS Google Scholar
Xu, H., Qin, Y., Hu, G. & Tsang, H. K. Breaking the resolution-bandwidth limit of chip-scale spectrometry by harnessing a dispersion-engineered photonic molecule. Light Sci. Appl. 12, 64 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Qiao, Q. et al. MEMS-enabled on-chip computational mid-infrared spectrometer using silicon photonics. ACS Photonics 9, 2367–2377 (2022).
Article CAS Google Scholar
Chang, Y. et al. Development of triboelectric-enabled tunable Fabry-Pérot photonic-crystal-slab filter towards wearable mid-infrared computational spectrometer. Nano Energy 89, 106446 (2021).
Article CAS Google Scholar
Ma, Y., Dong, B., Li, B., Ang, K.-W. & Lee, C. Dispersion engineering and thermo-optic tuning in mid-infrared photonic crystal slow light waveguides on silicon-on-insulator. Opt. Lett. 43, 5504–5507 (2018).
Article ADS CAS PubMed Google Scholar
Nedeljkovic, M. et al. Silicon-on-insulator free-carrier injection modulators for the mid-infrared. Opt. Lett. 44, 915–918 (2019).
Article ADS CAS PubMed Google Scholar
Nedeljkovic, M. et al. Mid-infrared thermo-optic modulators in soI. IEEE Photonics Technol. Lett. 26, 1352–1355 (2014).
Article CAS Google Scholar
Quack, N. et al. Integrated silicon photonic MEMS. Microsyst. Nanoeng. 9, 27 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Errando-Herranz, C. et al. MEMS for photonic integrated circuits. IEEE J. Sel. Top. Quantum Electron. 26, 8200916 (2020).
Article CAS Google Scholar
Kim, D. U. et al. Programmable photonic arrays based on microelectromechanical elements with femtowatt-level standby power consumption. Nat. Photonics 17, 1089–1096 (2023).
Article ADS CAS Google Scholar
Mohd-Yasin, F., Nagel, D. J. & Korman, C. E. Noise in MEMS. Meas. Sci. Technol. 21, 012001 (2010).
Article ADS Google Scholar
Talghader, J. J. Thermal and mechanical phenomena in micromechanical optics. J. Phys. D Appl. Phys. 37, R109–R122 (2004).
Article CAS Google Scholar
Redding, B., Liew, S. F., Sarma, R. & Cao, H. Compact spectrometer based on a disordered photonic chip. Nat. Photonics 7, 746–751 (2013).
Article ADS CAS Google Scholar
Gao, L., Qu, Y., Wang, L. & Yu, Z. Computational spectrometers enabled by nanophotonics and deep learning. Nanophotonics 11, 2507–2529 (2022).
Article CAS Google Scholar
Zhang, J., Zhu, X. & Bao, J. Solver-informed neural networks for spectrum reconstruction of colloidal quantum dot spectrometers. Opt. Express 28, 33656–33673 (2020).
Article ADS PubMed Google Scholar
Zhang, J., Zhu, X. & Bao, J. Denoising autoencoder aided spectrum reconstruction for colloidal quantum dot spectrometers. IEEE Sens. J. 21, 6450–6458 (2021).
Article ADS Google Scholar
Brown, C. et al. Neural network-based on-chip spectroscopy using a scalable plasmonic encoder. ACS Nano 15, 6305–6315 (2021).
Article CAS PubMed Google Scholar
Meng, L., Ding, S. & Xue, Y. Research on denoising sparse autoencoder. Int. J. Mach. Learn. Cybern. 8, 1719–1729 (2017).
Article Google Scholar
Yuan, S. et al. Geometric deep optical sensing. Science 379, eade1220 (2023).
Article ADS CAS PubMed Google Scholar
Zhang, H. et al. Molecular property prediction with photonic chip‐based machine learning. Laser Photon. Rev. 17, 2200698 (2023).
Article ADS Google Scholar
Saranyaraj, D. & Manikandan, M. Early prediction of breast cancer based on the classification of HER‐2 and ER biomarkers using deep neural network. Expert Syst. 40, e13366 (2023).
Article Google Scholar
Zhang, H. et al. Resource-efficient high-dimensional subspace teleportation with a quantum autoencoder. Sci. Adv. 8, eabn9783 (2022).
Article ADS PubMed PubMed Central Google Scholar
Simon, L. M., Mueller, N. S. & Theis, F. J. Single-cell RNA-seq denoising using a deep count autoencoder. Nat. Commun. 10, 390 (2019).
Article ADS PubMed PubMed Central Google Scholar
Gao, F. et al. Ultrasonic signal denoising based on autoencoder. Rev. Sci. Instrum. 91, 045104 (2020).
Article ADS CAS PubMed Google Scholar
Li, L. et al. Design of an on-chip Fourier transform spectrometer using waveguide directional couplers and NEMS. Opt. Express 26, 30362–30370 (2018).
Article ADS PubMed Google Scholar
Yariv, A. Coupled-mode theory for guided-wave optics. IEEE J. Quantum Electron. 9, 919–933 (1973).
Article ADS CAS Google Scholar
Chiang, H.-T. et al. Noise reduction in ECG signals using fully convolutional denoising autoencoders. IEEE Access 7, 60806–60813 (2019).
Article Google Scholar
Fang, Z. et al. Laser stripe image denoising using convolutional autoencoder. Results Phys. 11, 96–104 (2018).
Article ADS Google Scholar
O’Brien, G., Monk, D. J. & Lin, L. MEMS cantilever beam electrostatic pull-in model. Proc. SPIE 4593, 31–41 (2001).
Article ADS Google Scholar
Liao, M. et al. Scattering imaging as a noise removal in digital holography by using deep learning. N. J. Phys. 24, 083014 (2022).
Article Google Scholar
Wang, X., Wang, Z., Zhang, Y., Jiang, X. & Cai, Z. Latent representation learning based autoencoder for unsupervised feature selection in hyperspectral imagery. Multimed. Tools Appl. 81, 12061–12075 (2022).
Article Google Scholar
Lee, W. H., Ozger, M., Challita, U. & Sung, K. W. Noise learning-based denoising autoencoder. IEEE Commun. Lett. 25, 2983–2987 (2021).
Article Google Scholar
Wu, C. et al. Harnessing optoelectronic noises in a photonic generative network. Sci. Adv. 8, eabm2956 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Hu, J. et al. Fabrication and testing of planar chalcogenide waveguide integrated microfluidic sensor. Opt. Express 15, 2307–2314 (2007).
Article ADS CAS PubMed Google Scholar
Zheng, S. N. et al. Microring resonator-assisted Fourier transform spectrometer with enhanced resolution and large bandwidth in single chip solution. Nat. Commun. 10, 2349 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Kita, D. M. et al. High-performance and scalable on-chip digital Fourier transform spectroscopy. Nat. Commun. 9, 4405 (2018).
Article ADS PubMed PubMed Central Google Scholar
Momeni, B., Askari, M., Shah Hosseini, E., Atabaki, A. & Adibi, A. An on-chip silicon grating spectrometer using a photonic crystal reflector. J. Opt. 12, 035501 (2010).
Article ADS Google Scholar
Zheng, Z., Zhu, S., Chen, Y., Chen, H. & Chen, J. Towards integrated mode-division demultiplexing spectrometer by deep learning. Opto-Electron. Sci. 1, 220012 (2022).
Article Google Scholar
Xie, S., Meng, Y., Bland-Hawthorn, J., Veilleux, S. & Dagenais, M. Silicon nitride/silicon dioxide echelle grating spectrometer for operation near 1.55 μm. IEEE Photonics J. 10, 4502207 (2018).
Article CAS Google Scholar
Zhang, L. et al. Ultrahigh-resolution on-chip spectrometer with silicon photonic resonators. Opto-Electron. Adv. 5, 210100 (2022).
Article CAS Google Scholar
Xia, Z. et al. High resolution on-chip spectroscopy based on miniaturized microdonut resonators. Opt. Express 19, 12356–12364 (2011).
Article ADS CAS PubMed Google Scholar
Sun, C. et al. Integrated microring spectrometer with in‐hardware compressed sensing to break the resolution‐bandwidth limit for general continuous spectrum analysis. Laser Photon. Rev. 17, 2300291 (2023).
Article ADS CAS Google Scholar
Yao, C. et al. Integrated reconstructive spectrometer with programmable photonic circuits. Nat. Commun. 14, 6376 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Yao, C. et al. Broadband picometer-scale resolution on-chip spectrometer with reconfigurable photonics. Light Sci. Appl. 12, 156 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Xu, H., Qin, Y., Hu, G. & Tsang, H. K. Cavity-enhanced scalable integrated temporal random-speckle spectrometry. Optica 10, 1177–1188 (2023).
Article ADS CAS Google Scholar
Huang, C.-J. et al. Realization of a quantum autoencoder for lossless compression of quantum data. Phys. Rev. A 102, 032412 (2020).
Article ADS CAS Google Scholar
Chen, Y. et al. Photonic unsupervised learning variational autoencoder for high-throughput and low-latency image transmission. Sci. Adv. 9, eadf8437 (2023).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (NSFC) Grant (62405173 to Y.M.), Shanghai Pujiang Program (23PJ1413700 to H.Z.), Fundamental Research Funds for the Central Universities (22120240566 to H.Z.), Agency for Science, Technology and Research (A*STAR) RIE Advanced Manufacturing and Engineering (AME) Programmatic Grant (A18A4b0055 to C.L.), Ministry of Education (MOE) Singapore Academic Research Fund Tier 2 (MOE-T2EP50220-0014 to C.L.), and National Research Foundation (NRF) Singapore Mid-Sized Centre Grant through the National Centre for Advanced Integrated Photonics (NRF-MSG-2023-0002 to C.L.).

Author information

These authors contributed equally: Jing Zhou, Hui Zhang.

Authors and Affiliations

School of Microelectronics, Shanghai University, Shanghai, China
Jing Zhou, Heng Chen, Qian Huang, Hanxing Wang, Qinghua Ren, Nan Wang & Yiming Ma
Shanghai Collaborative Innovation Center of Intelligent Sensing Chip Technology, Shanghai University, Shanghai, China
Jing Zhou, Heng Chen, Qian Huang, Hanxing Wang, Qinghua Ren, Nan Wang & Yiming Ma
Institute of Precision Optical Engineering, School of Physics Science and Engineering, Tongji University, Shanghai, China
Hui Zhang
MOE Key Laboratory of Advanced Micro-Structured Materials, Shanghai, China
Hui Zhang
Shanghai Institute of Intelligent Science and Technology, Tongji University, Shanghai, China
Hui Zhang
Shanghai Frontiers Science Center of Digital Optics, Shanghai, China
Hui Zhang
Shanghai Industrial μTechnology Research Institute (SITRI), Shanghai, China
Qifeng Qiao
Department of Electrical and Computer Engineering, National University of Singapore, Singapore, Singapore
Chengkuo Lee
Center for Intelligent Sensors and MEMS (CISM), National University of Singapore, Singapore, Singapore
Chengkuo Lee
National Centre for Advanced Integrated Photonics (NCAIP), Singapore, Singapore
Chengkuo Lee

Authors

Jing Zhou
View author publications
Search author on:PubMed Google Scholar
Hui Zhang
View author publications
Search author on:PubMed Google Scholar
Qifeng Qiao
View author publications
Search author on:PubMed Google Scholar
Heng Chen
View author publications
Search author on:PubMed Google Scholar
Qian Huang
View author publications
Search author on:PubMed Google Scholar
Hanxing Wang
View author publications
Search author on:PubMed Google Scholar
Qinghua Ren
View author publications
Search author on:PubMed Google Scholar
Nan Wang
View author publications
Search author on:PubMed Google Scholar
Yiming Ma
View author publications
Search author on:PubMed Google Scholar
Chengkuo Lee
View author publications
Search author on:PubMed Google Scholar

Contributions

Y.M., J.Z., and H.Z. conceived the idea. J.Z. and H.Z. performed the device design, fabrication, and characterization with assistance from Y.M., Q.Q., H.C., Q.H., and H.W. H.Z. and J.Z. wrote the convolutional autoencoder algorithms and control programs for demonstration. The results were discussed by all authors. Y.M. and H.Z. wrote the manuscript with comments from J.Z., N.W., and Q.R. Y.M. and C.L. supervised the project.

Corresponding authors

Correspondence to Yiming Ma or Chengkuo Lee.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Ang Li, Darshan Parmar and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Transparent Peer Review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhou, J., Zhang, H., Qiao, Q. et al. Denoising-autoencoder-facilitated MEMS computational spectrometer with enhanced resolution on a silicon photonic chip. Nat Commun 15, 10260 (2024). https://doi.org/10.1038/s41467-024-54704-1

Download citation

Received: 06 February 2024
Accepted: 19 November 2024
Published: 26 November 2024
DOI: https://doi.org/10.1038/s41467-024-54704-1

This article is cited by

Dispersive optical activity for spectro-polarimetric imaging
- Zhijie Cao
- Siwei Sun
- Yong Liu
Light: Science & Applications (2025)
Near-Sensor Edge Computing System Enabled by a CMOS Compatible Photonic Integrated Circuit Platform Using Bilayer AlN/Si Waveguides
- Zhihao Ren
- Zixuan Zhang
- Chengkuo Lee
Nano-Micro Letters (2025)