Introduction

Optical spectrometers play a crucial role in a wide range of applications such as laser and material characterization, process monitoring, biomedical research, forensic analysis, and space applications1,2,3,4,5,6. Conventional spectrometers based on diffractive elements face a trade-off between resolution and optical throughput7. Fourier transform spectrometers (FTS) overcome this limit by eliminating the need for slits8 (Jacquinot’s advantage). Common FTS instruments consist of a moving mirror as part of a Michelson interferometer and reconstruct the spectrum in the Fourier domain by capturing the interferometer response with varying frequencies over wavelength9,10. Typical stationary spatial heterodyne FTS designs split light and combine after reflection from tilted surfaces, to simultaneously image the response of multiple interferometers onto an array of photon detectors11,12,13,14,15. Both approaches benefit from noise reduction thanks to multiple measurements16 (Fellgett’s advantage).

Raman spectroscopy is a crucial tool for material characterization, and its application in FTS, known as FT-Raman17,18,19, enables the detection of weak signals that are limited by detector noise. The weak power and broad angular range of the Raman signal can restrict its applications due to constraints in optical throughput. Additionally, when using a high-power multimode laser, the excitation spot may be large, or the power may be intentionally spread across a broader area to avoid damaging the substance. FT-Raman emerged as a solution to address these challenges, offering a large throughput. However, they were relatively bulky as Michelson interferometers with a moving mirror were employed.

A novel waveguide-based spatial heterodyne FTS was presented as a path to miniaturize optical spectrometers in 2007 by Florjańczyk et al. 20. This structure consisted of a number (\(N\)) of asymmetric Mach-Zehnder Interferometers (MZI) with progressively increasing path length differences (\(\triangle L\)) between the arms. The spectral resolution (\({\delta }_{\lambda }\)) is determined by the free spectral range (FSR) of the MZI with the longest path length, while the operational range is defined by the FSR of the entire array as \(N{\delta }_{\lambda }/2\). A great deal of research21,22,23 has been devoted to enhancing the range and resolution24,25,26,27,28,29,30,31,32,33,34 and decreasing the footprint primarily using actively controlled interferometers and resonators27,28,29,30,31,32,33,34. However, a limited amount of research has been performed on increasing the optical throughput20,35,36,37,38,39, which is vital for detecting a weak and widespread signal such as Raman scattering.

Combining waveguide-based FTS and Raman spectroscopy would bring a valuable and affordable tool for next-generation material characterization. Both the angular range and the excitation area determine the optical throughput as \(G\approx {n}^{2}S\Omega =\pi S.{{NA}}^{2}\), where n is the refractive index, \(S\) is the area, \(\Omega\) is the solid angle, and NA is the numerical aperture. Commercial Raman spectrometers typically offer comparable or an order of magnitude less throughput than a multimode fiber (MMF) with 105 µm core diameter and NA = 0.22. In contrast, a single-mode waveguide operating at 850 nm wavelength has about 1800 times less throughput (\(G\approx {\lambda }^{2}\)). Multi-aperture waveguide-based FTS, where the throughput scales linearly with the number of apertures, can provide high optical throughput and spectral resolution by using multiple interferometers with independent inputs.

The choice of waveguide material is crucial for combining Raman spectroscopy with waveguide-based FTS, as Raman spectroscopy usually operates in the visible and near-infrared (NIR) range of the spectrum. For instance, with a common Raman excitation source at 785 nm, the most relevant spectral region spans 800–950 nm. Although most waveguide-based FTS was demonstrated on Si photonics, Si is opaque at these wavelengths. Silicon’s high refractive index contrast enables dense stacking of a large number of waveguides without any significant crosstalk. Visible transparent platforms such as silica or polymer waveguides require tens of micrometers spacing between waveguides, limiting the number of interferometers to thousands per reticle. SiN has emerged as a matured waveguide material with transparency in the visible-NIR spectrum and a relatively high refractive index40 (\(n \sim 2\)). SiN allows waveguide separation of a few micrometers, enabling reticles with tens of thousands of interferometers.

It should be noted that the estimation of the number of interferometers per reticle assumes that each interferometer is as wide as a waveguide itself. However, most interferometers, such as asymmetric MZIs, require 90° bends which substantially increases their footprint. While compact spectrometers based on stationary-wave integrated FTS (SWIFTS) have been reported41,42,43,44,45, they require high-resolution imaging, active control, or a rigorous fabrication process, adding to the integration complexity.

In this work, we report the Raman spectroscopy detection of four different substances using a SiN waveguide-based FTS, fabricated on a 200 mm wafer using Complementary Metal-Oxide-Semiconductor (CMOS) fabrication compatible techniques, with 160 edge-coupled input apertures, a spectral range of 40 nm, and a resolution of 0.5 nm. The waveguide-based FTS incorporates multiple edge-couple input apertures to enhance optical throughput, which are connected to an array of novel and compact interferometers, without any branching and need for high-resolution imaging. We found that Raman spectrum reconstruction using LASSO regression improves the quality of the reconstructed spectrum by suppressing noise and artificial peaks, making it useful for scenarios with low optical throughput.

Results

Concept of the miniature Raman spectrometer and spectrum reconstruction

The work presented in this article encompasses three main stages: light collection, interferometry, and spectrum reconstruction. Figure 1 illustrates the operating principles of our miniature spectrometer. The Raman signal, filtered to match the FTS’s spectral range, was guided into individual interferometers at the edge of the chip using a cylindrical lens. The interferometers exert harmonic amplitude modulation over the spectrum, where the frequency of modulation gradually increases from the first to the last interferometer. Each interferometer consists of a single line of waveguide, making it very compact compared to the alternatives with on-chip branching.

Fig. 1
figure 1

Working principle of the miniature Raman spectrometer. The multi-aperture waveguide-based FTS (\(A\)) collects an exemplary Raman spectrum with high optical throughput. The output power from each interferometer (y) is measured, and the Raman spectrum (\(x\)) is reconstructed by solving the linear matrix equation (\({Ax}=y\)), where \(A\) represents the transform matrix of the FTS

The interferometer structure is similar to a SWIFTS in the Lippmann configuration45 as depicted in Fig. 1; however, the measurement is relatively simple and does not require high-resolution imaging. Unlike SWIFTS, which images a standing wave pattern, the principle of our spectrometer relies on imaging the output power of each interferometer which is wavelength-dependent. A vertical grating coupler (VGC) is followed by a waveguide, which is terminated by an aluminum mirror. This structure allows forward and backward propagating waves to interfere, generating a beating pattern over the VGC. The position of this pattern shifts depending on the wavelength due to the cavity’s phase accumulation, making the output power wavelength-dependent. The output power of each interferometer (y) was captured using a microscope camera. Subsequently, the Raman signal spectrum (x) was reconstructed by solving the linear matrix equation \({Ax}=y\) for \(x\), where \(A\) represents the pre-characterized transform matrix (T-matrix) of the FTS.

Using the Raman spectrum of Ibuprofen as a sample input signal, we numerically simulated the output power of the interferometer array and reconstructed the spectrum, as shown in Fig. 1. We found that the reconstructed spectrum’s features are highly sensitive to the regression method used. The transform matrix was generated from a numerical model that represents our spectrometer’s behavior. Notably, the output power variation between neighboring interferometers decreases rapidly for those with shorter FSRs. This makes high-resolution contributions of the FTS more prone to noise, creating a need for a robust regression method to enhance the accuracy of the reconstructed spectrum. Given the inherently weak Raman signal, a regression method that enhances the major Raman peaks and lowers the noise is essential.

We compared two regression techniques for the spectrum reconstruction in our numerical simulation as illustrated in Fig. 1: the Penrose-Moore pseudoinverse and the least absolute shrinkage and selection operator (LASSO). While LASSO regression successfully provided a solution with major peaks, it suppressed minor peaks along with the noise. Moreover, LASSO fragmented the major peaks into multiple narrower peaks with small gaps between them, a profile previously reported in the literature as a poor match32. To mitigate this, we applied Savitzky-Golay smoothing after LASSO regression, which improved the matching with the expected Raman spectrum.

Compact Fourier transform spectrometer design

The initial step in designing the waveguide-based FTS was determining the spectrometer’s center wavelength and range. We targeted a center wavelength of 860 nm with a spectral range of 40 nm, as the substances under testing exhibit distinct Raman spectra within this range for a 785 nm excitation wavelength. Given the desired resolution of 0.5 nm, we designed a spectrometer consisting of 160 interferometers.

Figure 2a presents a simplified layout of the waveguide-based FTS (see Supplementary Information S1 for further details). Each interferometer consists of an edge coupler (\({w}_{E}=15.0\,{\rm{\mu }}m\)), a single-mode waveguide (\({w}_{{WG}}=466\,{nm}\)) to filter out higher-order modes, a fully etched VGC, and an aluminum mirror on an expanded waveguide. We used a \({t}_{{SiN}}=150{nm}\) thick low-pressure chemical vapor deposition (LPCVD) SiN platform to mitigate both edge-coupling and propagation losses. The VGC and cavity width were set as \(8\mu m\) to control light divergence, allowing efficient power collection by the microscope system with NA = 0.14.

Fig. 2
figure 2

The multi-aperture waveguide-based Fourier transform spectrometer. a Schematic top-view of edge-coupled multi-aperture waveguide-based Fourier transform spectrometer (FTS), illustrating the operation of the ultra-compact interferometers. b Cross-sectional schematic of an interferometer, showing its back-reflector enhanced vertical grating coupler (VGC) and termination mirror. c Pictures of the 200 mm SiN photonics wafer and one of the FTS chips, with a Raman spectrum in the background. d Scanning electron micrograph of the cross-section of an interferometer, near its aluminum mirror termination

The interferometer design began by determining the longest cavity length and cavity length difference. The longest cavity length was set to \(384.565\mu m\), based on the required resolution (\({\delta }_{\lambda }=0.5{nm}\)). Consequently, the cavity length difference between neighboring waveguides (\(\Delta L=2.390\mu m\)) was determined using the Littrow wavelengths of 800 nm and 880 nm, where all interferometers give an identical response, making the spectral range of the FTS between 840 nm to 880 nm. It was assumed that the Littrow wavelength at 880 nm would blue-shift due to longer propagation in the VGC, falling within the T-matrix characterization range.

The next step involved designing the VGC. The cross-section of the interferometer cavity is depicted in Fig. 2b. The grating parameters, including the number of grating lines, pitch (\(p=567{nm}\)) and etch width (\(e=318{nm}\)), were optimized through a series of finite-difference time-domain (FDTD) simulations, using the shortest interferometer (\({L}_{0}=4.555\mu m)\) as the basis (Supplementary Figs. S1 and S2). The optimization aimed to maximize both the output power and the interference contrast. Of these parameters, the number of grating lines has the greatest impact on output power, while all parameters influence the interference contrast.

Forward and backward propagating waves form a standing wave over the VGC where spatial matching of the constructive interference with a perturbation (grating etch) maximizes the outcoupling. For optimal interference contrast, all perturbation positions must align in phase with the standing wave, implying that the VGC must couple at the normal angle, effectively functioning as a distributed Bragg reflector (DBR) as discussed in Supplementary Information S1. Back-reflection bandwidth is limited as the pitch corresponds to a second-order DBR. To optimize the output power, the back-reflected power must be minimized, which was achieved by adjusting the etch width.

To further enhance the output power, we placed an aluminum reflector beneath the VGC. The reflector distance (\(h=325{nm}\)) was designed for the normal angle, to provide a sufficient bandwidth for the intended spectral range. This distance was kept as small as possible, limited by the propagation loss, to minimize performance variations due to fabrication tolerances. The top-oxide thickness (\({t}_{{TOX}}=3325{nm}\)) was determined to optimize edge-coupler efficiency and output power. The bottom oxide thickness (\({t}_{{BOX}}=4525{nm}\)) was kept large enough to isolate the silicon substrate from the waveguide.

To improve Raman signal collection and to use TM polarization for light sheet amplitude calibration, we integrated a polarization rotating beam splitter (PRBS) into the layout. However, due to fabrication challenges, the performance for TM0 to TE0 coupling was suboptimal, while the TE0 to TE0 port remained unaffected and TM0 was largely eliminated from the propagation towards the FTS (details provided in Supplementary Fig. S3).

A 200 mm wafer based on this design, along with characterization structures, was fabricated at the United Microelectronics Center (CUMEC). Figure 2c shows an image of the fabricated wafer and a spectrometer chip with dimensions of 1.6 mm x 3.2 mm. The fabrication process was verified using multiple scanning electron micrographs (SEM). Figure 2d represents an SEM image of the cross-section of one of the interferometers near its mirror. More SEM images can be found in Supplementary Fig. S4. The measured waveguide thickness was close to the target of 150 nm, although this measurement’s accuracy highly depends on the cleaving angle of the chip. Moreover, we observed that the mirror was inclined by \({6}^{^\circ }\), which could introduce extra losses upon reflection.

Using the stack parameters obtained from the SEM images, we simulated the output response of Interferometer Number 10 and found that up to 75% of the power (1.2 dB loss), with an average of 49% of the power over the spectrum, was guided from the injected TE0 mode to the output via VGC, as shown in Supplementary Fig. S5. The interference contrast was observed to range from 3.0 dB to 9.8 dB across the spectrum.

We experimentally characterized the performance of single-mode waveguide propagation loss, PRBS, VGC, and the mirror, and observed that the results generally matched our expectations, except for the PRBS, as mentioned earlier. The structures used for characterization and the experimental results are shown in Supplementary Fig. S6. The single-mode waveguide propagation loss was measured at 1.18 dB/cm for 860 nm wavelength. PRBS measurements indicated a high loss of 6 dB for TM0 to TE0 conversion efficiency, while TE0 to TE0 transmission remained unaffected. Additionally, a loss of over 21 dB was measured for TM0 propagating straight through the PRBS, showing no contribution from TM0 mode is expected at the FTS output. The discrepancy between the simulation and measurements was attributed to variations in the SiN waveguide slab thickness. VGC diffraction and transmission efficiency ranged from -4 dB to -6 dB, in line with our expectations from FDTD simulations. Significant variations were observed due to multi-reflections between the top oxide-air interface and the VGC reflector, which are influenced by variations in the top oxide thickness. Mirror reflection efficiency was measured to have a loss ranging from 1.6 dB to 1.9 dB, due to the mirror being tilted at approximately 6° and potential surface roughness.

Characterization of the Fourier transform spectrometer

Our FTS was characterized for its wavelength response by a tunable laser source (with line-width 0.06 nm) from 805 nm to 880 nm with 0.05 nm steps, using the setup shown in Supplementary Fig. S7a. The laser light was focused into a light sheet, with a full-width half-maximum (FWHM) width of 600 µm and length of 2.9 mm, using a series of cylindrical lenses, and it was carefully aligned to the edge of the chip while monitoring the output from the VGC using a camera. Figure 3a shows a camera image of the output signal, illustrating the power variation across the interferometers for an 850 nm wavelength input. As expected, the signal was heavily modulated over the interferometers at this wavelength. A slowly varying profile, consistent with the shape of the light sheet, was observed.

Fig. 3
figure 3

Characterization results of the multi-aperture waveguide-based Fourier transform spectrometer. a Camera image of the vertical grating coupler’s outputs and the corresponding output power of the interferometers. b Measured and numerically calculated interferometer responses versus wavelength, both normalized by their respective maximum values between 805 nm and 880 nm. c Free spectral range of each interferometer, averaged over the wavelength range. d Transform matrix of the Fourier transform spectrometer, showing the response of all interferometers within the characterized wavelength range

We recorded images at each wavelength within the test range and normalized the output power from each image using a Gaussian fit and the corresponding laser power. Figure 3b displays the response of four interferometers with varying lengths, showing their respective wavelength responses. We observe interference contrast ranging from 2.5 dB to 9.5 dB. The experimental results showed good agreement with FDTD simulations and our numerical model (presented in Supplementary Information S1), where we used a genetic algorithm to match by varying the effective refractive index (\({n}_{{eff}}\)), \(h\), \({t}_{{TOX}}\) and interferometer length. A correction factor (\({L}_{{err}}\)) was applied to account for potential deviations in interferometer length due to mask misalignments. The optimized parameters were: \(h=299{nm}\), \({t}_{{TOX}}=3038{nm}\) and \({L}_{{err}}=-175{nm}\), indicating a slight shift of the mirror towards the VGC.

The mean FSR for each interferometer was calculated by applying a Fourier transform to their respective output power signal. As shown in Fig. 3c, the FSR exhibits a clear trend of decreasing from 37.53 nm to 0.51 nm (which defines \({\delta }_{\lambda }\)). The low variation from the trend indicates low phase noise.

After normalizing the output powers of each interferometer for both the laser power and the light sheet, we compiled the data into a transform matrix (\(A\)), as illustrated in Fig. 3d. The matrix was then used to reconstruct the laser and Raman spectra. We observed a Littrow wavelength at \(871.85{nm}\) and a wavelength where the consecutive interferometer responses phase shift by π (called a-Littrow for convenience) at \(832.10{nm}\), corresponding to a spectral range of \(39.75{nm}\).

The slight discrepancies between the measured range and resolution compared to the initial design specifications can be attributed to the actual SiN waveguide thickness of \({t}_{{SiN}}= \sim 136{nm}\), rather than the intended \(150{nm}\). The thickness was estimated through simulations to match the calculated (using \(\triangle L\)) effective refractive indices (\({n}_{{eff}}\)) for the measured Littrow and a-Littrow wavelengths, yielding values of 1.642 and 1.654, respectively. In our numerical model, illustrated in Fig. 3b, the \({n}_{{eff}}\) was varied around these values for the corresponding wavelength.

Laser spectrum reconstruction

We evaluated the spectrum reconstruction performance of the FTS by repeating the T-matrix measurement. Figure 4a shows the results obtained using both Penrose-Moore pseudoinverse and LASSO regression methods. For the pseudoinverse, each peak generated a mirrored peak symmetrical around the Littrow and a-Littrow wavelengths. However, these mirrored peaks exhibited smaller amplitudes due to the asymmetry of the interferometer responses. In contrast, LASSO regression effectively suppressed these mirrored peaks and narrowed the main peak, providing a broader operational range and higher resolution than expected. This is due to LASSO regression optimizing the sparsity and enhancing the peak with the best matching solution while minimizing the rest of the spectral coefficients, thus prohibiting the appearance of any artificial peaks.

Fig. 4
figure 4

Measurement results of the spectrum reconstruction of a tunable laser source. a Reconstructed spectra of the laser as it was tuned from 810 nm to 870 nm in 10 nm increments, using Penrose-Moore pseudoinverse and LASSO regression methods. b Amplitude variation of the reconstructed peak compared to the power meter read-outs. c Reconstructed spectra of the sum of output powers from two laser measurements, centered at 850 nm and separated by 0.5 nm to 0.1 nm, with 0.1 nm steps (left to right columns). The upper row shows reconstructions by Penrose-Moore pseudoinverse, while the lower row shows reconstruction using LASSO regression

To characterize the resolution, we summed the interferometer output powers for two closely spaced wavelengths with separations varying from \(0.5{nm}\) down to \(0.1{nm}\), \(0.1{nm}\) increments, and then reconstructed the spectrum of the summed response. As shown in Fig. 4c, the Penrose-Moore solution provided a resolution close to \(0.5{nm}\), consistent with the design specifications. However, the LASSO regression improved the resolution close to \(0.2{nm}\), demonstrating its ability to enhance spectral details thanks to its sparse nature25,31,32,33.

Accurate amplitude recovery across the spectrum is crucial when comparing complex spectra. Figure 4b illustrates that the amplitude of the laser peak reconstructed by the Penrose-Moore pseudoinverse varied significantly from the power meter’s readings. The LASSO regression did not show a clear nonlinear trend, although it exhibited higher noise. As expected, both regression methods failed to estimate the amplitude correctly close to the Littrow and a-Littrow wavelengths. Nonetheless, LASSO regression showed promise for a more robust reconstruction, particularly in handling complex but sparse spectra like those encountered in Raman spectroscopy (Supplementary Information S4).

Raman spectrum reconstruction

We successfully demonstrated experimental Raman spectrum reconstruction and material identification of four different substances: Paracetamol, Ibuprofen, isopropyl alcohol (IPA), and pure glucose. The experimental setup was similar to the T-matrix measurement configuration, with an additional excitation light at 785 nm, a focusing lens to illuminate the substances and collect the Raman signal, and filters to block the excitation light (see Supplementary Fig. S7d). The interferometer responses were recorded for all substances as shown in Supplementary Fig. S8.

We evaluated Raman signal reconstruction using Penrose-Moore pseudoinverse, LASSO, Ridge, and Elastic-Net regression combined with Savitzky-Golay filtering, both numerically and experimentally (Supplementary Figs. S9 and S10). All methods outperformed the pseudoinverse method, consistent with the reports in the literature25,31,32,33. Under low SNR conditions, LASSO and Elastic-Net regression performed similarly, while Ridge produced artificial peaks due to its smoothing effect, making it less suitable for material identification. Since Elastic-Net requires optimizing two hyperparameters versus LASSO’s one, we focused on LASSO regression in this article.

Figure 5 represents the Raman spectra obtained from a conventional spectrometer and those reconstructed using our FTS, using LASSO regression with Savitzky-Golay filtering. To quantify the similarity between the conventional and FTS-reconstructed spectra, we used the Pearson correlation coefficient (\(r\)), a standard technique for assessing spectral similarity46. A value of \(r > 0.7\) indicates a good match, while \(r > 0.9\) signifies a very high correlation. The similarity between the spectra was influenced by factors such as signal strength and spectral complexity47. The difference in Raman signal strength between the substances can be because of their Raman cross-section as well as their scattering efficiency i.e., solution, fine particles, or rough surfaces. IPA and Paracetamol showed the highest match, due to the spectral simplicity and higher Raman signal, respectively. Pure glucose exhibited the lowest match, which can be attributed to its rich spectrum with multiple broad peaks and low Raman signal.

Fig. 5
figure 5

Comparison of Raman spectra from a conventional spectrometer and the FTS reconstruction for four substances. a Paracetamol. b Ibuprofen. c 100% isopropyl alcohol solution. d Pure glucose powder. The Pearson correlation coefficients, indicating the similarity between each reconstructed spectrum and the conventional spectrometer measurements, are shown for each substance

The spectrum reconstruction in Fig. 5 used a set of 100 images, each captured with 10 s of integration time, for each substance to keep consistency between experiments. For Paracetamol and IPA, a single image was sufficient to reconstruct a spectrum with \(r > 0.9\) (Supplementary Fig. S11), showing a prospect for the practical use of this FTS for material identification.

In general, lower \(r\) were caused by several factors, including the appearance of artificial peaks due to spectral complexity and low optical power, and the disappearance of the baseline because of LASSO regression. Additionally, slight mismatches in peak positions and relative amplitudes were observed. The peak position discrepancies were likely influenced by the smoothing process, while variations in relative peak amplitudes may be due to noise introduced by the regression as shown earlier in Fig. 4b, and the selection of the hyperparameter, as illustrated in Supplementary Fig. S12.

It should be noted that the Savitzky-Golay filtering decreases the effective resolution, and when multiple complex spectra overlap, a need for higher spectral resolution arises. An FTS design with higher inherent resolution and a T-matrix measurement with smaller steps can be employed to boost the resolution. Alternatively, more complex spectrum reconstruction techniques, such as Elastic-Net, relying on multiple L1-norm, L2-norm, and first derivative minimization coefficients can be used25,31,32,33.

We explored concentration detection for IPA solutions as discussed in Supplementary Information S5 and shown in Supplementary Fig. S13, and observed a linear relationship with approximately 90% accuracy down to 25% IPA concentration, though the LASSO regression hyperparameter had a noticeable impact on the results. This suggests a need for further research to improve the signal-to-noise ratio.

The collection efficiency of the FTS must be further improved for an accurate concentration measurement. The current FTS collects a maximum of 3% of the Raman signal in an ideal scenario where the Raman signal is confined within the excitation spot of our laser (see Supplementary Information S4) i.e., confocal arrangement and non-scattering opaque medium.

One way of efficiency improvement involves increasing the optical throughput of the optical system. 160 apertures support a throughput that corresponds to the collection of Raman signal from a spot with a length of approximately 79 µm with an NA of 0.55 in one direction (see Supplementary Information S2). The current optical system collects the signal from an excitation spot diameter of 29 µm with NA = 0.55 considering a non-scattering opaque substance, filling only part of the throughput supported by the chip. In this case, increasing the numerical aperture of the optical system would increase the efficiency by collecting more Raman signals. However, Raman signal is not confined within the excitation spot for a diffusive medium, and it can extend several millimeters from the excitation spot48, showing that the collection optics may not be the limiting factor.

In addition to improvement in the optical system, the optical throughput of our current array is limited by only one single-mode waveguide in the perpendicular direction, allowing collection from a spot with a width of only 0.5 µm. This small collection area significantly limits the efficiency of our spectrometer. We compared the collection efficiency of the FTS presented in this article with that of a diffraction-type spectrometer (25 µm slit width, f/1.3) based on optical throughput, and estimated the FTS collection efficiency to be approximately 5.6% of that of the conventional spectrometer, as detailed in Supplementary Information S2. It was assumed that the Raman signal has lower optical throughput than the FTS along the axis where the interferometers are parallel to each other. This efficiency is highly dependent on factors such as the excitation laser spot size, sample scattering, and the optical throughput of the collection optics.

We analyzed the overall efficiency of the Raman setup from the Paracetamol substance to the camera and estimated a loss of 29.8 dB, as detailed in Supplementary Information S6. The primary source of this loss is the optical throughput mismatch between the Raman signal and the FTS, contributing 19.3 dB, highlighting the need for improvements in the FTS optical throughput. Of the losses originating from the FTS, aside from the optical throughput mismatch, the average output power over the spectral range is 49% of the power in the waveguide, leading to an additional 3.1 dB loss. Propagation loss is negligible compared to the other FTS-related losses. Additionally, only TE-polarized light propagates through the FTS, contributing 3.0 dB to the loss. Assuming that free-space optical component losses are similar between the FTS and the aforementioned diffraction-type spectrometer, the complete FTS system is approximately 1.3% as efficient as the diffraction-type spectrometer.

The advantage of the waveguide-based FTS is that it allows optical throughput to be expanded in both axes without compromising resolution. To make the optical throughput more symmetrical, one could stack multiple layers of waveguides or employ a two-dimensional array of vertical couplers such as grating couplers. Another possibility is to use a complex optical component consisting of a multiplexed prism array that transfers the throughput between axes for coupling into the current structure49.

To reconstruct a spectrum with high SNR, comparable to conventional spectrometers, the optical throughput of the FTS must be increased substantially. Number of apertures in the range of \({10}^{2}-{10}^{3}\) provides a throughput similar to a typical Raman spectrometer as aforementioned. However, the entire spectrum splitting into multiple channels degrades the system SNR due to the shot and thermal noise from multiple channels. Ignoring the photonics component losses, the splitting of the signal creates a need for \({10}^{2}-{10}^{3}\) more optical throughput47, thus requiring \({10}^{4}-{10}^{6}\) input apertures to have an SNR comparable to a conventional spectrometer.

Further improvements in reconstructed spectrum quality could also be achieved by optimizing the interferometer and VGC design. Apodized gratings, for instance, could provide better phase matching with the standing wave enhancing the interference contrast and expanding the working range of the interferometers.

In addition to improving the SNR, using a pre-modeled T-matrix can save time and reduce noise compared to measuring the T-matrix for each device. However, complex random variations exist between interferometers, such as VGC response, which is very sensitive to changes in top oxide thickness due to multiple reflections (see Supplementary Fig. S6g). The top oxide thickness varies over the FTS due to possibly being relatively thick and rough. These multiple reflections could be eliminated in the future with the use of anti-reflection coatings.

Discussion

In this work, we presented an experimental demonstration of Raman spectroscopy for various substances using a highly scalable 150 nm thick SiN waveguide platform with a novel, compact, edge-coupled FTS with 160 input apertures and interferometers. The spectral range and resolution of the FTS were characterized close to the design values as 39.75 nm and 0.51 nm, respectively. We showed that LASSO regression can enhance the resolution down to 0.2 nm while providing a relatively accurate reconstructed peak amplitude compared to the Penrose-Moore pseudoinverse.

The Pearson correlation coefficient reached up to \(r=0.95\), indicating an outstanding match with the Raman spectrum from a conventional spectrometer. We observed that a spectrum with high correlation can be reconstructed down to 10 seconds of integration time using LASSO regression combined with Savitzky-Golay filtering.

It is important to note that this work focuses on demonstrating the Raman spectroscopy capabilities of the interferometer array rather than presenting a fully miniaturized Raman spectrometer device incorporating a waveguide-based FTS. The current complete system remains bulky (see Supplementary Fig. 7d) and requires further miniaturization, including the integration of a laser, sample excitation and collection optics, excitation light filtering, an imaging system for signal analysis, and associated electronics. Future work aims to improve optical throughput by incorporating orders of magnitude more input apertures using vertical couplers, directly integrating the waveguide-based interferometers onto an imager, depositing CMOS-compatible long-pass filters over the input apertures, and developing on-chip laser integration50.

Notable high-performance miniaturized Raman spectrometers using conventional optics have been reported in the literature51,52. Ilchenko et. al.51 introduced a compact and cost-effective dispersive transmission grating-based Raman spectrometer, which is highly efficient for confocal Raman spectroscopy. However, applications such as accurate blood glucose detection53 require signal collection over a wide area, necessitating high optical throughput. Additionally, efforts have been made to miniaturize FT-Raman spectrometers, particularly for space applications52. The on-chip FTS demonstrated in this work can be integrated with such compact optical systems to further reduce the overall footprint.

To the best of our knowledge, this work represents one of the first demonstrations of an integrated photonics chip spectrometer for Raman spectroscopy. The wafer-scale CMOS-compatible fabrication of the chip allows highly scalable integration. Additionally, this structure does not require any high-resolution imaging, indicating a possibility of direct integration to a photon detector chip and paving the path for compact Raman spectrometers. We believe this research will contribute significantly to the development of wearable devices for non-invasive biomarker detection, affordable hand-held spectrometers, and lightweight instruments for space exploration.

Materials and methods

Device simulation

The output response of the vertical grating coupler (VGC) and the interferometer with the shortest cavity (\({L}_{0}=4.555\mu m\)) were simulated using the finite-difference time-domain (FDTD) method with Lumerical FDTD software. These simulations also provided the effective refractive index (\({n}_{{eff}})\). The interferometer and VGC structure were simulated in 2D and 3D for comparison. After confirming similar results through a mesh convergence test, all subsequent optimization simulations were performed in 2D, using a uniform mesh of \(5{nm}\times 5{nm}\) around the VGC region and \(28{nm}\times 5{nm}\) at the cavity region.

Simulations were conducted under four conditions: (1) with a top-oxide air interface and no VGC reflector, (2) without a top-oxide air interface and with VGC reflector, (3) with both a top-oxide air interface and VGC reflector, (4) without a top-oxide air interface and no VGC reflector. Pre-fabrication simulations assumed a waveguide thickness of \({t}_{{SiN}}=150{nm}\) and a mirror inclination angle of \({0}^{^\circ }\), while post-fabrication simulations were adjusted to match the estimated waveguide thickness and mirror inclination, \({t}_{{SiN}}=136{nm}\) and \({6}^{^\circ }\). For longer interferometers, due to the extended simulation times, a numerical model was developed instead of relying on direct FDTD simulations.

Interferometer numerical model

A detailed description of the interferometer’s working principle together with a numerical model is provided in Supplementary Information S1.

Matching the numerical model with measurement results

The effective refractive index (\({n}_{{eff}}\)) for the Littrow wavelength and the center wavelength between two Littrow wavelengths, a-Littrow, (both denoted as \({\lambda }_{l}\)) was first calculated from the measurement using the equation \({n}_{{eff}}^{l}=\frac{l.{\lambda }_{l}}{2\triangle L}\) where \(l\) is a positive integer, and \({n}_{{eff}}^{l}\) is the effective refractive index at those wavelengths. Since the T-matrix measurement covered a Littrow and an a-Littrow wavelength, we assumed a linear relationship between \({n}_{{eff}}\) and wavelength, modeled as \({n}_{{eff}}={m}^{l}.\lambda +{n}_{g}^{l}\), where the slope \({m}^{l}\) and group index \({n}_{g}^{l}\) were estimated based on the effective refractive index at the Littrow and a-Littrow wavelengths. The resulting \({n}_{{eff}}\) profile showed good agreement with that of a waveguide with a thickness of \({t}_{{SiN}}= \sim 136{nm}\), as determined by the Lumerical FDTD solver.

The mirror inclination of 6° was accounted for by retrieving the reflection amplitude coefficient through FDTD simulations. Several fabrication-induced parameter variations were considered, including the cavity length (due to mask misalignment), effective refractive index (where small variations can cause significant mismatches in longer interferometers), top-oxide thickness \({t}_{{TOX}}\), and VGC reflector distance \(h\). At each step, the Pearson correlation coefficient between the measurement results and the numerical model for a set of cavity lengths was used as the figure of merit for matching.

The matching procedure began with a sweep of the misalignment correction factor, ranging from -200 nm to 200 nm. Since the shortest cavity responses are most sensitive to length variations, the Pearson correlation coefficients for the five shortest cavities were averaged to find the best match. After identifying the optimal misalignment correction length, the effective refractive index was further optimized for longer interferometers with cavity lengths of 384.565 µm, 288.965 µm, and 193.365 µm. The optimization was performed using a genetic algorithm (from PyGad toolbox54) with 50 solutions per population, 10 parents per generation, 5 survivors, and 1000 generations with a saturation stop condition of 50 generations. The parameter search space was defined as \(m=(1\pm 0.001){m}^{l}\) and \({n}_{g}=(1\pm 0.001){n}_{g}^{l}\).

Once the effective refractive index was optimized, we performed a genetic algorithm search for the top-oxide thickness and VGC reflector distance using the same genetic algorithm settings. The Pearson correlation coefficient was averaged over interferometers with cavity lengths of 384.565 µm, 193.365 µm, 26.065 µm, and 4.555 µm. The search range for the top-oxide thickness (\({t}_{{TOX}}\)) was from 2992.5 nm to 3657.5 nm, and for the VGC reflector distance (\(h\)), it ranged from 292.5 nm to 357.5 nm.

Wafer fabrication

A 200 mm SiN PIC wafer was fabricated in United Microelectronics Center (CUMEC). Initially, a SiO2 layer was deposited on a 725 µm thick Si wafer using plasma-enhanced chemical vapor deposition (PECVD), then thinned to 2 µm by chemical mechanical polishing (CMP). A 150 nm thick SiN waveguide layer was deposited via low-pressure chemical vapor deposition (LPCVD) and patterned using two masks. The waveguide layer was dry-etched fully and to the depth of 50 nm to define the designed structures.

Next, a 325 nm SiO2 layer was deposited (PECVD followed by CMP), followed by the deposition of a 200 nm Al layer to form the VGC reflector. A further 3 µm of SiO2 (PECVD and CMP) was deposited. The device wafer was then flipped and bonded to a carrier wafer with 1.0 µm of SiO2 (PECVD) using oxide-oxide fusion bonding. The Si substrate of the device wafer was completely removed by grinding and CMP, leaving 1.5 µm of SiO2 on top.

The interferometer mirror was fabricated by dry-etching 4.1 µm of SiO2 and 150 nm of SiN, followed by the deposition of 500 nm Al layer. Finally, an additional SiO2 layer was deposited to bring the total top oxide thickness to 3.325 µm.

Transform matrix measurement

The optical setup used for T-matrix measurements is illustrated in Supplementary Fig. S7a, with a list of components provided in Supplementary Information S4. A tunable laser with laser power varying from 21.3 mW to 24.8 mW, depending on the wavelength, covering 805-880 nm range, was used to scan the interferometer responses with 0.05 nm steps. To prevent speckles in the spatial profile, the laser light was directed into an integrating sphere and then guided into free space through a short section of a multimode fiber with a 600 µm core diameter. The light was quasi-collimated by an aspherical lens (L1, f = 10 mm, NA = 0.55) and split by a beam splitter.

One beam path was imaged onto a camera via two mirrors and a lens (L5, f = 40 mm) to monitor the alignment of the fiber input. The second beam path included two cylindrical lenses (CL1, fy = 50 mm, and CL4, fy = 40 mm) arranged in a 2 f configuration, which focused and collimated the beam along the y-axis (perpendicular to the chip surface) to relay the beam and enhance the Raman spectroscopy measurements. Finally, the beam was focused along the y-axis using another cylindrical lens (CL5, fy = 10 mm).

In the axis parallel to the chip edge, two cylindrical lenses (CL2, fx = 50 mm, and CL3, fx = 10 mm) in a 2 f configuration demagnified the beam by 5x. This created a light sheet with FWHM of 0.6 mm (width) and 2.9 mm (length) for edge coupling into the chip. The chip’s output was captured by a 5x objective (O1, NA = 0.14) and imaged onto a Hamamatsu camera using a tube lens (L2, f = 200 mm) and an additional lens (L3, f = 150 mm). The camera recorded a 16-bit image for each wavelength with an integration time of 10 ms, and the same procedure was followed for the laser spectrum reconstruction measurements. The output power of each interferometer was obtained by summing the pixel values over a 5×5 pixel area. To mitigate the high-frequency noise in the interferometer responses, a low-pass filter, with a cut-off at the edge of the response peaks in the Fourier domain, was applied.

Raman spectroscopy measurements

For the Raman spectroscopy measurements, the portion of the T-matrix optical setup that included five cylindrical lenses for forming a light sheet at the chip edge was retained as shown in Supplementary Fig. S7d. A 785 nm laser beam with 537 mW power, delivered through a multimode fiber (core diameter 105 µm, NA = 0.22), was quasi-collimated by a lens (L4, f = 30 mm) and filtered with a laser clean-up filter (F1, OD > 5). The collimated beam was then directed via two mirrors and a dichroic mirror and focused onto the substance using a lens (L1, f = 10 mm, NA = 0.55), creating a spot with a 14.7 µm FWHM radius. The laser power on the substance was measured as 331 mW and was kept constant for all Raman spectrum acquisitions.

The scattered light, including the Raman signal, was collected and collimated by the same lens. A long-pass edge filter (F1, 785 nm cut off, OD > 6) was used to block the excitation light. The remaining beam was further filtered by a band-pass filter (F2, 850 nm center wavelength, 40 nm full-width half-maximum) to match the Fourier transform spectrometer’s range and eliminate any remaining excitation light.

To measure the Raman spectrum using a conventional Raman spectrometer (Wasatch Photonics), an additional beam splitter before the cylindrical lenses directed half of the Raman signal through a lens (L5, f = 40 mm) and into a multimode fiber (core diameter 105 µm, NA = 0.22). All conventional Raman spectrum measurements used a single acquisition with varying integration times for different substances. 100 ms integration time was used for Ibuprofen and Paracetamol, while 1 s and 500 ms integration times were used for glucose and IPA, respectively. When the IPA solutions with varying concentrations were measured, the integration time was kept constant at 500 ms.

The beamsplitter was removed during the chip spectrometer measurements to maximize signal power. The Raman signal was focused on the chip edge via the cylindrical lenses, and the output was recorded using a Hamamatsu camera. For each chip spectrometer measurement, 100 16-bit images with 10 s of integration times were recorded. Before each Raman measurement, background images were captured by turning the laser off. For processing, the sum of background images was subtracted from the sum of the Raman measurement images. The interferometer output power was then determined by summing the pixel values within a 5 × 5 pixels region.

All substances were placed in a glass cuvette. Both the Ibuprofen in a sustained capsule (Fenbid®) and Paracetamol (Jinzhou Jiutai Pharmaceutical) pill were crushed into fine particles. Isopropyl alcohol (2-Propanol) and Glucose (D-(+)-Glucose, ≥99.5% (GC)) were sourced from Sigma Aldrich.

Spectrum reconstruction

The interferometer values were first normalized by a Gaussian profile to account for the light sheet’s intensity distribution. Laser and Raman spectra were reconstructed from the FTS response using different regression techniques, including Penrose-Moore pseudoinverse, LASSO regression, Ridge regression, and Elastic-Net regression55. A detailed discussion of these methods and results can be found in Supplementary Information S4.

Data processing was performed in Python 3.11. Penrose-Moore pseudoinverse was calculated using the Numpy library, while the other regression techniques were implemented via scikit-learn56. Savitzky-Golay smoothing was applied using the Scipy library to improve the quality of the reconstructed spectrum.

Each regression method aimed to solve the equation \({Ax}=y\) for the unknown spectrum vector \(x\), where \(A\) is the 2D transform matrix (varying by interferometer and wavelength), and \(y\) represents the measured output power from the FTS. The challenge of finding an optimum solution arises because \(A\) is a wide rectangular matrix (\(160\times 1501\)), with a rank equal to the number of rows, making the system underconstrained with infinite possible solutions57.

The Least squares solution (\(\bar{x}={A}^{+}y\)) was calculated using the Penrose-Moore pseudoinverse, \({A}^{+}\), obtained through singular value decomposition (SVD) of \(A=U\varSigma {V}^{T}\), where \(U\) and \(V\) are unitary matrices, and \(\varSigma\) contains the square roots of the eigenvalues.

LASSO regression was used to impose sparsity and limit the coefficient magnitude, especially useful in low SNR conditions, by minimizing the sum of squared errors with an additional L1-norm penalty:

$$\mathop{\min }\limits_{x}\left\{{{\rm{||}}y-A\bar{x}{\rm{||}}}_{2}^{2}+{\lambda }^{{LASSO}}{{\rm{||}}\bar{x}{\rm{||}}}_{1}\right\}$$
(1)

where the first term represents the least square regression term and the second term is an extra penalty on the L1-norm scaled with a hyperparameter of \({\lambda }^{{LASSO}}\).

Ridge regression (Tikhonov regularization) minimizes the spectral coefficients without driving them to zero by applying a penalty on the L2-norm with a hyperparameter \({\lambda }^{{RIDGE}}\):

$$\mathop{\min }\limits_{x}\left\{{{||y}-A\bar{x}{||}}_{2}^{2}+{\lambda }^{{RIDGE}}{{||}\bar{x}{||}}_{2}\right\}$$
(2)

Elastic-Net regression combines both L1 and L2 penalties for smoothness and coefficient reduction:

$$\mathop{\min }\limits_{x}\left\{{{||y}-A\bar{x}{||}}_{2}^{2}+{\lambda }_{1}^{{EN}}{{||}\bar{x}{||}}_{2}+{\lambda }_{2}^{{EN}}{{||}\bar{x}{||}}_{2}\right\}$$
(3)

All regression methods (except the Penrose-Moore pseudoinverse) were influenced by their hyperparameters and the quality of spectrum reconstruction was further enhanced through Savitzky-Golay smoothing, whose window length and polynomial order were also optimized.

Optimization of all parameters, including regression hyperparameters and smoothing factors, was performed using a genetic algorithm (PyGad library54). The objective was to maximize the Pearson correlation coefficient between the spectrum obtained from a conventional spectrometer and the FTS-reconstructed spectrum. Each genetic algorithm ran for 100 generations with 50 members per generation and 25 parents, keeping one survivor per generation. The optimum hyperparameters \({\lambda }^{{LASSO}}\), \({\lambda }^{{RIDGE}}\) and \({\lambda }_{1}^{{EN}}\) were searched in a logarithmic range from \({10}^{-12}\) to \({10}^{8}\), due to the varying signal strength of the substances. The L1 to L2 hyperparameter ratio \({\lambda }_{1}^{{EN}}/{\lambda }_{2}^{{EN}}\) of Elastic-Net regression was optimized between 0 and 1, while the Savitzky-Golay smoothing window length was searched between 10 to 150 and polynomial orders between 3 and 8.