Compilation and scaling strategies for a silicon quantum processor with sparse two-dimensional connectivity

Crawford, O.; Cruise, J. R.; Mertig, N.; Gonzalez-Zalba, M. F.

doi:10.1038/s41534-023-00679-8

Download PDF

Article
Open access
Published: 22 February 2023

Compilation and scaling strategies for a silicon quantum processor with sparse two-dimensional connectivity

npj Quantum Information volume 9, Article number: 13 (2023) Cite this article

4527 Accesses
19 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Inspired by the challenge of scaling-up existing silicon quantum hardware, we propose a 2d spin-qubit architecture with low compilation overhead. The architecture is based on silicon nanowire split-gate transistors which form 1d chains of spin-qubits and allow the execution of two-qubit operations among neighbors. We introduce a silicon junction which can couple four nanowires into 2d arrangements via spin shuttling and Swap operations. We then propose a modular sparse 2d spin-qubit architecture with unit cells of diagonally-oriented squares with nanowires along the edges and junctions on the corners. Targeting noisy intermediate-scale quantum (NISQ) demonstrators, we show that the proposed architecture allows for compilation strategies which outperform methods for 1d chains, and exhibits favorable scaling properties which enable trading-off compilation overhead and colocation of control electronics within each square by adjusting the nanowire length. An appealing feature of the proposed architecture is its manufacturability using complementary-metal-oxide-semiconductor (CMOS) fabrication processes.

Quantum tomography of an entangled three-qubit state in silicon

Article 07 June 2021

The SpinBus architecture for scaling spin qubits with electron shuttling

Article Open access 11 June 2024

Scaling silicon-based quantum computing using CMOS technology

Article 20 December 2021

Introduction

Qubit connectivity is a primary feature of any quantum computing technology. It represents the architectural arrangement of the qubits within the quantum processor and indicates the number of qubits with which any other qubit can interact. Highly-connected structures are favorable since two-qubit gates between arbitrary qubits require a smaller gate count and hence more complex problems can be solved with lower circuit depth^{1,2,3,4,5,6,7,8,9}. On the other hand, high qubit connectivity comes at the expense of technological complexity. Therefore, scaling while maintaining high connectivity is a challenge being faced by many quantum computing technologies. All-to-all connectivity has been demonstrated by photonic¹⁰ as well as ion-trap^11,12 qubits, but the distributed nature of these technologies puts serious challenges on the path to scaling. On the other hand, solid-state systems, such as superconducting¹³ and quantum dot spin qubits¹⁴, can exhibit 2d hardware topologies, which could be compactly integrated on a chip. This allows for colocating classical support electronics and offers an alternative path to scaling.

Implementations of 2d grids with nearest neighbor connectivity are desirable as this would allow for fault-tolerant quantum computing via the surface code¹⁵ without additional gate overhead. However, scaling 2d nearest neighbor grids in solid-state systems poses substantial contact routing challenges and complicates the integration of classical support electronics in the qubit plane^16,17. Quantum-classical integration would require levels of 3d integration unseen to date¹⁸ although important progress has been made in that direction¹⁹. Therefore, solid-state platforms explore scaling with sparse 2d connectivity^20,21,22 and several works have explored optimized methods for running quantum algorithms on sparsely-connected hardware graphs^23,24,25.

Particularly for silicon spin qubits, progress has been made in the last few years demonstrating high-fidelity single- and two-qubit gates in SiGe^{26,27,28,29,30,31} and silicon metal-oxide-semiconductor (MOS) structures^32,33, as well as high-fidelity initialization and readout in SiMOS^34,35,36,37. Further, the first few-qubit processors^38,39 and architecture proposals^{18,20,40,41,42} are beginning to emerge. Next steps will be focused towards scale-up, for which the use of industrial complementary metal-oxide-semiconductor (CMOS) processes is expected to play an important role¹⁷. In particular, modules containing industry-manufactured bilinear arrays of quantum dots (QDs) using split-gate nanowire transistor technology are readily manufactured and their qubit properties are widely being tested in experiments^43,44,45,46. These modules should provide a platform to demonstrate a 1d quantum processor in silicon. One-dimensional processors allow running early noisy intermediate-scale quantum (NISQ) algorithms^23,47 or demonstrating logical qubits⁴⁸. How best to scale this technology to allow for optimized operation of quantum algorithms with minimal compilation overhead is therefore an important question.

Here, we present thoughts on how to scale QD chains of 1d split-gate transistors by combining them into 2d arrays with sparse connectivity. For this purpose, we introduce a hardware junction which enables coupling between horizontally and vertically oriented arrays of split-gate transistors via spin shuttling⁴⁹. Given this junction element, we consider possible 1d and 2d architectures and investigate their suitability for implementing NISQ algorithms with minimal compilation overhead of two-qubit operations. As a result of our consideration, we propose a modular 2d spin-qubit architecture with unit cells consisting of diagonally-oriented squares where nanowires form the edges and junctions the corners of a square. We then investigate compilation strategies for the proposed 2d architecture and demonstrate a square-root scaling of the compilation overhead, which outperforms the linear overhead of 1d devices, not only asymptotically, but also down to its smallest building block which is likely to be investigated first in future experiments. The compilation results further allow for balancing the trade-off between the compilation overhead and creating space in between the qubit modules which can be beneficial for colocation of classical control and readout electronics^20,50,51 and alleviating contact routing issues^18,41,52,53. Finally, while reducing compilation overhead is mainly useful for NISQ applications, we believe the proposed architecture should be compatible with recent proposals^42,48 to operate similar architectures at fault-tolerance. Overall, this should provide a compelling path to scaling, which could be manufactured with industrial CMOS processes.

Results

Topology of the proposed architecture

We start by introducing the hardware topology of the proposed architecture, before describing the detailed embodiment in silicon and discussing compilation strategies in subsequent sections. The core element of the architecture is a bilinear array of QDs using CMOS split-gate nanowire transistors. This gives rise to a 1d arrangement of m qubits along a line (each with an ancilla qubit for readout as will be explained later), in which neighboring qubits can be involved in two-qubit operations as visualized in Fig. 1a, where the m = 4 dots indicate qubits and the black lines indicate possible two-qubit operations among neighbors. An additional building block is provided by a junction element which can join up to four linear segments in a perpendicular manner, as depicted in Fig. 1b. In principle, a two-qubit operation can be executed between any two qubits involved in the junction as indicated by an orange line connecting the qubits. As explained in more detail in the following section, each two-qubit operation across the junction requires 6 additional spin shuttling steps. We consider the regime in which these coherent shuttling steps are fast, invoking negligible overhead as compared to the two-qubit operation. We note that each qubit can only be involved in a single two-qubit operation at a time.

The unit cell of the proposed architecture is then provided by squares, with linear segments along the edges and junctions at the corners of each square, as in Fig. 1c. To form the complete architecture, we join d_x squares in one direction and d_y in the perpendicular direction. For convenience, we define the two directions as x and y. The complete architecture and x and y directions are show in Fig. 1d. Since each square contains 4m qubits and the device consists of d_xd_y squares, each device contains

$$N=4m{d}_{{{{\rm{x}}}}}{d}_{{{{\rm{y}}}}}$$

(1)

qubits. We note that a crucial feature of the proposed architecture is the tilted orientation of squares at an angle of 45^∘ in each unit cell, which minimizes the compilation overhead as will be explained later on.

Architecture embodiment in silicon

In the following, we introduce the embodiment of the proposed architecture in silicon and the corresponding control schemes in more detail. We focus on an implementation based on electron spins hosted in gate-defined QDs, but note that a similar proposal can be described for hole spin qubits. Readers interested only in the compilation techniques are advised to skip ahead.

Architecture embodiment in silicon: split-gate submodules

The principal building block of our proposal is the split-gate transistor^45,54,55,56. It consists of an undoped silicon-on-insulator nanowire of height h and width w (typically 10 nm and 60 nm, respectively). The central part of the nanowire is gated by two metallic surface electrodes of length l_g (typically 40–60 nm) which are isolated from the channel by the gate oxide (SiO₂) of thickness t (typically 6 nm); see Fig. 2a, b. The gate stack is typically formed by 5 nm of TiN followed by 50 nm of polycrystalline silicon. The rectangular cross-section of the channel, as seen in Fig. 2b, is covered by the pair of split gates which are separated by a face-to-face distance S_gg ( ≈ 30 nm) and enable local electrostatic control of the nanowire. At deep cryogenic temperatures, when positive (negative) voltages are applied to the gates, few-electron (hole) QDs form in the top-most corners of the device due to the corner effect^57,58,59, as shown in Fig. 2b. Charges can be drawn into the QDs from charge reservoirs formed of highly doped silicon located at each side of the split gate (source and drain in a). The qubit that we consider here is the spin of a single electron (or hole) confined to one of the two corner dots whose quantization axis is defined by placing the structure in a magnetic field. The other spin is used as an ancilla for readout as described later on. Hence a split-gate defines one qubit with a readout ancilla. The architecture can be scaled up by fabricating a series of split-gate transistors placed along the axis of the silicon nanowire, as depicted in Fig. 2c. The tunnel coupling along and across the nanowire is engineered by selecting the physical dimensions of the structure. In particular, w primarily determines the coupling across the nanowire whereas S_vv determines the coupling along the nanowire. These values can be tuned globally by applying a potential to the silicon handle wafer (or back gate)⁶⁰. Fine tuning could be achieved with more advanced gate layouts including a dedicated tunnel coupling gate module above the space between QD gates. The split-gate edge-to-edge separation S_vv ( ≈ 40–60 m) is set to enable sizable exchange coupling between spins, which we use to generate two-qubit interactions. Overall, the module results in a bilinear array of QDs. Given that the two QDs of each split-gate transistor encode one qubit—one dot contains a qubit spin and the other an ancilla spin for readout—the structure embodies a one-dimensional chain of silicon spin qubits of length m, as described in the previous section.

Next, we explain control, readout and initialization of the 1d modules in detail. We base our explanation on the energy spectrum of the coupled two-spin system in the single spin basis as a function of the QD energy detuning, ϵ, and a finite magnetic field; see Fig. 2d. At large positive detuning with respect to the (11)-(20) charge hybridization point, the ground state of the system corresponds to the intradot singlet, $\left\vert S(20)\right\rangle$. Here the (nm) notation refers to the charge distribution among the two QDs, i.e., dots are occupied with n and m charges, respectively. At negative detuning, the ground state of the system is the (11) charge configuration whose spin degeneracy is broken by the external magnetic field. We consider the QDs to have a tunnel coupling energy t_s and present different g-factors due to the variability of the Si/SiO₂ interface⁶¹, which further breaks the degeneracy of the $\left\vert \uparrow \downarrow \right\rangle$ and $\left\vert \downarrow \uparrow \right\rangle$ states. Here, the first spin refers to the spin with the lower g-factor (see the Hamiltonian in Supplementary Note 1). Although not strictly necessary for the operation of the processor, we consider that larger Zeeman energy differences exist across the nanowire than along the nanowire, which could be achieved by creating an asymmetry in the gate structure. Using magnetic materials on the gate stack on one side of the split or placing the splits offset with respect to the nanowire axis⁵⁹ may produce a difference in the Meissner effect at the location of the QDs. We use the energy spectrum to describe processes involving QDs within a split-gate and QDs on the same corner of the nanowire in different splits.

Architecture embodiment in silicon: initialization

We start with the 1d module loaded with one charge in each QD. Charges can be drawn in from reservoirs at the periphery following established methods⁶². To initialize to a known spin state, QDs in each split gate are positively detuned until the system relaxes to the $\left\vert S(20)\right\rangle$ state (Fig. 2e). Then, the system is pulsed towards negative detuning adiabatically with respect to the $\left\vert \uparrow \downarrow \right\rangle$-$\left\vert S(20)\right\rangle$ coupling and non-adiabatically with respect to the $\left\vert \downarrow \downarrow \right\rangle$-$\left\vert S(20)\right\rangle$ coupling (not shown in d), to initialize the QDs in the splits in the $\left\vert \uparrow \downarrow \right\rangle$ state (Fig. 2f).

Architecture embodiment in silicon: single-qubit operations

In the case of electron spins in isotopically purified silicon QDs, spins present long coherence times (${T}_{2}^{* }\, > \,100\,\mu$s and T₂ = 28 ms)⁶³. This feature enables high control fidelity via electron spin resonance (ESR) techniques^64,65 with typical Rabi frequencies of 1 MHz and fidelity in excess of 99%³². Manipulation occurs for QDs on the same side of the nanowire which are now initialized to the $\left\vert \uparrow \downarrow \right\rangle$ state.

The ESR implementation for the proposed architecture is inspired by Ref. ⁶⁶. It requires placing the structure inside a broadband 3D microwave cavity designed to deliver an oscillatory magnetic field perpendicular to the external magnetic field at the location of the chip ^67,68. The cavity is designed to have a quality factor Q ≤ 100 to cover the range of potential qubit frequencies associated with the typical variations of the g-factor of electrons in MOS devices, Δg/g ≈ 10⁻² (Ref. ⁶⁹.) The variability, associated with spin-orbit coupling, can be minimized by applying the external magnetic field along the [100] axis⁷⁰. To reduce the number of necessary frequencies to control the system and minimize cross-talk, we make use of the Stark shift. A g-factor modulation of δg/g ≈ 10⁻³ (Ref. ⁶¹) can be obtained by gate voltage tuning and allows lumping the qubit frequencies into N = Δg/δg bins. Then N-1 frequencies are sufficient to cover the full qubit frequency range. Selected qubits are tuned in and out of resonance with the relevant frequency (f_ESRi) using gate voltage pulses at the qubit gate⁷¹. ESR allows two-axis control (X and Y gates) by controlling the duration of the gate voltage pulse and the phase of the microwave excitation.

In the case of hole spin qubits, the larger spin-orbit coupling when compared to electron spins allows for all-electrical control via individual microwave voltage signals applied directly to the relevant QD gates^72,73. The spin coherence is relatively lower, with ${T}_{2}^{* }$ values of up to 7.1 μs reported⁷⁴, but the faster Rabi frequencies, of up to 150 MHz, have enabled X and Y rotation with fidelity in excess of 99%^73,75.

Architecture embodiment in silicon: two-qubit operations

We propose engineering two-qubit gates between spins on the same corner of the nanowire by means of the spin–spin exchange interaction (Fig. 2h). The exchange interaction can be modulated electrostatically by applying a differential mode voltage on the two relevant neighboring gates that brings the system close to ϵ = 0; see Fig. 2d, h. In the limit where the differential mode voltage pulse increases the exchange coupling to a value much larger than the Zeeman energy difference between spins, a $\sqrt{{\rm{S}}{\scriptstyle{\rm{WAP}}}}$ or Swap gate can be implemented by timing the duration and depth of the interaction pulse^76,77—see Supplementary Note 2. Alternatively, when the size of the modulation is smaller, such that the exchange coupling strength remains below the Zeeman energy difference, a CPhase gate could be implemented⁶⁵ with fidelity above fault-tolerant thresholds^29,30,31. In this article, we focus on the former gate but note that the Swap gate can be synthesized from a combination of CPhase and single qubit gates.

Architecture embodiment in silicon: readout

Spin readout is based on spin-dependent tunneling from the (11) to the (20) charge configurations, i.e., Pauli spin blockade⁷⁸. More particularly, the state $\left\vert \uparrow \downarrow \right\rangle$ is allowed to tunnel to the $\left\vert S(20)\right\rangle$ state, whereas all remaining two particle spins states are blocked^18,79; see Fig. 2d, i. To detect this tunneling process, we suggest using dispersive readout⁸⁰. In each split, the gate of the ancilla is connected to a lumped-element electrical resonator which is driven at its natural frequency, f₀. At ϵ = 0, the readout point, cyclic tunneling between $\left\vert \uparrow \downarrow \right\rangle$ and $\left\vert S(20)\right\rangle$, driven by the oscillatory resonator voltage, manifests in an additional quantum capacitance that loads the resonator producing a spin-dependent frequency shift that can be readily detected with standard methods^{81,82,83,84,85}. On the other hand, at the readout point, the $\left\vert \downarrow \downarrow \right\rangle$ state does not tunnel under the oscillatory voltage of the resonator giving no signal. This difference in polarizability allows discerning the two computational states ($\left\vert \uparrow \downarrow \right\rangle$ and $\left\vert \downarrow \downarrow \right\rangle$). We note that enhanced spin-orbit coupling can promote fast relaxation between the $\left\vert \downarrow \uparrow \right\rangle$ state and the singlet^38,86 but the state is outside our computational basis. Further, enhanced spin-orbit coupling can couple the $\left\vert \downarrow \downarrow \right\rangle$ and the singlet states leading to a measurable signal at that additional anticrossing^87,88. By operating at $\bar{g}{\mu }_{{{{\rm{B}}}}}B\, > \,{t}_{{{{\rm{s}}}}}$, even in these systems readout can be performed. In this proposal, we have chosen to have a readout ancilla per qubit such as each qubit can be read with one tunnel step only. However, other arrangements may be possible in which ancillas may be used as computational qubits. To reduce the number of resonators required for readout, time-division multiplexing techniques could be implemented in conjunction with floating gate arrangements as previously described⁵² and laid out in Supplementary note 3.

Architecture embodiment in silicon: junctions

Coupling of silicon nanowire qubits in the second dimension has been proposed via floating gates⁸⁹ and microwave resonators⁹⁰. Here, we propose a hardware junction which allows for coupling 1d submodules of split-gate transistors in 2d. The junction consists of etched silicon-on-insulator in quadrangular form, as can be seen in the central region of Fig. 3a. The exact shape and gate layout of the junction can vary to accommodate different levels of interconnection, reduce cross talk, and shuttling bottlenecks (see Supplementary Note 4). Here, we present a square junction that enables connecting up to four submodules at an angle of 90^∘, 180^∘, or 270^∘. On top of the junction, we place a series of metallic gates with the same gate stack as the split-gate transistors. We propose a five-gate structure arranged in cross geometry—one central square gate flanked by four square gates of the same footprint. The characteristic dimensions of the junction are indicated in Fig. 3b, with submodule-to-edge-gate separation in ${x}^{{\prime} }$(${y}^{{\prime} }$) directions L_q (≈40 nm), the gate length in the ${x}^{{\prime} }$(${y}^{{\prime} }$) directions L_l (≈80 nm), and a junction gate-to-gate separation L_d (≈40 nm). The gate length and pitch are in line with the fabrication requirements for the 1d sub-modules. For the junction, we suggest gate contact over active, a feature available in 10 nm node technology. The junction is invariant under rotations by 90^∘.

**Fig. 3: Junction and shuttling sequence.**

The purpose of the junction is to create a shuttling path for electrons^49,91 at the edges of the 1d submodules to be moved around the junction in the ${x}^{{\prime} }$ and ${y}^{{\prime} }$ directions on demand using the appropriate gate voltages sequence. These electrons can be moved to be in exchange coupling proximity with the electrons at the edge of another submodule where a two-qubit gate will be performed. We illustrate the operation of the junction using an ${x}^{{\prime} }{y}^{{\prime} }$ coupling example in Fig. 3c: (i) Shuttle the electron under the top-rightmost gate in the left module ${x}^{{\prime} }$, to the left gate in the junction by applying a differential voltage between the two gates⁴⁹. (ii) Shuttle from the left gate to the central gate of the junction. (iii) Shuttle from the central gate to the bottom gate of the junction. (iv) Implement a two-qubit gate between the electron under the bottom gate in the junction and the electron under the left-topmost gate in the ${y}^{{\prime} }$ module, as described above. (v–vii) Shuttle the electron back following the reverse process. The corresponding abstracted hardware topology and the same shuttling sequence are presented in Fig. 3d. Although in this work we consider two-qubit interactions between spins confined to different types of QDs (a corner QD in a submodule and a planar QD in the junction), the shuttling process above can be modified to produce two-qubit interactions between planar QDs which have been demonstrated^29,30,65,76.

As discussed in Supplementary Note 5, we consider the regime in which spin shuttling operations are performed coherently and much faster than the two-qubit operation in step (iv). Any two qubits of a junction can be involved in a two-qubit operation between them and, since shuttle operations are assumed to have negligible overhead, two two-qubit operations can be run in parallel. Overall, this reasoning allows for abstracting the junction element into the simplified hardware graph depicted in Fig. 3e.

Regarding the quality of the shuttling process, it has been argued that conveyor mode shuttling could lead to a phase error of less than 10⁻³ for typical values of electron spins in MOS QDs (a correlation length on the variation of the spin splitting of 20 nm and homogeneous coherence times, ${T}_{2}^{* }\ge 20\, \mu$s) as long as the transfer occurs adiabatically with a velocity not less than 0.1 ms⁻¹ (the transfer length here is ≈400 nm)⁹².

Compilation methods

We now present compilation methods for deploying NISQ-type quantum algorithms on the proposed architecture, following standard approaches. We assume that algorithms are given by a quantum circuit which consists of initialization, followed by a sequence of one- and two-qubit operations, and readout, suitable for execution on a fully-connected device. While most of these operations are readily available on the proposed architecture, the sparse connectivity will prohibit execution of two-qubit gates between arbitrary pairs of qubits that are not directly connected by an edge of the hardware graph. To address this issue, compilation methods re-express a given quantum circuit through an equivalent circuit readily amenable to the sparse hardware topology. More specifically, before executing a given two-qubit gate, one generally applies a sequence of Swap gates to shuttle the relevant qubits along the hardware graph until they reach neighboring positions. Once neighboring positions are reached, the two-qubit gate is executed and the sequence of Swap gates is reverted to promote the relevant qubits back to their original position. In applying this compilation strategy, the resulting quantum circuit accumulates a compilation overhead characterized by the number of additional Swap gates, N_SWAP, and the increased circuit depth, N_d. Both quantities represent important quality metrics for the compilation which should be minimized as is possible to avoid additional decoherence and infidelities introduced through the additional Swap gates on NISQ devices. We note that the proposed architecture uses two $\sqrt{{\rm{S}}{\scriptstyle{\rm{WAP}}}}$ gates to implement the Swap operation.

In what follows, we focus on two major compilation scenarios: (i) The case of moving two arbitrary qubits together along the hardware graph. This covers the general case where each two-qubit operation is addressed individually and usually gives reasonable estimates for the compilation overhead. (ii) The case of rearranging qubits in an arbitrary permutation. This compilation method could also be, for example, useful for variational algorithms⁹³ from chemistry^94,95 and finance⁹⁶ which repeatedly execute some number of up to ⌊N/2⌋ two-qubit operations in parallel. For such algorithms, the compilation must efficiently permute qubits into configurations which allow for executing the relevant two-qubit operations in parallel, even on the sparse hardware topology. To implement such permutations efficiently, one repeatedly applies a layer of Swap operations during which up to ⌊N/2⌋ Swap gates are executed in parallel. Ultimately, this can result in significantly lower circuit depth than simply addressing each two-qubit operation individually. We note that being able to apply such a layer of two-qubit operations is less general than arbitrary rearrangement and thus further reductions may be possible for this specific case.

Finally, we compare our compilation methods to two important limiting cases—a 1d device which could be fabricated by joining nanowire submodules along one dimension, as depicted in Fig. 4a, and a 2d rectangular device consisting of l_x by l_y qubits with nearest-neighbor connectivity, as depicted in Fig. 4b.

Compilation methods: case I – moving two qubits together along the shortest path

The compilation method for case I (as in, e.g., Ref. ⁹⁷) requires iterating over all two-qubit operations of a given input circuit and, for each pair of qubits involved in a two-qubit operation, (a) determining the shortest path connecting the two qubits and subsequently (b) executing Swap gates to connect the qubits along the shortest path, executing the two-qubit operation and shuttling qubits back to their original position. Finding the shortest path, i.e, the distance, between a pair of qubits can efficiently be implemented in polynomial time using, e.g., Dijkstra’s algorithm⁹⁸.

The compilation overhead of this algorithm is determined by the length of the shortest path, l, connecting the pair of qubits involved in each two-qubit operation along the hardware graph. In particular, the overhead of Swap gates accumulated per two-qubit gate is given by N_SWAP = l − 1 while the increased circuit depth per two-qubit operation will be given by ${N}_{{{{\rm{d}}}}}=\lceil\frac{l-1}{2}\rceil$, assuming that Swap gates on different pairs of qubits can be applied simultaneously. To compare further the compilation overhead for different devices, we compute the average ($\bar{l}$) and maximal (${l}_{\max }$) distances between qubit pairs in a given topology. For the linear layout with N qubits, we have

$${\bar{l}}_{{{{\rm{lin}}}}}=\frac{1}{3}(N+1),\quad {l}_{{{{\rm{lin}}}},\max }=N-1.$$

(2)

For a rectangular grid of l_x by l_y qubits, we find that ${\bar{l}}_{{{{\rm{rec}}}}}=\frac{1}{3}({l}_{{{{\rm{x}}}}}+{l}_{{{{\rm{y}}}}})$ and ${l}_{{{{\rm{rec}}}},\max }={l}_{{{{\rm{x}}}}}+{l}_{{{{\rm{y}}}}}-2$ and specifically for the square grid with ${l}_{{{{\rm{x}}}}}={l}_{{{{\rm{y}}}}}=\sqrt{N}$ we have

$${\bar{l}}_{{{{\rm{rec}}}}}=\frac{2}{3}\sqrt{N},\quad {l}_{{{{\rm{rec}}}},\max }=2(\sqrt{N}-1).$$

(3)

Finally, considering the proposed architecture for d_x = d_y, the average and maximal qubit distances are

$$\begin{array}{l}{\bar{l}}_{2{{{\rm{d}}}}}=\frac{1}{45(N-1)}\sqrt{\frac{m}{N}}\left[21{N}^{2}+5\left(m+\frac{2}{m}\right)N \vphantom{\left(\sqrt{{m}^{3}}+\frac{2}{\sqrt{m}}\right)}\right.\\ \quad\quad\quad\left.-15\left(\sqrt{{m}^{3}}+\frac{2}{\sqrt{m}}\right)\sqrt{N}+34{m}^{2}+20\right],\\ \qquad\quad \simeq \frac{7}{15}\sqrt{mN},\\ \qquad\quad {\bar{l}}_{2{{{\rm{d}}}},\max }=\sqrt{mN}.\end{array}$$

(4)

For details of the derivation, see Supplementary Note 6. For the limiting case $m=\frac{N}{4}$, we see that

$${\bar{l}}_{2{{{\rm{d}}}}}=\frac{{N}^{2}}{4(N-1)},\quad {\bar{l}}_{2{{{\rm{d}}}},\max }=\frac{N}{2},$$

(5)

and thus, as expected, the device exhibits linear scaling behaviour. However, its performance is still better than that of the linear device due to the ring-shaped arrangement of qubits.

We visualize the aforementioned compilation overheads expressed via mean and maximal qubit distances in Fig. 5a, b as a function of increasing qubit number. These figures illustrate several important properties, which we comment on in the following. The linear configuration (labeled 1D) clearly exhibits a linear scaling of the compilation overhead, while the rectangular device with nearest neighbor connectivity (labeled 2D) exhibits a square-root scaling. The compilation overhead of the proposed architecture inherits the favorable square-root scaling of 2d hardware topologies, making the proposed architecture favorable over 1d devices. Interestingly, the compilation overhead of the proposed device never exceeds the compilation overhead of the linear architecture, even for the smallest devices d_x = d_y = 1, 2, 3, …. This is useful as first experimental realizations of the proposed architecture would start from small prototypes. Finally, for increasing sparsity m = 4, 8, 16, …, the compilation overhead of the proposed architecture does increase; however, it never exceeds the overhead of the linear device. This allows for balancing the trade-off between compilation overhead and creating space in between the qubit modules which can be beneficial for colocation of classical support electronics (see Supplementary Note 3).

**Fig. 5: Compilation overhead of Case I.**

Compilation methods: case II—rearranging qubits in arbitrary permutations

We now consider the cost of permuting all qubits at once, making use of sorting networks^1,2. We begin by recalling qubit permutations on linear and rectangular devices^3,4 as these underlie the compilation method for the proposed sparse architecture.

We first consider permutations on 1d linear devices with N qubits using parallel neighbor sort⁹⁹, as depicted in Fig. 6a. First, each node of the hardware graph is assigned an index v = 1, …, N in ascending order, and each qubit is labeled by its final position. Consecutive layers of Swap operations are then applied in up to N steps. For odd steps 1, 3, …, qubits on node pairs (v, u) ∈ {(1, 2), (3, 4), …} are compared and a Swap operation is applied if the final destination of qubit v is larger than the final destination of qubit u. For even steps 2, 4, …, the same process occurs for qubits in node pairs (v, u) ∈ {(2, 3), (4, 5), …}. With this method, any permutation can be implemented in a maximum of N layers leading to an increase in the circuit depth of

$${N}_{{{{\rm{d}}}}}=N.$$

(6)

The maximum number of Swap operations needed is

$${N}_{{{{\rm{SWAP}}}}}=\frac{N(N-1)}{2}.$$

(7)

It is clear that it is possible for one or two qubits to be required to move N − 1 places and thus the method here produces a result close to the worst-case minimum. An alternative is to use bubble sort or insertion sort, but the maximum depth increases to 2N − 3^1,2.

Next, we consider permutations on a rectangular device of N = l_xl_y qubits with nearest neighbor connectivity. We follow the algorithm of Ref. ⁴, which consists of the following three steps: (i) Rearrange the qubits in each column using parallel neighbor sort such that each row contains exactly one qubit with final destination in each column 1, 2, 3, …, l_x. (ii) Rearrange the qubits in each row using parallel neighbor sort such that all qubits are in the correct column. (iii) Rearrange the qubits in each column using parallel neighbor sort such that each qubit is in the correct final location. We note that column and row can also be interchanged in the above steps. An example visualizing the method is shown in Fig. 6b.

Once step (i) provides an arrangement such that each row contains exactly one qubit with final destination in column 1, 2, 3, …, l_x, an implementation of steps (ii) and (iii) using parallel neighbor sort is simple. The challenge is to see that an efficient implementation of step (i) is always possible. This was shown in Refs. ^3,4 using Hall’s matching theorem¹⁰⁰ and will be discussed in more detail in its adaption to the proposed architecture with sparse 2d connectivity below. The overhead of the discussed method originates from consecutively using parallel neighbor sort on (i) columns (ii) rows and (iii) columns. This results in a maximum of

$${N}_{{{{\rm{d}}}}}=2{l}_{{{{\rm{y}}}}}+{l}_{{{{\rm{x}}}}}$$

(8)

layers of Swap gates and

$${N}_{{{{\rm{SWAP}}}}}=\frac{1}{2}{l}_{{{{\rm{x}}}}}{l}_{{{{\rm{y}}}}}({l}_{{{{\rm{x}}}}}+2{l}_{{{{\rm{y}}}}}-3)$$

(9)

total Swap gates. Specifically, for the square ${l}_{{{{\rm{y}}}}}={l}_{{{{\rm{x}}}}}=\sqrt{N}$, this gives

$${N}_{{{{\rm{d}}}}}=3\sqrt{N}$$

(10)

and

$${N}_{{{{\rm{SWAP}}}}}=\frac{3}{2}N(\sqrt{N}-1).$$

(11)

We now extend the method of Ref. ⁴ to the proposed architecture with sparse 2d connectivity. The method is based on the definition of generalized rows and columns as depicted in Fig. 7a, b, respectively. In essence, this results in 2d_y rows and 2d_x columns with 2md_x and 2md_y qubits respectively, where any given row and column share m qubits. Having defined these generalized rows and columns, we note that every qubit can conveniently be labeled by a combination of its column index, a_x = 1, …, 2d_x, and its vertical position, y. Equivalently, we could choose to label a qubit by its row index, a_y and its horizontal position, x. The coordinates x and y can be seen in Fig. 1d. With generalized rows and columns in place, we describe the corresponding compilation algorithm for implementing qubit permutations. Each qubit, in its initial location, carries a label (a_x, y), indicating its final position. An arbitrary qubit permutation can then be implemented using the following three steps: (i) Rearrange the qubits in each generalized column using parallel neighbor sort such that each set of qubits with fixed coordinate y contains exactly one qubit with final destination in each column a_x = 1, 2, 3, …, 2d_x. (ii) Rearrange the qubits in each generalized row using parallel neighbor sort such that all qubits are moved into the correct column according to their column index a_x. (iii) Rearrange the qubits in each generalized column using parallel neighbor sort such that each qubit is in the correct y location along the columns. An example visualizing the method is given in Fig. 8. We note again that, once step (i) provides an arrangement such that each set of qubits with vertical position y contains exactly one qubit with final destination in column a_x = 1, 2, 3, …, 2d_x, an implementation of steps (ii) and (iii) using parallel neighbor sort is straightforward. The challenge is again to see that an efficient implementation of step (i) is always possible. The procedure to achieve this is illustrated in Fig. 8e–g and shall now be explained in more detail. To begin with, a bipartite graph of 4d_x nodes is constructed, l ∈ 1, …, 2d_x on the left and 2d_x nodes r ∈ 1, …, 2d_x on the right. Nodes on the left indicate columns in which a qubit is located initially. Nodes on the right indicate columns to which a qubit should be routed. To build the graph, one adds an edge (l, r) to the bipartite graph for each qubit initially located in column l and having final destination in column r. An example is given in Fig. 8e. We note that, since we have 2md_y qubits located in each column initially and since we will have 2md_y qubits with final destination located in each column, the bipartite graph has 2md_y edges incident to each node. Since some qubits may originate and end up in the same column, the bipartite graph can have multiple edges connecting the same nodes.

**Fig. 7: Labeling of the proposed architecture.**

**Fig. 8: Qubit permutation on proposed architecture with d_x = 2, d_y = 1, and m = 2.**

Next, to determine how qubits should be arranged along columns in step (i), one extracts 2md_y perfect matchings ${{{{\mathcal{M}}}}}_{{{{\rm{y}}}}}$ with y = 1, …, 2md_y from the bipartite graph. A perfect matching ${{{{\mathcal{M}}}}}_{{{{\rm{y}}}}}$ is a set of edges such that each node of the bipartite graph is connected to exactly one edge of the matching. See Fig. 8g for examples showing one possible set of perfect matchings for the bipartite graph in Fig. 8e. Finding 2md_y perfect matchings for the given type of bipartite graph is always possible due to Hall’s matching theorem¹⁰⁰. In the bipartite graphs that arise due to our compilation problem, each node has the same number of incident edges, a sufficient condition for Hall’s matching theorem to hold. Matchings can efficiently be found using the Ford-Fulkerson algorithm⁹⁸. To this end, one attaches virtual nodes s and t to all nodes on the left and right of the bipartite graph, respectively, and determines a minimal network flow from s to t; see Fig. 8f. Finding a minimal flow configuration reveals one matching at a time. Successively removing edges of a matching from the bipartite graph and repeatedly running the Ford-Fulkerson algorithm will reveal all matchings ${{{{\mathcal{M}}}}}_{{{{\rm{y}}}}}$ with y = 1, …, 2md_y, as visualized in Fig. 8g. Finally, to route qubits in step (i), one selects qubits which move to vertical position y by iterating over the edges of the yth matching $(l,r)\in {{{{\mathcal{M}}}}}_{{{{\rm{y}}}}}$. Here, each edge (l, r) signifies that in the generalized column l a qubit destined for the generalized column r should be moved to vertical position y. Since edges in each set of matchings ${{{{\mathcal{M}}}}}_{{{{\rm{y}}}}}$ point to exactly one final destination per vertical position y, the fact that 2md_y matchings exist ensures that step (i) arranges the qubits in each column so that each row contains exactly m qubits with final destination in column 1, 2, 3, …, 2md_x.

We close this section by evaluating the compilation overhead. Considering the succession of parallel neighbor sorts along (i) generalized columns of length 2md_y, (ii) generalized rows of length 2md_x, and (iii) again generalized columns of length 2md_y, the maximum number of required layers of Swap gates is given by

$${N}_{{{{\rm{d}}}}}=4m{d}_{{{{\rm{y}}}}}+2m{d}_{{{{\rm{x}}}}}$$

(12)

and the maximum total number of Swap gates by

$${N}_{{{{\rm{SWAP}}}}}=2m{d}_{{{{\rm{x}}}}}{d}_{{{{\rm{y}}}}}(2m{d}_{{{{\rm{x}}}}}+4m{d}_{{{{\rm{y}}}}}-3).$$

(13)

For the specific case d_x = d_y = d, and using Eq. (1), we have

$${N}_{{{{\rm{d}}}}}=6md=3\sqrt{mN}$$

(14)

and

$${N}_{{{{\rm{SWAP}}}}}=6m{d}^{2}(2md-1)=\frac{3}{2}N(\sqrt{mN}-1).$$

(15)

We compare the compilation overhead of the proposed architecture with sparse 2d connectivity to the linear and rectangular hardware graphs in Fig. 9. Again, the linear configuration (labeled 1D) exhibits a linear scaling of the compilation overhead, while the rectangular device with nearest neighbor connectivity (labeled 2D) exhibits a square-root scaling. The compilation overhead of the proposed architecture inherits the favorable square-root scaling of 2d hardware topologies, making the proposed architecture favorable over 1d devices. Interestingly, the compilation overhead of the proposed device never exceeds the compilation overhead of the linear architecture for d_x = d_y ≥ 2. For device structures with d_x = d_y = 1, the compilation overhead of the proposed architecture can be reduced by recognizing that the rearrangement of the sparse device can always be handled with a single parallel neighbor sort, by arranging the qubits along a single line. This ensures that the compilation overhead for the proposed architecture never exceeds the compilation overhead of the 1d architecture even for the smallest devices. This is useful as first experimental realizations of the proposed architecture would start from small prototypes. Finally, for increasing sparsity m = 4, 8, 16, …, the compilation overhead of the proposed architecture does increase; however, it again never exceeds the overhead of the linear device. This allows for balancing between compilation overhead and creating space in between the qubit modules which can be beneficial for colocation of classical support electronics.

Fig. 9: Compilation overhead for arbitrary rearrangement of qubits given by maximum number of layers of Swap gates as a function of the number of qubits for (1d) linear qubit chains, (2d) rectangular nearest-neighbor grids and the proposed architecture for d_x = d_y = d with m = 4, 8 and 16.

Discussion

Inspired by the challenge of scaling up existing silicon quantum hardware, we investigate compilation strategies for sparsely-connected 2d qubit arrangements and propose a spin-qubit architecture with low compilation overhead. Our considerations are inspired by silicon nanowire split-gate transistors which form finite 1d chains of spin-qubits, allowing for the execution of two-qubit operations such as Swap gates among neighbors. Adding to this, we describe a silicon junction which can couple up to four nanowires at one end into 1d or 2d arrangements via spin shuttling and Swap operations. Given these hardware elements, we propose a modular 2d spin-qubit architecture with unit cells consisting of diagonally-oriented squares with nanowires along the edges and junctions at the corners.

The junction geometry opens space between modules to route the gate lines and/or to place cryogenic classical electronics in the quantum processor plane^20,50. Fabricating the qubits and the classical control layer using the same technology is appealing because it will facilitate the integration process, improving feedback speeds in error-correction protocols, and offer potential solutions to wiring and layout challenges^{16,101,102,103,104,105}. Integrating classical and quantum devices monolithically, using CMOS processes, enables the quantum processor to profit from the most mature industrial technology for the fabrication of large-scale circuits⁵¹. We show that this architecture allows for compilation strategies which inherit the favorable square-root scaling of compilation overhead in 2d structure and outperform the best in class compilation strategy of 1d chains, not only asymptotically, but also down to the minimal structure of a single square. This result shows that scaling silicon nanowires into 2d structures will have benefits early on, even in experimental demonstrations of the smallest prototypes, thus encouraging building the proposed junction element and expanding silicon architectures into 2d arrangements.

Going beyond the proposed architecture for spin qubits, the proposed method of generalized rows and columns can be applied to many other 2d architectures. In some cases, where rows and columns are not obvious, this could be made to work by introducing branches along a column and row, as illustrated in Fig. 10. As branches lead to three-way junctions within a row and/or column the overhead may, of course, be different. Overall, this shows that the method of generalized rows and columns is a useful compilation strategy not only for spin qubits but also many other architectures such as, for example, IBM’s heavy-hex for superconducting qubits.

**Fig. 10: Labeling of an alternative sparsely-connected device.**

We wished to present compilation methods for the device for two quite general situations for which we could obtain an upper bound on the number of layers of SWAP gates to allow comparison to other architectures. The compilation strategies presented here act to demonstrate the square-root scaling in overhead due to routing for the sparsely-connected device, and the advantage of using this device over one with a 1d structure. Many other architecture-aware compilation methods exist and show good results [e.g.,^5,6,7,8,106]. Circuit re-synthesis⁹ provides another option. Alternatively, it may be beneficial to consider a compilation method designed with the desired algorithm in mind [e.g.,^25,107]. However, the distance between qubits will clearly affect the overhead introduced through such compilation methods, and thus the results presented here provide some indication of their likely performance.

In exploring strategies to reduce the compilation overhead we focus on the utility of the proposed architecture for the NISQ era. In that, future investigations should be directed to exploring the utility of the proposed architecture for fault-tolerant quantum computation. Positive indications towards this long-term goal are provided by recent work considering error correction along a line⁴⁸ as well as the possibility of adapting the ideas of the spider-web array⁴² and operating each nanowire with a single-qubit connected in an angled rectangular lattice with highly coherent shuttle operations.

Finally, while our work provides thoughts on how to scale one-dimensional QDs into a sparse two-dimensional array, it leaves fundamental questions of feasibility to future research. Firstly, if operated for near-term applications, how many SWAP and shuttling operations could the architecture tolerate, before the results become inaccurate? Recent work investigating the feasibility of VQE algorithms in the presence of noise indicates the need for two-qubit gate fidelity well above the fault-tolerance threshold¹⁰⁸. As such, the proposed architecture, which adds overhead through SWAP and shuttling operations, will likely require even higher two-qubit fidelities to become a scalable solution for NISQ algorithms. Similarly, if operated in a manner like Ref. ⁴², what fidelity would the individual shuttling operations need to achieve for overall fidelities to be above the fault-tolerance threshold? Recent work investigating surface codes on sparser grid-like architectures¹⁰⁹ indicates that this will likely require gate and shuttling fidelities higher than those of the standard surface code, but it remains subject of future investigations to determine these fidelities. Finally, is it feasible to operate the one-dimensional lines of quantum dots as one perpetual qubit in the manner of Ref. ⁴⁸? If so, would it then be feasible to exchange information between such qubits via the tunneling junctions? All these questions remain the subject of future work.

Data availability

No datasets were generated or analyzed during the current study.

Code availability

No code was developed to obtain the results in this paper.

References

Hirata, Y., Nakanishi, M., Yamashita, S. & Nakashima, Y. An efficient conversion of quantum circuits to a linear nearest neighbor architecture. Quantum Inform. Comp. 11, 142–166 (2011).
MathSciNet MATH Google Scholar
Beals, R. et al. Efficient distributed quantum computing. Proc. R. Soc. A: Math., Phys.Eng. Sci. 469, 20120686 (2013).
Article ADS MathSciNet MATH Google Scholar
Brierley, S. Efficient implementation of quantum circuits with limited qubit interactions. Quantum Info. Comput. 17, 1096–1104 (2017).
MathSciNet Google Scholar
Steiger, D. S., Häner, T. & Troyer, M. Advantages of a modular high-level quantum programming framework. Microprocess. Microsy. 66, 81–89 (2019).
Article Google Scholar
Zulehner, A., Paler, A. & Wille, R. An efficient methodology for mapping quantum circuits to the IBM QX architectures. IEEE T. Comput. Aid. D. 38, 1226–1236 (2018).
Article Google Scholar
Childs, A. M., Schoute, E. & Unsal, C. M. Circuit Transformations for Quantum Architectures. In van Dam, W. & Mancinska, L. (eds.) 14th Conference on the Theory of Quantum Computation, Communication and Cryptography (TQC 2019), vol. 135 of Leibniz International Proceedings in Informatics (LIPIcs), 3:1–3:24 (Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany, 2019).
Cowtan, A. et al. On the Qubit Routing Problem. In van Dam, W. & Mancinska, L. (eds.) 14th Conference on the Theory of Quantum Computation, Communication and Cryptography (TQC 2019), vol. 135 of Leibniz International Proceedings in Informatics (LIPIcs), 5:1–5:32 (Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany, 2019).
Lao, L., van Someren, H., Ashraf, I. & Almudever, C. G. Timing and resource-aware mapping of quantum circuits to superconducting processors. IEEE T. Comput. Aid. D. 41, 359–371 (2021).
Gheorghiu, V., Huang, J., Li, S. M., Mosca, M. & Mukhopadhyay, P. Reducing the CNOT count for Clifford+ T circuits on NISQ architectures. IEEE T. Comput. Aid. D. https://doi.org/10.1109/TCAD.2022.3213210 (2022).
Zhong, H.-S. et al. Quantum computational advantage using photons. Science 370, 1460–1463 (2020).
Article ADS Google Scholar
Linke, N. M. et al. Experimental comparison of two quantum computing architectures. Proc. Natl Acad. Sci. USA 114, 3305–3310 (2017).
Article ADS Google Scholar
Pino, J. M. et al. Demonstration of the trapped-ion quantum CCD computer architecture. Nature 592, 209–213 (2021).
Article ADS Google Scholar
Arute, F. et al. Quantum supremacy using a programmable superconducting processor. Nature 574, 505–510 (2019).
Article ADS Google Scholar
Hendrickx, N. W. et al. A four-qubit germanium quantum processor. Nature 591, 580 – 585 (2021).
Article Google Scholar
Fowler, A. G., Mariantoni, M., Martinis, J. M. & Cleland, A. N. Surface codes: Towards practical large-scale quantum computation. Phys. Rev. A 86, 32324 (2012).
Article ADS Google Scholar
Charbon, E. et al. Cryo-CMOS for quantum computing. In 2016 IEEE International Electron Devices Meeting (IEDM), 13.5.1–13.5.4 (2016).
Gonzalez-Zalba, M. F. et al. Scaling silicon-based quantum computing using cmos technology. Nat. Electron. 4, 872–884 (2021).
Article Google Scholar
Veldhorst, M., Eenink, H. G. J., Yang, C. H. & Dzurak, A. S. Silicon CMOS architecture for a spin-based quantum computer. Nat. Commun. 8, 1766 (2017).
Article ADS Google Scholar
Ibm unveils breakthrough 127-qubit quantum processor. https://newsroom.ibm.com/2021-11-16-IBM-Unveils-Breakthrough-127-Qubit-Quantum-Processor.
Vandersypen, L. M. K. et al. Interfacing spin qubits in quantum dots and donors–hot, dense, and coherent. NPJ Quantum Inf. 3, 34 (2017).
Article ADS Google Scholar
Buonacorsi, B. et al. Network architecture for a topological quantum computer in silicon. Quantum Sci. Technol. 4, 025003 (2019).
Article ADS Google Scholar
Nation, P., Paik, H., Cross, A. & Nazario, Z. The IBM Quantum heavy hex lattice. https://research.ibm.com/blog/heavy-hex-lattice (2021).
Kivlichan, I. D. et al. Quantum simulation of electronic structure with linear depth and connectivity. Phys. Rev. Lett. 120, 110501 (2018).
Article ADS MathSciNet Google Scholar
Guerreschi, G. G. & Park, J. Two-step approach to scheduling quantum circuits. Quantum Sci. Technol. 3, 045003 (2018).
Article ADS Google Scholar
Holmes, A., Johri, S., Guerreschi, G. G., Clarke, J. S. & Matsuura, A. Y. Impact of qubit connectivity on quantum algorithm performance. Quantum Sci. Technol. 5, 025009 (2020).
Article ADS Google Scholar
Yoneda, J. et al. A quantum-dot spin qubit with coherence limited by charge noise and fidelity higher than 99.9%. Nat. Nanotechnol. 13, 102–106 (2017).
Article ADS Google Scholar
Zajac, D. M. et al. Resonantly driven CNOT gate for electron spins. Science 359, 439–442 (2017).
Article ADS MathSciNet MATH Google Scholar
Watson, T. F. et al. A programmable two-qubit quantum processor in silicon. Nature 555, 633–637 (2018).
Article ADS Google Scholar
Xue, X. et al. Quantum logic with spin qubits crossing the surface code threshold. Nature 601, 343–347 (2022).
Article ADS Google Scholar
Noiri, A. et al. Fast universal quantum gate above the fault-tolerance threshold in silicon. Nature 601, 338–342 (2022).
Article ADS Google Scholar
Mills, A. R. et al. Two-qubit silicon quantum processor with operation fidelity exceeding 99%. Sci. Adv. 8, eabn5130 (2022).
Article Google Scholar
Yang, C. H. et al. Silicon qubit fidelities approaching incoherent noise limits via pulse engineering. Nat. Electron. 2, 151–158 (2019).
Article Google Scholar
Huang, W. et al. Fidelity benchmarks for two-qubit gates in silicon. Nature 569, 532–536 (2019).
Article ADS Google Scholar
Harvey-Collard, P. et al. High-fidelity single-shot readout for a spin qubit via an enhanced latching mechanism. Phys. Rev. X 8, 021046 (2018).
Google Scholar
Urdampilleta, M. et al. Gate-based high fidelity spin readout in a CMOS device. Nat. Nanotechnol. 14, 737–741 (2019).
Article Google Scholar
Johnson, M. A. et al. Beating the thermal limit of qubit initialization with a bayesian maxwell’s demon. Phys. Rev. X 12, 041008 (2022).
Google Scholar
Oakes, G. et al. Fast high-fidelity single-shot readout of spins in silicon using a single-electron box. Preprint at https://arxiv.org/abs/2203.06608 (2022).
Philips, S. G. J. et al. Universal control of a six-qubit quantum processor in silicon. Nature 609, 919–924 (2022).
Article ADS Google Scholar
Takeda, K., Noiri, A., Nakajima, T., Kobayashi, T. & Tarucha, S. Quantum error correction with silicon spin qubits. Nature 608, 682–686 (2022).
Article ADS Google Scholar
Taylor, J. M. et al. Fault-tolerant architecture for quantum computation using electrically controlled semiconductor spins. Nat. Phys. 1, 177–183 (2005).
Article Google Scholar
Li, R. et al. A crossbar network for silicon quantum dot qubits. Sci. Adv. 4, eaar3960 (2018).
Article ADS Google Scholar
Boter, J. M. et al. Spiderweb array: a sparse spin-qubit array. Phys. Rev. Appl. 18, 024053 (2022).
Article ADS Google Scholar
Betz, A. et al. Reconfigurable quadruple quantum dots in a silicon nanowire transistor. Appl. Phys. Lett. 108, 203108 (2016).
Article ADS Google Scholar
Hutin, L. et al. Gate reflectometry for probing charge and spin states in linear Si MOS split-gate arrays. In 2019 IEEE International Electron Devices Meeting (IEDM), 37–7 (2019).
Ansaloni, F. et al. Single-electron operations in a foundry-fabricated array of quantum dots. Nat. Commun. 11, 6399 (2020).
Article ADS Google Scholar
Chanrion, E. et al. Charge detection in an array of CMOS quantum dots. Phys. Rev. Appl. 14, 024066 (2020).
Article ADS Google Scholar
Cai, Z. Resource estimation for quantum variational simulations of the hubbard model. Phys. Rev. Appl. 14, 014059 (2020).
Article ADS Google Scholar
Jones, C. et al. Logical qubit in a linear array of semiconductor quantum dots. Phys. Rev. X 8, 021058 (2018).
Google Scholar
Yoneda, J. et al. Coherent spin qubit transport in silicon. Nat. Commun. 12, 4114 (2021).
Article ADS Google Scholar
Boter, J. M. et al. A sparse spin qubit array with integrated control electronics. In 2019 IEEE International Electron Devices Meeting (IEDM), 31–4 (IEEE, 2019).
Ruffino, A. et al. A cryo-CMOS chip that integrates silicon quantum dots and multiplexed dispersive readout electronics. Nat. Electron. 5.1, 53–59 (2021).
Schaal, S. et al. A CMOS dynamic random access architecture for radio-frequency readout of quantum devices. Nat. Electron. 2, 236–242 (2019).
Article Google Scholar
Pauka, S. J. et al. A cryogenic CMOS chip for generating control signals for multiple qubits. Nat. Electron. 4, 64–70 (2021).
Article Google Scholar
Roche, B. et al. A tunable, dual mode field-effect or single electron transistor. Appl. Phys. Lett. 100, 32103–32107 (2012).
Article Google Scholar
Lundberg, T. et al. Spin quintet in a silicon double quantum dot: Spin blockade and relaxation. Phys. Rev. X 10, 041010 (2020).
Google Scholar
Ciriano-Tejel, V. N. et al. Spin readout of a CMOS quantum dot by gate reflectometry and spin-dependent tunneling. PRX Quantum 2, 010353 (2021).
Article Google Scholar
Voisin, B. et al. Few-electron edge-state quantum dots in a silicon nanowire field-effect transistor. Nano Lett. 14, 2094–2098 (2014).
Article ADS Google Scholar
Ibberson, D. J. et al. Electric-field tuning of the valley splitting in silicon corner dots. Appl. Phys. Lett. 113, 53104 (2018).
Article Google Scholar
Ibberson, D. J. et al. Large dispersive interaction between a CMOS double quantum dot and microwave photons. PRX Quantum 2, 020315 (2021).
Article ADS Google Scholar
Bohuslavskyi, H. et al. Reflectometry of charge transitions in a silicon quadruple dot. Preprint at https://arxiv.org/abs/2012.04791 (2020).
Huang, W., Veldhorst, M., Zimmerman, N. M., Dzurak, A. S. & Culcer, D. Electrically driven spin qubit based on valley mixing. Phys. Rev. B 95, 75403 (2017).
Article ADS Google Scholar
Volk, C. et al. Loading a quantum-dot based “Qubyte” register. NPJ Quantum Inf. 5, 29 (2019).
Article ADS Google Scholar
Veldhorst, M. et al. An addressable quantum dot qubit with fault-tolerant control-fidelity. Nat. Nano 9, 981–985 (2014).
Article Google Scholar
Pla, J. J. et al. High-fidelity readout and control of a nuclear spin qubit in silicon. Nature 496, 334–338 (2013).
Article ADS Google Scholar
Veldhorst, M. et al. A two-qubit logic gate in silicon. Nature 526, 410–414 (2015).
Article ADS Google Scholar
Fogarty, M. A. Silicon edge-dot architecture for quantum computing with global control and integrated trimming. Preprint at https://arxiv.org/abs/2208.09172 (2022).
Vahapoglu, E. et al. Single-electron spin resonance in a nanoelectronic device using a global field. Sci. Adv. 7, eabg9158 (2021).
Article ADS Google Scholar
Vahapoglu, E. et al. Coherent control of electron spin qubits in silicon using a global field. NPJ Quantum Inf. 8, 1–6 (2022).
Article Google Scholar
Ferdous, R. et al. Valley dependent anisotropic spin splitting in silicon quantum dots. NPJ Quantum Inf. 4, 26 (2018).
Article ADS Google Scholar
Tanttu, T. et al. Controlling spin-orbit interactions in silicon quantum dots using magnetic field direction. Phys. Rev. X 9, 021028 (2019).
Google Scholar
Laucht, A. et al. Electrically controlling single-spin qubits in a continuous microwave field. Sci. Adv. 1, e1500022 (2015).
Article ADS Google Scholar
Maurand, R. et al. A CMOS silicon spin qubit. Nat. Commun. 7, 1–6 (2016).
Article Google Scholar
Camenzind, L. C. et al. A hole spin qubit in a fin field-effect transistor above 4 kelvin. Nat. Electron. 5, 178–183 (2022).
Article Google Scholar
Piot, N. et al. A single hole spin with enhanced coherence in natural silicon. Nat. Nanotechnol. 17.10, 1072–1077 (2022).
Bosco, S., Hetényi, B. & Loss, D. Hole spin qubits in Si FinFETs with fully tunable spin-orbit coupling and sweet spots for charge noise. PRX Quantum 2, 010348 (2021).
Article Google Scholar
Maune, B. M. et al. Coherent singlet-triplet oscillations in a silicon-based double quantum dot. Nature 481, 344–347 (2012).
Article ADS Google Scholar
He, Y. et al. A two-qubit gate between phosphorus donor electrons in silicon. Nature 571, 371–375 (2019).
Article ADS Google Scholar
Ono, K., Austing, D. G., Tokura, Y. & Tarucha, S. Current rectification by pauli exclusion in a weakly coupled double quantum dot system. Science 297, 1313–1317 (2002).
Article ADS Google Scholar
Zhao, R. et al. Single-spin qubits in isotopically enriched silicon at low magnetic field. Nat. Commun. 10, 5500 (2019).
Article ADS Google Scholar
Gonzalez-Zalba, M. F., Barraud, S., Ferguson, A. J. & Betz, A. C. Probing the limits of gate-based charge sensing. Nat. Commun. 6, 6084 (2015).
Article ADS Google Scholar
Mizuta, R., Otxoa, R., Betz, A. & Gonzalez-Zalba, M. Quantum and tunneling capacitance in charge and spin qubits. Phys. Rev. B 95, 045414 (2017).
Article ADS Google Scholar
Pakkiam, P. et al. Single-shot single-gate rf spin readout in silicon. Phys. Rev. X 8, 41032 (2018).
Google Scholar
West, A. et al. Gate-based single-shot readout of spins in silicon. Nat. Nanotechnol. 14, 437–441 (2019).
Article ADS Google Scholar
Zheng, G. et al. Rapid gate-based spin read-out in silicon using an on-chip resonator. Nat. Nanotechnol. 14, 742–746 (2019).
Article ADS Google Scholar
Crippa, A. et al. Gate-reflectometry dispersive readout and coherent control of a spin qubit in silicon. Nat. Commun. 10, 2776 (2019).
Article ADS Google Scholar
Seedhouse, A. E. et al. Pauli blockade in silicon quantum dots with spin-orbit control. PRX Quantum 2, 010303 (2021).
Article Google Scholar
Ezzouch, R. et al. Dispersively probed microwave spectroscopy of a silicon hole double quantum dot. Phys. Rev. Appl. 16, 034031 (2021).
Article ADS Google Scholar
Lundberg, T. et al. Non-reciprocal Pauli spin blockade in a silicon double quantum dot. Preprint at https://arxiv.org/abs/2110.09842 (2021).
Duan, J. et al. Remote capacitive sensing in two-dimensional quantum-dot arrays. Nano Lett. 20, 7123–7128 (2020). PMID: 32946244.
Article ADS Google Scholar
Yu, C. X. et al. Strong coupling between a photon and a hole spin in silicon. Preprint at https://arxiv.org/abs/2206.14082 (2022).
Mills, A. R. et al. Shuttling a single charge across a one-dimensional array of silicon quantum dots. Nat. Commun. 10, 1063 (2019).
Article ADS Google Scholar
Langrock, V. et al. Blueprint of a scalable spin qubit shuttle device for coherent mid-range qubit transfer in disordered Si/SiGe/SiO₂. Preprint at https://arxiv.org/abs/2202.11793 (2022).
Cerezo, M. et al. Variational quantum algorithms. Nat. Rev. Phys. 3, 625–644 (2021).
Article Google Scholar
Peruzzo, A. et al. A variational eigenvalue solver on a photonic quantum processor. Nat. Commun. 5, 4213 (2014).
Article ADS Google Scholar
Fedorov, D. A., Peng, B., Govind, N. & Alexeev, Y. VQE method: a short survey and recent developments. Mater. Theory 6, 1–21 (2022).
Article ADS Google Scholar
Orús, R., Mugel, S. & Lizaso, E. Quantum computing for finance: overview and prospects. Rev. Phys. 4, 100028 (2019).
Article Google Scholar
Cross, A. W., Bishop, L. S., Sheldon, S., Nation, P. D. & Gambetta, J. M. Validating quantum computers using randomized model circuits. Phys. Rev. A 100, 032328 (2019).
Article ADS Google Scholar
Cormen, T. H., Leiserson, C. E., Rivest, R. L. & Stein, C. Introduction to Algorithms (MIT Press, 2009), 3rd edn.
Habermann, A. N. Parallel neighbor-sort (or the glory of the induction principle) (1972).
Hall, P. On representatives of subsets. J. London Math. Soc. 1, 26–30 (1935).
Article MathSciNet MATH Google Scholar
Franke, D. P., Clarke, J. S., Vandersypen, L. M. K. & Veldhorst, M. Rent’s rule and extensibility in quantum computing. Microprocess. Microsy. 67, 1–7 (2019).
Article Google Scholar
Xue, X. et al. CMOS-based cryogenic control of silicon quantum circuits. Nature 593, 205–210 (2021).
Article ADS Google Scholar
Park, J. S. et al. A fully integrated cryo-CMOS SoC for qubit control in quantum computers capable of state manipulation, readout and high-speed gate pulsing of spin qubits in Intel 22nm FFL FinFET technology. In 2021 IEEE International Solid- State Circuits Conference (ISSCC), vol. 64, 208–210 (2021).
Prabowo, B. et al. A 6-to-8GHz 0.17 mW/qubit cryo-CMOS receiver for multiple spin qubit readout in 40 nm CMOS technology. In 2021 IEEE International Solid- State Circuits Conference (ISSCC), vol. 64, 212–214 (2021).
Ruffino, A. et al. A fully-integrated 40-nm 5-6.5 GHz cryo-CMOS system-on-chip with I/Q receiver and frequency synthesizer for scalable multiplexed readout of quantum dots. In 2021 IEEE International Solid- State Circuits Conference (ISSCC), vol. 64, 210–212 (2021).
Li, G., Ding, Y. & Xie, Y. Tackling the qubit mapping problem for NISQ-era quantum devices. In Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 1001–1014 (2019).
Lao, L. & Browne, D. E. 2qan: A quantum compiler for 2-local qubit hamiltonian simulation algorithms. In Proceedings of the 49th Annual International Symposium on Computer Architecture, 351–365 (2022).
Dalton, K. et al. Variational quantum chemistry requires gate-error probabilities below the fault-tolerance threshold. Preprint at https://arxiv.org/abs/2211.04505 (2022).
Tomaru, T., Yoshimura, C. & Mizuno, H. Surface code for low-density qubit array. Sci. Rep. 12, 12946 (2022).
Article ADS Google Scholar

Download references

Acknowledgements

We thank Michael A. Fogarty for his careful read of the manuscript. This research has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under Grant agreement no. 688539 (http://mos-quito.eu) as well as Grant agreement no. 951852. This research has further received funding through the UKRI Innovate UK grant 48482 (NISQ.OS). M.F.G.Z. acknowledges support from UKRI Future Leaders Fellowship [grant number MR/V023284/1].

Author information

M. F. Gonzalez-Zalba
Present address: Quantum Motion Technologies, London, UK

Authors and Affiliations

Riverlane, Cambridge, UK
O. Crawford & J. R. Cruise
Hitachi Cambridge Laboratory, Cambridge, UK
N. Mertig & M. F. Gonzalez-Zalba

Authors

O. Crawford
View author publications
Search author on:PubMed Google Scholar
J. R. Cruise
View author publications
Search author on:PubMed Google Scholar
N. Mertig
View author publications
Search author on:PubMed Google Scholar
M. F. Gonzalez-Zalba
View author publications
Search author on:PubMed Google Scholar

Contributions

M.F.G.Z. conceived the sparse architecture. O.C. and J.C. performed the compilation analysis. N.M. supervised the architecture-compilation integration. All authors contributed to the writing of the paper.

Corresponding authors

Correspondence to O. Crawford or M. F. Gonzalez-Zalba.

Ethics declarations

Competing interests

M. Fernando Gonzalez Zalba and Hitachi Ltd. are inventors in a relevant patent (EP3975072A1).

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Crawford, O., Cruise, J.R., Mertig, N. et al. Compilation and scaling strategies for a silicon quantum processor with sparse two-dimensional connectivity. npj Quantum Inf 9, 13 (2023). https://doi.org/10.1038/s41534-023-00679-8

Download citation

Received: 10 January 2022
Accepted: 20 January 2023
Published: 22 February 2023
Version of record: 22 February 2023
DOI: https://doi.org/10.1038/s41534-023-00679-8

This article is cited by

Combining multiplexed gate-based readout and isolated CMOS quantum dot arrays
- Pierre Hamonic
- Martin Nurizzo
- Matias Urdampilleta
Nature Communications (2025)
Pipeline quantum processor architecture for silicon spin qubits
- S. M. Patomäki
- M. F. Gonzalez-Zalba
- J. J. L. Morton
npj Quantum Information (2024)
Universal control of a six-qubit quantum processor in silicon
- Stephan G. J. Philips
- Mateusz T. Mądzik
- Lieven M. K. Vandersypen
Nature (2022)