Abstract
Inspired by the challenge of scaling-up existing silicon quantum hardware, we propose a 2d spin-qubit architecture with low compilation overhead. The architecture is based on silicon nanowire split-gate transistors which form 1d chains of spin-qubits and allow the execution of two-qubit operations among neighbors. We introduce a silicon junction which can couple four nanowires into 2d arrangements via spin shuttling and Swap operations. We then propose a modular sparse 2d spin-qubit architecture with unit cells of diagonally-oriented squares with nanowires along the edges and junctions on the corners. Targeting noisy intermediate-scale quantum (NISQ) demonstrators, we show that the proposed architecture allows for compilation strategies which outperform methods for 1d chains, and exhibits favorable scaling properties which enable trading-off compilation overhead and colocation of control electronics within each square by adjusting the nanowire length. An appealing feature of the proposed architecture is its manufacturability using complementary-metal-oxide-semiconductor (CMOS) fabrication processes.
Similar content being viewed by others
Introduction
Qubit connectivity is a primary feature of any quantum computing technology. It represents the architectural arrangement of the qubits within the quantum processor and indicates the number of qubits with which any other qubit can interact. Highly-connected structures are favorable since two-qubit gates between arbitrary qubits require a smaller gate count and hence more complex problems can be solved with lower circuit depth1,2,3,4,5,6,7,8,9. On the other hand, high qubit connectivity comes at the expense of technological complexity. Therefore, scaling while maintaining high connectivity is a challenge being faced by many quantum computing technologies. All-to-all connectivity has been demonstrated by photonic10 as well as ion-trap11,12 qubits, but the distributed nature of these technologies puts serious challenges on the path to scaling. On the other hand, solid-state systems, such as superconducting13 and quantum dot spin qubits14, can exhibit 2d hardware topologies, which could be compactly integrated on a chip. This allows for colocating classical support electronics and offers an alternative path to scaling.
Implementations of 2d grids with nearest neighbor connectivity are desirable as this would allow for fault-tolerant quantum computing via the surface code15 without additional gate overhead. However, scaling 2d nearest neighbor grids in solid-state systems poses substantial contact routing challenges and complicates the integration of classical support electronics in the qubit plane16,17. Quantum-classical integration would require levels of 3d integration unseen to date18 although important progress has been made in that direction19. Therefore, solid-state platforms explore scaling with sparse 2d connectivity20,21,22 and several works have explored optimized methods for running quantum algorithms on sparsely-connected hardware graphs23,24,25.
Particularly for silicon spin qubits, progress has been made in the last few years demonstrating high-fidelity single- and two-qubit gates in SiGe26,27,28,29,30,31 and silicon metal-oxide-semiconductor (MOS) structures32,33, as well as high-fidelity initialization and readout in SiMOS34,35,36,37. Further, the first few-qubit processors38,39 and architecture proposals18,20,40,41,42 are beginning to emerge. Next steps will be focused towards scale-up, for which the use of industrial complementary metal-oxide-semiconductor (CMOS) processes is expected to play an important role17. In particular, modules containing industry-manufactured bilinear arrays of quantum dots (QDs) using split-gate nanowire transistor technology are readily manufactured and their qubit properties are widely being tested in experiments43,44,45,46. These modules should provide a platform to demonstrate a 1d quantum processor in silicon. One-dimensional processors allow running early noisy intermediate-scale quantum (NISQ) algorithms23,47 or demonstrating logical qubits48. How best to scale this technology to allow for optimized operation of quantum algorithms with minimal compilation overhead is therefore an important question.
Here, we present thoughts on how to scale QD chains of 1d split-gate transistors by combining them into 2d arrays with sparse connectivity. For this purpose, we introduce a hardware junction which enables coupling between horizontally and vertically oriented arrays of split-gate transistors via spin shuttling49. Given this junction element, we consider possible 1d and 2d architectures and investigate their suitability for implementing NISQ algorithms with minimal compilation overhead of two-qubit operations. As a result of our consideration, we propose a modular 2d spin-qubit architecture with unit cells consisting of diagonally-oriented squares where nanowires form the edges and junctions the corners of a square. We then investigate compilation strategies for the proposed 2d architecture and demonstrate a square-root scaling of the compilation overhead, which outperforms the linear overhead of 1d devices, not only asymptotically, but also down to its smallest building block which is likely to be investigated first in future experiments. The compilation results further allow for balancing the trade-off between the compilation overhead and creating space in between the qubit modules which can be beneficial for colocation of classical control and readout electronics20,50,51 and alleviating contact routing issues18,41,52,53. Finally, while reducing compilation overhead is mainly useful for NISQ applications, we believe the proposed architecture should be compatible with recent proposals42,48 to operate similar architectures at fault-tolerance. Overall, this should provide a compelling path to scaling, which could be manufactured with industrial CMOS processes.
Results
Topology of the proposed architecture
We start by introducing the hardware topology of the proposed architecture, before describing the detailed embodiment in silicon and discussing compilation strategies in subsequent sections. The core element of the architecture is a bilinear array of QDs using CMOS split-gate nanowire transistors. This gives rise to a 1d arrangement of m qubits along a line (each with an ancilla qubit for readout as will be explained later), in which neighboring qubits can be involved in two-qubit operations as visualized in Fig. 1a, where the m = 4 dots indicate qubits and the black lines indicate possible two-qubit operations among neighbors. An additional building block is provided by a junction element which can join up to four linear segments in a perpendicular manner, as depicted in Fig. 1b. In principle, a two-qubit operation can be executed between any two qubits involved in the junction as indicated by an orange line connecting the qubits. As explained in more detail in the following section, each two-qubit operation across the junction requires 6 additional spin shuttling steps. We consider the regime in which these coherent shuttling steps are fast, invoking negligible overhead as compared to the two-qubit operation. We note that each qubit can only be involved in a single two-qubit operation at a time.
a Linear segment of m = 4 qubits (dots) connected by two-qubit operations (lines), representing split-gate transistors along a nanowire. b Hardware junction joining four linear qubit segments. Orange lines indicate two-qubit operations combined with spin shuttling. c Unit cell of the proposed architecture with m = 4 qubits in each segment. d Hardware graph of the proposed architecture with dx = 2 and dy = 3 unit cells in x and y directions, respectively.
The unit cell of the proposed architecture is then provided by squares, with linear segments along the edges and junctions at the corners of each square, as in Fig. 1c. To form the complete architecture, we join dx squares in one direction and dy in the perpendicular direction. For convenience, we define the two directions as x and y. The complete architecture and x and y directions are show in Fig. 1d. Since each square contains 4m qubits and the device consists of dxdy squares, each device contains
qubits. We note that a crucial feature of the proposed architecture is the tilted orientation of squares at an angle of 45∘ in each unit cell, which minimizes the compilation overhead as will be explained later on.
Architecture embodiment in silicon
In the following, we introduce the embodiment of the proposed architecture in silicon and the corresponding control schemes in more detail. We focus on an implementation based on electron spins hosted in gate-defined QDs, but note that a similar proposal can be described for hole spin qubits. Readers interested only in the compilation techniques are advised to skip ahead.
Architecture embodiment in silicon: split-gate submodules
The principal building block of our proposal is the split-gate transistor45,54,55,56. It consists of an undoped silicon-on-insulator nanowire of height h and width w (typically 10 nm and 60 nm, respectively). The central part of the nanowire is gated by two metallic surface electrodes of length lg (typically 40–60 nm) which are isolated from the channel by the gate oxide (SiO2) of thickness t (typically 6 nm); see Fig. 2a, b. The gate stack is typically formed by 5 nm of TiN followed by 50 nm of polycrystalline silicon. The rectangular cross-section of the channel, as seen in Fig. 2b, is covered by the pair of split gates which are separated by a face-to-face distance Sgg ( ≈ 30 nm) and enable local electrostatic control of the nanowire. At deep cryogenic temperatures, when positive (negative) voltages are applied to the gates, few-electron (hole) QDs form in the top-most corners of the device due to the corner effect57,58,59, as shown in Fig. 2b. Charges can be drawn into the QDs from charge reservoirs formed of highly doped silicon located at each side of the split gate (source and drain in a). The qubit that we consider here is the spin of a single electron (or hole) confined to one of the two corner dots whose quantization axis is defined by placing the structure in a magnetic field. The other spin is used as an ancilla for readout as described later on. Hence a split-gate defines one qubit with a readout ancilla. The architecture can be scaled up by fabricating a series of split-gate transistors placed along the axis of the silicon nanowire, as depicted in Fig. 2c. The tunnel coupling along and across the nanowire is engineered by selecting the physical dimensions of the structure. In particular, w primarily determines the coupling across the nanowire whereas Svv determines the coupling along the nanowire. These values can be tuned globally by applying a potential to the silicon handle wafer (or back gate)60. Fine tuning could be achieved with more advanced gate layouts including a dedicated tunnel coupling gate module above the space between QD gates. The split-gate edge-to-edge separation Svv ( ≈ 40–60 m) is set to enable sizable exchange coupling between spins, which we use to generate two-qubit interactions. Overall, the module results in a bilinear array of QDs. Given that the two QDs of each split-gate transistor encode one qubit—one dot contains a qubit spin and the other an ancilla spin for readout—the structure embodies a one-dimensional chain of silicon spin qubits of length m, as described in the previous section.
a Top view of a single split-gate nanowire transistor, with highly-doped source and drain ohmic contacts in blue, metallic split gates (G1 and G2) in orange, Si3N4 spacer in green and buried oxide in black. b Schematic cross section of the split-gate transistor along the dashed line in a. The gates and silicon nanowire are isolated by the gate oxide of thickness t. The metal gate stack TiN(5 nm)/Polycrystalline silicon (50 nm) is represented in pink. Sgg indicates the edge-to-edge separation of the split gates. The silicon nanowire height and width are h and w, respectively. The purple regions represent schematically the QD wavefunctions. BOX stands for buried oxide. The vertical black arrow represents the direction of the external magnetic field. c An 8 split-gate 1d submodule. Dashed lines highlight a one-qubit split gate subset. Svv indicates the edge-to-edge separation between splits and lg the gate length. d Energy spectrum versus energy detuning of the two-spin system in a tunnel-coupled DQD. Both energy and detuning are normalized by the \(\left\vert \uparrow \downarrow \right\rangle -\left\vert S(20)\right\rangle\) tunnel coupling, ts. The diagram is simulated using a 5% g-factor difference and an average Zeeman energy of three times ts. e–i Schematic of operating 3 qubits in a 2 × 3 QD array formed by 3 split gates as indicated by dashed lines. Initialization e, f, one-qubit gates g, two-qubit exchange interaction h, and readout via Pauli spin blockade i.
Next, we explain control, readout and initialization of the 1d modules in detail. We base our explanation on the energy spectrum of the coupled two-spin system in the single spin basis as a function of the QD energy detuning, ϵ, and a finite magnetic field; see Fig. 2d. At large positive detuning with respect to the (11)-(20) charge hybridization point, the ground state of the system corresponds to the intradot singlet, \(\left\vert S(20)\right\rangle\). Here the (nm) notation refers to the charge distribution among the two QDs, i.e., dots are occupied with n and m charges, respectively. At negative detuning, the ground state of the system is the (11) charge configuration whose spin degeneracy is broken by the external magnetic field. We consider the QDs to have a tunnel coupling energy ts and present different g-factors due to the variability of the Si/SiO2 interface61, which further breaks the degeneracy of the \(\left\vert \uparrow \downarrow \right\rangle\) and \(\left\vert \downarrow \uparrow \right\rangle\) states. Here, the first spin refers to the spin with the lower g-factor (see the Hamiltonian in Supplementary Note 1). Although not strictly necessary for the operation of the processor, we consider that larger Zeeman energy differences exist across the nanowire than along the nanowire, which could be achieved by creating an asymmetry in the gate structure. Using magnetic materials on the gate stack on one side of the split or placing the splits offset with respect to the nanowire axis59 may produce a difference in the Meissner effect at the location of the QDs. We use the energy spectrum to describe processes involving QDs within a split-gate and QDs on the same corner of the nanowire in different splits.
Architecture embodiment in silicon: initialization
We start with the 1d module loaded with one charge in each QD. Charges can be drawn in from reservoirs at the periphery following established methods62. To initialize to a known spin state, QDs in each split gate are positively detuned until the system relaxes to the \(\left\vert S(20)\right\rangle\) state (Fig. 2e). Then, the system is pulsed towards negative detuning adiabatically with respect to the \(\left\vert \uparrow \downarrow \right\rangle\)-\(\left\vert S(20)\right\rangle\) coupling and non-adiabatically with respect to the \(\left\vert \downarrow \downarrow \right\rangle\)-\(\left\vert S(20)\right\rangle\) coupling (not shown in d), to initialize the QDs in the splits in the \(\left\vert \uparrow \downarrow \right\rangle\) state (Fig. 2f).
Architecture embodiment in silicon: single-qubit operations
In the case of electron spins in isotopically purified silicon QDs, spins present long coherence times (\({T}_{2}^{* }\, > \,100\,\mu\)s and T2 = 28 ms)63. This feature enables high control fidelity via electron spin resonance (ESR) techniques64,65 with typical Rabi frequencies of 1 MHz and fidelity in excess of 99%32. Manipulation occurs for QDs on the same side of the nanowire which are now initialized to the \(\left\vert \uparrow \downarrow \right\rangle\) state.
The ESR implementation for the proposed architecture is inspired by Ref. 66. It requires placing the structure inside a broadband 3D microwave cavity designed to deliver an oscillatory magnetic field perpendicular to the external magnetic field at the location of the chip 67,68. The cavity is designed to have a quality factor Q ≤ 100 to cover the range of potential qubit frequencies associated with the typical variations of the g-factor of electrons in MOS devices, Δg/g ≈ 10−2 (Ref. 69.) The variability, associated with spin-orbit coupling, can be minimized by applying the external magnetic field along the [100] axis70. To reduce the number of necessary frequencies to control the system and minimize cross-talk, we make use of the Stark shift. A g-factor modulation of δg/g ≈ 10−3 (Ref. 61) can be obtained by gate voltage tuning and allows lumping the qubit frequencies into N = Δg/δg bins. Then N-1 frequencies are sufficient to cover the full qubit frequency range. Selected qubits are tuned in and out of resonance with the relevant frequency (fESRi) using gate voltage pulses at the qubit gate71. ESR allows two-axis control (X and Y gates) by controlling the duration of the gate voltage pulse and the phase of the microwave excitation.
In the case of hole spin qubits, the larger spin-orbit coupling when compared to electron spins allows for all-electrical control via individual microwave voltage signals applied directly to the relevant QD gates72,73. The spin coherence is relatively lower, with \({T}_{2}^{* }\) values of up to 7.1 μs reported74, but the faster Rabi frequencies, of up to 150 MHz, have enabled X and Y rotation with fidelity in excess of 99%73,75.
Architecture embodiment in silicon: two-qubit operations
We propose engineering two-qubit gates between spins on the same corner of the nanowire by means of the spin–spin exchange interaction (Fig. 2h). The exchange interaction can be modulated electrostatically by applying a differential mode voltage on the two relevant neighboring gates that brings the system close to ϵ = 0; see Fig. 2d, h. In the limit where the differential mode voltage pulse increases the exchange coupling to a value much larger than the Zeeman energy difference between spins, a \(\sqrt{{\rm{S}}{\scriptstyle{\rm{WAP}}}}\) or Swap gate can be implemented by timing the duration and depth of the interaction pulse76,77—see Supplementary Note 2. Alternatively, when the size of the modulation is smaller, such that the exchange coupling strength remains below the Zeeman energy difference, a CPhase gate could be implemented65 with fidelity above fault-tolerant thresholds29,30,31. In this article, we focus on the former gate but note that the Swap gate can be synthesized from a combination of CPhase and single qubit gates.
Architecture embodiment in silicon: readout
Spin readout is based on spin-dependent tunneling from the (11) to the (20) charge configurations, i.e., Pauli spin blockade78. More particularly, the state \(\left\vert \uparrow \downarrow \right\rangle\) is allowed to tunnel to the \(\left\vert S(20)\right\rangle\) state, whereas all remaining two particle spins states are blocked18,79; see Fig. 2d, i. To detect this tunneling process, we suggest using dispersive readout80. In each split, the gate of the ancilla is connected to a lumped-element electrical resonator which is driven at its natural frequency, f0. At ϵ = 0, the readout point, cyclic tunneling between \(\left\vert \uparrow \downarrow \right\rangle\) and \(\left\vert S(20)\right\rangle\), driven by the oscillatory resonator voltage, manifests in an additional quantum capacitance that loads the resonator producing a spin-dependent frequency shift that can be readily detected with standard methods81,82,83,84,85. On the other hand, at the readout point, the \(\left\vert \downarrow \downarrow \right\rangle\) state does not tunnel under the oscillatory voltage of the resonator giving no signal. This difference in polarizability allows discerning the two computational states (\(\left\vert \uparrow \downarrow \right\rangle\) and \(\left\vert \downarrow \downarrow \right\rangle\)). We note that enhanced spin-orbit coupling can promote fast relaxation between the \(\left\vert \downarrow \uparrow \right\rangle\) state and the singlet38,86 but the state is outside our computational basis. Further, enhanced spin-orbit coupling can couple the \(\left\vert \downarrow \downarrow \right\rangle\) and the singlet states leading to a measurable signal at that additional anticrossing87,88. By operating at \(\bar{g}{\mu }_{{{{\rm{B}}}}}B\, > \,{t}_{{{{\rm{s}}}}}\), even in these systems readout can be performed. In this proposal, we have chosen to have a readout ancilla per qubit such as each qubit can be read with one tunnel step only. However, other arrangements may be possible in which ancillas may be used as computational qubits. To reduce the number of resonators required for readout, time-division multiplexing techniques could be implemented in conjunction with floating gate arrangements as previously described52 and laid out in Supplementary note 3.
Architecture embodiment in silicon: junctions
Coupling of silicon nanowire qubits in the second dimension has been proposed via floating gates89 and microwave resonators90. Here, we propose a hardware junction which allows for coupling 1d submodules of split-gate transistors in 2d. The junction consists of etched silicon-on-insulator in quadrangular form, as can be seen in the central region of Fig. 3a. The exact shape and gate layout of the junction can vary to accommodate different levels of interconnection, reduce cross talk, and shuttling bottlenecks (see Supplementary Note 4). Here, we present a square junction that enables connecting up to four submodules at an angle of 90∘, 180∘, or 270∘. On top of the junction, we place a series of metallic gates with the same gate stack as the split-gate transistors. We propose a five-gate structure arranged in cross geometry—one central square gate flanked by four square gates of the same footprint. The characteristic dimensions of the junction are indicated in Fig. 3b, with submodule-to-edge-gate separation in \({x}^{{\prime} }\)(\({y}^{{\prime} }\)) directions Lq (≈40 nm), the gate length in the \({x}^{{\prime} }\)(\({y}^{{\prime} }\)) directions Ll (≈80 nm), and a junction gate-to-gate separation Ld (≈40 nm). The gate length and pitch are in line with the fabrication requirements for the 1d sub-modules. For the junction, we suggest gate contact over active, a feature available in 10 nm node technology. The junction is invariant under rotations by 90∘.
a Schematic diagonal view of the junction connecting two \({x}^{{\prime} }\) and two \({y}^{{\prime} }\) submodules. b Top view of the junction. c Shuttling sequence from \({x}^{{\prime} }\) submodule at the left to \({y}^{{\prime} }\) submodule at the bottom. Arrows indicate three consecutive spin shuttling operations (i–iii), followed by a Swap operation, and another three consecutive spin shuttling operations (v–vii). d Same as c, abstracted. e Hardware topology of the junction. Qubits (dots) of \({x}^{{\prime} }\) and \({y}^{{\prime} }\) submodules are connected via six shuttling and one Swap operations (lines).
The purpose of the junction is to create a shuttling path for electrons49,91 at the edges of the 1d submodules to be moved around the junction in the \({x}^{{\prime} }\) and \({y}^{{\prime} }\) directions on demand using the appropriate gate voltages sequence. These electrons can be moved to be in exchange coupling proximity with the electrons at the edge of another submodule where a two-qubit gate will be performed. We illustrate the operation of the junction using an \({x}^{{\prime} }{y}^{{\prime} }\) coupling example in Fig. 3c: (i) Shuttle the electron under the top-rightmost gate in the left module \({x}^{{\prime} }\), to the left gate in the junction by applying a differential voltage between the two gates49. (ii) Shuttle from the left gate to the central gate of the junction. (iii) Shuttle from the central gate to the bottom gate of the junction. (iv) Implement a two-qubit gate between the electron under the bottom gate in the junction and the electron under the left-topmost gate in the \({y}^{{\prime} }\) module, as described above. (v–vii) Shuttle the electron back following the reverse process. The corresponding abstracted hardware topology and the same shuttling sequence are presented in Fig. 3d. Although in this work we consider two-qubit interactions between spins confined to different types of QDs (a corner QD in a submodule and a planar QD in the junction), the shuttling process above can be modified to produce two-qubit interactions between planar QDs which have been demonstrated29,30,65,76.
As discussed in Supplementary Note 5, we consider the regime in which spin shuttling operations are performed coherently and much faster than the two-qubit operation in step (iv). Any two qubits of a junction can be involved in a two-qubit operation between them and, since shuttle operations are assumed to have negligible overhead, two two-qubit operations can be run in parallel. Overall, this reasoning allows for abstracting the junction element into the simplified hardware graph depicted in Fig. 3e.
Regarding the quality of the shuttling process, it has been argued that conveyor mode shuttling could lead to a phase error of less than 10−3 for typical values of electron spins in MOS QDs (a correlation length on the variation of the spin splitting of 20 nm and homogeneous coherence times, \({T}_{2}^{* }\ge 20\, \mu\)s) as long as the transfer occurs adiabatically with a velocity not less than 0.1 ms−1 (the transfer length here is ≈400 nm)92.
Compilation methods
We now present compilation methods for deploying NISQ-type quantum algorithms on the proposed architecture, following standard approaches. We assume that algorithms are given by a quantum circuit which consists of initialization, followed by a sequence of one- and two-qubit operations, and readout, suitable for execution on a fully-connected device. While most of these operations are readily available on the proposed architecture, the sparse connectivity will prohibit execution of two-qubit gates between arbitrary pairs of qubits that are not directly connected by an edge of the hardware graph. To address this issue, compilation methods re-express a given quantum circuit through an equivalent circuit readily amenable to the sparse hardware topology. More specifically, before executing a given two-qubit gate, one generally applies a sequence of Swap gates to shuttle the relevant qubits along the hardware graph until they reach neighboring positions. Once neighboring positions are reached, the two-qubit gate is executed and the sequence of Swap gates is reverted to promote the relevant qubits back to their original position. In applying this compilation strategy, the resulting quantum circuit accumulates a compilation overhead characterized by the number of additional Swap gates, NSWAP, and the increased circuit depth, Nd. Both quantities represent important quality metrics for the compilation which should be minimized as is possible to avoid additional decoherence and infidelities introduced through the additional Swap gates on NISQ devices. We note that the proposed architecture uses two \(\sqrt{{\rm{S}}{\scriptstyle{\rm{WAP}}}}\) gates to implement the Swap operation.
In what follows, we focus on two major compilation scenarios: (i) The case of moving two arbitrary qubits together along the hardware graph. This covers the general case where each two-qubit operation is addressed individually and usually gives reasonable estimates for the compilation overhead. (ii) The case of rearranging qubits in an arbitrary permutation. This compilation method could also be, for example, useful for variational algorithms93 from chemistry94,95 and finance96 which repeatedly execute some number of up to ⌊N/2⌋ two-qubit operations in parallel. For such algorithms, the compilation must efficiently permute qubits into configurations which allow for executing the relevant two-qubit operations in parallel, even on the sparse hardware topology. To implement such permutations efficiently, one repeatedly applies a layer of Swap operations during which up to ⌊N/2⌋ Swap gates are executed in parallel. Ultimately, this can result in significantly lower circuit depth than simply addressing each two-qubit operation individually. We note that being able to apply such a layer of two-qubit operations is less general than arbitrary rearrangement and thus further reductions may be possible for this specific case.
Finally, we compare our compilation methods to two important limiting cases—a 1d device which could be fabricated by joining nanowire submodules along one dimension, as depicted in Fig. 4a, and a 2d rectangular device consisting of lx by ly qubits with nearest-neighbor connectivity, as depicted in Fig. 4b.
Compilation methods: case I – moving two qubits together along the shortest path
The compilation method for case I (as in, e.g., Ref. 97) requires iterating over all two-qubit operations of a given input circuit and, for each pair of qubits involved in a two-qubit operation, (a) determining the shortest path connecting the two qubits and subsequently (b) executing Swap gates to connect the qubits along the shortest path, executing the two-qubit operation and shuttling qubits back to their original position. Finding the shortest path, i.e, the distance, between a pair of qubits can efficiently be implemented in polynomial time using, e.g., Dijkstra’s algorithm98.
The compilation overhead of this algorithm is determined by the length of the shortest path, l, connecting the pair of qubits involved in each two-qubit operation along the hardware graph. In particular, the overhead of Swap gates accumulated per two-qubit gate is given by NSWAP = l − 1 while the increased circuit depth per two-qubit operation will be given by \({N}_{{{{\rm{d}}}}}=\lceil\frac{l-1}{2}\rceil\), assuming that Swap gates on different pairs of qubits can be applied simultaneously. To compare further the compilation overhead for different devices, we compute the average (\(\bar{l}\)) and maximal (\({l}_{\max }\)) distances between qubit pairs in a given topology. For the linear layout with N qubits, we have
For a rectangular grid of lx by ly qubits, we find that \({\bar{l}}_{{{{\rm{rec}}}}}=\frac{1}{3}({l}_{{{{\rm{x}}}}}+{l}_{{{{\rm{y}}}}})\) and \({l}_{{{{\rm{rec}}}},\max }={l}_{{{{\rm{x}}}}}+{l}_{{{{\rm{y}}}}}-2\) and specifically for the square grid with \({l}_{{{{\rm{x}}}}}={l}_{{{{\rm{y}}}}}=\sqrt{N}\) we have
Finally, considering the proposed architecture for dx = dy, the average and maximal qubit distances are
For details of the derivation, see Supplementary Note 6. For the limiting case \(m=\frac{N}{4}\), we see that
and thus, as expected, the device exhibits linear scaling behaviour. However, its performance is still better than that of the linear device due to the ring-shaped arrangement of qubits.
We visualize the aforementioned compilation overheads expressed via mean and maximal qubit distances in Fig. 5a, b as a function of increasing qubit number. These figures illustrate several important properties, which we comment on in the following. The linear configuration (labeled 1D) clearly exhibits a linear scaling of the compilation overhead, while the rectangular device with nearest neighbor connectivity (labeled 2D) exhibits a square-root scaling. The compilation overhead of the proposed architecture inherits the favorable square-root scaling of 2d hardware topologies, making the proposed architecture favorable over 1d devices. Interestingly, the compilation overhead of the proposed device never exceeds the compilation overhead of the linear architecture, even for the smallest devices dx = dy = 1, 2, 3, …. This is useful as first experimental realizations of the proposed architecture would start from small prototypes. Finally, for increasing sparsity m = 4, 8, 16, …, the compilation overhead of the proposed architecture does increase; however, it never exceeds the overhead of the linear device. This allows for balancing the trade-off between compilation overhead and creating space in between the qubit modules which can be beneficial for colocation of classical support electronics (see Supplementary Note 3).
This is expressed via a average and b maximum shortest path as a function of the number of qubits for different layouts—a (1D) linear layout, a (2D) rectangular device with nearest neighbor connectivity with lx = ly and the proposed architecture in a square arrangement dx = dy with m = 4, 8, and 16. Lines indicate the scaling and dots indicate qubit numbers which can be realized by an actual device.
Compilation methods: case II—rearranging qubits in arbitrary permutations
We now consider the cost of permuting all qubits at once, making use of sorting networks1,2. We begin by recalling qubit permutations on linear and rectangular devices3,4 as these underlie the compilation method for the proposed sparse architecture.
We first consider permutations on 1d linear devices with N qubits using parallel neighbor sort99, as depicted in Fig. 6a. First, each node of the hardware graph is assigned an index v = 1, …, N in ascending order, and each qubit is labeled by its final position. Consecutive layers of Swap operations are then applied in up to N steps. For odd steps 1, 3, …, qubits on node pairs (v, u) ∈ {(1, 2), (3, 4), …} are compared and a Swap operation is applied if the final destination of qubit v is larger than the final destination of qubit u. For even steps 2, 4, …, the same process occurs for qubits in node pairs (v, u) ∈ {(2, 3), (4, 5), …}. With this method, any permutation can be implemented in a maximum of N layers leading to an increase in the circuit depth of
The maximum number of Swap operations needed is
It is clear that it is possible for one or two qubits to be required to move N − 1 places and thus the method here produces a result close to the worst-case minimum. An alternative is to use bubble sort or insertion sort, but the maximum depth increases to 2N − 31,2.
These are shown for a linear and b rectangular devices. a Parallel neighbor sort for a linear device with N = 4 qubits using 4 consecutive layers of Swaps. Qubits (circles) are labeled by final positions as marked by gray numbers. Orange lines indicate qubit pairs which are compared at a given iteration and swapped, if in the wrong order. b Permuting N = 9 qubits (circles) in a square layout. Graph positions are labeled in gray. Labels on qubit circles indicate the final row and column of each qubit. The permutation is implemented by consecutively using parallel neighbor sort along (i) columns, (ii) rows, and (iii) columns, as highlighted by orange lines.
Next, we consider permutations on a rectangular device of N = lxly qubits with nearest neighbor connectivity. We follow the algorithm of Ref. 4, which consists of the following three steps: (i) Rearrange the qubits in each column using parallel neighbor sort such that each row contains exactly one qubit with final destination in each column 1, 2, 3, …, lx. (ii) Rearrange the qubits in each row using parallel neighbor sort such that all qubits are in the correct column. (iii) Rearrange the qubits in each column using parallel neighbor sort such that each qubit is in the correct final location. We note that column and row can also be interchanged in the above steps. An example visualizing the method is shown in Fig. 6b.
Once step (i) provides an arrangement such that each row contains exactly one qubit with final destination in column 1, 2, 3, …, lx, an implementation of steps (ii) and (iii) using parallel neighbor sort is simple. The challenge is to see that an efficient implementation of step (i) is always possible. This was shown in Refs. 3,4 using Hall’s matching theorem100 and will be discussed in more detail in its adaption to the proposed architecture with sparse 2d connectivity below. The overhead of the discussed method originates from consecutively using parallel neighbor sort on (i) columns (ii) rows and (iii) columns. This results in a maximum of
layers of Swap gates and
total Swap gates. Specifically, for the square \({l}_{{{{\rm{y}}}}}={l}_{{{{\rm{x}}}}}=\sqrt{N}\), this gives
and
We now extend the method of Ref. 4 to the proposed architecture with sparse 2d connectivity. The method is based on the definition of generalized rows and columns as depicted in Fig. 7a, b, respectively. In essence, this results in 2dy rows and 2dx columns with 2mdx and 2mdy qubits respectively, where any given row and column share m qubits. Having defined these generalized rows and columns, we note that every qubit can conveniently be labeled by a combination of its column index, ax = 1, …, 2dx, and its vertical position, y. Equivalently, we could choose to label a qubit by its row index, ay and its horizontal position, x. The coordinates x and y can be seen in Fig. 1d. With generalized rows and columns in place, we describe the corresponding compilation algorithm for implementing qubit permutations. Each qubit, in its initial location, carries a label (ax, y), indicating its final position. An arbitrary qubit permutation can then be implemented using the following three steps: (i) Rearrange the qubits in each generalized column using parallel neighbor sort such that each set of qubits with fixed coordinate y contains exactly one qubit with final destination in each column ax = 1, 2, 3, …, 2dx. (ii) Rearrange the qubits in each generalized row using parallel neighbor sort such that all qubits are moved into the correct column according to their column index ax. (iii) Rearrange the qubits in each generalized column using parallel neighbor sort such that each qubit is in the correct y location along the columns. An example visualizing the method is given in Fig. 8. We note again that, once step (i) provides an arrangement such that each set of qubits with vertical position y contains exactly one qubit with final destination in column ax = 1, 2, 3, …, 2dx, an implementation of steps (ii) and (iii) using parallel neighbor sort is straightforward. The challenge is again to see that an efficient implementation of step (i) is always possible. The procedure to achieve this is illustrated in Fig. 8e–g and shall now be explained in more detail. To begin with, a bipartite graph of 4dx nodes is constructed, l ∈ 1, …, 2dx on the left and 2dx nodes r ∈ 1, …, 2dx on the right. Nodes on the left indicate columns in which a qubit is located initially. Nodes on the right indicate columns to which a qubit should be routed. To build the graph, one adds an edge (l, r) to the bipartite graph for each qubit initially located in column l and having final destination in column r. An example is given in Fig. 8e. We note that, since we have 2mdy qubits located in each column initially and since we will have 2mdy qubits with final destination located in each column, the bipartite graph has 2mdy edges incident to each node. Since some qubits may originate and end up in the same column, the bipartite graph can have multiple edges connecting the same nodes.
a–d Qubits (circles) labeled by their final destination (y, ax) in the block are permuted using parallel neighbor sort along generalized (i) columns, (ii) rows, and (iii) columns as indicated by black solid as opposed to gray dashed lines. Gray labels next to the qubits index current qubit positions. e Bipartite graph corresponding to step (i). Nodes on the left and right indicate qubits with origin and final destination in columns 1, 2, 3, and 4 respectively, while edges represent the corresponding qubit. f Same as e with virtual nodes s, t. g Matchings extracted from e. The yth matching determines qubits routed to position y. An edge (l, r) of yth matching indicates that a qubit of column l destined for column r is to be routed to position y.
Next, to determine how qubits should be arranged along columns in step (i), one extracts 2mdy perfect matchings \({{{{\mathcal{M}}}}}_{{{{\rm{y}}}}}\) with y = 1, …, 2mdy from the bipartite graph. A perfect matching \({{{{\mathcal{M}}}}}_{{{{\rm{y}}}}}\) is a set of edges such that each node of the bipartite graph is connected to exactly one edge of the matching. See Fig. 8g for examples showing one possible set of perfect matchings for the bipartite graph in Fig. 8e. Finding 2mdy perfect matchings for the given type of bipartite graph is always possible due to Hall’s matching theorem100. In the bipartite graphs that arise due to our compilation problem, each node has the same number of incident edges, a sufficient condition for Hall’s matching theorem to hold. Matchings can efficiently be found using the Ford-Fulkerson algorithm98. To this end, one attaches virtual nodes s and t to all nodes on the left and right of the bipartite graph, respectively, and determines a minimal network flow from s to t; see Fig. 8f. Finding a minimal flow configuration reveals one matching at a time. Successively removing edges of a matching from the bipartite graph and repeatedly running the Ford-Fulkerson algorithm will reveal all matchings \({{{{\mathcal{M}}}}}_{{{{\rm{y}}}}}\) with y = 1, …, 2mdy, as visualized in Fig. 8g. Finally, to route qubits in step (i), one selects qubits which move to vertical position y by iterating over the edges of the yth matching \((l,r)\in {{{{\mathcal{M}}}}}_{{{{\rm{y}}}}}\). Here, each edge (l, r) signifies that in the generalized column l a qubit destined for the generalized column r should be moved to vertical position y. Since edges in each set of matchings \({{{{\mathcal{M}}}}}_{{{{\rm{y}}}}}\) point to exactly one final destination per vertical position y, the fact that 2mdy matchings exist ensures that step (i) arranges the qubits in each column so that each row contains exactly m qubits with final destination in column 1, 2, 3, …, 2mdx.
We close this section by evaluating the compilation overhead. Considering the succession of parallel neighbor sorts along (i) generalized columns of length 2mdy, (ii) generalized rows of length 2mdx, and (iii) again generalized columns of length 2mdy, the maximum number of required layers of Swap gates is given by
and the maximum total number of Swap gates by
For the specific case dx = dy = d, and using Eq. (1), we have
and
We compare the compilation overhead of the proposed architecture with sparse 2d connectivity to the linear and rectangular hardware graphs in Fig. 9. Again, the linear configuration (labeled 1D) exhibits a linear scaling of the compilation overhead, while the rectangular device with nearest neighbor connectivity (labeled 2D) exhibits a square-root scaling. The compilation overhead of the proposed architecture inherits the favorable square-root scaling of 2d hardware topologies, making the proposed architecture favorable over 1d devices. Interestingly, the compilation overhead of the proposed device never exceeds the compilation overhead of the linear architecture for dx = dy ≥ 2. For device structures with dx = dy = 1, the compilation overhead of the proposed architecture can be reduced by recognizing that the rearrangement of the sparse device can always be handled with a single parallel neighbor sort, by arranging the qubits along a single line. This ensures that the compilation overhead for the proposed architecture never exceeds the compilation overhead of the 1d architecture even for the smallest devices. This is useful as first experimental realizations of the proposed architecture would start from small prototypes. Finally, for increasing sparsity m = 4, 8, 16, …, the compilation overhead of the proposed architecture does increase; however, it again never exceeds the overhead of the linear device. This allows for balancing between compilation overhead and creating space in between the qubit modules which can be beneficial for colocation of classical support electronics.
Lines indicate scaling and dots indicate qubit layout which exist. Crosses indicate the reduced compilation overhead by using a single parallel neighbor sort for devices with dx = dy = 1. Such devices consist of a square of qubits, as seen in Fig. 1c.
Discussion
Inspired by the challenge of scaling up existing silicon quantum hardware, we investigate compilation strategies for sparsely-connected 2d qubit arrangements and propose a spin-qubit architecture with low compilation overhead. Our considerations are inspired by silicon nanowire split-gate transistors which form finite 1d chains of spin-qubits, allowing for the execution of two-qubit operations such as Swap gates among neighbors. Adding to this, we describe a silicon junction which can couple up to four nanowires at one end into 1d or 2d arrangements via spin shuttling and Swap operations. Given these hardware elements, we propose a modular 2d spin-qubit architecture with unit cells consisting of diagonally-oriented squares with nanowires along the edges and junctions at the corners.
The junction geometry opens space between modules to route the gate lines and/or to place cryogenic classical electronics in the quantum processor plane20,50. Fabricating the qubits and the classical control layer using the same technology is appealing because it will facilitate the integration process, improving feedback speeds in error-correction protocols, and offer potential solutions to wiring and layout challenges16,101,102,103,104,105. Integrating classical and quantum devices monolithically, using CMOS processes, enables the quantum processor to profit from the most mature industrial technology for the fabrication of large-scale circuits51. We show that this architecture allows for compilation strategies which inherit the favorable square-root scaling of compilation overhead in 2d structure and outperform the best in class compilation strategy of 1d chains, not only asymptotically, but also down to the minimal structure of a single square. This result shows that scaling silicon nanowires into 2d structures will have benefits early on, even in experimental demonstrations of the smallest prototypes, thus encouraging building the proposed junction element and expanding silicon architectures into 2d arrangements.
Going beyond the proposed architecture for spin qubits, the proposed method of generalized rows and columns can be applied to many other 2d architectures. In some cases, where rows and columns are not obvious, this could be made to work by introducing branches along a column and row, as illustrated in Fig. 10. As branches lead to three-way junctions within a row and/or column the overhead may, of course, be different. Overall, this shows that the method of generalized rows and columns is a useful compilation strategy not only for spin qubits but also many other architectures such as, for example, IBM’s heavy-hex for superconducting qubits.
It is constructed from the linear segments and four-way junctions shown in Fig. 1a, b respectively. The generalized a rows and b columns are shown. Qubits are represented by dots and lines indicate between which qubits a two-qubit gate can be performed. Qubits of the same color are in the same row or column. For this device, the rows and columns have branches, which may increase the overhead of the Case II method.
We wished to present compilation methods for the device for two quite general situations for which we could obtain an upper bound on the number of layers of SWAP gates to allow comparison to other architectures. The compilation strategies presented here act to demonstrate the square-root scaling in overhead due to routing for the sparsely-connected device, and the advantage of using this device over one with a 1d structure. Many other architecture-aware compilation methods exist and show good results [e.g.,5,6,7,8,106]. Circuit re-synthesis9 provides another option. Alternatively, it may be beneficial to consider a compilation method designed with the desired algorithm in mind [e.g.,25,107]. However, the distance between qubits will clearly affect the overhead introduced through such compilation methods, and thus the results presented here provide some indication of their likely performance.
In exploring strategies to reduce the compilation overhead we focus on the utility of the proposed architecture for the NISQ era. In that, future investigations should be directed to exploring the utility of the proposed architecture for fault-tolerant quantum computation. Positive indications towards this long-term goal are provided by recent work considering error correction along a line48 as well as the possibility of adapting the ideas of the spider-web array42 and operating each nanowire with a single-qubit connected in an angled rectangular lattice with highly coherent shuttle operations.
Finally, while our work provides thoughts on how to scale one-dimensional QDs into a sparse two-dimensional array, it leaves fundamental questions of feasibility to future research. Firstly, if operated for near-term applications, how many SWAP and shuttling operations could the architecture tolerate, before the results become inaccurate? Recent work investigating the feasibility of VQE algorithms in the presence of noise indicates the need for two-qubit gate fidelity well above the fault-tolerance threshold108. As such, the proposed architecture, which adds overhead through SWAP and shuttling operations, will likely require even higher two-qubit fidelities to become a scalable solution for NISQ algorithms. Similarly, if operated in a manner like Ref. 42, what fidelity would the individual shuttling operations need to achieve for overall fidelities to be above the fault-tolerance threshold? Recent work investigating surface codes on sparser grid-like architectures109 indicates that this will likely require gate and shuttling fidelities higher than those of the standard surface code, but it remains subject of future investigations to determine these fidelities. Finally, is it feasible to operate the one-dimensional lines of quantum dots as one perpetual qubit in the manner of Ref. 48? If so, would it then be feasible to exchange information between such qubits via the tunneling junctions? All these questions remain the subject of future work.
Data availability
No datasets were generated or analyzed during the current study.
Code availability
No code was developed to obtain the results in this paper.
References
Hirata, Y., Nakanishi, M., Yamashita, S. & Nakashima, Y. An efficient conversion of quantum circuits to a linear nearest neighbor architecture. Quantum Inform. Comp. 11, 142–166 (2011).
Beals, R. et al. Efficient distributed quantum computing. Proc. R. Soc. A: Math., Phys.Eng. Sci. 469, 20120686 (2013).
Brierley, S. Efficient implementation of quantum circuits with limited qubit interactions. Quantum Info. Comput. 17, 1096–1104 (2017).
Steiger, D. S., Häner, T. & Troyer, M. Advantages of a modular high-level quantum programming framework. Microprocess. Microsy. 66, 81–89 (2019).
Zulehner, A., Paler, A. & Wille, R. An efficient methodology for mapping quantum circuits to the IBM QX architectures. IEEE T. Comput. Aid. D. 38, 1226–1236 (2018).
Childs, A. M., Schoute, E. & Unsal, C. M. Circuit Transformations for Quantum Architectures. In van Dam, W. & Mancinska, L. (eds.) 14th Conference on the Theory of Quantum Computation, Communication and Cryptography (TQC 2019), vol. 135 of Leibniz International Proceedings in Informatics (LIPIcs), 3:1–3:24 (Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany, 2019).
Cowtan, A. et al. On the Qubit Routing Problem. In van Dam, W. & Mancinska, L. (eds.) 14th Conference on the Theory of Quantum Computation, Communication and Cryptography (TQC 2019), vol. 135 of Leibniz International Proceedings in Informatics (LIPIcs), 5:1–5:32 (Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany, 2019).
Lao, L., van Someren, H., Ashraf, I. & Almudever, C. G. Timing and resource-aware mapping of quantum circuits to superconducting processors. IEEE T. Comput. Aid. D. 41, 359–371 (2021).
Gheorghiu, V., Huang, J., Li, S. M., Mosca, M. & Mukhopadhyay, P. Reducing the CNOT count for Clifford+ T circuits on NISQ architectures. IEEE T. Comput. Aid. D. https://doi.org/10.1109/TCAD.2022.3213210 (2022).
Zhong, H.-S. et al. Quantum computational advantage using photons. Science 370, 1460–1463 (2020).
Linke, N. M. et al. Experimental comparison of two quantum computing architectures. Proc. Natl Acad. Sci. USA 114, 3305–3310 (2017).
Pino, J. M. et al. Demonstration of the trapped-ion quantum CCD computer architecture. Nature 592, 209–213 (2021).
Arute, F. et al. Quantum supremacy using a programmable superconducting processor. Nature 574, 505–510 (2019).
Hendrickx, N. W. et al. A four-qubit germanium quantum processor. Nature 591, 580 – 585 (2021).
Fowler, A. G., Mariantoni, M., Martinis, J. M. & Cleland, A. N. Surface codes: Towards practical large-scale quantum computation. Phys. Rev. A 86, 32324 (2012).
Charbon, E. et al. Cryo-CMOS for quantum computing. In 2016 IEEE International Electron Devices Meeting (IEDM), 13.5.1–13.5.4 (2016).
Gonzalez-Zalba, M. F. et al. Scaling silicon-based quantum computing using cmos technology. Nat. Electron. 4, 872–884 (2021).
Veldhorst, M., Eenink, H. G. J., Yang, C. H. & Dzurak, A. S. Silicon CMOS architecture for a spin-based quantum computer. Nat. Commun. 8, 1766 (2017).
Ibm unveils breakthrough 127-qubit quantum processor. https://newsroom.ibm.com/2021-11-16-IBM-Unveils-Breakthrough-127-Qubit-Quantum-Processor.
Vandersypen, L. M. K. et al. Interfacing spin qubits in quantum dots and donors–hot, dense, and coherent. NPJ Quantum Inf. 3, 34 (2017).
Buonacorsi, B. et al. Network architecture for a topological quantum computer in silicon. Quantum Sci. Technol. 4, 025003 (2019).
Nation, P., Paik, H., Cross, A. & Nazario, Z. The IBM Quantum heavy hex lattice. https://research.ibm.com/blog/heavy-hex-lattice (2021).
Kivlichan, I. D. et al. Quantum simulation of electronic structure with linear depth and connectivity. Phys. Rev. Lett. 120, 110501 (2018).
Guerreschi, G. G. & Park, J. Two-step approach to scheduling quantum circuits. Quantum Sci. Technol. 3, 045003 (2018).
Holmes, A., Johri, S., Guerreschi, G. G., Clarke, J. S. & Matsuura, A. Y. Impact of qubit connectivity on quantum algorithm performance. Quantum Sci. Technol. 5, 025009 (2020).
Yoneda, J. et al. A quantum-dot spin qubit with coherence limited by charge noise and fidelity higher than 99.9%. Nat. Nanotechnol. 13, 102–106 (2017).
Zajac, D. M. et al. Resonantly driven CNOT gate for electron spins. Science 359, 439–442 (2017).
Watson, T. F. et al. A programmable two-qubit quantum processor in silicon. Nature 555, 633–637 (2018).
Xue, X. et al. Quantum logic with spin qubits crossing the surface code threshold. Nature 601, 343–347 (2022).
Noiri, A. et al. Fast universal quantum gate above the fault-tolerance threshold in silicon. Nature 601, 338–342 (2022).
Mills, A. R. et al. Two-qubit silicon quantum processor with operation fidelity exceeding 99%. Sci. Adv. 8, eabn5130 (2022).
Yang, C. H. et al. Silicon qubit fidelities approaching incoherent noise limits via pulse engineering. Nat. Electron. 2, 151–158 (2019).
Huang, W. et al. Fidelity benchmarks for two-qubit gates in silicon. Nature 569, 532–536 (2019).
Harvey-Collard, P. et al. High-fidelity single-shot readout for a spin qubit via an enhanced latching mechanism. Phys. Rev. X 8, 021046 (2018).
Urdampilleta, M. et al. Gate-based high fidelity spin readout in a CMOS device. Nat. Nanotechnol. 14, 737–741 (2019).
Johnson, M. A. et al. Beating the thermal limit of qubit initialization with a bayesian maxwell’s demon. Phys. Rev. X 12, 041008 (2022).
Oakes, G. et al. Fast high-fidelity single-shot readout of spins in silicon using a single-electron box. Preprint at https://arxiv.org/abs/2203.06608 (2022).
Philips, S. G. J. et al. Universal control of a six-qubit quantum processor in silicon. Nature 609, 919–924 (2022).
Takeda, K., Noiri, A., Nakajima, T., Kobayashi, T. & Tarucha, S. Quantum error correction with silicon spin qubits. Nature 608, 682–686 (2022).
Taylor, J. M. et al. Fault-tolerant architecture for quantum computation using electrically controlled semiconductor spins. Nat. Phys. 1, 177–183 (2005).
Li, R. et al. A crossbar network for silicon quantum dot qubits. Sci. Adv. 4, eaar3960 (2018).
Boter, J. M. et al. Spiderweb array: a sparse spin-qubit array. Phys. Rev. Appl. 18, 024053 (2022).
Betz, A. et al. Reconfigurable quadruple quantum dots in a silicon nanowire transistor. Appl. Phys. Lett. 108, 203108 (2016).
Hutin, L. et al. Gate reflectometry for probing charge and spin states in linear Si MOS split-gate arrays. In 2019 IEEE International Electron Devices Meeting (IEDM), 37–7 (2019).
Ansaloni, F. et al. Single-electron operations in a foundry-fabricated array of quantum dots. Nat. Commun. 11, 6399 (2020).
Chanrion, E. et al. Charge detection in an array of CMOS quantum dots. Phys. Rev. Appl. 14, 024066 (2020).
Cai, Z. Resource estimation for quantum variational simulations of the hubbard model. Phys. Rev. Appl. 14, 014059 (2020).
Jones, C. et al. Logical qubit in a linear array of semiconductor quantum dots. Phys. Rev. X 8, 021058 (2018).
Yoneda, J. et al. Coherent spin qubit transport in silicon. Nat. Commun. 12, 4114 (2021).
Boter, J. M. et al. A sparse spin qubit array with integrated control electronics. In 2019 IEEE International Electron Devices Meeting (IEDM), 31–4 (IEEE, 2019).
Ruffino, A. et al. A cryo-CMOS chip that integrates silicon quantum dots and multiplexed dispersive readout electronics. Nat. Electron. 5.1, 53–59 (2021).
Schaal, S. et al. A CMOS dynamic random access architecture for radio-frequency readout of quantum devices. Nat. Electron. 2, 236–242 (2019).
Pauka, S. J. et al. A cryogenic CMOS chip for generating control signals for multiple qubits. Nat. Electron. 4, 64–70 (2021).
Roche, B. et al. A tunable, dual mode field-effect or single electron transistor. Appl. Phys. Lett. 100, 32103–32107 (2012).
Lundberg, T. et al. Spin quintet in a silicon double quantum dot: Spin blockade and relaxation. Phys. Rev. X 10, 041010 (2020).
Ciriano-Tejel, V. N. et al. Spin readout of a CMOS quantum dot by gate reflectometry and spin-dependent tunneling. PRX Quantum 2, 010353 (2021).
Voisin, B. et al. Few-electron edge-state quantum dots in a silicon nanowire field-effect transistor. Nano Lett. 14, 2094–2098 (2014).
Ibberson, D. J. et al. Electric-field tuning of the valley splitting in silicon corner dots. Appl. Phys. Lett. 113, 53104 (2018).
Ibberson, D. J. et al. Large dispersive interaction between a CMOS double quantum dot and microwave photons. PRX Quantum 2, 020315 (2021).
Bohuslavskyi, H. et al. Reflectometry of charge transitions in a silicon quadruple dot. Preprint at https://arxiv.org/abs/2012.04791 (2020).
Huang, W., Veldhorst, M., Zimmerman, N. M., Dzurak, A. S. & Culcer, D. Electrically driven spin qubit based on valley mixing. Phys. Rev. B 95, 75403 (2017).
Volk, C. et al. Loading a quantum-dot based “Qubyte” register. NPJ Quantum Inf. 5, 29 (2019).
Veldhorst, M. et al. An addressable quantum dot qubit with fault-tolerant control-fidelity. Nat. Nano 9, 981–985 (2014).
Pla, J. J. et al. High-fidelity readout and control of a nuclear spin qubit in silicon. Nature 496, 334–338 (2013).
Veldhorst, M. et al. A two-qubit logic gate in silicon. Nature 526, 410–414 (2015).
Fogarty, M. A. Silicon edge-dot architecture for quantum computing with global control and integrated trimming. Preprint at https://arxiv.org/abs/2208.09172 (2022).
Vahapoglu, E. et al. Single-electron spin resonance in a nanoelectronic device using a global field. Sci. Adv. 7, eabg9158 (2021).
Vahapoglu, E. et al. Coherent control of electron spin qubits in silicon using a global field. NPJ Quantum Inf. 8, 1–6 (2022).
Ferdous, R. et al. Valley dependent anisotropic spin splitting in silicon quantum dots. NPJ Quantum Inf. 4, 26 (2018).
Tanttu, T. et al. Controlling spin-orbit interactions in silicon quantum dots using magnetic field direction. Phys. Rev. X 9, 021028 (2019).
Laucht, A. et al. Electrically controlling single-spin qubits in a continuous microwave field. Sci. Adv. 1, e1500022 (2015).
Maurand, R. et al. A CMOS silicon spin qubit. Nat. Commun. 7, 1–6 (2016).
Camenzind, L. C. et al. A hole spin qubit in a fin field-effect transistor above 4 kelvin. Nat. Electron. 5, 178–183 (2022).
Piot, N. et al. A single hole spin with enhanced coherence in natural silicon. Nat. Nanotechnol. 17.10, 1072–1077 (2022).
Bosco, S., Hetényi, B. & Loss, D. Hole spin qubits in Si FinFETs with fully tunable spin-orbit coupling and sweet spots for charge noise. PRX Quantum 2, 010348 (2021).
Maune, B. M. et al. Coherent singlet-triplet oscillations in a silicon-based double quantum dot. Nature 481, 344–347 (2012).
He, Y. et al. A two-qubit gate between phosphorus donor electrons in silicon. Nature 571, 371–375 (2019).
Ono, K., Austing, D. G., Tokura, Y. & Tarucha, S. Current rectification by pauli exclusion in a weakly coupled double quantum dot system. Science 297, 1313–1317 (2002).
Zhao, R. et al. Single-spin qubits in isotopically enriched silicon at low magnetic field. Nat. Commun. 10, 5500 (2019).
Gonzalez-Zalba, M. F., Barraud, S., Ferguson, A. J. & Betz, A. C. Probing the limits of gate-based charge sensing. Nat. Commun. 6, 6084 (2015).
Mizuta, R., Otxoa, R., Betz, A. & Gonzalez-Zalba, M. Quantum and tunneling capacitance in charge and spin qubits. Phys. Rev. B 95, 045414 (2017).
Pakkiam, P. et al. Single-shot single-gate rf spin readout in silicon. Phys. Rev. X 8, 41032 (2018).
West, A. et al. Gate-based single-shot readout of spins in silicon. Nat. Nanotechnol. 14, 437–441 (2019).
Zheng, G. et al. Rapid gate-based spin read-out in silicon using an on-chip resonator. Nat. Nanotechnol. 14, 742–746 (2019).
Crippa, A. et al. Gate-reflectometry dispersive readout and coherent control of a spin qubit in silicon. Nat. Commun. 10, 2776 (2019).
Seedhouse, A. E. et al. Pauli blockade in silicon quantum dots with spin-orbit control. PRX Quantum 2, 010303 (2021).
Ezzouch, R. et al. Dispersively probed microwave spectroscopy of a silicon hole double quantum dot. Phys. Rev. Appl. 16, 034031 (2021).
Lundberg, T. et al. Non-reciprocal Pauli spin blockade in a silicon double quantum dot. Preprint at https://arxiv.org/abs/2110.09842 (2021).
Duan, J. et al. Remote capacitive sensing in two-dimensional quantum-dot arrays. Nano Lett. 20, 7123–7128 (2020). PMID: 32946244.
Yu, C. X. et al. Strong coupling between a photon and a hole spin in silicon. Preprint at https://arxiv.org/abs/2206.14082 (2022).
Mills, A. R. et al. Shuttling a single charge across a one-dimensional array of silicon quantum dots. Nat. Commun. 10, 1063 (2019).
Langrock, V. et al. Blueprint of a scalable spin qubit shuttle device for coherent mid-range qubit transfer in disordered Si/SiGe/SiO2. Preprint at https://arxiv.org/abs/2202.11793 (2022).
Cerezo, M. et al. Variational quantum algorithms. Nat. Rev. Phys. 3, 625–644 (2021).
Peruzzo, A. et al. A variational eigenvalue solver on a photonic quantum processor. Nat. Commun. 5, 4213 (2014).
Fedorov, D. A., Peng, B., Govind, N. & Alexeev, Y. VQE method: a short survey and recent developments. Mater. Theory 6, 1–21 (2022).
Orús, R., Mugel, S. & Lizaso, E. Quantum computing for finance: overview and prospects. Rev. Phys. 4, 100028 (2019).
Cross, A. W., Bishop, L. S., Sheldon, S., Nation, P. D. & Gambetta, J. M. Validating quantum computers using randomized model circuits. Phys. Rev. A 100, 032328 (2019).
Cormen, T. H., Leiserson, C. E., Rivest, R. L. & Stein, C. Introduction to Algorithms (MIT Press, 2009), 3rd edn.
Habermann, A. N. Parallel neighbor-sort (or the glory of the induction principle) (1972).
Hall, P. On representatives of subsets. J. London Math. Soc. 1, 26–30 (1935).
Franke, D. P., Clarke, J. S., Vandersypen, L. M. K. & Veldhorst, M. Rent’s rule and extensibility in quantum computing. Microprocess. Microsy. 67, 1–7 (2019).
Xue, X. et al. CMOS-based cryogenic control of silicon quantum circuits. Nature 593, 205–210 (2021).
Park, J. S. et al. A fully integrated cryo-CMOS SoC for qubit control in quantum computers capable of state manipulation, readout and high-speed gate pulsing of spin qubits in Intel 22nm FFL FinFET technology. In 2021 IEEE International Solid- State Circuits Conference (ISSCC), vol. 64, 208–210 (2021).
Prabowo, B. et al. A 6-to-8GHz 0.17 mW/qubit cryo-CMOS receiver for multiple spin qubit readout in 40 nm CMOS technology. In 2021 IEEE International Solid- State Circuits Conference (ISSCC), vol. 64, 212–214 (2021).
Ruffino, A. et al. A fully-integrated 40-nm 5-6.5 GHz cryo-CMOS system-on-chip with I/Q receiver and frequency synthesizer for scalable multiplexed readout of quantum dots. In 2021 IEEE International Solid- State Circuits Conference (ISSCC), vol. 64, 210–212 (2021).
Li, G., Ding, Y. & Xie, Y. Tackling the qubit mapping problem for NISQ-era quantum devices. In Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 1001–1014 (2019).
Lao, L. & Browne, D. E. 2qan: A quantum compiler for 2-local qubit hamiltonian simulation algorithms. In Proceedings of the 49th Annual International Symposium on Computer Architecture, 351–365 (2022).
Dalton, K. et al. Variational quantum chemistry requires gate-error probabilities below the fault-tolerance threshold. Preprint at https://arxiv.org/abs/2211.04505 (2022).
Tomaru, T., Yoshimura, C. & Mizuno, H. Surface code for low-density qubit array. Sci. Rep. 12, 12946 (2022).
Acknowledgements
We thank Michael A. Fogarty for his careful read of the manuscript. This research has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under Grant agreement no. 688539 (http://mos-quito.eu) as well as Grant agreement no. 951852. This research has further received funding through the UKRI Innovate UK grant 48482 (NISQ.OS). M.F.G.Z. acknowledges support from UKRI Future Leaders Fellowship [grant number MR/V023284/1].
Author information
Authors and Affiliations
Contributions
M.F.G.Z. conceived the sparse architecture. O.C. and J.C. performed the compilation analysis. N.M. supervised the architecture-compilation integration. All authors contributed to the writing of the paper.
Corresponding authors
Ethics declarations
Competing interests
M. Fernando Gonzalez Zalba and Hitachi Ltd. are inventors in a relevant patent (EP3975072A1).
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Crawford, O., Cruise, J.R., Mertig, N. et al. Compilation and scaling strategies for a silicon quantum processor with sparse two-dimensional connectivity. npj Quantum Inf 9, 13 (2023). https://doi.org/10.1038/s41534-023-00679-8
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41534-023-00679-8
This article is cited by
-
Combining multiplexed gate-based readout and isolated CMOS quantum dot arrays
Nature Communications (2025)
-
Pipeline quantum processor architecture for silicon spin qubits
npj Quantum Information (2024)
-
Universal control of a six-qubit quantum processor in silicon
Nature (2022)












