Measuring error rates of mid-circuit measurements

Hothem, Daniel; Hines, Jordan; Baldwin, Charles; Gresh, Dan; Blume-Kohout, Robin; Proctor, Timothy

doi:10.1038/s41467-025-60923-x

Download PDF

Article
Open access
Published: 01 July 2025

Measuring error rates of mid-circuit measurements

Nature Communications volume 16, Article number: 5761 (2025) Cite this article

1816 Accesses
Metrics details

Subjects

Abstract

High-fidelity mid-circuit measurements, which read out the state of specific qubits in a multiqubit processor without destroying them or disrupting their neighbors, are a critical component for useful quantum computing. They enable fault-tolerant quantum error correction, dynamic circuits, and other paths to solving classically intractable problems. But there are few methods to assess their performance comprehensively. In this work, we address this gap by introducing the first randomized benchmarking protocol that measures the rate at which mid-circuit measurements induce errors in many-qubit circuits. Using this protocol, we detect and eliminate previously undetected measurement-induced crosstalk in a 20-qubit trapped-ion quantum computer. Then, we use the same protocol to measure the rate of measurement-induced crosstalk error on a 27-qubit IBM Q processor, and quantify how much of that error is eliminated by dynamical decoupling.

Measurement-induced entanglement phase transition on a superconducting quantum processor with mid-circuit readout

Article 08 June 2023

Experimental error suppression in Cross-Resonance gates via multi-derivative pulse shaping

Article Open access 02 July 2024

Mechanically induced correlated errors on superconducting qubits with relaxation times exceeding 0.4 ms

Article Open access 10 May 2024

Introduction

Quantum computers promise to solve classically intractable problems^1,2,3,4,5. However, they are unlikely to do so by running quantum programs consisting solely of reversible unitary logic operations (“gates”)^2,6,7. Gates are noisy, and their errors accumulate and obscure the answer. All promising avenues to quantum computational advantage rely critically on mid-circuit measurements (MCMs) to reduce entropy during program execution. The best-known (and most trusted) path is fault-tolerant quantum computing using quantum error correction (QEC)^8,9,10,11,12, but other approaches like dynamic quantum circuits¹³ also rely on high-fidelity MCMs. But whereas errors in reversible gates can be measured and quantified using a wide range of well-tested protocols^{14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29}, relatively few methods for quantifying errors in MCMs are available⁵. Those proposed so far^{30,31,32,33,34,35,36,37,38,39} have limited scope.

Most previous work on characterizing measurement errors has focused on circuit-terminating measurements that can be described by positive operator-valued measures (POVMs)^{28,29,40,41,42,43}. Investigations of MCMs have emphasized tomographic reconstruction of quantum instruments^30,31,32, which enables detailed modeling of one- and two-qubit dynamics but not in situ characterization of errors in large processors. Another approach, certifying that a measurement is near-perfect or possesses certain properties^33,34, can rule out specific errors but does not quantify the experimental impact of errors. Recent papers have probed MCM-induced crosstalk^35,36, explored Pauli noise learning for MCMs^37,38, and assessed single-qubit feedforward operations³⁹, but do not attempt to detect or measure all the errors caused by MCMs in situ in many-qubit circuits.

RB is the most commonly used characterization protocol for measuring gate errors in situ in many-qubit circuits because it mixes all kinds of gate errors together to yield a single, simple error rate^{14,15,16,17,18,19,20,21,22}. But integrating MCMs into this framework is challenging. RB protocols are designed to measure the average fidelity of invertible quantum logic operations (i.e., gates), and most RB protocols execute motion-reversal circuits. A circuit (e.g., a sequence of d Clifford operations) is followed by a circuit that implements its inverse. The combined circuit’s fidelity with the identity operation can be measured easily, and this is the core of standard RB. However, MCMs are inherently irreversible, collapsing the quantum state of the measured qubit(s). Motion-reversal circuits are not possible with MCMs. A different approach is required.

To address this gap, we introduce and demonstrate the first fully scalable protocol for quantifying and measuring the combined rate of all errors in MCM operations on all qubits. Our protocol integrates MCMs into the canonical framework of randomized benchmarking (RB)^{14,15,16,17,18,19,20,21,22}, and models noisy measurement operations as general quantum instruments^30,44. MCM operations are placed naturally within random circuits, making it straightforward to measure the rate at which they cause errors in those circuits. We call the protocol quantum instrument randomized benchmarking (QIRB) because it captures general MCM errors described by quantum instruments and yields an overall rate of MCM errors that can be directly compared to RB-derived gate error rates found throughout the experimental quantum computing literature. In contrast to other recent work^37,38 that isolates and measures errors in specific circuit layers, QIRB reports the average error rate of all MCM+gate layers, as standard multiqubit RB does for gates.

In this work, we demonstrate QIRB’s validity and practical power by using it to probe, understand, and reduce MCM errors in two very different quantum processors. First, we show how we applied QIRB to study MCMs in a 20-qubit Quantinuum system (H1-1) that was previously used to demonstrate high-performance QEC⁴⁵. The results (Fig. 1) revealed unexpectedly high rates of MCM-induced crosstalk errors that other methods failed to detect, and allowed us to deduce their location and cause. After we implemented a targeted intervention—micromotion hiding³⁵ in the system’s auxiliary zones—we ran QIRB again, and found the errors had been eliminated. Second, we show how to use QIRB to demonstrate that dynamical decoupling⁴⁶ reduces MCM-induced errors in 15-qubit layers on IBM Q’s 27-transmon ibmq_algiers system (Fig. 2). These experiments required a protocol that was both scalable (i.e., could characterize MCMs in situ in many-qubit circuits) and generic (i.e., predictably sensitive to all kinds of MCM errors). These same properties enable measuring and predicting the impact of noisy MCMs on fault-tolerant circuits.

**Fig. 1: Quantifying measurement-induced crosstalk in a trapped-ion quantum computer using QIRB.**

**Fig. 2: Quantifying the impact of dynamical decoupling on measurement-induced errors on ibmq_algiers.**

Results

Quantum instrument randomized benchmarking

QIRB is based on a class of random Clifford circuits that contain MCMs (see Methods for full details). These circuits (Fig. 1a, b) integrate MCMs into the Ω-distributed random circuits used in many modern RB protocols^{19,20,21,22,23,24}. Like other modern RB protocols, QIRB is designed to measure the average error rate of a layer in an Ω-distributed random circuit.

QIRB does so by extending the Pauli-tracking techniques introduced in binary RB²² and first developed for direct fidelity estimation^25,47 to circuits with MCMs. Each n-qubit QIRB circuit C containing m MCMs is designed to mimic measuring a random (n + m)-qubit Pauli operator s_C, with the key innovation being that the processor’s state is ideally stabilized by every MCM. When averaged over all depth-d QIRB circuits, the average expectation value of these Pauli measurements decays exponentially in d (under reasonable assumptions). As shown in the Methods section, the decay rate is an average layer error rate (again under reasonable assumptions).

An n-qubit, depth-d QIRB circuit consists of (i) d circuit layers L₁, …, L_d, each sampled from a user-specified distribution Ω, and (ii) additional dressing layers of randomizing single-qubit gates before and after each Ω-distributed layer. Each layer L_i specifies the parallel application of MCMs and Clifford gates to the n qubits. QIRB only uses layers that (ideally) implement a computational basis measurement on k≤n qubits. Both reset-free (non-demolition) MCMs intended to leave a measured qubit in the state $\left\vert k\right\rangle$ conditional on observing k, and reset MCMs intended to re-initialize each measured qubit to $\left\vert 0\right\rangle$, can be used.

An n-qubit QIRB circuit C containing m MCMs produces m + n readout bits. We use Pauli-tracking techniques to divide the 2^m + n possible output strings into two subsets of equal size, called success and fail. Whenever a fail string is observed, it means that at least one error occurred in the circuit’s execution. We quantify the error in a QIRB circuit C with the metric F = (N_success − N_fail)/N²², where N is the number of repetitions of C, and N_success and N_fail are the number of times a bit string from C’s “success” and “fail” sets (respectively) was observed. We define ${\overline{F}}_{d}$ as the average value of F over all depth-d QIRB circuits. The QIRB error rate (r_Ω) is the decay rate of ${\overline{F}}_{d}$ with respect to d.

The QIRB protocol is as follows: For a range of depths d ≥ 0, sample k_d Ω-distributed QIRB circuits of depth d. Run each circuit N ≥ 1 times to compute its F. Average F for all of the sampled depth-d circuits to obtain an estimate of ${\overline{F}}_{d}$, and fit it to ${\overline{F}}_{d}=A{(1-{r}_{\Omega })}^{d}$, where A and r_Ω are fit parameters. A captures initial state preparation and final measurement error, and r_Ω is the estimated QIRB error rate.

Our theoretical analysis of QIRB (see Methods) shows that it satisfies all the expected properties of an RB protocol, and faithfully measures the rate at which MCMs cause errors in random circuits. We show that ${\overline{F}}_{d}$ decays exponentially in circuit depth d (the key property of an RB protocol) for many-qubit QIRB circuits and that the QIRB error rate r_Ω is bounded above and below by constant-factor multiples of the average layer infidelity (ε_Ω): $\frac{3}{4}{\varepsilon }_{\Omega }\le {r}_{\Omega }\le \frac{3}{2}{\varepsilon }_{\Omega }$, under certain assumptions (see the Methods and Supplementary Note 4 for more details). In most cases, r_Ω ≈ ε_Ω.

Trapped-ion experiments

We used QIRB to detect and quantify unexpected MCM crosstalk in Quantinuum’s H1-1, a 20-qubit trapped-ion quantum computer. H1-1’s qubits are encoded in the atomic hyperfine states of ¹⁷¹Yb⁺, with $\left\vert 0\right\rangle$ corresponding to a “dark” state and $\left\vert 1\right\rangle$ corresponding to a “bright” state that fluoresces when probed by a laser resonant with the ${}^{2}{{{{\rm{S}}}}}_{1/2}\left\vert {{{\rm{F}}}}=1\right\rangle {\leftarrow }^{2}{{{{\rm{P}}}}}_{1/2}\left\vert {{{\rm{F}}}}=0\right\rangle$ transition. Throughout the execution of a circuit, ions are stored in interleaved gate and auxiliary zones (Fig. 1g). When an ion is measured, it (and another possibly unmeasured ion) is moved into a gate zone where the detection laser beam is pulsed on. Stray light from the laser beam or the fluorescing ion can interact with the unmeasured ions to cause MCM-induced crosstalk errors.

Prior to this experiment, it was believed that the large physical separation between each gate zone and auxiliary zone would prevent MCM-induced crosstalk errors on the distant ions in the auxiliary zones, and that only the neighboring ion in a gate zone would be affected by stray light. As a result, Quantinuum’s detailed system emulator accounted for local MCM-induced crosstalk errors (within each gate zone), but no long-range crosstalk. However, our QIRB experiments showed this assumption to be false, revealing significant MCM-induced crosstalk errors in the auxiliary zones.

We ran 2-qubit and 6-qubit QIRB experiments on the first two and six qubits (respectively) of H1-1, and we simulated the exact same circuits on Quantinuum’s emulator. Each QIRB experiment comprised 75 circuits, 15 each at depths d = 0, 1, 4, 32, and 128. We used Ω-distributed layers consisting of parallel single-qubit Clifford gates, ${\mathsf{cnot}}$ gates, and MCMs. The Ω-distributed layers were sampled to contain a single MCM with probability ${p}_{{\mathsf{MCM}}}=0.35$ and a single ${\mathsf{cnot}}$ with probability ${p}_{{\mathsf{cnot}}}=0.2$. Other experiments with different ${p}_{{\mathsf{cnot}}}$ and ${p}_{{\mathsf{MCM}}}$ showed consistent results (see Supplementary Note 7) and were used to estimate the MCM error rate (see Methods). We ran (and simulated) N = 100 shots of each circuit to estimate F. All experimentally estimated quantities are reported with 1σ error bars computed via a bootstrap.

The 2-qubit QIRB experiments yielded results (r_Ω = 0.16 ± 0.02 %) consistent with emulator simulations (r_Ω = 0.18 ± 0.02 %). However, the 6-qubit experimental and emulated results showed a significant and unexpected discrepancy (Fig. 1d). The emulator predicted r_Ω = 0.17 ± 0.02 % for 6-qubit QIRB, consistent with the 2-qubit results, while the 6-qubit experiment revealed almost twice as much error (r_Ω = 0.35 ± 0.03 %). Because the QIRB error rate contains contributions from both gate errors and MCM errors, we used the analysis methods of ref. ⁴⁸ to isolate the MCM error rate (ε_MCM). We did so by fitting an error model with local depolarizing gate errors and bit-flip measurement errors to the data (see Supplementary Note 5 for more details). The discrepancy between the 6-qubit experimental and emulated ε_MCM was even larger (Fig. 1e). Consistency between experimental and emulated 2-qubit QIRB results, and between emulated 2-qubit and 6-qubit QIRB results, indicated that the emulator was accurately modeling interactions between neighboring qubits in gate zones. This suggested that the significant discrepancy in the 6-qubit results was due to long-range crosstalk errors, which are not modeled in the emulator.

We hypothesized that these errors were caused by stray light emanating from the detection beam used to measure ions or fluorescence from the measured ions in a circuit layer. To verify our hypothesis, we ran a bright-state depumping experiment³⁵ to independently quantify the strength of MCM-induced crosstalk errors within the auxiliary and gate zones (see Supplementary Note 7). These experiments confirmed our hypothesis (Fig. 1f).

To definitively confirm long-range MCM-induced crosstalk as the cause, we implemented micromotion hiding³⁵ on distant ions during mid-circuit measurements, and then ran 6-qubit QIRB again to measure its efficacy. Micromotion hiding reduces an unmeasured ion’s sensitivity to scattered light by displacing the ion from the RF null, causing micromotion oscillations that Doppler-broaden the resonant detection light. This partially protects the unmeasured ion from stray photons emitted by the detection beam or a fluorescing measured ion during the measurement process. In the initial experiments, micromotion hiding was only applied within each gate zone, not to ions in the auxiliary zones.

When micromotion hiding was applied to ions in the auxiliary zone, the experimental 6-qubit QIRB r_Ω dropped from 0.35 ± 0.03 % to 0.18 ± 0.03 % (Fig. 1d). This value is consistent with the 0.17 ± 0.02 % obtained from the emulator. The estimated ${\varepsilon }_{{\mathsf{MCM}}}$ dropped from 0.63 ± 0.09 % to 0.29 ± 0.06 % (Fig. 1e), which is consistent with the 0.23 ± 0.09 % MCM error rate obtained from the emulator. These 6-qubit experimental error rates are also consistent with those observed in 2-qubit experiments and simulations. Finally, we confirmed the physical mechanism of the reduced error rates by repeating the bright-state depumping experiment with the extra micromotion hiding in the auxiliary zones. We observed a substantial drop in the unmeasured ions’ sensitivity to scattered light (see Fig. 1f).

Quantifying MCM crosstalk mitigation on ibmq_algiers

Next, we used QIRB to probe the nature of MCM errors, and measure the positive impact of dynamical decoupling, on an IBM Q processor (ibmq_algiers). For this transmon-based processor, we had neither a high-fidelity emulator nor a detailed theoretical physics model for MCM errors, but its faster clock speed allowed for more detailed experimental exploration. We ran 5-, 10-, and 15-qubit QIRB experiments on ibmq_algiers, with various values of ${p}_{{\mathsf{cnot}}}$ and ${p}_{{\mathsf{MCM}}}$. We focus here on experiments with ${p}_{{\mathsf{cnot}}}=0.3$ and ${p}_{{\mathsf{MCM}}}=0.3$. Other experiment designs are described in the Methods, and experimental results are located in Supplementary Note 6.

To quantify the strength of crosstalk errors, we compared experimental QIRB results – the estimated r_Ω and error rates for one-qubit gates, two-qubit gates, and MCMs (ε_1Q, ε_2Q, and ${\varepsilon }_{{\mathsf{MCM}}}$, respectively)—to predictions made using IBM’s calibration data. IBM’s calibration data is gathered by running two-qubit RB, simultaneous one-qubit RB, and readout assignment error experiments. These protocols are not directly sensitive to MCM crosstalk errors, so discrepancies between observed and predicted quantities can be ascribed to MCM crosstalk errors.

We observed significant gate- and MCM-induced crosstalk errors in ibmq_algiers (Fig. 2a). The 5, 10, and 15-qubit QIRB experiments yielded an experimental r_Ω that was 4–5 × larger than simulations using the calibration data (see Fig. 2c). Comparing observed and predicted ε_MCM indicates that most of this discrepancy is due to measurement-induced crosstalk. The discrepancy between the observed and predicted ε_MCM increases with n (number of qubits), and for n = 15, ε_MCM is 26 × larger than predicted.

MCM-induced errors in transmons can have multiple causes. One mechanism is idling errors on unmeasured qubits. MCMs have a longer duration than one- and two-qubit gates, so unmeasured qubits have more time to decohere and dephase. This can be mitigated by performing dynamical decoupling (DD)⁴⁶ on the idling qubits, echoing away some of the idling errors. This approach is common and well-motivated, but its efficacy has not been systematically quantified in situ (i.e., in the context of many-qubit circuits).

We ran QIRB circuits into which we introduced XX dynamical decoupling sequences⁴⁹, and observed a significant reduction in the average layer error rate (Fig. 2c). The observed r_Ω decreased by ~22% at each circuit width, and the estimated ${\varepsilon }_{{\mathsf{MCM}}}$ decreased by as much as 38% (for n = 15 qubits). This suggests that a large fraction of MCM-induced crosstalk errors are caused by structured T2 idling errors on unmeasured qubits, and confirms that dynamical decoupling on idle qubits can reduce MCM-induced crosstalk significantly. However, the large residual discrepancy between predicted and observed error rates indicates that the majority of MCM-induced crosstalk errors are not mitigable in this way.

Fully characterizing the source of these errors is beyond QIRB; however, we can use the QIRB error rate to test hypotheses about the source of the remaining error. While XX dynamical decoupling sequences echo away structured T2 errors, they do not mitigate T1 decay and unstructured T2 errors. A simple error model based on the IBM calibration data, along with assuming only T1 decay errors on the idling qubits and bit-flip errors on the measured qubits, predicts QIRB error rates of 3.78, 7.17, and 10.2% for the 5-, 10-, and 15-qubit experiments, respectively. This suggests that, while some of the remaining MCM-induced errors are due to additional T1 idling errors, a substantial portion of the error remains uncharacterized.

Discussion

RB is ubiquitous precisely because it is generic. RB protocols mix together all the errors in all of a system’s gates to produce a single exponential decay and a single error rate that can be easily monitored, reported, and compared. But until recently, motion-reversal circuits—concatenating a random circuit with another circuit that inverts it—was the only way to achieve this simplicity. QIRB builds on the recent innovations in binary RB (which sidestepped motion reversal) to extend the simplicity and genericity of RB to MCMs. As our deployment here illustrates, QIRB scales seamlessly to many qubits, and can easily measure error rates below 10⁻³. It enables in situ characterization of MCMs in processors designed and optimized to implement quantum error correction (QEC), paving the way for full, simultaneous characterization of errors in all the key operations required for fault-tolerant QEC (one-qubit gates, two-qubit gates, and MCMs). We anticipate the techniques introduced here can be used to extend other RB protocols, especially those that measure leakage^50,51,52,53, to work with MCMs.

Methods

QIRB circuits

Conceptually, QIRB works by extending the Pauli-tracking approach underlying both binary RB and direct fidelity estimation to circuits with mid-circuit measurements (see Supplementary Note 1). An n-qubit QIRB circuit is designed so that, after each circuit layer, the n qubits are (ideally) always in the + 1 eigenspace of a particular n-qubit Pauli operator. To ensure this, each qubit that gets measured is immediately rotated to a + 1 eigenstate of a uniformly random single-qubit Pauli. Moreover, before each measurement, single-qubit gates are applied so that the measurement results can be compiled to determine if an error has occurred in the circuit. Thus, a QIRB circuit $\tilde{C}$ mimics preparing the eigenstate of an (n + m)-qubit Pauli operator and performing an (n + m)-qubit Pauli measurement ${s}_{\tilde{C}}$. The success metric for $\tilde{C}$ is the expectation of its associated Pauli measurement, $\langle {s}_{\tilde{C}}\rangle$.

The core component of a depth-d QIRB circuit is a sequence of d “dressed” n-qubit circuit layers, each composed of three sublayers, $\tilde{L}={l}_{3}{l}_{2}{l}_{1}$. The middle layer l₂ is an Ω-distributed layer and may contain MCMs. We often refer to l₂ as L. Both l₁ and l₃ are layers of parallel single-qubit gates and/or idles. The single-qubit gates in l₁ are chosen to rotate the processor’s state so that it is (ideally) an eigenstate of the n-qubit Pauli operator ${s}^{{{{\rm{pre}}}}}={s}_{{{{\rm{meas}}}}}^{{{{\rm{pre}}}}}\otimes {s}_{{{{\rm{unmeas}}}}}^{{{{\rm{pre}}}}}$, where (i) ${s}_{{{{\rm{unmeas}}}}}^{{{{\rm{pre}}}}}$ has support on the unmeasured qubits, (ii) ${s}_{{{{\rm{meas}}}}}^{{{{\rm{pre}}}}}$ has support on the measured qubits, and (iii) ${s}_{{{{\rm{meas}}}}}^{{{{\rm{pre}}}}}$ is the tensor-product of Pauli-Z or identity operators (i.e., a Z-type Pauli) (see Fig. 3). The gates in l₃ are selected so that the k qubits measured in l₂ end up in the +1-tensor-product eigenstate of a uniformly randomly sampled k-qubit Pauli operator (see Fig. 3). When “reset” measurements are used, the l₃ layer is constructed to achieve this result assuming the measured qubits are in ${\left\vert 0\right\rangle }^{\otimes k}$; when “reset-free” measurements are used, l₃ involves either X_π gates conditioned on the measurement results or a sign correction in post processing.

**Fig. 3: A “dressed” QIRB circuit layer.**

Formally, the construction of a depth-d QIRB circuit $\tilde{C}$ is a sequential process that begins by randomly sampling a depth-d Ω-distributed circuit C = L_d ⋯ L₁. Then, with m being the number of MCMs in C, we sample a uniformly random (n + m)-qubit Pauli operator

$$s=s(1)\otimes \cdots \otimes s(n+m)$$

(1)

and a uniform random + 1-tensor-product eigenstate of s

$$\left\vert \psi (s)\right\rangle=\left\vert \psi {(s)}_{1}\right\rangle \otimes \cdots \otimes \left\vert \psi {(s)}_{n+m}\right\rangle,$$

(2)

where $\left\vert \psi {(s)}_{i}\right\rangle$ is stabilized by s(i). The first layer ${\tilde{L}}_{0}$ in $\tilde{C}$ is a layer of single-qubit gates that locally prepares the first n-entries in $\left\vert \psi (s)\right\rangle$. Hence, in the ideal case, the state of the processor following ${\tilde{L}}_{0}$, denoted ${\left\vert \psi (s)\right\rangle }_{0}$ is stabilized by s₀: = s(1) ⊗ ⋯ ⊗ s(n).

The d dressed layers ${\{{\tilde{L}}_{i}\}}_{i=1}^{d}$ are constructed in an iterative fashion. In what follows, U(l) denotes the unitary map induced by the circuit layer l on any unmeasured qubits in l, and k_i denotes the number of measured qubits in the L_i. The first dressed layer ${\tilde{L}}_{1}$ is constructed as follows:

Set l_1,1 to be a layer of single-qubit gates such that $U({l}_{1,1}){s}_{0}U{({l}_{1,1})}^{-1}={s}_{0{{{\rm{,meas}}}}}^{{{{\rm{pre}}}}}\otimes {s}_{0{{{\rm{,unmeas}}}}}^{{{{\rm{pre}}}}}$ is a Z-type Pauli on the k₁ measured qubits in L₁.
Set l_1,2 = L₁, the “core” Ω-distributed circuit layer. Define ${s}_{0}^{{{{\rm{post}}}}}=U({l}_{1,2}){s}_{0,{{{\rm{unmeas}}}}}^{{{{\rm{pre}}}}}U{({l}_{1,2})}^{-1}\otimes {s}_{0,{{{\rm{meas}}}}}^{{{{\rm{post}}}}}$, where ${s}_{0,{{{\rm{meas}}}}}^{{{{\rm{post}}}}}={s}_{0,{{{\rm{meas}}}}}^{{{{\rm{pre}}}}}$.
Set l_1,3 to be a layer of single-qubit gates such that the measured qubits are re-prepared in a + 1 tensor-product eigenstate of s_1,meas = s(n + 1) ⊗ ⋯ ⊗ s(n + k₁).

In the ideal case, the state of the processor after ${\tilde{L}}_{1}$ should be stabilized by the Pauli,

$${s}_{1}:=U({\tilde{L}}_{1}){s}_{0,{{{\rm{unmeas}}}}}U{({\tilde{L}}_{1})}^{-1}\otimes s(n+1)\otimes \cdots s(n+{k}_{1}).$$

(3)

The remaining dressed layers are constructed similarly using s_i−1 in place of s₀, and the final layer ${\tilde{L}}_{d+1}$ is constructed to transform s_d into a Z-type Pauli operator, s_d+1. The above description leaves some freedom in the exact choice for each l_i,1 and l_i,3 and, in practice, we leverage this freedom to populate l_i,1 and l_i,3 with random single-qubit gates on any unmeasured qubits in order to implement randomized compilation for subsystem measurements⁵⁴ (see Supplementary Note 3).

Overall, $\tilde{C}$ is constructed to mimic measuring the Pauli operator

$${s}_{\tilde{C}}:={s}_{0{{{\rm{,meas}}}}}^{{{{\rm{pre}}}}}\otimes \cdots \otimes {s}_{d+1},$$

(4)

and we record the expectation of this Pauli operator $\langle {s}_{\tilde{C}}\rangle$. In practice, this is accomplished by concatenating the results from each MCM and each final computational basis state measurement into a bit string b = (b₁, …, b_n+m), and then returning 1 (“success”) if the corresponding computational basis state $\left\vert b\right\rangle$ is stabilized by ${s}_{\tilde{C}}$ and −1 (“failure”) otherwise.

QIRB theory

The QIRB protocol exhibits all of the expected properties of an RB protocol. Which is to say that, under reasonable assumptions about the noise, the average success rate of QIRB circuits (${\overline{F}}_{d}$) decays exponentially in circuit depth, at a decay rate that is directly related to the rate at which MCMs and gates cause errors in random circuits. Moreover, this decay rate—or the QIRB error rate (r_Ω)—is bounded above and below by constant-factor multiples of the average layer infidelity.

Outside of our description of uniform stochastic instruments, we will assume that n ≫ 1 and that s_i is uniformly random for each layer i = 1, …, d in a QIRB circuit. Under standard assumptions on the noise model (e.g., Pauli stochastic errors), this second assumption is approximately true as long as Ω contains layers with sufficiently many entangling gates (it breaks down, e.g., when n = 1 as non-trivial single-qubit Paulis cannot be rotated into the identity by single-qubit Clifford gates).

Uniform stochastic instruments

To analyze the QIRB protocol, we will model noisy circuit layers as uniform stochastic instruments⁵⁴. This is a specific ansatz generalizing the Pauli stochastic noise model to quantum instruments, and a processor’s native layers may display errors that are not consistent with it—but any layer L’s error can be transformed into uniform stochastic noise using randomized compiling (RC) for subsystem measurements⁵⁴. RC for subsystem measurements can be implemented within QIRB (see the section on QIRB circuits); it is not required for the protocol, but is required to guarantee the validity of our theory describing it. As uniform stochastic instruments describe “reset-free” measurements, we introduce a refinement of the general uniform stochastic instrument model to describe “reset” measurements (see Supplementary Note 4).

A uniform stochastic instrument describes errors, in an n-qubit, k-measurement layer, that are the composition of a pre-measurement Pauli X error X^⊗a on the measured qubits (specified by the k-bit string $a\in {{\mathbb{Z}}}_{2}^{k}$), with a post-measurement error that is the tensor-product of a generic Pauli error P on the unmeasured qubits and a bit-flip error X^⊗b on the measured qubits. Here, X^⊗a denotes the Pauli operator ${\otimes }_{i=1}^{k}{X}^{{a}_{i}}$.

We denote each possible error by its corresponding tuple (a, P, b) and the probability of (a, P, b) occurring in L by p_L(a, P, b). The process (entanglement) fidelity of a uniform stochastic instrument equals the probability of no error occurring⁵⁵:

$${{{\mathcal{F}}}}(L)=1-{\sum}_{(a,P,b)\ne (0,{\mathbb{I}},0)}{p}_{L}(a,P,b).$$

(5)

The Ω-averaged layer infidelity is defined as

$${\varepsilon }_{\Omega }:=1-{{\mathbb{E}}}_{L\in \Omega }[{{{\mathcal{F}}}}(L)].$$

(6)

In the absence of MCMs, ε_Ω reduces to the same quantity that is (approximately) measured by most modern RB protocols including direct RB¹⁹, mirror RB²⁰, binary RB²² and cross-entropy benchmarking⁵⁶.

${\overline{F}}_{d}$ decays exponentially

To show that ${\overline{F}}_{d}$ decays exponentially, we need to calculate the probability Pr_d(S) of recording + 1 (“success”) and the probability Pr_d(F) of recording −1 (“failure”) when executing a random depth-d QIRB circuit. Ignoring end-of-circuit measurement error, we can calculate Pr_d(S) and Pr_d(F) recursively:

$${\Pr }_{d}(S)={\Pr }_{d-1}(S)(1-{p}^{{{{\rm{trans}}}}})+{\Pr }_{d-1}(F){p}^{{{{\rm{trans}}}}}$$

(7)

$${\Pr }_{d}(F)={\Pr }_{d-1}(F)(1-{p}^{{{{\rm{trans}}}}})+{\Pr }_{d-1}(S){p}^{{{{\rm{trans}}}}},$$

(8)

where p^trans is the probability of errors in a random Ω-distributed layer flipping a stabilizer state of a pre-layer Pauli into an anti-stabilizer state of the post-measurement Pauli. Equivalently, we can compute Pr_d(S) and Pr_d(F) by considering a two-state, discrete-time stochastic process with states {S, F} and transition matrix

$$M=\left(\begin{array}{rc}1-{p}^{{{{\rm{trans}}}}}&{p}^{{{{\rm{trans}}}}}\\ {p}^{{{{\rm{trans}}}}}&1-{p}^{{{{\rm{trans}}}}}\end{array}\right).$$

(9)

If A is the probability of preparing in the S (“success”) state and B the probability of preparing in the F (“failure”) state, then

$${\overline{F}}_{d}: \! \!={\Pr }_{d}(S)-{\Pr }_{d}(F)$$

(10)

$$=(A-B){(1-2{p}^{{{{\rm{trans}}}}})}^{d}.$$

(11)

The equality of Eq. (10) and Eq. (11) follows from diagonalizing M and noting that its eigenvalues are 1 and 1 − 2p^trans.

So far, we have ignored end-of-circuit measurement errors. These errors contribute a depth-independent cofactor as they modify the rate at which the S and F states are recorded in a depth-independent fashion. Hence,

$${\overline{F}}_{d}\propto {(1-2{p}^{{{{\rm{trans}}}}})}^{d}.$$

(12)

This result shows that ${\overline{F}}_{d}$ decays exponentially and that r_Ω is well-defined, r_Ω = 2p^trans. Moreover, r_Ω represents twice the rate at which errors flip stabilizer states to anti-stabilizer states in a circuit layer, a generalization of the interpretation of entanglement fidelity as twice the rate at which errors in a circuit layer anti-commute with a random Pauli. This interpretation allows us to compute r_Ω for a given error model, and to relate r_Ω to the average layer infidelity of that error model.

Interpreting r _Ω

Because r_Ω is twice the rate at which errors flip stabilizer states to anti-stabilizer states in a circuit layer, it has the following form:

$${r}_{\Omega }={{\mathbb{E}}}_{L\in \Omega }\left[{\sum}_{a,P,b}{\lambda }_{L,a,P,b}\right],$$

(13)

where λ_L,a,P,b is twice the rate at which the error (a, P, b) occurring in a layer L causes a state transition. We now argue that λ_L,a,P,b depends upon: (i) the probability p_L(a, P, b) at which (a, P, b) occurs in L; (ii) the probability of X^⊗a anti-commuting with a random ${s}_{{{{\rm{meas}}}}}^{{{{\rm{pre}}}}}$; (iii) the probability of X^⊗b anti-commuting with a random ${s}_{{{{\rm{meas}}}}}^{{{{\rm{post}}}}}$; and (iv) whether $P={\mathbb{I}}$. Table 1 summarizes an error (a, P, b)’s contribution to r_Ω.

Table 1 Individual error contributions for QIRB

Full size table

An error (a, P, b) causes a state transition when one (but not both) of the following occurs: (I) the pre-measurement error causes the reported state to be anti-stabilized by the pre-measurement Pauli instead of stabilized and (II) we prepare a post-measurement state that is anti-stabilized by the post-measurement Pauli instead of stabilized. Event (I) occurs when X^⊗a anti-commutes with ${s}_{{{{\rm{meas}}}}}^{{{{\rm{pre}}}}}$. Event (II) occurs whenever X^⊗b anti-commutes with ${s}_{{{{\rm{meas}}}}}^{{{{\rm{post}}}}}$ (exclusive) or P anti-commutes with ${s}_{{{{\rm{unmeas}}}}}^{{{{\rm{pre}}}}}$. Averaging over all possible pre- and post-measurement Paulis, either event (I) or (II) occurs with probability 1/2 when $P={\mathbb{I}}$. Otherwise, they occur with probability p_anti(a) + p_anti(b) − 2p_anti(a)p_anti(b). The values in Table 1 come from multiplying these results by 2p_L(a, p, b).

Bounding r _Ω

We now prove the following bound on r_Ω,

$$\frac{3{\varepsilon }_{\Omega }}{4}\le {r}_{\Omega }\le \frac{3{\varepsilon }_{\Omega }}{2}.$$

(14)

This bound comes from analyzing the term

$${p}_{{{{\rm{anti}}}}}(a)+{p}_{{{{\rm{anti}}}}}(b)-2{p}_{{{{\rm{anti}}}}}(a){p}_{{{{\rm{anti}}}}}(b)$$

in the $P={\mathbb{I}}$ entry of Table 1 for all possible values of a and b. The possible values of this term form a discrete set on the two-dimensional surface in ${{\mathbb{R}}}^{3}$ defined by the equation x + y − 2xy. The grid points are determined by the Hamming weight of a and b. As shown in Supplementary Note 4, this set has a minimum value of 3/8 when the sum of the Hamming weights of a and b is 2, and a maximum value of 3/4 when the sum of the Hamming weights is 1. This completes the proof.

In practice, this bound is pessimistic. Closer analysis of p_anti(a) + p_anti(b) − 2p_anti(a)p_anti(b) shows that it rapidly approaches 1/2 as the Hamming weights of a and b increase. Thus, most errors contribute roughly p_L(a, P, b)/2 to p^trans and r_Ω ≈ ε_Ω.

Simulations

We used simulations to validate the predictions of our QIRB theory: ${\overline{F}}_{d}$ decays exponentially at a rate r_Ω predicted by our theory. To do so, we simulated QIRB on simulated noisy 2, 4, and 6-qubit quantum computers with native single-qubit Cliffords, ${\mathsf{cnot}}$ gates with all-to-all connectivity, and MCMs with post-measurement reset. We modeled gate errors as local depolarizing channels and noisy measurements as perfect computational basis measurements preceded by a bit-flip error channel. We used the same layer sampling rules as in our trapped-ion and IBMQ experiments, with ${p}_{{\mathsf{cnot}}}$ values of 0.2, 0.35, 0.5 and ${p}_{{\mathsf{MCM}}}$ values of 0.01, 0.02, 0.05, 0.10, 0.25, 0.5. See Supplementary Note 5 for additional details on the noise model, experiment designs, and full results.

Figure 4 summarizes the results from our simulations. The data shows close agreement with our theory: the decays are exponential, r_Ω is well predicted by our theory, and our extracted error rates agree with those used to generate the data. These results confirm QIRB’s validity in simulation.

**Fig. 4: Simulation of QIRB on few-qubit QPUs with a simple error model.**

Trapped-ion experimental details

We conducted QIRB experiments on Quantinuum’s H1-1, utilizing the first 2 and 6 qubits of the device, both with and without micromotion hiding (MMH) in the auxiliary zones. Using the layer rules described in the main text, we constructed QIRB experiment designs using ${p}_{{\mathsf{cnot}}}$ equal to 0.2 and 0.35, and ${p}_{{\mathsf{MCM}}}$ of 0.2 and 0.3. For each experiment, we generated a total of 15 circuits at the depths d = 0, 1, 4, 32, and 128, resulting in 75 circuits per experiment, with each circuit executed 100 times. We also simulated each experiment on Quantinuum’s emulator. We employed the fitting procedure outlined in Supplementary Note 5 to extract estimated one-qubit gate, two-qubit gate, and MCM error rates from both the experiments and simulations. Additionally, we generated 100 bootstrapped samples to assess the uncertainty of and 30 bootstrapped samples to estimate the uncertainty in the extracted error rates. Results from each experiment are detailed in Supplementary Note 7.

IBM Q demonstration details

We conducted QIRB demonstrations on the first 5, 10, and 15 qubits in ibmq_algiers. The layer rules applied in our ibmq_algiers demonstrations were consistent with those used in our trapped-ion experiments, with the exception that we limited the placement of ${\mathsf{cnot}}$ gates to the edges of ibmq_algiers’ connectivity graph. We used ${p}_{{\mathsf{cnot}}}$ values of 0.2, 0.35, and 0.5 and ${p}_{{\mathsf{MCM}}}$ values of 0.2, 0.3, and 0.4. Results from all of these experiments were used to estimate one-qubit, two-qubit, and MCM error rates using an error rate model (see Supplementary Note 5). The QIRB error rates for each demonstration can be found in Supplementary Table 1, along with bootstrapped standard deviations derived from 30 bootstrapped samples.

Data availability

The data generated in this study is available by request to the corresponding author. The data are available under restricted access due to corporate copyright policies; access can be obtained by emailing the corresponding author.

Code availability

Circuit sampling code is available in pyGSTi⁵⁷. All the code used to analyze our results will promptly be made available upon request, subject to Sandia National Laboratories' copyright policies. Circuit simulations were performed using LoQS⁵⁸.

References

Shor, P. W. Polynomial-time algorithms for prime factorization and discrete logarithms on a quantum computer. SIAM J. Comput. 26, 1484–1509 (1997).
Article MathSciNet Google Scholar
Rubin, N. C. et al. Quantum computation of stopping power for inertial fusion target design. Proc. Natl Acad. Sci. USA 121, e2317772121 (2024).
Article CAS PubMed PubMed Central Google Scholar
Harrow, A. W., Hassidim, A. & Lloyd, S. Quantum algorithm for linear systems of equations. Phys. Rev. Lett. 103, 150502 (2009).
Article ADS MathSciNet PubMed Google Scholar
Cao, Y. et al. Quantum chemistry in the age of quantum computing. Chem. Rev. 119, 10856–10915 (2019).
Article CAS PubMed Google Scholar
Proctor, T., Young, K., Baczewski, A. D. & Blume-Kohout, R. Benchmarking quantum computers. Nat. Rev. Phys. 7, 105–118 (2025).
Gidney, C. & Ekerå, M. How to factor 2048 bit RSA integers in 8 hours using 20 million noisy qubits. Quantum 5, 433 (2021).
Article Google Scholar
Lee, J. et al. Even more efficient quantum computations of chemistry through tensor hypercontraction. PRX Quantum 2, 030305 (2021).
Article ADS Google Scholar
Terhal, B. M. Quantum error correction for quantum memories. Rev. Mod. Phys. 87, 307–346 (2015).
Article ADS MathSciNet Google Scholar
Poulin, D. Stabilizer formalism for operator quantum error correction. Phys. Rev. Lett. 95, 230504 (2005).
Article ADS PubMed Google Scholar
Bluvstein, D. et al. Logical quantum processor based on reconfigurable atom arrays. Nature 626, 58–65 (2024).
Article ADS CAS PubMed Google Scholar
Google Quantum AI. Suppressing quantum errors by scaling a surface code. Nature 614, 676–681 (2023).
Article ADS CAS Google Scholar
Wang, Y. et al. Fault-tolerant one-bit addition with the smallest interesting color code. Sci. Adv. 10, eado9024 (2024).
Article CAS PubMed PubMed Central Google Scholar
DeCross, M., Chertkov, E., Kohagen, M. & Foss-Feig, M. Qubit-reuse compilation with mid-circuit measurement and reset. Phys. Rev. X 13, 041057 (2023).
CAS Google Scholar
Emerson, J., Alicki, R. & Życzkowski, K. Scalable noise estimation with random unitary operators. J. Opt. B Quantum Semiclass. Opt. 7, S347 (2005).
Article ADS MathSciNet Google Scholar
Emerson, J. et al. Symmetrized characterization of noisy quantum processes. Science 317, 1893–1896 (2007).
Article ADS CAS PubMed Google Scholar
Knill, E. et al. Randomized benchmarking of quantum gates. Phys. Rev. A 77, 012307 (2008).
Article ADS Google Scholar
Magesan, E., Gambetta, J. M. & Emerson, J. Scalable and robust randomized benchmarking of quantum processes. Phys. Rev. Lett. 106, 180504 (2011).
Article ADS PubMed Google Scholar
Magesan, E., Gambetta, J. M. & Emerson, J. Characterizing quantum gates via randomized benchmarking. Phys. Rev. A 85, 042311 (2012).
Article ADS Google Scholar
Proctor, T. J. et al. Direct randomized benchmarking for multiqubit devices. Phys. Rev. Lett. 123, 030503 (2019).
Article ADS CAS PubMed Google Scholar
Proctor, T. et al. Scalable randomized benchmarking of quantum computers using mirror circuits. Phys. Rev. Lett. 129, 150502 (2022).
Article ADS CAS PubMed Google Scholar
Hines, J. et al. Demonstrating scalable randomized benchmarking of universal gate sets. Phys. Rev. X 13, 041030 (2023).
CAS Google Scholar
Hines, J., Hothem, D., Blume-Kohout, R., Whaley, B. & Proctor, T. Fully scalable randomized benchmarking without motion reversal. PRX Quantum 5, 030334 (2024).
Article Google Scholar
Proctor, T., Rudinger, K., Young, K., Nielsen, E. & Blume-Kohout, R. Measuring the capabilities of quantum computers. Nat. Phys. 18, 75 (2021).
Article Google Scholar
Polloreno, A. M. et al. A theory of direct randomized benchmarking. Preprint at http://arxiv.org/abs/2302.13853 (2023).
Flammia, S. T. & Liu, Y.-K. Direct fidelity estimation from few pauli measurements. Phys. Rev. Lett. 106, 230501 (2011).
Article ADS PubMed Google Scholar
Erhard, A. et al. Characterizing large-scale quantum computers via cycle benchmarking. Nat. Commun. 10, 5347 (2019).
Article ADS PubMed PubMed Central Google Scholar
Harper, R., Flammia, S. T. & Wallman, J. J. Efficient learning of quantum noise. Nat. Phys. 16, 1184–1188 (2020).
Article CAS Google Scholar
Nielsen, E. et al. Gate set tomography. Quantum 5, 557 (2021).
Article Google Scholar
Blume-Kohout, R. et al. Demonstration of qubit operations below a rigorous fault tolerance threshold with gate set tomography. Nat. Commun. 8, 14485 (2017).
Article CAS PubMed PubMed Central Google Scholar
Rudinger, K. et al. Characterizing midcircuit measurements on a superconducting qubit using gate set tomography. Phys. Rev. Appl. 17, 014014 (2022).
Article ADS CAS Google Scholar
Stricker, R. et al. Characterizing quantum instruments: From nondemolition measurements to quantum error correction. PRX Quantum 3, 030318 (2022).
Article ADS Google Scholar
Pereira, L., García-Ripoll, J. J. & Ramos, T. Parallel tomography of quantum non-demolition measurements in multi-qubit devices. Npj Quantum Inf. 9, 1–9 (2023).
Article Google Scholar
Wagner, S., Bancal, J.-D., Sangouard, N. & Sekatski, P. Device-independent characterization of quantum instruments. Quantum 4, 243 (2020).
Article Google Scholar
Gómez, E. S. et al. Device-independent certification of a nonprojective qubit measurement. Phys. Rev. Lett. 117, 260401 (2016).
Article ADS MathSciNet PubMed Google Scholar
Gaebler, J. P. et al. Suppression of midcircuit measurement crosstalk errors with micromotion. Phys. Rev. A 104, 062440 (2021).
Article ADS CAS Google Scholar
Govia, L. C. G. et al. A randomized benchmarking suite for mid-circuit measurements. N. J. Phys. 25, 123016 (2023).
Article Google Scholar
Hines, J. & Proctor, T. Pauli noise learning for mid-circuit measurements. Phys. Rev. Lett. 134, 020602 (2025).
Article MathSciNet CAS PubMed Google Scholar
Zhang, Z., Chen, S., Liu, Y. & Jiang, L. Generalized cycle benchmarking algorithm for characterizing midcircuit measurements. PRX Quantum 6, 010310 (2025).
Article Google Scholar
Shirizly, L., Govia, L. C. G. & McKay, D. C. Randomized benchmarking protocol for dynamic circuits. Phys. Rev. A 111, 012611 (2025).
Article MathSciNet CAS Google Scholar
Bisio, A., Chiribella, G., D’Ariano, G. M., Facchini, S. & Perinotti, P. Optimal quantum tomography of states, measurements, and transformations. Phys. Rev. Lett. 102, 010404 (2009).
Article ADS CAS PubMed Google Scholar
Lundeen, J. S. et al. Tomography of quantum detectors. Nat. Phys. 5, 27–30 (2009).
Article CAS Google Scholar
Feito, A. et al. Measuring measurement: theory and practice. N. J. Phys. 11, 093038 (2009).
Article Google Scholar
Maciejewski, F. B., Zimborás, Z. & Oszmaniec, M. Mitigation of readout noise in near-term quantum devices by classical post-processing based on detector tomography. Quantum 4, 257 (2020).
Article Google Scholar
Davies, E. B. & Lewis, J. T. An operational approach to quantum probability. Commun. Math. Phys. 17, 239–260 (1970).
Article ADS MathSciNet Google Scholar
Ryan-Anderson, C. et al. Implementing fault-tolerant entangling gates on the five-qubit code and the color code. Preprint at http://arxiv.org/abs/2208.01863 (2022).
Viola, L., Knill, E. & Lloyd, S. Dynamical decoupling of open quantum systems. Phys. Rev. Lett. 82, 2417–2421 (1999).
Article ADS MathSciNet CAS Google Scholar
Moussa, O., da Silva, M. P., Ryan, C. A. & Laflamme, R. Practical experimental certification of computational quantum gates using a twirling procedure. Phys. Rev. Lett. 109, 070504 (2012).
Article ADS PubMed Google Scholar
Hothem, D., Hines, J., Nataraj, K., Blume-Kohout, R. & Proctor, T. Predictive models from quantum computer benchmarks. 2023 IEEE Int. Conf. Quantum Comput. Eng. 1, 709–714 (2023).
Article Google Scholar
IBM. DynamicalDecouplingOptions. https://docs.quantum.ibm.com/api/qiskit-ibm-runtime/qiskit_ibm_runtime.options.DynamicalDecouplingOptions. (2024).
Wood, C. J. & Gambetta, J. M. Quantification and characterization of leakage errors. Phys. Rev. A 97, 032306 (2018).
Article ADS CAS Google Scholar
Chen, Z. et al. Measuring and suppressing quantum state leakage in a superconducting qubit. Phys. Rev. Lett. 116, 020501 (2016).
Article ADS PubMed Google Scholar
Chasseur, T. & Wilhelm, F. K. Complete randomized benchmarking protocol accounting for leakage errors. Phys. Rev. A 92, 042333 (2015).
Article ADS Google Scholar
Wallman, J. J., Barnhill, M. & Emerson, J. Robust characterization of leakage errors. N. J. Phys. 18, 043021 (2016).
Article Google Scholar
Beale, S. J. & Wallman, J. J. Randomized compiling for subsystem measurements. Preprint at https://arxiv.org/abs/2304.06599 (2023).
McLaren, D., Graydon, M. A. & Wallman, J. J. Stochastic errors in quantum instruments. Preprint at https://arxiv.org/abs/2306.07418 (2023).
Boixo, S. et al. Characterizing quantum supremacy in near-term devices. Nat. Phys. 14, 595–600 (2018).
Article CAS Google Scholar
Nielsen, E. et al. pyGSTio/pyGSTi: version 0.9.10.1. Zenodo https://doi.org/10.5281/zenodo.6363115 (2022).
Rudinger, K. et al. Probing logical error models with gate set tomography. In Presented at APS March Meeting (2023).

Download references

Acknowledgements

This material was funded in part by the US Department of Energy, Office of Science, Office of Advanced Scientific Computing Research, Quantum Testbed Pathfinder Program. This research was funded in part by the Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA). Sandia National Laboratories is a multi-program laboratory managed and operated by National Technology and Engineering Solutions of Sandia, LLC., a wholly owned subsidiary of Honeywell International, Inc., for the US Department of Energy’s National Nuclear Security Administration under contract DE-NA-0003525. This research used IBM Quantum resources of the Air Force Research Laboratory. All statements of fact, opinion or conclusions contained herein are those of the authors and should not be construed as representing the official views or policies of the US Department of Energy, IARPA, the ODNI, the US Government, IBM, or the IBM Quantum team.

Author information

Authors and Affiliations

Quantum Performance Laboratory, Sandia National Laboratories, Livermore, CA, USA
Daniel Hothem & Timothy Proctor
Department of Physics, University of California, Berkeley, CA, USA
Jordan Hines
Quantum Performance Laboratory, Sandia National Laboratories, Albuquerque, NM, USA
Jordan Hines & Robin Blume-Kohout
Quantinuum, 303 S. Technology Ct., Broomfield, CO, USA
Charles Baldwin & Dan Gresh

Authors

Daniel Hothem
View author publications
Search author on:PubMed Google Scholar
Jordan Hines
View author publications
Search author on:PubMed Google Scholar
Charles Baldwin
View author publications
Search author on:PubMed Google Scholar
Dan Gresh
View author publications
Search author on:PubMed Google Scholar
Robin Blume-Kohout
View author publications
Search author on:PubMed Google Scholar
Timothy Proctor
View author publications
Search author on:PubMed Google Scholar

Contributions

D.H., J.H., R.B.-K. and T.P. conceived of the protocol. D.H., J.H. and T.P. were responsible for the theory. D.H. was responsible for designing and analyzing the experiments. C.B. and D.G. were responsible for running the experiments on H1-1. D.H. performed the simulations and submitted the IBM Q jobs.

Corresponding author

Correspondence to Daniel Hothem.

Ethics declarations

Competing interests

C.B. and D.G. are employees of Quantinuum. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Yunchao Liu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Transparent Peer Review file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Hothem, D., Hines, J., Baldwin, C. et al. Measuring error rates of mid-circuit measurements. Nat Commun 16, 5761 (2025). https://doi.org/10.1038/s41467-025-60923-x

Download citation

Received: 17 March 2025
Accepted: 09 June 2025
Published: 01 July 2025
DOI: https://doi.org/10.1038/s41467-025-60923-x