MNISQ: A Large-Scale Quantum Circuit Dataset for Machine Learning in the NISQ Era

Placidi, Leonardo; Hataya, Ryuichiro; Mori, Toshio; Aoyama, Koki; Morisaki, Hayata; Mitarai, Kosuke; Fujii, Keisuke

doi:10.1038/s41597-026-07493-9

Download PDF

Data Descriptor
Open access
Published: 26 May 2026

MNISQ: A Large-Scale Quantum Circuit Dataset for Machine Learning in the NISQ Era

Leonardo Placidi ORCID: orcid.org/0000-0001-7693-7631^1,2,
Ryuichiro Hataya³,
Toshio Mori^1,2,4,
Koki Aoyama ORCID: orcid.org/0000-0001-9137-3977¹,
Hayata Morisaki¹,
Kosuke Mitarai^1,2 &
…
Keisuke Fujii^1,2,4

Scientific Data volume 13, Article number: 810 (2026) Cite this article

1369 Accesses
Metrics details

Subjects

Abstract

We introduce MNISQ, the first large-scale dataset for both quantum and classical machine learning during the NISQ era, containing 4.95 million circuits of 10 qubits constructed with up to 100 two-qubit gates. MNISQ serves as a foundational resource for developing natural language processing (NLP) models for quantum computing and deep learning models. The dataset is derived from quantum-encoded classical data (e.g., MNIST) and is available in two formats: quantum circuits and classical descriptions (Quantum Assembly Language, QASM). We perform baseline experiments on circuit classification using both quantum and classical methods. Quantum Kernel methods achieve up to 97% accuracy in multiclass classification. We also explore the impact of noise in quantum machine learning, helping develop error-mitigation strategies for noisy hardware. In classical experiments, we use QASM files with NLP models: S4, Transformer, and LSTM. The S4 model reaches 77% accuracy (81% with data augmentation), demonstrating that modern machine learning models can effectively classify quantum circuits. The dataset is publicly available at https://doi.org/10.5281/zenodo.19656638 and related codes are available on GitHub.

Quantum machine learning beyond kernel methods

Article Open access 31 January 2023

Experimental quantum end-to-end learning on a superconducting processor

Article Open access 01 March 2023

A quantum leaky integrate-and-fire spiking neuron and network

Article Open access 02 December 2024

Background & Summary

Background

Machine Learning (ML) is a discipline intricately shaped by the availability and quality of data¹. Quantum computers, initially conceived for tackling complex physics problems due to their computational advantages², have given rise to the development of “quantum” machine learning (QML) techniques^3,4. However, the scarcity of quantum datasets remains a challenge, and many experiments within the community rely on custom data processing and evaluation methods rather than standardized benchmarks.

Contemporary quantum computers fall under the category of Noisy-Intermediate-Scale Quantum (NISQ) devices⁵. Ranging from a few qubits to a few hundred, these devices lack full error correction and operate within the domain of small-to-medium scale hardware prone to noise. Despite these present constraints, research avenues are diverse and oriented toward practical applications^3,6. Quantum computers, characterized by their BQP complexity class², hold the potential to excel beyond classical counterparts in various problem domains⁷. Consequently, they offer the promise of enhanced data processing capabilities and accelerated execution of machine learning tasks^4,8. The ongoing research landscape has yielded noteworthy progress with quantum algorithms in supervised^4,9, unsupervised^10,11 and reinforcement learning^12,13,14. However, the existing quantum devices mostly suffice for addressing small problems, preventing a direct comparison with cutting-edge classical machine learning techniques. Anticipated advancements in this decade project the ability of quantum devices to support considerably larger NISQ computations with thousands of qubits (e.g., the IBM Development Quantum Roadmap). Consequently, successful breakthroughs in quantum algorithms and machine learning methodologies could usher in transformative changes across artificial intelligence and all scientific domains. An important direction is the challenge to find scenarios where quantum methods present an advantage of the classical counterpart. Our work addresses the research of an advantageous framework with the task of quantum circuit classification.

The present work introduces MNISQ, a large scale quantum circuit dataset aimed at delivering a standard testing environment for the community. MNISQ can be easily employed for machine learning on quantum computers (e.g., quantum machine learning) and machine learning for quantum computers (as a support tool to perform some tasks without relying on quantum hardware). Along the introduction of the dataset, we also present quantum machine learning and classical machine learning baseline experiments to show the dataset learnability and compare the performance of the two approaches.

Related Works

Here follow a series of existing studies concerning data associated with quantum systems, referred to as quantum datasets.

NTangle¹⁵: NTangle shares common ground with our research in the sense that it emphasizes the importance of focusing on quantum-related datasets to gain an advantage in QML. However, it defines the quantum state itself as the dataset, which, unlike our quantum circuit dataset, cannot be directly applied to existing classical machine learning.

QDataSet¹⁶: QDataSet is a dataset defined from 52 types of data concerning quantum operations for single and two-qubit systems, encompassing both noise-inclusive and noise-free scenarios. It is constructed to apply existing machine learning frameworks for characterizing quantum systems and improving experimental control setups including classical postprocesssing, and hence QDataSet constitutes a dataset pertaining to the physical dynamics of one- or two-qubit systems. Each dataset consists of 10, 000 data points.

Quantum Federated Data¹⁷: a quantum federated dataset which can be used for distributed learning in QC networks and can serve as a baseline for future quantum federated learning implementations with quantum sensors.

Quantum datasets on PennyLane PennyLane is an open-source software library that provides tools for quantum machine learning and quantum computing. Quantum Datasets in PennyLane, specifically for quantum chemistry and quantum spin systems provide detailed problem systems descriptions, parameterized models and their solution for these problems, aiding in the study of quantum algorithms related to these molecular and spin systems. While these datasets serve as a valuable baseline for transparent research, how to apply quantum and classical machine learning for these data remain challenges, the size of the dataset is also too small to be applied for the state-of-the-art machine learning techniques. Besides that, our dataset is also already partially available on PennyLane datasets page at https://pennylane.ai/datasets/mnisq.

VQE-generated dataset¹⁸: The VQE-generated dataset is the most akin to our approach. It is a quantum circuit dataset labeled by the computational task it performs. However, a significant issue arises due to the dataset’s size, as there are only 300 quantum circuits for each label, which is too small to apply state-of-the-art machine learning techniques.

MNISQ contains 4.95 million data points, with circuits of 10 qubits up to 100 two-qubit gates. This scale is several orders of magnitude larger than existing quantum circuit datasets for machine learning, which typically contain from a few hundred to at most a few thousand samples per task. For example, the VQE-generated dataset provides approximately 300 circuits per class, and QDataSet consists of multiple datasets of 10,000 samples each for small (1-2 qubit) systems.

Motivation and approach

The motivation behind this study is to present a framework of quantum circuit classification. In contrast to classical data classification, the availability of an encoded dataset allows comparing quantum and classical methods. While it has been theoretically shown that QML can be advantageous for certain tasks, there is a need to explore practical and experimentally friendly settings to showcase the potential of QML. One approach is to work with quantum data obtained from quantum systems directly, without converting them to classical formats, as it has been shown to be more efficient for certain learning tasks⁷. However, this approach presents challenges in transferring quantum states obtained from physical experiments to quantum computers, making it experimentally challenging. Another approach is to define artificial machine learning tasks that are solvable only by quantum computers, such as the discrete logarithm problem¹⁹. While this demonstrates a rigorous advantage of QML, it is in a highly artificial setting and may not directly translate to advantages in practical problems. Recent work has also suggested that practical quantum advantage may emerge in physics-related machine learning tasks, for example in ghost imaging, where hybrid quantum-classical models achieved improved performance over comparable classical methods in identification and imaging tasks²⁰. To address these limitations and establish a more practical and experimentally friendly setting, it is crucial to create datasets where the advantage of QML can be expected and compare its performance against classical approaches. Our motivation is to propose a new machine learning framework that analyzes and learns from “quantum circuit data” given as input. Quantum computing is still in its infancy, but as quantum computing becomes more widely used, more quantum circuit data will be accumulated in the world. Just as ChatGPT²¹ and other software can already handle programming languages and generate optimized code, learning quantum circuit data, analyzing it, and generating optimized quantum circuits will be a very important field where we can apply machine learning. Many quantum circuits have already been run using AWS and IBM cloud systems, and quantum circuit data is stored on the servers, but unfortunately we do not have access to that data. What is needed to accelerate the framework of learning from such quantum circuit data, is to define a large amount of quantum circuit data sets and start machine learning research using them. We have not developed a method for MNIST classification, but rather we have generated quantum circuit datasets with AQCE²² using MNIST data as a seed and have launched a completely new framework for machine learning on quantum circuit datasets.

This dataset is provided in a dual form that can be utilized for both quantum and classical machine learning. The scale and structure of MNISQ enable several concrete research directions beyond existing quantum datasets. In particular, MNISQ supports the systematic investigation of the scalability of QML models with respect to data size and circuit complexity, as well as the study of sample complexity in hybrid quantum-classical learning settings. Furthermore, the availability of both quantum circuit representations and classical descriptions (QASM) provides a controlled testbed for developing and benchmarking “quantum-aware” classical models that learn from structured quantum data. These research directions are difficult to explore with existing small-scale datasets and position MNISQ as a benchmark for large-scale studies in quantum machine learning.

The workflow of this study is summarized in Fig. 1.

Contributions

We introduce a large-scale quantum dataset in the form of quantum circuits and QASM, facilitating research in quantum machine learning and classical machine learning. The dataset is easily accessible on Zenodo at https://doi.org/10.5281/zenodo.19656638²³, and, given its QASM formalism, easily used in Qulacs, Qiskit, PennyLane.
We introduced a new framework where quantum and classical machine learning learn (and compete) on quantum circuit data.
We present a quantum machine learning baseline, also investigating the effect and mitigation of noise.
We explore the classification of circuits using classical models (S4, Transformers, LSTM), revealing the solvability of the problem by classical machine learning. We highlight the remarkable capability (and the implications) of classical machine learning in comprehending quantum circuit computations without prior knowledge of quantum mechanics.
We introduced a task of quantum circuit classification where quantum machine learning methods are advantageous compared to classical methods.

Methods

MNISQ Dataset: Construction and Characteristics

To create a quantum circuit dataset, we start from classical data like MNIST. The transformation of the classical data into quantum states is a pivotal task and we utilize the Automatic Quantum Circuit Encoding (AQCE) approach²². Figure 2 illustrates the procedure.

The literature^4,9,24 explored many kinds of encodings such as Basis Encoding, Amplitude Encoding, Repeated Amplitude Encoding, Rotation Encoding, Coherent State Encoding. Amplitude Encoding. Encoding information is a crucial task and can highly affect trainability (such as barren plateau²⁴) for quantum machine learning models. AQCE is an algorithm to automatically generate an optimal circuit for the quantum encoding of information. It is a NISQ method, and it does not rely on parametric circuit optimization. AQCE is a NISQ algorithm and difficult to scale with the number of qubits, but our work resides in the NISQ dataset generation, so AQCE is a good choice for an encoding strategy.

Quantum Encoding of Classical Information (AQCE method)

Given classical data $\vec{x}$ as d-dimensional vector, AQCE generates a quantum circuit ${\mathcal{C}}$ that encodes $\vec{x}$ into the complex amplitudes of the output quantum state:

$$\begin{array}{r}\left|\Psi \,\right\rangle :=\sum _{i\in {\{0,1\}}^{n}}{\widetilde{x}}_{i}\left|i\right\rangle ,\end{array}$$

(1)

where the number of qubits n is chosen to be $n:=\lceil {\log }_{2}(d)\rceil $ and the complex amplitudes ${\widetilde{x}}_{i}$ are normalized appropriately so that ${\sum }_{i}| {\widetilde{x}}_{i}{| }^{2}=1$. If 2ⁿ > d, for those indices that do not have corresponding elements in $\vec{x}$, ${\widetilde{x}}_{i}$ is set to be zero. AQCE constructs such a quantum circuit ${\mathcal{C}}$ generating any given quantum state $\left|\Psi \right\rangle $ by optimizing the configuration and parameters of the quantum gates in ${\mathcal{C}}$ by maximizing the absolute value of the following overlap, which we call fidelity here:

$$F=| \left\langle 0\right|{{\mathcal{C}}}^{\dagger }\left|\Psi \right\rangle | .$$

(2)

The AQCE iteration ends when the fidelity exceeds the target fidelity F_target or the maximum number of gates ${M}_{\max }$ initially set for ${\mathcal{C}}$. The resulting quantum circuit generated by the AQCE procedure from classical data $\vec{x}$ will be referred to as the $AQCE(\vec{x})$ circuit.

AQCE is intended to be performed by simulating a quantum computation on a classical computer. Although this can only be done on the order of tens of qubits, we believe that the dimension of the classical data that can be amplitude embedded is sufficient, for example, 10⁹ for a classical simulation of 30 qubits. Also, the embedding operation with AQCE may cause a computational bottleneck, but once the quantum circuit data is generated in this format (like JPG format for images), it can be used efficiently thereafter. For a more detailed description of AQCE, please refer to the supplementary material paragraph on the AQCE method.

Dataset Construction

Using a cluster machine, we employed the AQCE method to encode the standard machine learning datasets MNIST²⁵, Fashion-MNIST²⁶, and Kuzushiji-MNIST²⁷. These datasets were obtained from OpenML²⁸. These datasets are publicly available and distributed under terms that allow reuse for research purposes.

The datasets are accessible via OpenML: MNIST, Fashion-MNIST, Kuzushiji-MNIST, and OpenML. Original dataset sources are also available at: MNIST, Fashion-MNIST, and Kuzushiji-MNIST. All source datasets used in this work are publicly available and distributed under permissive licenses that allow their use and redistribution for research purposes. The datasets were accessed via OpenML, which provides guidance on dataset usage and licensing.

For each dataset, we generated 60,000 training samples and 10,000 test samples. Additionally, to better train classical machine learning models, we augmented^29,30 our quantum dataset to include 480,000 samples. The dataset strictly follows the original train/test splits of the source datasets. Data augmentation is applied exclusively to the training set, and no augmented samples derived from training images are included in the test set. Each fidelity variant (e.g., 80%, 90%, 95%) is generated independently from the same underlying train/test partition. While these variants correspond to different quantum circuit realizations of the same classical inputs, the separation between training and test data is preserved within each variant, and no sample or its derivatives appear across partitions. Classical training data were augmented offline from the original 60,000 training images using a three-stage sequential pipeline. First, we added one rotated variant per image using a fixed rotation angle of +50 degrees. Second, we added one crop variant per image using RandomCrop(size=(28,28)). Third, we added one shifted/affine variant per image using RandomAffine with degrees=(0,30), translate=(0.1,0.3), and scale=(0.5,0.75). Because each stage appends one additional transformed sample per current sample, dataset size doubles at each stage: 60,000 → 120,000 → 240,000 → 480,000, yielding 480,000 training samples for each dataset.

We created three variations for each dataset. Variations correspond to different generation procedures with a minimum fidelity of 80%, 90%, and 95% for the generated quantum circuit dataset. Higher fidelity values require a larger number of quantum gates in the circuit.

Data Records

The MNISQ dataset is publicly available on Zenodo at https://doi.org/10.5281/zenodo.19656638²³.

As illustrated in Fig. 3, each archive stores one subdataset together with the information required to reconstruct and reuse the encoded samples. MNISQ is organized into subdatasets defined by three factors: source dataset, fidelity target, and representation format. The source datasets are MNIST, Fashion-MNIST, and Kuzushiji-MNIST. For each of them, three fidelity levels are provided (80%, 90%, and 95%). In addition, both original and augmented training sets are available, together with test sets and two QASM representations.

Overall, MNISQ contains 9 original subdatasets (three source datasets × three fidelity levels), each with 60,000 training samples and 10,000 test samples, together with 9 augmented training subdatasets containing 480,000 training samples each. The total size is therefore 4,950,000 records. QASM files are distributed in two formats: QASM with DenseMatrix formalism, intended for Qulacs, and base QASM formalism, where the proprietary DenseMatrix operator is decomposed into standard gates for compatibility with platforms such as Qiskit and PennyLane.

Each encoded sample is associated with the following stored records:

fidelity the fidelity between the AQCE-generated circuit output state and the target amplitude-encoded state;
label the class label inherited from the original image dataset;
qasm the quantum circuit description in QASM format;
state the target state vector corresponding to the encoded classical input.

File and archive names follow a structured convention of the form:

[data]_[type]_[fidelity].zip

where data identifies the split and representation, type identifies the source dataset, and fidelity identifies the AQCE fidelity target.

Details on dataset generation

This section summarizes the parameters and hardware used for generation of the MNISQ records.

Table 1 reports the AQCE generation parameters for each target fidelity. Here, M denotes the maximum number of two-qubit gates allowed in the generated circuit, while δ determines how many gates are added at each AQCE iteration.

Table 1 Parameters used to achieve the target fidelity in AQCE generation.

Full size table

The dataset was generated on a cluster machine with the following specifications:

CPU: Dual Intel Xeon Platinum 9242 (2.3 GHz, 48-core)
Memory: 384 GB (24 × 16 GB DDR4-2933 Reg. ECC)

Generating one MNIST sample at 90% fidelity required approximately 8 seconds per process. Although AQCE can be executed with multithreading, we adopted single-threaded execution because it was more efficient for large-scale parallel generation.

For augmented datasets, 480,000 samples were generated by distributing the work across 1000 independent processes, each producing 480 samples. Each process was submitted as a single job to the cluster. To reduce disk I/O contention, each process accumulated its 480 generated circuits in memory and wrote them to disk only after completion. Using a 40-node cluster (3840 cores), the full 4.95 million-record dataset was generated in approximately 8.5 hours.

Table 2 summarizes the source datasets included in MNISQ. For each source dataset, original train/test splits are provided together with augmented training data and all three fidelity targets.

Table 2 Breakdown of the MNISQ records by source dataset.

Full size table

Dual-form representation: quantum circuits and QASM descriptions

Quantum circuits for Qulacs can be loaded using methods of the form:

Qulacs_dataset.[type].loader.load_[type]_[data_type]_[fidelity]

In this notation:

type specifies the source dataset: mnist, fashion_mnist, or kuzushiji_mnist;
data_type specifies the split: original training, augmented training, or test;
fidelity specifies the AQCE target fidelity: f80, f90, or f95.

For example, the MNIST training dataset with target fidelity 95% can be loaded as:

Qulacs_dataset.mnist.loader.load_mnist_train_f95()

Executing one stored circuit embeds the corresponding classical input into a quantum state. This state can then be used directly as input to downstream quantum machine learning pipelines.

The Qulacs-oriented QASM representation uses the proprietary DenseMatrix instruction to encode gate blocks compactly. Since DenseMatrix is not part of standard OpenQASM, a second representation is also provided in which these operations are decomposed into standard gates for compatibility with Qiskit and PennyLane.

The DenseMatrix instruction has the form:

DenseMatrix(number of target qubits, number of control qubits, matrix elements a + bi ordered as “a, b”), target qubit column, control qubit column;

An example is:

DenseMatrix(2,0,0.900726,0,-0.267421,0,...,0.752815,0) q[2],q[6];

Within each archive, the directory structure is organized by record type:

fidelity fidelity values for all encoded samples;
label labels associated with each encoded sample;
qasm QASM circuit files;
state target state vectors corresponding to the encoded samples.

The main archive prefixes are:

train_orig 60,000 original encoded training samples in QASM with DenseMatrix;
base_train_orig the same original training samples in base QASM without DenseMatrix;
train 480,000 augmented training samples in QASM with DenseMatrix;
test 10,000 encoded test samples in QASM with DenseMatrix;
base_test the same test samples in base QASM without DenseMatrix.

The dataset identifiers are:

mnist_784 MNIST;
Fashion-MNIST Fashion-MNIST;
Kuzushiji-MNIST Kuzushiji-MNIST.

The fidelity suffixes are:

f80 fidelity ≥80%;
f90 fidelity ≥90%;
f95 fidelity ≥95%.

In addition to enforcing fidelity thresholds during AQCE generation, we carried out practical validation of the generated records by checking that the stored circuit representations reproduce the intended encoded states and associated source images upon execution. To support portability, the dataset includes both a Qulacs-oriented QASM representation using DenseMatrix and a base QASM representation intended for compatibility with other frameworks such as Qiskit and PennyLane. The integrity and usability of these files are further supported by the public tutorials and examples distributed with the dataset repository, as well as by subsequent reuse of MNISQ in the community.

Accordingly, archives such as test_mnist_784_f90.zip or base_train_orig_Fashion-MNIST_f95.zip can be interpreted directly from their filenames. All such archives are available through the Zenodo repository.

Technical Validation

Baseline for MNISQ

In this section, we present a baseline with quantum models and classical models. Our experiments serve to show the challenges depending on the data representations, indicating a relative advantage for quantum models in this scenario of circuit classification. In particular, the quantum representation directly encodes amplitudes of the target quantum state, while the classical representation corresponds to a syntactic description of the circuit. As a result, the two settings reflect different learning problems.

Quantum Machine Learning experiments

In the natural task of multiclass classification, MNISQ can be investigated by explicit (such as variational algorithms^3,31) and implicit quantum models (such as quantum kernel³²). In these first experiments, we chose to present results based on the Quantum Kernel approach. The use of a quantum kernel is motivated by its ability to compute classically hard or intractable kernels^9,32,33, the existing literature for classification tasks^34,35, along with their relative efficiency with parallel computation on simulators. In addition, recent studies have shown that quantum kernels can offer a quantum advantage over classical methods on carefully engineered datasets¹⁹ representing a very promising direction for the quantum community⁸.

In the literature, the classification of MNIST and image datasets has been explored in the variational^36,37,38, kernel³⁵ and tensor networks³⁹ literature in different ways. Among the frameworks, the most promising is the quantum convolutional network⁴⁰ with already significant works in the quantum classification field^38,41. Existing methods are generally based on one or multiple encoding of the data (often rescaled to fit less qubits) and, in limited cases, in the adoption of classical models in part of the pipeline^42,43,44. To the best of our knowledge, the existing classification of encoded data is mostly based on binary classification or 4-class classification, so we believe that the 10 classes of MNISQ may also present an interesting direction for future works on variational algorithms.

Quantum Support Vector Machine (QSVM)

The quantum experiments are based on QSVM, so we shortly introduce the main idea and leave in the supplementary material an introduction to the classical Support Vector Machine (SVMs), which differ from QSVM only in the computation of the kernel while leaving the same optimization procedure.

In fact, QSVMs differ from SVMs only because in the formulation of the decision function:

$$f(\vec{x})={\rm{sgn}}\left(\mathop{\sum }\limits_{n=1}^{N}{t}_{n}{\alpha }_{n}K({\vec{x}}^{(i)},\vec{x})\right),\vec{x}\in {\rm{TestSet}},$$

(3)

The Kernel function $K\left({\vec{x}}^{(i)},{\vec{x}}^{(j)}\right)$ is obtained by taking the inner product between quantum states, as detailed in Fig. 4. In our examples, the quantum states come from the encoding of our information (AQCE) available in the dataset, thus we can also write our quantum kernel as:

$$K\left({\vec{x}}^{(i)},{\vec{x}}^{(j)}\right)=| \left\langle 00...0\right|AQCE({\vec{x}}^{(i)})AQCE{({\vec{x}}^{(j)})}^{\dagger }\left|00...0\right\rangle {| }^{2}$$

(4)

QSVM implementation details

We use a support vector classifier with precomputed kernel (scikit-learn) with regularisation parameter C = 1.0 and no additional hyperparameter tuning.

In the ideal (noise-free) setting, kernel elements are computed via statevector simulation, allowing construction of the full kernel matrices on the dataset splits used in the experiments.

In contrast, in the noisy setting, each kernel element must be estimated via repeated circuit executions (shots) and error mitigation. Due to the resulting computational cost and the O(N²) scaling of kernel methods, full kernel evaluation was infeasible on our devices. Therefore, all noisy experiments are conducted on reduced subsets of size d = 200, using k = 10, 000 shots per circuit and n = 10 repeated simulations, as detailed in Fig. 5. In the ideal (statevector) setting, the QSVM evaluation is deterministic and does not exhibit variability across runs. In contrast, in the noisy setting, results are stochastic due to sampling, and variability is reported via standard deviations.

We compare our prediction results based on the accuracy of the decision function and the results are given in multiclass classification accuracy.

Quantum Baseline

Table 3 shows our results on the three subdatasets of MNISQ (mnist_784, Fashion-MNIST, Kuzushiji-MNIST). The two methods employed are a variation of the quantum support vector machines (QSVMs) where, instead of performing a binary classification task, we perform a multiclass classification.

Table 3 Experiments with the quantum kernel approach in Qulacs under ideal statevector simulation.

Full size table

Looking at Table 3, the one-versus-one approach fits $\frac{{n}_{{\rm{classes}}}({n}_{{\rm{classes}}}-1)}{2}$ QSVMs on all the possible couples of classes and finally classifies based on a voting scheme called one-versus-one. On the other side, one-versus-the-rest approach consists in training n_classes QSVMs (10 for MNISQ subdatasets), each one able to classify one class versus any other. We performed our experiments using Qulacs⁴⁵ quantum computer simulator for the quantum kernels and sklearn implementations for the classifiers⁴⁶. The one-versus-the-rest training is significantly faster because of its limited number of classifiers⁴⁷, but since every classifier is trained on a small subset of the dataset, the one-versus-one is likely more robust and, in fact, presents a slightly better performance. We observe that our best prediction is reached when the fidelity is at least 95% for the data. Thus, a higher fidelity circuit leads to a better approximation of the target state and a more effective feature map for the quantum kernel.

Performance of QML under noisy operations

The MNISQ dataset is a 10-qubit quantum circuit with a total number of quantum gates of about 100, which should be sufficient to run on a state-of-the-art quantum computer achieving quantum volume 2¹⁹. To demonstrate this, we introduce 1- and 2-qubit depolarizing noise after each gate with error probability p = 0.001 or p = 0.002, and estimate the kernel elements by sampling with error mitigation via zero-noise extrapolation (ZNE) using linear extrapolation^48,49. Depolarizing noise is a standard and widely used effective noise model in NISQ studies, as it provides a simple way to capture generic gate imperfections and loss of coherence without assuming device-specific calibration details.

Given that the datasets have 60, 000 training samples and 10, 000 test samples each, it is not feasible to compute the full noisy kernel matrix. The main bottleneck is that each circuit must be executed with a large number of shots (thousands to millions), resulting in runtimes of approximately one minute per circuit on our machines. Therefore, in the noisy setting we construct kernel matrices on subsets of size d = 200, using k = 10, 000 shots per circuit, n = 10 repeated simulations, and analyze the average residual noise after mitigation. We perform this study on the three dataset versions and report standard deviations due to the stochastic nature of the experiments.

In our results, the Train/Train, Test/Test, and Train/Test configurations correspond to different combinations of noise distributions applied to the kernel matrices. In the Train/Train and Test/Test settings, the same noise distribution is used for both training and testing, while Train/Test introduces a mismatch between the two.

We described the detailed algorithm in Fig. 5.

In Table 4 we show the distributions of the residual noise after mitigation for the MNISQ_784 dataset with different fidelities. Due to the limited number of shots, the residual noise can be well approximated by Gaussian distributions. As shown, the noise fitted on the training data exhibits a lower standard deviation ( ~ 0.008) compared to the test data (~0.150). This effect could be mitigated by increasing the number of simulations, assuming consistent noise settings across runs.

Table 4 Summary of train and test statistics.

Full size table

The reported standard deviations quantify the variability introduced by sampling, and this variability propagates to the observed differences in classification accuracy.

Finally, the accuracy scores for the QSVM obtained with the above statistical mitigation procedure are reported in Table 5. In the Train/Train configuration, the noise distribution fitted on the training data is used to perturb both training and test kernels. In Test/Test, the distribution fitted on the test data is used for both. In Train/Test, the training kernels are perturbed using the training noise distribution, while the test kernels use the test noise distribution, introducing a distribution shift.

Table 5 Accuracy results applying Algorithm 1.

Full size table

This mismatch explains the drop in accuracy observed in the Train/Test setting. When the noise distribution is consistent and concentrated (Train/Train), performance remains close to the ideal case, while larger variance in the noise (Test/Test) leads to a moderate degradation. These results indicate that the mitigated kernel remains competitive with the ideal kernel when the residual noise variance is sufficiently small.

Classical Machine Learning baseline

We evaluate the performance of classical machine learning methods in recognizing quantum circuits using classical deep neural networks by treating the problem as sequence classification. Specifically, we used Transformer⁵⁰, LSTM⁵¹, and Structured State Space (S4)⁵² models as sequence classifiers, which are trained to classify QASMs (Quantum Assembly Language) in the MNISQ datasets. While the literature presents examples of hybrid quantum-classical models^42,53, our approach is a new direction where data and models are purely classical for the classification of quantum circuits. Detailed experimental settings are described in the supplementary material.

The input QASMs were processed by removing unnecessary information, such as headers, and rounding dense matrix elements, e.g., from 0.12345 to 0.12, as shown in Fig. 6. We specify that we performed the experiments using the version of the dataset with QASM files with the proprietary extension Dense().

Table 6 presents test accuracy of the classical machine learning models on the MNISQ datasets. S4 achieved the highest accuracy, with a test accuracy of 77.78% on mnist_784 with a fidelity of 95. This performance surpassed that of the other models by a significant margin. Interestingly, as the fidelity decreased, both the accuracy of the quantum methods and the accuracy of the S4 and LSTM also dropped. On the other hand, the accuracy of the Transformer showed the opposite trend, where datasets with lower fidelity resulted in better performance. Because higher-fidelity datasets have more information and consist of longer QASM sequences, these results suggest that S4 and LSTM can effectively capture long-range dependencies while Transformer cannot. In particular, the S4 model was designed to capture long-range dependencies⁵², and thus, achieved the highest performance.

Table 6 Test multiclass accuracy of classical machine learning models.

Full size table

Table 7 shows test accuracy of the S4 model trained with and without augmented datasets. We can observe clear performance improvement on each dataset, indicating the effectiveness of data augmentation in MNISQ as in other machine learning problems³⁰. We may also expect that dataset size matters to machine learning methods for quantum-circuit recognition tasks from these results.

Table 7 Test multiclass accuracy of the S4 model trained with and without augmented data at fidelity 95.

Full size table

It is important to note that these results were achieved without incorporating any prior knowledge of quantum computing into the classical machine learning methods. Incorporating such knowledge could further improve the performance of these classical models in classifying quantum circuits.

In conclusion, MNISQ enables systematic studies with variational and gradient-based quantum models, hybrid quantum-classical approaches, and classical sequence models. It provides a controlled benchmark for analyzing robustness to noise and circuit complexity, evaluating error mitigation techniques, and developing data-driven methods to model noise effects in quantum circuits. While not designed for direct implementation of quantum error correction codes, the dataset can support benchmarking of noise-aware methods and preprocessing strategies relevant to error correction and fault-tolerant workflows. Given its scale and dual representation, the dataset also supports tasks such as circuit classification, generation, and quantum architecture search.

Data availability

The MNISQ dataset is publicly available via a persistent Zenodo repository at https://doi.org/10.5281/zenodo.19656638²³.

The dataset is also accessible via https://aqora.io/datasets/leopla/mnisq and https://pennylane.ai/datasets/mnisq.

Code availability

Code, documentation and usage are accessible on GitHub at https://github.com/FujiiLabCollaboration/MNISQ-quantum-circuit-dataset, where README file, tutorials, and installation with PyPI are openly available. Version information for the baseline software environment is recorded in the repository dependency files (pyproject.toml and poetry.lock), including Python, NumPy, Scikit-learn, and Qulacs versions used for the reported experiments. The dataset is also hosted on Aqora (https://aqora.io/datasets/leopla/mnisq) and partially on PennyLane (https://pennylane.ai/datasets/mnisq).

References

Deng, J. et al. Imagenet: A large-scale hierarchical image database. 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255, https://doi.org/10.1109/CVPR.2009.5206848 (IEEE, 2009).
Nielsen, M. A. & Chuang, I. L. Quantum Computation and Quantum Information, 10th anniversary edition edn. Cambridge University Press, Cambridge, UK https://doi.org/10.1017/CBO9780511976667 (2010).
Cerezo, M. et al. Variational quantum algorithms. Nature Reviews Physics 3(9), 625–644, https://doi.org/10.1038/s42254-021-00348-9 (2021).
Article ADS Google Scholar
Schuld, M. & Petruccione, F. Supervised Learning with Quantum Computers. Springer, Switzerland https://doi.org/10.1007/978-3-319-96424-9 (2018).
Preskill, J. Quantum computing in the nisq era and beyond. Quantum 2, 79, https://doi.org/10.22331/q-2018-08-06-79 (2018).
Article Google Scholar
Cerezo, M., Verdon, G., Huang, H.-Y., Cincio, L. & Coles, P. J. Challenges and opportunities in quantum machine learning. Nature Computational Science 2, 567–576, https://doi.org/10.1038/s43588-022-00068-7 (2022).
Article CAS PubMed Google Scholar
Huang, H.-Y. et al. Quantum advantage in learning from experiments. Science 376(6598), 1182–1186, https://doi.org/10.1126/science.abn7293 (2022).
Article ADS MathSciNet CAS PubMed Google Scholar
Schuld, M. & Killoran, N. Is quantum advantage the right goal for quantum machine learning? PRX Quantum 3(3), https://doi.org/10.1103/prxquantum.3.030101 (2022) .
Schuld, M. Supervised quantum machine learning models are kernel methods. https://doi.org/10.48550/arXiv.2101.11020 (2021).
Kerenidis, I., Landman, J., Luongo, A. & Prakash, A. Q-Means: A Quantum Algorithm for Unsupervised Machine Learning. Curran Associates Inc., Red Hook, NY, USA https://papers.nips.cc/paper_files/paper/2019/hash/16026d60ff9b54410b3435b403afd226-Abstract.html (2019).
Landman, J. Quantum Algorithms for Unsupervised Machine Learning and Neural Networks. https://doi.org/10.48550/arXiv.2111.03598 (2021)
Ostaszewski, M., Trenkwalder, L. M., Masarczyk, W., Scerri, E. & Dunjko, V. Reinforcement learning for optimization of variational quantum circuit architectures. Neural Information Processing Systems https://api.semanticscholar.org/CorpusID:229166765 (2021).
Jerbi, S., Gyurik, C., Marshall, S., Briegel, H. J. & Dunjko, V. Parametrized quantum policies for reinforcement learning. Neural Information Processing Systems https://api.semanticscholar.org/CorpusID:244843259 (2021).
Dong, D., Chen, C., Li, H. & Tarn, T.-J. Quantum reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 38(5), 1207–1220, https://doi.org/10.1109/tsmcb.2008.925743 (2008).
Article Google Scholar
Schatzki, L., Arrasmith, A., Coles, P. J. & Cerezo, M. Entangled datasets for quantum machine learning. arXiv preprint https://doi.org/10.48550/arXiv.2109.03400 (2021)
Perrier, E., Youssry, A. & Ferrie, C. Qdataset, quantum datasets for machine learning. Scientific Data 9(1), 582, https://doi.org/10.1038/s41597-022-01639-1 (2022).
Article PubMed PubMed Central Google Scholar
Chehimi, M. & Saad, W. Quantum federated learning with quantum data. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8617–8621, https://doi.org/10.1109/ICASSP43922.2022.9746622 (2022).
Nakayama, A., Mitarai, K., Placidi, L., Sugimoto, T. & Fujii, K. Vqe-generated quantum circuit dataset for machine learning. Physical Review Research 7(3), https://doi.org/10.1103/c43x-9866 (2025)
Liu, Y., Arunachalam, S. & Temme, K. A rigorous and robust quantum speed-up in supervised machine learning. Nature Physics 17(9), 1013–1017, https://doi.org/10.1038/s41567-021-01287-z (2021).
Article ADS CAS Google Scholar
Xiao, T. et al. Practical advantage of quantum machine learning in ghost imaging. Communications Physics 6, 171, https://doi.org/10.1038/s42005-023-01290-1 (2023).
Article ADS Google Scholar
OpenAI ChatGPT: OpenAI’s Language Model. Online https://openai.com/blog/chatgpt (2023).
Shirakawa, T., Ueda, H. & Yunoki, S. Automatic quantum circuit encoding of a given arbitrary quantum state. Physical Review Research 6(4), https://doi.org/10.1103/physrevresearch.6.043008 (2024)
Placidi, L. et al. MNISQ: A Large-Scale Quantum Circuit Dataset for Machine Learning in the NISQ Era. https://doi.org/10.5281/zenodo.19656638
Huang, H.-Y. et al. Power of data in quantum machine learning. Nature Communications 12(1), https://doi.org/10.1038/s41467-021-22539-9 (2021)
LeCun, Y. & Cortes, C. The mnist database of handwritten digits. Dataset available via OpenML. (2005).
Xiao, H., Rasul, K. & Vollgraf, R. Fashion-MNIST: A Novel Image Dataset for Benchmarking Machine Learning Algorithms. Dataset available via OpenML. (2017)
Clanuwat, T. et al. Deep learning for classical japanese literature. arXiv preprint arXiv:1812.01718 https://arxiv.org/abs/1812.01718 arXiv:1812.01718 [cs.CV](2018).
Vanschoren, J., Rijn, J. N., Bischl, B. & Torgo, L. Openml: networked science in machine learning. SIGKDD Explorations 15(2), 49–60, https://doi.org/10.1145/2641190.2641198 (2013).
Article Google Scholar
Shorten, C. & Khoshgoftaar, T. M. A survey on image data augmentation for deep learning. Journal of Big Data 6, 1–48 (2019).
Article Google Scholar
Perez, L. & Wang, J. The Effectiveness of Data Augmentation in Image Classification using Deep Learning https://doi.org/10.48550/arXiv.1712.04621 (2017).
Abohashima, Z., Elhosen, M., Houssein, E. H. & Mohamed, W. M. Classification with Quantum Machine Learning: A Survey https://doi.org/10.48550/arXiv.2006.12270 (2020).
Schuld, M. & Killoran, N. Quantum machine learning in feature hilbert spaces. Physical Review Letters 122(4) https://doi.org/10.1103/physrevlett.122.040504 (2019)
Mengoni, R. & Pierro, A. D. Kernel methods in quantum machine learning. Quantum Machine Intelligence 1, 65–71, https://doi.org/10.1007/s42484-019-00007-4 (2019).
Article Google Scholar
Rebentrost, P., Mohseni, M. & Lloyd, S. Quantum support vector machine for big data classification. Physical Review Letters 113(13), https://doi.org/10.1103/physrevlett.113.130503 (2014).
Haug, T., Self, C. N. & Kim, M. S. Quantum machine learning of large datasets using randomized measurements. Machine Learning: Science and Technology 4(1), 015005, https://doi.org/10.1088/2632-2153/acb0b4 (2023).
Article ADS Google Scholar
Farhi, E. & Neven, H. Classification with Quantum Neural Networks on Near Term Processors. https://doi.org/10.48550/arXiv.1802.06002 (2018).
Sahoo, S., Azad, U.& Singh, H. Quantum Phase Recognition using Quantum Tensor Networks. https://doi.org/10.48550/arXiv.2212.06207 (2022).
Hur, T., Kim, L. & Park, D. K. Quantum convolutional neural network for classical data classification. Quantum Machine Intelligence 4(1), https://doi.org/10.1007/s42484-021-00061-x (2022)
Stoudenmire, E. M. & Schwab, D. J. Supervised learning with tensor networks. Proceedings of the 30th International Conference on Neural Information Processing Systems. NIPS’16, pp. 4806–4814. Curran Associates Inc., Red Hook, NY, USA https://papers.nips.cc/paper/6211-supervised-learning-with-tensor-networks (2016).
Cong, I., Choi, S. & Lukin, M. D. Quantum convolutional neural networks. Nature Physics 15(12), 1273–1278, https://doi.org/10.1038/s41567-019-0648-8 (2019).
Article ADS CAS Google Scholar
Bokhan, D., Mastiukova, A. S., Boev, A. S., Trubnikov, D. N. & Fedorov, A. K. Multiclass classification using quantum convolutional neural networks with hybrid quantum-classical learning. Frontiers in Physics 10, https://doi.org/10.3389/fphy.2022.1069985 (2022)
Yang, C.-H. H., Qi, J., Chen, S. Y.-C., Tsao, Y. & Chen, P.-Y. When bert meets quantum temporal convolution learning for text classification in heterogeneous computing. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 8602–8606 https://doi.org/10.1109/ICASSP43922.2022.9746412 (2022)
Beaulieu, D., Miracle, D., Pham, A. & Scherr, W. Quantum Kernel for Image Classification of Real World Manufacturing Defects https://doi.org/10.48550/arXiv.2212.08693 (2022).
Choe, S. Continuous Variable Quantum MNIST Classifiers https://doi.org/10.48550/arXiv.2204.01194 (2022).
Suzuki, Y. et al. Qulacs: a fast and versatile quantum circuit simulator for research purpose. Quantum 5, 559, https://doi.org/10.22331/q-2021-10-06-559 (2021).
Article Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12, 2825–2830 (2011).
MathSciNet Google Scholar
Bishop, C. M. Pattern Recognition and Machine Learning (Information Science and Statistics). Springer, Berlin, Heidelberg (2006)
Li, Y. & Benjamin, S. C. Efficient variational quantum simulator incorporating active error minimization. Phys. Rev. X 7, 021050, https://doi.org/10.1103/PhysRevX.7.021050 (2017).
Article Google Scholar
Temme, K., Bravyi, S. & Gambetta, J. M. Error mitigation for short-depth quantum circuits. Phys. Rev. Lett. 119, 180509, https://doi.org/10.1103/PhysRevLett.119.180509 (2017).
Article ADS MathSciNet PubMed Google Scholar
Vaswani, A. et al. Attention is all you need. Guyon I., LuxburgU., Bengio S., Wallach H., Fergus R., VishwanathanS. V. N., Garnett R. (eds.) Advances in Neural Information Processing Systems, vol. 30. Curran Associates, Inc., Red Hook, NY, USA https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf (2017).
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Computation 9(8), 1735–1780, https://doi.org/10.1162/neco.1997.9.8.1735 (1997).
Article CAS PubMed Google Scholar
Gu, A., Goel, K. & Ré, C. Efficiently Modeling Long Sequences with Structured State Spaces. https://doi.org/10.48550/arXiv.2111.00396 (2022).
Gircha, A. I., Boev, A. S., Avchaciov, K., Fedichev, P. O. & Fedorov, A. K. Hybrid quantum-classical machine learning for generative chemistry and drug design. Scientific Reports. 13(1), https://doi.org/10.1038/s41598-023-32703-4 (2023).

Download references

Acknowledgements

This work is supported by MEXT Quantum Leap Flagship Program (MEXT Q-LEAP) Grant No. JPMXS0118067394 and JPMXS0120319794, and JST COI- NEXT Grant No. JPMJPF2014.

Author information

Authors and Affiliations

Graduate School of Engineering Science, The University of Osaka, 1-3 Machikaneyama, Toyonaka, 560-0043, Osaka, Japan
Leonardo Placidi, Toshio Mori, Koki Aoyama, Hayata Morisaki, Kosuke Mitarai & Keisuke Fujii
Center for Quantum Information and Quantum Biology, The University of Osaka, 1-2 Machikaneyama, Toyonaka, 560-0043, Osaka, Japan
Leonardo Placidi, Toshio Mori, Kosuke Mitarai & Keisuke Fujii
RIKEN Center for Advanced Intelligence Project (AIP), RIKEN, Nihonbashi 1-chome Mitsui Building, 15th floor, 1-4-1 Nihonbashi, Chuo-ku, 103-0027, Tokyo, Japan
Ryuichiro Hataya
RIKEN Center for Quantum Computing (RQC), RIKEN, Hirosawa 2-1, Wako, 351-0198, Saitama, Japan
Toshio Mori & Keisuke Fujii

Authors

Leonardo Placidi
View author publications
Search author on:PubMed Google Scholar
Ryuichiro Hataya
View author publications
Search author on:PubMed Google Scholar
Toshio Mori
View author publications
Search author on:PubMed Google Scholar
Koki Aoyama
View author publications
Search author on:PubMed Google Scholar
Hayata Morisaki
View author publications
Search author on:PubMed Google Scholar
Kosuke Mitarai
View author publications
Search author on:PubMed Google Scholar
Keisuke Fujii
View author publications
Search author on:PubMed Google Scholar

Contributions

L.P. led the project and coordinated the review and revisions. L.P., R.H., T.M, and K.F. designed the methodology. L.P. and R.H. performed the experiments. L.P. and T.M. conducted data preparation and generation. K.A., H.M., and K.F. contributed to data preparation and experiments. K.M. and K.F. supervised the research. L.P., R.H., T.M., and K.F. wrote the manuscript, and all authors contributed to editing.

Corresponding author

Correspondence to Leonardo Placidi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Placidi, L., Hataya, R., Mori, T. et al. MNISQ: A Large-Scale Quantum Circuit Dataset for Machine Learning in the NISQ Era. Sci Data 13, 810 (2026). https://doi.org/10.1038/s41597-026-07493-9

Download citation

Received: 21 November 2024
Accepted: 20 May 2026
Published: 26 May 2026
Version of record: 01 June 2026
DOI: https://doi.org/10.1038/s41597-026-07493-9

Subjects

Abstract

Similar content being viewed by others

Quantum machine learning beyond kernel methods

Experimental quantum end-to-end learning on a superconducting processor

A quantum leaky integrate-and-fire spiking neuron and network

Background & Summary

Background

Related Works

Motivation and approach

Contributions

Methods

MNISQ Dataset: Construction and Characteristics

Quantum Encoding of Classical Information (AQCE method)

Dataset Construction

Data Records

Details on dataset generation

Dual-form representation: quantum circuits and QASM descriptions

Technical Validation

Baseline for MNISQ

Quantum Machine Learning experiments

Quantum Support Vector Machine (QSVM)

QSVM implementation details

Quantum Baseline

Performance of QML under noisy operations

Classical Machine Learning baseline

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information (download PDF )

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links