Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Data
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific data
  3. data descriptors
  4. article
An EEG dataset for handwriting imagery decoding of Chinese character strokes and Pinyin single vowels
Download PDF
Download PDF
  • Data Descriptor
  • Open access
  • Published: 02 February 2026

An EEG dataset for handwriting imagery decoding of Chinese character strokes and Pinyin single vowels

  • Fan Wang  ORCID: orcid.org/0009-0008-8794-53601,2,
  • Yanxiao Chen1,2,
  • Peng Wang1,2,
  • Anmin Gong3,
  • Jiaping Xu1 &
  • …
  • Yunfa Fu1,2 

Scientific Data , Article number:  (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Brain–machine interface
  • Cognitive neuroscience

Abstract

Non-invasive EEG-based brain-computer interfaces (BCI) for handwriting imagery can support the restoration of fine writing abilities in individuals with motor impairments. However, the development of high-performance decoding algorithms is constrained by scarce training datasets. To address this, we present the first open EEG dataset dedicated to handwriting imagery. The dataset comprises 32-channel EEG recordings (sampled at 1000 Hz) from 21 healthy participants across two sessions separated by at least 24 hours. A dual-paradigm design captures multidimensional neural features: a Chinese character stroke imagery task (five basic strokes, 200 trials per session) and a Pinyin single-vowel imagery task (six vowels, 240 trials per session). After rigorous quality screening, 18,480 standardized trials are provided, ensuring completeness, reliability, and adherence to the Brain Imaging Data Structure (BIDS) standard. This dataset enables the development and evaluation of algorithms for non-invasive BCI and supports research on restoring writing-based communication in individuals with motor impairments.

Similar content being viewed by others

An EEG Dataset for Visual Imagery-Based Brain–Computer Interface

Article Open access 03 January 2026

An EEG motor imagery dataset for brain computer interface in acute stroke patients

Article Open access 25 January 2024

EEG-based BCI Dataset of Semantic Concepts for Imagination and Perception Tasks

Article Open access 15 June 2023

Data availability

The EEG dataset generated and analyzed in this study is publicly available on figshare (https://doi.org/10.6084/m9.figshare.29987758.v4). The data are structured in compliance with the EEG-BIDS standard, featuring participant- and session-level directories alongside detailed metadata (e.g., data description, participant information, electrode locations, and event markers) to support reproducible research. Additional resources, including experimental stimuli and custom code for data preprocessing, model training, and visualization, are available within the same repository.

Code availability

All code scripts used in this study are publicly available in the same figshare repository as the dataset (https://doi.org/10.6084/m9.figshare.29987758.v4), under the code/ directory. Users can locate the code by scrolling through the online file list or downloading the full repository and navigating to /code. Detailed setup instructions—including mappings to the original BIDS format—are provided in code/README.md.

References

  1. Feigin, V. L. et al. Global, regional, and national burden of neurological disorders, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet Neurol. 18, 459–480, https://doi.org/10.1016/s1474-4422(18)30499-x (2019).

    Google Scholar 

  2. Claassen, J. et al. Detection of brain activation in unresponsive patients with acute brain injury. N. Engl. J. Med. 380(26), 2497–2505, https://doi.org/10.1056/NEJMoa1812757 (2019).

    Google Scholar 

  3. Pels, E. G. M. et al. Estimated prevalence of the target population for brain-computer interface neurotechnology in the Netherlands. Neurorehabil. Neural Repair 31(7), 677–685, https://doi.org/10.1177/1545968317714577 (2017).

    Google Scholar 

  4. Willett, F. R. et al. High-performance brain-to-text communication via handwriting. Nature 593(7858), 249–254, https://doi.org/10.1038/s41586-021-03506-2 (2021).

    Google Scholar 

  5. Metzger, S. L. et al. Generalizable spelling using a speech neuroprosthesis in an individual with severe limb and vocal paralysis. Nat. Commun. 13(1), 6510, https://doi.org/10.1038/s41467-022-33611-3 (2022).

    Google Scholar 

  6. Moses, D. A. et al. Neuroprosthesis for decoding speech in a paralyzed person with anarthria. N. Engl. J. Med. 385(3), 217–227, https://doi.org/10.1056/nejmoa2027540 (2021).

    Google Scholar 

  7. Hochberg, L. R. et al. Reach and grasp by people with tetraplegia using a neurally controlled robotic arm. Nature 485(7398), 372–375, https://doi.org/10.1038/nature11076 (2012).

    Google Scholar 

  8. Willett, F. R. et al. A high-performance speech neuroprosthesis. Nature 620(7976), 1031–1036, https://doi.org/10.1038/s41586-023-06377-x (2023).

    Google Scholar 

  9. Metzger, S. L. et al. A high-performance neuroprosthesis for speech decoding and avatar control. Nature 620(7976), 1037–1046, https://doi.org/10.1038/s41586-023-06443-4 (2023).

    Google Scholar 

  10. Défossez, A. et al. Decoding speech perception from non-invasive brain recordings. Nat. Mach. Intell. 5(10), 1097–1107, https://doi.org/10.1038/s42256-023-00714-5 (2023).

    Google Scholar 

  11. Yang, X. et al. Reconstructing multi-stroke characters from brain signals toward generalizable handwriting brain-computer interfaces. IEEE Trans. Neural Syst. Rehabil. Eng. 32, 4230–4239, https://doi.org/10.1109/TNSRE.2024.3492191 (2024).

    Google Scholar 

  12. Qi, Y. et al. Human motor cortex encodes complex handwriting through a sequence of stable neural states. Nat. Hum. Behav. 9, 1260–1271, https://doi.org/10.1038/s41562-025-02157-x (2025).

    Google Scholar 

  13. Aflalo, T. et al. Decoding motor imagery from the posterior parietal cortex of a tetraplegic human. Science 348, 906–910, https://doi.org/10.1126/science.aaa5417 (2015).

    Google Scholar 

  14. Bouton, C. E. et al. Restoring cortical control of functional movement in a human with quadriplegia. Nature 533, 247–250, https://doi.org/10.1038/nature17435 (2016).

    Google Scholar 

  15. Jarosiewicz, B. et al. Virtual typing by people with tetraplegia using a self-calibrating intracortical brain-computer interface. Sci. Transl. Med. 7(313), 313ra179–313ra179, https://doi.org/10.1126/scitranslmed.aac7328 (2015).

    Google Scholar 

  16. Chen, X. et al. High-speed spelling with a noninvasive brain–computer interface. Proc. Natl. Acad. Sci. USA. 112(44), E6058–E6067, https://doi.org/10.1073/pnas.1508080112 (2015).

    Google Scholar 

  17. Boto, E. et al. Moving magnetoencephalography towards real-world applications with a wearable system. Nature 555(7698), 657–661, https://doi.org/10.1038/nature26147 (2018).

    Google Scholar 

  18. Abiri, R. et al. A comprehensive review of EEG-based brain–computer interface paradigms. J. Neural Eng. 16(1), 011001, https://doi.org/10.1088/1741-2552/aaf12e (2019).

    Google Scholar 

  19. Rashid, M. et al. Current status, challenges, and possible solutions of EEG-based brain-computer interface: a comprehensive review. Front. Neurorobot. 14, 25, https://doi.org/10.3389/fnbot.2020.00025 (2020).

    Google Scholar 

  20. He, H. & Wu, D. Transfer learning for brain–computer interfaces: A Euclidean space data alignment approach. IEEE Trans. Biomed. Eng. 67(2), 399–410, https://doi.org/10.1109/TBME.2019.2913914 (2019).

    Google Scholar 

  21. Dai, B. et al. A reliability-enhanced Brain–Computer Interface via Mixture-of-Graphs-driven Information Fusion. Inform. Fusion 120, 103069, https://doi.org/10.1016/j.inffus.2025.103069 (2025).

    Google Scholar 

  22. Yang, B. et al. A multi-day and high-quality EEG dataset for motor imagery brain-computer interface. Sci. Data 12(1), 488, https://doi.org/10.1038/s41597-025-04826-y (2025).

    Google Scholar 

  23. Liu, Y. et al. Lower limb motor imagery EEG dataset based on the multi-paradigm and longitudinal-training of stroke patients. Sci. Data 12(1), 314, https://doi.org/10.1038/s41597-025-04618-4 (2025).

    Google Scholar 

  24. Ma, J. et al. A large EEG dataset for studying cross-session variability in motor imagery brain-computer interface. Sci. Data 9(1), 531, https://doi.org/10.1038/s41597-022-01647-1 (2022).

    Google Scholar 

  25. Liu, B. et al. eldBETA: A large eldercare-oriented benchmark database of SSVEP-BCI for the aging population. Sci. Data 9(1), 252, https://doi.org/10.1038/s41597-022-01372-9 (2022).

    Google Scholar 

  26. Won, K. et al. EEG dataset for RSVP and P300 speller brain-computer interfaces. Sci. Data 9(1), 388, https://doi.org/10.1038/s41597-022-01509-w (2022).

    Google Scholar 

  27. Wang, F. et al. figshare, https://doi.org/10.6084/m9.figshare.29987758.v4 (2025).

  28. Pernet, C. R. et al. EEG-BIDS, an extension to the brain imaging data structure for electroencephalography. Sci. Data 6(1), 103, https://doi.org/10.1038/s41597-019-0104-8 (2019).

    Google Scholar 

  29. Lawhern, V. J. et al. EEGNet: a compact convolutional neural network for EEG-based brain–computer interfaces. J. Neural Eng. 15, 056013, https://doi.org/10.1088/1741-2552/aace8c (2018).

    Google Scholar 

Download references

Acknowledgements

This study was supported by the National Natural Science Foundation of China (Grant Nos. 62376112, 82172058, 81771926, 61763022, 62366026, and 62006246) and the China Postdoctoral Science Foundation (Grant No. 2023M734315).

Author information

Authors and Affiliations

  1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, 650500, China

    Fan Wang, Yanxiao Chen, Peng Wang, Jiaping Xu & Yunfa Fu

  2. Brain Cognition and Brain-computer Intelligence Integration Group, Kunming University of Science and Technology, Kunming, 650500, China

    Fan Wang, Yanxiao Chen, Peng Wang & Yunfa Fu

  3. School of Information Engineering, Chinese People’s Armed Police Force Engineering University, Xi’an, 710000, China

    Anmin Gong

Authors
  1. Fan Wang
    View author publications

    Search author on:PubMed Google Scholar

  2. Yanxiao Chen
    View author publications

    Search author on:PubMed Google Scholar

  3. Peng Wang
    View author publications

    Search author on:PubMed Google Scholar

  4. Anmin Gong
    View author publications

    Search author on:PubMed Google Scholar

  5. Jiaping Xu
    View author publications

    Search author on:PubMed Google Scholar

  6. Yunfa Fu
    View author publications

    Search author on:PubMed Google Scholar

Contributions

F.W. conceived the study, designed the experiments, analyzed the data, and wrote the original manuscript. Y.C. and P.W. conducted the experiments and performed data collection. A.G. assisted in securing partial funding support and, together with J.X., provided technical support and data interpretation. Y.F. supervised the project, acquired funding, and critically revised the manuscript. All authors reviewed and approved the final manuscript.

Corresponding author

Correspondence to Yunfa Fu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, F., Chen, Y., Wang, P. et al. An EEG dataset for handwriting imagery decoding of Chinese character strokes and Pinyin single vowels. Sci Data (2026). https://doi.org/10.1038/s41597-026-06708-3

Download citation

  • Received: 04 September 2025

  • Accepted: 23 January 2026

  • Published: 02 February 2026

  • DOI: https://doi.org/10.1038/s41597-026-06708-3

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Follow us on Twitter
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims and scope
  • Editors & Editorial Board
  • Journal Metrics
  • Policies
  • Open Access Fees and Funding
  • Calls for Papers
  • Contact

Publish with us

  • Submission Guidelines
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Data (Sci Data)

ISSN 2052-4463 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing