Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Data
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific data
  3. data descriptors
  4. article
CRA5 a high-fidelity compressed reanalysis atmospheric dataset for weather and climate research
Download PDF
Download PDF
  • Data Descriptor
  • Open access
  • Published: 18 May 2026

CRA5 a high-fidelity compressed reanalysis atmospheric dataset for weather and climate research

  • Tao Han  ORCID: orcid.org/0000-0003-1255-32701,2,
  • Dazhao Du1,
  • Zhenghao Chen3,
  • Song Guo1,
  • Wanli Ouyang2 &
  • …
  • Lei Bai2 

Scientific Data (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Atmospheric science
  • Climate and Earth system modelling

Abstract

The ERA5 reanalysis dataset, developed by the European Centre for Medium-Range Weather Forecasts (ECMWF), provides high-resolution, hourly global climate and weather data from 1950 to the present. However, its massive volume poses substantial storage and distribution challenges. To address this, we introduce CRA5, a highly compressed version of ERA5 generated by the neural network framework Aeolus. CRA5 reduces the 400 TB uncompressed float32 dataset to just 0.85 TB, achieving a 470× compression ratio. Notably, it offers over 100 times higher compression than the lossless GRIB files from the Copernicus Climate Data Store (CDS). Extensive experiments validate its numerical accuracy: CRA5 maintains consistent climatology and comparable power spectral density, yielding a mean absolute error of only 0.17 K for temperature across 37 vertical levels. Furthermore, it faithfully reconstructs extreme weather events and large-scale climatological patterns. By significantly lowering infrastructure barriers, CRA5 accelerates data access and facilitates broader collaboration in large-scale atmospheric research.

Similar content being viewed by others

Climate science data can be compressed efficiently by dual-stage extreme compression with a variational auto-encoder transformer

Article Open access 24 November 2025

Consistent responses to black carbon and carbon dioxide at global and 100 m model resolutions

Article Open access 01 July 2025

Bias-corrected CMIP6 global dataset for dynamical downscaling of the historical and future climate (1979–2100)

Article Open access 04 November 2021

Acknowledgements

This research was supported by funding from the Hong Kong RGC General Research Fund (152228/23E, 162161/24E, 162116/25E, and 162180/25E), the National Natural Science Foundation of China (NSFC) Key Program (No. 62532005), the Collaborative Research Fund (Nos. C1042-23GF and C5097-25G), the NSFC/RGC Collaborative Research Scheme (Grant Nos. 62461160332 and CRS_HKUST602/24), the Research Impact Fund (No. R5011-23F), the Areas of Excellence Scheme (AoE/E-601/22-R), and InnoHK (HKGAI). This work was also supported by the JC STEM Lab of AI for Science and Engineering, funded by The Hong Kong Jockey Club Charities Trust; the MTR Research Funding (MRF) Scheme (CHU-24003); and the Research Grants Council of Hong Kong (Project No. CUHK14213224).

Author information

Authors and Affiliations

  1. The Hong Kong University of Science and Technology, Computer Science and Engineering, Hong Kong SAR, China

    Tao Han, Dazhao Du & Song Guo

  2. Shanghai Artificial Intelligence Laboratory, No. 701, Yunjin Road, Xuhui District, Shanghai, China

    Tao Han, Wanli Ouyang & Lei Bai

  3. The University of Newcastle, New South Wales, Australia

    Zhenghao Chen

Authors
  1. Tao Han
    View author publications

    Search author on:PubMed Google Scholar

  2. Dazhao Du
    View author publications

    Search author on:PubMed Google Scholar

  3. Zhenghao Chen
    View author publications

    Search author on:PubMed Google Scholar

  4. Song Guo
    View author publications

    Search author on:PubMed Google Scholar

  5. Wanli Ouyang
    View author publications

    Search author on:PubMed Google Scholar

  6. Lei Bai
    View author publications

    Search author on:PubMed Google Scholar

Corresponding authors

Correspondence to Song Guo or Lei Bai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Han, T., Du, D., Chen, Z. et al. CRA5 a high-fidelity compressed reanalysis atmospheric dataset for weather and climate research. Sci Data (2026). https://doi.org/10.1038/s41597-026-07381-2

Download citation

  • Received: 30 July 2025

  • Accepted: 28 April 2026

  • Published: 18 May 2026

  • DOI: https://doi.org/10.1038/s41597-026-07381-2

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Follow us on X
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims and scope
  • Editors & Editorial Board
  • Journal Metrics
  • Policies
  • Open Access Fees and Funding
  • Calls for Papers
  • Contact

Publish with us

  • Submission Guidelines
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Data (Sci Data)

ISSN 2052-4463 (online)

nature.com footer links

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing