Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Scientific Data
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • RSS feed
  1. nature
  2. scientific data
  3. data descriptors
  4. article
Clinically validated dataset of 435 human colons segmented from CT colonography
Download PDF
Download PDF
  • Data Descriptor
  • Open access
  • Published: 14 January 2026

Clinically validated dataset of 435 human colons segmented from CT colonography

  • Martina Finocchiaro  ORCID: orcid.org/0000-0001-5151-11911,
  • Ronja Stern1,
  • Rikke Vilhelmsborg2,
  • Abraham George Smith1,
  • Jens Petersen  ORCID: orcid.org/0000-0003-0138-06931,
  • Kristoffer Cold  ORCID: orcid.org/0000-0001-7715-40733,
  • Lars Konge3,4,
  • Kenny Erleben1 &
  • …
  • Melanie Ganz  ORCID: orcid.org/0000-0002-9120-80981,5 

Scientific Data , Article number:  (2026) Cite this article

  • 830 Accesses

  • Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

  • Biomedical engineering
  • Colon
  • Computed tomography
  • Computer science

Abstract

High-quality segmentation datasets are essential for advancing AI applications in medical imaging. However, it is challenging to generate such datasets for highly variable and complex organs like the colon. We introduce a dataset of 435 human colons, segmented from Computed Tomography Colonography (CTC) obtained from the publicly available The Cancer Imaging Archive (TCIA). Each scan includes a mask of the whole colon, including collapsed segments and the fluid, and a mask of only the gas-filled parts of the colon. The colon segmentation accuracy has been clinically validated by an expert abdominal radiologist. This is the first open-access dataset of segmented colons derived from CTC. This resource enables population-scale radiologic studies, supports the development of AI-based image analysis tools, and facilitates the creation of anatomically accurate digital models and simulators, both virtual and physical.

Similar content being viewed by others

REAL-Colon: A dataset for developing real-world AI applications in colonoscopy

Article Open access 25 May 2024

Assessing generalisability of deep learning-based polyp detection and segmentation methods through a computer vision challenge

Article Open access 23 January 2024

Towards full integration of explainable artificial intelligence in colon capsule endoscopy’s pathway

Article Open access 18 February 2025

Data availability

The dataset is hosted in the Open Science Framework (OSF) repository “HQColon: High-Resolution Human Colon Segmentation” (https://doi.org/10.17605/OSF.IO/8TKPM).

Code availability

The code for the semi-automatic segmentation of the colon can be found in the following GitHub repository https://github.com/horizon-europe-2023-ire/colon-segmentation-dataset. The RootPainter project for segmenting the colon fluid is available in the folder “root_painter_colon_fluid_project.zip” of the OSF repository11.

References

  1. Alabduljabbar, A., Khan, S. U., Alsuhaibani, A., Almarshad, F. & Altherwy, Y. N. Medical imaging datasets, preparation, and availability for artificial intelligence in medical imaging. Journal of Alzheimer’s Disease Reports 8, 1471–1483 (2024).

    Google Scholar 

  2. Liu, X. et al. Towards more precise automatic analysis: a systematic review of deep learning-based multi-organ segmentation. BioMedical Engineering OnLine 23, 52 (2024).

    Google Scholar 

  3. Starck, S. et al. Using uk biobank data to establish population-specific atlases from whole body mri. Communications Medicine 4, 237 (2024).

    Google Scholar 

  4. Tang, C. et al. A roadmap for the development of human body digital twins. Nature Reviews Electrical Engineering 1(3), 199–207 (2024).

    Google Scholar 

  5. Finocchiaro, M. et al. Training simulators for gastrointestinal endoscopy: Current and future perspectives. Cancers 13(6), 1427 (2021).

    Google Scholar 

  6. Pore, A. et al. Colonoscopy navigation using end-to-end deep visuomotor control: A user study. In: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). pp. 9582–9588. IEEE (2022).

  7. Mang, T., Graser, A., Schima, W. & Maier, A. Ct colonography: techniques, indications, findings. European journal of Radiology 61(3), 388–399 (2007).

    Google Scholar 

  8. Smith, K. et al. Data from ct colonography https://doi.org/10.7937/K9/TCIA.2015.NWTESAY1 (2015).

  9. Smith, A. G. et al. Rootpainter: deep learning segmentation of biological images with corrective annotation. New Phytologist 236(2), 774–791 (2022).

    Google Scholar 

  10. Wasserthal, J. et al. Totalsegmentator: robust segmentation of 104 anatomic structures in ct images. Radiology: Artificial Intelligence 5(5) (2023).

  11. Finocchiaro, M., Stern, R., Ganz, M. High-resolution human colon segmentation dataset, Open Science Framework, https://doi.org/10.17605/OSF.IO/8TKPM (2025).

Download references

Acknowledgements

This work is funded by the European Union, grant number 101135082. Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union or the European Health and Digital Executive Agency. Neither the European Union nor the granting authority can be held responsible for them. Abraham George Smith is funded by Novo Nordisk Foundation Grant [NNF22OC0080177].

Author information

Authors and Affiliations

  1. Department of Computer Science, University of Copenhagen, Copenhagen, Denmark

    Martina Finocchiaro, Ronja Stern, Abraham George Smith, Jens Petersen, Kenny Erleben & Melanie Ganz

  2. Department of Radiology, Bispebjerg Hospital, Copenhagen, Denmark

    Rikke Vilhelmsborg

  3. Copenhagen Academy for Medical Education and Simulation, Center for Human Resources and Education, The Capital Region of Denmark, Copenhagen, Denmark

    Kristoffer Cold & Lars Konge

  4. Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark

    Lars Konge

  5. Neurobiology Research Unit, Copenhagen University Hospital, Copenhagen, Denmark

    Melanie Ganz

Authors
  1. Martina Finocchiaro
    View author publications

    Search author on:PubMed Google Scholar

  2. Ronja Stern
    View author publications

    Search author on:PubMed Google Scholar

  3. Rikke Vilhelmsborg
    View author publications

    Search author on:PubMed Google Scholar

  4. Abraham George Smith
    View author publications

    Search author on:PubMed Google Scholar

  5. Jens Petersen
    View author publications

    Search author on:PubMed Google Scholar

  6. Kristoffer Cold
    View author publications

    Search author on:PubMed Google Scholar

  7. Lars Konge
    View author publications

    Search author on:PubMed Google Scholar

  8. Kenny Erleben
    View author publications

    Search author on:PubMed Google Scholar

  9. Melanie Ganz
    View author publications

    Search author on:PubMed Google Scholar

Contributions

Martina Finocchiaro designed and implemented the segmentation methods with contributions from Ronja Stern and Abraham George Smith; performed post-processing, quality checks and manual corrections of the segmentatios; designed the clinical validation protocol with input from Kristoffer Cold and Lars Konge; designed and developed the interface for clinical validation; analyzed the results and wrote the manuscript. Martina Finocchiaro and Ronja Stern created the data and code repositories. Rikke Vilhelmsborg clinically validated the segmentations. Kenny Erleben advised on the technical content of the research project. Melanie Ganz supervised and coordinated the research project. All authors reviewed and edited the manuscript.

Corresponding author

Correspondence to Martina Finocchiaro.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Finocchiaro, M., Stern, R., Vilhelmsborg, R. et al. Clinically validated dataset of 435 human colons segmented from CT colonography. Sci Data (2026). https://doi.org/10.1038/s41597-025-06518-z

Download citation

  • Received: 01 August 2025

  • Accepted: 19 December 2025

  • Published: 14 January 2026

  • DOI: https://doi.org/10.1038/s41597-025-06518-z

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Collections
  • Follow us on Twitter
  • Sign up for alerts
  • RSS feed

About the journal

  • Aims and scope
  • Editors & Editorial Board
  • Journal Metrics
  • Policies
  • Open Access Fees and Funding
  • Calls for Papers
  • Contact

Publish with us

  • Submission Guidelines
  • Language editing services
  • Open access funding
  • Submit manuscript

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Scientific Data (Sci Data)

ISSN 2052-4463 (online)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2026 Springer Nature Limited

Nature Briefing AI and Robotics

Sign up for the Nature Briefing: AI and Robotics newsletter — what matters in AI and robotics research, free to your inbox weekly.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing: AI and Robotics