Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Advertisement

Nature Precedings
  • View all journals
  • Search
  • My Account Login
  • Content Explore content
  • About the journal
  • RSS feed
  1. nature
  2. nature precedings
  3. articles
  4. article
Fractal-like Distributions over the Rational Numbers in High-throughput Biological and Clinical Data
Download PDF
Download PDF
  • Manuscript
  • Open access
  • Published: 20 October 2010

Fractal-like Distributions over the Rational Numbers in High-throughput Biological and Clinical Data

  • Vladimir Trifonov1,
  • Laura Pasqualucci2,
  • Riccardo Dalla-Favera2 &
  • …
  • Raul Rabadan1 

Nature Precedings (2010)Cite this article

  • 271 Accesses

  • Metrics details

Abstract

Recent developments in extracting and processing biological and clinical data are allowing quantitative approaches to studying living systems. High-throughput sequencing, expression profiles, proteomics, and electronic health records are some examples of such technologies. Extracting meaningful information from those technologies requires careful analysis of the large volumes of data they produce. In this note, we present a set of distributions that commonly appear in the analysis of such data. These distributions present some interesting features: they are discontinuous in the rational numbers, but continuous in the irrational numbers, and possess a certain self-similar (fractal-like) structure. The first set of examples which we present here are drawn from a high-throughput sequencing experiment. Here, the self-similar distributions appear as part of the evaluation of the error rate of the sequencing technology and the identification of tumorogenic genomic alterations. The other examples are obtained from risk factor evaluation and analysis of relative disease prevalence and co-mordbidity as these appear in electronic clinical data. The distributions are also relevant to identification of subclonal populations in tumors and the study of the evolution of infectious diseases, and more precisely the study of quasi-species and intrahost diversity of viral populations.

Similar content being viewed by others

The influence of evolutionary history on human health and disease

Article 06 January 2021

Diversity and consequences of structural variation in the human genome

Article 21 January 2025

Binomial-discrete Erlang-truncated exponential mixture and its application in cancer disease

Article Open access 28 July 2023

Article PDF

Author information

Authors and Affiliations

  1. C2B2/DBMI, Columbia University https://www.nature.com/nature

    Vladimir Trifonov & Raul Rabadan

  2. ICG/HICCC, Columbia University https://www.nature.com/nature

    Laura Pasqualucci & Riccardo Dalla-Favera

Authors
  1. Vladimir Trifonov
    View author publications

    Search author on:PubMed Google Scholar

  2. Laura Pasqualucci
    View author publications

    Search author on:PubMed Google Scholar

  3. Riccardo Dalla-Favera
    View author publications

    Search author on:PubMed Google Scholar

  4. Raul Rabadan
    View author publications

    Search author on:PubMed Google Scholar

Corresponding authors

Correspondence to Vladimir Trifonov or Raul Rabadan.

Rights and permissions

Creative Commons Attribution 3.0 License.

Reprints and permissions

About this article

Cite this article

Trifonov, V., Pasqualucci, L., Dalla-Favera, R. et al. Fractal-like Distributions over the Rational Numbers in High-throughput Biological and Clinical Data. Nat Prec (2010). https://doi.org/10.1038/npre.2010.5037.1

Download citation

  • Received: 19 October 2010

  • Accepted: 20 October 2010

  • Published: 20 October 2010

  • DOI: https://doi.org/10.1038/npre.2010.5037.1

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Keywords

  • high-throughput sequencing
  • electronic health records
  • cancer genetics
  • fractals
  • probability
  • clinical data
Download PDF

Advertisement

Explore content

  • Research articles
  • News & Comment
  • Sign up for alerts
  • RSS feed

About the journal

  • Journal Information

Search

Advanced search

Quick links

  • Explore articles by subject
  • Find a job
  • Guide to authors
  • Editorial policies

Nature Precedings (Nat Preced)

nature.com sitemap

About Nature Portfolio

  • About us
  • Press releases
  • Press office
  • Contact us

Discover content

  • Journals A-Z
  • Articles by subject
  • protocols.io
  • Nature Index

Publishing policies

  • Nature portfolio policies
  • Open access

Author & Researcher services

  • Reprints & permissions
  • Research data
  • Language editing
  • Scientific editing
  • Nature Masterclasses
  • Research Solutions

Libraries & institutions

  • Librarian service & tools
  • Librarian portal
  • Open research
  • Recommend to library

Advertising & partnerships

  • Advertising
  • Partnerships & Services
  • Media kits
  • Branded content

Professional development

  • Nature Awards
  • Nature Careers
  • Nature Conferences

Regional websites

  • Nature Africa
  • Nature China
  • Nature India
  • Nature Japan
  • Nature Middle East
  • Privacy Policy
  • Use of cookies
  • Legal notice
  • Accessibility statement
  • Terms & Conditions
  • Your US state privacy rights
Springer Nature

© 2025 Springer Nature Limited

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing