Figure 3
From: The Biological Object Notation (BON): a structured file format for biological data

File size and sequence size comparisons for the data sets analysed here. The x-axis depicts the data subset. Upper plots indicate the file size for each subset and the corresponding portion of the data only for uncompressed files, indicated by the lighter colour. The lower violin plots show the corresponding maximum and minimum data sizes analysed within each subset. The “+” indicates the median for the corresponding data set. File sizes and sample data sizes are indicated in the legend. (a) The Genome data set compares BON to the TinySeq XML format. Due to their small size the Escherichia coli and Saccharomyces cerevisiae genomes are not plotted but given in Supplementary Table 1. The original format is TinySeq XML. (b) The Collection data set comparing BON to the TinySeq XML format. (c) The SRA data set comparing BON to the FASTQ format. (d) The Protein data set comparing BON to the TinySeq XML format. (e) The phylogenetic tree (“phylogenetic”) data set comparing BON to the NeXML format.