Table 3 Statistics of benchmark datasets with sequence length distribution.
From: Comprehensive datasets for RNA design, machine learning, and beyond
Benchmark | Nr of samples | Length 1–500 | \({\textrm{Length }}> {\textrm{500}}\) |
|---|---|---|---|
Eterna | 100 | 100 | 0 |
RnaBench (Inverse RNA Folding Dataset) | 68553 | 68553 | 0 |
Our dataset (loop motifs with connecting stems extracted from the RNAsolo database) | 4921 | 4840 | 81 |
Our dataset (loop motifs with connecting stems extracted from the Rfam database) | 320476 | 316832 | 3644 |