Fig. 1: Distribution of caption lengths in the Poly-Caption dataset.
From: Unified multimodal multidomain polymer representation for property prediction

The histogram shows the normalized frequency distribution of caption lengths, measured by the number of words, in the Poly-Caption dataset.