Table 1 General statistics of the training, validation and test splits of the WorldFloods  dataset. Since raw images from S2 can be many megapixels in size, we tile each image into 256-pixel square patches. The training set distribution has a higher percentage of cloudy pixels compared with the validation and test datasets; this is because we were interested in distinguishing water/flood pixels whereas detecting clouds is a byproduct of the model.

From: Towards global flood mapping onboard low cost satellites with machine learning

Dataset

Flood events

Flood maps

256x256 patches

Water pixels (%)

Land pixels

Cloud pixels

Invalid pixels

    

Flood

Permanent\(^{\dag }\)

(%)

(%)

(%)

Training

108

407

182,413

1.45

1.25

43.24

50.25

3.81

Validation

6

6

1132

3.14

5.19

76.72

13.27

1.68

Test

5

11

2029

20.23

1.16

59.05

16.21

3.34

  1. \(^{\dag }\) Permanent water obtained from the yearly water classification product of Pekel et al.44 available at the Google Earth Engine52.