Figure 4 | Scientific Reports

Figure 4

From: A realistic fish-habitat dataset to evaluate algorithms for underwater visual analysis

Figure 4

Deep learning methods. The architecture used for the four computer vision tasks of classification, counting, localization, and segmentation consists of two components. The first component is the ResNet-50 backbone which is used to extract features from the input image. The second component is either a feed-forward network that outputs a scalar value for the input image or an upsampling path that outputs a value for each pixel in the image.

Back to article page