VegAnn, Vegetation Annotation of multi-crop RGB images acquired under diverse conditions for segmentation

Madec, Simon; Irfan, Kamran; Velumani, Kaaviya; Baret, Frederic; David, Etienne; Daubige, Gaetan; Samatan, Lucas Bernigaud; Serouart, Mario; Smith, Daniel; James, Chrisbin; Camacho, Fernando; Guo, Wei; De Solan, Benoit; Chapman, Scott C.; Weiss, Marie

doi:10.1038/s41597-023-02098-y

Download PDF

Data Descriptor
Open access
Published: 19 May 2023

VegAnn, Vegetation Annotation of multi-crop RGB images acquired under diverse conditions for segmentation

Scientific Data volume 10, Article number: 302 (2023) Cite this article

8474 Accesses
46 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Applying deep learning to images of cropping systems provides new knowledge and insights in research and commercial applications. Semantic segmentation or pixel-wise classification, of RGB images acquired at the ground level, into vegetation and background is a critical step in the estimation of several canopy traits. Current state of the art methodologies based on convolutional neural networks (CNNs) are trained on datasets acquired under controlled or indoor environments. These models are unable to generalize to real-world images and hence need to be fine-tuned using new labelled datasets. This motivated the creation of the VegAnn - Vegetation Annotation - dataset, a collection of 3775 multi-crop RGB images acquired for different phenological stages using different systems and platforms in diverse illumination conditions. We anticipate that VegAnn will help improving segmentation algorithm performances, facilitate benchmarking and promote large-scale crop vegetation segmentation research.

Automated weed segmentation with knowledge based labeling for machine learning applications

Article Open access 26 January 2026

Hybrid deep learning model for density and growth rate estimation on weed image dataset

Article Open access 02 April 2025

Deep learning classification of INSV-associated weeds in Monterey county using a curated RGB image dataset

Article Open access 26 November 2025

Background & Summary

Over the last 10 to 15 years, there has been a growing interest in image-based plant studies using automated digital cameras. Computer vision is being widely adopted to access crop knowledge from these images for various applications including decision support in farms for irrigation or fertilization application, harvest planning, disease, weed management, crop identification and the computation of biophysical variables^1,2,3,4,5. In the last decade, the availability of crop genomic information has accelerated numerous breeding programs^6,7 and there is increasing use of extensive phenotypic measurements through the crop growth cycle to interpret the behavior of cultivars at finer time scales and to link the phenotype to the genotype⁸. Further, interpretation from image analysis, especially green fractional cover and leaf area index, can also be used for the validation and calibration of remote sensing products^9,10,11.

These different applications are supported by the rapid development of robotic technologies associated with image acquisition and analysis workflows. For such standardized fully automated processing, RGB images are preferred as being low-cost, versatile and of high-spatial resolution. Crop traits of interest (e.g. green cover fraction, leaf area index, leaf spot disease, etc.) are often extracted from these images using fully-automated pipelines wherein semantic segmentation is performed as a critical intermediate step. This step, applied before other processing steps, is a pixel-level classification that isolates the vegetation from the background i.e. soil, rock, dead leaves, etc. Hereon referred to as “Vegetation Segmentation”, this is indeed a well-established area of research, with well-known drawbacks^12,13.

Vegetation segmentation approaches can be described as being in three broad categories:

Color-based approaches: Include thresholding applied on pixel color values, color-based indices such as excess green (ExG), vegetation index (VI) among others¹⁴. In most cases, such approaches require a user-defined threshold which often comes with a significant risk of dataset bias and lacks robustness and consistency across different datasets.
Machine learning approaches based on pixel-level features: These approaches utilize features computed from the spectral information contained in the pixels and may also include the features computed from the different color-space representations. However, such colour-based techniques struggle to generalize over varying illumination conditions, chromatic aberrations which might cause some of the soil pixels to appear green and the quality of the camera optics. Further, in image regions saturated either by strong specular reflection or under-exposure, it is difficult to reliably classify the pixels only using the color information. Also, the pixel color might be misleading in certain situation. For example, soil appearing greenish due to the presence of algae or vegetation appearing brownish-yellow due to senescence. Additionally, the soil and crop residues in the background are difficult to distinguish from the senescent vegetation observed on the canopy since they encompass a similar range of brownish colors. Therefore, textural and contextual information should be exploited to overcome the aforementioned problems and better segment RGB images into vegetation and background.
Machine learning approaches based on color-texture-shape characteristics: The methodologies within this category utilize the context and spatial information, in addition to the pixel values extracted from the images. To overcome the limitation of pixel-level features, researchers began using handcrafted features such as Bag of Words, SIFT, GLCM, Canny Edge Detectors, etc.^15,16. Due to the high dimensionality of these features, a sizable amount of data is required to train the algorithms to distinguish between vegetation and background. Recent advances in deep learning methodologies have enabled automatic learning of the necessary features from the dataset, surpassing traditional hand-crafted features and machine learning approaches¹⁷.

Deep learning methodologies have achieved notable success for certain agricultural and phenotyping tasks especially to characterise crop ‘traits’, e.g.^{18,19,20,21,22}. The application of these labelling for vegetation segmentation have therefore received increasing attention in the recent years^5,17. The organization of challenges, conferences²³ and availability of open labelled datasets under controlled conditions^17,24 have eased the adoption of deep learning methods for vegetation segmentation. However, the public datasets are limited to specific illumination conditions, crop varieties and soil types. Deep learning models trained on such small, domain-specific datasets tend to perform poorly on new domains. Thus, a key reason for lack of deep learning solutions for real-world conditions is the lack of diverse, publicly available labelled dataset for vegetation segmentation cf other types of datasets like boundary box classifications^25,26,27. The curation of a large pixel-level labelled dataset for vegetation segmentation is indeed an expensive and tedious task that requires contribution from experts.

This need motivated our creation of the VegAnn for outdoor vegetation segmentation from RGB images. To our knowledge, this is the first multi-crop image dataset for semantic segmentation that has been specifically constituted by sampling a large range of crop species, grown under diverse climatic and soil conditions. VegAnn assembles a total of 3775 images from various datasets with samples acquired over a large diversity of growing scenarios and throughout the crop growth cycle. This paper describes the dataset characteristics and shows how it can be used to develop a powerful crop segmentation algorithm. We also highlight the interest of merging datasets from different crop/species and provide baseline state of the art results on the VegAnn dataset²⁸. We believe that this database will serve as a reliable tool for benchmarking new algorithms and eventually boost research on vegetation segmentation.

Methods

Annotation rules

VegAnn²⁸ was annotated following a simple rule: all the pixels belonging to plants were labelled as vegetation (including stem, flowers, spikes, leaves - either healthy or senescent) and the rest as background (which includes crop residues or dead leaves present on the ground). This reduced potential bias among annotators since, for instance, excluding plant senescent leaves from the vegetation class would be prone to subjectivity. Indeed, the decision whether the vegetation is healthy or not is not straightforward as illustrated in the examples shown in Fig. 1.

Moreover, including the senescent part of the leaves within the vegetation class allows retention of information about leaf shape. This aligns with the reasoning of convolution-based approaches, since, in contrast to pixel-based methods, they utilize both the texture and the contextual information for decision making. Finally, it can be noticed that once the vegetation is extracted from the image, it is then relatively easy to use color-based methods to extract the non healthy parts that can no longer be confused with the background²⁹.

Despite this simple annotation rule, there were cases where decision making was not unequivocal. For instance, with images containing crop residues as seen in Fig. 2. We therefore added a second rule notifying that dead plants present at the ground level are considered as background. The presence of residues is often observed when crop rotation is practiced. This kind of crop management has a good impact on carbon sequestration and is prevalent in many cropping systems.

Creating VegAnn by assembling various sub-datasets of RGB images

The VegAnn dataset was aggregated from different sub-datasets collected by different institutions within the scope of various projects under specific acquisition configurations. This aggregation process encompassed a wide range of measurement conditions, crop species and phenological stages. The images were thus acquired using different cameras equipped with different focal length optics, at variable distances from the top of the canopy. An important requirement for the integration of external sub-dataset within VegAnn is to have downward-looking images that offer sufficient detail (i.e. spatial resolution) for accurate visual distinction between the vegetation and the background. The cameras were positioned at a few meters above the canopy with a ground sample distance (GSD) varying from 0.1 to 2 mm/ pixel. The original raw images (referred to as images in the following) were cropped into several patches of 512 × 512 pixels. The VegAnn dataset content was optimized by selecting images within all the sub-datasets so that they represent well the diversity of the samples while keeping a good balance between plant species, development stages, environmental and acquisition conditions.

Table 1 Datasets used to compose VegAnn.

Full size table

Table 2 VegAnn Metadata.

Full size table

Table 3 IOU and F₁ score averaged over the five cross-validation fold of VegAnn computed at the dataset and image-level.

Full size table

Table 4 The IOU and F₁ scores, averaged over the five test sets of VegAnn computed for images acquired using various acquisition systems available in VegAnn.

Full size table

To achieve this objective, several steps were followed:

The first criterion was to prioritize the diversity of locations and select as many locations as possible. Among series corresponding to the same acquisition conditions, e.g. same location and date, we selected a single image.
We used a stratified random sampling to include images representing all the phenological stages of the crops.
We carried out a second round of image selection by training a deep learning model on a subset of the first selection. A U-net, a fully convolutional neural network with a standard³⁰ encoder-decoder architecture and ResNet34 backbone implemented in the³¹ library was used for this purpose. A visual inspection of the results allowed us to identify the type of images and domains (e.g crop type and stage, conditions of acquisition) that were not well represented and we could then include these in the final version of VegAnn.

Table 1 summarizes the characteristics of the datasets used to compose VegAnn which originates from two scientific communities, e.g. plant phenotyping and satellite remote sensing.

The LITERAL dataset was acquired with a handheld system called LITERAL (Fig. 3). An operator maintains a boom with a pair of Sony RX0 cameras fixed at its extremity. The 938 images covered a wide range of different cereal crop species grown at several places in France. Wheat images from the GWHD Global Wheat Head Detection^25,32 from France and China (Nanjing) are also included in this dataset.

The PHENOMOBILE dataset was acquired with the Phenomobile system, an unmanned ground vehicle. This system uses flash lights synchronized with images image acquisition making the measurements independent from the natural illumination conditions.

INVITA (INnovations in Variety Testing in Australia) is a project led by The University of Queensland in Australia that aims to monitors the quality and performances of wheat variety trials³³. This dataset is constituted with a wide range of wheat crop cultivars grown in >100 different locations with photos collected with smartphones.

Easypcc is a dataset from the University of Tokyo. It is constituted of rice and wheat time series images acquired with a fix sensor in the field. Less variability can be found in this dataset since images are acquired at the same location but with different lighting conditions settings.

The P2S2 dataset¹¹ was initially acquired for the validation of green cover fraction products derived from decametric resolution satellite (e.g. SENTINEL-2). It is constituted of images of a spatial resolution of 0.2 mm. Nine crop species, four sites (in France and Belgium) and five measurements dates were monitored across the growing seasons.

The DHP dataset corresponds to patches extracted from digital hemispherical photographies (Fig. 7). The acquisition were performed to extract canopy structure characteristics from true-color image for the validation of Copernicus global land products derived from medium spatial resolution satellite observation³⁴.

Thus, it covers various crops, locations, and growing scenarios and includes some shrubs, herbaceous wetlands, grasslands pasture and herbaceous.

The Crowdsourcing dataset was constituted with diverse crop images assembled from diverse sources included from the web. It is mostly images acquired with smartphones. A proportion of the images (41) correspond to bare soils (e.g. background pixels with no vegetation) and were collected to better represent the variability of soil backgrounds in VegAnn.

We refer the readers to the available references for more details about the different datasets. Figure 3 shows examples of the different acquisition platforms that are used to compose VegAnn. Figure 4 displays image location with respect to their datasets and number of images and Fig. 5 shows example of images along with their labels.

VegAnn metadata and characteristics

In this section we describe the metadata, listed in Table 2, that are associated with each image contained in VegAnn.

Dataset Name

The DatasetName corresponds to the initial dataset from which the image was extracted (see Table 1)

Latitude, longitude and loccAcc

The GPS information in WGS84 coordinate reference system is stored in the Latitude and Longitude attributes. The attribute LocAcc is a boolean set to 1 if the location is exact and 0 if the location has been approximated due to missing information.

System of acquisition

Six different acquisition systems were used to build the VegAnn dataset and the corresponding proportion of images per system is shown in Fig. 6. Handeld cameras refers to high resolution commercial cameras, held by an operator with a boom or a tripod at 60–80 cm above the canopy (Fig. 3). DHP images were acquired by an operator using downward looking cameras equipped with a fish-eye lens, at around 60–80 cm above the canopy. Due to the field of view of fish-eye lens, the pixels of a DHP image represent quite different viewing orientations as compared to the Handeld cameras (Fig. 7). IOT refers to fixed camera placed in the field and looking downward, at height of 20–60 cm from the crop, depending on the growth stage. Phone Camera were acquired with conventional smartphones, and such images are generally associated with a lower quality. Phenomobile images were acquired with a mobile robot under controlled illumination conditions 3, by synchronising a flash with the acquisition. A few images were acquired with a camera mounted on unmanned aerial vehicles (UAV) flying at low altitude. Finally, it was not possible to determine the origin of a few images and are tagged as Na referring to unknown system of acquisition.

Orientation

Four different viewing information can be found in VegAnn: nadir: the viewing direction is close to the nadir (e.g vertical) with a small camera field of view; 45 the images were acquired with a camera inclined at 45° (Literal and Phenomobile datasets); DHP image extracted from hemispherical images, for which the viewing direction is unkonwn and very variable within the image due to the large field of view of the fish-eye lens. Finally, Na indicates that the viewing direction is unknown (crowdsourcing dataset).

Species

The VegAnn dataset contains images from 26 crop types at different phenological stages, and grown under various pedo-climatic conditions (Fig. 8). A high proportion of crops characterized by small leaves have been included since small leaves combined with an irregular spacing and high overlap between plants make pixel wise segmentation of the vegetation more challenging. Therefore, wheat and rice are highly represented since they are the most widely cultivated and studied small leaf crops in the world. To complement the representativeness of this kind of canopy structure, we included a high proportion of more complex canopies composed of at least two species 4 (Mix: crops with weeds or mixed crops cultivated in agroecology). Images acquired over bigger leafed crops of various shapes and sizes were also selected to incorporate some of the most cultivated and studied crops in the world (potato, sugarbeet, sunflower and maize). However, they are in a lower proportion since their labelling is comparatively easier.

Training/Validation/Test sets of VegAnn

As VegAnn was primarily built for benchmarking segmentation approaches, we provide five distinct Training/Validation/Test (TVT) sets.

To generate these TVT sets, we randomly selected five crops that were represented by fewer than 100 images, namely Vetch, Brown Mustard, Potato, Sorghum, and Sugarbeet. In each TVT set, one of these five crops was included in the Test dataset, as follows: Set 1 (Vetch), Set 2 (Brown Mustard), Set 3 (Potato), Set 4 (Sorghum), and Set 5 (Sugarbeet).

In order to develop models that generalize across different domains, we ensure that images with the same species, acquisition date, and coordinates were not present in the same set, we created the training, validation, and test datasets separately. However, in some cases where too many images were available for the same species, acquisition date, and coordinates, we were unable to avoid such occurrences. Note that we included the images from the dataset EasyPCC acquired with a fix sensor in the field in the training sets. We aimed for a distribution of approximately 85%, 5%, and 15% in the training, validation, and test datasets, respectively, for each TVT set.

The attribute “TVT-split1” indicates the category to which the images belong in Set 1, “TVT-split2” for Set 2, and so on.

Data Records

The dataset can be downloaded from Zenodo: https://doi.org/10.5281/zenodo.7636408²⁸ and is under the CC-BY license, allowing for reuse without restrictions. Images are of 512 pixels × 512 pixels and are saved in 8-bit PNG format. Images and their associated labels are stored in the “images” and “annotations” folder with the same file name. Meta information can be found in the VegAnn-dataset.csv file and is described in the following sections. All the available attributes are listed in Table 2.

Technical Validation

The labeling work was subcontracted to a private company that offers labeling services by Photoshop experts. Each labelled image was then carefully verified by at least two agronomy experts from our team and was re-annotated if required. The images without consensus (lack of illumination, poor quality, fuzzy) were eventually excluded from the dataset.

The technical validity of the VegAnn annotations was ensured by the iterative process used to construct the dataset. This was carried out in two ways:

1.
During the labelling phase, independent visual review of the labels of each image, by at least two persons
2.
While training and evaluating different deep learning approaches for automatic background/vegetation segmentation with VegAnn, the images leading to poor segmentation performances were carefully checked to understand whether these poor performances were due to the approach or to the labelling. When necessary, the labelling was corrected and reviewed once again.

There are different possible usages of VegAnn. Considering the uniqueness of VegAnn in terms of crop species, crop phenological stages, pedo-climatic conditions, and acquisition conditions, the main use would be the benchmarking and the updating of segmentation approaches for crops. Other usages could also be foreseen: as the raw images are labelled with a crop type, they could be used to complement other datasets for automatic crop recognition, or the validation of land use maps. As an illustration of the potential of VegAnn, we used this dataset to train and evaluate a deep learning model to segment vegetation from background in images acquired over crops. This work was further used to estimate the canopy structure (gap fraction, leaf area index, proportion of senescent vegetation) in phenotyping experiments²⁹ and used for the automatic processing of the P2S2 hemispherical images to derive ground truth for the validation of satellite leaf area index products⁹.

Evaluation scores

We used the 5 fold sets provided by VegAnn and computed baseline metrics to evaluate the performances of the approach. The Intersection Over Union (IOU) and normalF₁ score of the pixel predictions at the dataset- and image-level over the five folds were computed. The results obtained over the five folds were then averaged and reported with their standard deviation. It should be noted that the metrics reported at the dataset-level are in fact aggregated over the whole dataset and do not correspond to metrics averaged over each image. We recommend users to refer to the metrics at the dataset-level and not at the image level to reduce the influence of “empty” images i.e. images without vegetation.

$$IOU=\frac{TP}{TP+FP+FN}\quad \quad Precision=\frac{TP}{TP+FP}\quad \quad Recall=\frac{TN}{TP+TN}\quad \quad {F}_{1}=2.\frac{Precision.Recall}{Precision+Recall}$$

(1)

Implementation details

The models were implemented in PyTorch version 1.10 with PyTorch lightning framework. For this first evaluation we utilize fully convolutional neural network with a standard encoder-decoder architecture. Two variation: Unet³⁰ and DeeplabV3³⁵ were used. ResNet34 and ResNet50 backbones implemented in the³¹ library were used. The model weights were initialized on Imagenet³⁶. We trained our models using Adam optimizer³⁷, with a learning rate of $1e-4$ and Dice loss as the cost function. The batch size was fixed at 16 and the training process was conducted for 15 epochs. More detail about the implementation can be found in https://github.com/simonMadec/VegAnn.

Evaluation of the dataset

We report the performances averaged over the 5 official cross-validation folds of VegAnn in Table 3.

Regarding the Unet model architecture and Resnet34 backbone feature extractor: an average IOU of 86.0% and 89.7%, at the image and dataset-level respectively, were achieved over the five folds of VegAnn. Although different models and encoders were tested, the results showed only marginal differences between them. These results of the binary vegetation/background classification might be deemed satisfactory and leave plenty of room for improvements. The different metrics remain quite stable over the five folds (standard deviation over the five folds at the dataset level is 1.4% for IOU and 0.8% for IOU), indicating the robustness of the approach.

The IOU scores computed over the different species present in the test folds of VegAnn are summarized in Fig. 9. Species with a low number of images may not be present in the test fold of VegAnn and are not reported in this figure. Several visualizations of the model predictions, along with the ground-truth masks are also presented in Fig. 10.

The baseline approach presented in this study faces challenges when classifying scenes acquired from certain species. As observed in Fig. 10, these difficulties may arise due to various reasons, such as poor image quality, scene complexity, configuration of the sensor or acquisition set-up. For instance, the Sorghum images obtained from the VegAnn dataset were acquired using unmanned aerial vehicles and DHP cameras, which led to a lower spatial resolution. The lower results reported for the Mix, Wheat, and Rapeseed categories could also be attributed to the complexity of these scenes 10.

Table 4 shows the per-system results using the VegAnn generic approach. The highest performance was achieved for images captured under controlled illumination conditions with the phenomobile robot, whereas images acquired with a smartphone had the lowest performance. However, other factors, including the crop types, could have influenced these results. However, other factors, including the crop types, could have influenced these results. Notably, the majority of images captured with phone cameras depicted wheat, which is a challenging crop to segment.

Additionally, we also compare a crop-specific learning approach i.e. a vegetation/background segmentation model trained on images acquired over a single crop, with the VegAnn generic approach i.e. a vegetation/background segmentation model trained on images acquired over all crop species. The comparisons were performed separately for each crop. For the crop-specific learning approach, we only considered crop species with a sufficiently large number of images in both the Training and Test sets, which included maize, rapeseed, mixed crop, sunflower and wheat (Fig. 8). The VegAnn generic approach provides better results than the crop-specific approach, with an average of the IOU of 1.5 point and lesser variability among the five folds for all the species. (Fig. 11). This illustrates the strength gained by merging images of different crops to improve background detection, as it strengthens the model by leveraging the diversity of the images.

Code availability

The codes to reproduce the baseline results presented in the Usage Notes section is available at https://github.com/simonMadec/VegAnn. We recommend users to start with th custom PyTorch dataloader to run easily for instance the training/evaluation with the crop-specific approach and the VegAnn generic approach, more information can be found in the associated ReadMe file.

References

Mavridou, E., Vrochidou, E., Papakostas, G. A., Pachidis, T. & Kaburlasos, V. G. Machine vision systems in precision agriculture for crop farming. Journal of Imaging 5, 89 (2019).
Article PubMed PubMed Central Google Scholar
Ouhami, M., Hafiane, A., Es-Saady, Y., El Hajji, M. & Canals, R. Computer vision, iot and data fusion for crop disease detection using machine learning: A survey and ongoing research. Remote Sensing 13, 2486 (2021).
Article ADS Google Scholar
Rakhmatulin, I., Kamilaris, A. & Andreasen, C. Deep neural networks to detect weeds from crops in agricultural environments in real-time: a review. Remote Sensing 13, 4486 (2021).
Article ADS Google Scholar
Sharma, A., Jain, A., Gupta, P. & Chowdary, V. Machine learning applications for precision agriculture: A comprehensive review. IEEE Access 9, 4843–4873 (2020).
Article Google Scholar
Milioto, A., Lottes, P. & Stachniss, C. Real-time semantic segmentation of crop and weed for precision agriculture robots leveraging background knowledge in CNNs. In 2018 IEEE International Conference on Robotics and Automation (ICRA), 2229–2235, https://doi.org/10.1109/ICRA.2018.8460962. ISSN: 2577-087X.
Millet, E. J. et al. Genome-wide analysis of yield in europe: Allelic effects vary with drought and heat scenarios1[OPEN]. 172, 749–764, https://doi.org/10.1104/pp.16.00621.
Messina, C. D. et al. Leveraging biological insight and environmental variation to improve phenotypic prediction: Integrating crop growth models (cgm) with whole genome prediction (wgp). European Journal of Agronomy 100, 151–162 (2018).
Article Google Scholar
Li, L., Zhang, Q. & Huang, D. A review of imaging techniques for plant phenotyping. Sensors 14, 20078–20111 (2014).
Article ADS PubMed PubMed Central Google Scholar
Jiang, J., Weiss, M., Liu, S. & Baret, F. Developing crop specific algorithms to derive accurate gai and chlorophyll content from sentinel-2 data: 4d modeling & machine learning. In Living Planet Symposium, 1–16 (2019).
Stehman, S. V. & Foody, G. M. Key issues in rigorous accuracy assessment of land cover products. Remote Sensing of Environment 231, 111199 (2019).
Article ADS Google Scholar
Weiss, M. et al. The p2s2 validation database for decametric resolution crop products: Green area index, fraction of intercepted light, green fraction and chlorophyll content. In IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium, 4588–4591, https://doi.org/10.1109/IGARSS.2019.8900400 (2019).
Hamuda, E., Glavin, M. & Jones, E. A survey of image processing techniques for plant extraction and segmentation in the field. Computers and electronics in agriculture 125, 184–199 (2016).
Article Google Scholar
Bai, X. et al. Vegetation segmentation robust to illumination variations based on clustering and morphology modelling. Biosystems engineering 125, 80–97 (2014).
Article Google Scholar
Meyer, G. E. & Neto, J. C. Verification of color vegetation indices for automated crop imaging applications. Computers and Electronics in Agriculture 63, 282–293, https://doi.org/10.1016/j.compag.2008.03.009 (2008).
Article Google Scholar
Guo, W., Rage, U. K. & Ninomiya, S. Illumination invariant segmentation of vegetation for time series wheat images based on decision tree model. Computers and electronics in agriculture 96, 58–66 (2013).
Article Google Scholar
Sadeghi-Tehran, P., Virlet, N., Sabermanesh, K. & Hawkesford, M. J. Multi-feature machine learning model for automatic segmentation of green fractional vegetation cover for high-throughput field phenotyping. Plant methods 13, 1–16 (2017).
Article Google Scholar
Zenkl, R. et al. Outdoor plant segmentation with deep learning for high-throughput field phenotyping on a diverse wheat dataset. Frontiers in plant science 12 (2021).
Madec, S. et al. Ear density estimation from high resolution rgb imagery using deep learning technique. Agricultural and forest meteorology 264, 225–234 (2019).
Article ADS Google Scholar
Velumani, K. et al. An automatic method based on daily in situ images and deep learning to date wheat heading stage. Field Crops Research 252, 107793 (2020).
Article Google Scholar
Velumani, K. et al. Estimates of maize plant density from uav rgb images using faster-rcnn detection model: impact of the spatial resolution. Plant Phenomics 2021 (2021).
Ubbens, J. R. & Stavness, I. Deep plant phenomics: a deep learning platform for complex plant phenotyping tasks. Frontiers in plant science 8, 1190 (2017).
Article PubMed PubMed Central Google Scholar
Aich, S. et al. Deepwheat: Estimating phenotypic traits from crop images with deep learning. In 2018 IEEE Winter conference on applications of computer vision (WACV), 323–332 (IEEE).
Scharr, H. et al. Leaf segmentation in plant phenotyping: a collation study. 27, 585–606. Publisher: Springer.
Lameski, P., Zdravevski, E., Trajkovik, V. & Kulakov, A. Weed detection dataset with rgb images taken under variable light conditions. In International Conference on ICT Innovations, 112–119 (Springer, 2017).
David, E. et al. Global wheat head detection 2021: an improved dataset for benchmarking wheat head detection methods. Plant Phenomics 2021 (2021).
Garcin, C. et al. Pl@ ntnet-300k: a plant image dataset with high label ambiguity and a long-tailed distribution. In NeurIPS 2021-35th Conference on Neural Information Processing Systems (2021).
Brown, C. F. et al. Dynamic world, near real-time global 10 m land use land cover mapping. Scientific Data 9, 1–17 (2022).
Article PubMed PubMed Central Google Scholar
Madec, S. et al. Vegann: Vegetation annotation of multi-crop rgb images acquired under diverse conditions for segmentation. Zenodo https://doi.org/10.5281/zenodo.7636408 (2023).
Serouart, M. et al. Segveg: Segmenting rgb images into green and senescent vegetation by combining deep and shallow methods. Plant Phenomics https://doi.org/10.34133/2022/9803570 (2022).
Article PubMed PubMed Central Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. CoRR abs/1505.04597. 1505.04597 (2015).
Iakubovskii, P. Segmentation models pytorch. https://github.com/qubvel/segmentation_models.pytorch (2019).
David, E. et al. Global wheat head detection (gwhd) dataset: a large and diverse dataset of high-resolution rgb-labelled images to develop and benchmark wheat head detection methods. Plant Phenomics 2020 (2020).
Chapman, S. C. et al. INVITA and AGFEML–Monitoring and extending the value of NVT trials. (2022).
Camacho, F. et al. Crop specific algorithms trained over ground measurements provide the best performance for GAI and fAPAR estimates from landsat-8 observations. 260, 112453, https://doi.org/10.1016/j.rse.2021.112453.
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F. & Adam, H. Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European conference on computer vision (ECCV), 801–818 (2018).
Deng, J. et al. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, 248–255 (Ieee, 2009).
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. CoRR abs/1412.6980 (2015).
Guo, W. et al. EasyPCC: Benchmark Datasets and Tools for High-Throughput Measurement of the Plant Canopy Coverage Ratio under Field Conditions. Sensors 17, 798, https://doi.org/10.3390/s17040798. Number: 4 Publisher: Multidisciplinary Digital Publishing Institute (2017).

Download references

Acknowledgements

We thank the people involved in the labelling review: F.Venault, M. Debroux, G. Studer. We thank all the people involved in the acquisition of the images. We also thank Zenodo for hosting the dataset. This work was supported by the projects Phenome-ANR-11-INBS-0012, P2S2-CNES-TOSCA-4500066524, GRDC UOQ2002-08RTX, GRDC UOQ2003-011RTX, JST AIP Acceleration Research JPMJCR21U3 and French Ministry of Agriculture and food (LITERAL CASDAR project).

Author information

Authors and Affiliations

UMR TETIS, CIRAD, Montpellier, France
Simon Madec
INRAE, Avignon Université, UMR EMMAH 1114, 84000, Avignon, France
Simon Madec, Kamran Irfan, Kaaviya Velumani, Frederic Baret, Etienne David, Mario Serouart & Marie Weiss
Arvalis, 228, route de l’Aérodrome - CS 40509, 84914, Avignon, Cedex 9, France
Simon Madec, Etienne David, Gaetan Daubige, Lucas Bernigaud Samatan, Mario Serouart & Benoit De Solan
HIPHEN SAS, 120 Rue Jean Dausset, Agroparc-Batiment Technicité, 84140, Avignon, France
Kamran Irfan & Etienne David
The University of Queensland, School of Agriculture and Food Sciences, Gatton, QLD, 4343, Australia
Daniel Smith, Chrisbin James & Scott C. Chapman
EOLAB, c/Savina,8.A4, 46980, Valencia, Spain
Fernando Camacho
Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, 188-0002, Japan
Wei Guo

Authors

Simon Madec
View author publications
Search author on:PubMed Google Scholar
Kamran Irfan
View author publications
Search author on:PubMed Google Scholar
Kaaviya Velumani
View author publications
Search author on:PubMed Google Scholar
Frederic Baret
View author publications
Search author on:PubMed Google Scholar
Etienne David
View author publications
Search author on:PubMed Google Scholar
Gaetan Daubige
View author publications
Search author on:PubMed Google Scholar
Lucas Bernigaud Samatan
View author publications
Search author on:PubMed Google Scholar
Mario Serouart
View author publications
Search author on:PubMed Google Scholar
Daniel Smith
View author publications
Search author on:PubMed Google Scholar
Chrisbin James
View author publications
Search author on:PubMed Google Scholar
Fernando Camacho
View author publications
Search author on:PubMed Google Scholar
Wei Guo
View author publications
Search author on:PubMed Google Scholar
Benoit De Solan
View author publications
Search author on:PubMed Google Scholar
Scott C. Chapman
View author publications
Search author on:PubMed Google Scholar
Marie Weiss
View author publications
Search author on:PubMed Google Scholar

Contributions

S.M.: Conceptualization, Methodology, Data curation, Writing - K.I.: Data curation, Validation - K.V.: Methodology, Validation, Review & editing, Data curation - E.D., M.S.: Methodology, Data curation, Validation - G.D., L.B.S., D.S., C.J.: Data curation, Validation - B.D.S., S.C., M.W., F.B., W.G., F.C.: Project administration, Resources, Supervision - All authors reviewed the manuscript.

Corresponding author

Correspondence to Simon Madec.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Madec, S., Irfan, K., Velumani, K. et al. VegAnn, Vegetation Annotation of multi-crop RGB images acquired under diverse conditions for segmentation. Sci Data 10, 302 (2023). https://doi.org/10.1038/s41597-023-02098-y

Download citation

Received: 30 September 2022
Accepted: 22 March 2023
Published: 19 May 2023
Version of record: 19 May 2023
DOI: https://doi.org/10.1038/s41597-023-02098-y

This article is cited by

The Multi-Sensor and Multi-Temporal Dataset of Multiple Crops for In-Field Phenotyping and Monitoring
- Yue Linn Chong
- Julie Krämer
- Lasse Klingbeil
Scientific Data (2026)
Data-efficient and accurate rapeseed leaf area estimation by self-supervised vision transformer for germplasms early evaluation
- Pengfei Hao
- Jianpeng An
- Baogang Lin
Plant Methods (2025)
BOLM high resolution land use and land cover dataset and benchmark results for the rapidly developing City of Dhaka Bangladesh
- Ovi Paul
- Abu Bakar Siddik Nayem
- AKM Mahbubur Rahman
Scientific Reports (2025)
A Mycelium Dataset with Edge-Precise Annotation for Semantic Segmentation
- Qianguang Yuan
- Weizhen Liu
- Yongping Fu
Scientific Data (2025)
Prediction accuracy and repeatability of UAV based biomass estimation in wheat variety trials as affected by variable type, modelling strategy and sampling location
- Daniel T. L. Smith
- Qiaomin Chen
- Scott C. Chapman
Plant Methods (2024)

Subjects

Abstract

Similar content being viewed by others

Background & Summary

Methods

Annotation rules

Creating VegAnn by assembling various sub-datasets of RGB images

VegAnn metadata and characteristics

Dataset Name

Latitude, longitude and loccAcc

System of acquisition

Orientation

Species

Training/Validation/Test sets of VegAnn

Data Records

Technical Validation

Evaluation scores

Implementation details

Evaluation of the dataset

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links