GobhiSet: Dataset of manually and automatically annotated RGB images of early phenological stages of Brassica oleracea var. Botrytis

Published: 8 February 2024| Version 1 | DOI: 10.17632/dcjjcwc5dh.1
Contributors:
Shubham Rana,
,
,
,
,
,
,

Description

The dataset comprises a repository of manually and automatically annotated RGB images taken across multiple dates of Brassica oleracea, distributed in uniform crop spaces. It is designed to facilitate crop detection, segmentation and growth modelling based on manual and automatically annotated pixel information. The repository is publicly available here and consists of 132 original RGB images captured over 3 different dates. Each image has a dimension of 5472×3648 pixels. Manual annotations were performed using Visual Geometry Group Image Annotator (VIA) using bounding boxes and the results were saved in Common Objects in Context (COCO) segmentation format. The labelling information such as region and shape attributes are detailed in Javascript Object Notation (JSON). Other attributes such as individual crop ID and repetitiveness of individual crop specimens are described in the Comma Separated Values (CSV) version of the annotation. For creation of automated annotations, the manual annotations were trained over a framework of Grounding DINO + Segment Anything Model (SAM) and the labels were saved in Pascal Visual Object Classes (PASCAL VOC) format. The segmentation masks are provided as Portable Network Graphics (PNG) images. The automated annotations facilitate growth monitoring over the crop phenology through evaluation based on binary masks of individually identified crop specimens across different dates. The codes used for these processes are accessible to ensure transparency and reproducibility. The dataset not only provides annotation information but can also help in refinement of different machine learning models. 

Files

Steps to reproduce

The acquisition of the images was done in field environments for three date imagery, usually around noon. These images were captured at nadir via DJI Phantom IV Pro Obsidian. This activity was an aerial drone-based image acquisition in a linear grid path. The flying altitude varied between 4.275 m and 4.749 due to wind turbulence. The manual annotations were done using VIA annotator version 1.0.6. The automated annotations were done using a combined framework of Grounding DINO and Segment Anything Model. The script aimed to extract binary masks based on manual and automatic annotations was written in python.

Institutions

  • Universita degli Studi di Napoli Federico II Dipartimento di Agraria
  • Universita degli Studi della Campania Luigi Vanvitelli Dipartimento di Ingegneria
  • Universita degli studi della Campania Luigi Vanvitelli Dipartimento di Scienze e Tecnologie Ambientali Biologiche e Farmaceutiche

Categories

Computer Vision, Annotation, Image Segmentation, Object Detection, Automated Segmentation, Pattern Recognition, Crop Management, Precision Agriculture, Deep Learning

Licence