MangoImageBD: An Extensive Image Dataset of Common and Popular Mango Varieties in Bangladesh for Identification and Classification
Description
Type of data: 504 x 1120 px mango images. Data format: JPEG. Contents of the dataset: Images (original, processed, and augmented) of common and popular varieties of mangoes in Bangladesh. Number of classes: Fifteen (15) common and popular varieties of mangoes in Bangladesh - (1) Amrapali, (2) Ashshina Classic, (3) Ashshina Zhinuk, (4) Banana Mango, (5) Bari-4, (6) Bari-11, (7) Fazli Classic, (8) Fazli Shurmai, (9) Gourmoti, (10) Harivanga, (11) Himsagor, (12) Katimon, (13) Langra, (14) Rupali, and (15) Shada. Number of images: Total number of images in the dataset: 28,515. (1) Total original (raw) images of mango cultivars (MangoOriginal) = 5,703, (2) Total processed images with a blend of both real and virtual backgrounds (MangoRealVirtual) = 5,703, and (3) Total augmented images (MangoAugmented)= 17,109. Distribution of instances: (1) Original (raw) images in each class of the mango cultivars (MangoOriginal): Amrapali = 135, Ashshina Classic = 571, Ashshina Zhinuk = 1,286, Banana Mango = 83, Bari-4 = 74, Bari-11 = 1,244, Fazli Classic = 171, Fazli Shurmai = 247, Gourmoti = 630, Harivanga = 265, Himsagor = 106, Katimon = 424, Langra = 120, Rupali = 184, and Shada = 163. (2) Processed images with a blend of both real and virtual backgrounds for each class of the mango cultivars (MangoRealVirtual): Amrapali = 135, Ashshina Classic = 571, Ashshina Zhinuk = 1,286, Banana Mango = 83, Bari-4 = 74, Bari-11 = 1,244, Fazli Classic = 171, Fazli Shurmai = 247, Gourmoti = 630, Harivanga = 265, Himsagor = 106, Katimon = 424, Langra = 120, Rupali = 184, and Shada = 163. (3) Augmented images for each class of the mango cultivars (MangoAugmented): Amrapali = 405, Ashshina Classic = 1,713, Ashshina Zhinuk = 3,858, Banana Mango = 249, Bari-4 = 222, Bari-11 = 3,732, Fazli Classic = 513, Fazli Shurmai = 741, Gourmoti = 1,890, Harivanga = 795, Himsagor = 318, Katimon = 1,272, Langra = 360, Rupali = 552, and Shada = 489. Dataset size: Total size of the dataset = 1.35 GB and the compressed ZIP file size = 1.16 GB. Data acquisition process: Images of various mango varieties are captured through high-definition smartphone cameras focusing from different angles. Data source location: Local wholesale and retail fruit markets located in six geographically distributed districts of Bangladesh, namely Chapai Nawabganj, Dhaka, Panchagarh, Rajshahi, Rangpur, and Satkhira which are renowned for diverse mango cultivation and availability. Where applicable: Training and evaluating machine learning and deep learning models to identify and classify mango varieties in Bangladesh which can be useful in smart horticulture, precision farming, supply chain automation, ecology and ecosystem health monitoring, and biodiversity and conservation efforts.