Tea Leaf Dataset

Published: 22 July 2025| Version 1 | DOI: 10.17632/94fzcdz8gz.1
Contributor:
Megha Gupta

Description

Tea leaf dataset consists of two folders: Healthy leaves and Diseased leaves. The folder: “Healthy leaves” consists of images of leaves that is free from infection. The folder: “diseased leaves” consists of into 5 classes: Blister Blight, Brown Blight, Tea Mosquito Bug, Leaf Red Rust and Red Spider Mite. Each class is balanced in the dataset, that is1500 images in each class. Using python programming, raw images are first resized to 256*256 dimensions and then augmented using zoom, flip, rotation, shift and shear. Images are further enhanced using median filter.

Files

Steps to reproduce

• Seven tea gardens of different districts of Assam, India during Dec 2024 to June 2025 are visited for data collection. • Three smartphone devices, including Samsung, iphone13 and iphone16 are used to acquire images. • Dataset includes two folders, namely “Healthy Leaves” and “Diseased Leaves”. • Total six classes, including “Healthy”, “Tea Mosquito Bug”, “Blister Blight”, “Brown Blight”, “Red Spider Mite” and “Leaf Red Rust” are considered in the dataset. • All classes are balanced in the dataset. • Using python programming, images are resized, augmented and cleaned. RGB and .jpg format images are resized to 256*256 dimensions. Augmentation techniques, including zoom, flip, rotation, shift and shear are applied to increase dataset diversity. Median filter further enhances the images.

Categories

Image Acquisition, Plant Diseases, Monitoring in Agriculture, Leaf Studies, Image Analysis

Licence