SmallFishBD: A Comprehensive Image Dataset of Common Small Fish Varieties in Bangladesh for Species Identification and Classification
Description
Type of data: 320x320 px fish images. Data format: JPEG. Contents of the dataset: Varieties of small fishes in Bangladesh. Number of classes: Ten small fish varieties - (1) Bele, (2) Nama Chanda, (3) Chela, (4) Guchi, (5) Kachki, (6) Mola, (7) Kata Phasa, (8) Pabda, (9) Puti, and (10) Tengra. Number of images: (A) Total images in the original dataset (SmallFishBD) = 1,700. (B) Total images in the augmented dataset (Augmented SmallFishBD) = 20,400. Distribution of instances: (A) Images in each fish category of the original dataset (SmallFishBD): Bele = 205, Nama Chanda = 110, Chela = 190, Guchi = 164, Kachki = 247, Mola = 179, Kata Phasa = 129, Pabda = 125, Puti = 218, Tengra = 133. (B) Images in each fish category of the augmented dataset (Augmented SmallFishBD): Bele = 2,460, Nama Chanda = 1,320, Chela = 2,280, Guchi = 1,968, Kachki = 2,964, Mola = 2,148, Kata Phasa = 1,548, Pabda = 1,500, Puti = 2,616, Tengra = 1,596. Dataset size: (A) Total size of the original dataset (SmallFishBD) = 36.2 MB and the ZIP compressed size = 28.4 MB. (B) Total size of the augmented dataset (Augmented SmallFishBD) = 617 MB and the ZIP compressed size = 527 MB. Data acquisition process: Images of various small fish categories are captured through high-definition smartphone cameras focusing from different angles. Data source location: Local wholesale fish markets located in different areas of Dhaka, Bangladesh. Where applicable: Training and evaluating machine learning and deep learning models to identify and classify small fish species in Bangladesh which can be useful in aquaculture development, fisheries management and sustainable fishing, ecology and ecosystem health monitoring, and biodiversity and conservation efforts.