ceramics-defects-detection

Published: 18 August 2025| Version 1 | DOI: 10.17632/47x6jdbr5j.1
Contributors:
Lucileide Aquino do Nascimento, Otilio Paulo, Gilvan Moreira da Paz, Micherlane Rodrigues Machado da Silva

Description

The dataset used in this study comprises 1,600 RGB images of ceramic tiles, each with a resolution of 4,096 × 3,072 pixels. The images are divided into two categories: normal (800 samples) and defective (800 samples). Normal tiles meet quality standards without visible imperfections, whereas defective tiles present flaws such as cracks or deformations. Note: The images were collected from a limited number of physical samples. Therefore, while the total number of images is 1,600, the variability may be restricted. Users should consider this limitation when using the dataset for training or evaluation purposes. The original dataset is stored in the folder database/amostra/. For model training, the dataset was split into training (80%) and test (20%) sets, resulting in 1,280 images for training and 320 for testing, with balanced class distributions. To enhance model generalization and reduce overfitting, data augmentation was applied using the Albumentations library. Transformations included horizontal and vertical flips, rotations, shifts, scaling, brightness/contrast adjustments, gamma correction, and adaptive histogram equalization (CLAHE), applied with controlled probabilities to preserve structural features. After augmentation, the training set was expanded fivefold, totaling 7,000 images. The augmented dataset is stored in the folder database/amostra_processada/, providing increased variability for robust model training.

Files

Categories

Engineering Material

Licence