Contains a video dataset of road potholes for semantic segmentation purposes. Data is taken from a point of view perpendicular to the road. Contains a total of 619 videos (image, mask) with an aspect ratio of 1:1, 1080x1080 resolution and 48 image frames for each video. The data was divided manually with a ratio of 60/20/20 with 372 for training, 124 for validation, and 123 for test. Ground truth is made manually using Adobe After Effects. The videos in this dataset were captured in Hulu Sungai Tengah Regency, Indonesia
Steps to reproduce
Data was taken using xiaomi mi 10t with initial 4k resolution. The video is taken perpendicular to the road with a tilt angle of about 80-100 degrees, the distance between the camera and the pothole is about 80-120 cm. Data is cropped and resized to 1080x1080 resolution and 1:1 aspect ratio using Adobe Premiere Pro. Then the ground truth segmentation is annotated using pen tool and mask tracker feature in Adobe After Effects.