Development of an AI Dataset for Object Detection at Construction Sites

Name: Development of an AI Dataset for Object Detection at Construction Sites
Creator: jongho na
Published: 2024-09-24T07:31:39.042Z
Keywords: Construction Industry, Object Detection, Image Classification, Earthmoving Equipment, Civil Construction

na, jongho; Shin, HyuSoung; Yun, IlDong

doi:10.17632/rz8723t6d7.1

Development of an AI Dataset for Object Detection at Construction Sites

Published: 24 September 2024| Version 1 | DOI: 10.17632/rz8723t6d7.1

Contributors:

,

Description

This paper introduces a dataset for object detection training, which includes 12 types of construction machinery commonly operated at civil engineering sites. To collect this dataset, a housing development construction site in South Korea was selected, and a video collection system was operated over a six-month period. Frames were extracted from the collected video footage on a daily basis, resulting in a total of 87,766 images in the full training dataset. Using the COCO Annotator tool, labels and bounding box annotations were processed, generating a total of 856,485 objects.

Files

Steps to reproduce

To ensure transparency and reproducibility, the following outlines how the data was gathered for this research: Site Selection: A civil engineering construction site in South Korea was chosen for data collection. This site was selected based on ongoing earthmoving activities and the presence of various construction equipment types. Video Collection System Setup: A custom video collection system was designed and installed at the site. High-definition (1920x1080 resolution) cameras were positioned at key vantage points around the site to capture clear, comprehensive footage of equipment in operation. The system operated continuously over a six-month period to capture daily site activities under various weather and lighting conditions. Frame Extraction: Frames were extracted from the collected video footage at regular intervals, resulting in a dataset of images suitable for object detection tasks. This step ensured a diverse dataset covering a wide range of equipment movements, angles, and site conditions. Annotation Process: The extracted frames were processed using the COCO Annotator tool, where bounding boxes were manually drawn around 12 specific types of construction equipment (e.g., excavators, bulldozers, cranes). This step required the assistance of trained annotators to ensure accuracy and consistency across the dataset. Data Format: The annotations were then converted to the COCO (Common Objects in Context) format, which is a widely used standard for AI training datasets. This format includes object labels, bounding box coordinates, and other metadata necessary for training object detection models. Tools and Software: Key software and workflows used in this research include: Video collection system: Custom setup using high-definition cameras. Frame extraction: Automated using a custom script to sample images at regular intervals. COCO Annotator: For manual annotation and bounding box creation. Python-based scripts: For data processing and format conversion to the COCO standard. By following these steps, other researchers can reproduce the dataset creation process for similar object detection tasks in construction or civil engineering environments.

Institutions

Hankuk University of Foreign Studies, Korea Institute of Civil Engineering and Building Technology

Funding

Ministry of Science and ICT

20240143-001

Development of an AI Dataset for Object Detection at Construction Sites

Description

Files

Steps to reproduce

Institutions

Categories

Funding

Licence