A Practical Image Augmentation Method for Construction Safety Using Object Range Expansion Synthesis
This study aims to propose a practical and realistic synthetic data generation method for object recognition in hazardous and data-scarce environments, such as construction sites. Artificial intelligence (AI) applications in such dynamic domains require domain-specific datasets, yet collecting real-...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-04-01
|
| Series: | Buildings |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2075-5309/15/9/1447 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | This study aims to propose a practical and realistic synthetic data generation method for object recognition in hazardous and data-scarce environments, such as construction sites. Artificial intelligence (AI) applications in such dynamic domains require domain-specific datasets, yet collecting real-world data can be challenging due to safety concerns, logistical constraints, and high labor costs. To address these limitations, we introduce object range expansion synthesis (ORES), a lightweight and non-generative method for generating synthetic image data by inserting real object masks into varied background scenes using open datasets. ORES synthesizes new scenes, while preserving scale and ground alignment, enabling controllable and realistic data augmentation. A dataset of 30,000 synthetic images was created using the proposed method and used to train an object recognition model. When tested on real-world construction site images, the model achieved a mean average precision at IoU 0.50 (mAP50) of 98.74% and a recall of 54.55%. While recall indicates room for improvement, the high precision highlights the practical value of synthetic data in enhancing model performance without requiring extensive field data collection. This research contributes a scalable approach to data generation in safety-critical and data-deficient environments, reducing dependence on direct data acquisition, while maintaining model efficacy. It provides a foundation for accelerating the deployment of AI technologies in high-risk industries by overcoming data bottlenecks and supporting real-world applications through practical synthetic augmentation. |
|---|---|
| ISSN: | 2075-5309 |