Journal of Jilin University(Engineering and Technology Edition) ›› 2023, Vol. 53 ›› Issue (1): 226-233.doi: 10.13229/j.cnki.jdxbgxb20210510

Previous Articles    

Visual relationship detection method based on construction scene

Jun-jie WANG(),Yuan-jun NONG,Li-te ZHANG,Pei-chen ZHAI   

  1. School of Engineering,Ocean University of China,Qingdao 266100,China
  • Received:2021-06-04 Online:2023-01-01 Published:2023-07-23

Abstract:

The non-compliant interaction between workers, construction machinery and construction appliances in the construction site is an important cause of safety accidents. Therefore, a visual relationship detection method based on construction scene is proposed. Firstly, convolution neural network is used to build entity detection and relationship detection branches to extract entity features and relationship features in construction scene. Secondly, visual module, semantic module and space module are constructed to learn the extracted features, so that the network can fully perceive and understand visual information, semantic information and spatial information. Finally, a graphical contrastive loss function is designed to improve the visual relationship detection performance. The experimental results on the self-made construction relationship detection data set show that the proposed method achieves the R@20, R@50, R@100 recall rate of 75.89%, 77.64% and 78.93%. The proposed method has good visual relationship detection performance, and can accurately detect the objects and their interactions in the construction scene.

Key words: computer application technology, visual relationship detection, construction scene, convolutional neural network, scene graph, image understanding

CLC Number: 

  • TP319.4

Fig.1

Network structure of construction relationship detection model"

Fig.2

Entity feature and relation feature extracted by entity and relation detection branch"

Table 1

Number of images, entity targets and their interaction in different construction scenes"

场景

图像

数量

主语

(实体1)

谓语

(关系)

宾语

(实体2)

工人推/拉手推车145workerpush/pullcart
挖掘机挖土150excavatordigsoil
工人焊接铁制品150workerweldironwork
工人拿锤子145workerholdhammer
工人拿钻机150workerholdelectric drill
工人在脚手架上155workerstand onscaffold
工人爬梯子145workerclimbladder
工人佩戴安全帽160workerwearhardhat

Table 2

Visual relationship detection results of different methods"

方 法R@20/%R@50/%R@100/%
Large1273.5875.6276.56
Motifs1372.1673.8375.94
Graph1472.8374.7676.12
Unbiased1573.0474.9275.88
本文75.8977.6478.93

Fig.3

Visual relationship detection results of construction scene"

1 中华人民共和国住房和城乡建设部办公厅. 住房和城乡建设部办公厅关于2019年房屋市政工程生产安全事故情况的通报[EB/OL]. [2021-06-04]. .
2 Luo H, Wang M, Wong P K, et al. Construction machine pose prediction considering historical motions and activity attributes using gated recurrent unit (GRU)[J]. Automation in Construction, 2021,121:No.103444.
3 Wu J, Cai N, Chen W, et al. Automatic detection of hardhats worn by construction personnel: a deep learning approach and benchmark dataset[J]. Automation in Construction, 2019,106: No.102894.
4 Fang Q, Li H, Luo X, et al. Detecting non-hardhat-use by a deep learning method from far-field surveillance videos[J]. Automation in Construction, 2018,85:1-9.
5 Nath N D, Behzadan A H, Paal S G. Deep learning for site safety: real-time detection of personal protective equipment[J]. Automation in Construction, 2020,112: No.103085.
6 Lu C, Krishna R, Bernstein M, et al. Visual Relationship detection with language priors[J]. European Conference on Computer Vision, 2016,9905: 852-869.
7 Tang S, Roberts D, Golparvar-Fard M. Human-object interaction recognition for automatic construction site safety inspection[J]. Automation in Construction, 2020, 120: No.103356.
8 Xiong R, Song Y, Li H, et al. Onsite video mining for construction hazards identification with visual relationships[J]. Advance Engineering Informatics, 2019, 42: No.100966.
9 Zhang J, Shih K J, Elgammal A, et al. Graphical contrastive losses for scene graph parsing[J]. IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, USA,2019:11527-11535.
10 Krishna R, Zhu Y, Groth O, et al. Visual genome: connecting language and vision using crowdsourced dense image annotations[EB/OL]. [2021-06-04]. .
11 Dutta A, Zisserman A. The VIA annotation software for images, audio and video[EB/OL]. [2021-06- 04]. .
12 Zhang J, Kalantidis Y, Rohrbach M, et al. Large-scale visual relationship understanding[J]. AAAI Conference on Artificial Intelligence, 2019, 33(1):9185-9194.
13 Zellers R, Yatskar M, Thomson S, et al. Neural motifs: scene graph parsing with global context[J]. IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA,2018: 5831-5840.
14 Yang J, Lu J, Lee S, et al. Graph R-CNN for scene graph generation[J]. European Conference on Computer Vision, 2018,11205: 690-706.
15 Tang K H, Niu Y L, Huang J Q, et al. Unbiased scene graph generation from biased training[J]. IEEE Conference on Computer Vision and Pattern Recognition, Seattle, USA,2020:3716-3725.
[1] Zhen-hai ZHANG,Kun JI,Jian-wu DANG. Crack identification method for bridge based on BCEM model [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(5): 1418-1426.
[2] Pei-yong LIU,Jie DONG,Luo-feng XIE,Yang-yang ZHU,Guo-fu YIN. Surface defect detection algorithm of magnetic tiles based on multi⁃branch convolutional neural network [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(5): 1449-1457.
[3] Shan XUE,Ya-liang ZHANG,Qiong-ying LYU,Guo-hua CAO. Anti⁃unmanned aerial vehicle system object detection algorithm under complex background [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(3): 891-901.
[4] Tian BAI,Ming-wei XU,Si-ming LIU,Ji-an ZHANG,Zhe WANG. Dispute focus identification of pleading text based on deep neural network [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(8): 1872-1880.
[5] Fu-heng QU,Tian-yu DING,Yang LU,Yong YANG,Ya-ting HU. Fast image codeword search algorithm based on neighborhood similarity [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(8): 1865-1871.
[6] Xuan-jing SHEN,Xue-feng ZHANG,Yu WANG,Yu-bo JIN. Multi⁃focus image fusion algorithm based on pixel⁃level convolutional neural network [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(8): 1857-1864.
[7] Gui-he QIN,Jun-feng HUANG,Ming-hui SUN. Text input based on two⁃handed keyboard in virtual environment [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(8): 1881-1888.
[8] Huai-jiang YANG,Er-shuai WANG,Yong-xin SUI,Feng YAN,Yue ZHOU. Simplified residual structure and fast deep residual networks [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(6): 1413-1421.
[9] Ming-hua GAO,Can YANG. Traffic target detection method based on improved convolution neural network [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(6): 1353-1361.
[10] Ming LIU,Yu-hang YANG,Song-lin ZOU,Zhi-cheng XIAO,Yong-gang ZHANG. Application of enhanced edge detection image algorithm in multi-book recognition [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(4): 891-896.
[11] Shi-min FANG. Multiple source data selective integration algorithm based on frequent pattern tree [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(4): 885-890.
[12] Xue-zhi WANG,Qing-liang LI,Wen-hui LI. Spatio⁃temporal model of soil moisture prediction integrated with transfer learning [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(3): 675-683.
[13] Xiang-jun LI,Jie-ying TU,Zhi-bin ZHAO. Validity classification of melting curve based on multi⁃scale fusion convolutional neural network [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(3): 633-639.
[14] Xian-tong LI,Wei QUAN,Hua WANG,Peng-cheng SUN,Peng-jin AN,Yong-xing MAN. Route travel time prediction on deep learning model through spatiotemporal features [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(3): 557-563.
[15] Wen-zhi GAO,Yan-jun WANG,Xin-wei WANG,Pan ZHANG,Yong LI,Yang DONG. Real⁃time diagnosis for misfire fault of diesel engine based on convolutional neural network [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(2): 417-424.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!