Journal of Jilin University(Engineering and Technology Edition) ›› 2024, Vol. 54 ›› Issue (9): 2646-2657.doi: 10.13229/j.cnki.jdxbgxb.20221472

Previous Articles    

Object detection of high-resolution remote sensing images by neural architecture search

Jun YANG1,2(),Peng-fei HAN2   

  1. 1.School of Electronic and Information Engineering,Lanzhou jiaotong University,Lanzhou 730070,China
    2.Faculty of Geomatics,Lanzhou Jiaotong University,Lanzhou 730070,China
  • Received:2022-11-17 Online:2024-09-01 Published:2024-10-29

Abstract:

Aiming at the problems of traditional object detection methods on remote sensing images based on deep learning networks need hand-crafted architectures, which are overly dependent on expert experience and time-consuming, an object detection method for remote sensing images based on neural architecture search is proposed. The network is automatically built by pathwise sampling and evolutionary search strategy for object detection of remote sensing images. Experiments on DIOR dataset and RSOD dataset show that the mean average precision of object detection reached 67.8% and 85.5%, and the FLOPs are 208.47 G and 201.67 G, which are better than the network models such as Faster R-CNN, RetinaNet, NAS-FCOS, ResNet Strikes Back, HRNet and GRoIE in terms of detection accuracy and computational efficiency. The proposed method can automatically search the network architecture for object detection of high-resolution remote sensing images, which is superior to the hand-crafted classical networks.

Key words: remote sensing, high-resolution remote sensing image, network architecture search, object detection, pathwise sampling

CLC Number: 

  • TP751.1

Fig. 1

Architecture of the object detection network"

Fig.2

Final searched network architecture"

Fig.3

Detection Results on DIOR Dataset"

Table 1

Comparison of object detection precision of different network models on DIOR dataset"

网络mAPAP50AP75APsAPmAPl
Faster R-CNN66.188.874.212.250.876.2
GRoIE64.191.272.726.057.371.4
HRNet63.391.172.728.552.070.8
ResNet-SB52.076.457.59.731.262.1
RetinaNet34.355.136.73.919.442.8
NAS-FCOS55.780.460.610.435.865.5
本文67.891.378.336.160.674.4

Table 2

Comparison of AP(%) of different object categories on DIOR dataset"

类别Faster R-CNNGRoIEHRNetResNet-SBRetinaNetNAS-FCOS本文
机场66.060.063.442.210.952.258.2
车辆46.752.351.942.030.047.053.0
篮球场81.077.972.363.950.168.781.9
田径场82.278.180.265.646.965.581.4
风车56.154.047.543.230.543.356.5
船舶47.048.552.043.131.648.648.7
高速公路收费站74.372.072.063.350.557.476.4
网球场90.486.278.479.975.482.089.5
高尔夫球场61.853.661.043.932.256.362.2
立交桥57.059.057.144.322.238.961.9
储油罐67.370.667.162.554.171.371.0
棒球场86.585.284.777.675.983.585.4
港口41.734.344.326.38.434.051.4
体育场77.976.675.969.247.067.981.2
桥梁43.749.445.431.810.527.555.3
飞机81.578.074.967.859.975.580.6
火车站49.248.944.219.25.225.452.1
高速公路服务区70.162.863.744.09.653.168.7
水坝55.052.848.436.519.335.458.5
烟囱81.781.381.674.763.280.082.0

Fig.4

Confusion matrix on DIOR dataset"

Table 3

Comparison of parameters of different algorithms on DIOR dataset"

网络FLOPs/G参数/M搜索时间/h训练时间/hmAP
Faster R-CNN798.0432.9520.666.1
GRoIE545.0043.1222.164.1
HRNet298.6546.9721.163.3
ResNet-SB216.4041.2219.352.0
RetinaNet285.9820.0117.134.3
NAS-FCOS230.6238.1511.418.455.7
本文208.4725.3812.620.867.8

Fig.5

Curves on learning rate and training losses of the proposed network model"

Fig.6

Detection Results on RSOD Dataset"

Table 4

Comparison of object detection precision of different network models on RSOD dataset"

网络mAPAP50AP75APsAPmAPl
Faster R-CNN84.999.596.461.083.088.0
GRoIE84.599.496.360.782.287.9
HRNet84.199.297.359.982.587.2
ResNet-SB77.497.590.523.366.482.3
RetinaNet51.484.855.833.652.554.0
NAS-FCOS77.398.089.226.975.083.1
本文85.599.597.963.283.288.6

Table 5

Comparison of AP of different object categories on RSOD dataset"

网络飞机操场立交桥油罐
Faster R-CNN78.792.281.287.6
GRoIE79.190.381.087.5
HRNet78.589.881.486.9
ResNet-SB60.393.181.474.8
RetinaNet59.059.330.157.1
NAS-FCOS66.386.872.683.5
本文79.990.085.287.9

Fig.7

Confusion matrix on RSOD dataset"

Table 6

Comparison of parameters of different algorithms on RSOD dataset"

网络FLOPs/G参数/M训练时间/hmAP
Faster R-CNN841.4041.1410.184.9
GRoIE544.9243.0410.984.5
HRNet242.3444.6811.384.1
ResNet-SB298.5746.899.477.4
RetinaNet162.8919.689.151.4
NAS-FCOS216.3138.678.977.3
本文201.6732.7910.285.5
1 周鹏, 杨军. 采用神经网络架构搜索的遥感影像分割方法[J]. 西安电子科技大学学报, 2021, 48(5): 47-57, 77.
Zhou Peng, Yang Jun. Semantic segmentation of remote sensing images based on neural architecture search[J]. Journal of Xidian University, 2021, 48(5): 47-57, 77.
2 张晓东, 张力飞, 陈关州, 等. 基于深度学习的遥感影像地物目标检测和轮廓提取一体化模型[J]. 测绘地理信息, 2019, 44(6): 1-5.
Zhang Xiao-dong, Zhang Li-fei, Chen Guan-zhou, et al. An integrated model of object detection and contour extraction based on deep learning [J]. Journal of Geomatics, 2019, 44(6): 1-5.
3 田婷婷, 杨军. 基于多尺度特征融合网络的遥感影像目标检测[J]. 激光与光电子学进展, 2022, 59(16): 427-435.
Tian Ting-ting, Yang Jun. Object detection for remote sensing image using multi-scale feature fusion network[J]. Laser & Optoelectronics Progress, 2022, 59(16): 427-435.
4 Cao Y, Niu X, Dou Y. Region-based convolutional neural networks for object detection in very high-resolution remote sensing images[C]∥IEEE International Conference on Natural Computation, Hawaii, USA, 2016: 548-554.
5 Li K, Cheng G, Bu S, et al. Rotation-insensitive and context-augmented object detection in remote sensing images[J]. IEEE Transactions on Geoscience and Remote Sensing, 2018, 56(4): 2337-2348.
6 Zhong Y, Han X, Zhang L. Multi-class geospatial object detection based on a position-sensitive balancing framework for high spatial resolution remote sensing imagery[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2018, 138: 281-294.
7 Chen Z, Zhang T, Ouyang C. End-to-end airplane detection using transfer learning in remote sensing images[J]. Remote Sens, 2018, 10: No.139.
8 Liu W, Ma L, Chen H. Arbitrary-oriented ship detection framework in optical remote-sensing images [J]. IEEE Geoscience and Remote Sensing Letters, 2018, 15(6): 937-41.
9 Zoph B, Le Q V. Neural architecture search with reinforcement learning[DB/OL]. [2016-11-05].
10 Zoph B, Vasudevan V, Shlens J, et al. Learning transferable architectures for scalable image recognition[C]∥IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA, 2018: 8697-8710.
11 Real E, Aggarwal A, Huang Y, et al. Regularized evolution for image classifier architecture search[C]∥Proceedings of the AAAI Conference on Artificial Intelligence, Hawaii, USA, 2019: 4780-4789.
12 Liu H, Simonyan K, Yang Y. DARTS: differentiable architecture search[C]∥International Conference on Learning Representations, Vancouver, Canada, 2018: 6-9.
13 Ghiasi G, Lin T Y, Le Q V. NAS-FPN: learning scalable feature pyramid architecture for object detection [C] ∥IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, 2019: 7036-7045.
14 Wang N, Gao Y, Chen H, et al. NAS-FCOS: fast neural architecture search for object detection [C] ∥IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, USA, 2020: 11940-11948.
15 Xu A, Yao A, Li A, et al. Auto-FPN: automatic network architecture adaptation for object detection beyond classification[C]∥IEEE/CVF International Conference on Computer Vision, Venice, Italy, 2020: 6648-6657.
16 Cao L, Zhang X, Wang Z. Arbitrary-oriented object detection on high-resolution images based on differentiable architecture search [J]. Canadian Journal of Remote Sensing, 2021, 47(5): 719-30.
17 Ma N N, Zhang X Y, Zheng H T, et al. ShuffleNet V2: practical guidelines for efficient CNN architecture design[J/OL]. [2022-11-12].
18 Xia G S, Bai X, Ding J, et al. DOTA: a large-scale dataset for object detection in aerial images[C]∥IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA, 2018: 3974-3983.
19 Li K, Wan G, Cheng G, et al. Object detection in optical remote sensing images: a survey and a new benchmark[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2020, 159: 296-307.
20 Long Y, Gong Y, Xiao Z, et al. Accurate object localization in remote sensing images based on convolutional neural networks [J]. IEEE Transactions on Geoscience and Remote Sensing, 2017, 55(5): 2486-2498.
21 Xiao Z, Liu Q, Tang G, et al. Elliptic Fourier transformation-based histograms of oriented gradients for rotationally invariant object detection in remote-sensing images[J]. International Journal of Remote Sensing, 2015, 36(2): 618-644.
22 Ren S, He K, Girshick R, et al. Faster R-CNN: towards real-time object detection with region proposal networks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149.
23 Rossi L, Karimi A, Prati A. A novel region of interest extraction layer for instance segmentation[C]∥IEEE International Conference on Pattern Recognition, Chengdu, China, 2021: 2203-2209.
24 Sun K, Xiao B, Liu D, et al. Deep high-resolution representation learning for human pose estimation[C]∥IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, 2019: 5693-5703.
25 Wightman R, Touvron H, Jégou H. Resnet strikes back: an improved training procedure in timm [DB/OL].[2021-10-01].
26 Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection[C]∥IEEE International Conference on Computer Vision, Venice, Italy, 2017: 2980-2988.
[1] Sheng-jie ZHU,Xuan WANG,Fang XU,Jia-qi PENG,Yuan-chao WANG. Multi-scale normalized detection method for airborne wide-area remote sensing images [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(8): 2329-2337.
[2] Xin-dong YOU,Lei GUO,Jing HAN,Xue-qiang LYU. An character recognition network for imprint character [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(7): 2072-2079.
[3] Yun-zuo ZHANG,Wei GUO,Wen-bo LI. Omnidirectional accurate detection algorithm for dense small objects in remote sensing images [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(4): 1105-1113.
[4] Xiong-fei LI,Zi-xuan SONG,Rui ZHU,Xiao-li ZHANG. Remote sensing change detection model based on multi⁃scale fusion [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(2): 516-523.
[5] Chun-hua WANG,En-ze LI,Min XIAO. Object detection in high-resolution remote sensing images based on multi-feature fusion and twin attention network [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(1): 240-250.
[6] Zhi-dan CAI,Ming FANG,Zhe LI,Jia-lu XU. Blind remote sensing image deblurring algorithm based on Gaussian curvature and reweighted graph total variation [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(9): 2649-2658.
[7] Jun-qing ZHU,Xue-ru ZHAO,Tao MA,Xiao-ming HUANG,Hong-zhou ZHU. Monitoring road geological disaster based on satellite remote sensing [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(6): 1861-1872.
[8] Shan XUE,Ya-liang ZHANG,Qiong-ying LYU,Guo-hua CAO. Anti⁃unmanned aerial vehicle system object detection algorithm under complex background [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(3): 891-901.
[9] Bo TAO,Fu-wu YAN,Zhi-shuai YIN,Dong-mei WU. 3D object detection based on high⁃precision map enhancement [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(3): 802-809.
[10] Li-bo CHENG,Xin-yue LI,Zhe LI,Xiao-ning JIA. Remote sensing image denoising method based on curvelet transform and goodness-of-fit test [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(11): 3207-3213.
[11] Wei-xuan MA,Yan ZHANG,Chuan-xiang MA,Sa ZHU. Edge detection algorithm of noisy remote sensing image under different illumination conditions [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(1): 241-247.
[12] Bing ZHU,Zi-wei LI,Qi LI. Building segmentation method of remote sensing image based on improved SegNet [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(1): 248-254.
[13] Bin WANG,Bing-hui HE,Na LIN,Wei WANG,Tian-yang LI. Tea plantation remote sensing extraction based on random forest feature selection [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(7): 1719-1732.
[14] Xiang-jiu CHE,He-yuan CHEN. Muti⁃Object dishes detection algorithm based on improved YOLOv4 [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(11): 2662-2668.
[15] You QU,Wen-hui LI. Single-stage rotated object detection network based on anchor transformation [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(1): 162-173.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!