Journal of Jilin University(Engineering and Technology Edition) ›› 2024, Vol. 54 ›› Issue (5): 1426-1434.doi: 10.13229/j.cnki.jdxbgxb.20220790

Previous Articles    

UAV target tracking algorithm based on high resolution siamese network

Dian-wei WANG1(),Chi ZHANG1,Jie FANG1,Zhi-jie XU2   

  1. 1.School of Communication and Information Engineering,Xi'an University of Posts and Telecommunications,Xi'an 710121,China
    2.School of Computing and Engineering,University of Huddersfield,Huddersfield HD13DH,UK
  • Received:2022-06-25 Online:2024-05-01 Published:2024-06-11

Abstract:

Unmanned Aerial Vehicle(UAV) target tracking tasks are often suffer from small target size, large scale variance and frequent viewpoint change. To address these issues, in this paper, an UAV target tracking algorithm based on high-resolution siamese network is proposed. Firstly, a high-resolution network is improved as the feature extraction backbone network (Lite-high resolution network, L-HRNet), and a dynamic multi-template strategy is used to mine the inter-frame information of the video. Secondly, a multi-frame feature fusion module is constructed to obtain fusion features that are beneficial to target localization. Finally, an anchor-free strategy is selected to locate the target position and obtain accurate tracking results. The experimental results show that the success rate and accuracy of the proposed algorithm are 66.0% and 84.7% on DTB70 dataset, 65.7% and 84.3% on UAV123 dataset respectively, which improves the target tracking performance effectively.

Key words: computer vision, target tracking, unmanned aerial vehicle, high-resolution siamese network, multi-frame feature

CLC Number: 

  • TP391.4

Fig.1

Structure of tracking framework"

Fig.2

Structure of L-HRNet"

Fig.3

Structure of multi-frame feature fusion module"

Table 1

Performance of different algorithms on DTB70 dataset"

算法成功率精确率
SiamFC0.4870.738
SiamRPN0.5880.790
DaSiamRPN0.6100.796
SiamRPN++0.6140.800
SiamCAR0.5960.802
SiamBAN0.6430.834
本文0.6600.847

Fig.4

Performance evaluation of algorithms on UAV123 dataset small target sequence"

Fig.5

Performance evaluation of algorithms on UAV123 dataset"

Table 2

Performance evaluation of algorithms on different attributes of UAV123 dataset"

算法低分辨率尺度变化视点改变纵横比改变相似物体全遮挡
SPSPSPSPSPSP
SiamFC0.3460.7190.4760.6970.4790.6860.4360.6610.4660.7060.3060.571
SiamRPN0.4000.6160.5440.7520.6080.7790.5360.7240.4870.6820.3490.565
DaSiamRPN0.4110.6630.5600.7540.5630.7530.5370.7390.5170.7470.3790.633
SiamRPN++0.4330.6660.5780.7750.6370.8330.5440.7560.5410.7400.3540.582
SiamCAR0.4650.6930.6050.7910.6460.8070.5800.7590.5630.7540.4190.660
SiamBAN0.4720.7190.6130.8130.6390.8240.5910.7960.5660.7770.4180.671
本文0.4940.7340.6410.8240.6900.8640.6340.7840.5710.7620.4150.658

Fig.6

Tracking results of algorithms on three sequences"

Table 3

Experimental results with different backbone network"

骨干网络成功率/%精确率/%
AlexNet61.378.5
HRNet63.081.7
L-HRNet65.784.3

Table 4

Experimental results with different numbers of templates"

模板数量成功率/%精确率/%帧率/(帧·s-1
161.580.646
264.883.034
365.784.329
1 谢才华, 王学伟, 付强, 等. 基于动态模板更新的孪生网络目标跟踪算法[J]. 吉林大学学报: 工学版, 2022, 52(5): 1106-1116.
Xie Cai-hua, Wang Xue-wei, Fu Qiang, et al. Siamese network target tracking algorithm based on dynamic template update[J]. Journal of Jilin University (Engineering Edition), 2022, 52(5): 1106-1116.
2 刘晶红, 邓安平, 陈琪琪, 等. 基于多重注意力机制的无锚框目标跟踪算法[J]. 吉林大学学报:工学版, 2023, 53(12): 3518-3528.
Liu Jing-hong, Deng An-ping, Chen Qi-qi, et al. Anchor-free target tracking algorithm based on multiple attention mechanism[J]. Journal of Jilin University (Engineering and Technology Edition), 2023, 53(12): 3518-3528.
3 Bertinetto L, Valmadre J, Henriques J F, et al. Fully convolutional Siamese networks for object tracking[C]∥Europe an Conference on Computer Vision. Berlin: Springer, 2016: 850-865.
4 Li B, Yan J, Wu W, et al. High performance visual tracking with Siamese region proposal network[C]∥Proceedings. of the IEEE Conference on Computer Vision and Pattern Recognition, Utah, USA, 2018: 8971-8980.
5 Li B, Wu W, Wang Q, et al. SiamRPN++: evolution of Siamese visual tracking with very deep networks[C]∥Proceedings. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, California, USA, 2019: 4282-4291.
6 He K, Zhang X, Ren S, et al. Deep residual learning for image recognition[C]∥Proceedings. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nevada,USA,2016: 770 -778.
7 Guo D, Wang J, Cui Y, et al. SiamCAR: Siamese fully convolutional classification and regression for visual tracking[C]∥Proceedings. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 6269-6277.
8 Chen Z, Zhong B, Li G, et al. Siamese box adaptive network for visual tracking [C]∥Proceedings. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 6668-6677.
9 Xu Y, Wang Z, Li Z, et al. SiamFC++: towards robust and accurate visual tracking with target estimation guidelines[C]∥Proceedings. of the AAAI Conference on Artificial Intelligence, 2020, 34(7): 12549-12556.
10 Han K, Wang Y, Chen H, et al. A survey on visual transformer[J]. arXiv preprint arXiv: 2012.12556, 2020, 2(4).
11 Khan S, Naseer M, Hayat M, et al. Transformers in vision: a survey[J]. ACM Computing Surveys (CSUR), 2022, 54(10s): 1-41.
12 Chen X, Yan B, Zhu J, et al. Transformer tracking[C]∥Proceedings. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 8126-8135.
13 Wang N, Zhou W, Wang J, et al. Transformer meets tracker: exploiting temporal context for robust visual tracking [C]∥Proceedings. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021:1571-1580.
14 Fu Z, Liu Q, Fu Z, et al. Stmtrack: template-free visual tracking with space time memory networks[C]∥Proceedings. of the IEEE /CVF Conference on Computer Vision and Pattern Recognition, 2021: 13774-13783.
15 Zhao M, Okada K, Inaba M. TrTr: visual tracking with transformer[J]. arXiv preprint arXiv: 2105. 03817, 2021.
16 Cui Y, Jiang C, Wang L, et al. MixFormer: end-to-end tracking with iterative mixed attention[C]∥Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022: 13608-13618.
17 Cao Z, Huang Z, Pan L, et al. TCTrack: temporal contexts for aerial tracking[C]∥Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2022: 14798-14808.
18 Song Z, Yu J, Chen Y P, et al. Transformer tracking with cyclic shifting window attention[C]∥Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022: 8791-8800.
19 Cao Z, Fu C, Ye J, et al. HiFT: hierarchical feature transformer for aerial tracking[C]∥Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021: 15457-15466.
20 Fu C, Li S, Yuan X, et al. Ad2Attack: adaptive adversarial attack on real-time UAV tracking [J]. arXiv preprint arXiv:, 2022.
21 Xing D, Evangeliou N, Tsoukalas A, et al. Siamese transformer pyramid networks for real-time UAV tracking[C]∥Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022: 2139-2148.
22 Simonyan K, Zisserman A. Very deep convolutional networks for large scale image recognition[J]. arXiv: , 2014.
23 Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks[J]. Communications of the ACM, 2017, 60(6): 84-90.
24 Sun K, Xiao B, Liu D, et al. Deep high-resolution representation learning for human pose estimation[C]∥Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, California, USA, 2019: 5693-5703.
25 Zhang Y, Wang C, Wang X, et al. Fairmot: on the fairness of detection and reidentification in multiple object tracking[J]. International Journal of Computer Vision, 2021, 129(11): 3069-3087.
26 Russakovsky O, Deng J, Su H, et al. Imagenet large scale visual recognition challenge[J]. International Journal of Computer Vision, 2015, 115(3): 211-252.
27 Huang L, Zhao X, Huang K. Got-10k: a large high-diversity benchmark for generic object tracking in the wild[J]. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2019, 43(5): 1562-1577.
28 Li S, Yeung D Y. Visual object tracking for unmanned aerial vehicles: a benchmark and new motion models[C]∥Thirty-first AAAI Conference on Artificial Intelligence, San Francisco, USA, 2017.
29 Mueller M, Smith N, Ghanem B. A benchmark and simulator for UAV tracking [C]∥European Conference on Computer Vision. Belin: Springer, 2016: 445-446.
[1] Yu WANG,Kai ZHAO. Postprocessing of human pose heatmap based on sub⁃pixel location [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(5): 1385-1392.
[2] Yun-long GAO,Ming REN,Chuan WU,Wen GAO. An improved anchor-free model based on attention mechanism for ship detection [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(5): 1407-1416.
[3] Xiao-bo CHEN,Ling CHEN. Variational Bayesian cooperative target tracking with unknown localization noise statistics [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(4): 1030-1039.
[4] Ya-jing YU,Jian GUO,Rong-hao WANG,Wei QIN,Ming-wu SONG,Zheng-rong XIANG. Time⁃varying formation control of multiquadrotor unmanned aerial vehicles based on state observer [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(3): 871-882.
[5] Shan XUE,Ya-liang ZHANG,Qiong-ying LYU,Guo-hua CAO. Anti⁃unmanned aerial vehicle system object detection algorithm under complex background [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(3): 891-901.
[6] Long-long CHEN,Tian-yu FENG,Zong-yang LYU,Yu-hu WU. Finite⁃time sliding mode attitude control for coaxial tilt⁃rotor unmanned aerial vehicle [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(3): 883-890.
[7] Ying HE,Jun-song FAN,Wei WANG,Geng SUN,Yan-heng LIU. Joint optimization of secure communication and trajectory planning in unmanned aerial vehicle air⁃to⁃ground [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(3): 913-922.
[8] Guo-yuan QI,Hao CHEN. Observer⁃based control⁃anti⁃disturbance⁃obstacle avoidance of quadrotor unmanned aerial vehicle [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(3): 810-822.
[9] Hong-yang PAN,Zhao LIU,Bo YANG,Geng SUN,Yan-heng LIU. Overview of swarm intelligence methods for unmanned aerial vehicle systems based on new⁃generation information technology [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(3): 629-642.
[10] Jing-hong LIU,An-ping DENG,Qi-qi CHEN,Jia-qi PENG,Yu-jia ZUO. Anchorfree target tracking algorithm based on multiple attention mechanism [J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(12): 3518-3528.
[11] Bin XIAN,Jie-qi LI,Xun GU. Ground effects compensation for an unmanned aerial vehicle via nonlinear disturbance observer [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(8): 1926-1933.
[12] Qiang GUO,Yu-qiang CUI,Yong WANG. Nodes scheduling algorithm based on dynamic cluster in wireless sensor network [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(6): 1466-1476.
[13] Hang ZHU,Han-bo YU,Jia-hui LIANG,Hong-ze LI. Improved algorithm of UAV search based on electric field model and simulation analysis [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(12): 3029-3038.
[14] Kan WANG,Hang SU,Hao ZENG,Jian QIN. Deep target tracking using augmented apparent information [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(11): 2676-2684.
[15] Bin XIAN,Shi-jing ZHANG,Xiao-wei HAN,Jia-ming CAI,Ling WANG. Trajectory planning for unmanned aerial vehicle slung⁃payload aerial transportation system based on reinforcement learning [J]. Journal of Jilin University(Engineering and Technology Edition), 2021, 51(6): 2259-2267.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!