Journal of Jilin University(Engineering and Technology Edition) ›› 2022, Vol. 52 ›› Issue (11): 2669-2675.doi: 10.13229/j.cnki.jdxbgxb20220043

Previous Articles    

Image retrieval algorithm based on response value center weighted convolution feature

Xiao-ning LI1,2(),Hong-wei ZHAO1,3,Dan-yang ZHANG1,Yuan ZHANG1()   

  1. 1.College of Computer Science and Technology,Jilin University,Changchun 130012,China
    2.College of Computer Science and Technology,Changchun Normal University,Changchun 130032,China
    3.Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education,Jilin University,Changchun 130012,China
  • Received:2022-01-09 Online:2022-11-01 Published:2022-11-16
  • Contact: Yuan ZHANG E-mail:lixiaoning@ccsfu.edu.cn;yzhang@jlu.edu.cn

Abstract:

A novel cross-entropy-based loss is proposed, which is applied to a classical convolutional neural network to obtain a better embedding space. A convolutional feature aggregation algorithm based on the weighting of the center of the response value is designed to process the three-dimensional convolutional features obtained by the neural network. The algorithm obtains the spatial weighting coefficient of the feature map by calculating the current position response value and the Gaussian center. The algorithm reduces the loss of information when the 3D convolution feature map is reduced to a 1D image feature descriptor and realizes the enhancement of the target area. Finally, the obtained image feature descriptors are used for retrieval tasks. On the CUB-200-2011 dataset, the effectiveness of the loss function and the response value center weighting algorithm are respectively verified by ablation experiments. Compared with the current retrieval methods, higher precision and recall rates have been obtained on the four datasets of Paris6k, Oxford5k, CUB-200-2011 and CARS196.

Key words: computer applications, image retrieval, deep learning, convolutional feature aggregation

CLC Number: 

  • TP391

Fig.1

Schematic diagram"

Fig.2

Response value center weighting process"

Table 1

Fusion parameter μ parameter test"

方法CUB-200-2011 K
1248
μ=0.268.979.286.692.2
μ=0.469.480.087.792.7
μ=0.668.979.086.792.3
μ=0.868.178.786.692.0

Table 2

Modular ablation experiments on CUB-200-2011 dataset"

方法CUB-200-2011 K
124816
基础网络1366.477.285.491.194.6

基础网络+

本文损失

67.077.785.991.395.1

改进网络+响应

值中心加权算法

69.480.087.792.795.9

Table 3

Comparative experiment of loss on Paris6k and Oxford5k datasets"

损失mAP
三元组1458.9661.52
交叉熵1381.7367.64
本文82.3069.01

Table 4

Comparative experiment of loss on CUB-200-2011 datasets"

损失CUB-200-2011 K
124816
三元组1438.050.362.274.184.6
交叉熵1368.678.886.692.295.2
本文69.480.087.792.795.9

Table 5

Convolutional feature aggregation algorithms comparison experiment on Paris6k and Oxford5k datasets"

方法mAP
最大池化650.9854.62
求和池化551.9354.38
多尺度最大池化160.5961.83
卷积描述子选择算法1578.9267.81
响应值中心加权算法82.3069.01

Table 6

Convolutional feature aggregation algorithms comparison experiment on CUB-200-2011 datasets"

方法CUB-200-2011 K
124816
最大池化667.077.785.991.395.1
求和池化548.661.772.982.990.2

多尺度最大

池化1

53.065.576.485.992.3

响应值中心

加权算法

69.480.087.792.795.9

Table 7

Comparison of experimental results on CARS196 datasets"

方法CARS196 K
124816
文献[427.132.346.158.972.2
文献[1639.150.463.374.584.1
文献[1749.060.372.181.589.2
文献[1858.170.680.387.8-
文献[1953.966.777.786.3-
文献[2057.468.680.189.492.3
文献[2158.569.879.186.291.8
文献[2263.973.782.189.293.7
文献[2375.983.989.794.096.6
本文算法80.287.091.594.897.1
1 Tolias G, Sicre R, Jégou H. Particular object retrieval with integral max-pooling of CNN activations[J/OL]. [2021-11-18].
2 Chen W, Liu Y, Wang W, et al. Deep learning for instance retrieval: a survey[J/OL]. [2021-01-27].
3 Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks[J]. Communications of the ACM, 2017, 60(6): 84-90.
4 Bell S, Bala K. Learning visual similarity for product design with convolutional neural networks[J]. Transactions on Graphics, 2015, 34(4): 2766959.
5 Babenko A, Lempitsky V. Aggregating local deep features for image retrieval[C]∥IEEE International Conference on Computer Vision, Santiago, Chile, 2015: 1269-1277.
6 Razavian A S, Sullivan J, Carlsson S, et al. Visual instance retrieval with deep convolutional networks[J]. ITE Transactions on Media Technology and Applications, 2016, 4(3): 251-258.
7 Anwaar M U, Labintcev E, Kleinsteuber M. Compositional learning of image-text query for image retrieval[C]∥IEEE Winter Conference on Applications of Computer Vision, Waikoloa, USA, 2015: 1140-1149.
8 He K M, Zhang X Y, Ren S Q, et al. Deep residual learning for image recognition[J/OL]. [2021-12-10].
9 Krause J, Stark M, Deng J, et al. 3D object representations for fine-grained categorization[C]∥In Proceedings of Proceedings of the IEEE International Conference on Computer Vision Workshops, Sydney, Australia, 2013: 554-561.
10 Wah C, Branson S, Welinder P, et al. The caltech-UCSD birds-200-2011 dataset[J]. California Institute of Technology, 2011(1): 20111026.
11 Philbin J, Chum O, Isard M, et al. Object retrieval with large vocabularies and fast spatial matching[C]∥IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, USA, 2007: 1-8.
12 Philbin J, Chum O, Isard M, et al. Lost in quantization: Improving particular object retrieval in large scale image databases[C]∥IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, USA, 2008: 1-8.
13 Dubey A, Gupta O, Raskar R, et al. Maximum-entropy fine grained classification[C]∥32nd Conference on Neural Information Processing Systems,Montreal, Canada, 2018: 637-647.
14 Zhang X F, Zhou F, Lin Y Q, et al. Embedding label structures for fine-grained feature representation[C]∥IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 1114-1123.
15 Wei X S, Luo J H, Wu J X, et al. Selective convolutional descriptor aggregation for fine-grained image retrieval[J]. IEEE Transactions on Image Processing, 2017, 26(6): 2868-2881.
16 Zhang X, Zhou F, Lin Y, et al. Embedding label structures for fine-grained feature representation[C]∥IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 1114-1123.
17 Song H O, Yu X, Jegelka S, et al. Deep metric learning via lifted structured feature embedding[C]∥IEEE Conference on Computer Vision and Pattern Recognition, Las VegasLas,USA,2016: 4004-4012.
18 Song H O, Jegelka S, Rathod V, et al. Deep metric learning via facility location[C]∥30th IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 5382-5390.
19 Sohn K. Improved deep metric learning with multi-class n-pair loss objective[C]∥Advances in Neural Information Processing Systems 29, Barcelona, Spain, 2016: 1857-1865.
20 Chen H, Chen C L, Tang X O. Local similarity-aware deep feature embedding[J/OL]. [2021-10-27].
21 Wei X S, Luo J H, Wu J, et al. Selective convolutional descriptor aggregation for fine-grained image retrieval[J]. IEEE Transactions on Image Processing, 2017, 26(6): 2868-2881.
22 Zheng X W, Ji R R, Sun X S, et al. Centralized ranking loss with weakly supervised localization for fine-grained object retrieval[C]∥27th International Joint Conference on Artificial Intelligence, Stockholm, Sweden, 2018: 1226-1233.
23 Zheng X W, Ji R R, Sun X X, et al. Towards optimal fine grained retrieval via decorrelated centralized loss with normalize-scale layer[C]∥In Proceedings of Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, USA, 2019: 9291-9298.
[1] Jin-wu GAO,Zhi-huan JIA,Xiang-yang WANG,Hao XING. Degradation trend prediction of proton exchange membrane fuel cell based on PSO⁃LSTM [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(9): 2192-2202.
[2] Xiao-ying LI,Ming YANG,Rui QUAN,Bao-hua TAN. Unbalanced text classification method based on deep learning [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(8): 1889-1895.
[3] Xuan-jing SHEN,Xue-feng ZHANG,Yu WANG,Yu-bo JIN. Multi⁃focus image fusion algorithm based on pixel⁃level convolutional neural network [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(8): 1857-1864.
[4] Dan HU,Xin MENG. Vessel search method by earth observation satellite based on time⁃varying grid [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(8): 1896-1903.
[5] Ming-hua GAO,Can YANG. Traffic target detection method based on improved convolution neural network [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(6): 1353-1361.
[6] Ji-hong OUYANG,Ze-qi GUO,Si-guang LIU. Dual⁃branch hybrid attention decision net for diabetic retinopathy classification [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(3): 648-656.
[7] Lin SONG,Li-ping WANG,Jun WU,Li-wen GUAN,Zhi-gui LIU. Reliability analysis based on cyber⁃physical system and digital twin [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(2): 439-449.
[8] Jie CAO,Jia-lin MA,Dai-lin HUANG,Ping YU. A fault diagnosis method based on multi Markov transition field [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(2): 491-496.
[9] Shuai-na HUANG,Yu-xiang LI,Yue-heng MAO,Ai-ying BAN,Zhi-yong ZHANG. Black-box transferable adversarial attacks based on ensemble advGAN [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(10): 2391-2398.
[10] You QU,Wen-hui LI. Single-stage rotated object detection network based on anchor transformation [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(1): 162-173.
[11] Gui-xia LIU,Zhi-yao PEI,Jia-zhi SONG. Prediction of protein-ATP binding site based on deep learning [J]. Journal of Jilin University(Engineering and Technology Edition), 2022, 52(1): 187-194.
[12] Jie ZHANG,Wen JING,Fu CHEN. Vulnerability detection of instant messaging network protocol based on passive clustering algorithm [J]. Journal of Jilin University(Engineering and Technology Edition), 2021, 51(6): 2253-2258.
[13] Li-li REN,Zhi-jun WANG,Dong-mei YAN. Improved multi⁃verse algorithm with combined slime mould foraging behavior [J]. Journal of Jilin University(Engineering and Technology Edition), 2021, 51(6): 2190-2197.
[14] Li-li DONG,Dan YANG,Xiang ZHANG. Large⁃scale semantic text overlapping region retrieval based on deep learning [J]. Journal of Jilin University(Engineering and Technology Edition), 2021, 51(5): 1817-1822.
[15] Li-sheng JIN,Bai-cang GUO,Fang-rong WANG,Jian SHI. Dynamic multiple object detection algorithm for vehicle forward based on improved YOLOv3 [J]. Journal of Jilin University(Engineering and Technology Edition), 2021, 51(4): 1427-1436.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!