Journal of Jilin University(Engineering and Technology Edition) ›› 2020, Vol. 50 ›› Issue (5): 1765-1770.doi: 10.13229/j.cnki.jdxbgxb20190755

Previous Articles    

Clothing classification algorithm based on landmark attention and channel attention

Hong-wei ZHAO1(),Xiao-han LIU1,Yuan ZHANG1,Li-li FAN1,Man-li LONG2(),Xue-bai ZANG1   

  1. 1.College of Computer Science and Technology,Jilin University,Changchun 130012,China
    2.School of Foreign Language Education, Jilin University, Changchun 130012,China
  • Received:2019-07-25 Online:2020-09-01 Published:2020-09-16
  • Contact: Man-li LONG E-mail:zhaohw@jlu.edu.cn;Longml@jlu.edu.cn

Abstract:

In order to solve the problems of clothing landmark detection, category classification and attribute prediction, a novel deep neural network based on the combination of landmark attention mechanism and channel attention mechanism was proposed. First, the network predicts clothing landmarks by convoluting the input feature map to extract features, deconvoluting to restore the feature map size. Then, it acquires the connection between the landmarks by adding a non-local structure, thus, obtaining the landmark attention. The landmark attention module emphasizes the characteristics of the discriminative area in the clothing, and then new feature maps are generated. In addition, channel attention increases the weight of some feature maps which are more useful for category classification and attribute prediction. The experimental results on the DeepFashion dataset show that the proposed method can improve the accuracy of category classification and the recall rate of attribute prediction compared with the existing methods.

Key words: computer application, clothing category classification, clothing attribute prediction, deep learning, attention mechanism

CLC Number: 

  • TP391

Fig.1

Landmark detection, category classification and attribute prediction"

Fig.2

Whole instruction of network"

Fig.3

Visualization of landmark detection"

Fig.4

Clothing classification and attribute prediction results"

Table 1

Experimental results for landmark detection"

方法左领口右领口左袖口右袖口左腰线右腰线左下摆右下摆平均值
文献[1]0.08540.09020.09730.09350.08540.08450.08120.08230.0872
文献[2]0.06280.06370.06580.06210.07260.07020.06580.06630.0660
文献[14]0.05700.06110.06720.06470.07030.06940.06240.06270.0643
文献[3]0.03320.03460.04870.05190.04420.04290.06200.06390.0474
本文0.03850.03900.05460.05700.04890.05170.05520.05850.0504

Table 2

Experimental results for category classification and attribute prediction"

方法分类纹理面料形状部分
top-3top-5top-3top-5top-3top-5top-3top-5top-3top-5
文献[15]43.7366.2624.2132.6525.3836.0623.3931.2626.3133.24
文献[16]59.4879.5836.1548.1536.6448.5235.8946.9339.1750.14
文献[1]82.5890.1737.4649.5239.3049.8439.3748.5944.1354.02
文献[17]86.3092.8053.6063.2039.1048.8050.1059.5038.8048.90
文献[3]91.1696.1256.1765.8343.2053.5258.2867.8046.9757.42
本文91.2495.9457.1166.6244.2554.5259.5668.9247.6058.01
1 Liu Z, Luo P, Qiu S, et al. DeepFashion: powering robust clothes recognition and retrieval with rich annotations[C]∥IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 1096-1104.
2 Liu Z, Yan S, Luo P, et al. Fashion landmark detection in the wild[C]∥European Conference on Computer Vision, Amsterdam, Netherlands, 2016: 229-245.
3 Liu J, Lu H. Deep fashion analysis with feature map upsampling and landmark-driven attention[C]∥European Conference on Computer Vision, Munich Germany, 2018: 30-36.
4 Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]∥IEEE Conference on Computer Vision and Pattern Recognition, Munich Germany, 2018: 7132-7141.
5 Buades A, Coll B, Morel J M. A non-local algorithm for image denoising[C]∥2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA, 2005: 60-65.
6 Wang X, Girshick R, Gupta A, et al. Non-local neural networks[C]∥IEEE Conference on Computer Vision and Pattern Recognition, Munich Germany, 2018: 7794-7803.
7 Shih K J, Singh S, Hoiem D. Where to look: focus regions for visual question answering[C]∥IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 4613-4621.
8 Yang Z, He X, Gao J, et al. Stacked attention networks for image question answering[C]∥IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 21-29.
9 纪超, 刘慧英, 孙景峰, 等. 基于空域和频域的图像显著区域检测[J]. 吉林大学学报: 工学版, 2014, 44(1): 117-183.
Ji Chao, Liu Hui-ying, Sun Jing-feng, et al. Image salient region detection based on spatial and frequency domains[J]. Journal of Jilin University(Engineering and Technology Edition), 2014, 44(1): 177-183.
10 董超, 刘晶红, 徐芳, 等. 光学遥感图像舰船目标快速检测方法[J]. 吉林大学学报: 工学版, 2019, 49(4): 1369-1376.
Dong Chao, Liu Jing-hong, Xu Fang, et al. Fast ship detection in optical remote sensing images[J]. Journal of Jilin University(Engineering and Technology Edition), 2019, 49(4): 1369-1376.
11 Newell A, Yang K, Deng J. Stacked hourglass networks for human pose estimation[C]∥European Conference on Computer Vision, Amsterdam, Netherlands, 2016: 483-499.
12 Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[J/OL].[2015-04-10].
13 Lin M, Chen Q, Yan S. Network in network[J].arXiv preprint arXiv:1312.4400, 2013.
14 Yan S, Liu Z, Luo P. Unconstrained fashion landmark detection via hierarchical recurrent transformer networks[C]∥ACM on Multimedia Conference, Silicon Valley, USA, 2017: 172-180.
15 Chen H, Gallagher A, Girod B. Describing clothing by semantic attributes[C]∥European Conference on Computer Vision, Florence Italy, 2012: 609-623.
16 Huang J, Feris R S, Chen Q, et al. Cross-domain image retrieval with a dual attribute-aware ranking network[C]∥IEEE International Conference on Computer Vision, Santiago, Chile, 2015: 1062-1070.
17 Corbiere C, Ben-Younes H, Rame A, et al. Leveraging weakly annotated data for fashion image retrieval and label prediction[C]∥IEEE International Conference on Computer Vision Workshop, Venice, Italy, 2017: 2268-2274.
[1] Xiang-jiu CHE,You-zheng DONG. Improved image recognition algorithm based on multi⁃scale information fusion [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(5): 1747-1754.
[2] Zhou-zhou LIU,Wen-xiao YIN,Qian-yun ZHANG,Han PENG. Sensor cloud intrusion detection based on discrete optimization algorithm and machine learning [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(2): 692-702.
[3] Xiao-hui WANG,Lu-shen WU,Hua-wei CHEN. Denoising of scattered point cloud data based on normal vector distance classification [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(1): 278-288.
[4] Xiao-dong ZHANG,Xiao-jun XIA,Hai-feng LYU,Xu-chao GONG,Meng-jia LIAN. Dynamic load balancing of physiological data flow in big data network parallel computing environment [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(1): 247-254.
[5] Man CHEN,Yong ZHONG,Zhen-dong LI. Multi-focus image fusion based on latent lowrank representation combining lowrank representation [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(1): 297-305.
[6] Shun-fu JIN,Xiu-chen QIE,Hai-xing WU,Zhan-qiang HUO. Clustered virtual machine allocation strategy in cloud computing based on new type of sleep-mode and performance optimization [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(1): 237-246.
[7] Jun-yi DENG,Yan-heng LIU,Shi FENG,Rong-cun ZHAO,Jian WANG. GSPN⁃based model to evaluate the performance and securi tytradeoff in Ad-hoc network [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(1): 255-261.
[8] Tie-jun WANG,Wei-lan WANG. Thangka image annotation based on ontology [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(1): 289-296.
[9] Xiong-fei LI,Jing WANG,Xiao-li ZHANG,Tie-hu FAN. Multi-focus image fusion based on support vector machines and window gradient [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(1): 227-236.
[10] Hong-yan WANG,He-lei QIU,Jia ZHENG,Bing-nan PEI. Visual tracking method based on low⁃rank sparse representation under illumination change [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(1): 268-277.
[11] You ZHOU,Sen YANG,Da-lin LI,Chun-guo WU,Yan WANG,Kang-ping WANG. Acceleration platform for face detection and recognition based on field⁃programmable gate array [J]. Journal of Jilin University(Engineering and Technology Edition), 2019, 49(6): 2051-2057.
[12] Hong-wei ZHAO,Peng WANG,Li-li FAN,Huang-shui HU,Ping-ping LIU. Similarity retention instance retrieval method [J]. Journal of Jilin University(Engineering and Technology Edition), 2019, 49(6): 2045-2050.
[13] Jun SHEN,Xiao ZHOU,Zu-qin JI. Implementation of service dynamic extended network and its node system model [J]. Journal of Jilin University(Engineering and Technology Edition), 2019, 49(6): 2058-2068.
[14] Bing-hai ZHOU,Qiong WU. Balancing and optimization of robotic assemble lines withtool and space constraint [J]. Journal of Jilin University(Engineering and Technology Edition), 2019, 49(6): 2069-2075.
[15] Xiang-jiu CHE,Hua-luo LIU,Qing-bin SHAO. Fabric defect recognition algorithm based onimproved Fast RCNN [J]. Journal of Jilin University(Engineering and Technology Edition), 2019, 49(6): 2038-2044.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!