Journal of Jilin University(Engineering and Technology Edition) ›› 2025, Vol. 55 ›› Issue (5): 1756-1762.doi: 10.13229/j.cnki.jdxbgxb.20240566

Previous Articles     Next Articles

Discrimination method for Pu-er tea varieties based on noise-robust feature extraction

Xiu-zhi ZHAO1,2(),De-hong XIE3()   

  1. 1.College of Artificial Intelligence,Zhejiang Industry& Trade Vocational College,Wenzhou,325002,China
    2.School of Computer and Artificial Intelligence,Wenzhou University,Wenzhou 325002,China
    3.College of Information Science and Technology,Nanjing Forestry University,Nanjing 210037,China
  • Received:2024-05-22 Online:2025-05-01 Published:2025-07-18
  • Contact: De-hong XIE E-mail:rgznzxz@zjitc.edu.cn;dehong.xie@gmail.com

Abstract:

When using near-infrared spectroscopy and machine learning methods to quickly identify the quality of Pu-er tea, the spectra collected by medium and low-end near-infrared spectroscopy acquisition equipment have the characteristics of high dimension, overlap and large noise, which seriously affects the accuracy of modeling. This paper proposes a noise-robust feature extraction method, which is combined with support vector machine (SVM) classifier to establish the quality identification method of Pu-er tea. Firstly, the noise-robust feature extraction method, principal component analysis (PCA) and successive projections algorithm (SPA) are used to extract the features from the obtained near-infrared spectral data. Then, SVM is used to train the data after feature extraction to obtain the identification model. The comparison of the identification results of the model shows that for the noiseresidual near-infrared spectral data, the noise robust feature extraction method in this paper can effectively resist the influence of noise and propose feature variables from the high-dimensional spectrum to improve the accuracy of the identification model. The accuracy, recall, specificity, accuracy and F-score predicted by the identification model were significantly higher than those obtained by the other two methods. For the detection of ancient Pu-er tea and non-ancient Pu-er tea, the accuracy and recall predicted by the identification model in this paper have reached 92.06% and 95.38% respectively, indicating that the identification model has good identification ability. The research results provide theoretical reference and basis for accurately judging the quality of Pu-er tea in practical application.

Key words: near-infrared spectroscopy, noise, rapid identification, Pu-er tea, feature extraction, machine learning

CLC Number: 

  • O657.3

Fig.1

NIR spectra of Pu-er tea samples"

Fig.2

Flowchart of tea quality identification"

Fig.3

2D projections of sample points by feature extraction method: PCA,SPA,our method"

Fig.4

Accuracy of identification model varying with the extracted features count"

Table1

Quality identification results of Pu-er tea"

方法正确率召回率特效度准确率平衡F分数
PCA-SVM0.842 90.853 20.831 70.845 50.849 3
SPA-SVM0.700 00.629 60.805 60.829 30.715 8
本文方法0.920 60.953 80.885 20.898 60.925 4
[1] 赵阳, 龚加顺, 王秋萍. 古树普洱茶生茶贮藏过程中香气成分的变化[J]. 食品科学, 2022, 43(4): 241-248.
Zhao Yang, Gong Jia-shun, Wang Qiu-ping. Change in aroma components of raw pu-erh tea from ancient tea trees during storage[J]. Food Science, 2022, 43(4): 241-248.
[2] 曾敏, 龚正礼. 基于主成分分析法构建云南古树普洱生茶香气质量评价模型[J]. 食品工业科技, 2017, 38(15): 264-269.
Zeng Min, Gong Zheng-li. Modeling for aroma quality evaluation of Yunnan Pu-erh raw tea made from ancient trees based on principal component analysis[J]. Science and Technology of Food Industry, 2017, 38(15): 264-269.
[3] 吴全金, 周喆, 孙威江. 近红外光谱技术在茶叶品质调控中的应用[J]. 广东农业科学, 2019, 46(1): 91-100.
Wu Quan-jin, Zhou Zhe, Sun Wei-jiang. Review on the application of near-infrared spectroscopy technology in tea quality management[J]. Guangdong Agricultural Sciences, 2019, 46(1): 91-100.
[4] 王胜鹏, 龚自明, 高士伟, 等. 基于近红外光谱技术的恩施玉露茶保存年份的快速无损鉴别[J]. 华中农业大学学报, 2015, 34(5): 111-114.
Wang Sheng-peng, Gong Zi-ming, Gao Shi-wei, et al. Identification of Enshi yulu tea conserved years based on near infrared spectroscopy[J]. Journal of Huazhong Agricultural University, 2015, 34(5): 111-114.
[5] Ren G, Wang Y, Ning J, et al. Highly identification of keemun black tea rank based on cognitive spectroscopy: near infrared spectroscopy combined with feature variable selection[J]. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2020, 230: 118079.
[6] 韩广, 王小燕, 陈思琪, 等. 提高近红外光谱法检测人体血液等复杂溶液成分准确度的研究进展[J]. 光谱学与光谱分析, 2021, 41(7): 1993-1997.
Han Guang, Wang Xiao-yan, Chen Si-qi, et al. Research progress on improving the accuracy of near inf rared spectroscopy detection of human blood and other complex solution components[J]. Spectroscopy and Spectral Analysis, 2021, 41(7): 1993-1997.
[7] 谢德红, 李俊锋, 刘菂, 等. 基于改进Hodrick-Prescott分解模型的近红外自适应降噪方法[J]. 光谱学与光谱分析, 2020, 40(5): 1650-1655.
Xie De-hong, Li Jun-feng, Liu Di, et al. An improved hodrick-prescott decomposition based near-infrared adaptive denoising method[J]. Spectroscopy and Spectral Analysis, 2020, 40(5): 1650-1655.
[8] Yang H, Li L L, Li G H, et al. A novel feature extraction method for ship-radiated noise[J]. Defence Technology, 2022, 18(4): 604-617.
[9] 董春旺, 梁高震, 安霆, 等. 红茶感官品质及成分近红外光谱快速检测模型建立[J]. 农业工程学报, 2018, 34(24): 306-313.
Dong Chun-wang, Liang Gao-zhen, An Ting, et al. Near-infrared spectroscopy detection model for sensory quality and chemical constituents of black tea[J]. Transactions of the Chinese Society of Agricultural Engineering, 2018, 34(24): 306-313.
[10] 刘鹏, 艾施荣, 杨普香, 等. 非线性流形降维方法结合近红外光谱技术快速鉴别不同海拔的茶叶[J]. 茶叶科学, 2019, 39(6): 715-722.
Liu Peng, Ai Shi-rong, Yang Pu-xiang, et al. Nonlinear manifold dimensionality reduction methods for quick discrimination of tea at different altitude by near infrared spectroscopy[J]. Journal of Tea Science, 2019, 39(6): 715-722.
[11] Canova L D S, Vallese F D, Pistonesi M F, et al. An improved successive projections algorithm version to variable selection in multiple linear regression[J]. Analytica Chimica Acta, 2023, 1274: 341560.
[12] Pang L, Wang L, Yuan P, et al. Rapid seed viability prediction of Sophora japonica by improved successive projection algorithm and hyperspectral imaging[J]. Infrared Physics & Technology, 2022, 123: 104143.
[13] Ghosh T, Kirby M. Linear centroid encoder for supervised principal component analysis[J]. Pattern Recognition, 2024, 155: 110634.
[14] Cardoso V G K, Poppi R J. Non-invasive identification of commercial green tea blends using NIR spectroscopy and support vector machine[J]. Microchemical Journal, 2021, 164: 106052.
[15] Pang Y, Wang Y, Lai X, et al. Enhanced kriging leave-one-out cross-validation in improving model estimation and optimization[J]. Computer Methods in Applied Mechanics and Engineering, 2023, 414: 116194.
[16] Luque A, Carrasco A, Martín A, et al. The impact of class imbalance in classification performance metrics based on the binary confusion matrix[J]. Pattern Recognition, 2019, 91: 216-231.
[1] Qing-liang JIN,Xin-sen ZHOU,Yi CHEN,Cheng-wen WU. Predictive model for identifying innovative university talents based on the swarm intelligence evolution enhanced kernel extreme learning machine [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(5): 1763-1771.
[2] Nan-nan ZHAO,Feng JIN,Hong-yu DING. Nonlinear active noise control algorithm based on weighted optimization AF [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(5): 1722-1727.
[3] Sheng-qi MEI,Xiao-dong LIU,Xing-ju WANG,Xu-feng LI,Teng WU,Xiang-xu CHENG. Prediction of high strength concrete creep based on parametric MIC analysis and machine learning algorithm [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(5): 1595-1603.
[4] Le-ting TAN,Hui DENG. Adaptive task scheduling for distributed fiber optic acoustic sensing signals with low signal-to-noise ratio under full waveform inversion [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(4): 1412-1418.
[5] Xiang-hai MENG,Guo-rui WANG,Ming-yang ZHANG,Bi-jiang TIAN. Traffic accident prediction model of mountain highways based on selection integration [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(4): 1298-1306.
[6] Yin-fei DAI,Xiu-zhen ZHOU,Yu-bao LIU,Zhi-yuan LIU. In⁃vehicle network intrusion detection system based on CAN bus data [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(3): 857-865.
[7] Na WANG,Yue-lei CUI,Yang LI,Zi-cong WANG. Rolling bearing fault diagnosis method via wavelet packet logarithmic-energy map [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(2): 494-502.
[8] Hua-song DONG,Yuan-feng LIAN. Lightweight detection algorithm for lossless transcoding and heavy compression of massive digital media videos [J]. Journal of Jilin University(Engineering and Technology Edition), 2025, 55(2): 741-747.
[9] Lei ZHANG,Jing JIAO,Bo-xin LI,Yan-jie ZHOU. Large capacity semi structured data extraction algorithm combining machine learning and deep learning [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(9): 2631-2637.
[10] Dan-hui LAI,Wei-feng LUO,Xu-dong YUAN,Zi-liang QIU. Key point feature extraction algorithms for multimodal gesture in complex environments [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(8): 2288-2294.
[11] Wen CHENG,Cheng-chun ZHANG,Xiao-wei SUN,Chun SHEN,Zheng-yang WU,Zheng-wu CHEN. Experimental on self-noise control of airfoil with ridge-like structure [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(8): 2385-2392.
[12] Yun-zuo ZHANG,Yu-xin ZHENG,Cun-yu WU,Tian ZHANG. Accurate lane detection of complex environment based on double feature extraction network [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(7): 1894-1902.
[13] Chang-jian WANG,Jiu-ming LIU,Jin-zhou ZHANG,Bin LI. Laser sequence pulse diagnosis method of planetary reducer fault based on high-speed photography technology [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(7): 1869-1875.
[14] Cheng CHEN,Pei-xin SHI,Peng-jiao JIA,Man-man DONG. Correlation analysis of shield driving parameters and structural deformation prediction based on MK-LSTM algorithm [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(6): 1624-1633.
[15] Shi-feng NIU,Shi-jie YU,Yan-jun LIU,Chong MA. Real-time detection method of angry driving behavior based on bracelet data [J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(12): 3505-3512.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] Liu Hui;Wang Mao-hua;Wang Yue-xuan;Ma Dao-kun;Li Hai-xia. Development of farmland soil moisture and temperature monitoring system based on wireless sensor network[J]. 吉林大学学报(工学版), 2008, 38(03): 604 -0608 .
[2] Xia Hong-wei,Ling Ming-xiang,Wang Chang-hong . Guaranteed cost controller design for a class of uncertain networked control systems[J]. 吉林大学学报(工学版), 2008, 38(01): 173 -177 .
[3] Wen Guang;Zhao Ding-xuan;Tang Xin-xing;Deng Le;Zeng Chun-ping . Bilateral hydraulic servo control system based on force sense for teleoperated engineering robot[J]. 吉林大学学报(工学版), 2006, 36(06): 919 -0923 .
[4] FU Hong,JIA Hui-min,ZHANG Xiao-xu,YU Jian-qun . 2D DEM calculation methods based on superquadratics particles modeling[J]. 吉林大学学报(工学版), 2008, 38(06): 1383 -1388 .
[5] JIA Hong-fei, SUN Bao-feng|LUO Qing-yu, HAN Jia-chen . Measurement and adaptation analysis of their facilities capacity for the subway passenger terminal[J]. 吉林大学学报(工学版), 2009, 39(增刊2): 199 -0203 .
[6] CHI Xue-fen, ZHAO Ying-ying. Tandem queuing network model of H2H and M2M mixed services[J]. 吉林大学学报(工学版), 2011, 41(6): 1783 -1787 .
[7] Shang Hai-bin,Cui Ping-yuan,Luan En-jie . Guidance strategy for lowthrust transfer trajectories based on optimal states feedback technology[J]. 吉林大学学报(工学版), 2007, 37(04): 949 -954 .
[8] XU Zhixing , CONG Wenlong , TANG Kehong. Computeraided Selective Assembly Method[J]. 吉林大学学报(工学版), 2005, 35(06): 613 -0616 .
[9] WANG Dian-hai|CHEN Yong-heng. Research on the level of service at shared use path sidewalk[J]. 吉林大学学报(工学版), 2009, 39(增刊2): 146 -0149 .
[10] QU Zhao-wei, LI Zhi-hui, HU Hong-yu, GUO Wei-wei, WEI Wei. Traffic conflict automatic discrimination at non-signalized intersection based on video processing[J]. 吉林大学学报(工学版), 2009, 39(增刊2): 163 -0167 .