基于 Vision Transformer 的眼睑遮挡虹膜识别

吉林大学学报(信息科学版) ›› 2026, Vol. 44 ›› Issue (3): 598-608.

基于 Vision Transformer 的眼睑遮挡虹膜识别

夏志城^1a, 刘元宁^1a,1b, 朱晓冬^1a,1b, 刘震^1a,2, 陈英³, 郭志民^1a

1. 吉林大学 a. 计算机科学与技术学院;b. 符号计算与知识工程教育部重点实验室,长春130012; 2. 长崎综合科学大学研究生院工学研究科,长崎851-0193;3. 南昌航空大学软件学院,南昌330036

收稿日期:2025-04-11 出版日期:2026-06-02 发布日期:2026-06-02
作者简介:夏志城(2000— ), 男, 河南信阳人, 吉林大学硕士研究生, 主要从事生物识别研究, (Tel)86-18240525708(E-mail) xzc201908@163. com; 刘元宁(1962— ), 男, 长春人, 吉林大学教授, 博士生导师, 主要从事生物识别研究, (Tel)86- 13904336786(E-mail)liuyn@ jlu. edu. cn。
基金资助:
国家自然科学基金资助项目(61471181); 国家重点研发计划基金资助项目(国科发资[2020]151 号); 江西省自然科学基金资助项目(20242BAB26015)

Iris Recognition with Eyelid Occlusion Based on Vision Transformer

XIA Zhicheng^1a, LIU Yuanning^1a,1b, ZHU Xiaodong^1a,1b, LIU Zhen^1a,2, CHEN Ying³, GUO Zhimin^1a

1a. College of Computer Science and Technology; 1b. Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China; 2. Graduate School of Engineering, Nagasaki Institute of Applied Science, Nagasaki 851-0193, Japan; 3. School of Software, Nanchang Hangkong University, Nanchang 330036, China

Received:2025-04-11 Online:2026-06-02 Published:2026-06-02

摘要/Abstract

摘要： 针对虹膜识别过程中存在眼睑遮挡影响识别性能的问题, 提出基于 ViT(Vision Transformer)的解决方案。首先提出特征融合模块(FFM:Feature Fusion Module), 实现不同尺度特征提取与融合, 解决特征提取过程中信息丢失问题; 其次用最小化重构损失对局部特征编码器进行预训练, 避免相同主导特征的异类虹膜构成三元组, 此先验知识使模型参数调整具备一定可解释性; 同时以 ViT 和残差块为核心构建交互式编码结构,将来自不同虹膜块的信息高效融合形成全面特征表达; 最后改进传统三元组损失, 融合阈值概念, 为训练模型提供更明确的学习方向。实验结果表明, 所提方法能有效去除遮挡对虹膜识别的负面影响, 显著提升识别性能。

关键词: 虹膜识别, 视觉Transformer, 特征融合, 三元组损失

Abstract: To address the issue of eyelid occlusion affecting recognition performance in iris recognition, a solution based on ViT(Vision Transformer) is proposed. Firstly, a FFM(Feature Fusion Module) is proposed to achieve feature extraction and fusion at different scales, solving the problem of information loss during feature extraction. Secondly, the local feature encoder is pre-trained by minimizing reconstruction loss to avoid forming triplets with heterogeneous irises sharing the same dominant features. This prior knowledge endows the model parameter adjustment with certain interpretability. An interactive encoding structure is constructed with ViT and residual blocks as the core, efficiently fusing information from different iris blocks to form comprehensive feature representation. Finally, the traditional triplet loss is improved by incorporating the threshold concept, providing a clearer learning direction for model training. Experimental results show that the proposed method can effectively eliminate the negative impact of occlusion on iris recognition and significantly improve recognition performance.

Key words: iris recognition, Vision Transformer, feature fusion, triplet loss

中图分类号:

TP391

夏志城, 刘元宁, 朱晓冬, 刘震, 陈英, 郭志民. 基于 Vision Transformer 的眼睑遮挡虹膜识别[J]. 吉林大学学报(信息科学版), 2026, 44(3): 598-608.

XIA Zhicheng, LIU Yuanning, ZHU Xiaodong, LIU Zhen, CHEN Ying, GUO Zhimin. Iris Recognition with Eyelid Occlusion Based on Vision Transformer[J]. Journal of Jilin University (Information Science Edition), 2026, 44(3): 598-608.

[1]	路阳, 许思源, 陶贤鹏, 刘启旺, 管闯. 基于ViT-WGAN-GP 的水稻病害图像生成方法 [J]. 吉林大学学报(信息科学版), 2025, 43(4): 747-754.
[2]	何飞,刘元宁,朱晓冬,王宁. 一种适于虹膜识别的特征提取算法[J]. J4, 2009, 27(05): 520-.