吉林大学学报(理学版) ›› 2025, Vol. 63 ›› Issue (3): 783-0794.

• • 上一篇    下一篇

基于多层次多尺度注意力融合网络的多模态眼底疾病诊断模型

郭晓新1,2, 杨梅1,2, 杨广奇1,2, 董洪良1, 徐海啸1   

  1. 1. 吉林大学 计算机科学与技术学院, 长春 130012; 2. 吉林大学 符号计算与知识工程教育部重点实验室, 长春 130012
  • 收稿日期:2023-12-13 出版日期:2025-05-26 发布日期:2025-05-26
  • 通讯作者: 郭晓新 E-mail:guoxx@jlu.edu.cn

Multimodal Retinal Disease Diagnosis Model Based on Multi-level and Multi-scale Attention Fusion Network

GUO Xiaoxin1,2, YANG Mei1,2, YANG Guangqi1,2, DONG Hongliang1, XU Haixiao1   

  1. 1. College of Computer Science and Technology, Jilin University, Changchun 130012, China;2. Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
  • Received:2023-12-13 Online:2025-05-26 Published:2025-05-26

摘要: 针对单模态眼底图像提取眼底特征的局限性, 提出一个基于多层次多尺度注意力融合网络的多模态眼底疾病诊断模型. 首先, 分别针对彩色眼底图像和视网膜光学相干断层成像设计多层次注意力网络和多尺度注意力网络, 并在特征层进行融合得到融合特征; 其次, 将两种模态的损失函数加权, 与融合特征的损失函数相加, 提取模态的独特和互补信息, 以提高眼底疾病诊断的准确率. 在数据集MMC-AMD和GAMMA上进行评估的实验结果表明, 该模型优于当前主流模型, 诊断效果优越.

关键词: 医学图像分类, 眼底疾病诊断模型, 多模态分类, 注意力机制

Abstract: Aiming at the limitations of extracting retinal features from single-mode retinal images, we proposed a multi-modal retinal disease diagnosis model based on multi-level and multi-scale attention fusion network. Firstly, the multi-level attention network and multi-scale attention network were designed for color retinal images  and retinal optical coherence tomography respectively, and the fusion features were obtained by merging at the feature layer. Secondly, the weighted loss function of the two modes and the loss function of the fusion features were added to extract the unique and complementary information of the two modes in order to  improve the accuracy of retinal disease diagnosis. The results of evaluation experiments  on the MMC-AMD dataset and GAMMA dataset show that the proposed model outperforms  the current mainstream models and has superior diagnostic effect.

Key words: medical image classification, retinal disease diagnosis model, multi-modal classification, attention mechanism

中图分类号: 

  • TP391.4