Journal of Jilin University Science Edition ›› 2025, Vol. 63 ›› Issue (3): 783-0794.

Previous Articles     Next Articles

Multimodal Retinal Disease Diagnosis Model Based on Multi-level and Multi-scale Attention Fusion Network

GUO Xiaoxin1,2, YANG Mei1,2, YANG Guangqi1,2, DONG Hongliang1, XU Haixiao1   

  1. 1. College of Computer Science and Technology, Jilin University, Changchun 130012, China;2. Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
  • Received:2023-12-13 Online:2025-05-26 Published:2025-05-26

Abstract: Aiming at the limitations of extracting retinal features from single-mode retinal images, we proposed a multi-modal retinal disease diagnosis model based on multi-level and multi-scale attention fusion network. Firstly, the multi-level attention network and multi-scale attention network were designed for color retinal images  and retinal optical coherence tomography respectively, and the fusion features were obtained by merging at the feature layer. Secondly, the weighted loss function of the two modes and the loss function of the fusion features were added to extract the unique and complementary information of the two modes in order to  improve the accuracy of retinal disease diagnosis. The results of evaluation experiments  on the MMC-AMD dataset and GAMMA dataset show that the proposed model outperforms  the current mainstream models and has superior diagnostic effect.

Key words: medical image classification, retinal disease diagnosis model, multi-modal classification, attention mechanism

CLC Number: 

  • TP391.4