Journal of Jilin University (Information Science Edition) ›› 2025, Vol. 43 ›› Issue (4): 747-754.

Previous Articles     Next Articles

mage Generation Method of Rice Disease Based on ViT-WGAN-GP

LU Yang1, XU Siyuan1, TAO Xianpeng2, LIU Qiwang1, GUAN Chuang3   

  1. 1. School of Information and Electrical Engineering, Heilongjiang Bayi Agricultural Reclamation University, Daqing 163319, China; 2. Intelligent Vehicle Control Department, Shanghai Shangtai Automotive Information System Limited, Shanghai 200020, China; 3. Heilongjiang Key Laboratory of Networking and Intelligent Control, Northeast Petroleum University, Daqing 163318, China
  • Received:2024-06-06 Online:2025-08-15 Published:2025-08-14

Abstract: In order to solve the problem that the accuracy of deep neural network model learning is affected by the small sample of rice disease image dataset, an improved adversarial generative network model ViT-WGAN-GP (The Fusion of Vision Transformer and Wasserstein Generative Adversarial Networks with Gradient Penalty) is proposed for enhancing the image dataset. Firstly, the Vision Transformer structure is introduced in the generation model to enhance the learning of global features. Secondly, the WGAN-GP structure is used in the discrimination model to ensure the stability of the model training and improved the effect of the generated images through the Wasserstein measure function and the gradient penalty term. Finally, the enhanced sample set is used to train the deep neural network model. The experimental results show that the ViT-WGAN-GP model generates images with significant improvement compared with GAN and WGAN-GP. The average accuracy of rice disease recognition is 94. 3%,96. 2%, and 97. 5% for VGG16, ResNet34, which are improved by 9. 7%, 2. 8%, and 4.8%, respectively. The proposed ViT-WGAN-GP model can generate more realistic rice disease images and can improve the recognition accuracy of deep neural network models significantly with small sample sets.

Key words: image generation, vision transformer, wasserstein generative adversarial networks with gradient penalty(WGAN-GP), generative adversarial networks, rice diseases

CLC Number: 

  • TP391.41