A Cross-Domain Transfer Learning Framework for Robust Plant Disease Identification from Leaf Imagery
DOI:
https://doi.org/10.37934/sijml.6.1.1124Keywords:
Food crops diseases, image recognition, vision transformer, attention mechanism, deep learningAbstract
Recent advancements in transformer-based models, particularly Vision Transformers (ViT), have revolutionized agricultural image analysis by capturing complex, non-linear patterns. Despite their effectiveness, ViTs require large labeled datasets, posing challenges in plant disease identification due to similar symptoms and limited data. This study explores data augmentation and transfer learning with two ViT variants, ViT-Base and ViT-Large, using training-from-scratch and feature extraction techniques. The RoboFlow augmentation combined with ViT-GZSL feature extraction achieved 96.62% accuracy. The work uses transfer learning in the background. ViT-Base excelled in classifying corn, chilli, tea, and tomato diseases, while ViT-Large performed best on peanut and potato crops, reaching up to 98.85 % accuracy.
