CBC-ViT: Blood Cell Type Classification Using a Hybrid CNN-Vision Transformer with Explainable AI
Students & Supervisors
Student Authors
Supervisors
Abstract
Accurate classification of blood cell types plays a pivotal role in automated hematological analysis and serves as a fundamental step toward early blood-related disorder detection and diagnosis. In this study, we propose CBC-ViT, a novel hybrid Convolutional Vision Transformer model designed to classify five distinct blood cell types: basophil, erythroblast, monocyte, myeloblast, and neutrophil. The proposed architecture integrates a Convolutional Neural Network (CNN) backbone for hierarchical feature extraction with a Vision Transformer (ViT) encoder employing multi-head self-attention to capture long-range dependencies and contextual information. The model is optimized using the Adam optimizer with carefully tuned learning parameters to ensure stable convergence. Experimental results demonstrate the superior performance of CBC-ViT, achieving a test accuracy of 99.20%, with precision, recall, and F1-scores all exceeding 99%, thereby highlighting the robustness and reliability of the proposed approach. To further ensure transparency and trustworthiness, we incorporated Explainable AI (XAI) techniques, including Grad-CAM, Integrated Gradients, and SmoothGrad, which provide interpretable visual explanations of the model’s predictions, thereby assisting medical practitioners in validating automated outputs. The results underscore the potential of CBC-ViT as a reliable and explainable decision-support tool in hematological diagnostics.
Keywords
Publication Details
- Type of Publication:
- Conference Name: IEEE International Conference on Biomedical Engineering, Computer and Information Technology for Health 2025 (BECITHCON 2025)
- Date of Conference: 29/11/2025 - 29/11/2025
- Venue: Eastern University, Dhaka, Bangladesh
- Organizer: IEEE Bangladesh section and IEEE Engineering in Medicine and Biology Society Bangladesh