Hybrid CNN-Based Transformer Pipeline With Radiomic Fusion for Multi-Class Lung Cancer Detection
- 1 Department of Computer Science and Engineering, Galgotias University, Greater Noida, India
- 2 Graduate School, Duy Tan University, Da Nang 550000, Vietnam
Abstract
Early detection of lung cancer remains challenging due to high intra-class variation and inter-class similarity in Computed Tomography (CT) images. In this paper, we propose a hybrid deep learning model that combines convolutional, attention-based, and transformer-guided representations to address these challenges in multi-class lung cancer classification. For deep feature extraction, we use the EfficientNetV2-S architecture augmented with a Convolutional Block Attention Module to emphasize salient spatial and channel information. A transformer encoder captures global contextual dependencies, and texture-based radiomic features are incorporated to further enrich the representation. The resulting features are fused into a single embedding, which is then classified as normal, benign, or malignant. Experiments on the IQ-OTHNCCD dataset demonstrate that the proposed framework achieves superior performance across multiple metrics, accuracy, recall, precision, F1 score, and AUC, and outperforms state-of-the-art methods.
DOI: https://doi.org/10.3844/jcssp.2026.1785.1796
Copyright: © 2026 Aanchal Vij, Kuldeep Singh Kaswan and Anand Nayyar. This is an open access article distributed under the terms of the
Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 43 Views
- 11 Downloads
- 0 Citations
Download
Keywords
- Lung Cancer
- CT Imaging
- EfficientNetV2-S
- CBAM
- Transformer
- Hybrid Deep Learning