e-ISSN 2231-8526
ISSN 0128-7680
Gan Mengyun, Hazlina Hamdan, Mahmud Dwi Sulistiyo, and Abdulrahman Dira Khalaf
Pertanika Journal of Science & Technology, Pre-Press
DOI: https://doi.org/10.47836/pjst.34.2.02
Keywords: Convolutional neural networks, data augmentation, deep learning, EfficientNet, MambaATT Model, ResNet50, skin cancer, vision transformer
Published: 2026-04-01
Skin cancer poses a vast worldwide health issue because of its fast progression and high mortality rate. Early diagnosis and precise prognosis are important for the treatment of these patients. In this paper, we propose an enhanced hybrid deep learning model composed of a combination of Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) for the dermoscopic image classification task. The designed architecture adopts ResNet50, EfficientNet, and ViT as strong feature extractors with an innovative attention module, MambaATT, for precise modelling and utilisation of global context information. Methods for statistical growth (rotation, flips, and scaling) have been tested to improve the model's generalisation and its variety, even if the dataset is short. When evaluating the entire dataset, we found that our proposed version CNN_ViT_MambaATT reaches an advanced-class accuracy of 92%, outperforming the traditional methods for individual networks. The findings highlight the power of combining strong CNN and VIT models for both mechanisms of interest and information growth strategies, offering a robust and accurate approach to early skin cancer analysis.
ISSN 0128-7702
e-ISSN 2231-8534
Share this article