Triple-Stream Transformer Architecture for Multi-Class Skin Cancer Classification in Dermoscopic Images

Nawaf Alshdaifat; ,Suleiman Ibrahim Mohammad; Khaleel Ibrahim Al- Daoud; Suhaila Abuowaida; Asokan Vasudevan; Marah Sami Ali  Amoush; Muhammad Turki Alshurideh

doi:10.63332/joph.v5i3.853

Authors

Nawaf Alshdaifat Faculty of IT, Applied Science Private University, Amman, Jordan
,Suleiman Ibrahim Mohammad Electronic Marketing and Social Media, Economic and Administrative Sciences Zarqa University, Jordan; Research follower, INTI International University, 71800 Negeri Sembilan, Malaysia
Khaleel Ibrahim Al- Daoud Department of Accounting, Business School Faculties, Al Ahilya Amman University, Amman, Jordan
Suhaila Abuowaida Department of Computer Science, Faculty of Prince Al-Hussein Bin Abdallah II for Information Technology, Al Al-Bayt University, Mafraq 25113, Jordan
Asokan Vasudevan Faculty of Business and Communications, INTI International University, 71800 Negeri Sembilan, Malaysia
Marah Sami Ali Amoush Queen Noor civil aviation technical college
Muhammad Turki Alshurideh Department of Marketing, School of Business, The University of Jordan, Amman 11942, Jordan

DOI:

https://doi.org/10.63332/joph.v5i3.853

Keywords:

Skin Cancer Classification, DeepLabV3+, Derm-ViT, Swin Transformer V2, ConvNeXt V2, triple-stream transformer

Abstract

The most often occurring kind of cancer globally is skin cancer; melanoma is the deadliest kind. Good treatment results depend on early and correct diagnosis. Dermatologists visually inspect skin cancers as part of traditional diagnostic methods, which can be arbitrary and unreliable. However, recent advances in deep learning show that automated skin cancer identification has a lot of potential. This work presents a new hybrid model for dermoscopic image-based multi-class skin cancer classification. The five steps in our method are: using DeepLabV3+ with a ResNet50 backbone to separate skin lesions; extracting features using a triple-stream transformer-based architecture (Derm-ViT, Swin Transformer V2, and ConvNeXt V2); joining features together; choosing features using the ReliefF algorithm; and classifying with k-nearest neighbors (kNN). Each transformer branch collects a number of different but related parts of skin lesions, such as fine-grained texture information, multiscale characteristics, and patterns that are specific to dermatology. On the ISIC-2019 dataset, which has eight diagnostic categories, our suggested method has 94.42% accuracy, 94.13% precision, 92.99% sensitivity, and 98.96% specificity compared to individual transformer models and state-of-the-art approaches. This result shows how well our hybrid method addresses the difficulties of multi-class skin cancer classification and provides a consistent instrument to support dermatologists in daily clinical work.

Triple-Stream Transformer Architecture for Multi-Class Skin Cancer Classification in Dermoscopic Images

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

SJR