โก Quick Summary
This study introduces a hybrid deep learning model for the multiclass classification of skin cancer, achieving an impressive 93.48% accuracy on the ISIC 2019 dataset. The model’s innovative architecture combines ConvNeXtV2 blocks and separable self-attention mechanisms to enhance feature extraction and classification performance.
๐ Key Details
- ๐ Dataset: ISIC 2019, featuring eight distinct skin lesion categories
- ๐งฉ Features used: Visual characteristics of skin lesions
- โ๏ธ Technology: Hybrid model utilizing ConvNeXtV2 and separable self-attention
- ๐ Performance: Accuracy 93.48%, Precision 93.24%, Recall 90.70%, F1-score 91.82%
- ๐ Model size: Compact design with only 21.92 million parameters
๐ Key Takeaways
- ๐ Innovative architecture enhances feature extraction for skin lesion classification.
- ๐ก ConvNeXtV2 blocks effectively capture fine-grained local features.
- ๐ Separable self-attention prioritizes diagnostically relevant regions.
- ๐ Exceptional performance compared to over twenty other models tested.
- ๐ High efficiency with a compact model design suitable for deployment.
- ๐ Potential for early and accurate skin cancer diagnosis in clinical settings.
๐ Background
Skin cancer is a major global health issue, where early and accurate diagnosis is crucial for improving treatment outcomes and patient survival rates. The visual similarities between benign and malignant lesions often complicate the classification process, necessitating advanced technological solutions to enhance diagnostic accuracy.
๐๏ธ Study
The study proposed a novel hybrid deep learning framework that integrates ConvNeXtV2 blocks and separable self-attention mechanisms. This model was trained and validated on the ISIC 2019 dataset, employing advanced techniques such as data augmentation and transfer learning to bolster its robustness and reliability.
๐ Results
The proposed model achieved remarkable performance metrics, including 93.48% accuracy, 93.24% precision, 90.70% recall, and a 91.82% F1-score. These results indicate a significant improvement over more than twenty other models, including both Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs), tested under similar conditions.
๐ Impact and Implications
The implications of this study are profound, as the proposed model establishes a reliable framework for early and accurate skin cancer diagnosis. By leveraging advanced deep learning techniques, healthcare professionals can enhance diagnostic precision, ultimately leading to improved patient outcomes and survival rates. This innovation could pave the way for broader applications in dermatology and beyond.
๐ฎ Conclusion
This research highlights the transformative potential of deep learning in the field of skin cancer diagnosis. The hybrid model not only demonstrates exceptional accuracy and efficiency but also sets a new standard for future developments in medical imaging and diagnostics. Continued exploration in this area could significantly enhance clinical practices and patient care.
๐ฌ Your comments
What are your thoughts on this innovative approach to skin cancer classification? We would love to hear your insights! ๐ฌ Leave your comments below or connect with us on social media:
A robust deep learning framework for multiclass skin cancer classification.
Abstract
Skin cancer represents a significant global health concern, where early and precise diagnosis plays a pivotal role in improving treatment efficacy and patient survival rates. Nonetheless, the inherent visual similarities between benign and malignant lesions pose substantial challenges to accurate classification. To overcome these obstacles, this study proposes an innovative hybrid deep learning model that combines ConvNeXtV2 blocks and separable self-attention mechanisms, tailored to enhance feature extraction and optimize classification performance. The inclusion of ConvNeXtV2 blocks in the initial two stages is driven by their ability to effectively capture fine-grained local features and subtle patterns, which are critical for distinguishing between visually similar lesion types. Meanwhile, the adoption of separable self-attention in the later stages allows the model to selectively prioritize diagnostically relevant regions while minimizing computational complexity, addressing the inefficiencies often associated with traditional self-attention mechanisms. The model was comprehensively trained and validated on the ISIC 2019 dataset, which includes eight distinct skin lesion categories. Advanced methodologies such as data augmentation and transfer learning were employed to further enhance model robustness and reliability. The proposed architecture achieved exceptional performance metrics, with 93.48% accuracy, 93.24% precision, 90.70% recall, and a 91.82% F1-score, outperforming over ten Convolutional Neural Network (CNN) based and over ten Vision Transformer (ViT) based models tested under comparable conditions. Despite its robust performance, the model maintains a compact design with only 21.92ย million parameters, making it highly efficient and suitable for model deployment. The Proposed Model demonstrates exceptional accuracy and generalizability across diverse skin lesion classes, establishing a reliable framework for early and accurate skin cancer diagnosis in clinical practice.
Author: [‘Ozdemir B’, ‘Pacal I’]
Journal: Sci Rep
Citation: Ozdemir B and Pacal I. A robust deep learning framework for multiclass skin cancer classification. A robust deep learning framework for multiclass skin cancer classification. 2025; 15:4938. doi: 10.1038/s41598-025-89230-7