โก Quick Summary
This study explored the acoustic characteristics of voice production in patients with organic vocal cord lesions, utilizing machine learning to classify these lesions and predict malignancy. The XGBoost model achieved an impressive AUC of 0.735 for classification, while LightGBM excelled in malignancy prediction with an AUC of 0.924, highlighting the potential of AI in voice disorder diagnostics.
๐ Key Details
- ๐ Dataset: 157 participants (127 with lesions, 30 healthy controls)
- ๐งฉ Features used: Acoustic parameters from vowel sounds
- โ๏ธ Technology: Machine learning models (XGBoost, LightGBM)
- ๐ Performance: XGBoost: AUC 0.735; LightGBM: AUC 0.924
๐ Key Takeaways
- ๐ Acoustic analysis can effectively differentiate between benign and malignant vocal cord lesions.
- ๐ค Machine learning models significantly enhance diagnostic accuracy for voice disorders.
- ๐ Six key acoustic parameters were identified as highly diagnostic for lesion classification.
- ๐ฅ AI-assisted voice analysis offers a promising non-invasive tool for early detection.
- ๐ Study involved a diverse cohort, emphasizing the need for further validation in larger populations.
- ๐ง Age and specific acoustic parameters were found to be significant predictors in the models.
๐ Background
Voice disorders, often stemming from organic vocal cord lesions, pose a significant challenge in clinical settings due to the lack of reliable non-invasive diagnostic tools. Traditional methods can lead to delays in treatment, making it crucial to explore innovative approaches. The integration of acoustic analysis with artificial intelligence (AI) presents a promising avenue for enhancing diagnostic capabilities in this field.
๐๏ธ Study
Conducted with a total of 157 participants, this study aimed to identify distinctive acoustic biomarkers associated with organic vocal cord lesions. The cohort included 127 patients with lesions (109 benign and 18 malignant) and 30 healthy controls. Acoustic analysis focused on vowel sounds to assess vocal fold vibration parameters, while machine learning models were developed to classify lesion types and predict malignancy.
๐ Results
The study revealed 63 statistically significant differences in acoustic parameters between healthy individuals and those with lesions. Notably, six key parameters demonstrated high diagnostic value in distinguishing between benign and malignant lesions. The XGBoost model achieved an AUC of 0.735 for classifying vocal cord lesions, while the LightGBM model excelled in malignancy prediction with an AUC of 0.924, indicating a strong potential for clinical application.
๐ Impact and Implications
The findings from this study underscore the transformative potential of integrating acoustic analysis with machine learning in the diagnosis of voice disorders. By providing a non-invasive method for early detection and accurate classification of vocal cord lesions, this approach could significantly improve clinical decision-making and patient outcomes. The implications extend beyond voice disorders, suggesting a broader application of AI in various medical diagnostics.
๐ฎ Conclusion
This research highlights the remarkable potential of AI-assisted voice analysis in enhancing diagnostic accuracy for vocal cord lesions. The integration of acoustic characteristics with machine learning not only aids in differentiating benign from malignant cases but also paves the way for future innovations in non-invasive diagnostics. Continued research and validation in larger cohorts will be essential to refine these predictive algorithms for widespread clinical use.
๐ฌ Your comments
What are your thoughts on the integration of AI in voice disorder diagnostics? We invite you to share your insights and engage in a discussion! ๐ฌ Leave your comments below or connect with us on social media:
Acoustic signatures of organic lesions and the role of artificial intelligence in voice disorder diagnostics.
Abstract
OBJECTIVE: Voice disorders resulting from organic vocal cord lesions, whether benign or malignant, often lack reliable non-invasive diagnostic tools, which can lead to delays in treatment. This study aims to identify distinctive acoustic biomarkers and develop machine learning models for accurate classification of these lesions and prediction of malignancy. We investigated the acoustic characteristics of voice production in patients with organic vocal cord lesions, comparing benign and malignant cases, and evaluated the diagnostic potential of machine learning models in distinguishing between healthy and pathological voices.
METHODS: A total of 157 participants were enrolled, including 127 patients with organic vocal cord lesions (109 benign, 18 malignant) and 30 healthy controls. Acoustic analysis was performed on vowel sounds, assessing vocal fold vibration parameters. Machine learning models (eXtreme Gradient Boosting (XGBoost), Light Gradient-Boosting Machine (LightGBM)) were trained to classify lesion types and predict malignancy. Receiver operating characteristic analysis identified key diagnostic parameters.
RESULTS: Comparative analysis revealed 63 statistically significant differences in acoustic parameters between healthy and lesion-affected groups, with skewness and kurtosis being particularly discriminative. Six key parameters (/u/skew, /i/skew, /o/kurt, /o/shapefactor, /o/impulsefactor, and /a/peak2valley) demonstrated high diagnostic value in distinguishing benign from malignant lesions. The XGBoost model achieved the best performance in classifying vocal cord lesions (area under the curve (AUC)โ=โ0.735), while LightGBM excelled in malignancy prediction (AUCโ=โ0.924). Age and specific acoustic parameters were significant predictors in the models.
CONCLUSION: The integration of acoustic analysis with machine learning significantly enhances the diagnostic accuracy for vocal cord lesions, particularly in differentiating between benign and malignant cases. These findings underscore the potential of artificial intelligence (AI)-assisted voice analysis as a non-invasive tool for early detection and clinical decision-making. Further validation in larger cohorts is necessary to refine predictive algorithms for broader clinical application.
Author: [‘Ma K’, ‘Wang Y’, ‘Zhou Y’, ‘Chen L’, ‘Zhang T’, ‘Xu F’, ‘Peng X’]
Journal: Digit Health
Citation: Ma K, et al. Acoustic signatures of organic lesions and the role of artificial intelligence in voice disorder diagnostics. Acoustic signatures of organic lesions and the role of artificial intelligence in voice disorder diagnostics. 2025; 11:20552076251376264. doi: 10.1177/20552076251376264