Convolutional Neural Network Classifies Pathological Voice Change in Laryngeal Cancer with High Accuracy
- PMID: 33113785
- PMCID: PMC7692693
- DOI: 10.3390/jcm9113415
Convolutional Neural Network Classifies Pathological Voice Change in Laryngeal Cancer with High Accuracy
Abstract
Voice changes may be the earliest signs in laryngeal cancer. We investigated whether automated voice signal analysis can be used to distinguish patients with laryngeal cancer from healthy subjects. We extracted features using the software package for speech analysis in phonetics (PRAAT) and calculated the Mel-frequency cepstral coefficients (MFCCs) from voice samples of a vowel sound of /a:/. The proposed method was tested with six algorithms: support vector machine (SVM), extreme gradient boosting (XGBoost), light gradient boosted machine (LGBM), artificial neural network (ANN), one-dimensional convolutional neural network (1D-CNN) and two-dimensional convolutional neural network (2D-CNN). Their performances were evaluated in terms of accuracy, sensitivity, and specificity. The result was compared with human performance. A total of four volunteers, two of whom were trained laryngologists, rated the same files. The 1D-CNN showed the highest accuracy of 85% and sensitivity and sensitivity and specificity levels of 78% and 93%. The two laryngologists achieved accuracy of 69.9% but sensitivity levels of 44%. Automated analysis of voice signals could differentiate subjects with laryngeal cancer from those of healthy subjects with higher diagnostic properties than those performed by the four volunteers.
Keywords: deep learning; larynx cancer; machine learning; voice change; voice pathology classification.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
References
-
- Polesel J., Furlan C., Birri S., Giacomarra V., Vaccher E., Grando G., Gobitti C., Navarria F., Schioppa O., Minatel E., et al. The impact of time to treatment initiation on survival from head and neck cancer in north-eastern Italy. Oral Oncol. 2017;67:175–182. doi: 10.1016/j.oraloncology.2017.02.009. - DOI - PubMed
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
