GFCC-based x-vectors for Reinke’s Edema Detection
2022
scientific article
english
- x-vectors
- Reinke's edema
- voice pathology classification
EN Automatic assessment of voice disorders is one of the most important applications of speech signal analysis. Various algorithms utilizing both sustained vowels and continuous speech have been successfully used to perform detection of many voice pathologies, e.g. dysphonia, laryngitis, and vocal folds paralysis. However, algorithms described in literature used for classification of Reinke’s edema – one of the most severe smoking-induced voice conditions – are scarce and rely mostly on speech signals containing sustained vowels. In this paper, a method incorporating gammatone frequency cepstral coefficients (GFCC) based x-vectors extracted from continuous speech is presented. The extracted x-vectors are used to train a SGD classifier performing Reinke’s edema detection. For validation folds, the proposed method yielded AUC ROC, accuracy, recall, and specificity of 0.96 (±0.03), 0.94 (±0.02), 0.92 (±0.03), and 0.94 (±0.02), respectively. For testing set, the method yielded AUC ROC, accuracy, recall, and specificity of 0.98, 0.89, 0.88, and 0.89, respectively.
2022307-1 - 2022307-7
article number: 2022307
CC BY (attribution alone)
open journal
final published version
public
70