Aaron Earned an Iron Urn: Speech-to-IPA Models Improve Diagnostic of Pronunciation
[ 1 ] Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ 2 ] Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ S ] student | [ SzD ] doctoral school student | [ P ] employee
2023
chapter in monograph / paper
english
- Wav2Vec
- IPA
- pronunciation diagnostic
- ASR
EN Learning the proper pronunciation is one of the key aspects of foreign language acquisition. Assessment of the correctness of pronunciation requires the involvement of expert phoneticians and linguists, severely limiting the scalability of learning solutions. However, the recent adaptation of the Transformer architecture to the audio domain opens the way for automatic model-based assessment of pronunciation. In this paper, we present the pronunciation diagnostic tool developed at PUT and we experimentally evaluate the correlation between expert human assessment and automatic model assessment. By combining the Wav2Vec model and the IPA representation, we prove that pronunciation assessment can be performed automatically with high precision.
273 - 279
dla wszystkich w zakresie dozwolonego użytku
open repository
final published version
at the time of publication
20