Coding Effects on Changes in Formant Frequencies in Japanese Speech Signals
2019
artykuł naukowy
angielski
- speech
- speech coding
- formant
EN This paper presents results of research on effects of lossy coding on formant frequencies for Japanese speech signals. Additionally changes in pitch of the voice were inspected. For this research four most popular lossy coding standards were chosen, MP3, WMA, AAC and OGG, and compared to original WAVE files. Audio files were created by the author based on ITU-T P.501 recommendation in two sampling frequencies, 16 kHz and 48 kHz, and converted into chosen codecs. To extract the data from audio files, open license software Praat was used. Due to discovered differences in time duration between original and encoded files, that also differed between individual codecs, only OGG and WMA standards were compared directly. MP3 and AAC standards were divided into Japanese syllables, averaged and then compared into also averaged WAVE files. Results were additionally compared to FLAC lossless codec.
2019131-1 - 2019131-8
CC BY (uznanie autorstwa)
publiczny
40
70