Real Time Recognition of Speakers from Internet Audio Stream
[ 1 ] Katedra Sterowania i Inżynierii Systemów, Wydział Informatyki, Politechnika Poznańska | [ P ] pracownik
2015
artykuł naukowy
angielski
- speaker recognition
- GMM
- internet radio
- ISOMAP
EN In this paper we present an automatic speaker recognition technique with the use of the Internet radio lossy (encoded) speech signal streams. We show an influence of the audio encoder (e.g., bitrate) on the speaker model quality. The model of each speaker was calculated with the use of the Gaussian mixture model (GMM) approach. Both the speaker recognition and the further analysis were realized with the use of short utterances to facilitate real time processing. The neighborhoods of the speaker models were analyzed with the use of the ISOMAP algorithm. The experiments were based on four 1-hour public debates with 7–8 speakers (including the moderator), acquired from the Polish radio Internet services. The presented software was developed with the MATLAB environment.
223 - 233
CC BY-NC-ND (uznanie autorstwa - użycie niekomercyjne - bez utworów zależnych)
otwarte czasopismo
ostateczna wersja opublikowana
po opublikowaniu
publiczny
15