Real Time Recognition of Speakers from Internet Audio Stream
[ 1 ] Katedra Sterowania i Inżynierii Systemów, Wydział Informatyki, Politechnika Poznańska | [ P ] employee
2015
scientific article
english
- speaker recognition
- GMM
- internet radio
- ISOMAP
EN In this paper we present an automatic speaker recognition technique with the use of the Internet radio lossy (encoded) speech signal streams. We show an influence of the audio encoder (e.g., bitrate) on the speaker model quality. The model of each speaker was calculated with the use of the Gaussian mixture model (GMM) approach. Both the speaker recognition and the further analysis were realized with the use of short utterances to facilitate real time processing. The neighborhoods of the speaker models were analyzed with the use of the ISOMAP algorithm. The experiments were based on four 1-hour public debates with 7–8 speakers (including the moderator), acquired from the Polish radio Internet services. The presented software was developed with the MATLAB environment.
223 - 233
CC BY-NC-ND (attribution - noncommercial - no derivatives)
open journal
final published version
after publication
public
15