W zależności od ilości danych do przetworzenia generowanie pliku może się wydłużyć.

Jeśli generowanie trwa zbyt długo można ograniczyć dane np. zmniejszając zakres lat.

Rozdział

Pobierz BibTeX

Tytuł

Dominance based post-processing in ASR system

Autorzy

[ 1 ] Instytut Informatyki (II), Wydział Informatyki i Zarządzania, Politechnika Poznańska | [ P ] pracownik

Rok publikacji

2003

Typ rozdziału

referat

Język publikacji

angielski

Streszczenie

EN This paper presents different approaches to post-processing of N-best recognition hypotheses. When N-best solutions are obtained from an HMM-based recognizer, the segmental scores can be computed for each. Statistical modeling of a segment involves two models: for correct and incorrect labeling; such two models are built for each phoneme in the dictionary. Segmental score for the chosen feature (such as phoneme duration) is calculated as the total log likelihood normalized by the number of segments. In the described method, it is possible that there are many segmental scores (coming from different features, possibly based on differently defined 'segments'). If it is the case, they have to be combined to one confidene score.
In this paper the examined features are phoneme-duration and markovian likelihood. The effectiveness of each segmental model is presented and two methods of combining confidence scores are compared: statistical and dominance-based rough-set approach. It is shown that when all integrated scores are likelihoods (thus the "preference direction" is defined) the dominance-based approach can yield better results than Gaussian-mixture statistical modeling.

Strony (od-do)

41 - 46

Książka

Signal processing '2003 : workshop proceedings, Poznan 10th October 2003

Zaprezentowany na

Signal Processing Workshop '2003, 10.10.2003, Poznań, Poland

Ta strona używa plików Cookies, w celu zapamiętania uwierzytelnionej sesji użytkownika. Aby dowiedzieć się więcej przeczytaj o plikach Cookies i Polityce Prywatności.