Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

Dominance based post-processing in ASR system

Authors

[ 1 ] Instytut Informatyki (II), Wydział Informatyki i Zarządzania, Politechnika Poznańska | [ P ] employee

Year of publication

2003

Chapter type

paper

Publication language

english

Abstract

EN This paper presents different approaches to post-processing of N-best recognition hypotheses. When N-best solutions are obtained from an HMM-based recognizer, the segmental scores can be computed for each. Statistical modeling of a segment involves two models: for correct and incorrect labeling; such two models are built for each phoneme in the dictionary. Segmental score for the chosen feature (such as phoneme duration) is calculated as the total log likelihood normalized by the number of segments. In the described method, it is possible that there are many segmental scores (coming from different features, possibly based on differently defined 'segments'). If it is the case, they have to be combined to one confidene score.
In this paper the examined features are phoneme-duration and markovian likelihood. The effectiveness of each segmental model is presented and two methods of combining confidence scores are compared: statistical and dominance-based rough-set approach. It is shown that when all integrated scores are likelihoods (thus the "preference direction" is defined) the dominance-based approach can yield better results than Gaussian-mixture statistical modeling.

Pages (from - to)

41 - 46

Book

Signal processing '2003 : workshop proceedings, Poznan 10th October 2003

Presented on

Signal Processing Workshop '2003, 10.10.2003, Poznań, Poland

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.