Depending on the amount of data to process, file generation may take longer.

If it takes too long to generate, you can limit the data by, for example, reducing the range of years.

Chapter

Download BibTeX

Title

Post-processing of automatic segmentation of speech using dynamic programming

Authors

[ 1 ] Instytut Informatyki (II), Wydział Informatyki i Zarządzania, Politechnika Poznańska | [ P ] employee

Year of publication

2006

Chapter type

chapter in monograph / paper

Publication language

english

Abstract

EN Building unit-selection speech synthesisers requires a precise annotation of large speech corpora. Manual segmentation of speech is a very laborious task, hence there is the need for automatic segmentation algorithms. As it was observed that the common HMM-based method is prone to systematical errors, some boundary refinement approaches, like boundary-specific correction, were introduced. Last year, a dynamic programming fine-tuning approach was proposed, that combined two sources information, boundary error distribution and boundary MFCC statistical models. In this paper we verify the usefulness of incorporating several other data, boundary energy dynamics models and the signal periodicity information.

Pages (from - to)

523 - 530

DOI

10.1007/11846406_66

URL

https://link.springer.com/chapter/10.1007/11846406_66

Book

Text, Speech and Dialogue : 9th International Conference TSD 2006, Brno, Czech Republic, September 11-15, 2006. Proceedings

Presented on

9th International Conference Text, Speech and Dialogue, TSD 2006, 11-15.09.2006, Brno, Czech Republic

This website uses cookies to remember the authenticated session of the user. For more information, read about Cookies and Privacy Policy.