Post-processing of automatic segmentation of speech using dynamic programming

Marcin Szymański; Stefan Grocholewski

doi:10.1007/11846406_66

Scientific Information System of the Poznań University of Technology

PL EN

Main page / Publications / Post-processing of automatic segmentation of speech using dynamic programming

Submit a comment

Chapter

Download BibTeX

Title

Post-processing of automatic segmentation of speech using dynamic programming

Authors

Marcin Szymański ^{[ 1 ]}
Stefan Grocholewski ^{[ 1 ][ P ]}

^{[ 1 ]} Instytut Informatyki (II), Wydział Informatyki i Zarządzania, Politechnika Poznańska | ^{[ P ]} employee

Year of publication

2006

Chapter type

chapter in monograph / paper

Publication language

english

Abstract

EN Building unit-selection speech synthesisers requires a precise annotation of large speech corpora. Manual segmentation of speech is a very laborious task, hence there is the need for automatic segmentation algorithms. As it was observed that the common HMM-based method is prone to systematical errors, some boundary refinement approaches, like boundary-specific correction, were introduced. Last year, a dynamic programming fine-tuning approach was proposed, that combined two sources information, boundary error distribution and boundary MFCC statistical models. In this paper we verify the usefulness of incorporating several other data, boundary energy dynamics models and the signal periodicity information.

Pages (from - to)

523 - 530

DOI

10.1007/11846406_66

URL

https://link.springer.com/chapter/10.1007/11846406_66

Book

Text, Speech and Dialogue : 9th International Conference TSD 2006, Brno, Czech Republic, September 11-15, 2006. Proceedings

Presented on

9th International Conference Text, Speech and Dialogue, TSD 2006, 11-15.09.2006, Brno, Czech Republic

System created by Poznań University of Technology and Poznan Supercomputing and Networking Center

Log in through eKonto to add to SIS