Cloning the voice and speech of Piotr Fronczewski for Polish speech synthesis

Krzysztof Szklanny

doi:10.21008/j.0860-6897.2024.1.12

System Informacji Naukowej Politechniki Poznańskiej

PL EN

Strona główna / Publikacje / Cloning the voice and speech of Piotr Fronczewski for Polish speech synthesis

Zgłoś uwagę

Artykuł

Pobierz plik Pobierz BibTeX

Tytuł

Cloning the voice and speech of Piotr Fronczewski for Polish speech synthesis

Autorzy

Krzysztof Szklanny

Rok publikacji

2024

Opublikowano w

Vibrations in Physical Systems

Rocznik: 2024 | Tom: vol. 35 | Numer: no. 1

Typ artykułu

artykuł naukowy

Język publikacji

angielski

Słowa kluczowe

EN

voice
emotions
corpus
recordings
speech synthesis
Piotr Fronczewski

Streszczenie

EN The quality of synthetically generated speech has improved significantly in recent years, largely due to the technological development of speech synthesis systems, in particular those based on deep neural networks (DNN). However, the problem of emotion in speech synthesis still remains a challenge. Most of the existing speech synthesis systems do not convey the pervasive emotional contexts in human-human interaction. The lack of expression limits the emotional intelligence of current speech synthesis systems. This work aimed to develop a recording method for preparing a balanced corpus of emotional recordings in the Polish language for use in speech synthesis based on artificial intelligence (AI) algorithms. An essential aspect of the work was the selection of a voice-over artist who would allow the recording of the spectrum of an actor's voice, emphasizing the actor's interpretations and emotions derived from the content. Outstanding actor Piotr Fronczewski was chosen for the role.

Strony (od-do)

2024112-1 - 2024112-10

DOI

10.21008/j.0860-6897.2024.1.12

URL

https://vibsys.put.poznan.pl/_journal/2024-35-1/articles/vps_2024112.pdf

Uwagi

Article number: 2024112

Typ licencji

CC BY (uznanie autorstwa)

Tryb otwartego dostępu

otwarte czasopismo