Project | PAVOQUE

Duration: 05/01/2006 - 06/30/2008

PArametrisation of prosody and VOice QUality for concatenative speech synthesis in view of Emotion expression

A major obstacle for the acceptability of speech synthesis is its lack of expressivity. In order to convey emotions or other expressions appropriately, the sound of the synthetic voice would need to be changed; however, newer speech synthesis methods lack the possibility to influence the relevant parameters to the necessary extent.

In current speech synthesis technology, naturalness and flexibility are mutually exclusive: newer corpus-based unit selection synthesis methods often sound natural, but they can only realise a single speaking style, which is determined during the recordings of the speech corpus. In contrast, older methods such as formant or diphone synthesis are parametrisable but sound quite unnatural. There is currently no synthesis method combining the naturalness of corpus-based synthesis with the parametrisability of earlier systems.

The PAVOQUE project is to make a core contribution to reconciling synthesis quality and parametrisability. In a current corpus-based speech synthesis system, it carries out research on methods for the required parametrisation of the key parameters for vocal emotion expression: prosody (=intonation and rhythm) and voice quality. Two strategies are pursued: parameter-based selection of units from the corpus, and post-processing of the synthetic speech signal with signal manipulation methods. This will allow for a high degree of expressivity while maintaining good quality of the speech signal.

Keyfacts

Involved research areas

Website

http://mary.dfki.de/pavoque

Publications

All publications

The PAVOQUE corpus as a resource for analysis and synthesis of expressive speech
Ingmar Steiner; Marc Schröder; Annette Klepp
In: Phonetik & Phonologie 9. Phonetik & Phonologie (P&P-9), October 11-12, Zurich, Switzerland, Pages 83-84, Peter Lang, 10/2013.
Open source voice creation toolkit for the MARY TTS Platform
Marc Schröder; Marcela Charfuelan Oliva; Sathish Pammi; Ingmar Steiner
In: 12th Annual Conference of the International Speech Communication Association. Conference in the Annual Series of Interspeech Events (INTERSPEECH-2011), 12th, August 28-31, Florence, Italy, ISCA, 8/2011.
Multilingual Voice Creation Toolkit for the MARY TTS Platform
Sathish Chandra Pammi; Marcela Charfuelan Oliva; Marc Schröder
In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10). International Conference on Language Resources and Evaluation (LREC-2010), May 19-21, Valleta, Malta, ISBN 2-9517408-6-7, ELRA, 5/2010.

Project | PAVOQUE

PArametrisation of prosody and VOice QUality for concatenative speech synthesis in view of Emotion expression

Keyfacts

Involved research areas

Website

Publications

The PAVOQUE corpus as a resource for analysis and synthesis of expressive speech

Open source voice creation toolkit for the MARY TTS Platform

Multilingual Voice Creation Toolkit for the MARY TTS Platform

Funding Authorities

DFG - German Research Foundation

Share project:

Keyfacts

Involved research areas

Website

The PAVOQUE corpus as a resource for analysis and synthesis of expressive speech

Open source voice creation toolkit for the MARY TTS Platform

Multilingual Voice Creation Toolkit for the MARY TTS Platform

Funding Authorities

DFG - German Research Foundation