WO2007103520A3

WO2007103520A3 - Codebook-less speech conversion method and system

Info

Publication number: WO2007103520A3
Application number: PCT/US2007/005962
Authority: WO
Inventors: Oytun Turk; Levent Arslan; Fred Deutsch
Original assignee: Voxonic Inc; Oytun Turk; Levent Arslan; Fred Deutsch
Priority date: 2006-03-08
Filing date: 2007-03-07
Publication date: 2008-03-27
Also published as: WO2007103520A2; US20070213987A1

Abstract

The conversion of speech can be used to transform an utterance by a source speaker to match the speech characteristic of a target speaker, for applications such as dubbing a motion picture. During a training phase, utterances corresponding to the same sentences by both the target speaker and source speaker are force aligned according to the phonemes within the sentences. A transformation or mapping is trained so that each frame of the source utterances is mapped to a corresponding frame of the target utterance. After the completion of the training phase, a source utterance is divided into frames, which are transformed into target frames. After all target frames are created from the sequence of frames from the source utterance, a target utterance is created having the speech of the source speaker, but with the vocal characteristics of the target speaker.