09934775 is referenced by 2 patents and cites 4865 patents.

Systems and processes for performing unit-selection text-to-speech synthesis are provided. In an example process, text to be converted to speech is received. The text is represented as a sequence of target units. A plurality of candidate speech segments corresponding to the sequence of target units are selected. Predicted statistical parameters of acoustic features associated with the sequence of target units are determined. The predicted statistical parameters of acoustic features are used to determine target costs and concatenation costs associated with the plurality of candidate speech segments. Based on a combined cost determined from the target costs and concatenation costs, a subset of candidate speech segments is selected from the plurality of candidate speech segments. Speech corresponding to the received text is generated using the subset of candidate speech segments.

Title
Unit-selection text-to-speech synthesis based on predicted concatenation parameters
Application Number
15/266930
Publication Number
9934775 (B2)
Application Date
September 15, 2016
Publication Date
April 3, 2018
Inventor
David A Winarsky
Cupertino
CA, US
Ladan Golipour
Cupertino
CA, US
Alistair D Conkie
Cupertino
CA, US
Kishore Sunkeswari Prahallad
Cupertino
CA, US
Tuomo J Raitio
Sunnyvale
CA, US
Agent
Dentons US
Assignee
Apple
CA, US
IPC
G10L 13/06
G10L 13/033
G10L 13/10
View Original Source